Senior/Lead Data Analyst

Reposted 25 Days Ago
New York City, NY
In-Office
150K-300K Annually
Senior level
Gaming • Mobile • Software
The Role
As a Senior/Lead Data Analyst, you'll ensure video data quality for ML features, audit datasets using SQL and Python, collaborate on data issues, and run experiments to improve model performance, while mentoring others in best practices.
Summary Generated by Built In
About General Intuition

We are the frontier research lab dedicated to building foundation models for environments that require deep spatial and temporal reasoning. For the past year, we've been pushing the forefront of AI across agents capable of navigating space and time, world models that provide training environments for those agents, and video understanding models with a focus on transfer to the real world.

We raised a seed round of $133M from General Catalyst and Khosla to discover the next generation of intelligence.

The Role

We’re looking for a Senior/Lead Data Analyst to own the quality of the video data that powers Medal’s machine learning features. You’ll partner closely with ML researchers, data engineering, and product to measure, diagnose, and improve the accuracy, completeness, and reliability of our video datasets and labels.

If you love turning messy, high-volume media data into trustworthy, measurable assets—and you get excited about building feedback loops that make ML systems smarter—this is for you.

You Will
  • Own the video data quality program: define quality KPIs (coverage, precision/recall, calibration, temporal alignment, label latency, drift) and build dashboards that make them visible company-wide.

  • Audit datasets at scale using SQL and Python: create automated checks for codec/bitrate/fps/resolution, audio/video sync, corruption, duplicates, and long-tail coverage by game, device, and region.

  • Design ground-truth pipelines: human-in-the-loop reviews and labeling guidelines; measure annotator agreement, and iterate to improve label quality.

  • Diagnose model-data issues: collaborate with ML to localize failure modes, quantify data gaps, and prioritize data collection or relabeling to move accuracy on real user content.

  • Detect bias and drift across games, platforms, and cohorts; propose mitigations and monitor post-launch.

  • Instrument product and ingestion to capture the metadata ML needs (e.g., encoding, device, frame rate, content type) while respecting privacy and safety constraints.

  • Run experiments: design and analyze A/Bs and holdouts to connect data quality improvements to model and product outcomes.

  • Champion best practices in data contracts, validation, reproducibility, and documentation; mentor analysts and influence data quality culture.

  • Work on-site at our NYC office 5 days a week.

You Need
  • 5+ years in data analytics or data science with a focus on media or ML data quality in production systems.

  • Fluency in SQL and Python (Pandas/NumPy); you’re comfortable building reproducible notebooks and code-reviewed pipelines.

  • Strong measurement chops: you’ve defined and computed label & model quality metrics (precision/recall/F1, mAP, AUROC, calibration, temporal IoU) and can explain their trade-offs.

  • Data validation & ETL experience: Great Expectations/TFDV (or equivalent), dbt, and an orchestrator (Airflow/Prefect).

  • Warehouse & BI: BigQuery (or similar) plus Looker/Mode/Tableau (or similar); you build clear dashboards and know when to run deep dives.

  • Experimentation: A/B testing design and analysis; comfort with pitfalls and guardrails.

  • Product sense & communication: you turn ambiguous problems into measurable roadmaps and communicate findings clearly to technical and non-technical partners.

  • A love for gaming, however you define it.

Bonus Points
  • Experience running annotation programs (Label Studio, CVAT, Scale or custom tooling) and crafting labeling taxonomies for actions/events/scenes.

  • Hands-on with video tooling: ffmpeg/ffprobe for metadata & probes; familiarity with OpenCV (and running lightweight inference with PyTorch/TensorFlow for scoring/spot checks).

  • Duplicate/near-duplicate detection (perceptual hashing, embeddings/FAISS/Milvus) and dataset dedup at scales

  • Privacy, safety, and policy consider.ations for user-generated video (GDPR/COPPA basics, PII redaction, content safety heuristics).

  • Spark/PySpark or distributed compute for heavy lifts.

  • Familiarity with CV/ASR signals (scene boundary, keypoint/action recognition, speech-to-text) to enrich labels and audits.

  • Prior history as a Medal user—share a clip or your profile!

Why Join Us
  • Directly shape the data foundation behind ML features used by millions of gamers.

  • Work with a passionate team that values ownership, craftsmanship, and speed.

  • Competitive salary, equity options, comprehensive health insurance, and 401k.

  • See your work translate into more accurate models and better creator experiences—fast

Benefits
  • Competitive salary and meaningful equity

  • Comprehensive health insurance including dental and vision insurance

  • 401k

Top Skills

Airflow
BigQuery
Dbt
Ffmpeg
Ffprobe
Great Expectations
Looker
Mode
Numpy
Opencv
Pandas
Prefect
Pyspark
Python
PyTorch
Spark
SQL
Tableau
TensorFlow
Tfdv
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, New York
57 Employees
Year Founded: 2015

What We Do

About Us: Create gaming memories while apart — Medal enables you to reliably capture and meaningfully share online memories with friends (..that would otherwise be lost to time).

Similar Jobs

Traba Logo Traba

Senior Data Analyst

Information Technology • Logistics • Software • 3PL: Third Party Logistics • Industrial • Manufacturing
In-Office
New York, NY, USA
100 Employees
145K-185K Annually

Stensul Logo Stensul

Data Analyst

Cloud • Marketing Tech • Professional Services • Software
Easy Apply
Hybrid
New York, NY, USA
166 Employees
115K-135K Annually

Rent the Runway Logo Rent the Runway

Senior Data Analyst

eCommerce • Fashion • Logistics
Easy Apply
Hybrid
Brooklyn, NY, USA
1000 Employees
140K-175K Annually
In-Office
Getzville, NY, USA
223850 Employees
125K-188K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account