Robotics Data Quality Engineer

Posted Yesterday
Be an Early Applicant
San Mateo, CA, USA
Hybrid
Mid level
Artificial Intelligence • Machine Learning • Robotics • Software
The Role
Analyze and validate multi-modal robotics data, build automated validation pipelines and visualization tools, define quality metrics and schemas, synchronize cross-modal timestamps, catalog failure modes, and work with collection teams to trace and fix data issues.
Summary Generated by Built In

At XDOF, we’re at an inflection point. Frontier labs are racing to build general-purpose robots, and high-quality training data is the bottleneck. We’re building the foundation behind the foundation models – the data collection systems, operational capability, exabyte-scale data warehouse, and software toolchain – to help our partners drive the field forward.

The models are only as good as the data. We’re looking for a Robotics Data Quality Engineer to be the person who knows whether our data is trustworthy, across every modality we collect. You’ll analyze, validate, and build tooling around data from teleoperation on real hardware, egocentric capture, UMI-style grippers, and more. If something is wrong with the data, you’re the first to catch it and the one who helps us fix the process.

What You’ll Do

Data quality engineers are the bridge between raw collection and usable training data. Sample projects include:

  • analyzing robotics data across modalities to identify quality issues: plotting joint velocities, validating camera poses, checking gripper encoder accuracy, and flagging anomalous collection sessions

  • building automated validation pipelines that run on ingestion and catch problems before data enters the warehouse

  • designing and documenting data formats and schemas across collection modalities, ensuring they are consistent, versioned, and well-understood by partners and internal researchers

  • building data visualization tools and dashboards so the broader team can inspect and understand the data without writing custom scripts

  • validating cross-modal temporal alignment, including timestamp synchronization, dropped frame detection, and clock drift across camera, joint, and gripper streams

  • defining quality metrics and thresholds per modality and tracking whether data quality is improving or degrading as collection scales

  • cataloging edge cases and failure modes into a shared taxonomy so the team has a common language for data issues

  • working closely with data collection operators to trace quality issues back to their root cause, whether systemic (hardware calibration, sensor drift) or operator-specific

About You

Baseline skills:

  • bachelor’s or master’s degree (or equivalent experience) in robotics, computer science, mechanical engineering, or a related field

  • strong Python data skills (numpy, pandas, matplotlib or plotly) and comfort working with large, messy datasets

  • solid understanding of 3D geometry, coordinate frames, and spatial transformations

  • intuition for physical systems: you can look at a trajectory or a joint velocity plot and tell when something is off

  • experience designing or working with structured data formats (protobuf, HDF5, ROS bags, or similar)

You might be a good fit if you:

  • have hands-on experience with robotics data, whether from a research lab, a robotics startup, or a manipulation/locomotion project

  • have worked with teleoperation systems, motion capture, or egocentric data collection

  • have experience with signal processing, sensor fusion, or time-series analysis

  • have built internal data visualization tools or dashboards for technical teams

  • have worked on data versioning, lineage tracking, or schema migration in a production setting

  • are very comfortable working in 0→1 environments

  • are mission-driven and passionate about robotics: work at XDOF is fast-paced and constant. We hope you love what you’re going to be doing, because you’ll be doing a lot of it!

Skills Required

  • Bachelor's or Master's degree (or equivalent experience) in robotics, computer science, mechanical engineering, or related field
  • Strong Python data skills (NumPy, pandas) and experience with matplotlib or Plotly
  • Experience working with large, messy datasets and building validation pipelines
  • Solid understanding of 3D geometry, coordinate frames, and spatial transformations
  • Intuition for physical systems and ability to interpret trajectories and joint velocity plots
  • Experience designing or working with structured data formats (Protobuf, HDF5, ROS bags or similar)
  • Hands-on experience with robotics data (teleoperation, egocentric capture, manipulation/locomotion projects)
  • Experience with signal processing, sensor fusion, or time-series analysis
  • Experience building internal data visualization tools or dashboards for technical teams
  • Experience with data versioning, lineage tracking, or schema migration in production
  • Comfort working in 0->1 environments and fast-paced startups
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
0 Employees
Year Founded: 2024

What We Do

XDOF is the infrastructure partner for the world's most ambitious robotics builders, developing the tools, data, and services that accelerate the future of physical AI. They build the foundational infrastructure for robotics foundation models, including data collection systems, annotation pipelines, and exabyte-scale data infrastructure, helping frontier labs build general-purpose robots by solving the critical bottleneck of high-quality training data.

Similar Jobs

ServiceNow Logo ServiceNow

Director Of Product Management

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Santa Clara, CA, USA
29000 Employees
221K-387K Annually

ServiceNow Logo ServiceNow

Vice President, CEG Global Partner Strategy and Management

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Santa Clara, CA, USA
29000 Employees
264K-449K Annually

ServiceNow Logo ServiceNow

Machine Learning Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Mountain View, CA, USA
29000 Employees

ServiceNow Logo ServiceNow

Senior Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Mountain View, CA, USA
29000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account