Data Operations Engineer

Posted 2 Days Ago
San Francisco, CA, USA
In-Office
Junior
Artificial Intelligence • Big Data • Information Technology • Software • Analytics
The Role
Own end-to-end data labeling pipelines and vendor relationships, build annotation tooling and dataset browsers, define labeling taxonomies and QC standards, partner with researchers to design data collection strategies, monitor dataset metrics, integrate new data sources, and close the loop from labeled data to model improvement.
Summary Generated by Built In

Company Background:
Specter's mission is to help automate the physical world.

Today, we build video sensors with state-of-the-art AI agents that answer any question, anywhere in their environments. Our systems can automatically detect and reason about any physical activity captured on camera, from security incidents (e.g. perimeter intrusion, theft, LPR), to safety monitoring (e.g. PPE detection, injured people), to operational efficiency (e.g. material tracking, congestion monitoring). We offer both long range wireless (1km range) and wired sensor variants to suit any deployment.

Our co-founders Xerxes and Philip are passionate about empowering our partners in the fast approaching world of physical AI and robotics. We are a small, fast growing team who hail from Anduril, Tesla, Uber, and the U.S. Special Forces.

Role:

Specter is hiring a data operations engineer to build our research data operation. This individual will own the full pipeline from defining what data we need, to getting it labeled at high quality, to ensuring it meets the needs of our research team and ultimately improves our models. The role sits at the intersection of engineering and research, with a focus on building systems and tooling.

Responsibilities:

  • Own the end-to-end relationship with our data labeling provider, including task scoping, timeline management, and issue resolution

  • Build and maintain internal tooling for labelers, including annotation interfaces, task pipelines, and dataset browsers

  • Define and enforce quality control standards across all labeled data, implementing automated checks and audit workflows

  • Partner with researchers to translate perception model needs into data collection strategies, identifying gaps in coverage across object types, scenes, lighting conditions, and sensor modalities

  • Build dashboards and metrics to monitor dataset diversity, class balance, and domain coverage

  • Close the loop on the data flywheel: track how labeled data flows into training, surface failure modes, and drive iteration on the pipeline from collection through to model improvement

  • Evaluate and integrate new data sources

  • Define labeling taxonomies and annotation specifications

Qualifications:

  • 1-3+ years of experience in data operations, project management, or a technical coordination role, ideally supporting ML or engineering teams

  • Proficiency in Python and comfort building lightweight tools, scripts, and dashboards

  • Strong written and verbal communication skills, with experience managing external vendors or cross-functional stakeholders

  • Familiarity with ML workflows and how training data impacts model performance

  • Highly organized, with a track record of managing multiple concurrent workstreams

  • Self-directed and autonomous

  • Bonus: experience with computer vision data, annotation platforms, or labeling operations

Skills Required

  • 1-3+ years experience in data operations, project management, or technical coordination, ideally supporting ML or engineering teams
  • Proficiency in Python and comfort building lightweight tools, scripts, and dashboards
  • Strong written and verbal communication skills, with experience managing external vendors or cross-functional stakeholders
  • Familiarity with ML workflows and how training data impacts model performance
  • Highly organized, with a track record of managing multiple concurrent workstreams
  • Self-directed and autonomous
  • Experience with computer vision data, annotation platforms, or labeling operations
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
50 Employees
Year Founded: 2020

What We Do

Specter is an AI-powered platform that delivers real-time data and insights on private companies, enabling investors to make informed decisions. It offers an AI-driven deal sourcing platform and a dual-lens classification system for industries and tech verticals.

Similar Jobs

Deutsche Bank Logo Deutsche Bank

Data Engineer

Fintech • Financial Services
In-Office or Remote
2 Locations
68787 Employees
65K-118K Annually
In-Office
Sunnyvale, CA, USA
3411 Employees
125K-220K Annually

Xenith Solutions Logo Xenith Solutions

Data Ops Engineer

Big Data • Software • App development • Defense • Manufacturing
In-Office
San Diego, CA, USA
41 Employees
In-Office
Sunnyvale, CA, USA
472 Employees
125K-222K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account