Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes up 80% of internet traffic and has become the enabling digital medium powering creativity, communication, gaming, AR/VR, and robotics. Sieve exists to solve the biggest bottleneck in growth of these applications: high-quality training data.
We've partnered with top AI labs and did $XXM last quarter alone, as a team of just 12 people. We also raised our Series A earlier this year from Tier 1 firms such as Matrix Partners, Swift Ventures, Y Combinator, and AI Grant.
About the RoleAs a software engineer at Sieve, you’ll work across the stack to build and scale the data pipelines that create the datasets we deliver to customers. You’ll have ownership over projects end-to-end: from how data is sourced and curated, to developing ML filters, improving system efficiency, and building internal dashboards for QA and delivery. You’ll play a critical role in ensuring customers receive high-quality data on time, every time.
This role is ideal for someone who thrives on solving hard problems, enjoys working directly with customers pushing the frontier of Video AI, and wants to be challenged to achieve at the highest level.
RequirementsStrong Python developer, experience in Go or Typescript also welcome
Excellent communication skills, especially with customers and external teams
Motivated by hard problems that require working late nights and weekends
Writes clean, maintainable code
Able to context switch at a moment's notice
Bonus: Experience as an early hire at a startup
Bonus: Active GitHub or portfolio projects
In-person at our SF HQ
Top Skills
What We Do
Sieve is the only AI research lab exclusively focused on video data.
Video already makes up 80% of internet traffic and has become the dominant medium driving creativity, communication, gaming, AR/VR, and robotics. Unlocking the ability to truly model video is the key to breakthroughs across all of these domains but progress has been bottlenecked by one thing: high-quality training data. That’s where Sieve comes in.
We bring together exabyte-scale video infrastructure, novel video understanding techniques, and dozens of diverse data sources to create datasets that push the frontier of video modeling. This unique combination allows us to deliver data with unmatched precision, quality, and speed which has earned the trust of frontier AI labs, Fortune 100 companies, and fast-growing generative AI startups.