Reducto is the agentic document platform for leading AI teams who demand enterprise performance at scale. We provide a complete toolkit for handling any workflow by understanding documents the way a human would. Using an agentic architecture to orchestrate dozens of vision and frontier models behind the scenes, Reducto delivers high accuracy on complex, real-world documents where other tools fall short.
We’ve grown rapidly, increasing revenue 8x year over year and partnering with hundreds of companies, from leading AI teams like Harvey, Vanta, and Scale, to enterprise customers across FAANG and top trading firms.
Reducto has raised over $100M from world-class investors including a16z, Benchmark, and First Round Capital.
The OpportunityAs Data Labeling Lead, you’ll play a key role in leading and managing our in-house data labeling team to ensure the highest quality of training data for our models. You’ll collaborate closely with the ML and engineering teams to refine labeling guidelines and maintain rigorous accuracy standards. This is a high-impact role where you’ll help shape the foundation of our AI capabilities from the ground up.
What You’ll Do:Lead, train, and manage our in-house data labeling team.
Define, execute, and continuously improve data annotation processes with a very high attention to detail.
Ensure high-quality data outputs and meet rigorous accuracy and consistency standards.
Work closely with ML engineers to understand data requirements and edge cases.
Manage team schedules, specifically coordinating and working nighttime hours Pacific Time (PT).
Possess a very high attention to detail.
Have 3+ years of experience working in Python and are comfortable managing basic data labeling apps.
Have the ability to work nighttime hours PT.
Hold yourself to a high bar for quality and precision.
Enjoy solving complex problems and building from first principles.
Have 3+ years of experience in data labeling, operations, or team management.
Operate well in fast-changing, high-growth environments.
Collaborate effectively across technical and non-technical teams.
Take full ownership from strategy through execution.
Have experience at an early-stage or high-growth startup.
Are familiar with AI/ML data pipelines and labeling tools.
Care deeply about combining operational excellence with business impact.
Impact: Your work directly shapes how the world’s best AI companies access and use enterprise data.
Speed: We move fast, ship often, and iterate in days, not months.
Learning: Work alongside world-class engineers, operators, and founders who care deeply about product, precision, and velocity.
This is a contract role based anywhere in North America. We’re an early-stage company, we move fast and work hard. Please apply only if that excites you.
Equal OpportunityReducto is an Equal Opportunity Employer committed to diversity and inclusion in the workplace. All qualified applicants will receive consideration without regard to sex, race, color, age, national origin, religion, disability, sexual orientation, gender identity, veteran status, or any other protected category.
Skills Required
- 3+ years of experience in data labeling
- 3+ years of experience working in Python
- Ability to work nighttime hours PT
- Experience in operations or team management
Reducto Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Reducto and has not been reviewed or approved by Reducto.
-
Healthcare Strength — Health coverage is described as comprehensive, including medical, dental, and vision insurance. Company materials present this as a core part of the benefits package.
-
Retirement Support — A 401(k) plan with a 4% company match is explicitly offered. This adds meaningful long-term financial support to the total rewards package.
-
Leave & Time Off Breadth — Unlimited PTO is included. Parental leave is available with a schedule tailored in coordination with the company.
Reducto Insights
What We Do
Reducto turns unstructured documents—like PDFs, images, and spreadsheets—into clean, structured data ready for any workflow. Our multi-pass parsing system combines OCR and vision-language models to deliver state of the art accuracy, reliability, and scalability. We currently power the best AI teams ranging from startups to Fortune 10 companies across all industries -- technology, healthcare, legal, finance, and more.








