Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises looking to deploy advanced AI systems. Turing accelerates frontier research with high-quality data, specialized talent, and training pipelines that advance thinking, reasoning, coding, multimodality, and STEM. For enterprises, Turing builds proprietary intelligence systems that integrate AI into mission-critical workflows, unlock transformative outcomes, and drive lasting competitive advantage.
Recognized by Forbes, The Information, and Fast Company among the world’s top innovators, Turing’s leadership team includes AI technologists from Meta, Google, Microsoft, Apple, Amazon, McKinsey, Bain, Stanford, Caltech, and MIT. Learn more at www.turing.com
Overview
We are seeking a seasoned techno-functional leader to drive the development and execution of large-scale LLM training programs. This leader would partner with our clients (leading LLM labs) research teams to:
- Identify opportunities for building training datasets to improve model capabilities and performance
- Generate these datasets with high quality and speed
- Build automation tools and processes for scalability
- Deliver the datasets so that they are easily usable by our clients
Key Responsibilities
Operational Leadership & Performance Management
- Lead and scale global delivery teams of 100+, distributed across functions, regions, and levels (ICs, leads, and managers)
- Implement performance management systems that go beyond managerial reporting using data-driven metrics, tools, and products to assess productivity, quality, and output consistency
- Build strong operational structures that allow for transparency, accountability, and early detection of underperformance
- Partner with cross-functional leads to optimize workflows and improve internal tool adoption for delivery efficiency
Data Quality & Scripting-Driven Automation
- Own the quality, accuracy, and scalability of data generated for LLM training
- Move beyond manual QA layers by leveraging Python scripting, APIs, and automation frameworks to measure, validate, and improve dataset integrity
- Design and oversee tools or scripts for data validation, annotation accuracy checks, and pipeline consistency
- Ensure datasets adhere to compliance standards (PII, GDPR, HIPAA) and can be programmatically tested for usability and quality
LLM Training & Evaluation
- Lead generation and delivery of high-quality, scalable datasets focused on SFT, RLHF, reasoning, and agentic workflows
- Oversee the entire data lifecycle from client intake and annotation workflow design to delivery
- Partner with product, research, and engineering teams to implement evaluation metrics (e.g., win rate, inter-annotator agreement, and pairwise preference scoring)
Client Partnership & Communication
- Serve as the primary point of contact for enterprise AI clients; manage expectations, delivery timelines, and escalations
- Build relationships with engineering and research stakeholders by delivering consistently high-quality data
- Communicate effectively across technical and non-technical audiences; provide transparency through structured updates and quality reporting
Team Development & Tooling
- Recruit, mentor, and coach cross-functional leaders (Eng, Data, Ops, and Program Management)
- Drive adoption and improvement of internal tools (e.g., task management systems, quality dashboards)
- Champion continuous improvement across data quality, tools, and delivery processes
Required Qualifications
- 10+ years of experience leading large-scale technical delivery organizations, ideally across AI, ML, or data operations
- Bachelor's degree in Engineering, Computer Science, or equivalent technical discipline
- Demonstrated ability to act as a strategic business partner with our clients, researchers, and engineers at leading LLM labs
- Proven success in building and scaling multi-level high performance teams, with distributed global operations
- Experience managing managers
- Skip-level performance management
- Hands-on technical fluency: ability to write and review data validation scripts
- Demonstrated experience managing dataset generation or annotation for machine learning model evaluation and/or training
- Familiarity with ML tools and data workflows (e.g., HuggingFace, LangChain, Weights & Biases, Databricks)
Preferred Qualifications
- Experience evaluating large language model performance and/or improving model performance via fine-tuning
- Strong understanding of data quality frameworks, including automation, toolings and manual processes
- Experience in AI data annotation, model evaluation, and fine-tuning platforms
- Strong communication and storytelling skills with executive stakeholders
Location SF Bay Area (Hybrid)
Compensation: $255,000 to $325,000 OTE + Equity
We are client first: We put our clients at the center of everything we do, because their success is the ultimate measure of our value.
We work at Start-Up Speed: We move fast, stay agile and favor action because momentum is the foundation of perfection
We are AI forward: We help our clients build the future of Al and implement it in our own roles and workflow to amplify productivity.
Amazing work culture (Super collaborative & supportive work environment; 5 days a week)
Awesome colleagues (Surround yourself with top talent from Meta, Google, LinkedIn etc. as well as people with deep startup experience)
Competitive compensation
Flexible working hours
Don’t meet every single requirement? Studies have shown that women and people of color are less likely to apply to jobs unless they meet every single qualification. Turing is proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender identity, sexual orientation, age, marital status, disability, protected veteran status, or any other legally protected characteristics. At Turing we are dedicated to building a diverse, inclusive and authentic workplace and celebrate authenticity, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.
For applicants from the European Union, please review Turing's GDPR notice here.
Skills Required
- 10+ years of experience leading large-scale technical delivery organizations, ideally across AI, ML, or data operations
- Bachelor's degree in Engineering, Computer Science, or equivalent technical discipline
- Demonstrated ability to act as a strategic business partner with our clients, researchers, and engineers at leading LLM labs
- Proven success in building and scaling multi-level high performance teams, with distributed global operations
- Hands-on technical fluency: ability to write and review data validation scripts
- Demonstrated experience managing dataset generation or annotation for machine learning model evaluation and/or training
- Familiarity with ML tools and data workflows (e.g., HuggingFace, LangChain, Weights & Biases, Databricks)
Turing Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Turing and has not been reviewed or approved by Turing.
-
Fair & Transparent Compensation — Feedback suggests USD-denominated pay and access to higher-paying clients can outpace local benchmarks for many non‑U.S. developers. Payout timing and processing are described as predictable once engagements begin, which supports confidence in earnings.
-
Wellbeing & Lifestyle Benefits — Feedback suggests remote‑first work with flexible hours is a consistent positive that enhances day‑to‑day balance. The ability to work from anywhere and maintain autonomy is frequently highlighted as part of the overall rewards experience.
-
Healthcare Strength — Feedback suggests some U.S. corporate roles include comprehensive health benefits, with individual accounts referencing employer‑covered medical insurance. These signals indicate stronger healthcare support for certain employee populations.
Turing Insights
What We Do
We now live in a remote-first world and every company is in a race to find the best remote engineers. There are so many amazing engineers all over the world. Turing’s mission is to help unleash the world’s untapped human potential. More than 300 companies, including those backed by Google Ventures, Bloomberg, Andreessen, Founders Fund, and Kleiner are already using Turing to spin up their engineering dream teams. Turing’s hiring platform combines the planetary reach and AI to deliver your ideal engineers in order to help you spin up your engineering dream team. Our deep matching intelligence finds the best Turing developers across 100+ skills like React, Node, Python, Golang, Angular, Swift, Java, and many more. As part of our rigorous vetting process, we also review software engineers’ technical abilities, English skills, and remote working capabilities. Turing ensures time zone overlap, transparency, and reliable communication in order to make remote development easy for you after the match. The Turing team has deep expertise in AI and building engineering dream teams in the U.S. at top companies. Turing company is backed by well-known investors like Facebook’s initial CTO (Adam D’Angelo), executives from Google, Facebook, Amazon, Twitter, Founders Fund (investors in Facebook, Tesla, Asana, etc). Turing.com is led by serial A.I. entrepreneurs Jonathan Siddharth and Vijay Krishnan, their last A.I. firm leveraged remote talent and had a successful acquisition.

.png)





