Forward Deployed Research Scientist

Reposted 3 Days Ago
7 Locations
In-Office or Remote
140K-200K Annually
Mid level
Artificial Intelligence • Information Technology • Machine Learning
Our mission is to build the best products to align with artificial intelligence.
The Role
The Forward Deployed Research Scientist engages directly with AI labs to develop data strategies, fine-tune models, run experiments, and collaborate on publishing research, all in a fast-paced client-focused setting.
Summary Generated by Built In
Shape the Future of AI

At Labelbox, we're building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that are fundamental to AI development, and our work becomes even more essential as AI capabilities expand exponentially.

About Labelbox

We're the only company offering three integrated solutions for frontier AI development:

  1. Enterprise Platform & Tools: Advanced annotation tools, workflow automation, and quality control systems that enable teams to produce high-quality training data at scale
  2. Frontier Data Labeling Service: Specialized data labeling through Alignerr, leveraging subject matter experts for next-generation AI models
  3. Expert Marketplace: Connecting AI teams with highly skilled annotators and domain experts for flexible scaling
Why Join Us
  • High-Impact Environment: We operate like an early-stage startup, focusing on impact over process. You'll take on expanded responsibilities quickly, with career growth directly tied to your contributions.
  • Technical Excellence: Work at the cutting edge of AI development, collaborating with industry leaders and shaping the future of artificial intelligence.
  • Innovation at Speed: We celebrate those who take ownership, move fast, and deliver impact. Our environment rewards high agency and rapid execution.
  • Continuous Growth: Every role requires continuous learning and evolution. You'll be surrounded by curious minds solving complex problems at the frontier of AI.
  • Clear Ownership: You'll know exactly what you're responsible for and have the autonomy to execute. We empower people to drive results through clear ownership and metrics.
Role Overview

Alignerr is Labelbox's human data organization — we produce the training data that frontier AI labs use to build their most capable models. Our Forward Deployed Research Team sits at the intersection of research science and client delivery, embedding research capability directly into the engagements that drive our business.

This is not a traditional research scientist role. You will not spend months pursuing a single research question. You will work on multiple client engagements simultaneously, operating on timescales of days to weeks. You will sit in scoping meetings with research teams at major AI labs, reason scientifically about data strategy in real time, fine-tune open-weight models to validate our data methodology, and collaborate with our Applied Research team to turn client-grounded findings into published work. The pace is fast, the problems are applied, and the feedback loops are short.

We are looking for someone who finds that energizing, not compromising.

Your Impact
  • Engage directly with frontier lab research teams. You will be in the room during client scoping meetings — not as support staff, but as a technical peer. You'll engage on methodology, challenge assumptions about data requirements, and shape project specifications based on a scientific understanding of how data composition affects model outcomes.

    Develop deep scientific understanding of client engagements. For each project, you will build a working model of the client's architecture, training methodology, and target capabilities. You'll use this understanding to reason about why a particular data strategy will or won't work, identify risks early, and iterate with empirical grounding — not intuition.

    Run ablation studies and fine-tune open-weight models. You will fine-tune models on client data (and proxy data) to empirically measure the impact of our data on model performance. This is how we validate that what we deliver actually improves our customers' models — and how we catch problems before the client does.

    Consult on workflow and quality systems. You will partner with our Human Data Operations team to review annotation schemas, task designs, and quality rubrics before projects go into execution. Your job is to ensure the spec is technically sound — that the data we produce will actually serve the client's training objectives.

    Collaborate with Applied Research on publications and benchmarks. Our Applied Research team owns the long-horizon research agenda. Your role is to feed them signal from the field — generalizable findings, reusable methodologies, empirical results — and help drive joint projects to completion. You will contribute to benchmarks, white papers, and conference submissions that establish Labelbox's research credibility.

What You Bring
  • Required

    • MS or PhD in Machine Learning, NLP, Computer Science, or a related quantitative field.
    • Hands-on experience fine-tuning large language models (open-weight models such as Llama, Mistral, Qwen, or similar).
    • Strong understanding of LLM training pipelines — pretraining, supervised fine-tuning, RLHF/DPO, and how data quality and composition affect each stage.
    • Experience designing and executing experiments with rigor — hypothesis formation, controlled comparisons, statistical analysis of results.
    • Ability to operate at speed. You should be comfortable going from problem definition to experimental results in days, not months.
    • Strong written and verbal communication. You will present findings to client research teams and contribute to published work.

    Strongly Preferred

    • Prior experience at a frontier AI lab, applied ML startup, or in a research role with direct client/stakeholder interaction.
    • Experience with evaluation and benchmarking of LLMs — designing metrics, building eval harnesses, interpreting results critically.
    • Familiarity with human data pipelines — annotation workflows, quality assurance methodology, inter-annotator agreement analysis.
    • Experience with reinforcement learning, reward modeling, or RLHF environments.
    • Published research (conferences, journals, or technical reports) in ML/NLP or adjacent fields.

    What Matters More Than Credentials

    • Applied instinct over academic purity. The measure of success here is client impact and publishable-but-practical results — not methodological novelty for its own sake. If your first instinct when handed a problem is to build a framework, this isn't the role. If your first instinct is to run an experiment and get a result, it is.
    • Comfort with ambiguity and incomplete information. Client engagements rarely come with clean problem statements. You'll need to extract the real question from a noisy conversation, scope an approach quickly, and iterate.
    • Cross-functional fluency. You will work daily with field engineers, project managers, operations teams, and an independent Applied Research team. Someone who can only operate within a pure research silo will struggle here.
    • Intellectual honesty. When an ablation study shows the data isn't working, you need to say so — clearly and constructively — even when it's inconvenient for the deal timeline.
What You Should Know About This Team
  • We are small and high-leverage. The FDRT is a team of five today. Every person's work directly influences client outcomes and Labelbox's market position.
  • We operate at the tempo of client delivery. Two-week sprints. SLAs measured in days. If you want months of uninterrupted focus on a single problem, our Applied Research team is a better fit.
  • We are at the intersection of several teams. FDRT works with Field Delivery Engineers, Human Data Operations, Applied Research, and client research teams. The role requires navigating those interfaces with credibility and without ego.
  • We protect time for research. 25–30% of team capacity is allocated to research collaboration with Applied Research. This is not aspirational — it is a structural commitment. You will have the opportunity to publish.

Labelbox strives to ensure pay parity across the organization and discuss compensation transparently.  The expected annual base salary range for United States-based candidates is below. This range is not inclusive of any potential equity packages or additional benefits. Exact compensation varies based on a variety of factors, including skills and competencies, experience, and geographical location.

Annual base salary range
$140,000$200,000 USD
Life at Labelbox
  • Location: Join our dedicated tech hubs in San Francisco or Wrocław, Poland
  • Work Style: Hybrid model with 2 days per week in office, combining collaboration and flexibility
  • Environment: Fast-paced and high-intensity, perfect for ambitious individuals who thrive on ownership and quick decision-making
  • Growth: Career advancement opportunities directly tied to your impact
  • Vision: Be part of building the foundation for humanity's most transformative technology
Our Vision

We believe data will remain crucial in achieving artificial general intelligence. As AI models become more sophisticated, the need for high-quality, specialized training data will only grow. Join us in developing new products and services that enable the next generation of AI breakthroughs.

Labelbox is backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures, Databricks Ventures, and Kleiner Perkins. Our customers include Fortune 500 enterprises and leading AI labs.

Your Personal Data Privacy: Any personal information you provide Labelbox as a part of your application will be processed in accordance with Labelbox’s Job Applicant Privacy notice.

Any emails from Labelbox team members will originate from a @labelbox.com email address. If you encounter anything that raises suspicions during your interactions, we encourage you to exercise caution and suspend or discontinue communications.

Skills Required

  • MS or PhD in Machine Learning, NLP, Computer Science, or a related quantitative field
  • Hands-on experience fine-tuning large language models
  • Strong understanding of LLM training pipelines
  • Experience designing and executing experiments with rigor
  • Strong written and verbal communication

Labelbox Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Labelbox and has not been reviewed or approved by Labelbox.

  • Fair & Transparent Compensation Pay is considered competitive and clearly communicated through pay transparency, with equity included in total rewards. People are often satisfied with overall packages that combine salary, stock/equity, and benefits.
  • Leave & Time Off Breadth Time off options are broad, with unlimited PTO alongside paid sick time, volunteer time, company‑wide vacations, and bereavement leave. This breadth supports flexibility for rest and personal needs.
  • Healthcare Strength Health coverage includes medical, dental, and vision insurance with added protections like life insurance and travel support for sensitive care. Wellness resources and supportive programs further reinforce the healthcare offering.

Labelbox Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
115 Employees
Year Founded: 2017

What We Do

Labelbox is the data factory for generative AI, providing the highest quality training data for frontier and task-specific models. Labelbox’s comprehensive platform combines on-demand labeling services with the industry-leading data labeling platform. The Boost labeling service is powered by the Alignerr community of highly-educated experts, who span all major languages and a diverse range of advanced subjects. They are available on-demand to rapidly generate new data for supervised fine-tuning, RLHF, and more. Labelbox’s software-first approach delivers unmatched control and transparency into the labeling process, leading to the generation of high-quality, consistent data at scale. Customers include Fortune 500 enterprises and leading AI labs, and Labelbox is backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures, Databricks Ventures, and Kleiner Perkins.

Why Work With Us

Labelboxers are driven to master their craft and build the best products for AI, which is quickly becoming one of the most significant technologies of our time. Join us in supporting our customers as they create AI breakthroughs.

Gallery

Gallery

Similar Jobs

Affirm Logo Affirm

Manager, Software Engineering (Code to Environments)

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Canada
2200 Employees
178K-228K Annually

Affirm Logo Affirm

Technical Program Manager

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Canada
2200 Employees
150K-200K Annually

Applied Systems Logo Applied Systems

Digital Marketing Manager

Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Remote or Hybrid
2 Locations
3040 Employees
130K-163K Annually

VelocityEHS Logo VelocityEHS

Customer Experience Intern (Fall 2026)

Cloud • Greentech • Social Impact • Software • Consulting
Remote
2 Locations
500 Employees

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account