Founding Research Engineer

Posted 12 Days Ago
Be an Early Applicant
San Francisco, CA
In-Office
150K-200K Annually
Mid level
Artificial Intelligence • Information Technology • Software
The Role
Design and implement scalable reinforcement learning recipes, develop environments and reward functions, and conduct foundational research to publish open-source environments and training data.
Summary Generated by Built In

About The LLM Data Company

The LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. We have raised $3.6m from Tier 1 VCs and are growing 200%+ month-over-month.

Responsibilities

  • Design and implement scalable RL recipes for post-training task-specific models

  • Develop modular environments, reward functions, and evaluator scaffolds for internal and customer-facing tasks

  • Drive research at the intersection of scalable infra and modern RL frameworks to enable RL-as-a-service

  • Drive foundational research to publish open source environments and training data

  • Build data generation and curation pipelines to support frontier post-training

  • Collaborate with product teams to deliver a user friendly interface for non-technical users to generate data

Qualifications

  • Bachelor or Master in Computer Science or related field

  • Comfort with core tooling (verl, PyTorch, etc.)

  • Familiarity with modern post-training techniques (GRPO, etc.)

  • Experience with evaluations and reward engineering

  • Published in top journals (ICLR, NeurIPS, ICML, etc.)

Why you should join

  • Cutting-edge research: Work on unpublished, novel training environments

  • Direct lab exposure: Projects that labs actually use and validate in production

  • High autonomy: Wide design space to propose and run experiments with minimal oversight

  • Early team member: Join as one of the first 10 people with significant equity upside

Top Skills

PyTorch
Verl
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
7 Employees

What We Do

Post-training data research

Similar Jobs

The LLM Data Company Logo The LLM Data Company

Founding Senior Research Engineer

Artificial Intelligence • Information Technology • Software
In-Office
San Francisco, CA, USA
7 Employees
200K-275K Annually

Anduril Logo Anduril

Senior Machine Learning Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Mountain View, CA, USA
6000 Employees
240K-318K Annually

Anduril Logo Anduril

Staff Software Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
220K-292K Annually

SoFi Logo SoFi

Senior Project Manager

Fintech • Mobile • Software • Financial Services
Easy Apply
Hybrid
San Francisco, CA, USA
4500 Employees

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account