Applied AI Researcher, Post-Training

Reposted 23 Days Ago
Be an Early Applicant
2 Locations
Hybrid
100K-150K Annually
Senior level
Artificial Intelligence • Software
The Role
Researcher will adapt foundation models for real-world applications, focusing on performance, alignment, and human-centric objectives to optimize AI systems for enterprises.
Summary Generated by Built In
About Distyl AI

Distyl is an applied AI technology company partnering with the world’s most ambitious institutions to rearchitect critical operations for the frontier of AI. Our customers include the largest companies in telecom, healthcare, insurance, manufacturing, consumer goods, and global social organizations.
We research and deploy technologies that power AI-native operations — both for our partners and for Distyl itself. Our work spans research into self-constructing systems, the development of the most reliable execution of AI systems, and products that transform mission-critical workflows. As a result, Distyl's technologies affect some of the world's largest operations — from hundreds of millions of consumer interactions to tens of millions of supply chain transactions and millions of patient journeys.
Distyl is backed by leading investors including Lightspeed Venture Partners, Khosla Ventures, Coatue, DST Global, and the board-members of 20+ F500s. The results reflect this approach: a 100% production deployment success rate for our customers and one of the few enterprise AI companies to run a profitable business.

What We Are Looking For

At Distyl we’re pushing the envelope of AI utilization in enterprise. This requires creative researchers who don’t just want to drive incremental improvements on benchmarks or optimize an existing process but instead are looking to creatively redefine how software is used.

Our researchers come from many academic backgrounds but have strong research track records, operate in an AI-native way, and would be bored staying on the rails of a traditional research org.

Key Responsibilities
  • The Post-Training team focuses on adapting foundation models to real-world performance and alignment requirements. Researchers develop and evaluate techniques such as supervised fine-tuning, preference optimization (DPO, RLHF, RLAIF), and continual adaptation to align models with Distyl’s enterprise systems. The goal is to bridge raw model capability with trustworthy, contextually aligned system behavior

  • Researchers in Post-Training investigate new methods for aligning large models with human and system-level objectives. They explore trade-offs between generalization and specialization, data efficiency and robustness, capability and controllability. Their work informs how Distyl leverages foundation models safely, effectively, and at scale across industries

What We Require
  • Deep Understanding of Post-training Techniques: Familiarity with supervised fine-tuning, preference optimization (RLHF/DPO), LoRA/PEFT, and instruction-tuning pipelines.

  • Experience Adapting Frontier Models: You’ve tuned or adapted LLMs/SLMs to specialized domains or behaviors through data curation, reward modeling, or continual pretraining.

  • Experience Building with Models, Not Just Building Models: We develop intelligent systems using models rather than training or fine-tuning them. Ideal candidates have expertise in compound AI systems, agentic collaboration, and associated techniques (ensembling, ReAct, graph-of-thoughts, etc.).

  • Proven Track Record of Research Results: Whether you’ve published in top journals, posted amazing work on twitter, or somewhere else we want to see what you've done.

  • Uses AI Every Day: Before you can revolutionize someone else’s workflow, you need to revolutionize yours. You should be using tools like ChatGPT, Cursor, and Perplexity to accelerate your workflow.

  • Strong Programming and Data Analysis Skills: While you might not consider yourself a software engineer you need to be able to build prototypes of your ideas and then perform the experiments to prove the effectiveness to a F500 Head of AI.

  • Biases Towards Showing vs Telling: Our customers want to see the power of AI today vs discuss the most elegant idea that will take 5 years to realize.

What We Offer
  • The base salary range for this role is $150K – $250K, depending on experience, location, and level. In addition to base compensation, this role is eligible for meaningful equity, along with a comprehensive benefits package

  • 100% covered medical, dental, and vision for employees and dependents

  • 401(k) with additional perks (e.g., commuter benefits, in‑office lunch)

  • Access to state‑of‑the‑art models, generous usage of modern AI tools, and real‑world business problems

  • Ownership of high‑impact projects across top enterprises

  • A mission‑driven, fast‑moving culture that prizes curiosity, pragmatism, and excellence

Distyl has offices in San Francisco and New York. This role follows a hybrid collaboration model with 3+ days per week (Tuesday–Thursday) in‑office.

We believe diverse perspectives make our work stronger and more impactful. We are an equal opportunity employer and evaluate all applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, or any other legally protected characteristic. We encourage candidates from all backgrounds to apply.

Skills Required

  • Deep understanding of post-training techniques, including supervised fine-tuning and RLHF.
  • Experience adapting frontier models to specialized domains or behaviors.
  • Proven track record of research results published in top journals or notable platforms.
  • Strong programming and data analysis skills for building and testing prototypes.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
45 Employees

What We Do

Distyl AI is on a mission to create the most customer-centric AI company that revolutionizes how enterprises thrive in the AI-assisted economy. We collaborate with leading institutions worldwide to enhance their AI readiness and build dependable, seamlessly integrated AI-driven solutions tailored to their distinct data, workflows, and employee requirements. Using our proprietary platform of in-house tools and alliances such as the one with OpenAI, our team diligently develops and deploys generative AI products that adhere to the highest standards of integrity and reliability, empowering the institutions that require them the most.

Similar Jobs

Optimum Logo Optimum

Product Manager

AdTech • Digital Media • Internet of Things • Marketing Tech • Mobile • Retail • Software
Hybrid
New York, NY, USA
9000 Employees
123K-203K Annually

Optimum Logo Optimum

Construction Project Specialist

AdTech • Digital Media • Internet of Things • Marketing Tech • Mobile • Retail • Software
Hybrid
Bethpage, NY, USA
9000 Employees
64K-106K Annually

Wipfli Logo Wipfli

Senior Consultant

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
80K-108K Annually

Wells Fargo Logo Wells Fargo

Quant Engineer

Fintech • Financial Services
Hybrid
2 Locations
205000 Employees
215K-355K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account