Machine Learning Engineer — Distillation

Posted Yesterday
Hiring Remotely in World Golf Village, FL
In-Office or Remote
Mid level
Artificial Intelligence • Information Technology • Software
The Role
Design and implement knowledge distillation pipelines, optimize training and inference performance, and collaborate with research on production-ready ML models.
Summary Generated by Built In
About the Role

We’re looking for a Machine Learning Engineer focused on model distillation to help us build smaller, faster, and more efficient models without sacrificing quality. You’ll work at the intersection of research and production—taking cutting-edge techniques and turning them into systems that scale.

This is a hands-on role with real ownership: you’ll design distillation pipelines, run large-scale experiments, and ship models used in production.

What You’ll Do
  • Design and implement knowledge distillation pipelines (teacher–student, self-distillation, multi-teacher, etc.)

  • Distill large foundation models into smaller, faster, and cheaper models for inference

  • Run and analyze large-scale training experiments to evaluate quality, latency, and cost tradeoffs

  • Collaborate with research to translate new distillation ideas into production-ready code

  • Optimize training and inference performance (memory, throughput, latency)

  • Contribute to internal tooling, evaluation frameworks, and experiment tracking

  • (Optional) Contribute back to open-source models, tooling, or research

What We’re Looking For
  • Strong background in machine learning or deep learning

  • Hands-on experience with model distillation (LLMs or other neural networks)

  • Solid understanding of training dynamics, loss functions, and optimization

  • Experience with PyTorch (or JAX) and modern ML tooling

  • Comfort running experiments on multi-GPU or distributed setups

  • Ability to reason about model quality vs. performance tradeoffs

  • Pragmatic mindset: you care about shipping, not just papers

Nice to Have
  • Experience distilling LLMs or large sequence models

  • Experience with inference optimization (quantization, pruning, kernels, etc.)

  • Familiarity with evaluation for language models

  • Open-source contributions or research publications

  • Experience in early-stage or fast-moving startups

Why Join
  • Work on core model quality and cost efficiency—not side projects

  • High ownership and direct impact on product and roadmap

  • Small, senior team with strong research + engineering culture

  • Competitive compensation + meaningful equity

  • Remote-friendly, async-first environment

Top Skills

Deep Learning
Distributed Computing
Jax
Machine Learning
Model Distillation
Multi-Gpu
Pruning
PyTorch
Quantization
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
20 Employees
Year Founded: 2023

What We Do

We enable serverless inference via our GPU orchestration and model load-balancing system. We unlock fine-tuning by enabling organizations to size their server fleet to throughput needs, not number of models in the catalogue.

See it in action on our public cloud, which offers inference for 10k+ open weight models.

Similar Jobs

Tiger Analytics Logo Tiger Analytics

Machine Learning Engineer

Big Data • Analytics • Business Intelligence • Big Data Analytics
Remote
United States
5000 Employees

Samsara Logo Samsara

Senior Machine Learning Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
United States
4000 Employees
135K-228K Annually

Adapter Logo Adapter

Machine Learning Engineer

Information Technology • Software
Easy Apply
Remote
United States
12 Employees
180K-225K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account