AI Researcher — Training Optimization

Reposted 24 Days Ago
Hiring Remotely in World Golf Village, FL, USA
In-Office or Remote
Mid level
Artificial Intelligence • Information Technology • Software
The Role
The AI Researcher will develop training optimization techniques, improve large model training efficiency, run experiments and publish research findings.
Summary Generated by Built In
About the Role

We’re looking for an AI Researcher focused on training optimization to help us push the efficiency, stability, and scalability of large-scale model training. You’ll work at the intersection of research and systems, developing novel techniques to reduce training cost, accelerate convergence, and improve model quality—while validating ideas through rigorous experiments and publications.

This role is ideal for someone who enjoys turning research insights into practical training wins, and who has a track record (or strong ambition) of publishing applied ML research.

What You’ll Work On
  • Design and evaluate training optimization techniques for large models (e.g. optimization algorithms, schedulers, normalization, curriculum strategies)

  • Improve training efficiency and stability across long runs and large datasets

  • Research and implement methods such as:

    • Optimizer and scheduler innovations

    • Mixed-precision, low-precision, and memory-efficient training

    • Gradient noise reduction, scaling laws, and convergence analysis

    • Training-time regularization and robustness techniques

  • Run large-scale experiments, analyze results, and translate findings into actionable improvements

  • Author or co-author research papers, technical reports, or blog posts

  • Collaborate closely with infrastructure and inference teams to ensure training decisions translate to real-world performance

What We’re Looking For
  • Strong background in machine learning research, with emphasis on training dynamics and optimization

  • Experience training large neural networks (LLMs, multimodal models, or large sequence models)

  • Publication experience in ML venues (e.g. NeurIPS, ICML, ICLR, ACL, EMNLP, COLM, arXiv) or equivalent high-quality open research

  • Solid understanding of:

    • Optimization theory and practice

    • Backpropagation, gradient flow, and training stability

    • Distributed and large-batch training

  • Proficiency in Python and modern ML frameworks (PyTorch preferred)

  • Ability to independently design experiments and reason from data

Nice to Have
  • Experience with non-standard architectures (e.g. RNN variants, long-context models, hybrid systems)

  • Experience optimizing training on GPUs at scale (FSDP, ZeRO, custom kernels)

  • Contributions to open-source ML or research codebases

  • Comfort operating in fast-moving, ambiguous startup environments

Why This Role
  • Real influence over core model training decisions

  • Freedom to pursue and publish novel research

  • Direct access to large-scale experiments and real production constraints

  • A small, senior team that values thinking deeply and shipping thoughtfully

Top Skills

Python
PyTorch
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
20 Employees
Year Founded: 2023

What We Do

We enable serverless inference via our GPU orchestration and model load-balancing system. We unlock fine-tuning by enabling organizations to size their server fleet to throughput needs, not number of models in the catalogue. See it in action on our public cloud, which offers inference for 10k+ open weight models.

Similar Jobs

Wipfli Logo Wipfli

Manager, Financial Reporting - Real Estate Clients

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
97K-145K Annually

ActBlue Logo ActBlue

Director of Data Science & Engineering

Fintech • Social Impact • Software
Easy Apply
Remote
USA
296 Employees
214K-271K Annually

Advisor360 Logo Advisor360

Senior Manager, Sales Engineering

Artificial Intelligence • Fintech • Software • Financial Services • Generative AI • Big Data Analytics • Automation
Remote
United States
500 Employees
170K-185K Annually

SoFi Logo SoFi

Staff Credit Policy Analyst

Fintech • Mobile • Software • Financial Services
Easy Apply
Remote or Hybrid
United States
4500 Employees
115K-216K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account