Research Engineer, Machine Learning

Posted 2 Days Ago
Palo Alto, CA
Hybrid
Mid level
Artificial Intelligence
The Role
As a Research Engineer in Machine Learning, you'll optimize large-scale ML systems, integrate research with production, and conduct experiments on deep-learning techniques, all while collaborating closely with Research Scientists.
Summary Generated by Built In
About Mistral 

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.

We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.

Role Summary 

About the Research Engineering team

The team spans Platform (shared infra & clean code) and Embedded (inside research squads). Engineers can move along the research↔production spectrum as needs or interests evolve.

As a Research Engineer – ML track, you’ll build and optimise the large-scale learning systems that power our open-weight models. Working hand-in-hand with Research Scientists, you’ll either join:

- Platform RE Team: Enhance the shared training framework, data pipelines and cluster tooling used by every team; or
- Embedded RE Team: Sit inside a research squad (Alignment, Pre-training, Multimodal, …) and turn fresh ideas into repeatable, scalable code.


What will you do

• Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
• Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
• Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
• Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
• Deliver prototypes that become production-grade components for Le Chat and our enterprise API.

About you

• Master’s or PhD in Computer Science (or equivalent proven track record).
• 4 + years working on large-scale ML codebases.
• Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed / FSDP / SLURM / K8s).
• Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops.
• Strong software-design instincts: testing, code review, CI/CD.
• Self-starter, low-ego, collaborative.


What we offer

  • 💰 Competitive salary and equity.
  • 🚑 Healthcare: Medical/Dental/Vision covered for you and your family.
  • 👴🏻 Pension : 401K (6% matching)
  • 🏝️ PTO : 18 days 
  • 🚗 Transportation: Reimburse office parking charges, or $120/month for public transport
  • 🏀 Sport: $120/month reimbursement for gym membership
  • 🥕 Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger)
  • 🌎 Visa sponsorship 
  • 🤝 Coaching: we offer BetterUp coaching on a voluntary basis

Top Skills

Deepspeed
Fsdp
Jax
K8S
Python
PyTorch
Slurm
TensorFlow
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Paris
92 Employees
Year Founded: 2023

What We Do

Fast, open-source and secure language models. Facilitated specialisation of models on business use-cases, leveraging private data and usage feedback.

Built from a world-class team in Europe, targeting global market. Join the team ! https://jobs.lever.co/mistral/

Similar Jobs

In-Office
San Francisco, CA, USA
6 Employees

X, The Moonshot Factory Logo X, The Moonshot Factory

Senior Applied ML Research Engineer, Early Stage Project

Artificial Intelligence • Greentech • Hardware • Internet of Things • Transportation • Cybersecurity • Automation
In-Office
Mountain View, CA, USA
2277 Employees
165K-238K Annually

Aldea Logo Aldea

Research Engineer (Machine Learning)

Artificial Intelligence • Information Technology • Software
In-Office or Remote
San Francisco, CA, USA
6 Employees

Sunday (sunday.ai) Logo Sunday (sunday.ai)

Scientist

Artificial Intelligence • Information Technology • Robotics • Software
In-Office
Mountain View, CA, USA
70 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account