Forward Deployed Engineer - ML

Reposted 17 Days Ago
Be an Early Applicant
Stockholm, SWE
In-Office
Mid level
Machine Learning • Generative AI
The Role
The Forward Deployed Engineer will work with AI companies to optimize production workloads, contribute to open-source projects, and collaborate with customers on technical solutions in ML.
Summary Generated by Built In
About Us:

AI needs a new infrastructure layer. We're building it at Modal.

Every era of computing brought new workloads that previous infrastructure couldn't support: mainframes, databases, and the cloud. Each time, the company that rebuilt the layer underneath defined the decade. AI is no different, except it touches everything instead of one slice, and the window to build the layer underneath it is open right now.

Our customers include category-defining companies like Lovable, Ramp, Cognition, DoorDash, and Suno. They rely on Modal for instant GPU access, sub-second container starts, and native storage, so it's simple to serve low-latency inference, fine-tune models, and access production-ready sandboxes at scale.

We recently raised a $355M Series C at a $4.65B valuation, led by General Catalyst and Redpoint Ventures. We've crossed $300M+ ARR and grown fivefold since September.

Our team includes creators of popular open-source projects (e.g.,Seaborn,Luigi), academic researchers, international olympiad medalists, and experienced engineering and product leaders with decades of experience.

The Role:

We're looking for Forward Deployed ML Engineers who want to work at the intersection of deep technical work and direct customer impact. As an ML FDE, you'll partner with leading AI companies and foundation model labs to help them achieve state-of-the-art performance on their most demanding workloads — LLM serving, model training (SFT, RLHF), audio pipelines, scientific computing, and more. You're helping teams reach outcomes most engineers can't on their own.

The FDE team today includes world-class software engineers, computational scientists, ML engineers, and former founders. We're looking for people with strong engineering fundamentals, deep curiosity across the AI stack, and energy for working directly with customers on hard problems. You will:

  • Work hands-on with companies like Suno, Lovable, Cognition, and Meta to architect and optimize production AI workloads on Modal

  • Contribute to open-source projects — members of the team are active contributors to SGLang — and publish technical content that demonstrates Modal's capabilities across the AI stack

  • Collaborate with Modal's product and sales teams, contributing to the platform as both an engineer and a product stakeholder

  • Build trusted relationships with technical leaders (CTOs, VPs of Engineering, ML leads) at companies doing frontier AI work

  • Conduct technical demos, experiments, and proof-of-concepts that make Modal's performance advantages tangible

Requirements:
  • 2+ years of professional ML engineering experience, ideally with hands-on work in inference optimization, model training, GPU programming, or ML infrastructure

  • Familiarity with the serving (e.g., vLLM, SGLang) and training (e.g., slime, verl, TRL) toolchains. You don't need all of these, but you should be able to go deep on at least one.

  • Strong communicator who can go deep on technical architecture with an engineering team and clearly articulate tradeoffs to technical leadership

  • Genuine interest in working directly with customers — you find it energizing to understand someone else's problem and help them solve it

  • Bonus: side projects, open-source contributions, or published work you're proud of in ML or systems performance

  • Willing to work in-person in Stockholm

Skills Required

  • 2+ years of professional ML engineering experience
  • Hands-on work in inference optimization, model training, GPU programming, or ML infrastructure
  • Familiarity with ML toolchains (e.g., vLLM, SGLang)
  • Strong communication skills, able to articulate technical architecture
  • Genuine interest in customer interaction and problem solving
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York City, New York
50 Employees

What We Do

Deploy generative AI models, large-scale batch jobs, job queues, and more on Modal's platform. We help data science and machine learning teams accelerate development, reduce costs, and effortlessly scale workloads across thousands of CPUs and GPUs. Our pay-per-use model ensures you're billed only for actual compute time, down to the CPU cycle. No more wasted resources or idle costs—just efficient, scalable computing power when you need it.

Similar Jobs

CSC Logo CSC

Junior redovisningskonsult för internationella klienter till CSC Global

Fintech • Legal Tech • Software • Financial Services • Cybersecurity • Data Privacy
Hybrid
Stockholm, SWE
8500 Employees
50K-70K Annually

Datadog Logo Datadog

Sales Engineer

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
2 Locations
6500 Employees

ServiceNow Logo ServiceNow

Sales Director

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Stockholm, SWE
29000 Employees

Palantir Technologies Logo Palantir Technologies

Deployment Strategist

Artificial Intelligence • Information Technology • Software
In-Office
Stockholm, SWE
4400 Employees

Similar Companies Hiring

ClickMint Thumbnail
AdTech • eCommerce • Marketing Tech • Generative AI
Malibu, CA
9 Employees
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
LTX Thumbnail
Conversational AI • Generative AI
Jerusalem, Israel
360 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account