ML Engineer, Generative Video

Posted 5 Days Ago
New York, Union, IA, USA
In-Office
175K-275K Annually
Junior
Artificial Intelligence • Software
Mirage is redefining short-form video with frontier AI research.
The Role
Design, train, and optimize large-scale video and multimodal models. Improve training and inference efficiency (memory, latency, cost) using distillation, quantization, and pruning. Build and maintain distributed training systems, optimize GPU utilization and throughput, develop experimentation and evaluation tooling, and translate research prototypes into low-latency, production-ready systems while monitoring real-world performance.
Summary Generated by Built In

Mirage is an AI-native video platform that intelligently orchestrates production and editing through natural language. Our models leverage contextual awareness to execute the same creative decisions a professional editor would — dramatically improving productivity for experienced teams, while making video creation accessible to anyone.
We’re an interdisciplinary team addressing some of the most difficult technical and creative challenges in generative media. As an early member of our team, you’ll tackle foundational problems that remain largely unsolved across the industry, driving an outsized impact on the future of creative expression.

More about us

Product (Captions by Mirage)

Research (Seeing Voices, technical-white-paper)

Updates (Mirage on X / twitter)

TechCrunch, Forbes AI 50, Fast Company (press)

Our Investors

We’re very fortunate to have some the best investors and entrepreneurs backing us, including Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, General Catalyst, Uncommon Projects, Kevin Systrom, Mike Krieger, Lenny Rachitsky, Antoine Martin, Julie Zhuo, Ben Rubin, Jaren Glover, SVAngel, 20VC, Ludlow Ventures, Chapter One, and more.

Please note that all of our roles will require you to be in-person at our NYC HQ (located in Union Square)

About the Role
Mirage is seeking an ML Engineer to build and scale the systems powering our video generation models. You’ll work on novel modeling approaches, training objectives, scaling strategies, and inference optimization and efficiency to bring cutting-edge models into production.

This role sits at the intersection of research and systems engineering, focusing on making advanced models faster, more efficient, and capable of ultra-low latency, real-time generation.

Responsibilities

  • Train and optimize large-scale video and multimodal models

  • Improve efficiency across training and inference (memory, latency, cost)

  • Implement techniques such as distillation, quantization, and pruning to aggressively accelerate diffusion and autoregressive generation

  • Build and maintain distributed training systems

  • Optimize GPU utilization, parallelism, and throughput

  • Develop tooling for experimentation, evaluation, and debugging

  • Translate research models into robust, production-ready systems

  • Monitor and improve model performance in real-world usage

What makes you a great fit

  • BS/MS/PhD in CS, ML, or related field

  • 2+ years of professional industry experience

  • Strong experience in deep learning systems and infrastructure

  • Expertise in PyTorch, CUDA, Triton, and distributed training (FSDP, etc.)

  • Experience scaling and optimizing large models under low-latency inference constraints

  • Strong debugging and performance profiling skills

  • Ability to move quickly from prototype to production

Benefits:
  • Comprehensive medical, dental, and vision plans

  • 401K with employer match

  • Commuter Benefits

  • Catered lunch multiple days per week

  • Dinner stipend every night if you're working late and want a bite!

  • Grubhub subscription

  • Health & Wellness Perks

  • Multiple team offsites per year with team events every month

  • Generous PTO policy

Captions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

Please note benefits apply to full time employees only.

Skills Required

  • In-person work at NYC HQ (Union Square)
  • BS/MS/PhD in Computer Science, Machine Learning, or related field
  • 2+ years of professional industry experience
  • Strong experience in deep learning systems and infrastructure
  • Expertise in PyTorch, CUDA, Triton, and distributed training (FSDP, etc.)
  • Experience scaling and optimizing large models for low-latency inference
  • Strong debugging and performance profiling skills
  • Ability to move quickly from prototype to production
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, New York
75 Employees
Year Founded: 2021

What We Do

We’re building full-stack foundation models and products that are changing video creation, production and editing more broadly. Over 20 million creators and businesses use Mirage’s products to reach their full creative and commercial potential. We are a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. As an early member of our team, you’ll have an opportunity to have an outsized impact on our products and our company's culture.

Gallery

Gallery

Similar Jobs

MetLife Logo MetLife

Customer Care Advocate Disability Intake - Cary, NC 9.21.26 - 18274

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
42K-42K Annually

MetLife Logo MetLife

Customer Care Advocate Disability Intake - Omaha, NE 9.14.26 - 18270

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
42K-42K Annually

MetLife Logo MetLife

Customer Care Advocate Disability Intake - Cary, NC 9.14.26 - 18272

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
42K-42K Annually

ChowNow Logo ChowNow

Back-end Engineer

Food • Software
Easy Apply
Remote or Hybrid
USA
208 Employees
170K-221K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account