The Role
The role involves refining and personalizing Luma's multimodal AI models, improving their controllability and adaptability for creative workflows, while collaborating with cross-functional teams.
Summary Generated by Built In
About Luma AI
Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable, and useful systems, the next step function change will come from vision. So, we are working on training and scaling up multimodal foundation models for systems that can see and understand, show and explain, and eventually interact with our world to effect change.
About the Role
This is a foundational opportunity to refine, personalize, and build the final capabilities and control interface of Luma’s foundation models and drive real-world value.
You’ll sit at the intersection of research, product, and partnerships, helping close the gap between state-of-the-art and production-ready. Your mission is to make our video foundation models more expressive, controllable, and personalized – solving the “last mile” challenges demanded by top-tier creative workflows.
What You'll Do
You will work as a fullstack applied researcher across modeling, data, systems, and evaluation to adapt and deploy models to production.
- Controllability and Features: You will leverage a toolkit spanning SFT, RL, personalization, distillation, control adapters, and more, to develop and maintain model variants purpose-built for user environments and creative partners.
- Personalization: Architect the data engine for rapid adaptation. You will leverage proprietary, vertical-specific datasets to create specialized finetunes and improve future training recipes, ensuring our models rely on data that reflects real-world use cases.
- End-User Quality: You will define and drive end-user quality – setting success metrics, building user-aligned evaluations, and iterating on the model/data/evals loop to meet strict fidelity and reliability targets in specific enterprise verticals.
- Cross-functional Collaboration: Partner closely with Product, Research, and Design to translate creative intent and user feedback into model behavior, intuitive controls, and production-ready capabilities for users and partners.
Who You Are
- Product-Obsessed Researcher/Engineer: You treat end users and partners as collaborators and enjoy solving specific “last mile” problems—not just optimizing public metrics.
- ML Expert: Strong ML fundamentals with deep experience in visual generative models (diffusion/transformers or related architectures). Ideal candidates also have a deep understanding of at least one: fine-tuning, personalization, domain adaptation, data curation, targeted distillation, interpretability, or human-feedback-driven refinement.
- Hands-On Builder: Strong Python and deep learning engineering skills (ideally PyTorch), comfortable moving between research prototypes and production systems.
- Contributions to state-of-the-art models in image/video generation.
- Experience collaborating with creative partners (VFX, animation, film, design tools).
- Track record building workflows/tools that materially improve iteration speed and evaluation rigor.
- Familiarity with large-scale training infrastructure and distributed systems (Ray, Slurm, Kubernetes).
The base pay range for this role is $200,000 – $450,000 per year.
About LumaLuma’s mission is to build unified general intelligence that can generate, understand, and operate in the physical world.
We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will come from vision. So, we are working on training and scaling up multimodal foundation models for systems that can see and understand, show and explain, and eventually interact with our world to effect change.
Skills Required
- Strong ML fundamentals with deep experience in visual generative models
- Hands-on experience with Python and deep learning engineering
- Understanding of fine-tuning, personalization, domain adaptation
- Experience with large-scale training infrastructure and distributed systems
Luma AI Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Luma AI and has not been reviewed or approved by Luma AI.
-
Fair & Transparent Compensation — Pay is considered competitive for senior technical and some non-technical roles, with posted bands indicating strong market alignment in key locations. Publicly listed ranges provide directional clarity for certain roles and markets.
-
Equity Value & Accessibility — Equity is positioned as a meaningful component of total compensation, and language in postings emphasizes ownership alongside cash pay. Signals indicate equity can be significant in senior roles where competition for talent is intense.
-
Healthcare Strength — Core medical, dental, and vision coverage are referenced in multiple postings, aligning with standard expectations for venture-backed tech companies. These inclusions suggest baseline health benefits are part of the package.
Luma AI Insights
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Luma AI’s mission is to build Multimodal AGI: AI that can generate, understand, and operate in the physical world. We develop multimodal models across video, 3D, and generative media, and ship them in products like Dream Machine to help creators and teams turn ideas into compelling visuals—fast.









