The Role
You will work on building generative AI inference infra at an unprecedented scale. You’ll be responsible for Luma’s REST APIs, and backend systems. You'll work closely with the research team to rapidly prototype, build, and optimize inference pipelines, and compute.
Summary Generated by Built In
You will work on building generative AI inference infra at an unprecedented scale. You’ll be responsible for Luma’s REST APIs, and backend systems. You will build infrastructure with thousands of GPUs in production serving SOTA machine-learning models to millions of luma users. You'll work closely with the research team to rapidly prototype, build, and optimize inference pipelines, and compute.
Experience
- Requirement of 5+ years of experience as an industry software engineer, we will not consider new grads for this role.
- Proficiency in Python.
- Experience deploying with Docker and Kubernetes.
- Good understanding of security and authentication.
- Experience designing and shipping state of the art and high-traffic REST APIs.
- Experience deploying ML models is a strong plus but not required.
- Please note this role is not meant for recent grads.
Your application is reviewed by real people.
Top Skills
Python
The Company
What We Do
Luma is a multimedia platform that delivers personalized movie and TV program selections from a range of sources to its viewers.