Principal Software Engineer, ML Infrastructure

Reposted 20 Days Ago
Foster City, CA
Hybrid
373K-448K Annually
Senior level
Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
Zoox is an autonomous mobility company that’s created a purpose-built robotaxi to give the world a better way to ride.
The Role
Lead the design and development of ML infrastructure for autonomous driving, collaborating with ML teams and mentoring engineers in the process.
Summary Generated by Built In
Zoox is on a mission to reimagine transportation and ground-up build autonomous robotaxis that are safe, reliable, clean, and enjoyable for everyone. We are still in the early stages of deploying our robotaxis, and it's a great time to join Zoox and make a significant impact on executing this mission. The ML Infrastructure team at Zoox plays a crucial role in enabling innovations in ML and CV and making autonomous driving as seamless as possible.

The Opportunity
We are seeking a deeply technical, influential, and hands-on Principal Software Engineer to shape and build our next-generation ML Infrastructure and significantly reduce the time to develop and deploy large-scale ML and Foundational models to our robotaxi. You will lead the design and development of our Data, Compute, Model Training, and Serving Infrastructure. You will work across all AI teams within Zoox, including Perception, Prediction, Planner, Simulation, Collision Avoidance, and have the opportunity to significantly push the boundaries of how ML is practiced within Zoox.

We build and operate the data infrastructure responsible for ingesting PBs of sensor data and the systems used to assemble training datasets. We operate the compute infrastructure that powers Zoox’s model training, serving, and large-scale validation pipelines across tens of thousands of GPUs. We also operate the base layer of ML tools, deep learning frameworks, and inference systems used by our applied research teams for in- and off-vehicle ML use cases. You will lead a team of strong software engineers and act as a force multiplier for our teams. You can learn more about our ML Infrastructure here and our stack behind autonomous driving here. 

In this role, you will

  • Vision: Develop and execute a strategic vision for ML Infrastructure that will unlock innovation in autonomous driving and enhance our rider experience. 
  • Technical acumen: Lead the design and implementation of cutting-edge infrastructure spanning all stages of an ML lifecycle from data preparation to training to evaluation, deployment, and serving. 
  • Partnership: Collaborate closely with cross-functional teams, including ML researchers, software engineers, data engineers, and hardware engineers, to define requirements and align on architectural decisions.
  • Mentorship: Enable the engineers in the team to grow their careers by providing technical guidance and mentorship.

Qualifications

  • Experience building and managing large-scale ML infrastructure that powers the development of large-scale ML models
  • Excellent leadership skills with a demonstrated ability to lead high-performing engineering teams.
  • Strong experience with training frameworks like PyTorch, JAX, etc., leveraging GPUs efficiently for distributed model training.
  • Experience with GPU-accelerated inference using TensorRT, Ray Serve, or similar frameworks.
  • Proficient in Python and/or C++.

Bonus Qualifications

  • Experience enabling the development and deployment of large-scale Foundation models.
  • Experience working on large-scale data infrastructure and big data processing frameworks like Apache Spark.
  • Experience working in the AV domain supporting Perception, Prediction, Planner et al.

Compensation
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. The salary will range from $324,000-$470,000. A sign-on bonus may be part of a compensation package. Compensation will vary based on geographic location, job-related knowledge, skills, and experience.  

Zoox also offers a comprehensive package of benefits including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.

Top Skills

Spark
C++
Jax
Python
PyTorch
Ray Serve
Tensorrt
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Foster City, CA
2,500 Employees
Year Founded: 2014

What We Do

Zoox is an autonomous mobility company that was founded to provide a safer, cleaner, and more enjoyable future on the road. To achieve that goal, the company has spent the past 10 years creating a purpose-built robotaxi that gives the world a better way to ride.

Why Work With Us

At Zoox, we are working to solve one of the greatest technological challenges of our generation.
From the beginning, we have been focused on our goal of reimagining transportation from the ground up. We are a mission-driven community of innovators working together to create a safer, cleaner, and more enjoyable future on the road.

Gallery

Gallery

Similar Jobs

Datadog Logo Datadog

Solutions Architect

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
4 Locations
6500 Employees
161K-237K Annually

Attain Logo Attain

Senior Software Engineer

AdTech • Fintech • Financial Services
Easy Apply
Hybrid
Redwood City, CA, USA
145 Employees

Commerce Logo Commerce

Lead Software Engineer

Artificial Intelligence • Cloud • Consumer Web • eCommerce • Information Technology • Software
In-Office
2 Locations
1200 Employees
134K-275K Annually

ZS Logo ZS

Consultant

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
South San Francisco, CA, USA
13000 Employees
155K-190K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account