Machine Learning Engineer Intern, Perception

Reposted 3 Days Ago
2 Locations
Hybrid
6K-10K Annually
Internship
Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
Zoox is an autonomous mobility company that’s created a purpose-built robotaxi to give the world a better way to ride.
The Role
The Machine Learning Engineer Intern at Zoox will work on perception systems, developing algorithms for autonomous driving, collaborating with teams, and leveraging advanced machine learning models and datasets.
Summary Generated by Built In
Zoox’s internship program provides hands-on experiences with state of the art technology, mentorship from some of the industry's brightest minds, and the opportunity to play a part in our success. Internships at Zoox are reserved for those who demonstrate outstanding academic performance, activities outside their course work, aptitude, curiosity, and a passion for Zoox's mission.

Perception at Zoox is the "Retina of Zoox" — the system responsible for understanding the world around the autonomous vehicle.

As an MLE intern working on Perception, you may be assigned to one of the following teams:

On the Offline Driving Intelligence team, you will develop advanced multimodal large language models that enhance scenario understanding and driving. You'll develop and fine-tune models with driving data, ensuring models can efficiently identify hazards, interpret driving restrictions, drive and answer questions about the scenario. Working alongside world-class engineers and researchers, you'll leverage premium sensor data and cutting-edge infrastructure to validate your algorithms in real-world conditions, directly impacting productivity, safety and the capability of Zoox's autonomous system.

On the Perception Attributes team, you will collect and generate datasets for specialized vehicle classification and semantic enrichment, design and frame machine learning problems for real-world autonomous driving scenarios and train and evaluate state-of-the-art machine learning models with a focus on computer vision. You will also collaborate with engineers to deploy models for real-time inference on our vehicles, and contribute to improving our vehicle's ability to recognize and respond to emergency vehicles, school buses, construction vehicles, and other specialized road actors.

On the Perception Scene Understanding team, you will develop advanced ML models that perceive our vehicle's surroundings to identify hazards and driving restrictions. You will utilize vision-language models for detecting rare events and ensuring safe driving in these situations. You'll work with state-of-the-art machine learning models that operate in real-time on our robotaxi platform with minimal latency. Collaborating with world-class engineers and researchers across sensors, planning, and other teams, you'll have access to premium sensor data and cutting-edge infrastructure to validate your algorithms in real-world conditions.

On the Occupancy and Rare Events team, you will develop multimodal foundation models that serve as the common backbone for on-vehicle perception, enhancing the system's ability to detect long-tail events and generalize to new geofences. In this role, you will develop effective tokenization techniques for Vision, Lidar, and Radar modalities, leverage LLM techniques to align token embeddings across modalities into a common feature space supporting various 3D tasks (detection, segmentation, tracking, feature matching, dense depth), You'll collaborate with top-notch engineers across PCP, MLInfra, and Offboard Driving Intelligence teams, utilizing Zoox's large-scale dataset to train and evaluate models that directly impact the autonomous system's real-world performance.

On the perception optimization team, you will build optimized inference pipelines for on-bot algorithms. A major focus of optimization is ML models, with techniques such as quantization, pruning, and advanced transformer optimizations such as token pruning, merging and layer pruning being used to deploy large models into the bot to operate at real time. In this role, you will experiment with optimizing SOTA large ML models to make them fit into on-bot compute, including both post-training optimization (e.g. quantization) as well as architectural approaches (e.g. token merging).

Requirements:

  • Currently working towards a B.S., M.S., Ph.D., or advanced degree in a relevant engineering program
  • Must be returning to school to continue your education upon completing this internship
  • Good academic standing
  • Able to commit to a 12-week internship beginning in May or June of 2026.
  • At least one previous industry internship, co-op, or project completed in a relevant area
  • Ability to relocate to the Bay Area, California or Boston for the duration of the internship
  • Interns at Zoox may not use any proprietary information they are working on as part of their thesis, any published work with their university, or to be distributed to anyone outside of Zoox

Qualifications (It’s helpful if you meet a majority of the following qualifications, but it isn’t a requirement):

  • Advanced understanding of Python or C++ (C++ preferred)
  • Experience with production ML pipelines: dataset creation, labeling, training, metrics
  • Experience training/finetuning MLLMs or at least MLLms (SFT/RL)
  • Experience with Vision-Language Models
  • Experience with model deployment with TensorRT 
  • Experience with Neural Network design and implementation
  • Experience working with LiDAR, Camera and Radar data
  • Experience with building and processing large scale dataset
  • GPU/CUDA programming experience

Bonus Qualifications:

  • Experience with multimodal foundation model optimization techniques
  • Experience in algorithm development for Autonomous Driving software

Compensation:
The monthly salary range for this position is $5,500 to $9,500. Compensation will vary based on geographic location and level of education. Additional benefits may include medical insurance, and a housing stipend (relocation assistance will be offered based on eligibility).

About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations
If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.

A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

Top Skills

C++
Cuda
Gpu
Python
Tensorrt
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Foster City, CA
2,500 Employees
Year Founded: 2014

What We Do

Zoox is an autonomous mobility company that was founded to provide a safer, cleaner, and more enjoyable future on the road. To achieve that goal, the company has spent the past 10 years creating a purpose-built robotaxi that gives the world a better way to ride.

Why Work With Us

At Zoox, we are working to solve one of the greatest technological challenges of our generation.
From the beginning, we have been focused on our goal of reimagining transportation from the ground up. We are a mission-driven community of innovators working together to create a safer, cleaner, and more enjoyable future on the road.

Gallery

Gallery

Similar Jobs

Imprivata Logo Imprivata

Strategic Renewals Manager

Healthtech • Information Technology • Security • Software • Cybersecurity
Hybrid
2 Locations
1372 Employees
50K-136K Annually

Imprivata Logo Imprivata

Senior Product Manager

Healthtech • Information Technology • Security • Software • Cybersecurity
Hybrid
Waltham, MA, USA
1372 Employees
165K-175K Annually

ServiceNow Logo ServiceNow

Account Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Waltham, MA, USA
28000 Employees

ServiceNow Logo ServiceNow

Sales Executive

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Waltham, MA, USA
28000 Employees
123K-183K Annually

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account