In this role, you will:
- Design and implement visual scene understanding solutions for robotaxis
- Lead end-to-end data strategy, including mining, auto-labeling, and dataset construction to power our ML flywheel
- Build and utilize vision-language models for rare hazard detection and scene reasoning
- Utilize our large-scale data pipelines and ML infrastructure to research, prototype, and deploy solutions that improve driving behavior
- Partner with cross-functional teams to integrate perception signals
Qualifications
- MS or PhD in Computer Science or related field
- Background in deep learning solutions for vision-language models, large language models, object detection or scene understanding
- Track record of training and deploying state-of-the-art deep learning models
- Hands-on experience with production ML pipelines, including dataset creation, training frameworks, and metrics
- Expertise in Python libraries (PyTorch, NumPy, Pandas)
Bonus Qualifications
- Deep knowledge of cutting-edge computer vision techniques
- Publications in top-tier conferences (CVPR, ICCV, RSS, ICRA)
- Experience with integrating large language models to various tasks
Top Skills
What We Do
Zoox is an autonomous mobility company that was founded to provide a safer, cleaner, and more enjoyable future on the road. To achieve that goal, the company has spent the past 10 years creating a purpose-built robotaxi that gives the world a better way to ride.
Why Work With Us
At Zoox, we are working to solve one of the greatest technological challenges of our generation.
From the beginning, we have been focused on our goal of reimagining transportation from the ground up. We are a mission-driven community of innovators working together to create a safer, cleaner, and more enjoyable future on the road.
Gallery









