In this role, you will:
- Evaluate new distributed system paradigms and technologies to meet Zoox’s ever-growing computational and storage needs
- Strike a balance between incremental improvements to Zoox’s existing in-house HPC infrastructure and greenfield services and abstractions.
- Create production-grade web service APIs, SDKs, and other tools to provide a world-class developer experience for all of Zoox’s software teams.
Qualifications
- 7+ years of experience
- Experience with Ray.io, particularly Ray Core and Ray Data
- Experience with Kubernetes, particularly for heterogeneous workloads and clusters
- Experience with Ray.io and Kubernetes deployed on Amazon Web Services (AWS) or other similar cloud providers such as Azure or GCP
- Proficiency with Python
Bonus Qualifications
- Exposure to machine learning workloads (training, inference, data generation, etc) from a compute infra service provider perspective
- Experience with Kubernetes or SLURM at scale (>10k+ nodes)
- Experience with SLURM workload manager
Similar Jobs
What We Do
Zoox is an autonomous mobility company that was founded to provide a safer, cleaner, and more enjoyable future on the road. To achieve that goal, the company has spent the past 10 years creating a purpose-built robotaxi that gives the world a better way to ride.
Why Work With Us
At Zoox, we are working to solve one of the greatest technological challenges of our generation.
From the beginning, we have been focused on our goal of reimagining transportation from the ground up. We are a mission-driven community of innovators working together to create a safer, cleaner, and more enjoyable future on the road.
Gallery







