Software Engineer - Site Reliability Engineering

Reposted 3 Days Ago
Be an Early Applicant
Foster City, CA, USA
Hybrid
140K-230K Annually
Senior level
Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
Zoox is an autonomous mobility company that’s created a purpose-built robotaxi to give the world a better way to ride.
The Role
The Site Reliability Engineer at Zoox will manage the availability and resilience of services for autonomous vehicles, design systems, and lead incident resolution.
Summary Generated by Built In
Zoox is seeking a Site Reliability Engineer to help ensure the availability, performance, and resilience of the services that power the development and operation of our autonomous vehicles. In this role, you will own the full lifecycle of our services—from designing fault-tolerant, maintainable systems to deploying, operating, and continuously improving them in production. As a robotics company, Zoox embraces automation at every layer of our infrastructure, and you’ll help drive that ethos forward. You’ll work hands-on with systems that process massive volumes of data and support compute-intensive pipelines running on both CPUs and GPUs. 

In this role, you will:

  • Architect and optimize scalable systems: You will design, implement, and continuously improve highly reliable infrastructure, directly impacting the success and safety of Zoox's autonomous vehicle platform.

  • Build proactive monitoring solutions: You will develop advanced monitoring, alerting, and reporting tools to ensure potential issues are identified and resolved before they affect production.

  • Collaborate across engineering: You will partner closely with software engineering teams to elevate our system architecture, streamline deployment processes, and drive automation initiatives.

  • Lead incident resolution: You will conduct thorough root cause analyses on production issues and rapidly deploy corrective actions to maintain a resilient and stable environment.

  • Ensure business continuity: You will safeguard the company's operations by designing and implementing robust disaster recovery plans to keep the Zoox fleet running smoothly under any circumstances.

Qualifications

  • SRE & Distributed Systems Experience: 5+ years of experience in site reliability engineering or a similar role, with a strong, objective background in managing large-scale distributed systems.

  • Cloud & Infrastructure as Code (IaC): Proven experience operating within major cloud platforms (AWS, GCP, or Azure) and utilizing IaC tools like Terraform, Ansible, Salt, or CloudFormation.

  • Container Orchestration: Technical expertise in deploying, managing, and scaling systems using container orchestration technologies such as Kubernetes.

  • Core Infrastructure Knowledge: Deep, foundational understanding of networking protocols, storage solutions, and database technologies.

  • Programming Proficiency: Strong, demonstrable programming and scripting skills in languages such as Python, Go, C/C++, or Java.

Bonus Qualifications

  • Experience in the automotive or autonomous vehicle industry.

  • Knowledge of security best practices and compliance requirements.

About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations
If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.

A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

Skills Required

  • 5+ years of experience in site reliability engineering or similar role
  • Proven experience operating within major cloud platforms
  • Technical expertise in deploying and managing container orchestration systems
  • Deep understanding of networking protocols and storage solutions
  • Strong programming skills in Python, Go, C/C++, or Java

Zoox Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Zoox and has not been reviewed or approved by Zoox.

  • Healthcare Strength Healthcare is extensive, with broad medical and vision options, company‑paid disability coverage, and multiple mental‑health resources. Feedback suggests coverage breadth and auxiliary programs support a wide range of needs.
  • Parental & Family Support Family supports include paid parental leave, additional pregnancy disability time, fertility coverage, and adoption/surrogacy assistance. Backup care and family‑oriented programs further reinforce support across life stages.
  • Wellbeing & Lifestyle Benefits Day‑to‑day perks are robust, including free daily meals, fitness subsidies, commuter support, and on‑site amenities. Feedback suggests these lifestyle benefits enhance convenience and workplace experience, especially for office‑based roles.

Zoox Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Foster City, CA
2,900 Employees
Year Founded: 2014

What We Do

Zoox is an autonomous mobility company that was founded to provide a safer, cleaner, and more enjoyable future on the road. To achieve that goal, the company has spent the past 10 years creating a purpose-built robotaxi that gives the world a better way to ride.

Why Work With Us

At Zoox, we are working to solve one of the greatest technological challenges of our generation. From the beginning, we have been focused on our goal of reimagining transportation from the ground up. We are a mission-driven community of innovators working together to create a safer, cleaner, and more enjoyable future on the road.

Gallery

Gallery

Similar Jobs

Click Therapeutics Logo Click Therapeutics

DTx Quality Engineer (6 Month Contract)

Healthtech • Biotech • App development
Remote or Hybrid
USA
70 Employees
100-120 Hourly

Airwallex Logo Airwallex

Senior Product Manager

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Remote or Hybrid
San Francisco, CA, USA
2200 Employees

Airwallex Logo Airwallex

Counsel

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Remote or Hybrid
San Francisco, CA, USA
2200 Employees

Airwallex Logo Airwallex

Paralegal

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Remote or Hybrid
San Francisco, CA, USA
2200 Employees

Similar Companies Hiring

Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account