Technical Program Manager, ML Developer Experience and Infrastructure Reliability

Reposted 2 Days Ago
Be an Early Applicant
Mountain View, CA
In-Office
230K-292K Annually
Senior level
Automotive
The Role
As a Technical Program Manager, oversee ML development processes, manage infrastructure reliability, and ensure effective project execution to enhance developer experience.
Summary Generated by Built In

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

Waymo’s Technical Program Managers and Program Managers are accountable for Waymo’s roadmap execution by providing thoughtful cross-functional planning, clarity, and proactive risk management. In the face of complex technical and operational challenges with no established playbooks to follow, we act with thoughtful urgency, driving conversations, discussions, and outcomes. Our team partners closely with every function of Waymo to structure, own and drive work towards real-world deployments of the Waymo Driver across platforms and geographies.

In this hybrid role, you will report to a Technical Program Management Director. 

You will:

  • Drive the "Golden Path" for ML: Lead cross-functional execution to define and invest in a simplified "golden path" for ML development for Onboard and Waymo Foundation Model (WaymoFM) development, targeting the reduction of friction and low reliability in the "inner loop"
  • Manage Reliability Operations: Ensure smooth day-to-day operations of the reliability triage ecosystem, keeping queues healthy through interaction with rotation members and driving automation of queue management
  • Program Implementation for Infra Stability: Drive "contract-based reliability" programs across Onboard domains
  • Bridge ML and Infra: Facilitate communication and alignment between ML research, infrastructure foundations, and onboard teams to resolve blockers in core workflows like root-causing brittle pipelines
  • Strategic Roadmap Tracking: Contribute to strategic planning and track project progress, risks, and KPIs related to ML developer productivity and infrastructure reliability for leadership reporting
  • Resolve Systemic Blockers: Proactively identify and resolve roadblocks in the ML development cycle, such as data fragmentation and complex tooling that currently hinders developer velocity

You have:

  • Technical Education: A Bachelor's degree in Computer Science, Engineering, or a related technical field
  • TPM Experience: 5+ years of experience as a Technical Program Manager in a software engineering or large-scale infrastructure environment
  • ML/Reliability Track Record: Proven track record of managing complex technical projects involving machine learning infrastructure, developer experience (DevX), or site reliability engineering (SRE)
  • Program Ownership: Experience owning and driving programs end-to-end, including managing timelines, risks, and dependencies across multiple senior stakeholders
  • Analytical Problem Solving: Strong analytical and technical judgment skills, with the ability to use data to diagnose and solve systemic engineering bottlenecks
  • Communication Mastery: Excellent communication and interpersonal skills, with a demonstrated ability to convey complex technical concepts to both researchers and infrastructure engineers

We prefer:

  • Advanced ML Operations: Experience with ML observability, root-causing production pipelines, and automating large-scale offline inference or model training experiments
  • Large-Scale Data Management: Background in managing multi-petabyte scale datasets, data validation frameworks, or unified data management solutions
  • Reliability Frameworks: Familiarity with contract-based reliability models, SLO management for autonomous systems, or reliability triage ecosystems
  • Developer Platforms: Experience building or managing "golden path" developer platforms or developer tooling that simplifies complex, fragmented tech stacks
  • Advanced Degree: Master's degree or PhD in a related technical field
  • Autonomous Domain Knowledge: Experience with simulation environments for autonomous systems, model validation strategies, or onboard/offboard infrastructure dependencies

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process. 

Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements. 

Salary Range
$230,000$292,000 USD

Top Skills

Data Management
Infrastructure Management
Machine Learning
Software Engineering
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Mountain View, CA
2,359 Employees
Year Founded: 2009

What We Do

Waymo is an autonomous driving technology company with a mission to make it safe and easy for people and things to move around. With the Waymo Driver, we can improve the world’s mobility while saving thousands of lives.
Waymo reaches out to candidates from official channels only (e.g. directly from @waymo.com email addresses, or through our recruiters or sourcers who are noted as such on LinkedIn). We do not contact candidates about career opportunities through instant messaging apps like Telegram, email addresses from domains other than waymo.com (such as Gmail addresses), direct messages on Twitter, Facebook, and Instagram, or text messages. Visit waymo.com to check out our official job listings.

Similar Jobs

Airwallex Logo Airwallex

Senior Software Engineer

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
In-Office or Remote
San Francisco, CA, USA
2000 Employees

Airwallex Logo Airwallex

Analyst, Transaction Monitoring

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Remote or Hybrid
San Francisco, CA, USA
2000 Employees

General Motors Logo General Motors

Manager - Human Interface Design, Commercial

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
2 Locations
165000 Employees
173K-266K Annually

General Motors Logo General Motors

Senior Software Engineer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
3 Locations
165000 Employees

Similar Companies Hiring

Cox Enterprises Thumbnail
Software • Other • Information Technology • Greentech • Cybersecurity • Cloud • Automotive
Atlanta, GA
50000 Employees
UL Solutions Thumbnail
Software • Renewable Energy • Professional Services • Energy • Consulting • Chemical • Automotive
Chicago, IL
15000 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account