Principal Software Engineer, ML System Architect

Reposted 19 Days Ago
Be an Early Applicant
Mountain View, CA, USA
In-Office
349K-431K Annually
Senior level
Automotive
The Role
Seeking a Principal Software Engineer for architecting ML systems, unifying codebases, and leading cross-team architecture for Waymo's foundation models.
Summary Generated by Built In

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

Waymo’s Systems Intelligence and ML team works with Research and Production teams to develop and deploy models that are core to our autonomous driving software. Waymo's AI is at the heart of this mission, and we are increasingly leveraging large-scale Foundation Models to unlock new capabilities for the Waymo Driver. Join Waymo to architect and build a unified, large-scale AI platform leveraging Google DeepMind's latest foundation models (like Gemini) for comprehensive world understanding and generation, to accelerate the development and distillation of models powering the world's most experienced driver.

In this hybrid role, you will report to our Director of Engineering who leads Systems Intelligence and Machine Learning.
We are seeking a deeply experienced Principal Software Engineer to provide the overarching technical vision, architectural design, and cross-team leadership to make Waymo’s foundation model systems nextgen a success. This role is pivotal in transforming Waymo's offboard ML landscape from a fragmented set of models and tools into a cohesive, efficient, and powerful platform centered around a unified foundation model recipe, deeply integrated with Google Deepmind's latest innovations with Gemini. You will be the technical authority defining how Waymo builds, trains, and utilizes these large models offboard to ultimately accelerate onboard deployment and improvements.


You will:

  • Architect ML Systems: Define and drive the technical roadmap for the platform, encompassing codebase unification, data pipelines, model architecture, training recipes, and evaluation frameworks.
  • Codebase Consolidation & Best Practices: Lead the unification of existing forked locations of foundation model component codebases into a production-hardened, shared repository. Establish and enforce rigorous coding standards, testing practices, and API designs to ensure long-term codebase health and developer velocity.
  • Google Deepmind Integration & API Definition: Serve as the primary technical interface between Waymo's offboard model development and Google Deepmind's core model and framework teams. Define clear APIs and integration patterns, ensuring Waymo can seamlessly leverage and contribute to Google Deepmind's advancements while maintaining stability and control.
  • Unify Core Components: Drive the consolidation of tokenization/de-tokenization strategies, data formats, input pipelines, and evaluation methodologies across all offboard Foundation Model use cases.
  • Scalable Training & Distillation: Architect for efficient large-scale distributed training (large scale) and establish a common, efficient distillation setup to transfer knowledge from large teacher models to onboard student models.
  • Technical Leadership & Influence: Provide technical mentorship, guidance, and direction to engineers across multiple teams within SIML and AI Foundations. Drive alignment on technical decisions with senior stakeholders across Waymo and Google Deepmind.
  • Drive Efficiency: Instill a culture of efficiency in model development, training, and resource utilization, aiming for high ML Productivity.


You have:

  • Master's degree or PhD in Computer Science or a related field.
  • 12+ years of experience in software engineering, with at least 8+ years focused on large-scale machine learning systems, deep learning frameworks, and AI infrastructure.
  • A track record of architecting and delivering complex, high-impact ML platforms or models.
  • Deep expertise in Python, C++, and ML frameworks like JAX and TensorFlow.
  • Extensive experience with large-scale distributed training on TPUs/GPUs and associated challenges.
  • Demonstrated ability to design robust, scalable, and maintainable software architectures and APIs.
  • Understanding of data pipelines, storage systems, and tokenization techniques.
  • Experience working effectively with research and product teams, and influencing across organizational boundaries.
  • Technical leadership skills, with the ability to drive strategy, influence across teams, and mentor other engineers.
  • Communication skills, with the ability to articulate complex technical vision and drive alignment, capable of conveying complex technical ideas clearly.


We prefer:

  • Experience with multimodal and generative models.
  • Experience in autonomous vehicle systems or robotics.
  • Contributions to open-source ML frameworks or widely used internal tools.
  • Experience with simulation systems.

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process. 

Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements. 

Salary Range
$349,000$431,000 USD

Skills Required

  • Master's degree or PhD in Computer Science or a related field
  • 12+ years of experience in software engineering
  • 8+ years focused on large-scale machine learning systems
  • Track record of architecting and delivering complex ML platforms
  • Deep expertise in Python, C++, and ML frameworks
  • Experience with large-scale distributed training on TPUs/GPUs
  • Ability to design robust, scalable software architectures
  • Technical leadership skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Mountain View, CA
2,359 Employees
Year Founded: 2009

What We Do

Waymo is an autonomous driving technology company with a mission to make it safe and easy for people and things to move around. With the Waymo Driver, we can improve the world’s mobility while saving thousands of lives. Waymo reaches out to candidates from official channels only (e.g. directly from @waymo.com email addresses, or through our recruiters or sourcers who are noted as such on LinkedIn). We do not contact candidates about career opportunities through instant messaging apps like Telegram, email addresses from domains other than waymo.com (such as Gmail addresses), direct messages on Twitter, Facebook, and Instagram, or text messages. Visit waymo.com to check out our official job listings.

Similar Jobs

Wipfli Logo Wipfli

Audit Senior Manager, Health Care Industry

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
Irvine, CA, USA
3000 Employees
142K-200K Annually

FreeWheel Logo FreeWheel

Technical Program Manager

AdTech • Digital Media • Marketing Tech
Remote or Hybrid
California, USA
1249 Employees
186K-248K Annually

Atlassian Logo Atlassian

Principal Strategist, AI Sales Strategy, Consumption Pricing

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
San Francisco, CA, USA
11000 Employees
149K-233K Annually

Cox Enterprises Logo Cox Enterprises

Search Engine Optimization Specialist

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
22-33 Hourly

Similar Companies Hiring

Cox Enterprises Thumbnail
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Atlanta, GA
50000 Employees
UL Solutions Thumbnail
Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Chicago, IL
15000 Employees
HERE Technologies Thumbnail
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account