ML System Performance Tech Lead, Machine Learning Platform
Waymo is an autonomous driving technology company with a mission to make it safe and easy for people and things to get where they’re going. Since our start as the Google Self-Driving Car Project in 2009, Waymo has been focused on building the World’s Most Experienced Driver in hopes of improving the world's access to mobility while saving thousands of lives now lost to traffic crashes. Our Waymo Driver powers Waymo One, our fully autonomous ride-hailing service, as well as Waymo Via, our trucking and local delivery service. To date, Waymo has driven over 20 million miles autonomously on public roads across 25 U.S. cities and conducted over 20 billion miles of simulation testing.
Software Engineering builds the brains of Waymo's fully autonomous driving technology. Our software allows the Waymo Driver to perceive the world around it, make the right decision for every situation, and deliver people safely to their destinations. We think deeply and solve complex technical challenges in areas like robotics, perception, decision-making and deep learning, while collaborating with hardware and systems engineers. If you’re a software engineer or researcher who’s curious and passionate about Level 4 autonomous driving, we'd like to meet you.
The ML Platform team at Waymo provides a set of tools and technologies to support and automate the lifecycle of the machine learning workflow, including feature and experiment management, model development, debugging & evaluation, deployment, and monitoring. These efforts have resulted in making machine learning more accessible to teams at Waymo, including Perception, Behavior Prediction, Planner, Routing, Maps and Research, ensuring greater degrees of consistency and repeatability, and addressing the “last mile” of getting models into production and managing them once they are in place. We work hand in hand with machine learning experts in all parts of the company as well as our collaborators across Alphabet.
We are looking for a technical lead (TL) with a strong background in system performance analysis and optimization to help us improve the complex ML inference workloads on the car. Non-exhaustive examples of our work include:
- Developing performance/benchmark/profiling tools to help analyze the performance issues on the car.
- Identifying performance bottlenecks for the ML workloads onboard and in simulation.
- Proposing and implementing solutions to optimize the ML inference workloads
At a minimum, we’d like you to have:
- B.Sc in Computer Science, EE or a related field
- Strong C++ programming skills
- Strong problem solving
- Performance analysis and optimization with 5+ years of experience
- Experience with multi-core systems and multi-thread interactions
- Experience with CPU, GPU, or computer system microarchitecture
It’s preferred if you have:
- Python programming skills
- Embedded / real time / robotics systems
- Strong experience working with ML workload on edge device