Machine Learning Engineer - Reinforcement Learning

Posted 4 Days Ago
Fremont, CA, USA
In-Office
150K-250K Annually
Expert/Leader
Artificial Intelligence • Software • Transportation
The Role
The Machine Learning Engineer will design, implement, and optimize reinforcement learning models for autonomous driving systems, enhancing data systems for scalable evaluation and performance assessment.
Summary Generated by Built In

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in November 2024.

Responsibility
  • Build scalable systems for training and fine-tuning large generative models that produce realistic, informative driving behaviors for evaluation and scenario coverage.
  • Implement and iterate on RL-style methods: algorithms, reward / preference objectives, and training setups suited to high-fidelity, insightful behaviors in simulation-aligned workflows (closed-loop evaluation mindset).
  • Ship deep learning solutions (including LLM / VLM where appropriate) that improve human-led triaging, automate high-volume workflows, and support nuanced analysis of self-driving behavior to surface critical anomalies.
  • Own production-oriented ML for fleet-scale assessment: training, optimization, monitoring, and iteration of models used to judge performance across large real-world exposure.
  • Design and evolve data + evaluation systems inspired by RL from human preferences (RLHF) and related paradigms—turning preference/judgment signals into repeatable, scalable training and evaluation loops.
  • Partner broadly with teams such as Prediction, Planning, Research, and platform/engineering leads to land cross-cutting improvements with clear metrics.

Requirements
  • M.S. or Ph.D. in Computer Science, Machine Learning, AI, or a related field—or equivalent practical experience.
  • Hands-on experience building and applying ML in production-grade settings, with a strong RL component (policy learning, preference/feedback optimization, or offline/online RL pipelines).
  • Depth in deep learning, sequence modeling, and generative models.
  • Demonstrated impact via strong publications or a clear history of shipping impactful ML systems end-to-end.
  • Experience with large-scale distributed training and large-scale data processing.
  • Ability to lead ambiguous technical work from problem framing through reliable delivery.
Preferred
  • Background in autonomous vehicles, robotics, or complex simulation environments.
  • Strong grasp of modern RL and post-training techniques in LLM, dLLM, VLA and video generations.
  • Hands-on integration of simulation platforms with ML training and evaluation workflows.
  • Python fluency and frameworks such as PyTorch
  • Experience defining and operating metrics for complex, safety-critical AI systems.
  • Technical leadership: influencing stakeholders, aligning teams, and raising the bar for evaluation rigor.
  • Excellent communication—simple explanations of complex trade-offs.
Compensation and Benefits

Base Salary Range: $150,000 - $250,000 Annually

Compensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units.

Also, we provide the following benefits to the eligible employees:

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (Traditional and Roth 401k)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Free Food & Snacks

Please click here for our privacy disclosure.

Skills Required

  • M.S. or Ph.D. in Computer Science, Machine Learning, AI, or equivalent practical experience
  • Hands-on experience building and applying ML in production-grade settings with a strong RL component
  • Depth in deep learning, sequence modeling, and generative models
  • Demonstrated impact via strong publications or clear history of shipping impactful ML systems end-to-end
  • Experience with large-scale distributed training and large-scale data processing
  • Ability to lead ambiguous technical work from problem framing through reliable delivery

Pony.AI Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Pony.AI and has not been reviewed or approved by Pony.AI.

  • Healthcare Strength Healthcare coverage is characterized as comprehensive, including medical, dental, and vision, alongside disability/life insurance and an Employee Assistance Program. This makes the core insurance offering a consistent strength within the overall package.
  • Wellbeing & Lifestyle Benefits Daily meals and on-site food perks stand out as especially generous, with free meals, snacks, and drinks repeatedly highlighted. This creates a tangible day-to-day benefit that can materially reduce employees’ out-of-pocket costs during in-office work.
  • Equity Value & Accessibility Equity participation is presented as a standard part of the compensation mix, contributing to the perceived competitiveness of total packages in technical roles. At the same time, perceived value can vary with liquidity timing and broader market performance, shaping how accessible that value feels in practice.

Pony.AI Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Fremont, CA
512 Employees
Year Founded: 2016

What We Do

Pony AI Inc. (“Pony.ai”) is a global leader in the large-scale commercialization of autonomous mobility. Leveraging its vehicle-agnostic Virtual Driver technology, full-stack autonomous driving technology that seamlessly integrates its proprietary software, hardware, and services, Pony.ai is developing a commercially viable and sustainable business model that enables the mass production and deployment of vehicles across transportation use cases. Founded in 2016, Pony.ai has expanded its presence across China, Europe, East Asia, the Middle East, and other regions, ensuring widespread accessibility to its advanced technology. Pony.ai is among the first in China to obtain licenses to operate fully driverless vehicles in all four Tier-1 cities in China (Beijing, Guangzhou, Shanghai, Shenzhen) and has begun to offer public-facing, fare-charging robotaxi services without safety drivers in Beijing, Guangzhou and Shenzhen. Pony.ai operates a fleet consisting of over 250 robotaxis. To date, Pony.ai has driven nearly 45 million autonomous testing and operation kilometers on open roads worldwide.

Similar Jobs

Hybrid
San Francisco, CA, USA
56 Employees

Anthropic Logo Anthropic

Research Engineer, Machine Learning (Reinforcement Learning)

Artificial Intelligence • Natural Language Processing • Generative AI
In-Office
2 Locations
2500 Employees
500K-850K Annually

Skild AI Logo Skild AI

Machine Learning Engineer

Artificial Intelligence • Robotics • Business Intelligence
In-Office
San Mateo, CA, USA
24 Employees
100K-300K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account