ML/RL Engineer, Behavior Planning

Posted Yesterday
8 Locations
In-Office or Remote
Senior level
Logistics • Transportation
Transforming American Transportation
The Role
Develop and train conditioned policies and MARL systems to simulate realistic driving behaviors, implement safety-constrained RL algorithms, design rewards and evaluation metrics, optimize large-scale training pipelines, advance neural architectures for long-horizon planning and spatial reasoning, and integrate research models with production simulation and planning teams.
Summary Generated by Built In
Company Introduction

At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous trucks, enhancing the quality of life for communities around the globe. With the agility of a startup and the wisdom of seasoned experts, our team has achieved numerous world-firsts and unparalleled innovations. United by a shared vision, we create groundbreaking solutions that propel the future of transportation. Join us and transform your ideas into reality.

Role Overview

We are seeking a ML/RL Engineer to join our Algo team and drive the development of our unified behavioral architecture. In this role, you will help bridge the gap between simulation and the real world by developing a scalable policy framework that represents both our L4 ego-policy and a diverse population of simulated agents. You will work at the intersection of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design to ensure our autonomous semi-trucks navigate highways with superhuman safety and precision.

Key Responsibilities
  • Behavioral Modeling: Develop and train diverse, conditioned policies that simulate realistic driving behaviors to stress-test and validate our autonomous driving stack.
  • Safety-Constrained Learning: Lead the research and implementation of advanced RL algorithms to ensure safety metrics are treated as primary constraints in the learning process.
  • Reward & Objective Design: Collaborate with cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort.
  • Scalable Training Pipelines: Contribute to the optimization of our large-scale, high-throughput training environments to enable rapid iteration on complex multi-agent scenarios.
  • Model Architecture: Advance our state-of-the-art neural architectures to improve spatial reasoning, long-horizon planning, and interaction modeling.
  • Cross-Team Collaboration: Work closely with Simulation and Planning teams to integrate research-grade models into production-quality, safety-critical software.
Required Qualifications
  • Professional RL Experience: Proven track record of training and deploying deep RL algorithms (e.g., PPO, SAC) for complex, real-world robotic or autonomous systems.
  • Technical Mastery: Expertise in Python and PyTorch; strong understanding of modern deep learning architectures and optimization techniques.
  • Academic Background: MS or PhD in Computer Science, Robotics, or a related quantitative field.
  • Scientific Intuition: Ability to diagnose and solve fundamental challenges in RL training, such as variance management and distribution shift.
Preferred Qualifications
  • Safe RL Specialization: Experience with constrained optimization or safety-critical learning frameworks.
  • Multi-Agent Systems: Background in MARL training stability, including self-play and decentralized execution strategies.
  • Autonomous Driving Domain: Familiarity with vehicle dynamics and behavior planning, particularly for long-haul highway environments.
Additional Information
  • Compensation: Competitive salary based on experience, with opportunities for performance bonuses and equity.
  • Benefits: Comprehensive health insurance, paid time off, and the opportunity to work at the forefront of the autonomous trucking industry.

Skills Required

  • Proven experience training and deploying deep reinforcement learning algorithms (e.g., PPO, SAC) for real-world robotic or autonomous systems
  • Expertise in Python
  • Expertise in PyTorch
  • MS or PhD in Computer Science, Robotics, or a related quantitative field
  • Ability to diagnose and solve RL training challenges such as variance management and distribution shift
  • Experience with constrained optimization or safety-critical learning frameworks
  • Background in multi-agent systems, MARL training stability, self-play, and decentralized execution
  • Familiarity with autonomous driving, vehicle dynamics, and behavior planning for long-haul highway environments
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Houston, TX
76 Employees
Year Founded: 2023

What We Do

We are an L4 autonomous trucking company, based in Houston, Texas. We operate our autonomous truck fleet and offer Transportation as a Service (TaaS) to our freight customers. Our team combines visionary leadership, top-tier science and engineering talents, financial and governance experts, and industry veterans to build an AI-driven autonomous trucking company. We focus on fleet operations, transforming autonomous trucking into a commercially profitable product. This blend of experience, industry maturity, and innovative technology positions us to commercialize autonomous trucking at scale.

Similar Jobs

In-Office or Remote
8 Locations
76 Employees

ServiceNow Logo ServiceNow

Sr. Partner Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Toronto, ON, CAN
29000 Employees

Applied Systems Logo Applied Systems

Senior Compensation Analyst

Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Remote or Hybrid
3 Locations
3040 Employees
100K-135K Annually

PwC Logo PwC

Managed Services HR Payroll - Director

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
63 Locations
370000 Employees
155K-410K Annually

Similar Companies Hiring

Blissway Thumbnail
Computer Vision • Fintech • Hardware • Internet of Things • Machine Learning • Software • Transportation
Denver, CO
24 Employees
Toro TMS Thumbnail
Cloud • Enterprise Web • Sales • Software • Transportation
Chicago, IL
80 Employees
Axle Health Thumbnail
Artificial Intelligence • Healthtech • Information Technology • Logistics
Santa Monica, CA
22 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account