Research Scientist / Engineer - Post-training & Robot Learning

Posted Yesterday
Be an Early Applicant
Mountain View, CA, USA
In-Office
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics
The Role
Develop and implement post-training pipelines that adapt web-pretrained causal video models to real robot tasks. Build RL training pipelines, reward models, inverse dynamics for action execution, and evaluation frameworks. Iterate between simulation and hardware, diagnose failures, and improve generalization with few-shot adaptation and preference alignment on robot-collected demonstration data.
Summary Generated by Built In

At Rhoda AI, we’re building the next generation of generalist intelligent robots. We own the full robotics stack from high-performance hardware and robot systems to the infrastructure and state-of-the-art foundation world models that control our robots. Our robots are designed to be generalists capable of operating in complex, real-world environments and handling long-tail edge cases, made possible by our cutting edge research and end-to-end system design. We've raised over $400M and are investing aggressively in model research, infrastructure, hardware development, and manufacturing scale-up to make generalist robotics a reality.

We're looking for Research Scientists and Research Engineers with deep robotics or autonomous systems domain knowledge to adapt our web-pretrained video model to real robot tasks. Post-training at Rhoda means taking a causal video generation model pretrained on internet-scale data and fine-tuning it on robot-collected demonstrations to produce reliable, generalizable behavior — with as little task-specific data as possible. We hire across levels — from senior to staff.

What You'll Do

  • Design and implement RL training pipelines to improve robot policy performance beyond what imitation learning alone achieves — reward design, online data collection, and policy optimization

  • Develop and apply RL algorithms (PPO, GRPO, or similar) adapted to the video prediction setting, including reward modeling and feedback collection strategies for physical task performance

  • Design and implement broader post-training pipelines: supervised fine-tuning, preference optimization, and behavioral alignment on robot-collected demonstration data

  • Work on the inverse dynamics model that translates video predictions into executable robot actions

  • Build evaluation frameworks for post-trained policies: task success, generalization to novel objects and environments, and failure mode analysis on real hardware

  • Research methods to efficiently adapt models to new tasks with minimal demonstration data, including in-context generalization and few-shot adaptation

  • Identify failure modes and systematic weaknesses in deployed robot policies and drive targeted improvements

  • Iterate quickly between simulation and real robot evaluation to close the feedback loop

  • Collaborate with the pre-training team to surface what capabilities are missing from the base model and need to be addressed upstream

What We're Looking For

  • Hands-on experience with robot systems, robotic policy learning, or autonomous systems in an industry or research setting (robotics, self-driving, or similar physical AI domains)

  • Strong understanding of robot policy learning: imitation learning, behavior cloning, and how RL builds on top of it

  • Practical familiarity with real robot hardware, deployment constraints, and sensor modalities (vision, proprioception)

  • Solid ML skills with hands-on PyTorch experience

  • Ability to diagnose policy failures, reason about distribution shift, and iterate effectively on data and training strategies

  • Comfort with ambiguity and fast-changing research priorities

  • Staff-level candidates are expected to define technical direction and drive research strategy independently; senior candidates execute complex projects with strong fundamentals and growing scope

Nice to Have (But Not Required)

  • Hands-on experience with reinforcement learning — reward design, policy optimization, and online RL training loops — applied to real or near-real environments (robotics, games, simulated physics, or similar); this is a significant plus

  • Prior industry experience in robotics, autonomous driving, or physical AI (e.g., manipulation, mobile robotics, self-driving stacks)

  • Experience with teleoperation systems or robot demonstration collection at scale

  • Familiarity with robot middleware (ROS/ROS2) and real-time control systems

  • Experience with simulation environments for robotics (MuJoCo, Isaac Sim, Genesis)

  • Understanding of video generation models and how they connect to action prediction

  • PhD in Robotics, ML, or a related field

  • Publication record at ICRA, CoRL, RSS, NeurIPS, or related venues

Why This Role

  • Your work is what makes our robots actually perform tasks reliably in the real world — the direct connection between pre-trained capability and deployed behavior

  • Work at a rare intersection: state-of-the-art video generation models applied to real robot hardware, not simulation

  • Fast feedback loop between model changes and real robot performance

  • High ownership on a small team where robotics domain expertise is core to the mission

Skills Required

  • Hands-on experience with robot systems, robotic policy learning, or autonomous systems
  • Strong understanding of robot policy learning: imitation learning, behavior cloning, and how RL builds on top
  • Practical familiarity with real robot hardware, deployment constraints, and sensor modalities (vision, proprioception)
  • Solid ML skills with hands-on PyTorch experience
  • Ability to diagnose policy failures, reason about distribution shift, and iterate on data and training strategies
  • Comfort with ambiguity and fast-changing research priorities
  • Hands-on experience with reinforcement learning applied to real or near-real environments (reward design, policy optimization, online RL)
  • Prior industry experience in robotics, autonomous driving, or physical AI (manipulation, mobile robotics)
  • Experience with teleoperation systems or robot demonstration collection at scale
  • Familiarity with robot middleware (ROS/ROS2) and real-time control systems
  • Experience with simulation environments for robotics (MuJoCo, Isaac Sim, Genesis)
  • Understanding of video generation models and how they connect to action prediction
  • PhD in Robotics, ML, or a related field
  • Publication record at ICRA, CoRL, RSS, NeurIPS, or related venues
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
73 Employees
Year Founded: 2024

What We Do

Rhoda AI builds robot foundation models that learn from internet-scale video to enable manipulation-capable robots to generalize in real-world industrial environments. Using a Direct Video Action architecture and its FutureVision intelligence layer, Rhoda focuses on turnkey deployments in manufacturing, logistics, and e-commerce—aiming to move robots out of controlled labs and into reliable, adaptive production settings.

Similar Jobs

Circle Logo Circle

Senior Counsel

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
In-Office or Remote
25 Locations
1050 Employees
230K-298K Annually

Wipfli Logo Wipfli

Transaction Advisory Services Manager

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
117K-158K Annually

Wipfli Logo Wipfli

Director - Transaction Advisory Services

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
142K-191K Annually

CrowdStrike Logo CrowdStrike

Infrastructure Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
140K-215K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account