Principal AI Research Engineer - RL

Reposted 13 Days Ago
2 Locations
In-Office
Senior level
Robotics
The Role
The role involves developing robust robot policies using reinforcement learning algorithms, enhancing sample efficiency, and applying them to real-world hardware for robotics applications.
Summary Generated by Built In

Company Overview

Reflex Robotics is building affordable ($10k) wheeled humanoid robots to automate dangerous and repetitive tasks in manufacturing and logistics.

We envision a future where intelligent robots are doing all kinds of boring work that people hate doing—loading chicken nuggets into Costco boxes, lifting forty pound bags of dog food at Petco stores, and cleaning up cranberry juice spills in your apartment.

We are a three-year-old startup backed by Khosla Ventures, with $60M/year of revenue lined up pending successful pilots with e-commerce warehouses in 2025.


How Does It Work?

Our robots are designed and built entirely in-house by an engineering team that led development of the Stretch robot at Boston Dynamics and key systems on the Tesla Model S, X, and Y production lines. Reflex robots are high-performance, low-inertia, and optimized for low-cost manufacturing.

We’ve built the best real-time teleoperation system in the world, allowing a remote operator in South America to “play a video game” to control our robots at human-level speeds. This has allowed us to already ship robots with positive unit economics, and enables us to create a powerful human-intervention + RL product feedback loop.

Our system allows us to collect high-quality demonstrations at scale—giving us the proprietary data engine needed to train increasingly capable AI systems. We're on track to build the largest robotics dataset in the world, which will serve as an important long-term advantage.


Key Company Beliefs

  • High-quality, proprietary robotics data is the next foundation for generational AI companies (like Tesla FSD and ChatGPT).

  • Being nerd-sniped by maximizing an engineering metric is way less important than solving our customers’ biggest pain points.

  • An insane work ethic is required for outsized success—and you'll be rewarded for it.


What We’re Looking For

We’re looking for stellar on-policy RL engineers to work on creating robust robot policies.

We’re still a small team—which means high ownership, high equity, and the chance to shape the product from the ground up.

VLAs and other great “base policies” for robotics achieve ~80% success rates, but in real robot deployments, it’s essential to achieve 99.99% success rates. We can’t ask our customers to tolerate our robots packing three socks into a bin instead of four, or swapping shipping labels between two packages—not even once!
You should apply for this role if:

  • You’ve re-implemented core RL algorithms (SAC, DDPG) from scratch and can debug unstable gradients / tune hyperparameters correctly

  • You’ve made meaningful intellectual contributions to sample-efficient RL algorithms (e.g., DreamerV3 and MuZero)

  • You’ve shipped on-policy RL on hardware that learns in the real-world (e.g., for quadruped walking or drone racing)

You’d be joining a company that already has a solid core business—with working hardware, delighted customers, and profitable unit economics. Reflex is de-risked enough to see the hazy outlines of success, but still small enough that there’s enormous upside up for grabs.


Come Join Us

This is a rare opportunity to help build a flagship robotics company from the ground up—and to do work that will truly matter, reshaping what people believe is possible in robotics.

We love to see the things you’ve worked on. Have a portfolio or insane project you’ve worked on? Share it. We’re looking for people who push past the status quo, are passionate at work and in their own time—we’re looking for people who want to win.

Skills Required

  • Re-implemented core RL algorithms from scratch
  • Debugging unstable gradients and tuning hyperparameters
  • Meaningful contributions to sample-efficient RL algorithms
  • Experience with on-policy RL in real-world hardware
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Brooklyn, New York
19 Employees
Year Founded: 2022

What We Do

We're building affordable general-purpose robots to free humanity from the drudgery of boring and repetitive tasks

Similar Jobs

NBCUniversal Logo NBCUniversal

Product Manager

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
New York, NY, USA
68000 Employees
110K-145K Annually

NBCUniversal Logo NBCUniversal

Associate Product Manager

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
New York, NY, USA
68000 Employees
100K-130K Annually

NBCUniversal Logo NBCUniversal

Sr. Software Quality Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
New York, NY, USA
68000 Employees
149K-159K Annually
Hybrid
2 Locations
897 Employees
150K-200K Annually

Similar Companies Hiring

Apptronik Thumbnail
Computer Vision • Hardware • Machine Learning • Robotics • Software
Austin, TX
355 Employees
Doodle Labs Thumbnail
Wearables • Robotics • Internet of Things • Hardware • Automation • App development • Aerospace
SG
50 Employees
Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account