Machine Learning Engineer: Deep Reinforcement Learning

Sorry, this job was removed at 06:14 p.m. (CST) on Monday, Aug 04, 2025
Hiring Remotely in United States
Remote
Software
The Role

What We Do:

Primordial Labs is building Anura, the next-generation human-machine interface. Anura allows operators to command, control and collaborate with uncrewed systems through natural language, just like they would with a human teammate. Our job is to make it easier for warfighters to wield the power of autonomous systems on the battlefield.

Job Description:

(Please Note: This is a 100% remote position that requires residence in the United States.)
We're looking for machine learning engineers with a background in reinforcement learning. Candidates with expertise in any of the following areas are highly desirable:

  • Closing the sim2real gap
  • Sparse rewards and credit assignment over long time horizons
  • Offline learning
  • Options learning and other hierarchical approaches
  • Unsupervised exploration techniques
  • Gym development (especially around complex DoD wargaming and platform simulation systems!)

As a company co-founded and led by engineers, we are focused on developer experience. We promise to minimize distractions (read: meetings) and to provide tools which maximize productivity. We are also committed to providing competitive total compensation, including profit sharing.

What You'll Do:

We are a small company, so you can expect to be an integral part of all phases of product development. Including lots and lots of flight testing. Here are a few examples of how you’ll contribute to the development of Anura:

  • Develop policies for tasks ranging from low-level aircraft control to high-level task assignment.
  • Transition policies to a number of physical unmanned vehicles and evaluate performance in the field.
  • Incorporate new aircraft/sensor models and simulation environments into our multi-fidelity hierarchical M&S framework.

About You:

There are no educational requirements. We're more interested in projects you've worked on - both on the job and on the side. So please be prepared to discuss those! The ideal candidate will have excellent Python skills and be adept in at least one ML framework (PyTorch, TensorFlow, etc.).

Application Process:

We have a two-stage interview process.

The first stage is a screener which is a combination of technical discussion about relevant engineering concepts as well as your past experience. This screener is generally around 30 minutes.

After the screener, if both sides believe it makes sense, we will provide you with an 'offline' coding test. This test is done on your own time, at your own pace, so you do not feel the pressure of coding live with us watching. We want you to do your best with a clear mind and free of interview jitters. The questions are a combination of algorithm and system design. Once you complete the test, we'll all hop back on another call and you will walk us through your code. We are not just looking for the right answers, we're interested in your thought processes and technical approach.

We are committed to a low-stress interview experience and promise there will be no trick questions or brain teasers!

Similar Jobs

Wipfli Logo Wipfli

Manager, Senior Living

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
107K-144K Annually

Wipfli Logo Wipfli

Senior Consultant

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
88K-118K Annually

MetLife Logo MetLife

IAM Engineer III

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
100K-145K Annually

MetLife Logo MetLife

Consultant

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
61K-81K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New Haven, Connecticut
15 Employees
Year Founded: 2021

What We Do

Primordial Labs is building the next-generation human-machine teaming interface, Anura, to empower warfighters to collaborate with uncrewed systems (UxS) using natural language.

With Anura, operators become active collaborators, not passive observers, in the human-machine team. Anura allows operators to verbally task robotic teammates and act as a “quarterback” while remaining heads-up and focused on the mission objective.

Primordial Labs was founded in 2021 by Lee Ritholtz and Adrian Pope. The company is headquartered in New Haven, Connecticut.

Similar Companies Hiring

LayerOne Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account