Research Engineer / Scientist – Reinforcement Learning (RL)

Posted 15 Days Ago
Be an Early Applicant
2 Locations
In-Office
Senior level
Artificial Intelligence • Software
Ethical AI for Socially Responsible Retail (Techstars '20)
The Role
As a Research Engineer/Scientist in Reinforcement Learning, you will develop RL methods, maintain experimental infrastructure, and collaborate to enhance AI solutions in critical sectors.
Summary Generated by Built In
Who we are

Percepta’s mission is to transform critical institutions with applied AI. We care that industries that power the world (e.g. healthcare, manufacturing, energy) benefit from frontier technology. To make that happen, we embed with industry-leading customers to drive AI transformation. We bring together:

  • Forward-deployed expertise in engineering, product, and research

  • Mosaic, our in-house toolkit for rapidly deploying agentic workflows

  • Strategic partnerships with Anthropic, McKinsey, AWS, companies within the General Catalyst portfolio, and more

Our team is a quickly growing group of Applied AI Engineers, Embedded Product Managers and Researchers motivated by diffusing the promise of AI into improvements we can feel in our day to day lives. Percepta is a direct partnership with General Catalyst, a global transformation and investment company.

About the role

As a Research Engineer/Scientist (Reinforcement Learning) at Percepta, you will work at the intersection of RL research and real-world deployment. You will advance the frontier of capabilities through research on decision-making for critical industries. You will collaborate closely with our Embedded Product Managers (EPMs) and engineers to ensure that our solutions transform how companies operate.

Role and responsibilities
  • Identifying which real-world challenges are tractable for RL-guided decision making.

  • Develop RL methods to perform complex tasks in domains like planning, decision-making, or optimization.

  • Develop and maintain the experimental infrastructure that powers our research, from simulation environments and data pipelines to training and evaluation frameworks.

  • Conduct in-the-wild evaluations at scale that drive millions of dollars in value.

  • Partner with our applied AI engineers to transition successful research ideas into robust features of our Mosaic platform.

  • Communicate research outcomes to both technical and non-technical stakeholders, making sure everyone understands the “so what” of research and how to apply it.

Indicators of a good fit
  • Have an MS/PhD in Computer Science, ML, or related field, or equivalent experience.

  • Have a track record of effective RL work.

  • Are motivated by impact in critical industries including healthcare, supply chains, energy, and finance.

  • Understand how to perform rigorous RL experimentation.

  • Enjoy extreme ownership.

  • Believe that AI can drive transformative change in critical industries.

The following list can be a sign that you might be a good technical fit:

  • High performance, large scale distributed systems.

  • Large scale LLM training or RL training.

  • Possess strong programming skills, especially in Python.

  • Implementing LLM post-training algorithms.

  • Experience with vLLM/SGLang, Ray, Kubernetes (or AWS EKS).

  • Experience with distributed checkpointing, multi-node, multi-gpu training, custom KV-caching.

  • Experience with asynchronous training and inference, either with VeRL, ROLL, SkyRL, AReal, or with RL libraries like CleanRL.

We're working against an incredibly ambitious mission. It won't be easy but it will likely be the most fulfilling work of your career. If that excites you, let's chat, even if you don't meet all of the qualifications above.

Top Skills

Aws Eks
Kubernetes
Python
Ray
Sglang
Vllm
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Philadelphia, Pennsylvania
3 Employees
Year Founded: 2019

What We Do

Percepta develops real-time, AI-based video analytics technology that detects
shoplifting in retail stores while preserving shopper privacy and mitigating racial and gender biases.

Similar Jobs

Liberty Mutual Insurance Logo Liberty Mutual Insurance

Solutions Architect

Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics
Hybrid
Boston, MA, USA
40000 Employees
134K-254K Annually

Liberty Mutual Insurance Logo Liberty Mutual Insurance

Solutions Engineer

Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics
Hybrid
Boston, MA, USA
40000 Employees
134K-254K Annually

CarGurus Logo CarGurus

Engineering Manager

Consumer Web • eCommerce • Software
Hybrid
Boston, MA, USA
1121 Employees
186K-233K Annually

CarGurus Logo CarGurus

Service Desk Analyst

Consumer Web • eCommerce • Software
Hybrid
Boston, MA, USA
1121 Employees
71K-90K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account