Research Intern – Reinforcement Learning (RL) - Onsite

Posted Yesterday
Be an Early Applicant
8 Locations
In-Office or Remote
Internship
Artificial Intelligence • Natural Language Processing • Software • Conversational AI
The Role
As a Research Intern, you'll build reinforcement learning environments and agents, define reward models using real-world data, and collaborate on deploying learning systems.
Summary Generated by Built In

🚀 Build the next generation of Agentic AI with us

Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agents across the entire customer experience lifecycle.

A core part of this vision is our investment in custom Small Language Models (SLMs)—purpose-built for CX workflows—paired with reinforcement learning systems that continuously improve decision-making in real-world environments.

We’re looking for a Research Intern (Reinforcement Learning) to join us in shaping this future.

What you’ll do
 
  • Design and build reinforcement learning environments that model real-world customer interaction workflows.

  • Design RL agents that learn from these environments using real-world interaction data, rewards, and feedback loops

  • Define reward models and feedback loops using real-world signals (outcomes and human feedback)

  • Enable learning from production data by structuring interaction traces into training-ready datasets for offline and online learning

  • Experiment with multi-agent systems and simulation frameworks for complex coordination and decision-making

  • Collaborate with engineering and product teams to deploy, evaluate, and iterate on learning systems in production at scale.

 

What we’re looking for
  • Currently pursuing (or recently completed) a degree in Computer Science, AI, Machine Learning, or related field

  • Strong understanding of reinforcement learning fundamentals

  • Familiarity with RL environments and training libraries such as Verl and Tinker

  • Strong foundation in probability, math, and optimization

  • Passion for building real-world AI systems

Nice to have
  • Experience with RLHF, LLM/SLM fine-tuning, or model alignment

  • Exposure to agent-based systems or multi-agent RL

  • Prior research, projects, or publications in RL or applied ML

  • Experience working with large-scale or production datasets

 

Why Level AI
  • Work on production-grade Agentic AI systems used by leading enterprises

  • Build alongside a team with deep expertise from Amazon, Google, and Meta

  • Be part of a fast-growing Series C AI company.

  • Direct exposure to 0→1 AI innovation in CX and decisioning systems

Top Skills

Agent-Based Systems
Conversation Intelligence
Llm/Slm Fine-Tuning
Multimodal Understanding
Optimization
Probability
Reinforcement Learning
Rl Environments
Rl Training Libraries
Rlhf
Small Language Models
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mountain View, CA
122 Employees
Year Founded: 2018

What We Do

Level AI (https://thelevel.ai) is a Mountain View, CA and Delhi, India based startup innovating in the Voice AI space. We are backed by top VCs, technologists from Silicon Valley and industry experts. We are on a mission for AI to augment the worker and not replace them. We are innovating in speech AI, NLP and information retrieval systems to bring customers and businesses closer to one another. The team has experience from Amazon Alexa, Google, and other leading AI organizations.

Similar Jobs

Gusto Logo Gusto

Staff Software Engineer

Fintech • HR Tech
Easy Apply
Remote or Hybrid
6 Locations
4405 Employees
190K-250K Annually

Samsara Logo Samsara

Senior Program Manager

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Canada
4000 Employees
111K-144K Annually

Xero Logo Xero

Principal Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software
Remote or Hybrid
British Columbia, BC, CAN
4500 Employees
288K-338K Annually

Xero Logo Xero

Senior Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software
Remote or Hybrid
British Columbia, BC, CAN
4500 Employees
198K-248K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account