Applied Research - RL & Agents

Reposted 5 Days Ago
Be an Early Applicant
2 Locations
In-Office or Remote
100K-150K
Senior level
Artificial Intelligence • Software
The Role
Develop and implement advanced reinforcement learning methods and distributed systems for AI agents. Collaborate with customers to understand needs and create tailored solutions.
Summary Generated by Built In

Building Open Superintelligence Infrastructure
Prime Intellect is building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full rl post-training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.


Role Impact

This is a customer facing role at the intersection of cutting-edge RL/post-training methods and applied agent systems. You’ll have a direct impact on shaping how advanced models are aligned, deployed, and used in the real world by:

  • Advancing Agent Capabilities: Designing and iterating on next-generation AI agents that tackle real workloads—workflow automation, reasoning-intensive tasks, and decision-making at scale.

  • Building Robust Infrastructure: Developing the distributed systems and coordination frameworks that enable these agents to operate reliably, efficiently, and at massive scale.

  • Bridge Between Customers & Research: Translate customer needs into clear technical requirements that guide product and research priorities.

  • Prototype in the Field: Rapidly design and deploy agents, evals, and harnesses alongside customers to validate solutions.

Customer-Facing Engineering

  • Work side-by-side with customers to deeply understand workflows and bottlenecks.

  • Prototype agents and eval harnesses tailored to real use cases, then hand off hardened systems to core teams.

  • Translate customer insights into roadmap and research direction.

Post-training & Reinforcement Learning

  • Design and implement novel RL and post-training methods (RLHF, RLVR, GRPO, etc.) to align large models with domain-specific tasks.

  • Build evaluation harnesses and verifiers to measure reasoning, robustness, and agentic behavior in real-world workflows.

  • Prototype multi-agent and memory-augmented systems to expand capabilities for customer-facing solutions.

Agent Development & Infrastructure

  • Rapidly prototype and iterate on AI agents for automation, workflow orchestration, and decision-making.

  • Extend and integrate with agent frameworks to support evolving feature requests and performance requirements.

  • Architect and maintain distributed training/inference pipelines, ensuring scalability and cost efficiency.

  • Develop observability and monitoring (Prometheus, Grafana, tracing) to ensure reliability and performance in production deployments..

Requirements
  • Strong background in machine learning engineering, with experience in post-training, RL, or large-scale model alignment.

  • Deep expertise in distributed training/inference frameworks (e.g., vLLM, sglang, Ray, Accelerate).

  • Experience deploying containerized systems at scale (Docker, Kubernetes, Terraform).

  • Track record of research contributions (publications, open-source contributions, benchmarks) in ML/RL.

  • Passion for advancing the state-of-the-art in reasoning and building practical, agentic AI systems.

What We Offer
  • Competitive Compensation + equity incentives

  • Flexible Work (remote or San Francisco)

  • Visa Sponsorship & relocation support

  • Professional Development budget

  • Team Off-sites & conference attendance


Growth Opportunity

You’ll join a mission-driven team working at the frontier of open, superintelligence infra. In this role, you’ll have the opportunity to:

  • Shape the evolution of agent-driven solutions—from research breakthroughs to production systems used by real customers.

  • Collaborate with leading researchers, engineers, and partners pushing the boundaries of RL and post-training.

  • Grow with a fast-moving organization where your contributions directly influence both the technical direction and the broader AI ecosystem.

If you’re excited to move fast, build boldly, and help define how agentic AI is developed and deployed, we’d love to hear from you.

Ready to build the open superintelligence infrastructure of tomorrow?
Apply now to help us make powerful, open AGI accessible to everyone.

Top Skills

Accelerate
Distributed Training
Docker
Grafana
Grpo
Kubernetes
Machine Learning
Prometheus
Ray
Reinforcement Learning
Rlhf
Rlvr
Sglang
Terraform
Vllm
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
16 Employees

What We Do

Prime Intellect democratizes AI development at scale. Our platform makes it easy to find global compute resources and train state-of-the-art models through distributed training across clusters. Collectively own the resulting open AI innovations, from language models to scientific breakthroughs.

Similar Jobs

GitLab Logo GitLab

Senior Solutions Architect

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
United States
2500 Employees

Upstart Logo Upstart

Recruiting Manager, Engineering

Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
Easy Apply
Remote
2 Locations
1500 Employees
135K-186K Annually

Upstart Logo Upstart

Senior Software Engineer

Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
Easy Apply
Remote
2 Locations
1500 Employees
164K-226K Annually

Mission Cloud Logo Mission Cloud

Senior Project Manager

Artificial Intelligence • Cloud • Information Technology • Machine Learning • Consulting • Generative AI • Big Data Analytics
In-Office or Remote
Los Angeles, CA, USA
258 Employees

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account