Reinforcement Learning Engineer ($400k - $800k salary)

Posted 4 Days Ago
Be an Early Applicant
2 Locations
In-Office
400K-500K Annually
Expert/Leader
Blockchain • Information Technology • Software • Cryptocurrency
The Role
As a Reinforcement Learning Engineer, you will own production trading systems, design reward functions, validate frameworks, and lead RL efforts to drive trading volume safely with real capital.
Summary Generated by Built In
Who We Are

Baton Corporation is the development company that builds and operates the entire technology stack behind pump.fun, the largest memecoin launchpad in production today. The systems are low latency, high throughput, live under constant load, and break if you get them wrong.

What You’ll Do

As our Reinforcement Learning Engineer, you will own a production trading system that directly deploys real capital. This is not a research role - it’s about building learning systems that are robust, measurable, and safe under real-world constraints.

  • Own and ship an RL-driven trading agent using real capital to increase trading volume and user participation in a memecoin ecosystem

  • Design reward functions and policies aligned with product goals while enforcing strict downside risk constraints

  • Build evaluation and validation frameworks (simulation, offline analysis) to minimize reliance on live sequential testing

  • Safely transition an existing heuristic-based production system toward learning-based approaches

  • Take end-to-end ownership and technical leadership as the sole RL expert, from data and modeling through deployment, monitoring, and safeguards

Who You Are:
  • You have previously put an autonomous learning system into production that directly controlled capital, pricing, traffic, or resources and can explain what broke and how they fixed it

  • Have personally designed and enforced hard risk limits (capital caps, loss bounds, circuit breakers) in a live system, not just talked about “risk-aware objectives.

  • Have built a policy evaluation loop from scratch (simulators, replay, counterfactuals, shadow deployments) before trusting live rollout.

  • Can make and defend uncomfortable tradeoffs (e.g. heuristic > RL, bandit > deep RL) based on empirical results instead of ideology

  • Have operated as the single owner of a complex ML system in a small team, with no safety net of research orgs, infra teams, or “ML platforms.”

What it's like to work here
  • We work in person

  • Hours can be long and unconventional

  • The pace is intense

  • Expectations are high, and impact is immediate

  • Working at Baton is not for everyone

Why Join Us?
  • Unmatched ownership and autonomy

  • Exposure to systems operating at the edge of crypto scale

  • The ability to ship fast and see real-world impact immediately

If you’re motivated by responsibility, speed, and building products used by massive audiences, you’ll feel at home here.

Skills Required

  • Experience in productionizing autonomous learning systems that control capital or resources
  • Designed and enforced hard risk limits in live systems
  • Built policy evaluation loops including simulators and shadow deployments
  • Ability to make tradeoffs based on empirical results
  • Experience operating complex ML systems independently
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
100 Employees
Year Founded: 2023

What We Do

Baton Corporation is the development company that builds and operates the technology stack behind pump.fun, the largest memecoin launchpad in production today. They develop and maintain blockchain platforms, including pump.fun, on behalf of third parties.

Similar Jobs

FreeWheel Logo FreeWheel

Operations Analyst

AdTech • Digital Media • Marketing Tech
Hybrid
New York, NY, USA
1249 Employees
72K-108K Annually

Cox Enterprises Logo Cox Enterprises

Search Engine Optimization Specialist

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
22-33 Hourly

BAE Systems, Inc. Logo BAE Systems, Inc.

Embedded Software Engineer

Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Hybrid
Endicott, NY, USA
40000 Employees
107K-183K Annually

Cravath, Swaine & Moore LLP Logo Cravath, Swaine & Moore LLP

Senior Cloud Engineer

Legal Tech • Other • Professional Services
In-Office
New York, NY, USA
1200 Employees
180K-220K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account