Senior Machine Learning Engineer, Agentic AI

Posted 2 Days Ago
3 Locations
In-Office
209K-245K Annually
Senior level
Fintech • Cryptocurrency
Robinhood's mission is to democratize finance for all.
The Role
Design and lead development of agentic AI systems: build evaluation frameworks, select and optimize models, improve reliability and latency, investigate production failures, and partner with product and engineering to set launch criteria and quality standards while mentoring engineers.
Summary Generated by Built In
Join us in building the future of finance.

Our mission is to democratize finance for all. An estimated $124 trillion of assets will be inherited by younger generations in the next two decades. The largest transfer of wealth in human history. If you’re ready to be at the epicenter of this historic cultural and financial shift, keep reading.

About the team + role
We are building an elite team, applying frontier technologies to the world’s biggest financial problems. We’re looking for bold thinkers. Sharp problem-solvers. Builders who are wired to make an impact. Robinhood isn’t a place for complacency, it’s where ambitious people do the best work of their careers. We’re a high-performing, fast-moving team with ethics at the center of everything we do. Expectations are high, and so are the rewards.

The Agentic AI team builds agentic AI systems that power intelligent, reliable customer experiences across Robinhood products. The team focuses on reducing the time to ship agents with fine-tuned models and while doing so enables other teams to build, evaluate, and improve their own agents. You will contribute to a culture grounded in first-principles thinking, high performance, and strong focus on customer outcomes!

As a Senior Machine Learning Engineer (IC5), you will define and uphold the quality bar for agentic systems across the organization. You will design evaluation frameworks, guide model selection, and partner with product, data science, and engineering teams to ensure systems meet clear standards for correctness, safety, latency, and user satisfaction. Your work will shape how agentic systems are built, evaluated, and improved across Robinhood!

At Robinhood, we believe in the power of in-person work to accelerate progress, spark innovation, and strengthen community. Our office experience is intentional, energizing, and designed to fully support high-performing teams. This role is based in our Bellevue, WA, New York, NY, or Menlo Park, CA office, with in-person attendance expected at least 3 days per week.

What you'll do

  • Lead the design and evolution of agentic AI systems that power intelligent customer experiences across Robinhood.
  • Define the technical direction for evaluating autonomous agents, including reasoning quality, planning, tool selection, memory, task completion, safety, latency, and overall user experience.
  • Design and build scalable evaluation frameworks for agentic systems using automated evals, benchmark datasets, LLM-as-a-Judge techniques, and human feedback to continuously improve agent performance.
  • Drive model selection and optimization across frontier foundation models, fine-tuned models, retrieval systems, and tool-using agents, balancing quality, latency, cost, and reliability.
  • Partner closely with Product, Data Science, and Engineering to establish launch criteria, quality standards, and measurable success metrics for production agentic systems.
  • Improve agent reliability by investigating production failures, identifying root causes across reasoning, planning, retrieval, and tool execution, and driving architectural improvements.
  • Mentor engineers and influence technical direction across teams while helping establish best practices for building reliable, production-ready agentic AI systems.

What you bring

  • Significant experience building and deploying production AI systems powered by large language models, autonomous agents, or multi-step reasoning workflows.
  • Deep understanding of modern agent architectures, including tool calling, planning, memory, retrieval-augmented generation (RAG), orchestration, and multi-agent systems.
  • Experience designing evaluation frameworks for agentic AI, including automated evals, benchmark datasets, LLM-as-a-Judge methodologies, human evaluation pipelines, and continuous quality measurement.
  • Strong understanding of the tradeoffs between prompting, fine-tuning, retrieval, and agent orchestration, and when to apply each approach.
  • Experience evaluating frontier foundation models across quality, latency, safety, cost, robustness, and production readiness.
  • Proven ability to debug complex agent behaviors, identify failure modes, and improve reasoning, reliability, and overall system performance.
  • Strong software engineering skills with experience building scalable distributed systems and production ML infrastructure.
  • Demonstrated technical leadership through architecture design, mentorship, and influencing engineering direction across multiple teams.
  • Experience with agent frameworks, AI observability platforms, model evaluation tooling, or regulated AI applications is a strong plus.

What we offer

  • Challenging, high-impact work to grow your career
  • Performance driven compensation with multipliers for outsized impact, bonus programs, equity ownership, and 401(k) matching
  • Best in class benefits to fuel your work, including 100% paid health insurance for employees with 90% coverage for dependents
  • Lifestyle wallet - a highly flexible benefits spending account for wellness, learning, and more
  • Employer-paid life & disability insurance, fertility benefits, and mental health benefits
  • Time off to recharge including company holidays, paid time off, sick time, parental leave, and more!
  • Exceptional office experience with catered meals, events, and comfortable workspaces

In addition to the base pay range listed below, this role is also eligible for bonus opportunities + equity + benefits.

Base pay for the successful applicant will depend on a variety of job-related factors, which may include education, training, experience, location, business needs, or market demands. The expected base pay range for this role is based on the location where the work will be performed and is aligned to one of 3 compensation zones. For other locations not listed, compensation can be discussed with your recruiter during the interview process.

Base Pay Range:

Zone 1 (Menlo Park, CA; New York, NY; Bellevue, WA; Washington, DC)
$209,000$245,000 USD
Zone 2 (Denver, CO; Westlake, TX; Chicago, IL)
$184,000$216,000 USD
Zone 3 (Lake Mary, FL; Clearwater, FL; Gainesville, FL)
$163,000$191,000 USD

Click here to learn more about our Total Rewards, which vary by region and entity.

If our mission energizes you and you’re ready to build the future of finance, we look forward to seeing your application.

Robinhood provides equal opportunity for all applicants, offers reasonable accommodations upon request, and complies with applicable equal employment and privacy laws. Inclusion is built into how we hire and work—welcoming different backgrounds, perspectives, and experiences so everyone can do their best. Please review the Privacy Policy for your country of application.

Skills Required

  • Significant experience building and deploying production AI systems powered by large language models, autonomous agents, or multi-step reasoning workflows.
  • Deep understanding of agent architectures including tool calling, planning, memory, retrieval-augmented generation (RAG), orchestration, and multi-agent systems.
  • Experience designing evaluation frameworks for agentic AI: automated evals, benchmark datasets, LLM-as-a-Judge, human evaluation pipelines, continuous quality measurement.
  • Strong understanding of tradeoffs between prompting, fine-tuning, retrieval, and agent orchestration.
  • Experience evaluating foundation models across quality, latency, safety, cost, robustness, and production readiness.
  • Proven ability to debug complex agent behaviors, identify failure modes, and improve reasoning and system reliability.
  • Strong software engineering skills with experience building scalable distributed systems and production ML infrastructure.
  • Demonstrated technical leadership through architecture design, mentorship, and influencing engineering direction across teams.
  • Experience with agent frameworks, AI observability platforms, model evaluation tooling, or regulated AI applications.

Robinhood Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Robinhood and has not been reviewed or approved by Robinhood.

  • Healthcare Strength Company materials emphasize best-in-class medical, dental, and vision coverage alongside mental-health support and life/disability insurance. Feedback suggests employer-paid coverage levels are strong for employees and solid for dependents.
  • Parental & Family Support Paid parental leave and fertility benefits are consistently highlighted, with leave lengths described as generous for the industry. Feedback suggests these family-forming supports compare favorably to many fintech peers.
  • Wellbeing & Lifestyle Benefits A flexible lifestyle wallet, commuter support, and rich office perks (catered meals, snacks, events) are prominently featured. Feedback suggests these everyday perks enhance the overall value of the package.

Robinhood Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Menlo Park, CA
Year Founded: 2013

What We Do

Robinhood Markets, Inc. (NASDAQ: HOOD) transformed financial services by introducing commission-free stock trading and democratizing access to the markets for millions of investors. Today, Robinhood, through its subsidiaries, lets you trade stocks, options, futures (which includes event contracts), and crypto, invest for retirement, earn with Robinhood Gold, and access an expert-managed portfolio with Robinhood Strategies. Headquartered in Menlo Park, California, Robinhood puts customers in the driver’s seat, delivering unprecedented value and products intentionally designed for a new generation of investors. Additional information about Robinhood can be found at www.robinhood.com.

Why Work With Us

At Robinhood, we are building an elite team, applying frontier technologies to the world’s biggest financial problems. Every role can make a real impact. We’re a team of builders, doers, and bold thinkers who thrive in a high-performance environment and take pride in raising the bar for ourselves and the millions of customers we serve.

Gallery

Gallery

Similar Jobs

SailPoint Logo SailPoint

Senior Machine Learning Engineer

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
United States
2461 Employees
119K-201K Annually

Snap Inc. Logo Snap Inc.

Principal Software Engineer

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Remote or Hybrid
6 Locations
5000 Employees
235K-414K Annually

Snap Inc. Logo Snap Inc.

Account Manager

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
5 Locations
5000 Employees
91K-161K Annually

Brigit Logo Brigit

Senior Software Engineer

Fintech • Mobile • Social Impact • Financial Services
Remote or Hybrid
USA
132 Employees
170K-190K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account