Senior Applied AI Scientist

Posted 4 Hours Ago
Easy Apply
Be an Early Applicant
New York, NY, USA
Hybrid
182K-220K Annually
Senior level
Healthtech • Pharmaceutical • Telehealth
Ro's mission is to help patients achieve their health goals by delivering the easiest, most effective care possible.
The Role
Own evaluation, measurement, and optimization of production LLM-powered features. Design reproducible evaluation frameworks, run experiments and analyses to identify failure modes and regressions, build metrics and dashboards, partner with engineering to productionize improvements, and mentor teammates on experimental design and measurement best practices.
Summary Generated by Built In

Join Tech @ Ro to build the future of healthcare, from the ground up!

At Ro, we believe that when people achieve their health goals, they can achieve their life goals. The highest-leverage way to move society forward is to give people their health, and the current healthcare system isn’t built to do that. It was built to bill, not to serve patients.

We’re building a new system. One where the patient is in control. One designed from scratch for the digital age.

At Ro, technology isn’t just a function… It's core to how we deliver care. We’ve built a vertically integrated healthcare platform that connects telehealth, diagnostics, pharmacy, and logistics into a seamless, end-to-end experience for millions of patients.

…and we’re just getting started. 

As part of Tech @ Ro, you’ll work on systems that operate at scale, with an opportunity to:

  • Solve complex, high-concurrency problems across a full-stack platform
  • Build and ship quickly with tight feedback loops and real-world impact
  • Own systems end-to-end, from architecture to production performance
  • Work alongside experienced operators, technical leaders, and clinicians
  • Help define how modern healthcare should be delivered

We’re a performance-driven team with a strong sense of ownership and urgency. We move fast, learn quickly, and hold a high bar for what we build, and do so with a big heart — because patients depend on it.

If you’re motivated by impact, scale, and the chance to help lead the patient revolution, come build with us.


The Role
Ro is building a team focused on shipping LLM-powered products across the patient experience, clinical operations, and internal tooling.

We're hiring a Senior Applied AI Scientist to own the evaluation, measurement, and optimization of our AI systems. This role sits at the intersection of data science, applied machine learning, and product engineering. You'll design the frameworks that tell us whether our AI systems are actually working and use those insights to continuously improve them.

This is not a research role. You'll work closely with engineers and product teams to evaluate production systems, run experiments, identify failure modes, and ensure our AI products become more accurate, reliable, and cost-effective over time.

What You'll Do

  • Design and own evaluation frameworks for production LLM features, including LLM-as-a-judge evaluations, regression suites, synthetic datasets, golden datasets, and human review workflows.
  • Analyze production behavior to identify quality issues, hallucinations, latency bottlenecks, cost regressions, and emerging failure modes.
  • Design and run experiments including prompt variations, workflow changes, retrieval improvements, and model comparisons; and quantify their impact on quality, operational metrics, and user outcomes.
  • Define the metrics that matter and build dashboards that make AI performance visible across the organization.
  • Partner with engineering to determine which optimizations should be productionized and how to measure ongoing success.
  • Mentor teammates on experimental design, statistical rigor, evaluation methodology, and measurement best practices.

Who You Are

  • 5+ years of experience in data science, applied machine learning, experimentation, or a closely related field, with at least the last year focused on applied LLMs or AI evaluation.
  • Strong Python and SQL skills with experience working on production data pipelines and experimentation.
  • You have experience designing reproducible evaluation frameworks rather than relying on manual spot checks or qualitative assessments.
  • You have strong statistical intuition: you think in terms of distributions, confidence intervals, variance, and sample sizes rather than anecdotes.
  • You’re comfortable working closely with engineers and product teams to translate experimental findings into production improvements
  • Bonus: Experience with evaluation platforms (e.g. Braintrust, LangSmith, OpenAI Evals), experimentation platforms, causal inference, healthcare, or operations-heavy environments.

A note on reporting structure

This is a new function at Ro, and we're being deliberate about not over-defining it. Your manager and where you sit on the org chart will depend on the specific shape of the team we end up with. We'd rather find the right people and figure out the lines around them than pre-draw boxes and miss great candidates. If that ambiguity is a deal-breaker, this isn't the right role; if it sounds like an opportunity, we want to talk.

-
 
The target base salary for this position ranges from $182,300 to $220,000, in addition to a competitive equity and benefits package (as applicable). When determining compensation, we analyze and carefully consider several factors, including location, job-related knowledge, skills and experience. These considerations may cause your compensation to vary.

Ro is consistently recognized as a top workplace in Health Care, in New York, and for Women and Parents—earning more than 20 honors from Fortune, Great Place to Work, and PEOPLE since 2021. In 2025 alone, we ranked top 5 among medium workplaces in Health Care and New York, and top 50 nationwide.
 
At Ro, we believe that our diverse perspectives are our biggest strengths — and that embracing them will create real change in healthcare. As an equal opportunity employer, we provide equal opportunity in all aspects of employment, including recruiting, hiring, compensation, training and promotion, termination, and any other terms and conditions of employment without regard to race, ethnicity, color, religion, sex, sexual orientation, gender identity, gender expression, familial status, age, disability and/or any other legally protected classification protected by federal, state, or local law.
 
Ro is committed to providing reasonable accommodations for qualified individuals with disabilities in our application and interview process. If you require a reasonable accommodation in the application or interview process, please contact us at [email protected].
 
See our California Privacy Policy here.

Skills Required

  • 5+ years in data science, applied machine learning, experimentation, or closely related field with at least one year focused on applied LLMs or AI evaluation.
  • Strong Python skills.
  • Strong SQL skills.
  • Experience working on production data pipelines and experimentation systems.
  • Experience designing reproducible evaluation frameworks for production LLM features (LLM-as-judge, regression suites, synthetic/golden datasets, human review workflows).
  • Strong statistical intuition (distributions, confidence intervals, variance, sample sizes) and rigorous experimental design.
  • Ability to partner with engineering and product teams to translate experimental findings into production improvements.
  • Mentoring teammates on experimentation, evaluation methodology, and measurement best practices.
  • Experience with evaluation platforms (e.g., Braintrust, LangSmith, OpenAI Evals), experimentation platforms, causal inference, healthcare, or operations-heavy environments.

What the Team is Saying

Kim
Rachel
Andres
Ross
Kerry
Jay
Zach
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
824 Employees
Year Founded: 2017

What We Do

Ro is a direct-to-patient healthcare company with a mission of helping patients achieve their health goals by delivering the easiest, most effective care possible. Ro is the only company to offer nationwide telehealth, labs, and pharmacy services. This is enabled by Ro's vertically integrated platform that helps patients achieve their goals through a convenient, end-to-end healthcare experience spanning from diagnosis, to delivery of medication, to ongoing care. Since 2017, Ro has helped millions of patients in nearly every single county in the United States, including 98% of primary care deserts.

Why Work With Us

Ro is powering quality care at scale. The Ro Operating System (ro.OS) vertically integrates the core parts of healthcare, bringing together nationwide telehealth, lab, and pharmacy services on one platform. The result? ro.OS makes it easier for patients to access and providers to deliver high-quality care – millions of times over.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Ro (Ro.co) Teams

Team
Technology
Team
Clinical
Team
Pharmacy
About our Teams

Ro (Ro.co) Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Ro’ers in the tri-state area join their colleagues in the NY Hub twice a week for in-person collaboration.

Typical time on-site: 2 days a week
HQRo HQ
US
Learn more

Similar Jobs

Ro (Ro.co) Logo Ro (Ro.co)

Data Scientist

Healthtech • Pharmaceutical • Telehealth
Easy Apply
Hybrid
New York, NY, USA
824 Employees
150K-184K Annually

Ro (Ro.co) Logo Ro (Ro.co)

Product Engineer

Healthtech • Pharmaceutical • Telehealth
Easy Apply
Hybrid
New York, NY, USA
824 Employees
150K-184K Annually

Ro (Ro.co) Logo Ro (Ro.co)

Product Engineer

Healthtech • Pharmaceutical • Telehealth
Easy Apply
Hybrid
New York, NY, USA
824 Employees
182K-220K Annually

Ro (Ro.co) Logo Ro (Ro.co)

Artificial Intelligence Engineer

Healthtech • Pharmaceutical • Telehealth
Easy Apply
Hybrid
New York, NY, USA
824 Employees
150K-184K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account