AI Engineer - Data Intelligence

Posted 4 Days Ago
Hiring Remotely in US
Remote
150K-180K Annually
Junior
Artificial Intelligence • Healthtech • Software
The Role
As an AI Engineer at Clarium, you will build and maintain data enrichment pipelines, design classification workflows, analyze datasets, and ensure data quality, primarily using Python and SQL, and work closely with senior engineers and data scientists.
Summary Generated by Built In

Why Clarium?

The healthcare industry overspends on its supply chain by over $25B each year, the result of fragmented data, inefficient workflows, and wasted supplies. Clarium is fixing that. Our AI-powered platform, Astra OS, gives hospitals end-to-end visibility into their supply chain operations, automating workflows and surfacing actionable insights so supply chain teams can focus on what matters most: patient care. We're trusted by some of the world's leading health systems, including Yale New Haven Health, Stanford, Geisinger, Cleveland Clinic, and Kaiser Permanente.

Founded in 2020, Clarium has raised $43M in total funding. Our Series A was led by Northzone, with participation from General Catalyst, AlleyCorp, Kaiser Permanente Ventures, Texas Medical Center Ventures, and 1984 Ventures.

The Opportunity

AI-powered platforms, like Clarium’s, deliver the highest impact when they are supported by high-quality data. As we scale to more health systems and deepen our offering of intelligent, data-driven workflows, the master data enrichment pipeline (the system that classifies and contextualizes every product flowing through a hospital's supply chain) has become a critical growth lever. We're investing in the team and infrastructure to make that layer faster, smarter, and more reliable.

You'll join the Data Products team, a small, unusually senior group responsible for the data assets, data science, and analytics that drive measurable value for our clients. Day-to-day, you'll build and own components of our enrichment pipeline: classification workflows, entity resolution systems, evaluation harnesses, and the production tooling that keeps it all running. You'll work closely with engineers and data scientists who've shipped real ML systems at scale, and your work will feed directly into decisions made by supply chain teams at some of the country's leading health systems.

A rare early-career opportunity to learn fast and own real work from day one. As the first junior hire on the team, you won't be buried under layers of abstraction. You'll work directly alongside people who've done this before, on problems that actually matter. Short feedback loops, real stakes, and the kind of hands-on growth that's hard to find this early in a career. It's the opportunity many of us wish we'd had starting out.

In This Role You Will

  • Build and maintain components of Clarium's master data enrichment pipeline, the system that classifies and enriches every product flowing through our platform

  • Design and own classification and entity resolution workflows that combine deterministic logic and LLMs for production data processing

  • Build and operate evaluation harnesses, label sets, and regression suites (we use Braintrust) to measure and improve pipeline quality with confidence

  • Write production Python and SQL; the majority of your time will be spent in code, not in configuration tools

  • Analyze complex datasets using statistics and ML to surface actionable insights and inform pipeline improvements

  • Proactively audit data for quality issues; find the problems no one else has noticed yet, diagnose root causes, and ship fixes

What You'll Bring

  • Strong Python skills and a track record of writing production code, not just scripts or notebooks

  • Strong SQL, including complex joins, window functions, performance tuning, and data modeling

  • Comfort working in ambiguous environments; you can scope a problem, make a plan, and execute without hand-holding

  • A genuine, non-negotiable commitment to data quality; you treat silent bugs as real failures

  • Ability to go deep on an unfamiliar domain and develop meaningful expertise over time

Nice to Have

  • Experience with LLM integrations, prompt evaluation, or classification at scale

  • Familiarity with eval frameworks such as Braintrust, Promptfoo, or equivalent

  • Prior work in healthcare, supply chain, or another domain where data quality has direct operational consequences

Skills & Tools You'll Use

Need to Know: Python · SQL · PostgreSQL · CI/CD · Production observability

Nice to Know: Temporal · Braintrust · Snowflake · AWS · Sigma

What You Get at Clarium

Target Base Salary Range: $150K - $180K

The base salary Clarium offers may vary depending upon the ultimate scope and responsibilities of the position and on the candidate’s job-related knowledge, skills, and experience. The total package will include equity, in addition to a full range of medical and/or other benefits, depending on the position offered. Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans.

Incentive Stock Options proportionate to your salary

Fully remote, with a NYC co-working space available; distributed team across multiple time zones with opportunities for in-person time

Unlimited PTO

Top-tier health, vision, and dental benefits

401K

The opportunity to build on a strong foundational team with deep data and engineering roots at a stage where your work genuinely shapes the product

Equal Opportunity Statement

Clarium is committed to promoting an inclusive work environment free of discrimination and harassment. We value a diverse and balanced team where everyone can belong.

Skills Required

  • Strong Python skills and a track record of writing production code
  • Strong SQL, including complex joins and performance tuning
  • Comfort working in ambiguous environments
  • Commitment to data quality
  • Ability to develop expertise over time
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, New York
47 Employees
Year Founded: 2020

What We Do

Clarium is building the next-gen supply chain platform that healthcare needs now. We proactively manage supply disruptions, identify substitutes, forecast surgical demand, optimize inventory planning and eliminate match exceptions ever-present in our volatile era. Our AI-enabled workflow and data tools are dramatically enhancing productivity for healthcare staff at forward-thinking institutions. All so they can focus exclusively on providing exceptional patient care.

Similar Jobs

CrowdStrike Logo CrowdStrike

Data Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
7 Locations
10000 Employees
195K-320K Annually

Order.co Logo Order.co

Data Engineer

eCommerce • Fintech • Payments • Software
Remote or Hybrid
United States
120 Employees
175K-200K Annually

PNC Bank Logo PNC Bank

Software Engineer

Machine Learning • Payments • Security • Software • Financial Services
Remote or Hybrid
USA
55000 Employees

Citizens Logo Citizens

Senior Data Engineer

Digital Media • Fintech • Information Technology • Machine Learning • Financial Services • Cybersecurity • Automation
In-Office or Remote
2 Locations
17000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account