AI Engineer | Agentic Systems

Reposted 24 Days Ago
Hiring Remotely in United States
Remote
Mid level
Machine Learning • Software
We develop software that helps people get the right medical care, at the right time, at the right price.
The Role
The NLP Scientist will solve complex medical document understanding problems using NLP and CV methodologies, optimize models for performance, and make data-driven decisions.
Summary Generated by Built In

Machinify is a leading healthcare intelligence company with expertise across the payment continuum, delivering unmatched value, transparency, and efficiency to health plan clients across the country. Deployed by over 85 health plans, including many of the top 20, and representing more than 270 million lives, Machinify brings together a fully configurable and content-rich, AI-powered platform along with best-in-class expertise. We’re constantly reimagining what’s possible in our industry, creating disruptively simple, powerfully clear ways to maximize financial outcomes and drive down healthcare costs.

Machinify is a leading healthcare intelligence company with expertise across the payment continuum, delivering unmatched value, transparency, and efficiency to health plan clients across the country. Deployed by over 85 health plans — including many of the top 20 and representing more than 270 million lives — Machinify brings together a fully configurable, content-rich, AI-powered platform along with best-in-class expertise. We're constantly reimagining what's possible in our industry, creating disruptively simple, powerfully clear ways to maximize financial outcomes and drive down healthcare costs.

The Role

We're building production-grade agentic systems that audit medical claims end-to-end — reading raw medical records, reasoning over coding and clinical guidelines, and producing defensible findings that hold up to clinical and regulatory review. Reaching human-expert accuracy on noisy, long-context documents is one of the hardest unsolved problems in applied AI, and the field is moving weekly.

We're hiring an L4 AI Engineer who can step into an ambiguous problem, design an agent system from scratch, and ship it. You won't be plugging into someone else's architecture — you'll be deciding what the architecture should be.

What You'll Do

- Design agent systems from first principles. Decide the loop, the tools, the context strategy, the evaluation harness. Choose between single-agent and multi-agent topologies, between LLM reasoning and deterministic post-passes, between retrieval and direct context loading — and defend the choice with data.
- Engineer the context. The hardest part of building a good agent is what goes into the prompt and what comes out. You'll obsess over context windows, tool surfaces, structured outputs, citation grounding, and the prompt itself.
- Drive evaluation rigor. Build evals before you build the agent. Diagnose where it fails, fix the root cause, and prove the fix moved the metric.
- Use AI tooling like a power user. A meaningful fraction of your day will be spent driving Claude Code, Codex, and similar tools to plan, scaffold, refactor, and debug your own work. We expect you to be faster with these tools than most engineers are without them.
- Become a domain expert. Healthcare claims, coding guidelines, and the medical record itself are unavoidable parts of the job. Strong engineers who lean into the domain become outsized contributors here.

What We're Looking For

Required

- 2–4 years of applied ML / AI engineering experience with a Bachelor's in CS, Math, Engineering or equivalent — or a Master's in a similar program with no prior industry experience required. Either way, at least one production-quality system (industry, research, or substantial open-source) you owned end-to-end.
- Strong Python engineering. Clean abstractions, type discipline, async, tested code.
- Deep, hands-on understanding of agent loops — how a model decides to call a tool, how a tool result re-enters context, how loops terminate, where they fail.
- Hands-on experience with at least one major agent SDK — OpenAI Agents SDK, Anthropic SDK / claude-agent-sdk, LangGraph, or equivalent — and an opinion on the tradeoffs.
- Working knowledge of how modern coding agents are built and how they engineer context — what goes in the system prompt, how files are read and edited, how long-running tasks are planned and tracked, where they break.
- Fluency with Claude Code / Codex as a power user. You should be able to brainstorm, plan, and execute non-trivial engineering tasks with these tools — including reading their source when needed to understand or extend behavior.
- Solid command of VS Code and git — branches, rebases, worktrees, conflict resolution, PR workflows. Not optional.
- A bias toward measurement: you don't ship without an eval, and you don't believe a number you can't reproduce.

Strongly preferred

- Experience designing structured outputs (Pydantic / JSON Schema) and tool interfaces that LLMs reliably call correctly.
- Familiarity with reasoning models (o-series, Claude extended thinking, Gemini thinking) and a sense of when they earn their cost.
- Prior work on long-context, citation-grounded systems where the model must point to evidence, not just answer.
- Healthcare, legal, finance, or any other domain where "mostly right" is unacceptable.

Nice to have

- Document understanding (OCR, layout-aware models, table extraction).
- Vision-language models, multimodal retrieval.
- Production experience with caching, observability, and cost control on LLM workloads.

What We Offer 

  • Work from anywhere in the US! Machinify is digital-first.
  • Top Medical/Dental/Vision offerings
  • FSA/HSA
  • Tuition reimbursement
  • Competitive salary, 401(k) with company match
  • Unlimited PTO
  • Additional health and wellness benefits and perks
  • Flexible and trusting environment where you’ll feel empowered to do your best work 

The salary for this position is based on an array of factors unique to each candidate: Such as years and depth of experience, set skills, certifications, etc. We are hiring for different levels and the base salary can range from $130k-$200k+ based on your assessed level. Compensation also includes meaningful equity, healthcare, unlimited PTO, and more.

Equal Employment Opportunity at Machinify
 
We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal opportunity workplace. Machinify is an employment at will employer. We participate in E-Verify as required by applicable law. In accordance with applicable state laws, we do not inquire about salary history during the recruitment process. If you require a reasonable accommodation to complete any part of the application or recruitment process, please let our recruiters know. See our Candidate Privacy Notice at: https://www.machinify.com/candidate-privacy-notice/

Skills Required

  • Expertise in NLP and/or Computer Vision
  • Strong Software Engineering Skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
96 Employees
Year Founded: 2016

What We Do

Machinify is an AI start-up in the Healthcare space. Our software platform leverages the latest advances in machine learning, large language models, data analytics, and cloud processing to solve previously intractable problems in the healthcare industry impacting millions of lives.

Why Work With Us

We are a heavily cross-functional, collaborative diverse team working together to solve big problems that matter. If you are looking for an exciting environment where you'll be challenged to do the best work of your career, consider joining us!

Gallery

Gallery

Similar Jobs

ServiceNow Logo ServiceNow

Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Santa Clara, CA, USA
29000 Employees
201K-352K Annually

Kindo Logo Kindo

Systems Engineer

Artificial Intelligence • Software
In-Office or Remote
Los Angeles, CA, USA
38 Employees
170K-220K Annually

Kindo Logo Kindo

Systems Engineer

Artificial Intelligence • Software
In-Office or Remote
Los Angeles, CA, USA
38 Employees
250K-330K Annually

Kindo Logo Kindo

Systems Engineer

Artificial Intelligence • Software
In-Office or Remote
Los Angeles, CA, USA
38 Employees
210K-260K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account