The Role
Lead development of production agentic AI systems: reasoning, planning, tool orchestration, memory and grounding, evaluation frameworks, and multimodal agent coordination across chat, voice, and UI. Translate research into reliable, testable systems and iterate with founders and customers.
Summary Generated by Built In
Mission
About the Role
What You’ll Do
What You’ll Bring
Bonus
What We Offer
If you ask a hospital today how much a treatment will cost, the answer is usually: “We don’t know.”
Not because providers don’t want to tell you but because patient cost is computed across insurance plans, negotiated rates, and billing rules that are fragmented, opaque, and not designed for real-time transparency.
What should be a simple question requires navigating a web of systems that don’t talk to each other.
So humans do that work instead.
They log into portals, call payers, follow decision trees, and manually stitch together answers across disconnected systems. Even when providers want to give a clear answer, the system makes it nearly impossible.
At Bravebird, we’re changing that.
We’re building agents that do the work between systems - end to end.
We believe the future of work is machines talking to machines, handling fragmented, system-to-system workflows so humans can focus on decisions, judgment, and care.
About the Role
We’re building agentic AI systems that can reason, act, and operate reliably in messy, real-world environments - across chat, voice, and full computer-use interfaces.
As a Founding Engineer (Applied ML), you’ll help build the core intelligence powering these systems: reasoning, planning, grounding, memory, evaluation, and reliability.
This is a deeply technical, high-ownership role. You’ll work directly with founders, shape the technical direction, and ship systems that are used in real-world, high-stakes environments.
• Design and implement reasoning and planning systems for real production workflows
• Build robust tool orchestration, grounding, and execution frameworks
• Develop memory, context, and state management for long-horizon tasks
• Create evaluation systems that measure correctness, reliability, and performance
• Work across chat, voice, and UI-based agents to make them coordinated and dependable
• Translate research ideas into production systems
• Partner closely with founders and customers to iterate quickly and ship
• Help define engineering culture, standards, and early technical direction
• 5+ years of software engineering experience
• 2+ years building and shipping LLM or agentic systems in production
• Experience with evals, memory, retrieval, and grounding architectures
• Comfort operating in ambiguous, open-ended problem spaces
• Strong bias toward building reliable, measurable, testable systems
• Product intuition - you care about real users and real outcomes
• Clear communication and strong collaboration skills
• Experience with fine-tuning or reinforcement learning
• Experience building multimodal agents (voice, computer-use)
• Experience in healthcare or other regulated environments
• Founding-level ownership and impact
• Competitive compensation with meaningful equity
• Direct access to founders and customers
• The opportunity to ship production systems in high-stakes environments
We’re based in San Francisco and prefer working in person, but are flexible for exceptional remote candidates. Visa sponsorship available.
Skills Required
- 5+ years of software engineering experience
- 2+ years building and shipping LLM or agentic systems in production
- Experience with evals, memory, retrieval, and grounding architectures
- Strong bias toward building reliable, measurable, testable systems
- Product intuition and focus on real user outcomes
- Clear communication and strong collaboration skills
- Experience with fine-tuning or reinforcement learning
- Experience building multimodal agents (voice, computer-use)
- Experience in healthcare or other regulated environments
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Bravebird is an AI company that automates healthcare eligibility and benefit verification. It utilizes agentic AI to retrieve real-time patient insurance information directly from payer portals, integrating critical data such as co-pays and deductibles into practice management systems. Their mission is to replace fragmented, manual workflows between disparate systems with reliable AI agents, allowing healthcare providers to focus more on patient care.









