AI Engineer Backend (Remote - ES/UK only)

Reposted 9 Days Ago
2 Locations
In-Office or Remote
Senior level
Artificial Intelligence • Healthtech
The Role
Design, build, and operate low-latency multi-agent LLM systems. Ensure high-quality real-time communication and optimize prompt programming and retrieval processes.
Summary Generated by Built In
About Us

Quadrivia is the health technology company behind Q, a comprehensive, controllable, and customizable assistant AI built by clinicians, for clinicians. Addressing the urgent shortage of healthcare professionals, Q provides real-time, personal, and reliable support for clinical tasks across the care continuum. Designed for providers, payers, and pharmaceutical companies, Q is easy to customize and integrates seamlessly into workflows, delivering precise assistance across the care spectrum.

The Role

You'll build and run Cortex, the core AI architecture behind Qu, and the services that sit on top of it: automated AI audits, patient simulators, retrieval (RAG), and the escalation agents that take over in red-guardrail situations. This is a backend role first. The job is to make our AI systems reliable, fast, and observable in production, not to invent new ML. You own the software underneath the agents.

We're a small team that ships real systems. We've built our entire AI-driven evaluation system, our voice orchestrator, and a multi-hierarchical RAG platform from scratch.

What You'll Do
  • Design and maintain robust, modular backend systems using clean architectural (SOLID) principles to ensure long-term maintainability, scalability and flexibility as the agentic stack evolves.

  • Own Cortex end-to-end: architecture, API design, service boundaries, reliability targets, and proactively managing failure modes.

  • Build the platform services around it. The automated audit and eval pipeline, patient simulators for testing agents at scale, and the retrieval layer.

  • Write fast, well-tested Python services with FastAPI, asyncio, and pydantic, and get the queues, caching, and data stores right.

  • Wire up the multi-agent orchestration: routing between agents, shared state, and clean tool interfaces.

  • Engineer the RAG pipeline for high-signal retrieval (chunking, hybrid search, re-ranking, caching) and prove the grounding holds.

  • Make the whole thing observable: structured logs, OTEL tracing across the agent graph, cost, latency and token visibility, dashboards, and CI gates that catch regressions before they ship.

Minimum Qualifications
  • Your core is backend and software engineering. You write clean, maintainable services and you care how they behave in production.

  • Deep understanding of architectural design patterns (e.g., Clean/Hexagonal Architecture, Domain-Driven Design, SOLID, event-driven) to manage complex system boundaries.

  • At least 2 years, demonstrable, building or scaling user-facing AI software that real users touched. We'll want to see it.

  • Expert Python, with strong FastAPI, asyncio, pydantic, and production observability.

  • Comfortable with agent patterns and eval-driven development.

  • You've worked at a startup before and know what wearing several hats actually costs.

Nice to Have
  • Real-time and voice: WebRTC, LiveKit, SIP, VAD, barge-in, turn-taking. Useful here, not required.

  • Programmatic prompt optimization techniques.

  • LLM-as-judge setups and other evaluation tooling.

  • GCP: Cloud Run or GKE, Pub/Sub, Vertex AI, GCS, Secret Manager, Cloud Logging and Trace.

  • Healthcare data familiarity.

Example Problems You'll Tackle
  • Stand up the AI audit pipeline so evals run automatically on slices of production traffic, with regression gates wired into CI.

  • Build a patient simulator that lets us stress-test agents at scale before they ever reach a real call.

  • Improve the RAG pipeline with hybrid retrieval and re-ranking, then prove the gains with faithfulness and context metrics.

  • Get OTEL-first tracing across the agent graph, with automated eval triggers on live traffic.

  • Turn EHR integrations into reliable tools the agents can call.

Tech Stack

Python, FastAPI, pydantic, asyncio, Redis, Postgres, vector stores, Docker, Kubernetes, Terraform, ArgoCD, OTEL, TypeScript, React. Real-time stacks (WebRTC, LiveKit, SIP, STT/TTS) where the work touches voice.

What Success Looks Like
  • Quadrivia's backend becomes a reference for reliability, safety, and performance.

  • Your services run above 99.99% availability under strict regulatory constraints.

  • Other engineers build new clinical workflows and agent capabilities quickly and safely.

  • AI-generated code gets reviewed, corrected, and owned by you.

Skills Required

  • 5+ years in ML or backend engineering
  • Expert in Python
  • Strong FastAPI, asyncio, pydantic, and production observability
  • Experience with low-latency text/voice systems
  • Hands-on with ReAct and CoT
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
29 Employees

What We Do

Quadrivia is building Qu, a personal clinical assistant with wide capabilities across the care spectrum. It is designed by clinicians for clinicians, and the communities we serve. According to the World Health Organization, there is a global shortfall of up to 18 million healthcare workers. A root cause of the worldwide challenge with the quality, accessibility and affordability of healthcare is this structural imbalance between elastic demand by population and constrained supply of providers. Care is delivered globally in many different ways. By making Qu comprehensive, controllable and customizable, we aim to help healthcare professionals be in control of AI’s utility, safety and quality. Qu’s cognitive architecture of expanding agents is designed to support clinicians not just for a single task but across the full stack of routine clinical and administrative tasks, patient interactions, decision-making, chronic and post operative care, continuous monitoring and support, in multiple languages.

Similar Jobs

Datadog Logo Datadog

Senior Software Engineer

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
3 Locations
6500 Employees
Remote or Hybrid
12 Locations
2449 Employees
172K-200K Annually

Block Logo Block

Strategic Account Manager

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
In-Office or Remote
Barcelona, Cataluña, ESP
12000 Employees

Pfizer Logo Pfizer

Platform Engineer

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office or Remote
36 Locations
121990 Employees
65K-109K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account