Lead AI Engineer

Posted 5 Days Ago
Be an Early Applicant
San Francisco, CA
Hybrid
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Software
The Role
Lead the technical direction and hands-on implementation of Hilbert's AI stack: design and ship agent-based workflows, production-grade LLM systems, evaluation pipelines, and monitoring. Set architecture and engineering standards, prioritize work, and hire/mentor engineers while collaborating across product, data, and GTM.
Summary Generated by Built In
Hilbert is a scalable, data science-first growth engine that gives B2C teams predictive clarity into user behavior, revenue drivers, and the actions that drive sustainable growth. Fully agentic by design, Hilbert shrinks months-long decision cycles to minutes.

From Fortune 10 enterprises to beloved brands like FreshDirect, Blank Street, and Levain Bakery, operators run their growth on Hilbert. We're also co-building alongside leading AI companies.

We're looking for a Lead AI Engineer who can own the technical direction of Hilbert's AI stack, ship production-grade systems hands-on, and elevate a growing engineering team; all with the ownership and urgency of a founder.

This is not a "manage from a distance" role. You'll write code, make architecture calls, set the bar for quality and velocity, and build the engineering culture as we scale. You're the person the team looks to when the problem is ambiguous, the stakes are high, and the path forward hasn't been written yet. If you combine deep technical craft with the ability to lead people and communicate with clarity, we want to meet you.

THE ROLE

You'll work directly with the founding team and across product, data, and GTM to lead the design, development, and evolution of the AI systems at the heart of Hilbert. You'll be hands-on and in the code daily — but you'll also be the person who defines how we build, how we prioritize, and how we grow the engineering team. The environment is high-autonomy and high-ambiguity — the nature of building AI-native products means requirements shift, approaches evolve, and the person closest to the problem often makes the call. As Lead, you make sure the team is equipped to make those calls well.

Build: hands-on, every day

  • Design, build, and maintain AI-driven features and pipelines that serve enterprise customers at scale

  • Architect and implement agent-based workflows using LangChain, LangGraph, or equivalent orchestration frameworks

  • Own critical systems end-to-end — from experimentation through production deployment and monitoring

  • Build and improve evaluation pipelines to measure, validate, and iterate on AI system performance

  • Make pragmatic engineering decisions under ambiguity — ship, learn, iterate

Lead: set direction and raise the bar

  • Define and own the technical roadmap for the AI stack in partnership with the founding team

  • Make architecture and infrastructure decisions that balance speed today with scalability tomorrow

  • Set engineering standards — code quality, review practices, testing, documentation, and deployment discipline

  • Prioritize ruthlessly across competing demands, keeping the team focused on highest-impact work

  • Communicate technical strategy, tradeoffs, and progress clearly to founders and non-technical stakeholders

  • Be the tiebreaker when the team is stuck — on architecture, approach, or prioritization

Grow: build the team and the culture

  • Hire, mentor, and develop AI engineers as the team scales

  • Create an environment of ownership, intellectual honesty, and high-velocity shipping

  • Run effective processes without bureaucracy — standups, reviews, retros that actually help

  • Identify skill gaps and build the team to fill them — whether through hiring, upskilling, or restructuring work

  • Lead by example: the team sees you in the code, in the reviews, in the hard problems — not just in meetings

WHO THRIVES IN THIS ROLE

We care about how you think, how you ship, and how you make others around you better.

The profile:
  • You're a strong Python engineer first. Your code is clean, testable, and production-ready. You haven't left the codebase behind — you lead from inside it

  • You have deep experience with LangChain, LangGraph, or equivalent agent/orchestration frameworks. You've built with them at scale, hit their limits, designed around them, and have opinions about when to use them and when not to

  • You communicate with clarity and conviction. You can align a team around a technical direction, explain a tradeoff to a non-technical founder, and give direct, constructive feedback to an engineer — all in the same day. Communication is not a nice-to-have here — it's the job

  • You take ownership at the team level. You don't just own your own output — you own the team's output. If something falls through the cracks, you treat it as your problem

  • You thrive in ambiguity and help others do the same. AI products evolve fast. You bring structure to chaos without killing speed — and you coach the team to operate the same way

  • You move at startup speed and expect the same from your team. You understand what it means to be available, responsive, and biased toward action in a fast-moving, early-stage environment. You set that tempo

Strong pluses:
  • Experience building evals pipelines — designing metrics, running systematic evaluations, and using results to drive iteration on AI systems

  • Backend software engineering experience — building APIs, services, data infrastructure, or production systems beyond the ML/AI layer

  • Exposure to retrieval-augmented generation (RAG), vector databases, or LLM-powered search and recommendation systems

  • Prior experience as a tech lead, engineering manager, or founding engineer at an early-stage or high-growth company

  • Track record of hiring and developing engineers — not just managing them

A senior engineer or tech lead at a startup who's ready to own the entire AI function. A founding engineer who built a team around themselves and wants to do it again at a company where the AI stack is the product. An engineering manager who refuses to stop coding. Someone who's been leading agents and LLM infrastructure work at a larger company and wants full ownership and zero bureaucracy. What matters: you ship, you lead from the front, you raise the bar for everyone around you, and you communicate like a partner — not a silo.


Location
San Francisco, with occasional travel for team meets, offsites or customer engagements.
CompensationCompetitive salary + equity package, commensurate with experience.
Performance-based bonuses tied to project milestones and customer impact.
The Hiring JourneyShort form → Intro call → Technical working session → Team conversations → OfferFast, human, no bureaucracy.

Top Skills

Python,Langchain,Langgraph,Llms,Retrieval-Augmented Generation (Rag),Vector Databases,Evals Pipelines,Agent Orchestration Frameworks
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
20 Employees

What We Do

Hilbert is a scalable, data science-first growth engine that gives B2C teams predictive clarity into user behavior, revenue drivers, and the actions that drive sustainable growth. Fully agentic by design, Hilbert shrinks months-long decision cycles to minutes. One Engine. Four Intelligence Layers. Built for Momentum. Hilbert turns fragmented data into clear answers and decisive actions using a proprietary architecture of AI/ML algorithms accessible through natural language. No code, no dashboards, no guesswork.

Similar Jobs

Capital One Logo Capital One

Artificial Intelligence Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
4 Locations
55000 Employees
230K-286K Annually

Capital One Logo Capital One

Artificial Intelligence Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
4 Locations
55000 Employees
197K-246K Annually

Mastercard Logo Mastercard

Devops Engineer

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Remote or Hybrid
San Carlos, CA, USA
38800 Employees
140K-277K Annually

Mastercard Logo Mastercard

Devops Engineer

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Remote or Hybrid
San Diego, CA, USA
38800 Employees
140K-277K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account