Senior AI Operations (AI Ops) Engineer

Reposted 6 Days Ago
Easy Apply
Palo Alto, CA, USA
Hybrid
116K-258K Annually
Senior level
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Travel & expense made easy.
The Role
The role involves orchestrating AI services, optimizing model inference, ensuring reliability and standardization, and collaborating with AI researchers on deployments.
Summary Generated by Built In

At Navan, we aren't building a single, generic chatbot. We are building a Composable AI Microservice Architecture, a swarm of hundreds of hyper-specialized AI services, each meticulously "programmed" to solve small, focused tasks with high precision. This fleet powers Ava, our AI support engine, and a suite of cutting-edge generative tools for travel and expense management.

As a Senior AI Operations (AI Ops) Engineer, you are the architect of the platform that makes this scale possible. You will move beyond traditional MLOps to manage a "factory" of Language Models. Your challenge is one of orchestration and standardization, ensuring that every service in the swarm meets a rigorous bar for quality, reliability, and cost-efficiency.

What You’ll Do
  • Orchestrate the AI Fleet: Build and own the runtime environment for 100+ specialized AI services. Manage model routing, context versioning, and standardized memory/history stores.
  • High-Density Inference Optimization: Design and implement SageMaker Multi-Model Endpoints (MME) and Inference Components to serve multiple tuned SLMs per GPU, maximizing hardware utilization while minimizing latency.
  • Deterministic Service Excellence: Treat reliability as a layered engineering problem. Build deterministic "shells" around probabilistic LM outputs, prioritizing data-layer validation and strict serialization.
  • Automated Evaluation & Observability: Implement "LLM-as-a-judge" patterns and automated benchmarking to detect semantic drift and hallucinations across the fleet before they impact the user.
  • Standardize the Workflow: Obsess over building reusable patterns and Terraform-based infrastructure that eliminate "snowflake" configurations, allowing us to deploy new specialized AI tasks in minutes.
  • Agency Strategy: Partner with AI Researchers to find the "Goldilocks zone" for agentic autonomy—balancing the flexibility of LLM tool-use with the precision required for production stability.
What We’re Looking For
  • Experience: 5+ years in SRE, Platform Engineering, or MLOps, with at least 2 years focused on deploying LLMs/SLMs in production environments.
  • SageMaker Mastery: Deep hands-on expertise with AWS SageMaker, specifically configuring Multi-Model Endpoints (MME), Inference Components, and GPU-backed instances (G5/P4).
  • SLM Expertise: Proven experience with Small Language Models (e.g., Mistral, Llama 3, Phi) and parameter-efficient fine-tuning (PEFT) deployment strategies like LoRA/QLoRA.
  • Technical Stack: * Languages: Strong proficiency in Python and Terraform.
    • Orchestration: Experience with Docker, Kubernetes (EKS), or AWS ECS/Fargate.
    • Data: Familiarity with Snowflake and Vector Databases.
  • The "AI Ops" Mindset: You understand that AI at scale is a statistical challenge. You are comfortable debugging issues at the data/serialization layer rather than defaulting to prompt tweaks.
  • CI/CD & Automation: Experience building robust pipelines (Jenkins, GitHub Actions) for non-deterministic software, including automated "eval" stages.
  • Education: BS or MS in Computer Science, Engineering, Mathematics, or a related technical field.

The posted pay range represents the anticipated low and high end of the compensation for this position and is subject to change based on business need. To determine a successful candidate’s starting pay, we carefully consider a variety of factors, including primary work location, an evaluation of the candidate’s skills and experience, market demands, and internal parity.
For roles with on-target-earnings (OTE), the pay range includes both base salary and target incentive compensation. Target incentive compensation for some roles may include a ramping draw period. Compensation is higher for those who exceed targets. Candidates may receive more information from the recruiter.

Pay Range
$116,100$258,000 USD

Skills Required

  • 5+ years in SRE, Platform Engineering, or MLOps
  • At least 2 years focused on deploying LLMs/SLMs in production
  • Deep hands-on expertise with AWS SageMaker
  • Experience with Small Language Models like Mistral, Llama 3, Phi
  • Strong proficiency in Python and Terraform
  • Experience with Docker, Kubernetes, AWS ECS/Fargate
  • Familiarity with Snowflake and Vector Databases
  • Experience building robust CI/CD pipelines
  • BS or MS in Computer Science, Engineering, Mathematics or related field

What the Team is Saying

Brian Guimond
Adamas Victória Cavalcante Robitz
Bastian Martino
Charlotte Delafosse
Daniella Schuh
Alice Rao-Wyckoff
Mily O Loughlin
Anna
Roshni
Henry Statfeld
Jose Soares

Navan Compensation & Benefits Highlights

  • Fair & Transparent Compensation Pay aligns with mid‑ to upper‑market in core engineering and GTM roles, with competitive cash, equity, and bonus plans. Defined pay bands and commission tiers provide clarity on how earnings are structured.
  • Leave & Time Off Breadth Flexible/unlimited PTO is part of the package alongside paid parental leave durations for birthing and non‑birthing parents. Time‑off policies are positioned as broad and supportive across the company.
  • Wellbeing & Lifestyle Benefits Travel‑centric perks (IATAN access and discounted personal travel) combine with connectivity/home‑office stipends, commuter benefits, in‑office meals/snacks, and pet insurance. Access to Headspace supports mental‑health resources.

Navan Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
3,300 Employees
Year Founded: 2015

What We Do

Navan (Nasdaq: NAVN) is the leading all-in-one business travel, payments, and expense management platform that makes travel easy for frequent travelers. From finding flights and hotels to automating expense reconciliation, with 24/7 support along the way, Navan delivers an intuitive experience travelers love and finance teams rely on. See how Navan customers benefit and learn more at navan.com.

Why Work With Us

At Navan, we’re never satisfied with the status quo, and we know breakthrough ideas come from diverse perspectives. We are committed to cultivating a workplace that reflects the diversity of the customers we serve while fostering leadership and innovation.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Navan Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

In-person connections is the foundation of Navan, the connections forged through face-to-face interactions improve company culture and what we can achieve together. We operate on a hybrid working model, which we define as four days a week in-office.

Typical time on-site: 4 days a week
HQPalo Alto, CA
Austin, TX
Bengaluru, IN
Berlin, DE
Boston, MA
Dallas, TX
Gurugram, IN
Lisbon, PT
London, GB
New Delhi, Delhi
New York, NY
Paris, FR
San Francisco, CA
Singapore
Sydney, AU
Tel Aviv-Yafo, IL
Learn more

Similar Jobs

Navan Logo Navan

Senior Software Engineer

Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Easy Apply
Hybrid
2 Locations
3300 Employees
113K-252K Annually

Navan Logo Navan

Account Manager

Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Easy Apply
Hybrid
2 Locations
3300 Employees
131K-175K Annually

Navan Logo Navan

Accounting Manager

Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Easy Apply
Hybrid
2 Locations
3300 Employees
88K-195K Annually

Navan Logo Navan

Enterprise Account Executive

Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Easy Apply
Hybrid
2 Locations
3300 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account