Principal Research & Engineering, Realtime Voice AI

Posted 2 Days Ago
Be an Early Applicant
Palo Alto, CA, USA
In-Office
400K-550K Annually
Expert/Leader
Generative AI
The Role
Lead and build Inflection's realtime Voice AI stack across speech models, streaming systems, voice-agent runtime, and evaluation. Define roadmap, direct research and engineering, design evaluation metrics beyond WER, deploy enterprise voice agents, debug production behavior, and mentor a team focused on speech research, realtime systems, and audio infrastructure.
Summary Generated by Built In
About Inflection AI

Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We’re shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people’s potential.
Inflection AI created Pi, the world’s first emotionally intelligent AI, to help people work through decisions, emotions, and challenges. Pi is a personal AI agent powered by Inflection AI’s foundation model, proving that AI can be personal, empathetic, and contextually aware.

About the Role

Voice is becoming the highest-stakes interface for AI, where quality depends on speed, naturalness, interruption handling, emotional nuance, and reliability in real-world conditions. We are looking for a hands-on technical leader to define and build Inflection’s realtime Voice AI stack across speech models, streaming systems, voice-agent runtime, and evaluation. This person will help shape how emotionally intelligent AI shows up in spoken interactions, partnering across research, engineering, product, and design to deliver voice agents that feel responsive, trustworthy, and useful in enterprise settings.

What You’ll Do

  • Establish the technical roadmap for Inflection's realtime Voice AI stack, encompassing streaming ASR, TTS, speech-to-speech, speech LLMs, turn-taking, barge-in, latency, and reliability.
  • Utilize a 1,000 GPU cluster to support performance benchmarking and extensive experimentation.
  • Determine build-vs-buy-vs-train strategies for core audio, speech, and realtime interaction components.
  • Direct research and engineering efforts focused on speech quality, naturalness, expressiveness, emotional fit, controllability, and production readiness.
  • Collaborate with infrastructure, product, design, and agentic AI teams to deploy voice agents for enterprise workflows.
  • Develop evaluation systems measuring voice quality through metrics such as clarity, emotional appropriateness, interruption handling, task success, user preference, latency, and reliability, moving beyond standard WER.
  • Refine production voice behavior by debugging across runtime, model, evaluation, data, and product layers.
  • Mentor, and Coach a team specializing in speech research, audio infrastructure, realtime systems, and evaluation.


What We’re Looking For

  • Experience leading or serving as a principal Research and Engineering contributor to realtime voice, speech, audio AI, or conversational AI systems in production.
  • Experience with one or more of: streaming ASR, TTS, speech-to-speech systems, speech LLMs, audio tokenization, multimodal models, barge-in, low-latency inference, or realtime agents.
  • Strong technical judgment across both speech modeling and production systems.
  • Ability to define voice quality in terms of user and customer outcomes, not only offline model metrics.
  • Experience designing or using evaluation systems that capture real user experience.
  • Strong product intuition for natural, trustworthy, emotionally appropriate voice interactions.
  • Ability to lead senior technical talent while staying close to the code, architecture, and debugging work.
  • Have a bachelor’s degree or equivalent in a related field to the offered position requirements
Employee Pay Disclosures

At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary to fall within the range of $400,000 to $550,000, depending on a candidate’s qualifications and level of experience. This role also includes a meaningful equity component, allowing employees to share in the long-term success of the company.


Benefits

Inflection AI values and supports our team’s mental and physical health. We are focused on building a positive, safe, inclusive and inspiring place to work. Our benefits include: 

  • Diverse medical, dental and vision options 
  • 401k matching program 
  • Unlimited paid time off 
  • Parental leave and flexibility for all parents and caregivers
  • Support of country-specific visa needs for international employees living in the Bay Area

Skills Required

  • Experience leading or serving as a principal research and engineering contributor to realtime voice, speech, audio AI, or conversational AI systems in production.
  • Experience with one or more of streaming ASR, TTS, speech-to-speech systems, speech LLMs, audio tokenization, multimodal models, barge-in, or low-latency inference.
  • Strong technical judgment across both speech modeling and production systems.
  • Ability to define voice quality in terms of user and customer outcomes, not only offline model metrics.
  • Experience designing or using evaluation systems that capture real user experience and metrics beyond WER.
  • Strong product intuition for natural, trustworthy, emotionally appropriate voice interactions.
  • Ability to lead senior technical talent while staying close to code, architecture, and debugging work.
  • Bachelor's degree or equivalent in a related field.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
31 Employees
Year Founded: 2022

What We Do

We are an AI studio creating a personal AI for everyone. Our first AI is called Pi, for personal intelligence, a supportive and empathetic conversational AI. Our studio is made up of the world’s leading AI developers, creative designers, writers and innovators working together to create a brand new class of digital experiences. This is an era of exponential change. Our name Inflection embraces this moment of transformation, while our status as a public benefit corporation provides us with the legal mandate to prioritize the well-being and happiness of our users and wider stakeholders above all else.

Similar Jobs

Micron Technology Logo Micron Technology

Armed Executive Protection Agent

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office
San Jose, CA, USA
45000 Employees
119K-202K Annually

Micron Technology Logo Micron Technology

Senior Design Verification Engineer

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office
San Jose, CA, USA
45000 Employees
168K-336K Annually

Micron Technology Logo Micron Technology

Counsel

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office
San Jose, CA, USA
45000 Employees
200K-351K Annually

Trumid Logo Trumid

Senior Data Engineer

Fintech • Information Technology • Software • Financial Services
Easy Apply
Remote or Hybrid
USA
200 Employees
200K-250K Annually

Similar Companies Hiring

Northslope Thumbnail
Artificial Intelligence • Information Technology • Software • Analytics • Consulting • Generative AI
London, GB
100 Employees
ClickMint Thumbnail
AdTech • eCommerce • Marketing Tech • Generative AI
Malibu, CA
9 Employees
LTX Thumbnail
Conversational AI • Generative AI
Jerusalem, Israel
360 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account