AI Engineer

Posted Yesterday
Be an Early Applicant
Kitchener, ON, CAN
In-Office
145K-173K Annually
Mid level
Information Technology
The Role
Design and implement TTS backend integrations, optimize linguistic inputs for naturalness, craft conversational turns and personas, manage prompt and voice-parameter systems, expose voice attributes to product UI, and collaborate with ASR and Audio AI engineers to ensure end-to-end voice quality and low latency.
Summary Generated by Built In

About Dialpad
Dialpad is the AI-native business communications platform. We unify calling, messaging, meetings, and contact center on a single platform - powered by AI that understands every conversation in real time.

More than 70,000 companies around the globe, including WeWork, Asana, NASDAQ, AAA Insurance, COMPASS Realty, Uber, Randstad, and Tractor Supply, rely on Dialpad to build stronger customer connections using real-time, AI-driven insights.

We’re now leading the shift to Agentic AI: intelligent agents that don’t just analyze conversations but take action by automating workflows, resolving customer issues, and accelerating revenue in real time. Our DAART initiative (Dialpad Agentic AI in Real Time) is redefining what a communications platform can do.

Visit dialpad.com to learn more.

Being a Dialer
At Dialpad, AI isn’t just a feature; it’s how our teams do their best work every day. We put powerful AI tools in every employee’s hands so they can move faster, think bigger, and achieve more.

We believe every conversation matters. And we’ve built the platform that turns those conversations into insight and action, for our customers and ourselves.

We look for people who are intensely curious and hold themselves to a high bar. Our ambition is significant, and achieving it requires a team that operates at the highest level. We seek individuals who embody our core traits: Scrappy, Curious, Optimistic, Persistent, and Empathetic.

Your Role
As an AI Engineer: Voice Designer, you’ll own the back-end implementation and linguistic optimization of the Text-to-Speech (TTS) layer for our next-generation AI voice agents. You’ll work squarely within our Speech Team—a high-impact R&D and engineering group focused on speech recognition, enhancement, and synthesis. You will bridge the gap between core speech science and product engineering, ensuring our voice agents sound human, context-aware, and trustworthy. You’ll also help create the systems that manage voice personas, tone, and conversational fillers, eventually exposing these as tweakable parameters to our customer-facing UI.

This position reports to our Senior Manager, AI Speech, is based at our Kitchener hub, and operates on a hybrid schedule.

What You’ll Do

  • TTS Backend Implementation: Own the integration and optimization of multiple TTS vendor APIs while leading research and prototyping for open-source or in-house TTS architectures.
  • Linguistic Optimization: Apply expertise in phonetics and sociolinguistics to ensure TTS input is formatted for maximum naturalness, including SSML orchestration and pronunciation handling.
  • Conversational Turn Design: Craft context-specific utterances to optimize turn handling and build caller trust during agentic "thought" processes.
  • Prompt & Persona Management: Design and manage LLM and TTS prompts and parameters to define and refine agent personalities across different industry verticals.
  • UI Parameter Exposure: Architect the logic to expose voice attributes (speed, pitch, tone, style) to the product UI, allowing customers to customize their agent’s voice profile.
  • Cross-Functional R&D: Partner with ASR and Audio AI engineers to ensure end-to-end voice quality and minimize latency in the ASR → LLM → TTS pipeline.

Skills You’ll Bring

  • Technical Foundation: Strong Python programming skills and experience with deep learning frameworks (e.g. PyTorch).
  • Speech Expertise: 3+ years of experience in Speech Synthesis (TTS) or Voice Design, including hands-on work with frameworks like NVIDIA NeMo, ESPnet, or Coqui, and hands-on experience with major TTS APIs such as ElevenLabs, Rime, and Cartesia.
  • Linguistic Background: Degree in Computational Linguistics, Computer Science, or AI/ML with a deep understanding of phonetics, prosody, and syntax.
  • Prompt Engineering: Proven experience crafting and evaluating LLM prompts (system, few-shot) and managing structured prompt templates.
  • Backend Engineering: Experience building production-grade APIs and integrating multi-vendor services in a cloud environment (GCP preferred).
  • Evaluation Mindset: Knowledge of speech quality metrics (MOS, intelligibility, latency) and the ability to design rigorous A/B tests for voice personas.

For exceptional talent based in Ontario, Canada the target base salary range for this position is posted below. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the target range for new hire salaries for the position. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in Ontario role postings reflect the base salary only, and do not include bonus, equity, or benefits.

Ontario Pay Transparency Range
$145,000$172,500 CAD

Why Join Dialpad

  • Work at the center of the AI transformation in business communications
  • Build and ship agentic AI products that are redefining how companies operate
  • Join a team where AI amplifies every employee’s impact
  • Competitive salary, comprehensive benefits, and real opportunities for growth

We believe in investing in our people. Dialpad offers competitive benefits and perks, cutting-edge AI tools, and a robust training program that help you reach your full potential. We have designed our offices to be inclusive, offering a vibrant environment to cultivate collaboration and connection. Our exceptional culture, repeatedly recognized as a Great Place to Work, ensures that every employee feels valued and empowered to contribute to our collective success.

Don’t meet every single requirement? If you’re excited about this role and possess the fundamental traits, drive, and strong ambition we seek, but your experience doesn’t meet every qualification, we encourage you to apply. 

 Dialpad is an equal-opportunity employer. We are dedicated to creating a community of inclusion and an environment free from discrimination or harassment.

Skills Required

  • 3+ years of experience in Speech Synthesis (TTS) or Voice Design, including hands-on work with frameworks like NVIDIA NeMo, ESPnet, or Coqui
  • Hands-on experience with major TTS APIs such as ElevenLabs, Rime, and Cartesia
  • Strong Python programming skills and experience with deep learning frameworks (e.g., PyTorch)
  • Degree in Computational Linguistics, Computer Science, or AI/ML with deep understanding of phonetics, prosody, and syntax
  • Proven experience crafting and evaluating LLM prompts (system, few-shot) and managing structured prompt templates
  • Experience building production-grade APIs and integrating multi-vendor services in a cloud environment
  • Experience with GCP (preferred)
  • Knowledge of speech quality metrics (MOS, intelligibility, latency) and ability to design rigorous A/B tests for voice personas
  • Experience applying phonetics and sociolinguistics for pronunciation handling and SSML orchestration

Dialpad Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Dialpad and has not been reviewed or approved by Dialpad.

  • Fair & Transparent Compensation Compensation is viewed as competitive across many roles, combining salary, bonuses, equity, and benefits into a well-rounded package. Overall satisfaction with pay and total compensation is characterized as positive.
  • Leave & Time Off Breadth Paid time off is described as generous, with an unlimited PTO policy highlighted as a standout element. This breadth of time off is positioned as a central strength of the benefits package.
  • Healthcare Strength Healthcare coverage is characterized as comprehensive, spanning medical, dental, vision, disability, life insurance, and mental health benefits. Such coverage depth is presented as a core strength of the overall package.

Dialpad Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
841 Employees
Year Founded: 2011

What We Do

Dialpad is a cloud-based business phone system that turns conversations into opportunities and helps global teams make smarter calls--anywhere, anytime. We bring simplicity to the professional phone experience and some of the world’s most innovative companies use our platform. Dialpad's products span video meetings, cloud call centers, sales coaching and dialers and enterprise phone systems--and are all infused with the latest AI technologies to help every business make smarter calls. Customers include WeWork, Uber, Motorola Solutions, Domo and Xero. Investors include Amasia, Andreessen Horowitz, Felicis Ventures, GV, ICONIQ Capital, Salesforce Ventures, Scale Venture Partners, Section 32, Softbank and Work-Bench.

Similar Jobs

Samsara Logo Samsara

Artificial Intelligence Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
CA
4000 Employees
127K-214K Annually

iManage Logo iManage

Software Engineer

Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software
Hybrid
Toronto, ON, CAN
1100 Employees

McCain Foods Logo McCain Foods

Artificial Intelligence Engineer

Food • Retail • Agriculture • Manufacturing
In-Office
Toronto, ON, CAN
20000 Employees
86K-114K Annually
In-Office
Brampton, ON, CAN
615 Employees

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account