Conversational Modelling Research Engineer

Reposted 9 Days Ago
Be an Early Applicant
Hiring Remotely in USA
Remote
Expert/Leader
Artificial Intelligence • Software
The Role
Conduct research on Large Multimodal Models for Conversational Avatars, developing real-time modeling methods for verbal and non-verbal conversations, and partnering with Applied ML to transition prototypes into production.
Summary Generated by Built In

About Us

Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, free from the friction of today’s systems. Our real-time human simulation models let machines see, hear, respond, and even look real—enabling meaningful, face-to-face conversations. AI Humans combine the emotional intelligence of humans with the reach and reliability of machines, making them capable, trusted agents available 24/7, in every language, on our terms.

Imagine a friend who can discuss any topic with you. A personal trainer that adapts to your schedule. A fleet of medical assistants that can give every patient the attention they need. With Tavus, individuals, enterprises, and developers can all build AI Humans to connect, understand, and act with empathy at scale.

We’re a Series B company backed by world-class investors including Sequoia Capital, Y Combinator, and Scale Venture Partners.

Be part of shaping a future where humans and machines truly understand each other.

The Role

We’re looking for an AI Researcher to join our core AI team and push the boundaries of Foundation Multimodal Conversational Models. If you thrive in fast-moving startup environments, enjoy experimenting with new ideas, and love seeing your work come to life in production then you’ll feel right at home.

Your Mission 🚀

  • Conduct research on Large Multimodal Models in the context of Conversational Avatars (e.g. Neural Avatars, Talking-Heads).

  • Develop methods to model both verbal and non-verbal aspects of conversation, adapting and controlling avatar behavior in real time, with low-latency.

  • Experiment with fine-tuning, adaptation, and conditioning techniques to make AudioVisual Multimodal Models, more expressive, controllable, and task-specific.

  • Partner with the Applied ML team to take research from prototype to production.

  • Stay up to date with cutting-edge advancements — and help define what comes next.

You’ll Be Great At This If You Have:

  • A PhD (or near completion) in a relevant field, or equivalent research experience.

  • Hands-on experience with Large Multimodal Models and a strong foundation in generative (language) models. This could be in the context of tasks such as VQA, Audio/Video understanding tasks, captioning behavioral analysis, Translation tasks, Speech to Speech systems.

  • Experience in fine-tuning/adapting VLMs for control, conditioning, or downstream tasks.

  • Solid background in deep learning and foundation modes.

  • Strong PyTorch skills and comfort building deep learning pipelines.

Nice-to-Haves

  • Knowledge of large-scale model training and optimization.

  • Experience in duplex-conversational model.

  • Broader understanding of generative AI across modalities.

  • Exposure to software development best practices.

  • A flexible, experimental mindset i.e. comfortable working across research and engineering.

  • (Bonus) Publications at EMNLP, COLING, NeurIPS, ICLR, CVPR, ICCV.

Location


Preferred: San Francisco (hybrid) or London (office opening soon).

Remote within the U.S. or Europe available for exceptional candidates.

Skills Required

  • PhD or near completion in a relevant field or equivalent research experience
  • Hands-on experience with Large Multimodal Models
  • Strong foundation in generative language models
  • Experience in fine-tuning/adapting VLMs for control or downstream tasks
  • Solid background in deep learning and foundation models
  • Strong PyTorch skills and comfort building deep learning pipelines
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
17 Employees
Year Founded: 2020

What We Do

Use your voice for sales, without ever saying a word. Hyper-personalized AI reach-outs to increase your outreach. Request a demo today to drive more meetings, build deeper relationships, and spike conversions.

Similar Jobs

Commerce Logo Commerce

GTM Enablement Lead

Artificial Intelligence • Cloud • Consumer Web • eCommerce • Information Technology • Software
In-Office or Remote
Austin, TX, USA
1200 Employees
123K-185K Annually

SEON Logo SEON

Product Marketing Manager

Artificial Intelligence • Cybersecurity
Remote
US
415 Employees
5-7 Annually

Capital One Logo Capital One

Manager, Associate Relations Investigator (Remote)

Fintech • Machine Learning • Payments • Software • Financial Services
Remote or Hybrid
Richmond, VA, USA
55000 Employees
138K-158K Annually

RethinkFirst Logo RethinkFirst

Senior Database Engineer

Edtech • Healthtech • HR Tech • Information Technology • Professional Services • Software • Telehealth
In-Office or Remote
Chicago, IL, USA
300 Employees

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account