Machine Learning Researcher, Multimodal LLMs

Reposted 7 Days Ago
Be an Early Applicant
San Francisco, CA, USA
In-Office
140K-250K Annually
Mid level
Artificial Intelligence • Information Technology
The Role
The role involves developing multimodal LLMs for conversational AI, focusing on real-time interaction combining speech, text, and tools. Responsibilities include taking ideas from research to production.
Summary Generated by Built In
Machine Learning Researcher, Multimodal LLMs

Location: San Francisco, CA or Remotes

About Bland

At Bland.com, our mission is to empower enterprises to build AI phone agents at scale. Voice is quickly becoming the primary interface between businesses and their customers, and we are building the models and infrastructure that make those interactions feel natural, reliable, and genuinely human.

We’ve raised $65M from leading investors including Emergence Capital, Scale Venture Partners, Y Combinator, and founders of Twilio, Affirm, and ElevenLabs.

The Role

We are looking for someone to contribute to the development of our next-generation multimodal LLM stack, combining speech, text, tools, and real-time reasoning into a single unified system. You’ll be responsible for building industry-leading conversational AI models that power Bland's agent, and taking them all the way from idea to production.

At Bland, we're not just thinking about text modeling. You will define how our agents listen, think, and act in real time, integrating streaming audio, tool execution, and dynamic context into a single coherent system. You will take ideas from research through production systems serving millions of calls per day.

What Makes You a Great Fit

Strong LLM / Multimodal Background

  • Experience with LLMs, multimodal models, or speech-language systems

  • Deep understanding of prompting, fine-tuning, and alignment techniques

  • Familiarity with neural audio codecs and modern multimodal LLM techniques

Fast Experimental Loop

  • You can go from idea → dataset → experiment → conclusion in days

  • You know how to design experiments that actually answer the question

Product Intuition

  • Strong sense for what makes an interaction feel natural vs robotic

  • Ability to translate abstract modeling ideas into user-facing improvements

Builder Mentality

  • You take ownership from research through deployment

  • You thrive in ambiguous, fast-moving environments

  • You care about impact, not just elegance

How You Show Up
  • You think in systems, not just models

  • You obsess over latency, correctness, and real-world behavior

  • You are comfortable discarding ideas quickly when data disagrees

  • You push toward simple abstractions for complex problems

Bonus Points
  • Experience with real-time voice systems or conversational AI

  • Background in tool-using agents or agent frameworks

  • Experience with multimodal datasets (audio + text + actions)

  • Contributions to LLM or speech-related research or open source

Compensation & Benefits
  • Competitive salary: $180,000 – $260,000

  • Meaningful equity

  • Full healthcare, dental, vision

  • Office in Jackson Square, SF

  • High autonomy, high impact

Skills Required

  • Experience with LLMs
  • Experience with multimodal models or speech-language systems
  • Deep understanding of prompting, fine-tuning, and alignment techniques
  • Familiarity with neural audio codecs
  • Experience with real-time voice systems or conversational AI
  • Background in tool-using agents or agent frameworks
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
27 Employees
Year Founded: 2023

What We Do

The enterprise first solution for AI phone calls

Similar Jobs

ServiceNow Logo ServiceNow

Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Mountain View, CA, USA
29000 Employees
126K-195K Annually

ServiceNow Logo ServiceNow

Sales Executive

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
San Francisco, CA, USA
29000 Employees

ServiceNow Logo ServiceNow

Senior Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Mountain View, CA, USA
29000 Employees
143K-243K Annually

ServiceNow Logo ServiceNow

Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Mountain View, CA, USA
29000 Employees
126K-195K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account