Machine Learning Enginer, Core Evaluations

Reposted 3 Days Ago
Be an Early Applicant
27 Locations
Remote
Mid level
Artificial Intelligence • Software
The Role
The role involves designing and developing model evaluation pipelines, user studies for subjective evaluations, and automated dashboards to report results, alongside leading the evaluation team and communicating with other teams to improve model performance.
Summary Generated by Built In

About Cantina:

Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

About the Role:

We are seeking an experienced Machine Learning Engineer (MLE) to focus on audio model evaluation, specifically for speech generation and recognition models.

This role involves designing and developing comprehensive model evaluation pipelines for both development and production environments, as well as creating automated dashboards for reporting evaluation results.

As the founding member of our evaluation team, the ideal candidate is expected to leverage their experience to lead our evaluation efforts and play a key role in the future growth of the evaluation team.

What You’ll Do:

  • Designing model evaluation pipelines for models in development and production

  • Designing user studies for subjective model evaluations.

  • Converting requirements into measurable metrics.

  • Designing and developing automated evaluation dashboard to see model performances and compare results.

  • Training new models to capture new and different evaluation metrics.

  • Communicating with the model team to help design better models based on the evaluation results.

  • Communicating with the data team to help decide the type of data necessary to improve model performance.

  • Communication with the product-manager to make sure product requirements are correctly measured.

  • Help grow the evaluation team as the founding member.

  • Lead the evaluation team in the future.

What You’ll Bring:

  • Strong experience and intuition for designing metrics that capture model performance.

  • Strong experience with designing user studies on Mechanical Turk or similar platforms. .

  • Strong experience with model training and fine-tuning for model evaluation.

  • Strong statistical knowledge and experience to statistically compare evaluation results and take decisions.

  • Very strong engineering and programming skills.

  • Experience with training ASR, TTS models.

  • Experience at ML teams working on large-scale machine learning problems. (>3B models with >1m hours of data)

Skills Required

  • Strong experience designing metrics for model performance
  • Experience designing user studies on platforms like Mechanical Turk
  • Experience with model training and fine-tuning
  • Strong statistical knowledge for comparing evaluation results
  • Strong engineering and programming skills
  • Experience with training ASR and TTS models
  • Experience in ML teams on large-scale machine learning problems
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
364 Employees
Year Founded: 2023

What We Do

Cantina Labs, founded by Sean Parker, is a new social platform with the most advanced AI character creator. Build, share, and interact with AI bots and your friends directly in the Cantina or across the internet. Cantina bots are lifelike, social creatures, capable of interacting wherever humans go on the internet. Recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters. Bots are a new media type that offer a way for creators to share infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.

Similar Jobs

Zapier Logo Zapier

Systems Engineer

Artificial Intelligence • Productivity • Software • Automation
Remote
27 Locations
800 Employees

Zapier Logo Zapier

Staff Engineer

Artificial Intelligence • Productivity • Software • Automation
Remote
32 Locations
800 Employees
211K-316K Annually

SEON Logo SEON

Senior Site Reliability Engineer

Artificial Intelligence • Cybersecurity
In-Office or Remote
28 Locations
415 Employees

SEON Logo SEON

Senior Software Engineer

Artificial Intelligence • Cybersecurity
Remote
27 Locations
415 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account