Research, ML

Reposted 23 Hours Ago
San Francisco, CA, USA
In-Office
150K-300K Annually
Mid level
Artificial Intelligence • Software
Building embeddings-based search infrastructure
The Role
The ML Research Engineer will develop and train embedding models for a search engine, focusing on novel transformer architectures and dataset creation.
Summary Generated by Built In

Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M H200 GPU cluster that regularly lights up tens of thousands of machines.

On the ML team, we train foundational models for search. Our goal is to build systems that can instantly filter the world's knowledge to exactly what you want, no matter how complex your query. Basically, put the web into an extremely powerful database.

We're looking for an ML Research Engineer to train embedding models for perfect search over the web. The role involves dreaming up novel transformer-based search architectures, creating datasets, creating evals, beating our internal SoTA, and repeat.

Desired Experience

  • You have graduate-level ML experience (or are an exceptionally strong undergrad)

  • You can code up a transformer from scratch in PyTorch

  • You like creating large-scale datasets and diving deeply into the data

  • You care about the problem of finding high quality knowledge and recognize how important this is for the world

Example Projects

  • Pre-training: Train a hundred billion parameter model

  • Fine-tuning: Build an RLAIF pipeline for search

  • Dream up a novel architecture for search in the shower, then code it up and beat our best model's top score

  • Build an eval system that answers how do we know we're advancing our search quality? (this is an incredibly difficult question to answer)

This is an in-person opportunity in San Francisco. We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3). In addition to premium healthcare benefits (medical, dental, vision), we also offer fertility benefits and a monthly wellness stipend to all of our employees.

Top Skills

PyTorch
Rust
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, , California
86 Employees
Year Founded: 2021

What We Do

Exa was built with a simple goal — to organize all knowledge. After several years of heads-down research, we developed novel representation learning techniques and crawling infrastructure so that LLMs can intelligently find relevant information.

Similar Jobs

General Motors Logo General Motors

Machine Learning Engineer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Mountain View, CA, USA
165000 Employees

Deepgram Logo Deepgram

Research Engineer, Machine Learning Systems

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
In-Office or Remote
3 Locations
150 Employees
150K-250K Annually

General Motors Logo General Motors

Machine Learning Engineer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Sunnyvale, CA, USA
165000 Employees
219K-335K Annually

Cognitiv Logo Cognitiv

Director, ML Research Science

Artificial Intelligence • Marketing Tech
Easy Apply
Hybrid
San Mateo, CA, USA
150 Employees
250K-330K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account