Research, ML

Reposted 12 Days Ago
San Francisco, CA, USA
In-Office
180K-350K Annually
Mid level
Artificial Intelligence • Software
Building embeddings-based search infrastructure
The Role
The ML Research Engineer will develop and train embedding models for a search engine, focusing on novel transformer architectures and dataset creation.
Summary Generated by Built In

We raised a $250M Series C to build the search engine for AIs. Led by a16z, with existing investors Benchmark, Lightspeed, and YC doubling down, the round brings Exa's valuation to $2.2 billion. Read more

Exa is building a search engine from scratch to serve every AI agent. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to process it, and design super high performant vector databases in rust to search over it. If you like compute, we also own a $5M H200 GPU cluster (and soon 5x'ing that) and regularly spin up batchjobs with tens of thousands of machines.

We are rapidly building the most intelligent search engine in history. We’re high agency, low-ego, and united by the feeling that this is one of the last problems worth getting right.

On the ML team, we train foundational models for search. Our goal is to build systems that can instantly filter the world's knowledge to exactly what you want, no matter how complex your query. Basically, put the web into an extremely powerful database.

We're looking for an ML Research Engineer to train embedding models for perfect search over the web. The role involves dreaming up novel transformer-based search architectures, creating datasets, creating evals, beating our internal SoTA, and repeat.

Desired Experience

  • You have graduate-level ML experience (or are an exceptionally strong undergrad)

  • You can code up a transformer from scratch in PyTorch

  • You like creating large-scale datasets and diving deeply into the data

  • You care about the problem of finding high quality knowledge and recognize how important this is for the world

Example Projects

  • Pre-training: Train a hundred billion parameter model

  • Fine-tuning: Build an RLAIF pipeline for search

  • Dream up a novel architecture for search in the shower, then code it up and beat our best model's top score

  • Build an eval system that answers how do we know we're advancing our search quality? (this is an incredibly difficult question to answer)

This is an in-person opportunity in San Francisco. We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3). In addition to premium healthcare benefits (medical, dental, vision), we also offer fertility benefits and a monthly wellness stipend to all of our employees.

Skills Required

  • Graduate-level Machine Learning experience or strong undergraduate
  • Ability to code a transformer model from scratch in PyTorch
  • Experience creating large-scale datasets
  • Passion for finding high quality knowledge
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, , California
86 Employees
Year Founded: 2021

What We Do

Exa was built with a simple goal — to organize all knowledge. After several years of heads-down research, we developed novel representation learning techniques and crawling infrastructure so that LLMs can intelligently find relevant information.

Similar Jobs

Lawrence Livermore National Laboratory Logo Lawrence Livermore National Laboratory

Machine Learning Research Engineer

Information Technology • Security • Energy • Defense
Hybrid
Livermore, CA, USA
9757 Employees
146K-223K Annually

Together AI Logo Together AI

Technical Recruiter

Artificial Intelligence • Information Technology
In-Office
San Francisco, CA, USA
84 Employees
165K-210K Annually

The Walt Disney Company Logo The Walt Disney Company

Lead Machine Learning Engineer

Digital Media • Gaming • News + Entertainment • Sports
In-Office
3 Locations
219548 Employees
172K-241K Annually

Deccan AI Logo Deccan AI

Founding Engineer - ML Research

Artificial Intelligence • Big Data • Information Technology • Software
In-Office
Mountain View, CA, USA
430 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account