Software Engineer, Web Crawling

Reposted 10 Days Ago
Hiring Remotely in USA
Remote
150K-300K Annually
Mid level
Artificial Intelligence • Software
Building embeddings-based search infrastructure
The Role
As a Web Crawler Engineer, you will develop a search engine by crawling the web, optimizing systems, and scaling infrastructure for high performance and efficiency.
Summary Generated by Built In

Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M H200 GPU cluster that regularly lights up tens of thousands of machines.

As a Web Crawler engineer, you'd be responsible for crawling the entire web. Basically build Google-scale crawling!

Desired Experience
  • You have extensive experience building and scaling web crawlers, or would be excited to ramp up very quickly

  • You have experience with some high performance language (C++, Rust, etc.)

  • You’re comfortable optimizing a system to an exceptional degree

  • You care about the problem of finding high quality knowledge and recognize how important this is for the world

Example Projects
  • Build a distributed crawler that can handle 100M+ pages per day

  • Optimize crawl politeness and rate limiting across thousands of domains

  • Design systems to detect and handle dynamic content, JavaScript rendering, and anti-bot measures

  • Create intelligent crawl scheduling and prioritization algorithms for maximum coverage efficiency

This is an in-person opportunity in San Francisco. We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3).

Top Skills

C++
Rust
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, , California
86 Employees
Year Founded: 2021

What We Do

Exa was built with a simple goal — to organize all knowledge. After several years of heads-down research, we developed novel representation learning techniques and crawling infrastructure so that LLMs can intelligently find relevant information.

Similar Jobs

Woflow Logo Woflow

Software Engineer

Machine Learning • Marketing Tech • Software
Remote
USA
204 Employees

Dropbox Logo Dropbox

Software Engineer

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
United States
2500 Employees
196K-265K Annually

Vantor Logo Vantor

Senior Account Executive

Aerospace • Artificial Intelligence • Computer Vision • Software • Analytics • Defense • Big Data Analytics
Remote
United States
2500 Employees
163K-299K Annually

SailPoint Logo SailPoint

Operations Manager

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
United States
2461 Employees
118K-220K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account