Data Engineer, Web Scraping

Posted 4 Days Ago
Easy Apply
San Francisco, CA, USA
In-Office
105K-125K Annually
Mid level
Artificial Intelligence • Analytics • Consulting • Cybersecurity
Research Services
The Role
Design and optimize data pipelines for web scraping and processing data. Collaborate with ML engineers and develop APIs while preparing data for analysis.
Summary Generated by Built In

About 10a Labs: 10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely.

 

About 10a Labs: 

10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely.

 

In this role, you will:

  • Design, implement, and optimize end-to-end data pipelines for scraping and processing structured and unstructured data using Google Cloud Platform (or similar) and best practices; 
  • Conduct ad hoc web scraping and data collection to support research and intelligence initiatives;
  • Prepare data for further analysis, including data cleaning, transformation, anonymization, and masking;
  • Contribute to the development of internal and external APIs, following best practices; 
  • Collaborate with ML engineers, other data engineers, and software developers to deliver actionable insights and functional tools, including internal and external dashboards, APIs, and data dumps; and 
  • Drive other critical initiatives. 

Requirements:

  • Degree (or equivalent work experience) in Computer Science, Engineering, Information Science, Data Science or a related field (graduate degree preferred)
  • 2+ years of professional experience in data engineering or a closely related field
  • Ability to communicate complex technical ideas clearly to non-technical audiences
  • Proficiency in Python, SQL
  • Experience with web scraping/crawling (e.g., Beautiful Soup, Selenium, Scrapy)
  • Experience with Google Cloud Platform (or similar), including storage and database services (e.g., Cloud Storage, CloudSQL, Cloud Spanner) and workflow orchestration (e.g., Cloud Composer/Airflow, Cloud Run, Pub/Sub)
  • Experience building and managing data pipelines, especially for text data
  • Comfort working in fast-moving, high-impact environments, such as startups, AI research labs, or security-focused teams

Compensation & Benefits:

  • Salary Range: $105K–$125K, depending on experience and location
  • Bonus: Performance-based annual bonus
  • Professional Development: Support for conferences, continuing education, or leadership training
  • Work Environment: Fully remote, U.S.-based
  • Health Benefits: Comprehensive health, dental, and vision coverage
  • Time Off: Generous PTO and paid holiday schedule

Retirement: 401(k) plan

Top Skills

Beautiful Soup
Google Cloud Platform
Python
Scrapy
Selenium
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
28 Employees

What We Do

10a Labs is an applied research and technology company specializing in AI security. We deliver intelligence collection, investigative research, and analysis for AI unicorns, Fortune 10 companies, and U.S. tech leaders.

Similar Jobs

Anduril Logo Anduril

Technical Program Manager

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
129K-171K Annually

Anduril Logo Anduril

Security Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
191K-253K Annually
Easy Apply
Hybrid
Hollywood, Los Angeles, CA, USA
225 Employees
75K-93K Annually

Tempus AI Logo Tempus AI

Director, Commercial Connectivity Partnerships

Artificial Intelligence • Big Data • Healthtech • Machine Learning • Analytics • Biotech • Generative AI
Remote or Hybrid
5 Locations
3775 Employees
150K-200K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account