Data Engineer, Web Scraping

Reposted 5 Days Ago
Washington, DC, USA
In-Office
105K-125K Annually
Mid level
Artificial Intelligence • Analytics • Consulting • Cybersecurity
Research Services
The Role
Design and optimize data pipelines for web scraping and processing data. Collaborate with ML engineers and develop APIs while preparing data for analysis.
Summary Generated by Built In

About 10a Labs: 10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely.

In this role, you will:

  • Design, implement, and optimize end-to-end data pipelines for scraping and processing structured and unstructured data using Google Cloud Platform (or similar) and best practices; 
  • Conduct ad hoc web scraping and data collection to support research and intelligence initiatives;
  • Prepare data for further analysis, including data cleaning, transformation, anonymization, and masking;
  • Contribute to the development of internal and external APIs, following best practices; 
  • Collaborate with ML engineers, other data engineers, and software developers to deliver actionable insights and functional tools, including internal and external dashboards, APIs, and data dumps; and 
  • Drive other critical initiatives. 

Requirements:

  • Degree (or equivalent work experience) in Computer Science, Engineering, Information Science, Data Science or a related field (graduate degree preferred)
  • 2+ years of professional experience in data engineering or a closely related field
  • Ability to communicate complex technical ideas clearly to non-technical audiences
  • Proficiency in Python, SQL
  • Experience with web scraping/crawling (e.g., Beautiful Soup, Selenium, Scrapy)
  • Experience with Google Cloud Platform (or similar), including storage and database services (e.g., Cloud Storage, CloudSQL, Cloud Spanner) and workflow orchestration (e.g., Cloud Composer/Airflow, Cloud Run, Pub/Sub)
  • Experience building and managing data pipelines, especially for text data
  • Comfort working in fast-moving, high-impact environments, such as startups, AI research labs, or security-focused teams

Compensation & Benefits:

  • Salary Range: $105K–$125K, depending on experience and location
  • Bonus: Performance-based annual bonus
  • Professional Development: Support for conferences, continuing education, or leadership training
  • Work Environment: Fully remote, U.S.-based
  • Health Benefits: Comprehensive health, dental, and vision coverage
  • Time Off: Generous PTO and paid holiday schedule

Retirement: 401(k) plan

Skills Required

  • Degree in Computer Science, Engineering, Information Science, Data Science or equivalent experience
  • 2+ years of professional experience in data engineering or a closely related field
  • Proficiency in Python and SQL
  • Experience with web scraping and crawling (Beautiful Soup, Selenium, Scrapy)
  • Experience with Google Cloud Platform, including storage and database services
  • Experience building and managing data pipelines for text data

10a Labs Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about 10a Labs and has not been reviewed or approved by 10a Labs.

  • Healthcare Strength Benefits include comprehensive medical, dental, and vision coverage for full-time roles, listed across multiple postings. Coverage is presented as a core part of the package rather than a role-specific perk.
  • Leave & Time Off Breadth Positions frequently advertise generous PTO and paid holidays, with some roles noting unlimited PTO and flexible hours. This indicates substantial time-off provisions alongside remote work arrangements.
  • Strong & Reliable Incentives Compensation commonly includes performance-based annual bonuses and occasional spot bonuses. These incentives are presented as standard components for many roles.

10a Labs Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
28 Employees

What We Do

10a Labs is an applied research and technology company specializing in AI security. We deliver intelligence collection, investigative research, and analysis for AI unicorns, Fortune 10 companies, and U.S. tech leaders.

Similar Jobs

Easy Apply
Remote or Hybrid
2 Locations
180 Employees
110K-160K Annually

Eve Logo Eve

Software Engineer

Legal Tech • Software • Generative AI
Easy Apply
Remote or Hybrid
United States
180 Employees
250K-300K Annually

Cox Enterprises Logo Cox Enterprises

Dealer.com Performance Manager

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
DC, USA
50000 Employees
75K-113K Annually

Cox Enterprises Logo Cox Enterprises

Senior Machine Learning Engineer

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
112K-186K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account