Data Scraping 1743

Posted 2 Days Ago
Be an Early Applicant
Hiring Remotely in COL
Remote
Mid level
Artificial Intelligence • Big Data • Information Technology • Software
The Role
Remote role to research, extract, normalize, and deliver structured data from public and government sources. Build reusable scraping workflows, document sources and methods, troubleshoot acquisition issues, version-control code, and collaborate with stakeholders to provide accurate, repeatable outputs.
Summary Generated by Built In

This is a remote position.

We are seeking a Data Scraping to help collect, organize, and normalize data from public and government sources into a consistent, structured format. This role focuses on solving complex data acquisition challenges, researching unfamiliar sources, extracting information from websites and feeds, and transforming it into predefined formats that can be consumed by downstream systems. The ideal candidate enjoys working with messy datasets, investigating how websites and data sources are structured, and creating reusable solutions that can be executed repeatedly with consistent results. This position requires strong problem-solving skills, attention to detail, and the ability to work independently while documenting findings and processes clearly.
Schedule: Monday to Friday - 12:00 PM – 8:00 PM CST
Responsibilities:
Research and identify public and government data sources. Extract and normalize data from websites, APIs, feeds, and online repositories. Build reusable, maintainable, and re-runnable scripts and scraping workflows. Deliver structured outputs in predefined formats. Provide sample outputs for review before processing larger datasets. Document data sources, extraction methodologies, challenges encountered, and re-run procedures. Capture and report any relevant information discovered during extraction, including inconsistencies, amendments, effective dates, repeal notes, or related metadata. Troubleshoot data acquisition issues and propose alternative approaches when needed. Collaborate with stakeholders through regular check-ins and written communication. Maintain version-controlled code repositories and follow standard development practices.

Requisitos
 Strong experience with web scraping and data extraction. Practical programming experience using Python or similar scripting languages. Experience working with HTML parsing, APIs, HTTP requests, FTP sources, and structured or unstructured data. Ability to evaluate, debug, and improve scraping solutions. Strong analytical and problem-solving skills. Experience building reusable automation workflows rather than one-off scripts. Familiarity with relational databases (PostgreSQL preferred) and a normal Git workflow. Strong documentation and communication skills. Ability to work independently and take ownership of technical challenges. High attention to detail and commitment to data accuracy. Nice to Have: Experience working with government, regulatory, compliance, or public-sector datasets. Experience with Playwright, Selenium, Puppeteer, Scrapy, or similar scraping frameworks. Experience with data versioning, change detection, or document lineage. Familiarity with AI-assisted development tools and workflows.

Skills Required

  • Strong experience with web scraping and data extraction.
  • Practical programming experience using Python or similar scripting languages.
  • Experience working with HTML parsing, APIs, HTTP requests, FTP sources, and structured or unstructured data.
  • Ability to evaluate, debug, and improve scraping solutions.
  • Experience building reusable automation workflows rather than one-off scripts.
  • Familiarity with relational databases (PostgreSQL preferred) and a normal Git workflow.
  • Strong documentation and communication skills.
  • Ability to work independently and take ownership of technical challenges.
  • High attention to detail and commitment to data accuracy.
  • Experience with Playwright, Selenium, Puppeteer, Scrapy, or similar scraping frameworks.
  • Experience working with government, regulatory, compliance, or public-sector datasets.
  • Experience with data versioning, change detection, or document lineage.
  • Familiarity with AI-assisted development tools and workflows.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
139 Employees

What We Do

The company provides automation and process optimization, UX innovation, software designing and testing, big data, artificial intelligence, and machine learning. They are involved in the development of computer systems and IT consulting.

Similar Jobs

PwC Logo PwC

Oracle Fusion Field Service Implementation Senior Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
64 Locations
370000 Employees
124K-280K Annually

PwC Logo PwC

Product Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
62 Locations
370000 Employees
91K-322K Annually

PwC Logo PwC

US Tech - AI Product Senior Associate

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
62 Locations
370000 Employees
151K-187K Annually

PwC Logo PwC

Consultant

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
58 Locations
370000 Employees
99K-232K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account