The Role
Research and identify public and government data sources; extract, transform, and normalize data from websites, APIs, feeds, FTP, and repositories; design reusable, scalable ETL workflows; apply advanced web scraping techniques using Python, HTTP requests, and HTML parsing; validate and document data and methodologies; maintain code and version control with Git while communicating with stakeholders.
Summary Generated by Built In
We are looking for a Data Scraping Specialist to collect, organize, and normalize data from public and government sources into reliable formats.
🕒 Schedule
Monday to Friday: 12:00 PM – 8:00 PM CST 🎯 What will be your main challenges? (Responsibilities)
Research and identify public and government data sources. Extract, transform, and normalize data from websites, APIs, feeds, FTP sources, and online repositories. Design and build reusable, scalable, and maintainable ETL processes and workflows (no one-off scripts!). Apply advanced web scraping techniques using Python, HTTP requests, and HTML parsing. Ensure quality: Identify inconsistencies, validate data samples, and document methodologies and processes. Collaborate and version control: Maintain repositories using Git under development best practices and maintain clear communication with stakeholders.
🛠️ What are we looking for? (Requirements)
Solid experience in web scraping, data scraping, and structured/unstructured data extraction. Technical proficiency: Hands-on experience programming in Python (or similar languages), knowledge of APIs, HTTP, FTP, HTML parsing, and relational databases like PostgreSQL. Language: Advanced English level (fluent written and technical communication). Analytical mindset: Ability to solve complex data acquisition problems, optimize solutions, and work independently while taking on technical challenges. Quality focus: Strong emphasis on data validation, normalization, and documentation.
📩 Ready for the challenge?
If you fit the profile and want to take your automation skills to the next level, apply now!
#SolvoGlobal
#LI-PROMOTED
#LI-Remote
Skills Required
- Hands-on experience in web scraping and data scraping from public and government sources
- Proficient programming in Python (or similar languages) for scraping and ETL development
- Knowledge of APIs, HTTP, FTP, and HTML parsing techniques
- Experience designing and building reusable, scalable, maintainable ETL processes and workflows
- Familiarity with relational databases such as PostgreSQL
- Strong data validation, normalization, and documentation practices
- Experience using Git and following development best practices/version control
- Advanced English level for fluent written and technical communication
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Distro is an AI-powered platform designed to enhance the efficiency and productivity of distributor sales teams. By automating manual recruiting and sales tasks, the company helps teams move faster, reduce costs, and improve hiring outcomes. Their technology focuses on optimizing counter and inside sales operations, providing tools that assist recruitment and sales processes while maintaining human oversight in final decision-making.








