Distro

Data Scraper

Posted 3 Hours Ago

Be an Early Applicant

Hiring Remotely in Nairobi, KEN

In-Office or Remote

Mid level

Artificial Intelligence • Sales • Software • Automation

The Role

Research and identify public and government data sources; extract, transform, and normalize data from websites, APIs, feeds, FTP, and repositories; design reusable, scalable ETL workflows; apply advanced web scraping techniques using Python, HTTP requests, and HTML parsing; validate and document data and methodologies; maintain code and version control with Git while communicating with stakeholders.

Summary Generated by Built In

We are looking for a Data Scraping Specialist to collect, organize, and normalize data from public and government sources into reliable formats.

🕒 Schedule

Monday to Friday: 12:00 PM – 8:00 PM CST 🎯 What will be your main challenges? (Responsibilities)

Research and identify public and government data sources. Extract, transform, and normalize data from websites, APIs, feeds, FTP sources, and online repositories. Design and build reusable, scalable, and maintainable ETL processes and workflows (no one-off scripts!). Apply advanced web scraping techniques using Python, HTTP requests, and HTML parsing. Ensure quality: Identify inconsistencies, validate data samples, and document methodologies and processes. Collaborate and version control: Maintain repositories using Git under development best practices and maintain clear communication with stakeholders.

🛠️ What are we looking for? (Requirements)

Solid experience in web scraping, data scraping, and structured/unstructured data extraction. Technical proficiency: Hands-on experience programming in Python (or similar languages), knowledge of APIs, HTTP, FTP, HTML parsing, and relational databases like PostgreSQL. Language: Advanced English level (fluent written and technical communication). Analytical mindset: Ability to solve complex data acquisition problems, optimize solutions, and work independently while taking on technical challenges. Quality focus: Strong emphasis on data validation, normalization, and documentation.

📩 Ready for the challenge?

If you fit the profile and want to take your automation skills to the next level, apply now!

#SolvoGlobal

#LI-PROMOTED

#LI-Remote

Skills Required

Hands-on experience in web scraping and data scraping from public and government sources
Proficient programming in Python (or similar languages) for scraping and ETL development
Knowledge of APIs, HTTP, FTP, and HTML parsing techniques
Experience designing and building reusable, scalable, maintainable ETL processes and workflows
Familiarity with relational databases such as PostgreSQL
Strong data validation, normalization, and documentation practices
Experience using Git and following development best practices/version control
Advanced English level for fluent written and technical communication

View all jobs at Distro

View Distro Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

11 Employees

Year Founded: 2022

What We Do

Distro is an AI-powered platform designed to enhance the efficiency and productivity of distributor sales teams. By automating manual recruiting and sales tasks, the company helps teams move faster, reduce costs, and improve hiring outcomes. Their technology focuses on optimizing counter and inside sales operations, providing tools that assist recruitment and sales processes while maintaining human oversight in final decision-making.