The Role
Build and maintain web scraping scripts to extract and structure product data from eCommerce sites. Clean, validate, and organize large datasets, monitor crawl performance, troubleshoot errors, and collaborate with product and tech teams to ensure accurate, efficient data collection.
Summary Generated by Built In
Job Title: Product Matching
Role Overview
Location: Remote / Bangalore (Hybrid)
Work Type: Consultant
About us:
Rubick.ai is one of the fastest-growing eCommerce enablement platforms. We specialise in Product Discovery, Search, and Market Intelligence for marketplaces, brands, and sellers. We offer an end-to-end full-stack Product Information, Cataloging, and Marketing platform as a solution for eCommerce.
Rubick works with companies like Myntra, Amazon, Ajio, TataCliq, Hudson Bay US, The Luxury Closet, and Myers, across India, US, Singapore, Australia, UAE and other international markets.
Visit us: https://os.rubick.ai/
We are seeking a Product Matching Consultant to extract and structure product data from eCommerce websites and marketplaces. This role involves using web scraping tools, Python scripts, or browser automation to gather high-quality data efficiently.
Key Responsibilities- Develop and maintain scripts to crawl and extract data from websites.
- Clean, validate, and organize large datasets for cataloging and analysis.
- Collaborate with tech and product teams to ensure accurate data collection.
- Monitor crawl performance, troubleshoot scraping errors, and optimize efficiency.
- Stay updated on web crawling tools, libraries, and techniques.
- 1+ year of experience in product matching, data crawling, scraping, or similar roles.
- Hands-on experience with Python, BeautifulSoup, Scrapy, or similar libraries.
- Understanding of HTML, CSS, XPath, and JSON data structures.
- Strong analytical and problem-solving skills.
- Familiarity with proxies, API extraction, or browser automation (Selenium, Playwright) is a plus.
- Experience with large-scale eCommerce product matching.
- Exposure to AI/ML data pipelines or catalog automation systems.
Skills Required
- 1+ year experience in product matching, data crawling, scraping, or similar roles.
- Hands-on experience with Python.
- Experience with BeautifulSoup.
- Experience with Scrapy.
- Understanding of HTML, CSS, XPath, and JSON data structures.
- Strong analytical and problem-solving skills.
- Familiarity with proxies, API extraction, or browser automation (Selenium, Playwright).
- Experience with large-scale eCommerce product matching.
- Exposure to AI/ML data pipelines or catalog automation systems.
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Rubick.ai is an AI-powered eCommerce enablement platform that specializes in cataloging solutions and digital readiness for marketplaces, brands, and sellers. It provides an AI-powered operating system to automate catalog enrichment, product discovery, and scale growth for e-commerce businesses.






