Software Engineer – Web Crawling

Reposted 15 Days Ago
Hiring Remotely in USA
Remote
Mid level
Machine Learning • Marketing Tech • Software
The Role
Develop, enhance, and maintain web crawlers, optimize scraping techniques, collaborate with teams, and integrate AI solutions for data extraction accuracy.
Summary Generated by Built In

Woflow is a technology startup creating products and solutions to support a high-growth, on-demand economy. Our flagship product is an end-to-end platform that allows our customers to request and receive merchant data (think structured menu data, images, store information, etc) through a combination of web applications and public APIs. Behind the scenes, that data is created through a series of ML/AI models and workflow products along with a fully automated distributed workforce management platform. We are the world’s first Merchant Data Platform.

Our customers include food delivery companies, online ordering platforms, and ecommerce marketplaces. We provide the data infrastructure to help these companies scale and grow.

About the Role

We are looking for a Software Engineer with at least 3 years of experience to join our Web Crawling team. This team plays a critical role in our infrastructure, ensuring high-quality data collection at scale. The right candidate will work closely with both the Web Crawling and Application teams but will primarily focus on writing and optimizing web crawlers.

Our ideal candidate is someone who enjoys solving complex web scraping challenges, has strong reverse engineering skills, and thrives in a fast-paced, high-growth environment. This role is a great opportunity for an engineer looking to make a significant impact at a company where data is the product.

What You’ll Do
  • Develop, enhance, and maintain web crawlers and scraping infrastructure.

  • Optimize scraping techniques to handle anti-bot mechanisms, performance, and security challenges.

  • Collaborate with a geographically distributed team to identify and resolve issues.

  • Ensure high availability, efficiency, and reliability of crawling operations.

  • Integrate AI solutions to enhance automation and data extraction accuracy.

What We’re Looking For
  • 3+ years of experience in software engineering with a focus on web crawling and data extraction.

  • Strong expertise in Node.js (preferred) for web crawling applications.

  • Deep understanding of HTML, JavaScript, and reverse engineering techniques.

  • Hands-on experience with Playwright, Puppeteer, and Cheerio for automation and scraping.

  • Knowledge of security and performance best practices related to web crawling.

Nice to Have
  • Experience with Apify or Crawlee for large-scale crawling solutions.

  • Proficiency in TypeScript.

Benefits:

  • Unlimited PTO (we mean it!)

  • Comprehensive medical, dental, and vision insurance plans

  • STD, LTD, AD&D, and life insurance coverage

  • Free membership to TalkSpace, Teladoc and Health Advocate

  • Free annual membership to One Medical in participating regions

  • 401(k) retirement plan with company matching

  • Pre-tax commuter benefits

  • Free equipment: laptop and home office stipend

*Please note: the only valid application forms are via woflow.com and jobs.ashbyhq.com/woflow

Top Skills

Cheerio
HTML
JavaScript
Node.js
Playwright
Puppeteer
Typescript
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
204 Employees
Year Founded: 2017

What We Do

Woflow automates merchant data onboarding for the world's fastest growing platforms and marketplaces. Food ordering apps, POS systems and eCommerce companies trust our data infrastructure to manage their merchant data at scale. Woflow's technology is powered by the Woflow Engine, a sophisticated ML-powered task automation system that structures data from anywhere. Woflow has raised over $10M to date and is backed by top investors including Base 10 Partners, Craft Ventures and Construct Capital.

Similar Jobs

Ericsson Logo Ericsson

Developer Virtual Platforms

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office or Remote
Austin, TX, USA
88000 Employees
117K-175K Annually

Superhuman Logo Superhuman

Senior Procurement Specialist

Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
Easy Apply
Remote or Hybrid
2 Locations
1500 Employees
118K-163K Annually

Coupa Logo Coupa

Senior Product Strategist - 11324

Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
Remote
US
2500 Employees
104K-135K Annually

NBCUniversal Logo NBCUniversal

Programming Coordinator

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
New York, NY, USA
68000 Employees
50K-60K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account