Spice Data

Senior Software Engineer

Reposted 11 Days Ago

Be an Early Applicant

San Francisco, CA, USA

In-Office

160K-220K Annually

Senior level

Big Data • Analytics • Business Intelligence • Hospitality

The Role

Design, maintain, and expand end-to-end data pipelines and collection tooling. Build crawling/automation scripts, scheduling (Dagster), data cleaning/normalization, and help with DevOps. Mentor teammates through code and architecture review and execute independently in a hybrid San Francisco office role.

Summary Generated by Built In

About the role

Join our small and nimble engineering team! We’re looking for an experienced engineer who enjoys working on a variety of different projects and large(ish) data pipelines (100M+ data points collected every month).

Work in our downtown San Francisco office a few times per week
Provide mentorship to your teammates via code and architecture review
Help to improve, maintain and expand our end to end data pipeline
- Create and manage data collection/crawling scripts that utilize everything from basic HTTP requests to browser and mobile app automation (deep experience in this area is a huge plus)
- Build and maintain automation/scheduling tooling to ensure data is collected on a timely and consistent basis (we’re currently using self hosted Dagster for scheduling)
- Create data cleaning and normalization scripts (with some opportunity to integrate ML/LLMs for labeling/cleaning)
- Help with miscellaneous DevOps tasks to help manage the infrastructure running all the above
Our codebase is mostly Python with Typescript and Golang used when they make sense. We almost entirely rely on self hosted open source/core services on bare VMs via docker compose.

About you

Proficient (intermediate or higher) in:
- Python (e.g. can write a decorator, understand async/await, MRO, contextmanagers)
- SQL (e.g. familiar with complex joins, efficiently bulk update 10k+ rows, schema migrations)
Basic understanding of Linux and Docker
Excited to work on a variety of different projects, sometimes many at the same time
Able to execute with minimal supervision
Bonus skills: Web Crawling, Kubernetes, LLM Pipelines, DB Administration, HomeLab/Server Administration, Background in Statistics, Full Stack Web Development, Mobile App Development

Benefits

Lunch provided when in office
Unlimited PTO
401k
Company paid Platinum PPO health and comparable dental & vision insurance
$160K - $220K salary, 0.5% - 3% equity

About the interview

15 minute call with CTO about the position, your experience, and career goals
1st 30 minute technical screening call (coding interview)
2nd 30 minute technical screening call (coding interview)
Onsite at our office to further review your skills, learn more about us, and to get a feel for how we’d work together
Offer extended

Skills Required

Proficient in Python
Proficient in SQL
Basic understanding of Linux
Basic understanding of Docker (including Docker Compose)
Experience building and maintaining large-scale data pipelines (100M+ data points/month)
Experience creating and managing data collection and crawling scripts
Experience building and maintaining automation/scheduling tooling (Dagster)
Ability to mentor teammates via code and architecture review
Able to execute with minimal supervision and manage multiple projects
Familiarity with Typescript
Familiarity with Golang
Bonus skills: web crawling/browser/mobile automation, Kubernetes, LLM pipelines, DB administration, server/HomeLab administration, statistics, full-stack or mobile development

View all jobs at Spice Data

View Spice Data Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

Year Founded: 2019

What We Do

Spice Data provides enterprise-level restaurant data, licensing this information to leading Fortune 500 companies. Founded in 2019 and based in San Francisco, the company focuses on the restaurant industry, delivering high-quality data solutions designed for large-scale enterprise use to help organizations gain critical insights into the restaurant sector and optimize their business strategies.