Data Engineer

Reposted 21 Days Ago
San Francisco, CA
In-Office
170K-220K Annually
Mid level
Cloud • Information Technology
The Role
The role involves building ETL pipelines for cloud billing data, ensuring data quality, designing data models, and optimizing workflows.
Summary Generated by Built In
About Duckbill

We are developing a SaaS product that simplifies financial planning and analysis of cloud billing data for large enterprises with complex cloud spending requirements. We're looking for a data engineer to wrangle complex cloud billing data by designing the pipelines that power our product.

We have fascinating technical challenges around data modeling and continuous quality control. We're analyzing massive amounts of semistructured data at scale: processing cloud bills with constantly evolving schemas—complexity that only increases as we expand functionality and provider support. On the frontend, customers use the data to drive large financial decisions, so full data product ownership and quality is key.

What You'll Do
  • Build and maintain ETL pipelines processing hundreds of millions of rows of cloud billing data

  • Work with ClickHouse, Parquet files, and S3 to design efficient data storage and retrieval systems

  • Develop data validation and quality control systems using Python and SQL

  • Design data models for complex, evolving cloud billing schemas (AWS CUR and beyond)

  • Build and optimize Airflow workflows for reliable data processing

  • Collaborate with the entire engineering team to investigate and resolve data quality issues

  • Scale data infrastructure as we expand to new cloud providers and use cases

About You
  • 3+ years experience with data products: warehouses/lakehouses/OLAPs, ETL pipelines, or job queues

  • Software engineering experience, with intermediate Python experience

  • Strong SQL skills including CTEs, window functions, and query optimization

  • Experience with data validation and quality control systems

  • Comfortable with columnar databases, Parquet, and cloud storage (S3)

  • Ability to deliver results in hours instead of days

  • Some experience in a startup environment, or ability to work well in a startup environment

  • Fastidiousness about data quality and comfort when there's no answer key

Nice-to-have
  • Experience with ClickHouse or other OLAP datastores

  • Past experience with cost management tools and/or cloud billing data

  • Experience with Airflow or similar workflow orchestration tools

  • Backend engineering experience beyond data pipelines

Why This Role is Exciting

You'll tackle genuinely complex technical challenges at scale while building expertise in cloud financial data—a rapidly growing and specialized field. You'll have end-to-end ownership of data products that directly impact customer success, working in a small team where your contributions immediately matter.

About Us

We are a small and growing team (less than 10 people!), which means you get the opportunity to be on the ground floor of building the product and company. Our founders are the founders of The Duckbill Group, who bring their wealth of domain expertise and deep industry and customer connections in cloud cost management to the product. We're currently in a semi-stealth mode while we're focusing on building the initial product.

This is an in-office role

We work together in the office in San Francisco three days per week, so you must be located in the SF Bay Area and willing to work in the office on a regular basis.

Top Skills

Airflow
Clickhouse
Parquet
Python
S3
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
6 Employees
Year Founded: 2019

What We Do

Our cloud cost management experts help companies fix their AWS bill by making it smaller and less horrifying. You may know us from our publications: Last Week in AWS, AWS Morning Brief, and Screaming in the Cloud.

Similar Jobs

BlackRock Logo BlackRock

Data Engineer

Fintech • Information Technology • Financial Services
In-Office
3 Locations
25000 Employees
133K-162K Annually

Mastercard Logo Mastercard

Data Engineer

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Hybrid
San Francisco, CA, USA
35300 Employees
120K-120K Annually

Notion Logo Notion

Data Engineer

Artificial Intelligence • Productivity • Software
Hybrid
San Francisco, CA, USA
1000 Employees
150K-177K Annually

Mastercard Logo Mastercard

Data Engineer

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Hybrid
San Francisco, CA, USA
35300 Employees
179K-318K Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
LayerOne Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account