Distinguished Engineer, Data Platform

Reposted 2 Days Ago
2 Locations
Remote or Hybrid
275K-330K Annually
Expert/Leader
Cloud • Software
The Role
As the Distinguished Architect, you will lead the design and implementation of CloudZero's next-generation data platform, focusing on streaming architecture, data modeling, and optimizing query engines to enhance cost attribution and performance at scale.
Summary Generated by Built In
About the Role

CloudZero is growing fast. Our customer base is expanding, the data challenges we're solving are getting more complex, and the platform is scaling to match. As a Distinguished Engineer on the Data Engineering team, you'll own some of the hardest infrastructure problems at CloudZero: shaping the next-generation streaming data platform, the dimensional cost model underlying every attribution decision, the hot/cold storage architecture serving both real-time and historical queries, and the query engine that powers our entire product.

This is real platform architecture work at real scale, not a consulting role or a review-and-advise job. You'll define the roadmap, drive the foundational decisions, and be a force multiplier for a talented engineering team — evolving CloudZero from batch-oriented pipelines toward a streaming-first architecture where cost attribution reaches engineers within seconds of a resource being used, not the next morning.

This role is ideal for an architect who has built systems like this before, has the scars to prove it, and wants to see their decisions matter in direct and measurable ways for customers and for the business.

What You'll Do

Define the Data Platform Architecture

  • Lead end-to-end technical design for CloudZero's next-generation data platform, from event ingestion and stream processing through hot/cold storage and the query layer to the API surface

  • Document architectural decisions, tradeoffs, and migration strategies with the rigor of an RFC-driven process

  • Shape and drive every layer of the new architecture: event ingestion, stream processing and enrichment, real-time serving, analytical storage, query layer, and API

Drive Streaming Infrastructure to Production

  • Design and deliver CloudZero's real-time data pipeline from ingestion through enrichment to serving

  • Establish SLOs for throughput, latency, and correctness, and build the operational playbooks that make this system trustworthy enough to replace the batch pipelines our entire product currently depends on

  • Tackle real-time streaming at scale across thousands of customers simultaneously, with fault tolerance, backpressure awareness, and correctness as non-negotiables

Tackle the Dimension Cardinality Problem

  • Redesign CloudZero's dimensional cost model to support high-cardinality, multi-dimensional cost attribution without runaway materialization costs

  • Drive incremental, delta-based materialization strategies using modern open table formats, dramatically reducing expensive full-rebuild jobs and unlocking millions in annual infrastructure savings

Evolve the Query Layer

  • Assess CloudZero's current query infrastructure, drive in-flight migrations to completion, and lead the evolution of the query engine layer going forward

  • Own performance optimization across partition pruning, predicate pushdown, and query planning, and set the vision for how the query layer grows as data volumes scale 10x

Extend Cost Attribution to Real-Time

  • Evolve CloudZero's proprietary cost attribution engine from a batch-oriented model to one that assigns complex cost dimensions by team, feature, and customer within seconds of resource usage

  • Rethink enrichment, data lineage, and correctness guarantees in a streaming context

Shape the Data Engineering Roadmap

  • Partner with product, infrastructure, and analytics engineering to define a multi-year data platform roadmap

  • Build consensus across engineering leadership on foundational investments including table formats, streaming frameworks, query engines, and schema management

Elevate the Engineering Team

  • Participate in architecture reviews, contribute to design patterns and best practices, and mentor senior and staff engineers through code review, pairing, and structured feedback

  • Make everyone around you better, not by directing, but by raising the collective craft

What You Bring

Data Platform & Architecture

  • 10+ years in data engineering with a clear trajectory toward principal or staff-level architecture

  • Built and operated large-scale data platforms serving tens of millions of events per day in production

  • Deep experience with streaming systems such as Kafka, Kinesis, Flink, or Spark Streaming at real production throughput

  • Strong hands-on fluency with modern open table formats including Apache Iceberg, Delta Lake, and Hudi, including compaction, partitioning strategy, and time-travel queries

  • Designed hot/cold storage architectures with explicit latency SLOs per tier

  • Proven ability to drive a data platform end to end, not just a single layer

Data Modeling & Dimensional Design

  • Expert in dimensional data modeling including fact/dimension schema design, slowly changing dimensions, and cardinality management

  • Deep understanding of the materialization tradeoff space: full vs. incremental, push vs. pull, pre-aggregate vs. query-time

  • Experience with cost attribution, showback/chargeback, or multi-tenant data partitioning patterns

  • Strong SQL and query optimization background across predicate pushdown, partition pruning, and cost-based query planning

Query Engines & Compute

  • Hands-on with distributed query engines such as Trino, Presto, Spark SQL, or DuckDB including configuration, optimization, and production operations

  • Understands catalog and metadata management and how it couples to query engines

  • Comfortable with cloud data warehouses such as Snowflake, BigQuery, and Redshift and how they integrate with open table formats

  • Experience driving query engine migrations while maintaining production SLAs

Engineering Leadership

  • Track record as a technical anchor for a data platform or data engineering team

  • Writes clear ADRs, RFCs, and technical design docs that bring engineers along

  • Can drive multi-month, multi-team technical initiatives from inception to production without heavy process overhead

  • Communicates complex tradeoffs to non-technical stakeholders including product and business leadership

  • Comfortable in a high-autonomy environment: builds consensus, influences through expertise, and helps teams move forward

Bonus If You Have...
  • FinOps or cloud cost domain experience

  • Multi-cloud data ingestion across AWS, Azure, and GCP

  • Apache Flink at production scale

  • Lakehouse architecture patterns

  • Real-time feature engineering for ML

  • Data mesh or domain-oriented design patterns

  • Prior startup or high-growth SaaS experience

  • Open source contributions to the data ecosystem

About CloudZero

Cloud cost management is one of the biggest challenges organizations face today. As cloud adoption continues to accelerate, so do the complexities and costs associated with it, and macroeconomic conditions only increase pressure to prove cloud efficiency.

CloudZero is a SaaS platform at the intersection of next-generation cloud cost management and FinOps. We ingest billing and usage data from all cloud, SaaS, and PaaS providers, organize it in real time according to our customers' business structures, and empower organizations to make more informed business decisions.

Since our founding in 2016, our mission has been to make efficient innovation a reality for every cloud-driven organization. We believe every engineering decision is a buying decision, and we're applying proven reliability engineering principles to financial efficiency.

We believe the best AI empowers users with clear insights and confident decisions, transforming complex cloud cost data into actionable intelligence that drives meaningful business outcomes.

To date, we've raised over $56 million from leading venture capital firms. We're solving problems of massive scale, business importance, and complexity in a space that needs it more than ever.

Equal Opportunity Employer

CloudZero is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status or disability status. All job offers are contingent upon the candidate passing background and reference checks.

Skills Required

  • 10+ years in data engineering
  • Deep experience with streaming systems
  • Strong hands-on fluency with modern open table formats
  • Expert in dimensional data modeling
  • Experience driving query engine migrations

CloudZero Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about CloudZero and has not been reviewed or approved by CloudZero.

  • Healthcare Strength Healthcare coverage is described as comprehensive, spanning medical, dental, and vision. This breadth is consistently presented as a core part of the total rewards package.
  • Leave & Time Off Breadth Paid time off is presented as flexible and generous, with practices like Focus Fridays supporting balance. Remote-first policies and periodic meetups complement the time-off approach.
  • Equity Value & Accessibility Equity grants are included broadly, giving employees a stake in the company’s success. This equity component is positioned as a meaningful part of total compensation.

CloudZero Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Boston, MA
180 Employees
Year Founded: 2016

What We Do

CloudZero is the only cloud cost intelligence platform that puts engineering in control by connecting technical decisions to business results. CloudZero ingests cost data from AWS and Snowflake, organizes it for analysis, and delivers the insights to engineering teams who can understand how their work is impacting the business. You can answer question like: * Who are my most expensive customers? * Which product, feature, and team is spending the most? * Has the profitability of my product changed quarter over quarter? The outcome is real-time intelligence that helps companies control their cost of goods sold (COGS) and gross margins — aligning engineering and finance teams once and for all.

Similar Jobs

ServiceNow Logo ServiceNow

Consultant

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Waltham, MA, USA
29000 Employees
124K-217K Annually

Rula Logo Rula

Manager, Security Operations (Remote)

Healthtech • Social Impact • Software • Telehealth
Remote
United States
595 Employees
194K-217K Annually

MetLife Logo MetLife

Customer Care Advocate AMS Service - Omaha, NE 9.21.26 - 18275

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
42K-42K Annually

MetLife Logo MetLife

Customer Care Advocate Disability Intake - Cary, NC 9.14.26 - 18272

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
42K-42K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account