Lead Data Engineer

Reposted 8 Days Ago
3 Locations
Remote
Senior level
Artificial Intelligence
The Role
Lead the design and implementation of a modern data architecture, manage ETL pipelines, ensure data quality, and collaborate with cross-functional teams.
Summary Generated by Built In
Mission, Vision, Values

Verdigris is on a mission to sustain and enrich human life through responsive energy intelligence. Our AI sensors automate energy management and predict unseen equipment failures in mission-critical buildings. This is a critical step for autonomous, sustainable environments responsive to their inhabitants.


About You

You are deeply interested in how data flows — not just pipelines and tooling, but also how data is modeled, validated, and used to make decisions. You care about the structure and quality of data, and you take pride in designing systems that are reliable, scalable, and performant.

You’re execution-oriented: you like to ship, iterate, and improve. You’re comfortable navigating ambiguity and thrive in environments where the architecture is evolving. You enjoy tracking down data anomalies, validating assumptions, and making the invisible visible. You take ownership of your work, ask thoughtful questions, and collaborate well across disciplines.

You’re motivated by purpose building something that has impact, not just technically, but in the real world. You’re excited by the opportunity to shape the foundations of a modern data platform that supports climate-focused outcomes at scale.


About the Team

At Verdigris, our cloud software (data, web, ML) are a single team, collaborating to deliver insights that help data centers and other critical facilities optimize energy use and reduce carbon impact. We design and maintain APIs and data products that transform raw sensor data into real-time, actionable intelligence.

We partner closely with the Edge Hardware team, which streams high-fidelity, sub-second energy data from our IoT sensors to the cloud. Our team is responsible for modeling, storing, and serving that data to support real-time applications, machine learning, and customer-facing analytics.

We’re currently evolving our core architecture to embrace a modern, scalable data stack, including stream and OLAP-integrated databases like ClickHouse or StarTree (under evaluation), and are laying the foundation for a data mesh architecture. This will enable decentralized, domain-oriented data ownership and empower us to move faster with more reliable, discoverable, and performant data. You will help us design and implement this data architecture and migrate existing data.

We operate as a fully remote team with daily virtual standups and a two-week sprint cadence. We primarily work from 10:00am PST to 6:00pm PST. We’re committed to cross-functional collaboration and high-impact delivery.

Core Responsibilities

  • Collaborate with Product Management, Understand use cases and personas, and engineer product to support a strong user experience.
  • Own schema design and data modeling for energy metering and building management system (BMS) data.
  • Architect and maintain cost-effective and performant next generation data storage (e.g. ClickHouse, StarTree, etc).
  • Lead data architecture decisions, including evaluating and integrating tools in our modern data stack.
  • Build and manage robust, scalable ETL/ELT pipelines to ingest, transform, and serve data
  • Ensure performance and efficiency of analytical queries across large datasets
  • Develop and enforce data quality, validation, and governance standards

Adjacent Responsibilities

  • Support real-time IoT analytics and streaming pipelines.
  • Owning BI tooling (e.g. Superset, Looker, Tableau, etc).
  • Contribute to building internal data tools for engineers and analysts.
  • Collaborate with AI/ML teams to support model training and inference pipelines.
  • Work with web and application teams to ensure real-time and batch data access needs are met.
  • Manage team projects and coordinate with other technical leads.
  • Mentor junior engineers and contribute to technical hiring.

Required Qualifications

  • Align with core working hours, 10:00AM PST to 5:00PM PST in either pacific, mountain, or central timezones.
  • 5+ years of experience in data engineering with large-scale, high-throughput systems
  • Proven experience designing dimensional models and OLAP schema (fact/dimension tables)
  • Deep understanding of columnar stores and database internals (e.g., ClickHouse, Druid, StarTree, Pinot)
  • Strong SQL skills and proficiency with Python for data pipelines
  • Experience handling updates/inserts/type-2 dimensions for time-series or large-scale event stores

Preferred Qualifications

  • Experience with BMS/HVAC or Energy data is a plus
  • Experience with usage of time series and energy data used for diagnostics and efficiency.
  • Experience with IoT or sensor data systems.
  • Experience working in AWS Cloud.
  • Experience with Postgres.
  • Proficiency in orchestrating ETL workflows (e.g. Dagster, Airflow, AWS Step Functions, etc.)
  • Familiarity with stream processing tools (e.g., Kafka, Flink, Spark Streaming)
  • Exposure to machine learning feature stores or MLOps tooling
  • Experience with data observability and data cataloging tools
  • Experience managing a team or others.

Applying to Verdigris is a chance to make an impact by joining a mission-driven startup. We’re innovating for the energy management industry hoping to positively affect climate change. Verdigrisians aim to be ego-free authorities in our fields. We take our work seriously and strive for an opportunity-filled environment supportive of curious minds.

You can expect thoughtful, hardworking, and funny teammates. We value differing perspectives and embrace candid, direct and constant feedback. We are an equal opportunity employer. We do not discriminate on the basis of race, religion, color, origin, gender, orientation, age, or status.

Top Skills

Airflow
AWS
Clickhouse
Dagster
Flink
Kafka
Postgres
Python
Spark Streaming
SQL
Startree
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Moffett Field, CA
40 Employees
Year Founded: 2012

What We Do

Verdigris is an artificial intelligence IoT platform that makes buildings smarter and more connected while reducing energy consumption and costs. By combining proprietary hardware sensors, machine learning, and software, Verdigris “learns” the energy patterns of a building. Their AI software produces comprehensive reports including energy forecasts, alerts about faulty equipment, maintenance reminders, and detailed energy usage information for each and every device and appliance. Verdigris offers a suite of applications that gives building engineers a comprehensive overview, an “itemized utility bill”, powerful reporting, and simple automation tools for their facility. For more information, visit www.verdigris.co.

Similar Jobs

GE Vernova Logo GE Vernova

Lead Data Engineer

Energy • Manufacturing • Solar • Renewable Energy
In-Office or Remote
2 Locations
98K-140K
Remote
CAN

Aviso Logo Aviso

Data Engineer

Fintech • Payments • Financial Services
In-Office or Remote
2 Locations
105K-120K Annually

Afresh Logo Afresh

Data Engineer

Artificial Intelligence • Machine Learning • Retail • Social Impact • Software
Easy Apply
Remote or Hybrid
Ontario, ON, CAN

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account