Senior Software Engineer - Polaris & Data Lake Catalog

Posted 7 Days Ago
Be an Early Applicant
San Mateo, CA
Senior level
Artificial Intelligence • Big Data • Cloud • Machine Learning • Software • Database • Analytics
We're shattering barriers that prevent organizations from unleashing the true value of their data.
The Role
As a Senior Software Engineer for the Polaris and Data Lake Catalog team, you will design and implement scalable distributed systems, work on interoperability with external Iceberg catalogs, develop high-performance solutions, and contribute to data governance features. Your role will also involve collaborating with open-source teams and enhancing services related to data lakes and catalogs.
Summary Generated by Built In

Build the future of data. Join the Snowflake team.

Snowflake’s vision is to enable every organization to be data-driven. We’re at the forefront of innovation, helping customers realize the full potential of their data with our data cloud. We are now going beyond the traditional data warehouse and helping customers unlock the power of the open data lake house architecture with our Polaris project—an open-source implementation of the Iceberg REST catalog.

As a Senior Software Engineer on the Polaris and Data Lake Catalog team, you’ll play a key role in building and evolving our open and interoperable data lake ecosystem. You’ll work on some of the most complex and exciting challenges in distributed systems, contributing to Snowflake’s mission of providing a truly open data lake architecture, free from vendor lock-in.

AS A SENIOR SOFTWARE ENGINEER, YOU WILL:

  • Design and implement scalable, distributed systems to enable support for Iceberg DML/DDL transactions, schema evolution, partitioning, time travel, and more.
  • Architect and build systems that integrate Snowflake queries with external Iceberg catalogs (e.g., AWS Glue, Databricks Unity) and various data lake architectures, enabling seamless interoperability across cloud providers.
  • Develop high-performance, low-latency solutions for catalog federation, allowing customers to manage and query their data lake assets across multiple catalogs from a single interface.
  • Collaborate with Snowflake’s open-source team and the Apache Iceberg community to contribute new features and enhance the Iceberg REST specification.
  • Work on core data access control and governance features for Polaris, including fine-grained permissions such as row-level security, column masking, and multi-cloud federated access control.
  • Contribute to our managed Polaris service, ensuring that external query engines like Spark and Trino can read from and write to Iceberg tables through Polaris in a way that’s decoupled from Snowflake’s core data platform.
  • Build tooling and services that automate data lake table maintenance, including compaction, clustering, and data retention for enhanced query performance and efficiency.

OUR IDEAL SENIOR SOFTWARE ENGINEER WILL HAVE:

  • 8+ years of experience designing and building scalable, distributed systems.
  • Strong programming skills in Java, Scala, or C++ with an emphasis on performance and reliability.
  • Deep understanding of distributed transaction processing, concurrency control, and high-performance query engines.
  • Experience with open-source data lake formats (e.g., Apache Iceberg, Parquet, Delta) and the challenges associated with multi-engine interoperability.
  • Experience building cloud-native services and working with public cloud providers like AWS, Azure, or GCP.
  • A passion for open-source software and community engagement, particularly in the data ecosystem.
  • Familiarity with data governance, security, and access control models in distributed data systems.

BONUS POINTS FOR EXPERIENCE WITH:

  • Contributing to open-source projects, especially in the data infrastructure space.
  • Designing or implementing REST APIs, particularly in the context of distributed systems.
  • Managing large-scale data lakes or data catalogs in production environments.
  • Working on highly-performant and scalable query engines such as Spark, Flink, or Trino.

WHY JOIN THE POLARIS & DATA LAKE CATALOG TEAM AT SNOWFLAKE?

  • Be part of a pioneering effort to build the most open and interoperable data lake ecosystem in the industry.
  • Work on a high-impact open-source project that solves real-world data challenges for enterprise customers like Netflix, AWS, and others.
  • Collaborate with some of the brightest minds in the data ecosystem, including core contributors to Apache Iceberg.
  • Have the opportunity to innovate in one of the fastest-growing and evolving areas in data infrastructure, where you can make a direct impact on Snowflake’s growth and the broader open-source community.

Every Snowflake employee is expected to follow the company’s confidentiality and security standards for handling sensitive data. Snowflake employees must abide by the company’s data security plan as an essential part of their duties. It is every employee's duty to keep customer information secure and confidential.

Top Skills

C++
Java
Scala
The Company
HQ: Bozeman, MT
7,630 Employees
Hybrid Workplace
Year Founded: 2012

What We Do

Snowflake makes enterprise AI easy, efficient and trusted. Thousands of companies around the globe, including hundreds of the world’s largest, use Snowflake’s AI Data Cloud to share data, build applications, and power their business with AI. The era of enterprise AI is here. Learn more at snowflake.com.

Why Work With Us

We’re at the forefront of the AI data revolution, committed to building the world’s greatest data and applications platform. Our ‘get it done’ culture allows everyone at Snowflake to have an equal opportunity to innovate on new ideas, create work with a lasting impact, and excel in a culture of collaboration.

Gallery

Gallery

Similar Jobs

The Walt Disney Company Logo The Walt Disney Company

Lead Software Engineer, Ad Platforms

AdTech • Digital Media • News + Entertainment
Hybrid
Santa Monica, CA, USA
200000 Employees
152K-224K Annually

Roblox Logo Roblox

Senior Software Engineer- Creator Data Storage

Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
San Mateo, CA, USA
2500 Employees
234K-284K Annually

Notion Logo Notion

Software Engineer, Experimentation Platform

Artificial Intelligence • Productivity • Software
San Francisco, CA, USA
800 Employees
Remote
San Francisco, CA, USA
13 Employees

Similar Companies Hiring

InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account