Experienced Data Engineer - Data Engineering Technical Leader

Posted 12 Days Ago
San Francisco, CA
Hybrid
204K-360K Annually
Senior level
Fintech • Software
The Role
The Data Engineer will drive Plaid's data engineering strategy, focusing on building and scaling golden datasets, enhancing data architecture, and fostering a data-driven culture. Responsibilities include serving as the technical leader, mentoring junior engineers, collaborating with teams to deliver reliable data solutions, and ensuring the quality and effectiveness of data processes.
Summary Generated by Built In

Data Engineering is a cornerstone of Plaid's data-focused vision, playing a pivotal role in fostering a data-driven culture and advancing our vision of delivering trustworthy, truly differentiated data products powered by the largest network of user financial profiles. To support our data-focused vision, we need to scale our data systems while maintaining correct and complete data. We provide golden datasets and tooling to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively. In addition, Plaid will not be successful if we can't move quickly. We build the data systems and tools that enable everyone at Plaid to be data-driven, making analytics easy, obvious, and proactive across the company.


Data Engineers heavily leverage SQL and Python to build data workflows that integrate with our Golang applications. We use tools like DBT, Airflow, Redshift, Atlan, and Retool to orchestrate data pipelines and define workflows. We work with data scientists, product analytics, business intelligence, software engineers, product managers, and many other teams to build Plaid's data strategy and a data-first mindset. Our engineering culture is IC-driven -- we favor bottom-up ideation and empowerment of our incredibly talented team. We are looking for engineers and technical leaders who are motivated by creating impact for our consumers and customers, growing together as a team, shipping the MVP, and leaving things better than we found them.


Drive Data Standards and Culture @ Plaid: You will play a high-impact role in defining and promoting data standards and fostering a strong data-driven culture across Plaid.

Deliver and Scale Golden Datasets: You will own the creation and future iterations of Golden Datasets for all major Plaid data models, driving their adoption across key data use cases to empower teams with reliable and actionable insights

Shape Unified Data Architecture: You will collaborate with other data practitioners and leaders to design a unified data architecture and conventions, making data at Plaid more accessible and intuitive. You’ll also enhance data exploration tools to enable efficient and effective usage of data across the organization.

Mentor and Build Technical Excellence: You will mentor junior engineers on the team, providing guidance on designs and implementations. You’ll play a key role in fostering a strong technical culture, empowering the team to deliver high-quality, scalable solutions.

Responsibilities

  • Serving as the primary technical DRI, defining and executing the long-term technical roadmap to build and sustain a data-driven culture at Plaid.
  • Working closely with senior leadership and executives to shape Plaid’s data engineering strategy and roadmap, ensuring alignment with the company’s data-focused product goals and overall vision.
  • Deep understanding of Plaid's products and strategy in order to inform the design of Golden Datasets, set data usage principles, and ensure data is structured for maximum impact and usability.
  • Focus on delivering well-documented datasets with clearly defined quality metrics, uptime guarantees, and demonstrable usefulness.
  • Leading critical data engineering projects that foster collaboration across teams, driving innovation and efficiency throughout the company.
  • Mentoring engineers, operations, and data practitioners on best practices for data organization.
  • Advocating for the adoption of emerging industry tools and practices, evaluating their fit and implementing them at the right time to keep Plaid at the forefront of data engineering.
  • Owning core SQL and Python data pipelines that power our data lake and data warehouse, ensuring their reliability, scalability, and efficiency.
  • Upholding Plaid’s commitment to data quality and privacy, embedding these principles into every aspect of data work.

Qualifications

  • 10+ years of experience in data engineering, with a proven track record of building scalable data systems and pipelines.
  • Experience working with massive datasets (500TB to petabytes) and developing robust data models and pipelines on top of them
  • Lead major data modeling, schema design, and data architecture efforts in past roles
  • Deep appreciation for schema design and the ability to evolve analytics schemas on top of unstructured data.
  • Advanced knowledge of SQL as a flexible, extensible tool and experience with modern orchestration tools like DBT, Mode, and Airflow.
  • Hands-on experience with performant data warehouses and lakes such as Redshift, Snowflake, and Databricks.
  • Building and maintaining both batch and real-time pipelines using technologies like Spark and Kafka.
  • Excited about exploring new technologies and building proof-of-concepts that balance technical innovation with user experience and adoption.
  • Enjoy getting into the technical details to manage, deploy, and optimize low-level data infrastructure.
  • Champion for data privacy and integrity, and always act in the best interest of consumers.

Top Skills

Go
Python
SQL
The Company
HQ: San Francisco, CA
1,012 Employees
On-site Workplace
Year Founded: 2013

What We Do

Plaid is used by thousands of digital financial apps and services like Betterment, Expensify, Microsoft and Venmo, and by many of the largest banks to make it easy for consumers to connect their financial accounts with the apps and services they want to use. Plaid connects with over 11,000 financial institutions across the U.S, Canada and Europe.

At Plaid, we have diverse backgrounds and skills, but we're all passionate about building a more efficient and inclusive financial infrastructure—together.

Similar Jobs

The Walt Disney Company Logo The Walt Disney Company

Lead Data Engineer, Data Reliability

AdTech • Digital Media • News + Entertainment
Hybrid
Santa Monica, CA, USA
200000 Employees
149K-219K Annually

Anduril Logo Anduril

Senior Data Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
1400 Employees
150K-225K Annually

Snap Inc. Logo Snap Inc.

Data Engineer, 7+ Years of Experience

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
4 Locations
5000 Employees
172K-303K Annually

BlackLine Logo BlackLine

Principal Data Engineer - Snowflake

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Hybrid
Pleasanton, CA, USA
1810 Employees
201K-269K Annually

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account