Principal Data Engineer

Posted 4 Days Ago
4 Locations
In-Office or Remote
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
The Role
The Principal Data Engineer will oversee data architecture, design scalable data pipelines, ensure data quality and governance, and enable analytics and machine learning across the organization.
Summary Generated by Built In

Worth AI, a leader in the computer software industry, is looking for a talented and experienced Principal Data Engineer to join their innovative team. At Worth AI, we are on a mission to revolutionize decision-making with the power of artificial intelligence while fostering an environment of collaboration, and adaptability, aiming to make a meaningful impact in the tech landscape.. Our team values include extreme ownership, one team and creating reaving fans both for our employees and customers.

Worth is looking for a Principal Data Engineer to own the company-wide data architecture and platform. Design and scale reliable batch/streaming pipelines, institute data quality and governance, and enable analytics/ML with secure, cost-efficient systems. Partner with engineering, product, analytics, and security to turn business needs into durable data products.

Responsibilities

What you will do:

  • Architecture & Strategy
    • Define end-to-end data architecture (lake/lakehouse/warehouse, batch/streaming, CDC, metadata).
    • Set standards for schemas, contracts, orchestration, storage layers, and semantic/metrics models.
    • Publish roadmaps, ADRs/RFCs, and “north star” target states; guide build vs. buy decisions.
  • Platform & Pipelines
    • Design and build scalable, observable ELT/ETL and event pipelines.
    • Establish ingestion patterns (CDC, file, API, message bus) and schema-evolution policies.
    • Provide self-service tooling for analysts/scientists (dbt, notebooks, catalogs, feature stores).
    • Ensure workflow reliability (idempotency, retries, backfills, SLAs).
  • Data Quality & Governance
    • Define dataset SLAs/SLOs, freshness, lineage, and data certification tiers.
    • Enforce contracts and validation tests; deploy anomaly detection and incident runbooks.
    • Partner with governance on cataloging, PII handling, retention, and access policies.
  • Reliability, Performance & Cost
    • Lead capacity planning, partitioning/clustering, and query optimization.
    • Introduce SRE-style practices for data (error budgets, postmortems).
    • Drive FinOps for storage/compute; monitor and reduce cost per TB/query/job.
  • Security & Compliance
    • Implement encryption, tokenization, and row/column-level security; manage secrets and audits.
    • Align with SOC 2 and privacy regulations (e.g., GDPR/CCPA; HIPAA if applicable).
  • ML & Analytics Enablement
    • Deliver versioned, documented datasets/features for BI and ML.
    • Operationalize training/serving data flows, drift signals, and feature-store governance.
    • Build and maintain the semantic layer and metrics consistency for experimentation/BI.
  • Leadership & Collaboration
    • Provide technical leadership across squads; mentor senior/staff engineers.
    • Run design reviews and drive consensus on complex trade-offs.
    • Translate business goals into data products with product/analytics leaders.

Requirements
    • 10+ years in data engineering (including 3+ years as staff/principal or equivalent scope).
    • Proven leadership of company-wide data architecture and platform initiatives.
    • Deep experience with at least one cloud (AWS) and a modern warehouse or lakehouse (e.g., Snowflake, Redshift, Databricks).
    • Strong SQL and one programming language (Python or Scala/Java).
    • Orchestration (Airflow/Dagster/Prefect), transformations (dbt or equivalent), and streaming (Kafka/Kinesis/PubSub).
    • Data modeling (3NF, star, data vault) and semantic/metrics layers.
    • Data quality testing, lineage, and observability in production environments.
    • Security best practices: RBAC/ABAC, encryption, key management, auditability.

Nice to Have

    • Feature stores and ML data ops; experimentation frameworks.
    • Cost optimization at scale; multi-tenant architectures.
    • Governance tools (DataHub/Collibra/Alation), OpenLineage, and testing frameworks (Great Expectations/Deequ).
    • Compliance exposure (SOC 2, GDPR/CCPA; HIPAA/PCI where relevant).
    • Model features sourced from complex 3rd-party data (KYB/KYC, credit bureaus, fraud detection APIs)

Benefits
    • Health Care Plan (Medical, Dental & Vision)
    • Retirement Plan (401k, IRA)
    • Life Insurance
    • Unlimited Paid Time Off
    • 9 paid Holidays
    • Family Leave
    • Work From Home
    • Free Food & Snacks (Access to Industrious Co-working Membership!)
    • Wellness Resources

Top Skills

Airflow
AWS
Dagster
Databricks
Dbt
Java
Kafka
Kinesis
Prefect
Pubsub
Python
Redshift
Scala
Snowflake
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Orlando, Florida
32 Employees
Year Founded: 2023

What We Do

Worth AI is a minority-founded SaaS platform powering inclusive financial underwriting through AI.

Our Values:

- Economic inclusion for everyone.
- Equitable decision-making through technology.
- Tackling financial inclusion for all - from enterprises to small businesses.

Get to Know Worth AI’s Technology:

Worth AI is an underwriting platform that leverages AI algorithms to deliver lightning-fast processing times, unparalleled accuracy, and a unique user experience.

Our patent-pending platform provides:

- Real-time data on the SMB market.
- Enabling advanced risk modeling.
- Comprehensive portfolio visibility.
- Continuous risk monitoring.

Our data-driven platform reduces risks and eliminates biases by processing thousands of traditional and non-traditional data sources.

We're pushing boundaries and rewriting the rulebook in fintech.

Let's work together to make real change happen.

Similar Jobs

Recorded Future Logo Recorded Future

Data Engineer

Information Technology • Machine Learning • Natural Language Processing • Security • Software • Analytics • Cybersecurity
Remote
United States

BlackCloak Logo BlackCloak

Data Engineer

Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote
United States
190K-215K Annually
Remote
USA
134K-174K Annually

Motional Logo Motional

Principal Engineer

Artificial Intelligence • Automotive • Machine Learning • Transportation
Remote or Hybrid
3 Locations
175K-234K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Rain Thumbnail
Web3 • Payments • Infrastructure as a Service (IaaS) • Fintech • Financial Services • Cryptocurrency • Blockchain
New York, NY
40 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account