Data Engineer

Posted 2 Days Ago
Hiring Remotely in Sunnyvale, CA, USA
In-Office or Remote
Mid level
Artificial Intelligence • Machine Learning • Software • Analytics
The Role
Design, build, and maintain ETL data pipelines on Microsoft Fabric or equivalent tools; implement data quality checks, incremental loads, and transformations using SQL, Power Query, or Python; develop dimensional models and semantic metrics; troubleshoot operational issues; document lineage and support UAT and stakeholders while escalating architectural decisions to the Lead.
Summary Generated by Built In

Data Engineer   INDIVIDUAL CONTRIBUTOR

Mortgage Cadence Platform (MCP/LOS)  |  Role Profile  |  Draft for Recruiting

Bottom line: The Data Engineer operates within the framework established by the Lead — designing, building, and maintaining robust data pipelines and transformation logic that power analytics, compliance, and operational reporting across the Mortgage Cadence Platform. The role is execution-focused with increasing ownership of end-to-end data workflows as familiarity with the platform grows. Strong SQL, ETL, and data quality skills are required; the ability to build reports and leverage semantic models is secondary to data engineering excellence.

CORE RESPONSIBILITIES

DATA PIPELINE DEVELOPMENT

  • Design and build extraction, transformation, and loading (ETL) pipelines using Microsoft Fabric (Dataflow Gen2, Notebooks, or equivalent tools)
  • Write optimized SQL queries and transformations for data ingestion from designated source systems
  • Apply data quality rules and validation logic at each pipeline stage
  • Implement incremental loads and manage refresh schedules for performance
  • Escalate to Lead for architectural decisions or complex transformation patterns

DATA QUALITY & VALIDATION

  • Define and implement data quality checks at ingestion, transformation, and output stages
  • Perform ongoing data validation to ensure pipeline outputs align with business logic and source system expectations
  • Identify, document, and escalate data quality issues with root cause analysis
  • Maintain data quality dashboards and SLA monitoring
  • Support UAT for new data sources or transformation logic

TRANSFORMATION & MODELING

  • Build and maintain data transformations using Power Query, SQL, or Python as appropriate
  • Develop dimensional models and define aggregation logic aligned with analytics requirements
  • Optimize data structures for performance and maintainability
  • Document transformation logic, lineage, and assumptions per team standards
  • Collaborate with Lead to define semantic models and calculated metrics

OPERATIONAL SUPPORT

  • Troubleshoot pipeline failures and performance issues; coordinate resolution with IT/Engineering
  • Respond to data discrepancy reports from business users and analysts
  • Maintain documentation of data sources, data dictionaries, and transformation specifications
  • Support capacity planning and optimization of Fabric environments and pipelines

REQUIRED SKILLS

Technical

  • Advanced SQL — query optimization, window functions, performance tuning, debugging complex transformations
  • Proficient with Microsoft Fabric — (Dataflow Gen2, Notebooks, Lakehouse) OR equivalent ETL tools (Python, dbt, Talend, Informatica)
  • Strong understanding of relational database design and dimensional modeling
  • Power Query / M — complex data shaping, merging, error handling, and transformation logic
  • Python or similar scripting language — data manipulation, pipeline automation
  • Git/version control basics — able to collaborate on code and track changes
  • Data quality and testing frameworks — unit tests, assertions, validation rules

Functional

  • Ability to interpret business requirements and design efficient data solutions
  • Data governance mindset — understands data lineage, documentation, and quality standards
  • Proactive about identifying edge cases and potential data issues
  • Mortgage/lending domain familiarity preferred; willingness to learn domain required
  • Works effectively within defined standards and escalates architectural questions to Lead
  • Able to balance speed with quality; advocates for technical excellence

COMMUNICATION REQUIREMENTS BY STAKEHOLDER

Stakeholder

Interaction Context

Communication Requirements

Analytics / BI Team

Data pipeline requirements, data quality issues, model design collaboration

  • Translate analytical requirements into robust data solutions
  • Communicate data lineage and transformation logic clearly
  • Document assumptions and limitations of data sources and transforms
  • Set realistic timelines for new pipelines or data source onboarding

Data Lead

Daily collaboration, code/design review, escalation of technical blockers

  • Provide detailed status updates on assigned pipelines; flag performance or quality concerns early
  • Document design decisions and trade-offs for Lead review — escalate architecture questions rather than assume
  • Demonstrate commitment to code quality and maintainability; accept technical feedback constructively

IT / Engineering

Data access provisioning, source system clarifications, infrastructure support

  • Communicate data requirements precisely — schema details, volume expectations, refresh frequency
  • Escalate data access or infrastructure needs through Lead; provide business context
  • Provide detailed defect reports with query examples and expected vs. actual results

Business / Operations

Data quality escalations, new data source requests

  • Explain data quality issues and timelines in business terms; avoid over-technical language
  • Ask clarifying questions about data requirements and business logic expectations
  • Set expectations transparently; communicate delays or blockers early through Lead

Disclaimer: HeadSpin does not charge any fees at any stage of the recruitment or selection process. We will never ask candidates to pay money or share financial information in exchange for a job offer. If you receive any communication requesting payment on behalf of HeadSpin, please treat it as fraudulent and report it immediately to [email protected]

Skills Required

  • Advanced SQL (query optimization, window functions, performance tuning)
  • Proficient with Microsoft Fabric (Dataflow Gen2, Notebooks, Lakehouse) or equivalent ETL tools (dbt, Talend, Informatica)
  • Strong understanding of relational database design and dimensional modeling
  • Power Query / M for complex data shaping and transformations
  • Python or similar scripting language for data manipulation and pipeline automation
  • Git / version control basics for collaborative code management
  • Experience with data quality and testing frameworks (unit tests, assertions, validation rules)
  • Ability to interpret business requirements and translate to data solutions
  • Data governance mindset: documentation, lineage, and quality standards
  • Mortgage/lending domain familiarity
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Sunnyvale, CA
236 Employees
Year Founded: 2015

What We Do

HeadSpin is the world’s first Digital Experience AI Platform combining cloud-hosted and on-prem global device infrastructure, test automation, and ML-driven performance & quality of experience analytics for mobile, web, audio, and video. HeadSpin empowers engineering, QA, operations, and product teams to assure optimal digital experiences throughout the development lifecycle. Learn more at www.HeadSpin.io.

Similar Jobs

Pie Insurance Logo Pie Insurance

Data Engineer

Fintech • Insurance • Machine Learning • Analytics • Financial Services • Automation
Easy Apply
Remote
United States
350 Employees
115K-145K Annually

CrowdStrike Logo CrowdStrike

Data Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
85K-120K Annually

CrowdStrike Logo CrowdStrike

Data Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
7 Locations
10000 Employees
195K-320K Annually

Jellyfish Logo Jellyfish

Data Engineer

Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
Remote or Hybrid
United States
225 Employees
165K-205K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account