Data Architect

Posted Yesterday
2 Locations
In-Office or Remote
123K-150K Annually
Mid level
Automotive
The Role
Design and automate scalable ETL/ELT pipelines across relational, event, and unstructured sources; implement data governance, metadata and quality frameworks; architect cloud and distributed storage solutions; optimize physical data models and performance; integrate heterogeneous systems (Snowflake, Fabric, SAP, graph DBs); test, monitor, and modernize data infrastructure to support analytics.
Summary Generated by Built In

We are looking for a talented Data Architect to join our team specializing in Systems/Information Technology for Cummins, Inc. as part of DBU Data & Analytics, Remote.

In this role, you will make an impact in the following ways: 

  • Design and automate scalable data ingestion and transformation pipelines across relational, event-based, and unstructured sources.
  • Build and maintain frameworks to monitor, detect, and resolve data quality and integrity issues. Implement data governance practices, including metadata management, data access, and retention policies. 
  • Architect and guide development of reliable, efficient, and scalable ETL/ELT data pipelines with monitoring and alerting.
  • Design physical data models and optimize database structures, indexing, and relationships for performance. 
  • Test, optimize, and troubleshoot data pipelines to ensure stability and performance.
  • Develop and manage large-scale data storage solutions using distributed and cloud platforms (e.g., data lakes, Hadoop, NoSQL databases).
  • Drive automation and modernization of data infrastructure and integration processes to support agile analytics initiatives.
Responsibilities

To be successful in this role you will need the following: 

  • Data Extraction - Build scalable, automated ETL pipelines that deliver accurate, timely data. Choose the right tools and optimize transformations for performance and usability.
  • Programming - Write clean, well-documented, and testable code using best practices. Leverage version control and automation to ensure reliability and efficiency
  • Solution Validation Testing - Follow SDLC standards to thoroughly test and validate all solutions. Ensure outputs meet business requirements and perform correctly in production.
  • Data Quality - Proactively monitor and resolve data issues. Establish strong governance practices to maintain data accuracy and trust across the organization.
Qualifications

Education/Experience: 

  • College, university, or equivalent degree in relevant technical discipline, or relevant equivalent experience required. 
  • This position may require licensing for compliance with export controls or sanctions regulations. 
  • Intermediate experience in a relevant discipline area is required. Knowledge of the latest technologies and trends in data engineering are highly preferred and includes:
    - Familiarity analyzing complex business systems, industry requirements, and/or data regulations
    - Background in processing and managing large data sets
    - Design and development for a Big Data platform using open source and third-party tools
    - SPARK, Scala/Java, Map-Reduce, Hive, Hbase, and Kafka or equivalent college coursework
    - SQL query language
    - Clustered compute cloud-based implementation experience
    - Experience developing applications requiring large file movement for a Cloud-based environment and other data extraction tools and methods from a variety of sources
    - Experience in building analytical solutions 
    Intermediate experiences in the following are preferred:
    - Experience with IoT technology 
    - Experience in Agile software development

Additional Responsibilities:

Preferred Job Specific Skills – Data Architect

  • Dimensional Modeling Mastery — Deep expertise in designing enterprise‑scale dimensional models (star, snowflake, constellation) with strong command of fact table grain definition, surrogate key strategies, slowly changing dimensions (Types 1–6), bridge tables, and late‑arriving data handling.
  • Advanced SQL Engineering — Highly proficient in writing complex, high‑performance SQL, including window functions, CTE‑driven transformations, query plan analysis, cost‑based optimization, partitioning strategies, and performance tuning across large, distributed datasets.
  • Snowflake Architecture & Engineering — Hands‑on experience with Snowflake internals including micro‑partitioning, clustering keys, result‑set caching layers, warehouse sizing/auto‑suspend tuning, Snowpipe/Streams/Tasks orchestration, Time Travel, Zero‑Copy Cloning, and secure data sharing patterns.
  • Graph Database & Cypher Proficiency — Strong experience with Neo4j or equivalent graph platforms, including graph schema design, Cypher query optimization, graph algorithms (PageRank, community detection, pathfinding), and integration of graph workloads with analytical and relational systems.
  • Microsoft Fabric Ecosystem — Practical experience with Fabric Lakehouse architecture, Delta Lake optimization, Data Engineering pipelines, Data Factory orchestration, KQL‑based Real‑Time Analytics, semantic model creation, and integration with Power BI and OneLake governance.
  • SAP S/4HANA Data Structures —Familiarity of SAP S/4HANA data models (FI/CO, MM, SD, PP), CDS views, OData services, SLT/SDI/ODP‑based extraction patterns, and harmonization of SAP transactional data into cloud‑based analytical platforms.
  • Cloud Data Architecture — Strong understanding of distributed data processing, ELT/ETL orchestration, event‑driven ingestion (Kafka/Event Hub), metadata‑driven frameworks, schema evolution, and data lifecycle management across cloud environments (Azure preferred).
  • Data Governance & Metadata Management — Experience implementing enterprise data catalogs, lineage tracking, data quality rules, master data integration, and security models (RBAC/ABAC, row‑level and column‑level security).
  • Performance Engineering & Optimization — Ability to diagnose bottlenecks across compute, storage, and network layers; optimize workloads for cost and performance; and design scalable, fault‑tolerant data architectures.
  • Cross‑Platform Integration — Experience integrating heterogeneous systems (SAP, Snowflake, Fabric, graph DBs, APIs, streaming platforms) into unified analytical ecosystems with strong focus on interoperability and data consistency.

Compensation: 

Please note that the salary range provided is a good faith estimate on the applicable range. The final salary offer will be determined after considering relevant factors, including a candidate’s qualifications and experience, where appropriate.

Premium Range:

Minimum: $123,030

Maximum: $150,370

About UsCummins is an equal opportunity employer. Our policy is to provide equal employment opportunities to all qualified persons without regard to race, sex, color, disability, national origin, age, religion, union affiliation, sexual orientation, veteran status, citizenship, gender identity, or other status protected by law.

Skills Required

  • College, university, or equivalent degree in a relevant technical discipline or equivalent experience
  • May require export control or sanctions compliance licensing
  • Intermediate experience in a relevant discipline (data engineering/architecture)
  • Build scalable, automated ETL/ELT pipelines and data extraction from varied sources
  • Write clean, testable code and use version control and automation
  • Knowledge of Big Data platform design and development (open source and third-party tools)
  • Experience with Spark, Scala or Java, Map-Reduce, Hive, HBase, and Kafka (or equivalents)
  • Proficiency with SQL and high-performance query development/tuning
  • Clustered/cloud compute implementation experience and large-file cloud data movement
  • Experience building analytical solutions and processing large datasets
  • Experience with Snowflake architecture and features (Snowpipe, Streams, Tasks, Time Travel)
  • Experience with Neo4j or graph DBs and Cypher
  • Familiarity with Microsoft Fabric, Delta Lake, Azure Data Factory, KQL, Power BI, OneLake
  • Familiarity with SAP S/4HANA data extraction patterns (CDS views, OData, SLT/SDI/ODP)
  • Experience with IoT technologies
  • Experience with Agile software development practices

Cummins Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Cummins and has not been reviewed or approved by Cummins.

  • Retirement Support A 401(k) with company contribution/match and both defined contribution and defined benefit pension plans are offered, alongside profit sharing and an employee stock purchase plan. This mix supports long-term savings and financial security.
  • Healthcare Strength Multiple medical plan options (HSA, HSA Plus, PPO) with dental, vision, life and long-term disability coverage are provided, along with telehealth, mental-health support, and wellness tools. In-network protections and HSA/HSA Plus structures are described to help manage costs.
  • Parental & Family Support Paid maternity and paternity leave, family medical leave, and adoption assistance are offered. Reduced or flexible hours and unpaid extended leave options further support caregiving needs.

Cummins Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Columbus, IN
35,251 Employees
Year Founded: 1919

What We Do

At Cummins, we empower everyone to grow their careers through meaningful work, building inclusive and equitable teams, coaching, development and opportunities to make a difference. Across our entire organization, you'll find engineers, developers, and technicians who are innovating, designing, testing, and building. You'll also find accountants, marketers, as well as manufacturing, quality and supply chain specialists who are working with technology that's just as innovative and advanced.

Similar Jobs

Jellyfish Logo Jellyfish

Data Architect

Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
Remote or Hybrid
United States
225 Employees
200K-260K Annually

Cedar Logo Cedar

Data Architect

Artificial Intelligence • Fintech • Healthtech • Software
Easy Apply
Remote
United States
420 Employees
247K-312K Annually

CUNA Mutual Group Logo CUNA Mutual Group

Data Architect

Fintech • Insurance • Financial Services
Remote
USA
3634 Employees
158K-237K Annually

Skillable Logo Skillable

Data Architect

Edtech • Software
Remote
United States
288 Employees
170K-200K Annually

Similar Companies Hiring

Cox Enterprises Thumbnail
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Atlanta, GA
50000 Employees
UL Solutions Thumbnail
Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Chicago, IL
15000 Employees
HERE Technologies Thumbnail
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account