Internal Audit - Data Engineer III

Posted Yesterday
Be an Early Applicant
Pune, Maharashtra, IND
In-Office
Mid level
Healthtech • Logistics • Pharmaceutical
We are united in our responsibility to create healthier futures
The Role
The role involves designing and maintaining scalable data pipelines using Databricks, ensuring data quality and availability for audits, and optimizing ETL processes.
Summary Generated by Built In
Our team members are at the heart of everything we do. At Cencora, we are united in our responsibility to create healthier futures, and every person here is essential to us being able to deliver on that purpose. If you want to make a difference at the center of health, come join our innovative company and help us improve the lives of people and animals everywhere. Apply today!
Job Details
POSITION SUMMARY
The Internal Audit Data Analytics team is seeking experienced Data Engineer to support the build-out and ongoing enhancement of Internal Audit's Databricks-based analytics environment. This role will focus on designing, building, and maintaining scalable data pipelines and data lake solutions used to support stand-alone audits, continuous auditing, and risk monitoring initiatives across the enterprise.
Reporting to the Data Analytics Sr. Manager - Internal Audit, the Data Engineers will play a critical role in enabling high-quality, governed, and automated data flows into Internal Audit's Databricks cube. This position will partner closely with auditors, data analysts, IT organization, and business stakeholders to ensure reliable data ingestion, data quality, and availability of analytical datasets for use in audit execution, risk assessments, and strategic data-driven initiatives.
This is a hands-on engineering role requiring deep technical expertise in Databricks, cloud platforms (Azure preferred), data modeling, ETL/ELT design, and development of scalable data engineering solutions.
PRIMARY DUTIES AND RESPONSIBILITIES
Data Engineering & Pipeline Development
  • Design, build, and maintain large-scale, fault-tolerant data pipelines using Python/PySpark, Databricks, Delta Lake, and orchestration tools (e.g., Airflow, Azure Data Factory).
  • Develop and optimize ETL/ELT workflows to support ingestion, transformation, and modeling of large datasets into a Lakehouse using Delta Lake: batch ingestion from files, databases, APIs; streaming using Structured Streaming; handling semi-structured data (JSON, Parquet, Avro); ELT patterns using Spark SQL / PySpark; Incremental processing patterns; Databricks Jobs; External orchestrators (ADF, Airflow, etc.)
  • Hands-on experience with SAP ECC or SAP S/4HANA data extraction and processing
  • Implement CDC, incremental loads, and full refresh patterns; handle schema evolution and data reconciliation.
  • Develop and maintain curated data models (bronze/silver/gold) and support BI/analytics consumption.
  • Optimize performance and cost (partitioning, Z-ORDER, file sizing, caching, cluster policies, job tuning).
  • Implement scalable data lake and analytical platform architectures on Azure, ensuring security, governance, and cost efficiency.
  • Automate repeatable ingestion processes using infrastructure as code (IaC) and Continuous Integration (CI)/Continuous Delivery (CD) deployment methodologies.
  • Develop robust data models and semantic layers to facilitate analytical consumption by auditors and Data Analytics teammates.

Data Quality, Monitoring & Governance
  • Create and manage data quality checks, anomaly detection routines, and automated alerting to ensure accuracy and integrity of audit datasets, and SLA-driven operations.
  • Establish repeatable processes for documenting data lineage, validation, reconciliation, and test coverage.
  • Implement scalable frameworks for metadata management, schema validation, and versioning of data pipelines.

Audit Collaboration & Analytics Support
  • Support IA audit execution by enabling access to clean, reliable, and well-documented datasets.
  • Provide SME-level guidance on data availability, data structures, pipeline behavior, and data limitations.

Standards, Innovation & Best Practices
  • Establish consistency in design patterns, coding approaches, documentation, and engineering standards.
  • Identify opportunities to modernize or optimize existing pipelines, architecture, or data processing patterns.
  • Contribute to the continuous improvement of the Internal Audit analytics program through automation, performance tuning, and new capability development.
  • Create and maintain technical documentation, runbooks, and onboarding guides.
  • Participate in code reviews and promote engineering best practices (testing, CI/CD, version control).

EXPERIENCE AND EDUCATIONAL REQUIREMENTS
  • Bachelor's or Masters degree in Computer Science, Data Engineering, Information Systems, Analytics, or related discipline; equivalent work experience considered.
  • Minimum 3-5 years of relevant experience required; 5-7 years preferred including 2-4 years of hands-on Data Engineering experience with Databricks.
  • Build and manage scalable ETL/ELT pipelines integrating data from SAP ECC or SAP S/4HANA.
  • Deep expertise working with Databricks, including cluster design, notebook development, Spark optimization, Delta Lake, Delta Live Tables, Unity Catalog (Centralized permissions, Data lineage, Table & schema access controls), and data governance/access controls.
  • Strong proficiency in Python, PySpark/Spark, and SQL; understanding of Spark architecture: Driver, Executors, Stages, Tasks and Shuffle, portioning, caching. Performance tuning and optimization on large datasets.
  • Experience designing and managing large-scale data ingestion from complex enterprise systems (ERP, financial systems, operational platforms).
  • Hands-on experience with Azure (preferred), Amazon Web Services (AWS), or Google Cloud Platform (GCP) cloud services.
  • Solid understanding of data warehousing/Lake house concepts, Delta Lake (Delta Lake table design, ACID transactions, schema enforcement & evolution, time travel, handling late arriving data) and medallion architecture.
  • Experience in creating and supporting end-to-end ETL/ELT workflows.
  • Experience handling semi-structured data (JSON, Parquet, Avro).
  • Prior experience developing semantic models for analytics consumption.
  • Strong experience with data quality frameworks, validation routines, and monitoring strategies.
  • Experience with Git-based development and CI/CD practices.
  • Experience with cloud storage and services (Azure Data Lake Storage).
  • Experience with data integration tools (e.g., Fivetran, ADF, etc).
  • Experience collaborating with US-based onshore teams is strongly preferred.
  • Databricks Data Engineer Professional or Associate Certification is preferred.
  • Azure Data Engineer Associate (DP-203) Certification is preferred

What Cencora offers
Benefit offerings outside the US may vary by country and will be aligned to local market practice. The eligibility and effective date may differ for some benefits and for team members covered under collective bargaining agreements.
Full time
Affiliated Companies
Affiliated Companies: CENCORA BUSINESS SERVICES INDIA PRIVATE LIMITED
Equal Employment Opportunity
Cencora is committed to providing equal employment opportunity without regard to race, color, religion, sex, sexual orientation, gender identity, genetic information, national origin, age, disability, veteran status or membership in any other class protected by federal, state or local law.
The company's continued success depends on the full and effective utilization of qualified individuals. Therefore, harassment is prohibited and all matters related to recruiting, training, compensation, benefits, promotions and transfers comply with equal opportunity principles and are non-discriminatory.
Cencora is committed to providing reasonable accommodations to individuals with disabilities during the employment process which are consistent with legal requirements. If you wish to request an accommodation while seeking employment, please call 888.692.2272 or email [email protected]. We will make accommodation determinations on a request-by-request basis. Messages and emails regarding anything other than accommodations requests will not be returned

Skills Required

  • Bachelor's or Master's degree in Computer Science, Data Engineering, Information Systems, Analytics, or related discipline
  • 3-5 years of relevant experience; 5-7 years preferred
  • Hands-on Data Engineering experience with Databricks
  • Strong proficiency in Python, PySpark/Spark, and SQL
  • Experience with Azure (preferred), AWS, or GCP cloud services
  • Experience with data integration tools (e.g., Fivetran, Azure Data Factory)
  • Databricks Data Engineer Professional or Associate Certification is preferred
  • Azure Data Engineer Associate (DP-203) Certification is preferred

What the Team is Saying

Jason
Silvana
Paul Fritzsch
Denesha Thompson
Cindy Aviles
Denesha Thompson
Tina Martinez

Cencora Compensation & Benefits Highlights

  • Healthcare Strength Benefits begin on Day 1 and include medical, dental, vision, prescription coverage, behavioral health/EAP, virtual musculoskeletal physical therapy, tobacco-cessation, and a wellness program that can lower premiums. Immediate eligibility and broad coverage are consistently emphasized across the package.
  • Parental & Family Support Paid parental leave of 12 weeks, two weeks of paid caregiver leave, backup child care, and fertility and family‑building supports (including adoption and surrogacy assistance) are offered. Family supports are positioned to cover multiple paths to parenthood and caregiving needs.
  • Retirement Support A 401(k) program provides a company match on contributions with flexibility to apply an equivalent match to qualifying student‑loan payments. This structure underscores support for long‑term savings alongside debt management.

Cencora Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Conshohocken, PA
51,000 Employees
Year Founded: 2023

What We Do

Cencora is a leading pharmaceutical solutions organization centered on improving the lives of people and animals everywhere. With 46,000+ global team members, we have the opportunity to make a positive impact on healthcare in communities everywhere. Our team members are empowered to activate their careers through a collective of tools and resources designed to support individual career interests and aspirations. We value our listening culture that actions real outcomes and our team members appreciate and recognize one another for contributions that are making a meaningful global impact. No matter what your role is here, the work we do together has meaning. When you join our team, you become a crucial part of a greater purpose. We’re committed to supporting you personally and professionally, so we can achieve more together at the center of health. Protect yourself from job scams: Recruitment scams are on the rise. To protect yourself, we urge you to be vigilant and follow these guidelines > https://careers.cencora.com/us/en/job-scams

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Cencora Teams

Team
Early Careers
Team
Information Technology
About our Teams

Cencora Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
Company Office Image
HQConshohocken, PA
Bolsover, GB
București, RO
Carrollton, TX
Chessington, GB
Dříteň, CZ
Feltham, GB
Gennevilliers, FR
Marseille, FR
Oakville, ON
Company Office Image
Pune, Maharashtra
Villanueva de Gállego, Zaragoza
Vilniaus miesto, LT
Woking, GB
Zaragoza, Zaragoza
Learn more

Similar Jobs

Cencora Logo Cencora

Automation Engineer

Healthtech • Logistics • Pharmaceutical
In-Office
Pune, Maharashtra, IND
51000 Employees

Cencora Logo Cencora

Integration Engineer

Healthtech • Logistics • Pharmaceutical
In-Office
Pune, Maharashtra, IND
51000 Employees

Cencora Logo Cencora

Analyst III - ERP IT Solutions

Healthtech • Logistics • Pharmaceutical
In-Office
Pune, Maharashtra, IND
51000 Employees

Cencora Logo Cencora

Sr Analyst People Analytics & Insights

Healthtech • Logistics • Pharmaceutical
In-Office
Pune, Maharashtra, IND
51000 Employees
1-1 Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account