Principal Data Engineer - AI Program

Posted Yesterday
Be an Early Applicant
Rochester, MN, USA
In-Office
Senior level
Healthtech
The Role
Designs, builds, and operates large-scale healthcare data pipelines and ecosystems to support analytics and AI/ML initiatives. Partners with product owners and clinical stakeholders to retrieve, transform, and optimize structured and unstructured data across hybrid and multi-cloud environments. Provides technical leadership, architects scalable cost-efficient solutions, ensures secure compliant access to data, and supports development of foundation models and agentic systems.
Summary Generated by Built In

The Senior Data Engineer - AI Program develops and deploys data pipelines, integrations and transformations to support analytics and machine learning applications and solutions as part of an assigned product team using various open-source programming languages and vended software to meet the desired design functionality for products and programs. The position requires maintaining an understanding of the organization's current solutions, coding languages, tools, and regularly requires the application of independent judgment. Will provide consultative services to departments/divisions and leadership committees. Demonstrated experience designing, building, and operating large-scale healthcare data platforms and data ecosystems, including the movement, transformation, and optimization of structured and unstructured clinical, operational, and research data across on-premises and cloud environments. Candidate will partner with product owners, clinical stakeholders and AI/ML experts to identify and retrieve data, conduct exploratory analysis, pipeline and transform data to support the creation of agentic systems and the build of state-of-the-art multi-modal foundation models. Candidate will provide technical leadership in architecting scalable, cost-efficient data solutions, optimizing data movement and storage strategies, and ensuring secure, compliant access to healthcare data assets across hybrid and multi-cloud environments.
This is a full-time remote position within the United States. 
Mayo Clinic will not sponsor or transfer visas for this position including F1 OPT STEM.

Qualifications

A Bachelor's degree in a relevant field such as engineering, mathematics, computer science, information technology, health science, or other analytical/quantitative field and a minimum of seven years of professional or research experience in data visualization, data engineering, analytical modeling techniques; OR an Associate's degree in a relevant field such as engineering, mathematics, computer science, information technology, health science, or other analytical/quantitative field and a minimum of nine years of professional or research experience in data visualization, data engineering, analytical modeling techniques. In-depth business or practice knowledge will also be considered. 

Incumbent must have the ability to manage a varied workload of projects with multiple priorities and stay current on healthcare trends and enterprise changes. Interpersonal skills, time management skills, and demonstrated experience working on cross functional teams are required. Requires strong analytical skills and the ability to identify and recommend solutions and a commitment to customer service. The position requires excellent verbal and written communication skills, attention to detail, and a high capacity for learning and problem resolution. Advanced experience in SQL is required. Advanced Experience in scripting languages such as Python, JavaScript, PHP, C++ or Java & API integration is required. Experience in hybrid data processing methods (batch and streaming) such as Apache Spark, Hive, Pig, Kafka is required. Experience with big data, statistics, and machine learning is required. The ability to navigate linux and windows operating systems is required. Knowledge of workflow scheduling (Apache Airflow Google Composer), Infrastructure as code (Kubernetes, Docker) CI/CD (Jenkins, Github Actions) is required. Experience in DataOps/DevOps and agile methodologies is required. Experience with hybrid data virtualization such as Denodo is preferred. Working knowledge of Tableau, Power BI, SAS, ThoughtSpot, DASH, d3, React, Snowflake, SSIS, and Google Big Query is preferred. 
Preferred qualifications:
•   An advanced degree is preferred. 
•   Strong healthcare data knowledge including electronic health records (EHR), clinical, operational, imaging, genomic, and research data domains, as well as familiarity with healthcare interoperability standards such as HL7, FHIR, DICOM, OMOP, and related healthcare data models.

• Demonstrated experience designing and optimizing large-scale data movement, integration, and transformation solutions involving terabyte- to petabyte-scale datasets, with consideration for performance, scalability, reliability, and cost efficiency.

• Experience architecting and supporting hybrid data platforms spanning cloud and on-premises environments, including data residency, security, governance, and compliance requirements.

• Experience with multiple cloud platforms such as Google Cloud Platform (GCP), Amazon Web Services (AWS), and Microsoft Azure, including cloud-native data engineering services and cross-cloud data integration patterns.

• Experience evaluating and optimizing data transfer, storage, and compute costs while meeting performance, availability, and service-level objectives.

• Knowledge of healthcare data governance, data quality frameworks, master data management, metadata management, and regulatory requirements including HIPAA and related healthcare privacy standards.

• Experience supporting AI/ML, generative AI, and foundation model initiatives through the development of scalable, high-quality data pipelines and data products.

• Demonstrated ability to provide technical leadership and architectural guidance for enterprise-scale data engineering initiatives.

About Us
Why Mayo Clinic

Mayo Clinic is top-ranked in more specialties than any other care provider according to U.S. News & World Report. As we work together to put the needs of the patient first, we are also dedicated to our employees, investing in competitive compensation and comprehensive benefit plans – to take care of you and your family, now and in the future. And with continuing education and advancement opportunities at every turn, you can build a long, successful career with Mayo Clinic.

Benefits Highlights
  • Medical: Multiple plan options.
  • Dental: Delta Dental or reimbursement account for flexible coverage.
  • Vision: Affordable plan with national network.
  • Pre-Tax Savings: HSA and FSAs for eligible expenses.
  • Retirement: Competitive retirement package to secure your future.
About the Team
Just as our reputation has spread beyond our Minnesota roots, so have our locations. Today, our employees are located at our three major campuses in Phoenix/Scottsdale, Arizona, Jacksonville, Florida, Rochester, Minnesota, and at Mayo Clinic Health System campuses throughout Midwestern communities, and at our international locations. Each Mayo Clinic location is a special place where our employees thrive in both their work and personal lives. Learn more about what each unique Mayo Clinic campus has to offer, and where your best fit is. 

Equal Opportunity

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender identity, sexual orientation, national origin, protected veteran status or disability status. Learn more about the "EOE is the Law".  Mayo Clinic participates in E-Verify and may provide the Social Security Administration and, if necessary, the Department of Homeland Security with information from each new employee's Form I-9 to confirm work authorization.

Skills Required

  • Bachelor's degree in relevant field and minimum seven years of related experience OR Associate's degree and nine years of related experience
  • Authorized to work in the United States without visa sponsorship (Mayo Clinic will not sponsor or transfer visas, including F1 OPT STEM)
  • Ability to manage varied workload, prioritize multiple projects, and stay current on healthcare trends
  • Strong interpersonal, time management, verbal and written communication skills, attention to detail
  • Advanced experience in SQL
  • Advanced experience in scripting languages such as Python, JavaScript, PHP, C++ or Java and API integration
  • Experience in hybrid data processing methods (batch and streaming) such as Apache Spark, Hive, Pig, Kafka
  • Experience with big data, statistics, and machine learning
  • Ability to navigate Linux and Windows operating systems
  • Knowledge of workflow scheduling (Apache Airflow, Google Composer)
  • Familiarity with containerization/orchestration (Docker, Kubernetes) and Infrastructure as Code practices
  • CI/CD experience (Jenkins, GitHub Actions)
  • Experience with DataOps/DevOps practices and agile methodologies
  • Provide consultative services and technical leadership/architectural guidance for enterprise-scale data engineering initiatives
  • Experience designing and optimizing large-scale data movement, integration, and transformation solutions for terabyte- to petabyte-scale datasets
  • Experience architecting and supporting hybrid data platforms spanning cloud and on-premises environments (data residency, security, governance, compliance)
  • Experience with cloud platforms (GCP, AWS, Azure) and cloud-native data engineering services
  • Experience evaluating and optimizing data transfer, storage, and compute costs
  • Knowledge of healthcare data governance, data quality frameworks, master data management, metadata management, and HIPAA/privacy standards
  • Experience supporting AI/ML, generative AI, and foundation model initiatives through scalable data pipelines
  • Experience with hybrid data virtualization such as Denodo
  • Working knowledge of Tableau, Power BI, SAS, ThoughtSpot, DASH, d3, React, Snowflake, SSIS, Google BigQuery
  • Advanced degree (MS/PhD) in relevant field

Mayo Clinic Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Mayo Clinic and has not been reviewed or approved by Mayo Clinic.

  • Retirement Support A no-cost pension plus an employer-matched 403(b)/401(k) is positioned as a standout differentiator, offering strong long-term financial security. Feedback suggests this retirement combination elevates overall total rewards even when base pay is moderate.
  • Healthcare Strength Expanded medical networks, enhanced fertility coverage, and employer absorption of a plan year’s premium increases point to robust healthcare offerings. Feedback suggests annual updates maintain breadth and competitiveness of coverage.
  • Parental & Family Support Adoption assistance, dependent scholarships, child and elder-care resources, and EAP services provide meaningful family-oriented support. Feedback suggests these programs add tangible value beyond salary alone.

Mayo Clinic Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Rochester, MN
54,000 Employees

What We Do

Mayo Clinic is the first and largest integrated, not-for-profit medical group practice in the world. Doctors from every medical specialty work together to care for patients, joined by common systems and a philosophy of "the needs of the patient come first."​ More than 3,800 physicians and scientists and 50,900 allied health staff work at Mayo Clinic, which has sites in Rochester, Minn., Jacksonville, Fla., and Scottsdale/Phoenix, Ariz. Mayo Clinic also serves over 70 communities through Mayo Clinic Health System with locations in MN, IA, and WI. Collectively, these locations care for more than 1 million people each year.

Similar Jobs

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Store Manager

eCommerce • Fashion • Retail • Sales • Wearables • Design
Remote or Hybrid
14 Locations
16000 Employees
62K-94K Annually

Samsara Logo Samsara

Operations Analyst

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
United States
4000 Employees
89K-134K Annually

Crunchyroll Logo Crunchyroll

Senior Manager, CRM Marketing, APAC

Digital Media • eCommerce • Gaming • Mobile • News + Entertainment
Remote or Hybrid
21 Locations
1300 Employees

Collectors Logo Collectors

Director Of Engineering

Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
Remote or Hybrid
2 Locations
2246 Employees
212K-300K Annually

Similar Companies Hiring

Camber Thumbnail
Fintech • Healthtech • Social Impact
New York, New York
90 Employees
Sailor Health Thumbnail
Healthtech • Social Impact • Telehealth
New York City, NY
20 Employees
Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account