At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.
The PositionDESCRIPTIONWe are looking for a Principal Data Engineer to join our growing team of Advanced Data Analytics experts. This role calls for a seasoned professional with deep expertise in data engineering, data modeling, and analytics, along with strong technical leadership and architectural thinking.
The ideal candidate thrives on solving complex data challenges—ensuring data quality, integrity, and scalability—and driving data initiatives that enable analytics, machine learning, and evidence-based decision-making. The Data Engineer will work closely with System and Enterprise Architects, Technical Project Managers, Product Owners, and Software Engineers on data-driven initiatives, supporting the data needs of multiple teams, systems, and products.
The perfect fit for this role is self-motivated, collaborative, and enthusiastic about advancing Roche’s next generation of products and data initiatives. As Roche advances toward AI-enabled diagnostics, hands-on experience with Generative AI (GenAI) POCs will be considered a plus, especially in leveraging data assets for automation, insight generation, and feature engineering.
Location- Baner, Pune
KEY RESPONSIBILITIESDesign, implement, and optimize data architectures and pipelines using AWS services for scalable, high-performance, and reliable data systems.
Lead data modeling, exploratory data analysis (EDA), and transformation activities, ensuring high-quality, well-structured data for analytics and machine learning.
Collaborate with data scientists to support feature extraction and data preparation for AI/ML models.
Partner with software engineers and architects to integrate data-driven capabilities into Roche’s SaaS and digital health solutions.
Ensure data governance, lineage, and compliance across cloud environments.
Stay current with advancements in data engineering, analytics, and emerging AI/GenAI trends, evaluating their applicability to Roche’s data platforms.
Present data strategies, insights, and technical solutions clearly to both technical and business audiences.
12–15 years of experience in data engineering and analytics.
Expertise in AWS services (e.g., S3, Redshift, Glue, Athena, EMR).
Strong in data modeling, SQL, Python, and distributed frameworks like Apache Spark.
Hands-on experience in exploratory data analysis (EDA), data quality improvement, and feature engineering.
Experience developing SaaS and cloud-native data applications.
Sound understanding of software architecture patterns and agile methodologies.
Proven ability to work closely with cross-functional teams and drive data-centric initiatives.
Excellent communication skills to bridge technical and business discussions.
Experience in the Healthcare / Diagnostics domain is a plus.
Familiarity with healthcare data standards (HIPAA, HL7, FHIR).
Experience designing data lakes, data warehouses, or data mesh architectures.
Strong knowledge of data governance, security, and compliance in cloud ecosystems.
Generative AI (GenAI) Experience: Hands-on exposure to GenAI, including POCs for data summarization, feature extraction, or AI-driven insights, leveraging LLMs and AI pipelines in production or experimental setups.
A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.
Let’s build a healthier future, together.
Roche is an Equal Opportunity Employer.
Top Skills
What We Do
Roche is a global pioneer in pharmaceuticals and diagnostics focused on advancing science to improve people’s lives. The combined strengths of pharmaceuticals and diagnostics under one roof have made Roche the leader in personalised healthcare – a strategy that aims to fit the right treatment to each patient in the best way possible.
Roche is the world’s largest biotech company, with truly differentiated medicines in oncology, immunology, infectious diseases, ophthalmology and diseases of the central nervous system. Roche is also the world leader in in vitro diagnostics and tissue-based cancer diagnostics, and a frontrunner in diabetes management.
Founded in 1896, Roche continues to search for better ways to prevent, diagnose and treat diseases and make a sustainable contribution to society. The company also aims to improve patient access to medical innovations by working with all relevant stakeholders. Thirty medicines developed by Roche are included in the World Health Organization Model Lists of Essential Medicines, among them life-saving antibiotics, antimalarials and cancer medicines. Roche has been recognised as the Group Leader in sustainability within the Pharmaceuticals, Biotechnology & Life Sciences Industry ten years in a row by the Dow Jones Sustainability Indices (DJSI).








