Role Summary
As an AI and Analytics Data Engineer , you will be part of the Data Science Industrialization team charged with building and automating high quality data science pipelines that power key business applications with advanced analytics/AI/ML. You will be a member of a global team that defines and maintains ML Ops best practices and deploys and maintains production analytics and data science modeling workflows.
Role Responsibilities
- Convert data/ML pipelines into scalable pipelines based on the infrastructure available (e.g. convert Python based data science code into PySpark/SQL for scalable pushdown execution)
- Enable production models across the ML lifecycle
- Determine model performance metrics and implement monitoring dashboards
- Determine and implement model retraining trigger mechanisms
- Design champion/challenger model and A/B testing automation
- Implement CI/CD orchestration for data science pipelines
- Manage the production deployments and post-deployment model lifecycle management activities: drift monitoring, model retraining, and model technical evaluation & business validation
- Work with stakeholders to assist with ML pipeline -related technical issues and support modeling infrastructure needs
- Partner with C4 Data team to integrate developed ML pipelines into enterprise-level analytics data products where appropriate
- Partner with C4 Platforms team on continuous development and end to end capability integration between OOB platforms and internal engineered components (API registry, ML library / workflow management, enterprise connectors); Performance and resource optimization of managed pipelines and models
Qualifications
Must-Have
- Bachelor's degree in ML engineering related area (Data Science, Computer Engineering, Computer Science, Information Systems, Engineering or a related discipline)
- 5-10 years of work experience in Data science, Analytics, or Engineering for a diverse range of projects
- Understanding of data science development lifecycle (CRISP)
- Strong hands-on skills in ML engineering and data science (e.g., Python, R, SQL, industrialized ETL software)
- Experience working in a cloud based analytics ecosystem (AWS, Snowflake, etc)
- Highly self-motivated to deliver both independently and with strong team collaboration
- Ability to creatively take on new challenges and work outside comfort zone
- Strong English communication skills (written & verbal)
Nice-to-Have
- Advanced degree in Data Science, Computer Engineering, Computer Science, Information Systems or related discipline
- Experience with data science enabling technology, such as Data Science Studio or other data science platforms
- Hands on experience working in Agile teams, processes, and practices
- Understanding of MLOps principles and tech stack (e.g. MLFlow)
- Experience in CI/CD integration (e.g. Git Hub, Git Hub Action or Jenkins)
- Experience in software/product engineering
- Strong hands-on skills for data and machine learning pipeline orchestration via Dataiku (DSS 9 or 10) platform
- Pharma & Life Science commercial functional knowledge
- Pharma & Life Science commercial data literacy
- Experience with Dataiku Science Studio
Work Location Assignment: Hybrid
Pfizer is an equal opportunity employer and complies with all applicable equal employment opportunity legislation in each jurisdiction in which it operates.
Information & Business Tech
Top Skills
What We Do
Our purpose ensures that patients remain at the center of all we do. We live our purpose by sourcing the best science in the world; partnering with others in the healthcare system to improve access to our medicines; using digital technologies to enhance our drug discovery and development, as well as patient outcomes; and leading the conversation to advocate for pro-innovation/pro-patient policies.
Why Work With Us
We are the inventors, the problem solvers, the big thinkers — those who surmount any hurdle to deliver breakthrough medicines to the people who are counting on them the most.
Gallery
Pfizer Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.







