ML Ops & Observability Engineer

Sorry, this job was removed at 06:08 p.m. (CST) on Friday, May 01, 2026
Be an Early Applicant
2 Locations
Hybrid
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
We’re in relentless pursuit of breakthroughs that change patients’ lives.
The Role
Use Your Power for Purpose
At Pfizer, technology drives everything we do. You will play a pivotal role in implementing impactful and innovative technology solutions across all functions, from research to manufacturing. Whether you are digitizing drug discovery and development, identifying innovative solutions, or streamlining our processes, you will be making a significant impact on countless lives.
What You Will Achieve
MLOps Platform Execution & Model Operations
  • Lead the design, implementation, and operation of MLOps platforms supporting model development, deployment, monitoring, and lifecycle management.
  • Own production workflows for:
    • Model packaging and deployment
    • Versioning and rollback
    • Promotion across environments (dev/test/prod)
  • Implement standardized CI/CD pipelines for ML workloads, integrating with enterprise DevOps and infrastructure platforms.
  • Partner with infrastructure and DataOps teams to ensure ML workloads run on secure, scalable, and cost-effective cloud-native environments (AWS/Azure).
    • Translate Director-level AI platform strategy into reliable, repeatable ML operational capabilities.

Model, Data & System Observability
  • Own end-to-end observability for ML systems, spanning:
    • Model performance and behavior
    • Data quality and drift
    • Pipeline health and system reliability
  • Implement and operate observability tooling using:
    • OpenTelemetry for distributed tracing
    • Metrics and dashboards (Prometheus, Grafana)
    • Logs and analytics (ELK or equivalent)
  • Define and track ML-specific reliability signals, such as:
    • Model performance degradation
    • Data drift and feature anomalies
    • Prediction latency and failure rates
  • Establish SLOs and alerting strategies for ML services in production.

Testing, Validation & Responsible AI Enablement
  • Ensure testing and validation are embedded throughout the ML lifecycle, including:
    • Model validation and regression testing
    • Data and feature consistency checks
    • Deployment verification and rollback testing
  • Integrate automated ML testing and quality gates into CI/CD pipelines.
  • Support non-functional testing for ML systems, including:
    • Performance and scalability testing
    • Reliability and resilience testing
    • Security and access validation
  • Partner with AI, data, and compliance teams to support responsible and compliant AI operations, including auditability, traceability, and explainability hooks (where required).

AI Platform Enablement & Cross‑Team Collaboration
  • Enable data scientists and ML engineers to move models from experimentation to production efficiently and safely.
  • Provide reusable tooling, templates, and paved paths for:
    • Experiment tracking
    • Model registry usage
    • Deployment and monitoring patterns
  • Collaborate closely with:
    • Infrastructure Engineering (runtime, scaling, security)
    • DataOps Engineering (data pipelines, feature stores, data quality)
    • Product and analytics leaders to align ML capabilities to business outcomes.

Reliability, Incident Management & Continuous Improvement
  • Own operational reliability for ML platforms and services.
  • Lead response to ML-related production incidents, including:
    • Model failures or degradations
    • Data drift-driven issues
    • Pipeline or inference outages
  • Conduct post-incident reviews and drive systemic improvements.
  • Continuously improve MLOps maturity using SRE-inspired practices and metrics.

People Leadership & Engineering Ways of Working
  • Set clear expectations for operational ownership, quality, and delivery.
  • Coach engineers on:
    • MLOps best practices
    • Observability and reliability mindset
    • Secure and compliant AI operations
  • Establish strong engineering discipline through design reviews, runbooks, documentation, and continuous learning.
  • Act as the primary execution partner to the Director-level Commercial AI Analytics Solutions & Engineering Lead for ML operations and observability.

Here Is What You Need (Minimum Requirements)
  • 8+ years of experience in ML engineering, MLOps, platform engineering, or related roles, with 3+ years of people leadership.
  • Strong hands-on experience operationalizing ML systems in AWS or Azure environments.
  • Proven expertise in:
    • MLOps pipelines and tooling (experiment tracking, model registry, deployment, monitoring)
    • CI/CD for ML workloads (e.g., GitHub Actions or equivalent)
    • Containerized and cloud-native ML runtimes
  • Solid understanding of testing and validation for ML systems, including:
    • Model regression and performance testing
    • Data and feature validation
    • Deployment and rollback verification
  • Strong experience implementing observability and reliability practices using tools such as OpenTelemetry, Prometheus, Grafana, and ELK.
  • Demonstrated experience with DevSecOps and secure SDLC for AI/ML systems, including secrets management and access controls.
  • Proficiency in programming and scripting (e.g., Python, Bash, SQL; familiarity with ML frameworks).
  • Strong communication and collaboration skills; ability to deliver outcomes through teams and influence cross-functionally.

Bonus Points If You Have (Preferred Requirements)
  • Master's degree in Computer Science, Data Science, AI/ML, or related field.
  • Experience with MLOps platforms and tools (e.g., MLflow, Kubeflow, feature stores).
  • Background in data drift detection, model monitoring, and ML reliability engineering.
  • Familiarity with responsible AI, governance, or regulated environments.
  • Relevant certifications:
    • AWS/Azure Professional
      o Kubernetes (CKA/CKAD)
  • Cloud security or data/AI platform certifications

Work Location Assignment: Hybrid
Pfizer is an equal opportunity employer and complies with all applicable equal employment opportunity legislation in each jurisdiction in which it operates.
Information & Business Tech

What the Team is Saying

Daniel
Anna
Esteban
Pfizer

Pfizer Compensation & Benefits Highlights

  • Healthcare Strength Health coverage includes comprehensive medical with robust mental‑health networks, plus dental and vision options, and coverage for infertility/family‑building and transgender‑affirming care. Recent U.S. summaries name mental‑health partners and outline multiple plan choices.
  • Retirement Support The retirement program provides a 401(k) with company match plus an additional employer Retirement Savings Contribution, along with financial‑planning support and company‑paid life and disability insurance. These elements are highlighted as part of the core U.S. package.
  • Parental & Family Support Parental leave is described as up to 26 weeks in the U.S. when combining paid non‑medical parental leave with medical recovery where applicable, with exact pay and weeks dependent on circumstances and plan elections. Family‑building support includes egg preservation, adoption, and surrogacy coverage.

Pfizer Insights

Similar Jobs

Pfizer Logo Pfizer

R&D Management Analyst

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office
2 Locations
121990 Employees

Pfizer Logo Pfizer

Scientist

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office
Chennai, Tamil Nadu, IND
121990 Employees

Pfizer Logo Pfizer

Report Coordinator Associate

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
2 Locations
121990 Employees

Pfizer Logo Pfizer

Manager - Consolidation

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
Chennai, Tamil Nadu, IND
121990 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
121,990 Employees
Year Founded: 1848

What We Do

Our purpose ensures that patients remain at the center of all we do. We live our purpose by sourcing the best science in the world; partnering with others in the healthcare system to improve access to our medicines; using digital technologies to enhance our drug discovery and development, as well as patient outcomes; and leading the conversation to advocate for pro-innovation/pro-patient policies.

Why Work With Us

We are the inventors, the problem solvers, the big thinkers — those who surmount any hurdle to deliver breakthrough medicines to the people who are counting on them the most.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery

Pfizer Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: 2.5 days a week
Company Office Image
HQHudson Yards
Provincia de Buenos Aires
Andover, MA
Athens, GR
Chennai, IN
Collegeville, PA
Durham, NC
Groton, CT
Madison, NJ
Madrid, ES
Mumbai, Maharashtra
Rochester, MI
San Diego, CA
Seattle, WA
Company Office Image
Tampa, FL
Center for Digital Innovation
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account