Senior Manager Site Reliability Engineering

Job Posted 22 Days Ago Posted 22 Days Ago
Be an Early Applicant
Richardson, TX
118K-261K Annually
Senior level
Fitness • Healthtech • Retail • Pharmaceutical
The Role
The Senior Manager of SRE will lead a team to ensure system reliability and performance, manage incident processes, design infrastructure solutions, and implement observability tools while mentoring team members.
Summary Generated by Built In

At CVS Health, we’re building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care.

As the nation’s leading health solutions company, we reach millions of Americans through our local presence, digital channels and more than 300,000 purpose-driven colleagues – caring for people where, when and how they choose in a way that is uniquely more connected, more convenient and more compassionate. And we do it all with heart, each and every day.

Position Summary

As a Senior Manager of Site Reliability Engineering (SRE) at CVS Health, you will lead a team of SREs responsible for ensuring the reliability, availability, and performance of our critical systems and services. This is a high performing integration platform which processes about 6 billion Transactions every month. You will collaborate with cross-functional teams to design, implement, and maintain scalable and resilient infrastructure solutions that support our business objectives. Your leadership will drive the adoption of best practices in site reliability, incident management, and continuous improvement.

As a Senior Manager of Site Reliability Engineering (SRE) you will

  • Lead and mentor a team of Site Reliability Engineers, fostering a culture of collaboration, innovation, and continuous learning
  • Ensure the availability, reliability, and performance of critical services through proactive monitoring, capacity planning, and performance tuning.
  • Design, implement, and maintain observability solutions using tools such as AppDynamics, Splunk, Prometheus, Grafana, or Open Telemetry.
  • Collaborate with software engineering, operations, and product teams to design and deploy scalable and resilient systems
  • Oversee incident management processes, ensuring timely resolution of incidents and minimizing downtime
  • Establish and monitor key performance indicators (KPIs) to measure system reliability and performance
  • Conduct post-incident reviews and implement lessons learned to prevent future occurrences
  • Stay current with industry trends and emerging technologies to continuously improve SRE practices
  • Manage budgets and resources effectively to support SRE initiatives and projects
  • Incident Management: Lead incident response efforts, perform root cause analysis (RCA), and drive post-mortem processes to improve system reliability
  • Automation & Infrastructure as Code (IaC): Develop automation to reduce manual operational tasks using Terraform, Ansible, or Kubernetes 
  • CI/CD & Deployment Pipelines: Work closely with development teams to enhance deployment strategies and improve continuous integration/continuous deployment (CI/CD) workflows
  • Cloud & Kubernetes Operations: Manage and optimize cloud infrastructure (AWS, Azure, or GCP) and container orchestration platforms (Kubernetes, Docker)
  • Security & Compliance: Implement best practices for security, compliance, and cost optimization in cloud environments 

Required Qualifications

  • 7+ years of experience in site reliability engineering, DevOps, or a related field
  • 5+ years of experience of cloud computing platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes, Docker)
  • 3+ years of experience in a leadership or management role, with a proven track record of managing high-performing teams
  • 3+ years of experience in scripting and programming languages (e.g., Python, Go, Java)
  • 3+ years of experience in monitoring and observability tools (e.g., Prometheus, Grafana, Splunk)
  • Familiarity with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI etc)
  • Excellent communication and interpersonal skills, with the ability to collaborate effectively across teams
  • Strong problem-solving skills and a proactive approach to identifying and addressing issues
  • Ability to thrive in a fast-paced, dynamic environment and manage multiple priorities
  • Experience with Agile methodologies and DevOps practices

Preferred Qualifications

  • Ability to multi-task and rapid context switch between Applications, programs, and architecture initiatives
  • Ability to assess the impact of architecture changes on the business, application relationships and information flow
  • Strong understanding of SDLC - must have participated on many projects through complete lifecycles (requirements, design, development, testing, launch)
  • Strong ability to facilitate collaboration among senior technical team members and senior business leaders
  • Strong organizational, leadership and consensus building skills; ability to motivate and lead teams in matrix organization
  • Excellent interpersonal and communication skills to work with all levels
  • A strong base of experience in many disciplines of information technology, including operating systems, systems management and development tools, application program interfaces (APIs), database management systems, development methodologies, transaction processing monitors, messaging software, security, directory services, hardware, telecommunications, interoperability techniques and standards, services monitoring and alerting
  • Experience in multiple technologies in stack (Data Power, IIB, Splunk) is a PLUS
  • Healthcare experience or big box retail experience is a significant plus and will be given utmost consideration
  • ITCAM/Splunk experience is a PLUS
  • Experience in delivering projects using Agile methodology in addition to waterfall is desirable
  • Agile/PM certifications are a PLUS
  • Experience with large-scale distributed systems using message queues, TPMs, or other related technologies in a mobile/portal environment
  • Experience om data architecture concepts and governance, including acting as a design authority for information and data within projects and programs is critical for this role
  • Mastery of design considerations for high volume transaction systems

Education

  • Bachelor’s degree in computer science engineering, or a related field; Master’s degree preferred

Pay Range

The typical pay range for this role is:

$118,450.00 - $260,590.00

This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls.  The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors.  This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above.  This position also includes an award target in the company’s equity award program. 
 

Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.

Great benefits for great people

We take pride in our comprehensive and competitive mix of pay and benefits – investing in the physical, emotional and financial wellness of our colleagues and their families to help them be the healthiest they can be. In addition to our competitive wages, our great benefits include:

  • Affordable medical plan options, a 401(k) plan (including matching company contributions), and an employee stock purchase plan.

  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching.

  • Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility.

For more information, visit https://jobs.cvshealth.com/us/en/benefits

We anticipate the application window for this opening will close on: 05/30/2025

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.

Top Skills

Ansible
Appdynamics
AWS
Azure
Docker
GCP
Gitlab Ci
Go
Grafana
Java
Jenkins
Kubernetes
Prometheus
Python
Splunk
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Woonsocket, RI
119,959 Employees
On-site Workplace
Year Founded: 1963

What We Do

CVS Health is the leading health solutions company that delivers care in ways no one else can. We reach people in more ways and improve the health of communities across America through our local presence, digital channels and our nearly 300,000 dedicated colleagues – including more than 40,000 physicians, pharmacists, nurses and nurse practitioners.

Wherever and whenever people need us, we help them with their health – whether that’s managing chronic diseases, staying compliant with their medications, or accessing affordable health and wellness services in the most convenient ways. We help people navigate the health care system – and their personal health care – by improving access, lowering costs and being a trusted partner for every meaningful moment of health. And we do it all with heart, each and every day.

Similar Jobs

4 Locations

IAC Group Logo IAC Group

Supplier Quality Engineer

Automotive • Industrial • Manufacturing
Arlington, TX, USA
Houston, TX, USA
68K-96K Annually

Citi Logo Citi

Senior Java Developer- (Hybrid)

Fintech • Financial Services
2 Locations
97K-145K Annually

Similar Companies Hiring

Mochi Health Thumbnail
Telehealth • Healthtech
San Francisco, CA
70 Employees
Cencora Thumbnail
Pharmaceutical • Logistics • Healthtech
Conshohocken, PA
46000 Employees
Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account