Elsevier

Manager Site Reliability Engineering

Reposted 3 Days Ago

Be an Early Applicant

Alpharetta, GA

In-Office

136K-253K Annually

Senior level

Artificial Intelligence • Healthtech • Information Technology • Other • Analytics

The Role

The role involves leading multiple Site Reliability Engineering teams, implementing best practices, and driving cloud reliability, automation, and performance initiatives.

Summary Generated by Built In

Are you an experienced Site Reliability Engineering leader ready to shape strategy, inspire teams, and drive innovation at scale?

Are you looking to lead a high-impact SRE team where your leadership will directly influence innovation, reliability, and engineering excellence across the organization?

LexisNexis® Risk Solutions provides customers with innovative technologies, information-based analytics, decisioning tools and data management services that help them solve problems, make better decisions, stay compliant, reduce risk and improve operations. Headquartered in metro-Atlanta, Georgia it operates within the Risk market segment of RELX, a global provider of information-based analytics and decision tools for professional and business customers.

About the role, this is an advanced management level role. Individuals are required to manage multiple SRE teams within a single product group. You will ensure teams are working in alignment with the SRE framework, including leading sustainable incident response, blameless post-mortems, and production reliability improvement projects. You will mentor other team members on SRE practices and cultivate innovation and collaboration across multiple teams. Manages delivery of and may provide input to strategy and departmental plans.

About the team, this role is part of the Business Systems SRE team within LexisNexis Risk Solutions Group. As a SRE Manager, you will act as a technical and strategic leader, partnering with engineering and business stakeholders to drive cloud reliability, automation, observability, and performance initiatives across critical platforms. This role combines technical depth with managerial acumen, including leading Proof-of-Concept (PoC) initiatives, guiding teams, and aligning SRE outcomes with leadership expectations and business goals.

Responsibilities:

Managing high performance SRE teams ideally in multiple counties. We are not looking for an individual contributor.
Promoting and implementing Site Reliability Engineering best practices and principles across product and platform teams
Architecting, implementing, and managing infrastructure using Infrastructure as Code (IaC) and DevOps principles
Designing and maintaining secure-by-default cloud-native systems with a focus on continuous improvement of security posture
Defining and enforcing SLA/SLI/SLO standards for production systems
Developing and maintaining automated frameworks for provisioning, deployment, scaling, and monitoring
Conducting in-depth troubleshooting of complex production issues across application, infrastructure, and network layers
Leading proof-of-concept efforts to evaluate and introduce new technologies
Implement policy and compliance checks within CI/CD pipelines

Essential Skills & Experience:

Current and extensive experience managing teams of SRE’s. We are not looking to hire an individual contributor in this role.
Proficiency with at least one major public cloud provider: Azure, AWS
Extensive experience with Terraform, Ansible, and other IaC/orchestration tools
Expertise in Kubernetes (AKS/EKS/GKE), containerized workloads, and deployment strategies (e.g., Blue Green)
Deep knowledge of Linux and Windows server environments
Proven experience in building and enforcing automation frameworks for CI/CD and infrastructure provisioning
Hands-on experience with observability platforms such as Grafana, Kibana, Splunk, ELK Stack (Elasticsearch, Logstash, Kibana), OpenTelemetry, Prometheus, Loki
Strong knowledge of SLAs, SLIs, and SLOs and their application in production environments
Experience with monitoring, alerting, and logging best practices
Solid understanding of cloud-native security, identity management, and secrets management (e.g., HashiCorp Vault)
Skilled in scripting and programming (e.g., Python, Bash, Golang, PowerShell, C#)
Strong knowledge of networking, application performance tuning, and troubleshooting
Familiarity with common CI/CD and version control tools (e.g., Git, GitLab, GitHub, Jenkins)

U.S. National Base Pay Range: $136,100 - $252,800. Geographic differentials may apply in some locations to better reflect local market rates. This job is eligible for an annual incentive bonus.

We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location.

We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120.

Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here.

Please read our Candidate Privacy Policy.

We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.

USA Job Seekers:

EEO Know Your Rights.

Top Skills

Ansible

AWS

Azure

Bash

Elk Stack

Grafana

Kibana

Kubernetes

Linux

Loki

Opentelemetry

Powershell

Prometheus

Python

Splunk

Terraform

Windows

View all jobs at Elsevier

View Elsevier Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

0 Employees

Year Founded: 1880

What We Do

Elsevier is a world-leading provider of information solutions that enhance the performance of science, health, and technology professionals, empowering them to make better decisions, and deliver better care.

Because informed decisions lead to better outcomes, Elsevier is a leader in information and analytics for customers across the global research and health ecosystems.

Elsevier helps researchers and healthcare professionals advance science and improve health outcomes for the benefit of society.

We do this by facilitating insights and critical decision-making for customers across the global research and health ecosystems.