Manager Site Reliability Engineering

Reposted 3 Days Ago
Be an Early Applicant
Alpharetta, GA
In-Office
136K-253K Annually
Senior level
Artificial Intelligence • Healthtech • Information Technology • Other • Analytics
The Role
The role involves leading multiple Site Reliability Engineering teams, implementing best practices, and driving cloud reliability, automation, and performance initiatives.
Summary Generated by Built In

Are you an experienced Site Reliability Engineering leader ready to shape strategy, inspire teams, and drive innovation at scale?

Are you looking to lead a high-impact SRE team where your leadership will directly influence innovation, reliability, and engineering excellence across the organization?
 

LexisNexis® Risk Solutions provides customers with innovative technologies, information-based analytics, decisioning tools and data management services that help them solve problems, make better decisions, stay compliant, reduce risk and improve operations. Headquartered in metro-Atlanta, Georgia it operates within the Risk market segment of RELX, a global provider of information-based analytics and decision tools for professional and business customers.

About the role, this is an advanced management level role. Individuals are required to manage multiple SRE teams within a single product group. You  will ensure teams are working in alignment with the SRE framework, including leading sustainable incident response, blameless post-mortems, and production reliability improvement projects. You will mentor other team members on SRE practices and cultivate innovation and collaboration across multiple teams. Manages delivery of and may provide input to strategy and departmental plans.

About the team, this role is part of the Business Systems SRE team within LexisNexis Risk Solutions Group. As a SRE Manager, you will act as a technical and strategic leader, partnering with engineering and business stakeholders to drive cloud reliability, automation, observability, and performance initiatives across critical platforms. This role combines technical depth with managerial acumen, including leading Proof-of-Concept (PoC) initiatives, guiding teams, and aligning SRE outcomes with leadership expectations and business goals.

Responsibilities:

  • Managing high performance SRE teams ideally in multiple counties. We are not looking for an individual contributor.
  • Promoting and implementing Site Reliability Engineering best practices and principles across product and platform teams
  • Architecting, implementing, and managing infrastructure using Infrastructure as Code (IaC) and DevOps principles
  • Designing and maintaining secure-by-default cloud-native systems with a focus on continuous improvement of security posture
  • Defining and enforcing SLA/SLI/SLO standards for production systems
  • Developing and maintaining automated frameworks for provisioning, deployment, scaling, and monitoring
  • Conducting in-depth troubleshooting of complex production issues across application, infrastructure, and network layers
  • Leading proof-of-concept efforts to evaluate and introduce new technologies
  • Implement policy and compliance checks within CI/CD pipelines

Essential Skills & Experience:

  • Current and extensive experience managing teams of SRE’s. We are not looking to hire an individual contributor in this role.
  • Proficiency with at least one major public cloud provider: Azure, AWS
  • Extensive experience with Terraform, Ansible, and other IaC/orchestration tools
  • Expertise in Kubernetes (AKS/EKS/GKE), containerized workloads, and deployment strategies (e.g., Blue Green)
  • Deep knowledge of Linux and Windows server environments
  • Proven experience in building and enforcing automation frameworks for CI/CD and infrastructure provisioning
  • Hands-on experience with observability platforms such as Grafana, Kibana, Splunk, ELK Stack (Elasticsearch, Logstash, Kibana), OpenTelemetry, Prometheus, Loki
  • Strong knowledge of SLAs, SLIs, and SLOs and their application in production environments
  • Experience with monitoring, alerting, and logging best practices
  • Solid understanding of cloud-native security, identity management, and secrets management (e.g., HashiCorp Vault)
  • Skilled in scripting and programming (e.g., Python, Bash, Golang, PowerShell, C#)
  • Strong knowledge of networking, application performance tuning, and troubleshooting
  • Familiarity with common CI/CD and version control tools (e.g., Git, GitLab, GitHub, Jenkins)
U.S. National Base Pay Range: $136,100 - $252,800. Geographic differentials may apply in some locations to better reflect local market rates. This job is eligible for an annual incentive bonus.

We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location.

We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120.

Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here.

Please read our Candidate Privacy Policy.

We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.

USA Job Seekers:

EEO Know Your Rights.

Top Skills

Ansible
AWS
Azure
Bash
C#
Elk Stack
Go
Grafana
Kibana
Kubernetes
Linux
Loki
Opentelemetry
Powershell
Prometheus
Python
Splunk
Terraform
Windows
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
0 Employees
Year Founded: 1880

What We Do

Elsevier is a world-leading provider of information solutions that enhance the performance of science, health, and technology professionals, empowering them to make better decisions, and deliver better care.

Because informed decisions lead to better outcomes, Elsevier is a leader in information and analytics for customers across the global research and health ecosystems.

Elsevier helps researchers and healthcare professionals advance science and improve health outcomes for the benefit of society.

We do this by facilitating insights and critical decision-making for customers across the global research and health ecosystems.

Similar Jobs

Waystar Logo Waystar

Sr. Manager, Site Reliability Engineering

Healthtech • Payments • Software
In-Office
Atlanta, GA, USA
967 Employees

Plume Design, Inc Logo Plume Design, Inc

Manager, Site Reliability Engineering

Big Data • Internet of Things • Machine Learning
In-Office
20 Locations
611 Employees

CrowdStrike Logo CrowdStrike

Technical Operations Engineer III - Agentic AI (Remote)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
2 Locations
10000 Employees
120K-180K Annually

CrowdStrike Logo CrowdStrike

Threat Analyst, Machine Learning (Remote, East/Central)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
34 Locations
10000 Employees
90K-125K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account