Get the job you really want.

Job Title, Company or Keyword

Location

Maximum of 25 job preferences reached.

Top Senior Site Reliability Engineer Jobs

Miris

Site Reliability Engineer - All Levels

Reposted 6 Days AgoSaved

Easy Apply

Remote

USA

Easy Apply

89K-287K Annually

Mid level

89K-287K Annually

Mid level

3D Printing • Artificial Intelligence • Software • Design

The role involves building reliable platforms for 3D/4D content delivery to AR/VR devices, monitoring system health, and improving operational practices in collaboration with the engineering team.

Top Skills: Aws FargateCoreweaveGrafanaKubernetesPrometheusTerraform

Okta

Staff Site Reliability Engineer, Kubernetes w/ active TS/SCI

Reposted 6 Days AgoSaved

In-Office

Washington, DC, USA

188K-235K Annually

Senior level

188K-235K Annually

Senior level

Cloud

The Staff Site Reliability Engineer will manage large-scale cloud production systems, ensuring reliability and performance, while automating processes and responding to incidents.

Top Skills: AWSBashCloudFormationDockerGoHelmKubernetesPythonRubyTerraform

Nubank

Staff SRE Engineer

Reposted 6 Days AgoSaved

Easy Apply

In-Office

Miami, FL, USA

Easy Apply

Senior level

Financial Services

As a Staff SRE Engineer, you'll lead the Data Infra team in improving reliability, architecture, and automation for the Data Platform while mentoring engineers.

Top Skills: AWSClojureDatomicEc2KubernetesLambdasScalaSparkStep Functions

ECS

Site Reliability Engineer (SRE) / Operations Engineer

7 Days AgoSaved

In-Office

2 Locations

145K-180K Annually

Senior level

145K-180K Annually

Senior level

Artificial Intelligence • Cloud • Information Technology • Security • Software

The Site Reliability Engineer ensures system reliability and performance, manages incidents, implements automation, and collaborates with teams for software delivery and operational readiness.

Top Skills: AutomationCloud ServicesInfrastructure-As-CodeObservability Tools

PIMCO

Java Site Reliability Engineer, Messaging Platforms

Reposted 7 Days AgoSaved

In-Office

Austin, TX, USA

175K-240K Annually

Senior level

175K-240K Annually

Senior level

Financial Services

The Staff Engineer will support and optimize messaging platforms, design solutions to improve operational efficiency, and collaborate with teams on business-focused solutions.

Top Skills: AmpsAWSEksFixJavaKafkaKubernetesLinuxMqSpringSQL

January

Senior SRE, Software Engineering

Reposted 12 Days AgoSaved

Hybrid

New York City, NY, USA

205K-225K Annually

Senior level

205K-225K Annually

Senior level

Artificial Intelligence • Fintech • Payments • Social Impact • Analytics • Financial Services • Automation

As a Senior SRE, you'll ensure reliable and scalable systems, develop observability solutions and infrastructure as code, and lead incident response efforts.

Top Skills: AWSCloudFormationDatadogElkPrometheusTerraform

Fidelity Investments

Principal, Site Reliability Engineer

Reposted 7 Days AgoSaved

In-Office

Merrimack, NH, USA

Senior level

Fintech

The Principal Site Reliability Engineer at Fidelity will enhance system reliability, manage large-scale infrastructures, and automate processes using various technologies.

Top Skills: AnsibleAWSCi/CdDatadogGrafanaJenkinsPythonTerraformYugabyte

MongoDB

Senior Site Reliability Engineer, Fleet Management

13 Days AgoSaved

Easy Apply

Remote or Hybrid

9 Locations

Easy Apply

127K-249K Annually

Senior level

127K-249K Annually

Senior level

Big Data • Cloud • Software • Database

Develop and maintain Kubernetes runtime environments, support developers, resolve critical issues, and participate in on-call rotations for production systems.

Top Skills: AWSAzureCert-ManagerCorednsCrdsCriCsiGatekeeperGCPGoHelmKubernetesKustomizeOperatorsPythonTerraform

Air Apps

Site Reliability Engineer (SRE)

Reposted 7 Days AgoSaved

In-Office

San Francisco, CA, USA

116K-200K Annually

Mid level

116K-200K Annually

Mid level

Information Technology • Mobile • Software

As a Site Reliability Engineer, you'll ensure system reliability and scalability, automate processes, optimize performance, and collaborate on system design.

Top Skills: AWSAzureBashCloudFormationDatadogDockerElkGoGoogle Cloud PlatformGrafanaHelmKubernetesNew RelicPrometheusPulumiPythonTerraform

People Finders

Site Reliability Engineer (SRE)

Reposted 7 Days AgoSaved

Easy Apply

Remote

California, USA

Easy Apply

100K-130K Annually

Mid level

100K-130K Annually

Mid level

AdTech • Big Data • eCommerce • Marketing Tech • Real Estate • Software

The Site Reliability Engineer will manage AWS infrastructure, optimize Kubernetes environments, build CI/CD pipelines, and enhance system security and performance.

Top Skills: AnsibleAWSBashCloudflareCloudwatchDockerGitlabGoGrafanaKubernetesPrometheusPythonTerraform

Coalition

Site Reliability Engineer II

Reposted 7 Days AgoSaved

Remote

Location, WV, USA

111K-163K Annually

Mid level

111K-163K Annually

Mid level

Insurance • Cybersecurity

The Site Reliability Engineer II will build and operate infrastructure, improve system reliability, and enhance developer tools while collaborating across teams using AWS, Terraform, and IaC principles.

Top Skills: AWSEcsGithub ActionsGoKafkaKinesisKubernetesPythonTerraform

Deutsche Bank

Deployment Site Reliability Engineer - Associate

Reposted 8 Days AgoSaved

In-Office

Centre, Green, OH, USA

85K-130K Annually

Entry level

85K-130K Annually

Entry level

Fintech • Financial Services

Responsible for network deployments, automation, and system monitoring. Collaborates with teams to enhance network design and performance, ensuring scalability and security.

Top Skills: AnsibleAristaBgpCiscoCloudFormationDatadogFortinetGitJSONJuniperLinuxMplsOspfPrometheusPythonStpTerraformUnixVxlanYaml

New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free

Also

Staff Site Reliability Engineer - Cloud Platform & Vehicle Telemetry

8 Days AgoSaved

In-Office

Palo Alto, CA, USA

200K-245K Annually

Expert/Leader

200K-245K Annually

Expert/Leader

Automotive

The Staff Site Reliability Engineer will optimize cloud-native systems for vehicle telemetry using Kubernetes and AWS, ensuring reliability and operational excellence through advanced observability and automation.

Top Skills: AirflowAWSCi/CdDatadogGrafanaGrpcJavaKafkaKinesisKubernetesPythonRestScalaTerraform

Visa Inc,

Staff Site Reliability Engineer

8 Days AgoSaved

Hybrid

Austin, TX, USA

132K-210K Annually

Senior level

132K-210K Annually

Senior level

Fintech • Information Technology • Payments

The Staff Platform Engineer is responsible for maintaining and improving cloud-native platforms, managing operations, ensuring reliability, and implementing automation, particularly on Azure while also supporting AWS environments.

Top Skills: AWSAzureKubernetesTerraform

DFIN

Senior Site Reliability Engineer - Cloud

13 Days AgoSaved

Remote or Hybrid

United States

Senior level

Fintech • Software

The Senior Site Reliability Engineer ensures fast, stable SaaS products through automation, collaboration, monitoring, and implementing AI tools to enhance performance and reliability.

Top Skills: Ai ToolsAnsibleAppdynamicsAWSAzureAzure DevopsBashC# .NetCosmosDatadogDynatraceHarnessJavaJenkinsKubernetesNew RelicPowershellPythonSaaSSQLTerraform

BAE Systems, Inc.

Senior Site Reliability Engineer

Reposted 13 Days AgoSaved

Hybrid

Nashua, NH, USA

97K-165K Annually

Senior level

97K-165K Annually

Senior level

Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense

The Senior Site Reliability Engineer will oversee the deployment and reliability of digital engineering tools, enhance performance, and mentor junior engineers.

Top Skills: AnsibleFluent BitGrafanaLokiPostgresPrometheusPython

Apptronik

Senior Site Reliability Engineer

Reposted 13 Days AgoSaved

Easy Apply

Hybrid

Austin, TX, USA

Easy Apply

Senior level

Computer Vision • Hardware • Machine Learning • Robotics • Software

The role involves maintaining cloud infrastructure, collaborating with engineering teams, troubleshooting issues, deploying solutions, and ensuring system reliability.

Top Skills: AnsibleC++GrafanaHelmKubernetesPagerdutyPythonTerraformTypescript

Okta

Staff Site Reliability Engineer - Kubernetes

8 Days AgoSaved

In-Office

4 Locations

194K-267K Annually

Senior level

194K-267K Annually

Senior level

Cloud

The Site Reliability Engineer will manage Kubernetes platforms, optimize AWS cloud infrastructure, ensure high availability, and automate deployment while handling troubleshooting and security compliance.

Top Skills: AWSBashCi/CdCloudwatchElk StackGoGrafanaHelmIstioKubernetesPrometheusPythonTerraform

Jump Trading Group

Site Reliability Engineer

8 Days AgoSaved

In-Office

New York, NY, USA

200K-225K Annually

Senior level

200K-225K Annually

Senior level

Financial Services

The Site Reliability Engineer will enhance global infrastructure through coding, monitoring tools, and optimizing systems to ensure efficiency and resilience.

Top Skills: Apache KafkaBigtableC/C++CassandraCi/CdClickhouseGoKubernetesLinuxPythonRabbitMQRust

Unify (unifygtm.com)

Senior Staff Site Reliability Engineer, Tech Lead

8 Days AgoSaved

In-Office or Remote

2 Locations

250K-295K Annually

Senior level

250K-295K Annually

Senior level

Artificial Intelligence • Software

As a Senior Staff SRE Tech Lead, you'll oversee reliability and scalability, mentor engineers, optimize systems, and enhance data infrastructure.

Top Skills: ClickhouseGoPostgresPythonTypescript

Five9

Site Reliability Engineer

Reposted 8 Days AgoSaved

Remote

United States

72K-190K Annually

Mid level

72K-190K Annually

Mid level

Cloud • Software

The Site Reliability Engineer (SRE) will manage reliable, scalable systems, focusing on software development, infrastructure automation, and incident response. Responsibilities include monitoring, CI/CD pipeline management, security compliance, and cost optimization while collaborating with various teams.

Top Skills: AWSAzureDockerElk StackGCPGitGrafanaJavaKubernetesPHPPrometheusPythonShellTerraform

E-SPACE

Site Reliability Engineer (SRE) / DevOps Engineer

Reposted 8 Days AgoSaved

In-Office

Saratoga, CA, USA

100K-165K Annually

Senior level

100K-165K Annually

Senior level

Other

As a Platform Engineer/Dev Ops, you will expand cloud infrastructure, implement monitoring systems, manage databases, and leverage CI/CD tools, working collaboratively with various teams.

Top Skills: AWSAzureBashDatadogElk StackKubernetesOpentofuPrometheusPythonTerraform

Axiom (axiom.co)

Site Reliability Engineer

Reposted 8 Days AgoSaved

Remote

United States

Mid level

Security • Software • Analytics

Design, operate, and automate scalable, secure infrastructure for Axiom Cloud. Define SLOs, plan disaster recovery and capacity, tune performance, improve deployment practices, build reliability tooling, respond to incidents, and promote monitoring and observability across teams.

Top Skills: Amazon EksAWSCircleCIDockerGithub ActionsGitlabGoKubernetesLinuxLlmsMonitoring And Observability ToolsPulumiTerraform

Cloud Support Engineering Manager (SRE Manager)

Reposted 8 Days AgoSaved

In-Office

City of Broomfield, CO, USA

155K-233K Annually

Senior level

155K-233K Annually

Senior level

Cloud • Information Technology • Security • Software

Lead and grow a global Cloud Support/SRE team to ensure SaaS and self-hosted infrastructure reliability. Own incident response for Severity 1 events, refine support workflows, track KPIs (CSAT, MTTR, first-response), and collaborate with Product, Engineering, and Solutions teams to drive product improvements and operational excellence.

Top Skills: AWSAzureBashDnsGCPGoKubernetesLinuxLoad BalancingPythonSsl/TlsTcp/Ip

Scientific Games

Sr. Site Reliability Engineer (SRE)

Reposted 8 Days AgoSaved

In-Office

Alpharetta, GA, USA

Senior level

Gaming • Mobile

The Site Reliability Engineer (SRE) will enhance production system stability and performance, collaborate with DevOps, manage on-call responsibilities, and improve observability. Responsibilities include monitoring, reliability engineering, incident management, and documentation.