Get the job you really want.

Top Senior Site Reliability Engineer Jobs

Reposted 2 Days AgoSaved
In-Office
2 Locations
91K-125K Annually
Mid level
91K-125K Annually
Mid level
Hardware • Information Technology • Other • Software • Analytics
Responsible for developing and maintaining FedRAMP-compliant SaaS and PaaS systems in AWS GovCloud, ensuring reliability, automation, and security of cloud infrastructure.
Top Skills: AnsibleAWSAzureBashEcsEksGrafanaKubernetesPerlPowershellPrtgPythonSumo LogicTerraform
2 Days AgoSaved
In-Office
6 Locations
118K-261K Annually
Senior level
118K-261K Annually
Senior level
Fitness • Healthtech • Retail • Pharmaceutical
Lead the reliability and scalability of integration platforms, manage operations teams, define SLOs/SLIs, and improve automation and system resilience.
Top Skills: AceApicApigeeApimDatapowerJwtKongKubernetesMqOauth 2.0Splunk
Reposted 2 Days AgoSaved
Easy Apply
In-Office or Remote
3 Locations
Easy Apply
124K-206K Annually
Senior level
124K-206K Annually
Senior level
Analytics
The Site Reliability Engineer will ensure the reliability and performance of IaaS services, perform incident resolution, and enhance system reliability through automation while supporting mobility across hybrid infrastructures and collaborating extensively with various teams.
Top Skills: AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
Reposted 2 Days AgoSaved
In-Office or Remote
4 Locations
124K-206K Annually
Senior level
124K-206K Annually
Senior level
Cloud • Information Technology
The Site Reliability Engineer will support IaaS services, monitor infrastructure health, perform root cause analysis, automate processes, and collaborate with teams for service reliability.
Top Skills: AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
7 Days AgoSaved
Hybrid
2 Locations
Senior level
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Senior Site Reliability Engineer will ensure the performance, availability, and resilience of GM Motorsports' data platforms, focusing on high-throughput telemetry and analytics. Responsibilities include designing reliability practices, managing data pipelines, building observability frameworks, and driving infrastructure automation while mentoring team engineers.
Top Skills: DatabricksDatadogDevOpsFlinkGrafanaKafkaKubernetesLinuxOpentelemetryPlatform EngineeringPrometheusSite Reliability EngineeringSparkTerraform
Reposted 2 Days AgoSaved
In-Office
Irving, TX, USA
88K-137K Annually
Senior level
88K-137K Annually
Senior level
Consulting
As a Site Reliability Engineer, you'll enhance system performance and reliability through automation, monitor service levels, manage incidents, and improve application stability while collaborating with agile teams.
Top Skills: .Net CoreApi GatewayAppdynamicsAWSC#DatadogDockerDynatraceEc2EksHibernateJ2EeJavaScriptJdbcJenkinsJqueryKubernetesLambdaNew RelicNode.jsReactSplunkSpringTomcat
Reposted 7 Days AgoSaved
Easy Apply
Remote or Hybrid
USA
Easy Apply
180K-220K Annually
Senior level
180K-220K Annually
Senior level
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills: AWSDockerGCPKubernetes
Reposted 2 Days AgoSaved
In-Office
San Francisco, CA, USA
200K-275K Annually
Senior level
200K-275K Annually
Senior level
Artificial Intelligence • Healthtech • Information Technology • Software
As a Site Reliability Engineer, you will manage the production environment, focusing on infrastructure design, automation, and optimizing deployment pipelines to ensure high availability.
Top Skills: HelmKafkaKubernetesPostgresPythonRedisTerraformTypescript
Reposted 2 Days AgoSaved
In-Office or Remote
5 Locations
Senior level
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Generative AI
The Site Reliability Engineer will develop, deploy, and operate AI infrastructure, focusing on high-performance and scalable machine learning systems using Kubernetes and cloud platforms.
Top Skills: AWSAzureC++GCPGoKubernetesOci
Reposted 2 Days AgoSaved
In-Office
San Francisco, CA, USA
Mid level
Mid level
Information Technology • Software
As a DevOps Engineer, you'll design and scale secure systems, manage AWS environments, automate operations, and ensure operational excellence for revenue teams.
Top Skills: Amazon AuroraAWSDockerDynamoDBGithub ActionsKafkaS3SnowflakeSparkSqsTerraform
Reposted 2 Days AgoSaved
In-Office
Sunnyvale, CA, USA
141K-162K Annually
Mid level
141K-162K Annually
Mid level
Software • Cybersecurity
As an SRE Engineer II, manage multi-cloud infrastructure, enhance reliability, design services, implement IaC, develop CI/CD pipelines, and automate tasks with a focus on security and scalability.
Top Skills: Arm TemplatesAWSAws CodepipelineAzureAzure DevopsCloudFormationGCPGoJenkinsPowershellPythonTerraform
Reposted 2 Days AgoSaved
In-Office
Headquarters, AZ, USA
Senior level
Senior level
Information Technology • Consulting
The Lead Site Reliability Engineer will create an IT support automation strategy to reduce ticket volume and automate workflows, focusing on eliminating recurring issues and driving measurable outcomes across IT support systems.
Top Skills: AWSAzureGCPGoKubernetesOauth2PulumiPythonRest ApisSAMLTerraform
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 2 Days AgoSaved
Remote or Hybrid
3 Locations
160K-260K Annually
Senior level
160K-260K Annually
Senior level
Software
As a Site Reliability Engineer, you will manage the reliability and scalability of platform infrastructure, build observability tools, and automate processes to enhance operational excellence.
Top Skills: AWSGCPGoKubernetesPulumiPythonTerraform
Reposted 2 Days AgoSaved
In-Office
Lovelace, NC, USA
Senior level
Senior level
Artificial Intelligence • Machine Learning • Security • Database • Analytics • Big Data Analytics
As a Site Reliability Engineer, you'll ensure the availability and performance of AI applications, maintain infrastructure, automate tasks, and troubleshoot issues in high-scale environments.
Top Skills: AnsibleAWSAzureBashCircleCICloudFormationDatadogDockerDynatraceEc2Elk StackGCPGitlab CiGoGrafanaJenkinsKubernetesLambdaLinuxPrometheusPythonS3TerraformUnix
Reposted 2 Days AgoSaved
In-Office or Remote
Berkeley, CA, USA
205K-235K Annually
Senior level
205K-235K Annually
Senior level
Financial Services
The Senior Cluster Site Reliability Engineer will enhance the research compute cluster's uptime, reliability, and performance through engineering and operational improvements, ensuring high availability for researchers working on machine learning problems.
Top Skills: AnsibleAWSAWSCephDockerElkGCPGCPGrafanaHorovodHpcInfinibandKubeflowKueueLokiLustreMlflowOpentelemetryPodmanPrometheusPythonRdmaRubyS3SingularitySlurmTerraform
Reposted 2 Days AgoSaved
In-Office
San Jose, CA, USA
87K-186K Annually
Mid level
87K-186K Annually
Mid level
Artificial Intelligence • Information Technology • Software
The role involves developing and maintaining tools for data center networks, ensuring performance and security while collaborating with teams on network management.
Top Skills: Azure DevopsGitlabGoGrafanaJenkinsLinuxNagiosPrometheusPythonSolarwindsZabbix
Reposted 2 Days AgoSaved
In-Office
3 Locations
138K-206K Annually
Senior level
138K-206K Annually
Senior level
Cloud • Information Technology • Security • Software
The Site Reliability Engineer III role involves developing services, automating infrastructure, managing CI/CD environments, and improving system reliability for applications and technology stacks.
Top Skills: AnsibleAzureChefGitGitlabGoJavaScriptKubernetesMs Sql ServerPostgresPuppetPythonTerraform
Reposted 2 Days AgoSaved
Easy Apply
Remote
US
Easy Apply
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Software
The Senior SRE Manager will establish an SRE team, implement best practices, manage incidents, and enhance system reliability, scaling operations effectively.
Top Skills: Cloud InfrastructureDistributed SystemsObservability
Reposted 2 Days AgoSaved
Easy Apply
In-Office or Remote
Washington, DC, USA
Easy Apply
125K-155K Annually
Mid level
125K-155K Annually
Mid level
Information Technology • Security • Software
As a Site Reliability Engineer, you'll build and maintain cloud infrastructure, support Kubernetes workloads, and enhance CI/CD pipelines within a collaborative team environment.
Top Skills: AnsibleAWSGCPGoKubernetesLinuxPythonTerraform
8 Days AgoSaved
Hybrid
3 Locations
147K-278K Annually
Senior level
147K-278K Annually
Senior level
Cloud • Software
Responsible for maintaining FedRAMP-compliant infrastructure, collaborating with software engineers, and ensuring system availability and security. Duties include infrastructure design, automation, monitoring, and incident response.
Top Skills: AWSGoKubernetesPuppetPythonTerraform
Reposted 8 Days AgoSaved
Remote or Hybrid
United States
170K-206K Annually
Senior level
170K-206K Annually
Senior level
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills: Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython
Reposted 8 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills: AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
Reposted 8 Days AgoSaved
Remote or Hybrid
17 Locations
130K-180K Annually
Senior level
130K-180K Annually
Senior level
Information Technology • Productivity • Software • Infrastructure as a Service (IaaS)
The role involves diagnosing infrastructure issues, participating in on-call rotations, improving application availability, and enhancing automation in cloud environments.
Top Skills: AnsibleAWSC++CloudFormationDatadogGoHelmJavaKotlinKubernetesNew RelicPostgresSplunkTerraform
Reposted 3 Days AgoSaved
Remote
USA
Mid level
Mid level
Information Technology • Other • Software • Consulting
The Site Reliability Engineer at CardioOne will enhance the reliability and performance of production systems, implement automation, and lead incident response efforts while collaborating with development teams.
Top Skills: AnsibleAWSAzureDatadogDockerEcsJavaKubernetesPythonTerraformTerragrunt
Reposted 3 Days AgoSaved
In-Office
San Mateo, CA, USA
Senior level
Senior level
Software
Design, implement, and maintain scalable backend systems and APIs; build cloud infrastructure (preferably GCP) using Terraform; operate containerized workloads with Kubernetes; ensure reliability, security, and performance; participate in on-call rotations, architecture discussions, and cross-functional delivery.
Top Skills: Ci/CdCloud AutomationContainer OrchestrationGoGoogle Cloud PlatformIamInfrastructure As CodeKubernetesMicroservicesPythonService-Oriented ArchitectureTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account