Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 16 Days AgoSaved
In-Office
Indianapolis, IN, USA
65K-185K Annually
Senior level
65K-185K Annually
Senior level
Healthtech • Biotech • Pharmaceutical
Design, operate, and automate enterprise Oracle Database platforms across on‑prem, hybrid, and cloud environments. Ensure availability, performance tuning, HA/DR, backup/recovery, security/compliance, monitoring, and lifecycle management while driving IaC automation and SRE practices to reduce toil and improve reliability.
Top Skills: Active Data GuardAiopsAnsibleAWSAzureCi/CdData PumpExaccExadataGitGrafanaInfrastructure As CodeKubernetesOciOci Resource ManagerOemOracle Cloud InfrastructureOracle Data GuardOracle Database (19C+)Oracle RacPythonRmanServicenowShell ScriptingSplunkSQL ServerTerraform
22 Days AgoSaved
Easy Apply
Remote or Hybrid
USA
Easy Apply
180K-220K Annually
Senior level
180K-220K Annually
Senior level
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills: AWSDockerGCPKubernetes
Reposted 16 Days AgoSaved
Easy Apply
Remote or Hybrid
United States
Easy Apply
150K-225K Annually
Senior level
150K-225K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
Lead architecture and build reliability platforms, drive AIOps automation, champion SRE practices, lead incident response and postmortems, advance observability, and mentor engineers to improve system reliability and performance.
Top Skills: AiopsAWSAzureContinuous ProfilingDatadogDnsElkGCPGoGrafanaHttp/SKubernetesLoad BalancingOpentelemetryPrometheusPythonTcp/Ip
Reposted 16 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
175K-195K Annually
Senior level
175K-195K Annually
Senior level
Marketing Tech • Mobile • Software
Lead the Site Reliability Engineering team, ensuring platform reliability, scalability, and developer support while fostering an inclusive environment and coaching team members.
Top Skills: EmberGoReact
Reposted 16 Days AgoSaved
Easy Apply
In-Office
San Francisco, CA, USA
Easy Apply
150K-200K Annually
Junior
150K-200K Annually
Junior
Artificial Intelligence • Information Technology
As a Site Reliability Engineer, maintain user-facing services, implement best practices for reliability, and manage production incidents.
Top Skills: AnsibleCloud ServicesKubernetesProgramming LanguagesTerraform
Reposted 16 Days AgoSaved
In-Office or Remote
5 Locations
Senior level
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Generative AI
The Site Reliability Engineer will develop, deploy, and operate AI infrastructure, focusing on high-performance and scalable machine learning systems using Kubernetes and cloud platforms.
Top Skills: AWSAzureC++GCPGoKubernetesOci
Reposted 16 Days AgoSaved
In-Office
Seattle, WA, USA
176K-221K Annually
Senior level
176K-221K Annually
Senior level
eCommerce
Responsible for platform reliability, monitoring, automation, and system health for Coupang's customer-facing services, ensuring scalable solutions and handling production incidents.
Top Skills: AWSAzureDatadogDockerElastic StackGoGoogle Cloud PlatformGrafanaJavaKubernetesNew RelicPrometheusPythonRuby
Reposted 16 Days AgoSaved
In-Office
San Francisco, CA, USA
200K-275K Annually
Senior level
200K-275K Annually
Senior level
Artificial Intelligence • Healthtech • Information Technology • Software
As a Site Reliability Engineer, you will manage the production environment, focusing on infrastructure design, automation, and optimizing deployment pipelines to ensure high availability.
Top Skills: HelmKafkaKubernetesPostgresPythonRedisTerraformTypescript
Reposted 16 Days AgoSaved
In-Office
Lovelace, NC, USA
Senior level
Senior level
Artificial Intelligence • Machine Learning • Security • Database • Analytics • Big Data Analytics
As a Site Reliability Engineer, you'll ensure the availability and performance of AI applications, maintain infrastructure, automate tasks, and troubleshoot issues in high-scale environments.
Top Skills: AnsibleAWSAzureBashCircleCICloudFormationDatadogDockerDynatraceEc2Elk StackGCPGitlab CiGoGrafanaJenkinsKubernetesLambdaLinuxPrometheusPythonS3TerraformUnix
Reposted 16 Days AgoSaved
In-Office
San Francisco, CA, USA
Mid level
Mid level
Information Technology • Software
As a DevOps Engineer, you'll design and scale secure systems, manage AWS environments, automate operations, and ensure operational excellence for revenue teams.
Top Skills: Amazon AuroraAWSDockerDynamoDBGithub ActionsKafkaS3SnowflakeSparkSqsTerraform
Reposted 16 Days AgoSaved
In-Office or Remote
Berkeley, CA, USA
205K-235K Annually
Senior level
205K-235K Annually
Senior level
Financial Services
The Senior Cluster Site Reliability Engineer will enhance the research compute cluster's uptime, reliability, and performance through engineering and operational improvements, ensuring high availability for researchers working on machine learning problems.
Top Skills: AnsibleAWSAWSCephDockerElkGCPGCPGrafanaHorovodHpcInfinibandKubeflowKueueLokiLustreMlflowOpentelemetryPodmanPrometheusPythonRdmaRubyS3SingularitySlurmTerraform
Reposted 16 Days AgoSaved
Easy Apply
Remote
US
Easy Apply
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Software
The Senior SRE Manager will establish an SRE team, implement best practices, manage incidents, and enhance system reliability, scaling operations effectively.
Top Skills: Cloud InfrastructureDistributed SystemsObservability
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 16 Days AgoSaved
In-Office
Sunnyvale, CA, USA
141K-162K Annually
Mid level
141K-162K Annually
Mid level
Software • Cybersecurity
As an SRE Engineer II, manage multi-cloud infrastructure, enhance reliability, design services, implement IaC, develop CI/CD pipelines, and automate tasks with a focus on security and scalability.
Top Skills: Arm TemplatesAWSAws CodepipelineAzureAzure DevopsCloudFormationGCPGoJenkinsPowershellPythonTerraform
17 Days AgoSaved
In-Office
3 Locations
138K-206K Annually
Senior level
138K-206K Annually
Senior level
Cloud • Information Technology • Security • Software
The Site Reliability Engineer III role involves developing services, automating infrastructure, managing CI/CD environments, and improving system reliability for applications and technology stacks.
Top Skills: AnsibleAzureChefGitGitlabGoJavaScriptKubernetesMs Sql ServerPostgresPuppetPythonTerraform
17 Days AgoSaved
In-Office
San Jose, CA, USA
87K-186K Annually
Mid level
87K-186K Annually
Mid level
Artificial Intelligence • Information Technology • Software
The role involves developing and maintaining tools for data center networks, ensuring performance and security while collaborating with teams on network management.
Top Skills: Azure DevopsGitlabGoGrafanaJenkinsLinuxNagiosPrometheusPythonSolarwindsZabbix
Reposted 22 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills: AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
Reposted 22 Days AgoSaved
Remote or Hybrid
17 Locations
130K-180K Annually
Senior level
130K-180K Annually
Senior level
Information Technology • Productivity • Software • Infrastructure as a Service (IaaS)
The role involves diagnosing infrastructure issues, participating in on-call rotations, improving application availability, and enhancing automation in cloud environments.
Top Skills: AnsibleAWSC++CloudFormationDatadogGoHelmJavaKotlinKubernetesNew RelicPostgresSplunkTerraform
17 Days AgoSaved
Remote or Hybrid
3 Locations
160K-260K Annually
Senior level
160K-260K Annually
Senior level
Software
As a Site Reliability Engineer, you will manage the reliability and scalability of platform infrastructure, build observability tools, and automate processes to enhance operational excellence.
Top Skills: AWSGCPGoKubernetesPulumiPythonTerraform
17 Days AgoSaved
In-Office
Headquarters, AZ, USA
Senior level
Senior level
Information Technology • Consulting
The Lead Site Reliability Engineer will create an IT support automation strategy to reduce ticket volume and automate workflows, focusing on eliminating recurring issues and driving measurable outcomes across IT support systems.
Top Skills: AWSAzureGCPGoKubernetesOauth2PulumiPythonRest ApisSAMLTerraform
17 Days AgoSaved
Easy Apply
In-Office or Remote
Washington, DC, USA
Easy Apply
125K-155K Annually
Mid level
125K-155K Annually
Mid level
Information Technology • Security • Software
As a Site Reliability Engineer, you'll build and maintain cloud infrastructure, support Kubernetes workloads, and enhance CI/CD pipelines within a collaborative team environment.
Top Skills: AnsibleAWSGCPGoKubernetesLinuxPythonTerraform
Reposted 17 Days AgoSaved
In-Office or Remote
Miami, FL, USA
Senior level
Senior level
Software
Seeking a Senior Site Reliability Engineer to enhance reliability and speed for Stream Aligned teams, ensuring ownership of services, building tooling, and improving deployment processes.
Top Skills: AWSJavaKubernetesLinuxPostgresTerraform
Reposted 17 Days AgoSaved
In-Office
3 Locations
172K-258K Annually
Expert/Leader
172K-258K Annually
Expert/Leader
Fintech
The Principal Site Reliability Engineer designs and implements software to enhance application performance and resilience while ensuring security standards. Responsibilities include automating application management, providing observability, and leading cross-functional teams. Mentorship and on-call rotation participation are expected.
Top Skills: AuroraAWSChefDockerDynamo DbGitGoJavaJenkinsJmsKafkaKubernetesMavenMemcachedOraclePythonRedisSqsSwarm
Reposted 17 Days AgoSaved
Easy Apply
In-Office
Sandy, UT, USA
Easy Apply
Senior level
Senior level
Cloud • Software • Analytics
The Senior Cloud SRE enhances reliability and availability of solutions by automating tasks, providing on-call support, and mentoring teams.
Top Skills: AnsibleAppdynamicsBmcC#C++DatadogDockerDynatraceGitGrafanaJavaJIRAKubernetesNew RelicPerlPrometheusPythonRubySplunkTerraform
23 Days AgoSaved
Remote or Hybrid
United States
170K-215K Annually
Senior level
170K-215K Annually
Senior level
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills: Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython
18 Days AgoSaved
Easy Apply
Remote or Hybrid
USA
Easy Apply
136K-170K Annually
Senior level
136K-170K Annually
Senior level
Cloud • Security • Software
As a Staff Site Reliability Engineer, you will design, build, and maintain cloud infrastructure, improve deployment processes, and collaborate across teams.
Top Skills: Ci/CdDockerGoKubernetes
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account