Get the job you really want.

Top Site Reliability Engineer Jobs

7 Days AgoSaved
Remote
USA
156K-288K Annually
Mid level
156K-288K Annually
Mid level
Computer Vision • Machine Learning • Software
As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.
Top Skills: AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
7 Days AgoSaved
Remote
United States
120K-150K Annually
Senior level
120K-150K Annually
Senior level
Information Technology
The Lead Site Reliability Engineer will ensure platform reliability and performance, guiding SRE principles, managing incidents, and fostering collaboration across teams while leveraging cloud technologies and automation.
Top Skills: AWSAzureAzure DevopsBashBicepCloudFormationGithub ActionsGoJenkinsPowershellPythonTerraform
7 Days AgoSaved
In-Office
3 Locations
75K-100K Annually
Junior
75K-100K Annually
Junior
Aerospace
The Site Reliability Engineer will ensure system reliability, assist with incident response, improve operational quality, and automate processes to reduce toil. Responsibilities include incident resolution, reliability evaluations, and platform enablement.
Top Skills: Argo CdDockerGitGitlabGoGrafanaJenkinsKubernetesOtel StandardsPrometheusPython
7 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
230K-250K Annually
Expert/Leader
230K-250K Annually
Expert/Leader
Artificial Intelligence • Healthtech • Software
The Staff Site Reliability Engineer will lead the reliability of production systems by defining SRE practices, improving observability, and ensuring fault-tolerance in cloud environments.
Top Skills: AWSGoKubernetesPostgresPythonTerraformTypescript
Reposted 7 Days AgoSaved
In-Office
City of New Home, TX, USA
175K-335K Annually
Expert/Leader
175K-335K Annually
Expert/Leader
Fitness • Healthtech • Retail • Pharmaceutical
The Executive Director of Digital SRE & Operations will lead strategy and execution for enterprise-scale reliability and operational excellence, overseeing AIOps, automation, and DevOps while mentoring SRE teams.
Top Skills: AiopsAWSAzureDatadogGCPGrafanaOpentelemetryPrometheusSplunk
Reposted 7 Days AgoSaved
Remote
United States
Senior level
Senior level
Digital Media • Social Media • Software • Sports
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
Top Skills: Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform
Reposted 7 Days AgoSaved
Easy Apply
In-Office
Atlanta, GA, USA
Easy Apply
Mid level
Mid level
Healthtech • Software
Seeking a Site Reliability Engineer to ensure platform reliability, scalability, and performance by leveraging AI and automation in cloud infrastructure. Responsibilities include incident response, monitoring, and operational efficiency enhancement.
Top Skills: AIAWSCi/CdTerraform
Reposted 7 Days AgoSaved
Remote
United States
175K-275K Annually
Mid level
175K-275K Annually
Mid level
Software
As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.
Top Skills: AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Reposted 7 Days AgoSaved
In-Office
San Francisco, CA, USA
255K-490K Annually
Mid level
255K-490K Annually
Mid level
Artificial Intelligence • Machine Learning • Generative AI
As a Site Reliability Engineer, you will manage Kubernetes clusters, automate infrastructure, improve operational metrics, and enhance reliability across data centers.
Top Skills: CloudFormationGoGpuKubernetesLinuxPythonTerraform
8 Days AgoSaved
In-Office or Remote
2 Locations
107K-221K Annually
Senior level
107K-221K Annually
Senior level
Cloud • Security • Software • Cybersecurity
The Senior Lead Site Reliability Engineer will ensure performance and uptime of security products, develop automation pipelines, and improve monitoring systems, working closely with various teams.
Top Skills: AzureDatabricksDockerGoJenkinsKubernetesPythonTerraform
Senior level
Financial Services
As a Principal Application Support Engineer, you'll ensure system reliability and operational resilience, implementing SRE practices, and leading incident management efforts.
Top Skills: AWSAzureDynatraceGCPGoItsiJavaLinuxPythonSplunkUnix
13 Days AgoSaved
Hybrid
Denver, CO, USA
110K-125K Annually
Senior level
110K-125K Annually
Senior level
Information Technology • Insurance • Software
The Senior Site Reliability Engineer is responsible for the reliability and performance of production services, including incident response, service design, and automation of operations.
Top Skills: .NetAWSC#Ci/CdInfrastructure As CodeJavaKubernetesLinuxPythonReactWindows
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 13 Days AgoSaved
Easy Apply
Remote or Hybrid
United States
Easy Apply
140K-170K Annually
Senior level
140K-170K Annually
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will enhance system reliability, develop production-grade code, implement observability tools, conduct root cause analyses, and collaborate on system design for scalability.
Top Skills: ArgocdCi/CdDockerGitopsGoGrafanaHoneycombJenkinsKubernetesOpentelemetryPrometheusPythonTerraform
Reposted 13 Days AgoSaved
Hybrid
Chicago, IL, USA
100K-115K Annually
Senior level
100K-115K Annually
Senior level
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
The Senior Site Reliability Engineer will support cloud operations, implement observability strategies, and optimize applications for availability and performance.
Top Skills: .NetAnsibleC#GitGrafanaKubernetesPrometheus
Reposted 13 Days AgoSaved
Easy Apply
Hybrid
New York City, NY, USA
Easy Apply
129K-232K Annually
Senior level
129K-232K Annually
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you will maintain service uptime, improve automation, and ensure infrastructure reliability while collaborating with engineering teams at Braze.
Top Skills: ChefDockerKafkaKubernetesLinuxMongoDBRedisRuby On RailsTerraformUnix Shell
Reposted 14 Days AgoSaved
Hybrid
Austin, TX, USA
Senior level
Senior level
Gaming • Information Technology • Mobile • Software • Esports
Seeking a Senior Site Reliability Engineer to design and operate scalable platform solutions, enhance reliability, and improve developer experience and operational efficiency across engineering teams.
Top Skills: AWSGCP
14 Days AgoSaved
In-Office
Atlanta, GA, USA
144K-191K Annually
Senior level
144K-191K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Responsible for deploying and managing cloud environments, integrating platform services, enhancing data pipelines, and collaborating on operational testing for TRS systems.
Top Skills: AnsibleAWSAzureConfluenceCudaDockerGCPGitGithub ActionsGrafanaJfrog ArtifactoryJIRAKubernetesNominalOpenclPythonTerraform
8 Days AgoSaved
In-Office
San Francisco, CA, USA
194K-267K Annually
Senior level
194K-267K Annually
Senior level
Cloud
The role involves building and managing observability infrastructure in GCP, automating deployments, and optimizing data processes for high reliability.
Top Skills: GkeGoGCPGrafanaKubernetesOpentelemetryPythonRubySplunkTerraform
Reposted 8 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Edtech
The Lead Software Engineer will lead the SRE team, focusing on reliability, performance optimization, security, and mentoring developers, while improving overall platform resilience.
Top Skills: ActivejobAnsibleAWSAws CloudwatchEc2EcsElasticsearchGitGCPGoogle Cloud StackdriverJenkinsJIRAKubernetesMemcachedMongoDBNew RelicNode.jsPostgresRedisRuby On RailsSidekiqSpinnakerTerraformTerragrunt
9 Days AgoSaved
In-Office
Saint Louis, MO, USA
Senior level
Senior level
Fintech • Analytics
The role involves managing application services, driving improvements, handling incidents, and leveraging domain knowledge to enhance service quality and efficiency.
Top Skills: DatadogItrs
9 Days AgoSaved
In-Office
Omaha, NE, USA
3-3 Annually
Mid level
3-3 Annually
Mid level
Software
The Site Reliability Engineer will enhance monitoring systems, improve user experience, optimize alerting, and analyze data for informed decision-making.
Top Skills: AnsibleAWSAzureBashDatadogElk StackGCPGitGrafanaJenkinsNagiosNew RelicPowershellPrometheusPythonTerraform
Reposted 9 Days AgoSaved
In-Office
Austin, TX, USA
175K-240K Annually
Senior level
175K-240K Annually
Senior level
Financial Services
The Staff Engineer will support and optimize messaging platforms, design solutions to improve operational efficiency, and collaborate with teams on business-focused solutions.
Top Skills: AmpsAWSEksFixJavaKafkaKubernetesLinuxMqSpringSQL
Reposted 9 Days AgoSaved
In-Office
Norwalk, CT, USA
80K-120K Annually
Junior
80K-120K Annually
Junior
Travel
Deploy, operate, and automate large-scale cloud-native and Kubernetes workloads (GKE) with emphasis on reliability, observability, SLO/SLA design, GitOps deployments, on-call incident response, and building self-service platforms and automation to reduce operational toil.
Top Skills: Alpine/Distroless)AnsibleArgocdBashClaude CodeCursorGCPGithub ActionsGithub CopilotGitopsGoGrafanaHelmIstioKubernetes (Gke)KustomizeKyvernoLinux (Rhel/RockyNew RelicOpentelemetryPrometheusPythonSplunkTerraformUbuntu
Reposted 9 Days AgoSaved
In-Office
2 Locations
112K-137K Annually
Senior level
112K-137K Annually
Senior level
Fintech
The Site Reliability Engineer will manage AWS infrastructures, oversee application deployments, and ensure system reliability and security while collaborating with teams.
Top Skills: AWSBashCodebuildCodedeployCodepipelineEc2IamPythonRdsRoute 53S3TerraformVpc
Reposted 9 Days AgoSaved
In-Office
New York, NY, USA
140K-225K Annually
Senior level
140K-225K Annually
Senior level
Fintech
Lead adoption of SRE practices to improve reliability, observability, automation, and incident response. Implement and maintain observability tooling, instrumentation, CI/CD, and infrastructure-as-code. Partner with developers, participate in on-call rotations, drive postmortems, and reduce operational overhead through automation.
Top Skills: AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account