Top Site Reliability Engineer Jobs

Reposted 4 Days AgoSaved
In-Office
St. Louis, MO, USA
100K-120K Annually
Senior level
100K-120K Annually
Senior level
Fintech • Analytics
As a Senior Site Reliability Engineer, you'll lead incident recovery, enhance production stability, automate processes, and collaborate with development teams to improve operational efficiency.
Top Skills: AWSAzureBigpandaCloud-Native ApplicationsDatadogDnsDockerGitHTTPKubernetesShell ScriptingTcp/IpUnix
Reposted 4 Days AgoSaved
In-Office
St. Louis, MO, USA
Senior level
Senior level
Fintech • Analytics
As a Senior Site Reliability Engineer, you will enhance the reliability and performance of the FX trading platform, focusing on system health, automation, and collaboration with development teams. Responsibilities include measuring availability, incident response, and supporting cloud migration.
Top Skills: AWSAzureBashC#DatadogJavaKubernetesPythonSQL
Reposted 4 Days AgoSaved
In-Office
Burlingame, CA, USA
170K-197K Annually
Mid level
170K-197K Annually
Mid level
Aerospace • Artificial Intelligence
The Site Reliability Engineer will architect and manage ground infrastructure for satellite systems, ensuring high availability, automating deployments, and optimizing data management systems.
Top Skills: AnsibleAWSAzureC++CloudFormationEksElkGCPGrafanaHelmKubernetesPrometheusPythonTerraform
Reposted 4 Days AgoSaved
In-Office
2 Locations
Senior level
Senior level
Blockchain • Fintech • Cryptocurrency
Responsible for the design, implementation, and reliability of systems across hybrid cloud and on-premises environments, while leading technical initiatives and mentoring engineers.
Top Skills: AnsibleDatadogGithub ActionsHelmKubernetesPythonTerraform
10 Days AgoSaved
Hybrid
San Francisco, CA, USA
167K-226K Annually
Senior level
167K-226K Annually
Senior level
Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer, you will enhance the reliability of Drata’s product teams through automation, architecture reviews, and operational excellence using cloud-native technologies.
Top Skills: AiopsAWSBashDatadogDockerGitGithub ActionsKubernetesLinuxMySQLPythonTerraform
5 Days AgoSaved
In-Office
Wacker, IL, USA
132K-220K Annually
Expert/Leader
132K-220K Annually
Expert/Leader
Financial Services
The Staff Site Reliability Engineer will lead Platform Engineering's SRE efforts by defining technical strategy, overseeing architecture, and enhancing operational excellence through mentorship and governance.
Top Skills: ArgocdGCPGkeGoKafkaNode.jsPythonTerraform
Reposted 5 Days AgoSaved
Remote
USA
156K-288K Annually
Mid level
156K-288K Annually
Mid level
Computer Vision • Machine Learning • Software
As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.
Top Skills: AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
Reposted 5 Days AgoSaved
Hybrid
Redwood City, CA, USA
156K-261K Annually
Senior level
156K-261K Annually
Senior level
Consumer Web • eCommerce • Fashion • Retail
The Senior Site Reliability Engineer ensures the health of systems, automates processes, and collaborates on architecture to maintain uptime and reliability in production environments.
Top Skills: AnsibleAWSAzureDatadogDockerElasticsearchGCPGraphiteHaproxyJavaScriptJenkinsKubernetesMongoDBNagiosNew RelicNginxNode.jsRabbitMQRedisRubyTerraformTomcat
Reposted 5 Days AgoSaved
Hybrid
Redwood City, CA, USA
Entry level
Entry level
Consumer Web • eCommerce • Fashion • Retail
The Software Engineer, SRE will develop, deploy, and support new product features while ensuring operational excellence and quality support in a fast-paced environment.
Top Skills: AWSDockerElasticsearchHaproxyJavaScriptKubernetesMongoDBNginxNode.jsRabbitMQRedisRubyTomcat
Reposted 5 Days AgoSaved
In-Office
San Francisco, CA, USA
194K-267K Annually
Senior level
194K-267K Annually
Senior level
Cloud
The role involves building and managing observability infrastructure in GCP, automating deployments, and optimizing data processes for high reliability.
Top Skills: GkeGoGCPGrafanaKubernetesOpentelemetryPythonRubySplunkTerraform
Reposted 5 Days AgoSaved
In-Office
3 Locations
75K-100K Annually
Junior
75K-100K Annually
Junior
Aerospace
The Site Reliability Engineer will ensure system reliability, assist with incident response, improve operational quality, and automate processes to reduce toil. Responsibilities include incident resolution, reliability evaluations, and platform enablement.
Top Skills: Argo CdDockerGitGitlabGoGrafanaJenkinsKubernetesOtel StandardsPrometheusPython
Reposted 5 Days AgoSaved
In-Office
San Francisco, CA, USA
255K-490K Annually
Mid level
255K-490K Annually
Mid level
Artificial Intelligence • Machine Learning • Generative AI
As a Site Reliability Engineer, you will manage Kubernetes clusters, automate infrastructure, improve operational metrics, and enhance reliability across data centers.
Top Skills: CloudFormationGoGpuKubernetesLinuxPythonTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 5 Days AgoSaved
In-Office
Frederick, MD, USA
140K-155K Annually
Senior level
140K-155K Annually
Senior level
Information Technology • Consulting
The Site Reliability Engineer designs monitoring frameworks, manages SLIs/SLOs, automates cloud infrastructure, ensures compliance, and promotes SRE best practices.
Top Skills: .Net CoreAmazon AwsAnsibleBashChefDockerElkGithub ActionsGoogle GcpGrafanaJavaJenkinsKubernetesAzureNode.jsOpen-TelemetryPowershellPrometheusPuppetPythonRSplunkTerraform
Senior level
Financial Services
As a Principal Application Support Engineer, you'll ensure system reliability and operational resilience, implementing SRE practices, and leading incident management efforts.
Top Skills: AWSAzureDynatraceGCPGoItsiJavaLinuxPythonSplunkUnix
Reposted 5 Days AgoSaved
Remote
United States
Senior level
Senior level
Digital Media • Social Media • Software • Sports
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
Top Skills: Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform
Reposted 5 Days AgoSaved
Remote
United States
175K-275K Annually
Mid level
175K-275K Annually
Mid level
Software
As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.
Top Skills: AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Reposted 5 Days AgoSaved
Remote
United States
Senior level
Senior level
Edtech
The Lead Software Engineer will lead the SRE team, focusing on reliability, performance optimization, security, and mentoring developers, while improving overall platform resilience.
Top Skills: ActivejobAnsibleAWSAws CloudwatchEc2EcsElasticsearchGitGCPGoogle Cloud StackdriverJenkinsJIRAKubernetesMemcachedMongoDBNew RelicNode.jsPostgresRedisRuby On RailsSidekiqSpinnakerTerraformTerragrunt
Reposted 11 Days AgoSaved
Hybrid
Austin, TX, USA
Senior level
Senior level
Gaming • Information Technology • Mobile • Software • Esports
Seeking a Senior Site Reliability Engineer to design and operate scalable platform solutions, enhance reliability, and improve developer experience and operational efficiency across engineering teams.
Top Skills: AWSGCP
Reposted 11 Days AgoSaved
Easy Apply
Remote
31 Locations
Easy Apply
130K-140K Annually
Senior level
130K-140K Annually
Senior level
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, improve monitoring and logging, optimize database infrastructure, and collaborate on scaling systems efficiently.
Top Skills: AWSClickhouseKubernetesMySQLPostgresRedis
Reposted 6 Days AgoSaved
Remote
United States
Senior level
Senior level
Healthtech
Develop and implement processes to ensure high availability and reliability of services. Responsibilities include incident management, automation, capacity planning, and risk mitigation.
Top Skills: AWSAzureDatadogDockerGrafanaJavaScriptNew RelicPrometheusPythonRubySplunkTerraform
Reposted 6 Days AgoSaved
Remote
United States
190K-215K Annually
Senior level
190K-215K Annually
Senior level
Internet of Things • Cybersecurity
The Site Reliability Engineer will manage AWS GovCloud infrastructure, ensuring compliance and high availability while driving automation, security, and incident response best practices.
Top Skills: AnsibleAws GovcloudBashDockerElk StackGitlab Ci/CdGrafanaJenkinsKubernetesPrometheusPythonTerraform
Reposted 6 Days AgoSaved
In-Office
New York, NY, USA
140K-225K Annually
Senior level
140K-225K Annually
Senior level
Fintech
Lead adoption of SRE practices to improve reliability, observability, automation, and incident response. Implement and maintain observability tooling, instrumentation, CI/CD, and infrastructure-as-code. Partner with developers, participate in on-call rotations, drive postmortems, and reduce operational overhead through automation.
Top Skills: AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Reposted 6 Days AgoSaved
In-Office
2 Locations
106K-170K Annually
Junior
106K-170K Annually
Junior
Fintech
Support and evolve SRE practices: implement and maintain observability, monitoring, alerting, automation, and resilience for services. Participate in on-call rotations, incident response, postmortems, and collaborate with engineering teams to improve reliability and operational efficiency.
Top Skills: AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
7 Days AgoSaved
In-Office
Columbus, OH, USA
118K-222K Annually
Senior level
118K-222K Annually
Senior level
Insurance
The Site Reliability Engineer enhances systems reliability and performance while developing software for observability and supports incident management in a dynamic environment.
Top Skills: Ci/Cd PipelinesInformaticaKubernetesNew RelicOraclePostgresPythonSnowflakeSplunkSQL
Reposted 12 Days AgoSaved
Easy Apply
Hybrid
Los Angeles, CA, USA
Easy Apply
160K-214K Annually
Senior level
160K-214K Annually
Senior level
Real Estate • Sales • Software • PropTech
The Senior Site Reliability Engineer will lead infrastructure reliability, performance, and scalability while mentoring engineers and overseeing system health and incident responses.
Top Skills: AWSAzureC#CloudFormationDockerGCPGoKubernetesPulumiPythonTerraformTypescript
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account