Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 9 Days AgoSaved
In-Office
New York, NY, USA
140K-225K Annually
Senior level
140K-225K Annually
Senior level
Fintech
Lead adoption of SRE practices to improve reliability, observability, automation, and incident response. Implement and maintain observability tooling, instrumentation, CI/CD, and infrastructure-as-code. Partner with developers, participate in on-call rotations, drive postmortems, and reduce operational overhead through automation.
Top Skills: AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Reposted 9 Days AgoSaved
In-Office
2 Locations
106K-170K Annually
Junior
106K-170K Annually
Junior
Fintech
Support and evolve SRE practices: implement and maintain observability, monitoring, alerting, automation, and resilience for services. Participate in on-call rotations, incident response, postmortems, and collaborate with engineering teams to improve reliability and operational efficiency.
Top Skills: AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Reposted 9 Days AgoSaved
In-Office
Chicago, IL, USA
110K-150K Annually
Mid level
110K-150K Annually
Mid level
Artificial Intelligence • Other • Software • Industrial • Manufacturing
As a Site Reliability Engineer, you will enhance and secure our AI platform infrastructure, focusing on performance, compliance, incident response, and automation while collaborating with teams globally.
Top Skills: ArgocdBashCi/CdDatadogDockerElkGCPGithub ActionsGitlab CiGoGrafanaJenkinsKubernetesPrometheusPythonSpinnakerTerraform
Reposted 9 Days AgoSaved
In-Office
San Francisco, CA, USA
125K-195K Annually
Mid level
125K-195K Annually
Mid level
Hardware • Semiconductor • Manufacturing
Design, build, deploy, and manage on-prem backend infrastructure and services for a small semiconductor fab. Manage bare-metal Linux servers, networking, observability, backups, and automation. Deploy and operate services like consul, vault, grafana, and postgres; build high-frequency telemetry ingestion, OS image automation, and security/secret management. Contribute to an in-house infra-as-code tool and collaborate across hardware and software teams.
Top Skills: AlertmanagerBare-Metal ServersConsulDnsGiteaGoGrafanaLinuxOs Image AutomationPostgresPythonRedpandaReverse ProxiesRustS3-Compatible StorageSingle-Board ComputersSystemdTlsVaultVectorVictoriametricsVpns
Reposted 9 Days AgoSaved
In-Office or Remote
New York, NY, USA
133K-190K Annually
Mid level
133K-190K Annually
Mid level
Music
As a Site Reliability Engineer, you'll build and maintain cloud infrastructure for Spotify's AI-native developer platform, ensuring reliability and performance, while collaborating with senior engineers.
Top Skills: AWSGCPPythonReactTerraformTypescript
Reposted 9 Days AgoSaved
In-Office
Home, PA, USA
105K-165K Annually
Senior level
105K-165K Annually
Senior level
Software
As a Senior Platform Engineer/SRE, you'll design, implement, and manage scalable cloud solutions, focusing on platform reliability, automation, and compliance, while collaborating with cross-functional teams.
Top Skills: Argo WorkflowsArgocdAWSCi/CdCircleCICloudFormationDockerEnvoyGithub ActionsGoGrafanaIstioKubernetesPythonTerraform
Reposted 9 Days AgoSaved
Remote
United States
Mid level
Mid level
Information Technology • Legal Tech
The role involves maintaining and improving Azure infrastructure, managing Infrastructure as Code with Terraform, enhancing security measures, and operating CI/CD pipelines.
Top Skills: AzureAzure DevopsBashCircleCIDatadogEfkElkGithub ActionsPowershellPythonTerraform
Reposted 9 Days AgoSaved
In-Office or Remote
Newton, MA, USA
92K-135K Annually
Mid level
92K-135K Annually
Mid level
Security • Software
Work with Cloud Engineering to improve availability, performance, security, and scalability of CyberArk SaaS. Monitor, triage, and automate remediation of production incidents, enhance monitoring and dashboards, participate in on-call rotation, and influence system design to prevent failures. Focus on automation (Ansible, scripting), IaC, cloud platforms, and secure operations.
Top Skills: AnsibleAWSAzureBashChefCloudFormationGCPLinuxPowershellPuppetPythonRubyTerraformUnixWindows
Reposted 9 Days AgoSaved
In-Office or Remote
Newton, MA, USA
126K-185K Annually
Senior level
126K-185K Annually
Senior level
Security • Software
Design, implement, and architect AWS cloud infrastructure and automation for SaaS reliability. Lead SRE/DevOps practices, configuration management, observability, and recovery planning while mentoring engineers and driving platform improvements.
Top Skills: AnsibleAWSC#C++CatchpointCicdCloudFormationCloudwatchDatadogDockerEc2EksElasticsearchElkGrafanaHelmInfluxdbJavaKubernetesLogstashLogz.IoPythonS3SaltTerraformVpc
Reposted 9 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
180K-440K Annually
Senior level
180K-440K Annually
Senior level
Information Technology
Design and operate secure infrastructure for government projects. Optimize performance, manage storage with IaC tools, and ensure system reliability in high-security environments.
Top Skills: AnsibleArgocdGoGpu HardwareKubernetesKyvernoPulumiTerraform
10 Days AgoSaved
Remote
USA
388K-558K Annually
Senior level
388K-558K Annually
Senior level
News + Entertainment
As an Ads Reliability Engineer, you will ensure the reliability of Netflix's Ad Suite by designing scalable infrastructure, collaborating with teams, and implementing automation for monitoring and incident response.
Top Skills: AWSAzureGCPGoJavaKubernetesPythonTerraform
Reposted 10 Days AgoSaved
Easy Apply
Remote or Hybrid
United States
Easy Apply
186K-255K Annually
Senior level
186K-255K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
The Senior Director of SRE leads and defines reliability and operational excellence across products, manages the SRE team, and scales reliability practices within the organization.
Top Skills: AWSAzureCloud-Native NetworkingDistributed SystemsGCPKubernetesMicroservicesSite Reliability Engineering Principles
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
10 Days AgoSaved
In-Office
Whippany, NJ, USA
151K-160K Annually
Senior level
151K-160K Annually
Senior level
Fintech • Financial Services
Ensure the reliability and scalability of systems through monitoring, automation, incident response, and collaboration with diverse teams while managing risk and providing technical guidance.
Top Skills: CorvilElasticItrsLinuxPython
Reposted 10 Days AgoSaved
In-Office
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Healthtech
The Site Reliability Engineer will enhance system reliability, define observability standards, respond to incidents, and collaborate with engineering teams on performance and compliance improvements.
Top Skills: AWSContainerized ServicesDistributed WorkflowsObservability ToolingPostgresServerless Compute
Reposted 10 Days AgoSaved
Remote
United States
116K-255K Annually
Senior level
116K-255K Annually
Senior level
Big Data • Information Technology • Security • Software
The Senior Developer will drive observability roadmaps using SRE Golden Signals, establish monitoring strategies, enhance system reliability, and act as an expert in New Relic technology for performance management.
Top Skills: BashCri-OCshKubernetesNew RelicPerlWindows Powershell
15 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves supporting network infrastructure, automating cloud infrastructure, managing CI/CD workflows, and ensuring operational excellence in IT support, including incident response and security practices.
Top Skills: AnsibleAWSBashDockerGitKubernetesPythonRubyTerraform
Reposted 10 Days AgoSaved
In-Office or Remote
Field, KY, USA
Mid level
Mid level
Healthtech
The SRE Engineer will ensure system reliability of digital products, implement monitoring solutions, optimize performance, and improve deployment automation.
Top Skills: AWSCi/CdCloudwatchDatadogKubernetesPython
Reposted 10 Days AgoSaved
In-Office
4 Locations
81K-97K Annually
Mid level
81K-97K Annually
Mid level
Fintech • Payments
Join WEX as a Site Reliability Engineer to enhance system reliability in Azure, automate tasks, and collaborate with teams to optimize performance and incident response.
Top Skills: AnsibleAzureBashCloudFormationDockerElk StackGoGrafanaKubernetesPrometheusPythonSplunkTerraform
16 Days AgoSaved
Easy Apply
In-Office or Remote
Canada, KS, USA
Easy Apply
124K-266K Annually
Senior level
124K-266K Annually
Senior level
Cloud • Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer, you'll automate and manage GitLab environments, ensuring reliability, and scalability while troubleshooting production issues, and improving operational efficiency.
Top Skills: AnsibleCloud Services (AwsElkGcp)GoGrafanaHelm ChartsKubernetesOmnibus-GitlabPrometheusRubyTerraform
Reposted 10 Days AgoSaved
Hybrid
New York, NY, USA
210K-260K Annually
Mid level
210K-260K Annually
Mid level
Software
The Site Reliability Engineer will ensure the reliability and performance of Cloaked's services, contributing to the company's mission of protecting consumer data privacy.
Reposted 10 Days AgoSaved
Easy Apply
In-Office
Everett, WA, USA
Easy Apply
149K-184K Annually
Senior level
149K-184K Annually
Senior level
Energy
As a Site Reliability Engineer, you will design resilient systems, manage incident responses, and implement automation for infrastructure management, ensuring high operational standards.
Top Skills: AWSAzureBashGCPGoKubernetesPythonTerraform
Reposted 10 Days AgoSaved
Remote or Hybrid
3 Locations
148K-249K Annually
Senior level
148K-249K Annually
Senior level
Transportation
Design and develop Waabi's observability stack, optimize performance, build automation tooling, and support application requirements while leading projects and mentoring teams.
Top Skills: AWSC/C++DockerGoGrafanaJavaKubernetesOpentelemetryPythonRust
Reposted 10 Days AgoSaved
In-Office or Remote
Santa Clara, CA, USA
126K-185K Annually
Senior level
126K-185K Annually
Senior level
Security • Software
The role involves architecting and leading deployment automation, managing SaaS reliability, guiding teams on cloud tools, and responding to incidents.
Top Skills: AnsibleAWSCloudFormationCloudwatchDatadogGrafanaHelmKubernetesOpensearchPager DutyPythonSaltTerraform
Reposted 10 Days AgoSaved
In-Office or Remote
Newton, MA, USA
92K-135K Annually
Mid level
92K-135K Annually
Mid level
Security • Software
The Site Reliability Engineer will enhance production services' reliability and security, automate tasks, monitor systems, and manage incidents.
Top Skills: AnsibleAWSAzureBashCloudFormationGCPLinux/UnixPowershellPythonRubyTerraformWindows Os
Reposted 11 Days AgoSaved
In-Office
Aurora, CO, USA
87K-198K Annually
Senior level
87K-198K Annually
Senior level
Information Technology
As a Senior Site Reliability Engineer, you'll enhance system resilience, automate tasks, and improve infrastructure for the Intelligence Community. You'll need significant Linux experience and programming knowledge.
Top Skills: ConfluenceDockerGitGoHpJavaJenkinsJIRAKubernetesLinuxNessusPackerPythonRust
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account