Get the job you really want.

Top Site Reliability Engineer Jobs

18 Days AgoSaved
In-Office
Santa Clara, CA, USA
147K-235K Annually
Senior level
147K-235K Annually
Senior level
Cybersecurity
As a Principal SRE, build and maintain secure cloud infrastructure, drive automation, and ensure operational excellence in a FedRAMP compliant environment.
Top Skills: BackstageBashDockerFirehydrantGCPGitlab Ci/CdGitopsGoGrafanaJavaKubernetesLokiMySQLNode.jsPagerdutyPrometheusPythonTerraform
18 Days AgoSaved
Hybrid
Burlingame, CA, USA
195K-250K Annually
Senior level
195K-250K Annually
Senior level
Healthtech • Software • Biotech
Lead the evolution of the company's cloud-native infrastructure on AWS and Kubernetes, enhance CI/CD processes, and mentor engineering teams.
Top Skills: Aws,Eks,S3,Iam,Circleci,Github Actions,Argocd,Terrraform,Datadog,Honeycomb
18 Days AgoSaved
Remote
5 Locations
Mid level
Mid level
Blockchain • Fintech • Information Technology • Payments • Software • Financial Services • Cryptocurrency
The DevOps/SRE Engineer will ensure the reliability and scalability of crypto and fintech products, automate operations, and optimize system performance.
Top Skills: AnsibleAWSBashChefDigitaloceanGCPGitGnu/LinuxGrafanaHetznerJavaScriptKubernetesPostgresPrometheusPulumiPuppetPythonRabbitMQRedisTerraformTypescriptZabbix
Reposted 18 Days AgoSaved
In-Office
Boston, MA, USA
Senior level
Senior level
Hardware • Quantum Computing
The Sr. SRE will design and operate reliable systems, focusing on automation, incident management, and infrastructure optimization. Collaborates cross-functionally to ensure operational excellence.
Top Skills: AnsibleAWSAzureBashGCPGoGrafanaHelmKubernetesOpentelemetryPrometheusPythonTerraform
Reposted 19 Days AgoSaved
Remote
USA
200K-250K
Senior level
200K-250K
Senior level
Software • Cryptocurrency
Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.
Top Skills: Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform
Reposted 19 Days AgoSaved
Hybrid
San Jose, CA, USA
119K-170K Annually
Senior level
119K-170K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Staff Site Reliability Engineer will manage FedRAMP cloud products, perform operational duties, enhance monitoring systems, and automate cloud infrastructure.
Top Skills: AnsibleAWSGovcloudKubernetesLinuxPythonTerraform
Reposted 19 Days AgoSaved
Hybrid
Bellevue, WA, USA
119K-170K Annually
Senior level
119K-170K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Staff Site Reliability Engineer will manage FedRAMP cloud product operations, automate processes, handle incidents, and ensure compliance in a hybrid role.
Top Skills: AnsibleAws GovcloudKubernetesLinuxNetworkingPythonTerraform
Reposted 19 Days AgoSaved
In-Office
2 Locations
Senior level
Senior level
eCommerce
The Staff Back-end Engineer (SRE) will build, run, and scale ecommerce systems, ensuring reliability and performance for customer-facing services, while utilizing automation and best practices.
Top Skills: AWSAzureDatadogDockerElastic StackGoGoogle Cloud PlatformGrafanaJavaKubernetesNew RelicPrometheusPythonRuby
Reposted 20 Days AgoSaved
In-Office
Reston, VA, USA
109K-147K
Senior level
109K-147K
Senior level
Information Technology • Software
The Site Reliability Engineer will manage and scale infrastructure, automate deployments, and lead efforts in operational process management while participating in a 24x7 on-call rotation.
Top Skills: AnsibleDockerFreebsdFreeipaJenkinsKubernetesLinuxOpenstackPythonRedhat Enterprise LinuxTerraform
Reposted 20 Days AgoSaved
In-Office or Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Software • Generative AI
As a Site Reliability Engineer, you'll design and maintain cloud infrastructure, automate provisioning, ensure system reliability, and mentor junior engineers while leveraging various technologies to optimize performance and security.
Top Skills: AWSAzureDockerElk StackGCPGoGrafanaJavaKubernetesPrometheusPythonScalaTerraform
Reposted 20 Days AgoSaved
Hybrid
Atlanta, GA, USA
Senior level
Senior level
Software
The Principal Site Reliability Engineer will enhance system reliability, implement monitoring systems, collaborate across teams, and ensure platform uptime and performance.
Top Skills: AWSAzureDatadogGCPGrafanaJavaKubernetesNode.jsPrometheusPython
Reposted 20 Days AgoSaved
In-Office or Remote
San Francisco, CA, USA
Mid level
Mid level
Artificial Intelligence • Generative AI
Lead GPU cluster design and operations, manage Kubernetes, implement Infrastructure-as-Code, and develop observability stacks for high-performance AI models.
Top Skills: AnsibleArgo CdBashEbpfFluxGitopsGrafanaHelmInfinibandKubernetesNvidia DcgmOpentelemetryPrometheusPythonRdmaTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 20 Days AgoSaved
In-Office or Remote
2 Locations
Senior level
Senior level
Healthtech
The role involves designing and implementing the platform, managing CI/CD pipelines, automating tasks, and maintaining cloud environments.
Top Skills: AnsibleAWSDockerGCPGitGroovyIacMySQLPHPTerraform
Reposted 20 Days AgoSaved
In-Office
Midtown, TN, USA
Mid level
Mid level
Gaming
Manage operational tasks for gaming services, design runtime environments, monitor metrics, optimize architecture, and research software solutions.
Top Skills: C/C++GoIstioJavaK8SLinuxMySQLNginxPythonRustShell
Reposted 20 Days AgoSaved
Remote
United States
201K-287K Annually
Senior level
201K-287K Annually
Senior level
Cloud • Security • Software • Cybersecurity
As a Staff Site Reliability Engineer, you will lead SRE initiatives, mentor engineers, ensure system reliability, and drive strategic engineering practices globally.
Top Skills: C#GoGrafanaJavaJavaScriptKubernetesOpentelemetryPrometheusPulumiTerraformTypescript
Reposted 20 Days AgoSaved
Remote
United States
215K-307K Annually
Expert/Leader
215K-307K Annually
Expert/Leader
Cloud • Security • Software • Cybersecurity
The Principal Site Reliability Engineer will lead Veeam's global SRE efforts, focusing on architecture, reliability strategies, and mentorship while influencing cross-functional teams.
Top Skills: Automation ToolingCloud InfrastructureCloud-Native DevelopmentDistributed Systems
21 Days AgoSaved
In-Office
2 Locations
159K-230K Annually
Senior level
159K-230K Annually
Senior level
Artificial Intelligence • Big Data • Machine Learning • Software
The role involves designing and implementing custom installations of the C3 AI Platform for Federal customers, ensuring uptime, and automating system processes while collaborating with cross-functional teams.
Top Skills: AnsibleAWSAzureBashKubernetesLinuxPuppetPythonRubyTerraform
21 Days AgoSaved
In-Office
Gateway Trailer Park, Jacksonville, FL, USA
60K-86K Annually
Entry level
60K-86K Annually
Entry level
Fintech • Financial Services
Responsible for network deployments, automation, and system monitoring. Collaborates with teams to enhance network design and performance, ensuring scalability and security.
Top Skills: AnsibleAristaBgpCiscoCloudFormationDatadogFortinetGitJSONJuniperLinuxMplsOspfPrometheusPythonStpTerraformUnixVxlanYaml
21 Days AgoSaved
In-Office
Chicago, IL, USA
130K-170K
Senior level
130K-170K
Senior level
Agency • Cloud • Information Technology • Mobile • Software
The role involves designing and implementing observability solutions using OpenTelemetry, managing infrastructure through IaC, and establishing SRE practices. Strong expertise in cloud and DevOps engineering is required.
Top Skills: ArgocdAWSAzureBashCloudFormationDockerGCPGithub ActionsGitlab CiGoJavaJenkinsKubernetesNode.jsOpentelemetryPowershellPulumiPythonRustTerraform
Reposted 21 Days AgoSaved
In-Office
Chicago, IL, USA
70K-80K
Junior
70K-80K
Junior
Food • Internet of Things
As a Site Reliability Engineer, you will manage cloud infrastructure, implement SRE best practices, automate tasks, and collaborate with teams for system reliability and performance.
Top Skills: AnsibleAWSAzureBashCircleCIDockerElk StackGCPGithub ActionsGrafanaJenkinsKubernetesLinuxPrometheusPythonTerraformUnix
Reposted 21 Days AgoSaved
In-Office
Korea, KY, USA
Senior level
Senior level
Software
As a Lead SRE at Commvault, you'll ensure the quality and reliability of the Clumio Data Platform in AWS, collaborating across teams to enhance infrastructure and maintain SLAs.
Top Skills: AWSDockerIp NetworkingItilKubernetesLinuxPythonTerraform
Reposted 21 Days AgoSaved
In-Office
3 Locations
120K-200K Annually
Senior level
120K-200K Annually
Senior level
Software
The Site Reliability Engineer will enhance system reliability, improve tooling, oversee incident processes, and collaborate on software maintenance across distributed systems.
Top Skills: ClickhouseGrpcKafkaMongoDBNoSQLPostgresRedpanda
Reposted 21 Days AgoSaved
In-Office or Remote
47 Locations
Senior level
Senior level
Artificial Intelligence • Blockchain • Internet of Things • Machine Learning • Software • App development • Automation
As a Staff SRE, you will ensure the reliability, scalability, and performance of systems, lead incident management, and drive automation efforts.
Top Skills: AnsibleAWSAzureBashDockerElk StackGCPGitlab CiGoGrafanaJavaJenkinsKubernetesPrometheusPythonTerraform
Reposted 21 Days AgoSaved
Remote
USA
Senior level
Senior level
Artificial Intelligence • Blockchain • Internet of Things • Machine Learning • Software • App development • Automation
Join the Gigster Talent Network as an SRE Support Engineer, providing support for scalable applications and cloud services, including troubleshooting and improving internal tools.
Top Skills: AnsibleAWSBashDatadogDockerGCPGrafanaKafkaKubernetesPrometheusPuppetPythonSparkSplunkTerraform
Reposted 21 Days AgoSaved
Remote
United States
Senior level
Senior level
Cloud • Software
As a Site Reliability Engineer, you'll manage technical escalations, ensure system reliability, collaborate with engineering teams, and participate in on-call rotations.
Top Skills: AnsibleAzureBashC#ChefElkGitGithub ActionsGitlabGrafanaJenkinsLinux/UnixPrometheusPulumiPythonSplunkSvnTcp/IpTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account