Get the job you really want.

Top Site Reliability Engineer Jobs

12 Days AgoSaved
In-Office
2 Locations
164K-222K Annually
Expert/Leader
164K-222K Annually
Expert/Leader
Security
The Director of DevSecOps and SRE will lead teams in SRE, Cloud Infrastructure, and DevOps practices, focusing on automation, infrastructure reliability, and security policies while mentoring engineers and managing software projects.
Top Skills: Aws Cloud TechnologiesGitlabGrafanaJavaKubernetesLokiMaterial UiPostgresPrometheusRabbitMQReactReduxSentrySpringTailwindTerraform
Reposted 12 Days AgoSaved
In-Office
Springfield, MO, USA
Senior level
Senior level
Retail • Software
The Lead Architect is responsible for designing, implementing, and managing cloud infrastructure and DevOps/SRE solutions, focusing on Azure and infrastructure automation.
Top Skills: AnsibleAzureBashDockerElastic CloudGCPGithub ActionsKubernetesOpentelemetryPythonTerraform
Reposted 12 Days AgoSaved
In-Office
2 Locations
150K-224K Annually
Senior level
150K-224K Annually
Senior level
Cloud • Information Technology • Security • Software
Design, build, and operate network infrastructure for cloud and on-prem environments, ensuring reliability, scalability, and security through automation and observability.
Top Skills: AnsibleAws VpcAzure VnetsBgpDnsElkEnvoyFirewallsGcp VpcGoGrafanaNginxOpentelemetryOspfPrometheusPythonTcp/IpTerraformTransit GatewayVlans
Reposted 12 Days AgoSaved
In-Office
2 Locations
177K-265K Annually
Senior level
177K-265K Annually
Senior level
Cloud • Information Technology • Security • Software
The role involves designing, building, and operating infrastructure systems, focusing on automation, reliability, and security for cloud and on-prem environments while collaborating closely with engineering teams.
Top Skills: AnsibleBashCi/CdCloudFormationDockerElkGoGrafanaKubernetesLinuxOpentelemetryPrometheusPythonTerraform
Reposted 12 Days AgoSaved
Easy Apply
In-Office
Cape Canaveral, FL, USA
Easy Apply
Senior level
Senior level
Aerospace • Other
The Sr. IT Linux Site Reliability Engineer will manage and optimize Kubernetes clusters, automate systems, and foster collaboration to support SpaceX's engineering teams and infrastructure needs.
Top Skills: AnsibleDockerGitGoGrafanaHelmJSONKubernetesLinuxPrometheusPythonTerraformYaml
Reposted 12 Days AgoSaved
Easy Apply
In-Office
Redmond, WA, USA
Easy Apply
160K-220K Annually
Senior level
160K-220K Annually
Senior level
Aerospace • Other
The Sr. IT Linux Site Reliability Engineer will manage and optimize Kubernetes environments, automate infrastructure, and collaborate on software platforms ensuring high performance and reliability.
Top Skills: AnsibleGitGoGrafanaHelmInfluxdbJenkinsJsonnetKubernetesLinuxPrometheusPythonTerraformVMwareYaml
Reposted 12 Days AgoSaved
Remote
USA
200K-250K Annually
Senior level
200K-250K Annually
Senior level
Software • Cryptocurrency
Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.
Top Skills: Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform
Reposted 12 Days AgoSaved
Easy Apply
In-Office
Midtown, TN, USA
Easy Apply
Mid level
Mid level
Gaming
Manage operational tasks for gaming services, design runtime environments, monitor metrics, optimize architecture, and research software solutions.
Top Skills: C/C++GoIstioJavaK8SLinuxMySQLNginxPythonRustShell
Reposted 12 Days AgoSaved
Remote
Texas, USA
Mid level
Mid level
Blockchain
The Blockchain Site Reliability Engineer is responsible for maintaining blockchain nodes' reliability, monitoring, incident response, and building automation tools to enhance operations.
Top Skills: DockerElkGoGrafanaJavaScriptKubernetesLinuxPrometheusPythonRustShell
Reposted 12 Days AgoSaved
Easy Apply
In-Office
San Jose, CA, USA
Easy Apply
175K-250K Annually
Senior level
175K-250K Annually
Senior level
Artificial Intelligence • Robotics • Automation • Manufacturing
Responsible for managing and setting up internal systems infrastructure, migrating SaaS to self-hosted solutions, implementing monitoring systems, and ensuring security compliance.
Top Skills: AnsibleAWSAzureCloudFormationDatadogDnsGCPGrafanaHTTPLinux/UnixPrometheusTcp/IpTerraform
Reposted 12 Days AgoSaved
In-Office
San Jose, CA, USA
146K-339K Annually
Expert/Leader
146K-339K Annually
Expert/Leader
Artificial Intelligence • Information Technology • Software
Responsible for configuring, monitoring, and maintaining systems in global data centers, developing automation, and enhancing system performance. Mentors teams and collaborates with various departments on architectural solutions.
Top Skills: AnsibleDistributed Storage SystemsDockerGitlabJenkinsKubernetesLinuxPackerPythonTerraform
Reposted 18 Days AgoSaved
Easy Apply
Remote
31 Locations
Easy Apply
130K-140K Annually
Senior level
130K-140K Annually
Senior level
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, enhance monitoring and database infrastructure, and collaborate on scalable systems to maintain reliability as usage scales.
Top Skills: AWSClickhouseKubernetesMySQLPostgresRedis
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 18 Days AgoSaved
Hybrid
Austin, TX, USA
105K-155K Annually
Senior level
105K-155K Annually
Senior level
Gaming • Information Technology • Mobile • Software • Esports
As a Senior Site Reliability Engineer, you will enhance service infrastructure, develop automation tools, and support integration and maintenance of backend services in a large-scale environment.
Top Skills: AWSBambooC#C++CassandraCiscoDataflowF5FlinkGitJavaJenkinsKinesisLinuxMavenMongodbMySQLPerforcePuppetPythonRedisShell ScriptTeam CityVertica
Reposted 18 Days AgoSaved
Easy Apply
Remote or Hybrid
7 Locations
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
Manage continuous delivery infrastructure for reliable code deployment. Collaborate with teams to streamline onboarding, support deployment systems, and participate in on-call rotations.
Top Skills: Argo WorkflowsArgocdAWSAzureGoGoogle Cloud PlatformKubernetesPython
Reposted 13 Days AgoSaved
Remote
USA
Mid level
Mid level
Blockchain • Web3
As a Site Reliability Engineer, you'll enhance observability, logging, and tracing, collaborating with engineers to optimize performance and security of infrastructure.
Top Skills: AnsibleAWSAws CdkGCPGitGoGrafanaKubernetesLgtmLokiMimirOpentelemetryPrometheusRustSentryTempoTerraformTypescriptWebassembly
Reposted 13 Days AgoSaved
Hybrid
San Francisco, CA, USA
120K-140K Annually
Junior
120K-140K Annually
Junior
Artificial Intelligence • Security • Software
You will develop and improve cloud infrastructure, support distributed systems, and write infrastructure-as-code while collaborating across teams.
Top Skills: AWSCloudFormationDockerGoJavaKubernetesPythonTerraform
Reposted 13 Days AgoSaved
Remote
United States
165K-200K Annually
Senior level
165K-200K Annually
Senior level
Cloud • Information Technology
As a Staff Site Reliability Engineer, you will enhance cloud product lines, ensuring real-time scalability, collaborating with teams, and automating builds.
Top Skills: AnsibleAWSAzureBashDnsDockerEnvoyGCPGitGoGrafanaHaproxyHTTPJenkinsKafkaKubernetesLinuxMySQLOciOpentelemetryPostgresPrometheusPuppetPythonRedisTcp/IpTelegrafTerraformTls
Reposted 13 Days AgoSaved
In-Office
Auburn Hills, MI, USA
3-3 Annually
Mid level
3-3 Annually
Mid level
Automotive
The SRE Engineer will monitor and improve cloud and data platforms, lead incident response, maintain AWS infrastructure, and drive service reliability using monitoring tools and automation scripts.
Top Skills: AWSAws CliBashCi/CdDatabricksDelta LakeFlinkGrafanaHadoopIcebergKafkaKinesisKubernetesPowershellPrometheusPythonSparkSqsTerraform
Reposted 13 Days AgoSaved
In-Office
Chicago, IL, USA
20-43 Hourly
Internship
20-43 Hourly
Internship
Artificial Intelligence • Automotive • Internet of Things • Software
As an SRE intern, you will work on system reliability, design dashboards, collaborate with teams, and apply machine learning techniques.
Top Skills: AWSJavaJavaScriptKubernetesPythonTerraform
Reposted 13 Days AgoSaved
In-Office or Remote
Location, WV, USA
Mid level
Mid level
Healthtech • Telehealth
Seeking a Site Reliability Engineer to ensure availability and performance of cloud infrastructure. Responsibilities include observability solutions, incident response, and collaboration with teams to improve reliability and service health.
Top Skills: AnsibleAWSAzureAzure MonitorBashCloudwatchDynatraceElasticGrafanaPowershellPythonTerraform
Reposted 22 Days AgoSaved
In-Office
Costa Mesa, CA, USA
191K-208K Annually
Senior level
191K-208K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Site Reliability Engineer, develop solutions for deployment engineers, ensure scalable system delivery, and improve operational capabilities for military technologies.
Top Skills: C++Cloud TechnologiesCybersecurityGoNetworkingPythonRust
14 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
175K-185K Annually
Senior level
175K-185K Annually
Senior level
Fintech • Mobile • Security • Software • Cybersecurity
Seeking a Staff Site Reliability Engineer to design, deploy and maintain high-availability infrastructure, ensuring reliability, performance, and security while collaborating with engineering teams.
Top Skills: AWSElk StackGoGrafanaJaegerJavaKubernetesOpentelemetryOpentofuPrometheusPythonTerraform
Reposted 14 Days AgoSaved
Remote
United States
165K-200K Annually
Expert/Leader
165K-200K Annually
Expert/Leader
Cloud • Information Technology
As a Staff Platform Engineer, you'll develop and maintain infrastructure components using Go and Node.js, improve service reliability, mentor juniors, and manage data ecosystems.
Top Skills: EnvoyExpressGoJenkinsKafkaMySQLNode.jsPostgresPuppetPythonReactRedis
Reposted 14 Days AgoSaved
In-Office or Remote
8 Locations
Mid level
Mid level
Sports
Manage and improve the AWS infrastructure, deploy into new regions, monitor releases, and implement new technologies in a fast-paced environment.
Top Skills: AWSDockerGrafanaKubernetesPrometheusPython
14 Days AgoSaved
Easy Apply
In-Office
San Jose, CA, USA
Easy Apply
133K-200K Annually
Senior level
133K-200K Annually
Senior level
Aerospace
The Staff Site Reliability Engineer will design SRE procedures, engineer SLOs, build tooling, and create dashboards. The role includes automating tasks and collaborating within the SRE team.
Top Skills: DatadogGoJaegerKubernetesPrometheusPulumiPythonTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account