Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 20 Days AgoSaved
Easy Apply
In-Office
San Francisco, CA, USA
Easy Apply
150K-200K Annually
Junior
150K-200K Annually
Junior
Artificial Intelligence • Information Technology
As a Site Reliability Engineer, maintain user-facing services, implement best practices for reliability, and manage production incidents.
Top Skills: AnsibleCloud ServicesKubernetesProgramming LanguagesTerraform
Reposted 20 Days AgoSaved
Easy Apply
In-Office
Chicago, IL, USA
Easy Apply
130K-170K Annually
Senior level
130K-170K Annually
Senior level
Artificial Intelligence • Cloud • Information Technology • Mobile • Software • Consulting
The role involves designing and implementing observability solutions using OpenTelemetry, managing infrastructure through IaC, and establishing SRE practices. Strong expertise in cloud and DevOps engineering is required.
Top Skills: ArgocdAWSAzureBashCloudFormationDockerGCPGithub ActionsGitlab CiGoJavaJenkinsKubernetesNode.jsOpentelemetryPowershellPulumiPythonRustTerraform
Reposted 20 Days AgoSaved
Easy Apply
In-Office
Palo Alto, CA, USA
Easy Apply
180K-440K Annually
Senior level
180K-440K Annually
Senior level
Information Technology
The Senior Site Reliability Engineer will design and optimize Kubernetes clusters, manage infrastructure with IaC tools, and enhance system reliability while collaborating with teams.
Top Skills: AnsibleCluster ApiCniCriCsiKubernetesPulumiTerraform
Reposted 20 Days AgoSaved
Easy Apply
In-Office
New York, NY, USA
Easy Apply
175K-245K Annually
Mid level
175K-245K Annually
Mid level
Financial Services
As a Site Reliability Engineer, you'll ensure high availability of Commodities Technology applications, automate processes, and contribute to incident analysis and monitoring systems.
Top Skills: AnsibleAWSC#DatadogDockerKubernetesLinuxPowershellPythonTerraformWindows
Reposted 20 Days AgoSaved
In-Office
11 Locations
125K-150K Annually
Senior level
125K-150K Annually
Senior level
Legal Tech
Lead the design and automation of enterprise network infrastructures, managing cloud and on-premises networks with a focus on security and scalability.
Top Skills: AnsibleAWSAzureBashBgpEvpnFortianalyzerFortinetFortinet Sd-WanFrroutingLinuxNsxNvidia CumulusOspfPalo AltoPanoramaPowershellPythonSolarwindsSonicTerraformVcfVMware
Reposted 2 Days AgoSaved
Remote
United States
150K-200K Annually
Mid level
150K-200K Annually
Mid level
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills: AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
Reposted 21 Days AgoSaved
Easy Apply
In-Office
Fort Meade, MD, USA
Easy Apply
Senior level
Senior level
Information Technology • Security • Software
Manage daily operations of a classified NOC, focusing on Kubernetes services, incident response, system monitoring, and ensuring security and availability.
Top Skills: Aws GovcloudAzure GovernmentC2EC2SDockerElastic StackFluentdFluxGrafanaHelmJIRAJwccKubernetesOsticketPrometheusTerraform
21 Days AgoSaved
Remote
United States
180K-220K Annually
Expert/Leader
180K-220K Annually
Expert/Leader
Information Technology • Cybersecurity
The Director of SRE will oversee cloud infrastructure scalability, reliability, COGS optimization, and lead a team of SRE professionals while ensuring compliance and security of services.
Top Skills: AWSAzureCi/CdCloudFormationDatadogGCPGrafanaKubernetesPrometheusPulumiSplunkTerraform
Reposted 21 Days AgoSaved
In-Office or Remote
Berkeley, CA, USA
205K-235K Annually
Senior level
205K-235K Annually
Senior level
Financial Services
The Senior Cluster Site Reliability Engineer will enhance the research compute cluster's uptime, reliability, and performance through engineering and operational improvements, ensuring high availability for researchers working on machine learning problems.
Top Skills: AnsibleAWSAWSCephDockerElkGCPGCPGrafanaHorovodHpcInfinibandKubeflowKueueLokiLustreMlflowOpentelemetryPodmanPrometheusPythonRdmaRubyS3SingularitySlurmTerraform
22 Days AgoSaved
In-Office
Ashburn, VA, USA
125K-181K Annually
Senior level
125K-181K Annually
Senior level
Fintech • Information Technology • Payments
The Staff SRE will improve system reliability, lead incident resolution, automate tasks, and support cloud migration efforts while ensuring secure software delivery.
Top Skills: AWSEnterprise Monitoring ToolsMicrosoft StackMiddleware TechnologiesOrchestration ToolsPowershell
22 Days AgoSaved
In-Office
San Francisco, CA, USA
165K-250K Annually
Senior level
165K-250K Annually
Senior level
Artificial Intelligence • Healthtech • Information Technology • Software
As a Site Reliability Engineer, you will manage the production environment, focusing on infrastructure design, automation, and optimizing deployment pipelines to ensure high availability.
Top Skills: HelmKafkaKubernetesPostgresPythonRedisTerraformTypescript
22 Days AgoSaved
In-Office
Lovelace, NC, USA
Senior level
Senior level
Artificial Intelligence • Machine Learning • Security • Database • Analytics • Big Data Analytics
As a Site Reliability Engineer, you'll ensure the availability and performance of AI applications, maintain infrastructure, automate tasks, and troubleshoot issues in high-scale environments.
Top Skills: AnsibleAWSAzureBashCircleCICloudFormationDatadogDockerDynatraceEc2Elk StackGCPGitlab CiGoGrafanaJenkinsKubernetesLambdaLinuxPrometheusPythonS3TerraformUnix
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 22 Days AgoSaved
In-Office
2 Locations
125K-135K Annually
Senior level
125K-135K Annually
Senior level
Fintech • Financial Services
Lead the monitoring, automation, and incident response processes for platform stability, collaborating with cross-functional teams to enhance service reliability and performance optimization.
Top Skills: AnsibleAWSAzureBashBigpandaDynatraceGCPGithub ActionsJenkinsLogscaleMonproPowershellPythonTerraform
Reposted 4 Days AgoSaved
In-Office
Seattle, WA, USA
160K-250K Annually
Mid level
160K-250K Annually
Mid level
Artificial Intelligence • Cloud • Software
The Senior Site Reliability Engineer will automate operations, optimize workflows for teams, manage secure infrastructure, and participate in on-call duties.
Top Skills: AristaAWSBashCephChefCifsCiscoDnsDockerElk StackFortinetHpHTTPIcmpIscsiJenkinsKubernetesLinux/Debian Family/UbuntuMesosphereNfsNode.jsPivotal GreenplumPostgresPythonRabbitMQRubyS3ScyllaSshSslSupermicroTcpTls
Reposted 22 Days AgoSaved
Easy Apply
In-Office or Remote
Columbia, MD, USA
Easy Apply
162K-216K Annually
Senior level
162K-216K Annually
Senior level
Security • Software
The role involves developing and managing Tenable's cloud products, ensuring reliability and availability, automating systems, and collaborating on cloud technologies while meeting FedRAMP compliance.
Top Skills: AWSAzureDockerGCPGradleHelmKubernetesNode.jsPythonTerraform
Reposted 22 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
180K-440K Annually
Mid level
180K-440K Annually
Mid level
Information Technology
As a Site Reliability Engineer, you'll design and operate scalable storage systems and optimize performance for AI research data management.
Top Skills: GoKubernetesPulumiRust
Reposted 22 Days AgoSaved
Remote or Hybrid
Colorado, USA
125K-150K Annually
Mid level
125K-150K Annually
Mid level
Artificial Intelligence • HR Tech • Legal Tech • Marketing Tech • Software • Conversational AI • Generative AI
The Site Reliability Engineer will enhance SaaS solutions' stability and scalability by automating workflows, monitoring systems, and responding to incidents.
Top Skills: AnsibleAWSAzureDatadogDynatraceNew RelicPuppetTerraform
Reposted 22 Days AgoSaved
Easy Apply
In-Office
Reston, VA, USA
Easy Apply
109K-147K Annually
Mid level
109K-147K Annually
Mid level
Information Technology • Software
The SRE will manage Verisign's data platform by architecting, deploying, and ensuring the stability and performance of large-scale data systems, while collaborating with multiple teams for customer support and infrastructure improvements.
Top Skills: AnsibleDockerDruidHadoopJenkinsKafkaKubernetesPythonSpark
Reposted 22 Days AgoSaved
Easy Apply
In-Office
Reston, VA, USA
Easy Apply
136K-184K Annually
Senior level
136K-184K Annually
Senior level
Information Technology • Software
Build and maintain Verisign's Kubernetes platform, enforce security practices, monitor performance, and provide tier 3 support. Requires extensive experience with Kubernetes and related technologies.
Top Skills: GitJIRAKubernetesLinuxPythonTerraformUnix
Reposted 22 Days AgoSaved
Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
Top Skills: AWSClickhouseGoJavaKafkaKubernetesPulumiTerraform
Reposted 22 Days AgoSaved
In-Office
The Center, IN, USA
175K-175K Annually
Senior level
175K-175K Annually
Senior level
eCommerce • Retail • Software
The Director of Site Reliability Engineering will lead cloud deployment strategies, enhance automation and scalability, and mentor the engineering team.
Top Skills: AnsibleApacheChefDockerGithub ActionsJenkinsKubernetesMongoDBMySQLNginxTerraform
Reposted 22 Days AgoSaved
Remote
US
166K-293K Annually
Senior level
166K-293K Annually
Senior level
Artificial Intelligence • Software
As a Principal Site Reliability Engineer, you will design hybrid infrastructure, integrate edge devices and cloud resources, optimize performance and costs, and collaborate with cross-functional teams to ensure robust systems.
Top Skills: AWSGoKubernetesLinuxPythonTerraformTerragrunt
Reposted 22 Days AgoSaved
Hybrid
Sunnyvale, CA, USA
204K-247K Annually
Senior level
204K-247K Annually
Senior level
Cloud • Greentech • Other • Energy
As a Staff Site Reliability Engineer focused on storage, you'll ensure the reliability and performance of cloud storage systems while optimizing distributed, fault-tolerant architectures for AI workloads.
Top Skills: AnsibleCCephDockerGlusterfsGoIscsiJavaKubernetesNfsNvme-OfOpenebsPuppetPythonSmbTerraform
Reposted 22 Days AgoSaved
Hybrid
San Francisco, CA, USA
204K-247K Annually
Senior level
204K-247K Annually
Senior level
Cloud • Greentech • Other • Energy
The role involves ensuring reliability of AI-optimized cloud services, focusing on design, automation, and performance for AI workloads.
Top Skills: C++GoJavaKubernetesPython
23 Days AgoSaved
In-Office
Palo Alto, CA, USA
106K-199K Annually
Senior level
106K-199K Annually
Senior level
Gaming • Software • Metaverse
The Senior Distributed Storage SRE Engineer manages distributed storage systems, ensuring stability, designing disaster recovery solutions, and optimizing performance. Responsibilities include incident response, tool development, and resource management.
Top Skills: GoLinuxPythonShellTcp/IpUnix
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account