Get the job you really want.
Be the first applicant
Apply to jobs posted less than 24 hours ago to maximize your visibility.
Use the Date Posted filter to view jobs posted within the last 24 hours.

Top Site Reliability Engineer Jobs

14 Days Ago
Hoffman Estates, IL, USA
110K-140K Annually
Senior level
110K-140K Annually
Senior level
Artificial Intelligence • Machine Learning • Software
The Sr. Site Reliability Engineer will manage solutions and cloud infrastructures, ensuring reliability, scalability, and performance of enterprise-grade systems while collaborating across teams and responding to incidents efficiently.
Top Skills: AngularApi GatewayAWSCloudFormationDynamoDBEcsF5IstioJavaKafkaMicroservicesMongoDBNginxNode.jsOraclePostgresRabbitMQReactTerraform
13 Days Ago
4 Locations
140K-180K Annually
Mid level
140K-180K Annually
Mid level
Fintech
Responsible for the reliability, automation, and scalability of application infrastructure while implementing observability tools and improving performance and security standards.
Top Skills: AWSCi/CdDockerGoJavaJavaScriptKubernetesPythonRuby
114K-170K Annually
Senior level
Fintech • Payments • Financial Services
The Senior Site Reliability Engineer will ensure IT service reliability, manage observability tools, drive automation, and support operational excellence.
Top Skills: AkamaiCdnCloudtrailCloudwatchDynatraceGitJenkinsNexusRallySonarqubeSplunk
14 Days Ago
Remote
3 Locations
154K-287K Annually
Senior level
154K-287K Annually
Senior level
Artificial Intelligence • Digital Media • Marketing Tech • Software
The Senior Site Reliability Engineer will enhance application reliability through automation, manage infrastructure, and collaborate across teams to improve security and performance.
Top Skills: Apollo ClientArgoAws ServicesElixirGitGoGraphQLK8SNext.JsPostgresPythonReactTerraformTypescript
133K-164K Annually
Mid level
Insurance
Join as a Senior Site Reliability Engineer to improve system resilience, automate processes, and lead incident management for a billion-dollar company.
Top Skills: AnsibleBashC#GitJavaPowershellPythonTerraform
10 Days Ago
Remote
2 Locations
Senior level
Senior level
Logistics • Robotics
Lead Root Cause Analysis for critical production outages, improving system resilience by collaborating with cross-functional teams and driving closure on investigation tickets.
Top Skills: ElasticGitlabGrafanaKubernetesLogic MonitorPower BIPrometheusTableauVMware
10 Days Ago
Newark, NJ, USA
Senior level
Senior level
Fintech • Financial Services
As a Senior Site Reliability Engineer, you'll enhance application reliability and performance, manage infrastructure, and lead technical projects. You'll also oversee on-call support and report to senior leadership, requiring strong problem-solving skills and ability to adapt to change.
Top Skills: Active DirectoryAnsibleCertificate ManagementCrushftpGoodsyncJenkinsLinux ServerMessage QueuesMonitoring ToolsNetworkingStorage AdministrationVirtualizationWindows Server
10 Days Ago
Remote
USA
150K-215K Annually
Senior level
150K-215K Annually
Senior level
Hardware • Machine Learning • Security • Software
The Senior Site Reliability Engineer will design systems, improve reliability, manage CI/CD processes, and enhance monitoring platforms for the Aerodome platform.
Top Skills: AWSGrafanaPrometheusTerraform
16 Days Ago
3 Locations
151K-227K Annually
Senior level
151K-227K Annually
Senior level
Cloud • Information Technology • Security • Software
Seeking a Senior Site Reliability Engineer to lead the implementation of data fabric capabilities, manage internal pipelines, mentor junior team members and drive technology adoption while ensuring cloud service delivery.
Top Skills: AWSCi/CdGitlabGCPKubernetesLinuxTerraformUnix
17 Days Ago
2 Locations
100K-168K Annually
Mid level
100K-168K Annually
Mid level
eCommerce • Fintech • Information Technology • Payments • Software
As a Senior Site Reliability Engineer, you will operate payment infrastructure, leading incident management, automation, monitoring, and collaboration for stability and performance improvements.
Top Skills: AnsibleAzureDynatraceHarnessHelmIbm TechnologiesOraclePerlPythonSplunkTerraformUnixWebsphere
Reposted 10 Days Ago
San Jose, CA, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Fintech • Healthtech • Biotech
The role involves designing automated solutions for large-scale systems, enhancing system stability, and developing monitoring tools to improve operational performance.
Top Skills: C++DockerGoJavaJavaScriptKubernetesMySQLNginxPythonRedis
16 Days Ago
Remote
United States
Senior level
Senior level
Blockchain • Software
As a Senior SRE, you will enhance reliability across our cloud infrastructure by automating tasks, designing robust systems, and improving collaboration between dev and ops teams.
Top Skills: ArgocdAWSCi/CdGithub ActionsGoHelmJavaScriptKubernetesOpentelemetryPrometheusPythonRustTerraform
17 Days Ago
Frisco, TX, USA
Senior level
Senior level
Gaming • Software • Esports
The Senior Site Reliability Engineer will design and implement cloud-native solutions, focusing on observability and reliability, while mentoring developers and participating in on-call support.
Top Skills: AWSBashCi/CdDockerGoOpentelemetryPythonTerraform
21 Days Ago
Hybrid
San Mateo, CA, USA
Mid level
Mid level
Cloud • Fintech • Information Technology • Machine Learning • Software
As an Engineer in SRE Observability, you will enhance operational excellence, build monitoring tools, and support teams to improve software reliability.
Top Skills: C#CicdDatadogDockerDynatraceGoIacJavaScriptKubernetesLinuxNew RelicOpen TelemetryPythonScalyrSignalfxSplunkSumo Logic
Senior level
Fintech • Information Technology • Payments • Software
The role involves improving application performance and availability through automation and collaboration on disaster recovery strategies, while managing cloud infrastructure and promoting innovative technologies.
Top Skills: AksAnsibleArgo CdArtifactoryAzureBashBigQueryBigtableCassandraChefCi/CdCloud SqlCosmos DbFlux CdGCPGithub ActionsGkeHadoopHelmJenkinsKafkaKubernetesMicrosoft Sql ServerPostgresPowershellTerraformZookeeper
Reposted 11 Days Ago
Hybrid
New York, NY, USA
Senior level
Senior level
Music
The Senior SRE Engineer will take operational responsibility for services, design infrastructure, ensure security compliance, and promote SRE methodologies, while collaborating with cross-functional teams.
Top Skills: AWSGCPNode.jsReactTerraformTypescript
18 Days Ago
5 Locations
133K-188K Annually
Senior level
133K-188K Annually
Senior level
Artificial Intelligence • Cloud • Information Technology • Software • Semiconductor
The role involves designing a scalable core service layer, automating deployments, managing network hardware, and scripting for operations.
Top Skills: Arista EosBashCisco IosDockerGitGoGrafanaJenkinsKubernetesLinuxPrometheusPythonRest Apis
13 Days Ago
8 Locations
150K-189K Annually
Senior level
150K-189K Annually
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services • Data Privacy
The role involves designing and maintaining cloud platforms, ensuring reliability, implementing monitoring and automation solutions, and collaborating with teams to drive continuous improvement in service stability and performance.
Top Skills: AnsibleAWSAzureAzure MonitorCi/CdCloudtrailCloudwatchDatabricksDynatraceGCPGitGrafanaJavaJenkinsPrometheusPythonSplunkTerraform
13 Days Ago
Remote
USA
Senior level
Senior level
eCommerce • Food • Software
The Site Reliability Engineer will enhance system reliability and observability, manage incident response, and mentor teams. Requires strong technical expertise.
Top Skills: CircleCICloudwatchDatadogFirehydrantGrafanaJenkinsNew RelicOctopusOpsgeniePagerdutyRaygunSplunk SignalfxSumo LogicTeamcityVictorops
Reposted 3 Days Ago
Remote
8 Locations
Mid level
Mid level
Cloud • Software
As an SRE & GitOps Engineer, you will automate IT operations, improve infrastructure as code, and provide critical feedback on services used by millions of Ubuntu users.
Top Skills: Ci/CdDebianElasticsearchGrafanaIacLinuxPrometheusPythonUbuntu
Mid level
Artificial Intelligence • Cloud • Software
The Senior Site Reliability Engineer will manage and optimize compute resources, ensuring system reliability, collaborating with teams, and responding to incidents in AI-driven environments.
Top Skills: AnsibleBashGitGrafanaLinuxPrometheusPython
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design and implement GPU compute clusters, optimize operations for efficiency, troubleshoot and maintain large-scale infrastructure, and enhance researcher productivity.
Top Skills: BashDockerEnrootGpfsKubernetesLustreMySQLPythonPyTorchSlurmTensorFlowTerraform
19 Days Ago
Remote
United States
204K-240K Annually
Senior level
204K-240K Annually
Senior level
Cloud • Information Technology • Security • Software
Lead the design and optimization of AWS networking infrastructure to support HashiCorp's cloud products. Develop automation workflows, ensure network reliability, and provide mentorship to engineers.
Top Skills: AlbsAWSConsulNlbsPrivatelinkRoute 53TerraformTransit GatewayVpc
Reposted 24 Days Ago
Greenwood Village, CO, USA
67K-107K Annually
Junior
67K-107K Annually
Junior
Information Technology • Internet of Things • Mobile • On-Demand • Software
The Associate Site Reliability Engineer ensures the reliability and scalability of PaaS infrastructure, collaborating with teams to implement high availability and manage cloud services and automation to meet stakeholder needs.
Top Skills: AnsibleAWSGitGoogle Cloud PlatformsHelmJenkinsKubernetesLog InsightAzureOpenshiftPerlPowershellPythonRancherShellVMwareVrilVrniVrops
14 Days Ago
Home, PA, USA
146K-181K Annually
Senior level
146K-181K Annually
Senior level
Big Data • Cloud • Marketing Tech • Social Impact • Software
The Senior SRE will support global product deployments, provide engineering support, maintain infrastructure, and enhance CI/CD tooling while ensuring security and compliance in production environments.
Top Skills: AWSCircleCIGCPGoJenkinsKubernetesPythonTerraform

Top Companies Hiring Site Reliability Engineers

See All
Flock Safety Thumbnail
Software • Security • Machine Learning • Hardware
Atlanta, GA
800 Employees
Canonical Thumbnail
Software • Cloud
3 Offices
880 Employees
Spotify Thumbnail
Music
3 Offices
9574 Employees
Broadridge Thumbnail
Fintech • Financial Services
24 Offices
14000 Employees
Symbotic Thumbnail
Robotics • Logistics
Wilmington, MA
1200 Employees
Hireio, Inc. Thumbnail
Healthtech • Fintech • Cloud • Biotech • Artificial Intelligence
San Jose, CA
55 Employees
All Filters

Total selected (1)

Job Category

Skills
Date Posted
Job Category
Experience
Industry
Show more
Show less
Company Name
Company Size

Sign up now Access later

Create Free Account