Top Site Reliability Engineer Jobs

An Hour AgoSaved
Easy Apply
Hybrid
Palo Alto, CA, USA
Easy Apply
86K-192K Annually
Mid level
86K-192K Annually
Mid level
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
The Site Reliability Engineer will design and develop automated solutions and infrastructure to enhance service reliability and efficiency, collaborating closely with various teams to meet customer needs.
Top Skills: AIAWSCloudFormationDatadogGoJavaJenkinsKibanaMavenMlNewrelicNode.jsPythonSignalfxTerraform
Reposted An Hour AgoSaved
In-Office
3 Locations
Senior level
Senior level
Healthtech • Payments • Software
The Senior Site Reliability Engineer II manages infrastructure for Waystar products, enhancing system reliability, observability, and performance while collaborating with engineering teams and mentoring juniors.
Top Skills: Apache AirflowAWSAzureCloudFormationGCPGrafanaKafkaKubernetesPowershellPrometheusPythonSparkSplunkTerraform
An Hour AgoSaved
In-Office or Remote
Eden Prairie, MN, USA
135K-231K Annually
Senior level
135K-231K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
This role involves leading site reliability engineering initiatives, ensuring operational excellence and security for digital platforms within the organization, and collaborating across teams to improve system performance.
Top Skills: Automation ToolsAWSAzureGCPIds/IpsMonitoring SystemsSecurity FrameworksSIEM
An Hour AgoSaved
In-Office
Minnetonka, MN, USA
135K-231K Annually
Expert/Leader
135K-231K Annually
Expert/Leader
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Oversee the SRE, SecOps, and TechOps teams, develop strategies for system reliability and security, and manage high-performing teams.
Top Skills: AiopsAutomation ToolsIdsIncident ResponseIpsMlopsSIEM
6 Hours AgoSaved
Hybrid
Lakewood, CO, USA
90K-122K Annually
Junior
90K-122K Annually
Junior
Machine Learning • Payments • Security • Software • Financial Services
As a Site Reliability Engineer, you will design resilient systems, implement monitoring tools, troubleshoot incidents, and collaborate across teams to improve application performance and reliability.
Top Skills: Application DevelopmentSoftware SolutionsUser Experience Design
7 Hours AgoSaved
Hybrid
2 Locations
119K-187K Annually
Senior level
119K-187K Annually
Senior level
Fintech • Financial Services
Lead the Site Reliability Engineering team for critical cybersecurity platforms; ensure operational excellence, reliability, and compliance while managing incidents and driving automation.
Top Skills: ArmAWSAzureBashCloudFormationElasticGCPGoGrafanaOpentelemetryPowershellPrometheusPythonSplunkTerraform
7 Hours AgoSaved
Hybrid
3 Locations
Senior level
Senior level
Fintech • Financial Services
The role involves overseeing systems operations, ensuring reliability and performance of data pipelines, managing a team, and driving SRE principles and automation initiatives.
Top Skills: Big Data TechnologiesDevops Ci/CdKubernetesOcpScalaSpark
Junior
Machine Learning • Payments • Security • Software • Financial Services
The Site Reliability Engineer designs resilient systems, develops monitoring for critical applications, collaborates on performance tuning, and supports large-scale distributed applications while promoting engineering best practices.
Top Skills: Application DevelopmentIt StandardsProcedures & PoliciesSoftware Process ImprovementSoftware SolutionsSystem TestingTechnical Troubleshooting
14 Hours AgoSaved
Easy Apply
Remote or Hybrid
Crystal City, VA, USA
Easy Apply
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Responsible for managing operations within classified environments, overseeing cloud infrastructure, automating tasks, and ensuring system stability in a high-security setting.
Top Skills: AnsibleAws EcsKubernetesLinuxPythonTerraform
Reposted 21 Hours AgoSaved
In-Office
Eden Prairie, MN, USA
135K-231K Annually
Expert/Leader
135K-231K Annually
Expert/Leader
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Principal Site Reliability Engineer defines the reliability strategy for digital platforms, ensuring operational excellence, collaborating with teams, and improving system stability and performance.
Top Skills: Automation ToolsIds/IpsMonitoring SystemsSIEMSite Reliability EngineeringSoftware EngineeringTechnology Operations
Reposted 21 Hours AgoSaved
Remote or Hybrid
New York, NY, USA
130K-170K Annually
Senior level
130K-170K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Oversee operational support of SAP BTP CPI applications, manage incidents, lead support specialists, and collaborate on architecture and governance for finance processes.
Top Skills: Abap ProxiesAemCapmCloud ConnectorCloud FoundryEdge Integration CellIdocJSONMessage QueuesOauthOdataRestSAMLSap BtpSfapiSftpSoapXML
Reposted 21 Hours AgoSaved
Hybrid
Menlo Park, CA, USA
169K-224K Annually
Senior level
169K-224K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Machine Learning • Software • Biotech
Lead the design and operation of a fault-tolerant cloud infrastructure, implement infrastructure-as-code, manage Kubernetes reliability, and mentor engineers.
Top Skills: AnsibleAWSAzureBashCloudFormationDatadogGCPGithub ActionsGitlab CiGoGrafanaJenkinsKubernetesOpentelemetryPowershellPrometheusPythonTerraform
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
YesterdaySaved
Easy Apply
Hybrid
New York, NY, USA
Easy Apply
187K-240K Annually
Senior level
187K-240K Annually
Senior level
Artificial Intelligence • Cloud • Security • Software • Cybersecurity
The role involves developing AI-assisted product experiences for Datadog by building systems for chat, remediations, and codefixes, alongside collaboration with cross-functional teams to enhance user outcomes.
Top Skills: Ai Coding ToolsGoKubernetesLlm-Based Systems
YesterdaySaved
Hybrid
O'Fallon, MO, USA
122K-207K Annually
Senior level
122K-207K Annually
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
As a Lead Site Reliability Engineer, you will drive SRE practices, influence system architecture, and ensure operational excellence for critical platforms, emphasizing automation and risk mitigation.
Top Skills: AnsibleBashChefDynatraceGitGoJenkinsPythonSplunk
YesterdaySaved
Hybrid
Irving, TX, USA
148K-222K Annually
Senior level
148K-222K Annually
Senior level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
The Engineering Manager leads the IAM Platform team, focusing on operational leadership, SRE implementation, automation, AI integration, and team development to ensure stability and innovation in identity management.
Top Skills: AIAutomationGoIamPythonSre
Reposted YesterdaySaved
Remote
USA
150K-220K Annually
Senior level
150K-220K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
The engineer will build and operate AI/ML infrastructure, managing services on AWS and bare metal, using tools like Kubernetes and Terraform.
Top Skills: AWSBashGoKubernetesPythonSlurmTerraform
2 Days AgoSaved
Remote or Hybrid
2 Locations
160K-235K Annually
Senior level
160K-235K Annually
Senior level
Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth
The Senior Site Reliability Engineer will enhance the reliability and security of infrastructure for in-home healthcare services, using cloud technology and automation to improve systems and processes.
Top Skills: AWSBashGCPPythonTerraformTypescript
Reposted 2 Days AgoSaved
Hybrid
New York, NY, USA
175K-230K Annually
Senior level
175K-230K Annually
Senior level
Hardware • Healthtech • Software • Analytics
The Site Reliability Engineer will ensure high availability of Sage's platform, lead incident response, design reliable systems, and improve operational workflows.
Top Skills: Amazon Web ServicesDatadogGoGoogle Cloud PlatformGrafanaJavaKubernetesMySQLPostgresPrometheusPulumiPythonTerraform
Reposted 3 Days AgoSaved
Hybrid
Chicago, IL, USA
103K-159K Annually
Mid level
103K-159K Annually
Mid level
Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software
As a Site Reliability Engineer, you'll develop resilient cloud platforms, automate processes, engage in cross-team collaboration, and oversee incident management, focusing on scaling and security.
Top Skills: AksAzureBashChefCi/CdDockerElkGoGrafanaJavaKubernetesPowershellPrometheusPythonRubyTerraform
Reposted 3 Days AgoSaved
Easy Apply
Remote or Hybrid
6 Locations
Easy Apply
126K-248K Annually
Senior level
126K-248K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will develop and support distributed storage services, ensuring reliability and operational safety, with a focus on automation and efficiency.
Top Skills: AWSAzureDnsGoGoogle Cloud PlatformKubernetesLinuxPythonTcp/IpTls
Reposted 3 Days AgoSaved
Easy Apply
Remote or Hybrid
United States
Easy Apply
127K-249K Annually
Expert/Leader
127K-249K Annually
Expert/Leader
Big Data • Cloud • Software • Database
Seeking a Site Reliability Engineer with expertise in networking and distributed systems for building secure multi-cloud infrastructure. Responsibilities include maintaining network architecture and ensuring reliable service-to-service communication, involving a 24/7 on-call rotation.
Top Skills: AWSAzureBgpDnsGCPIpv6KubernetesLoad BalancingMtlsService MeshTcp/IpTlsVpcsVpns
Reposted 4 Days AgoSaved
Hybrid
Chicago, IL, USA
100K-157K Annually
Mid level
100K-157K Annually
Mid level
AdTech • Digital Media • Marketing Tech
The role involves ensuring the reliability and performance of FreeWheel systems, managing infrastructure, automating operations, responding to incidents, and collaborating with teams to optimize system capabilities.
Top Skills: AnsibleAWSAzureDockerElk StackGCPGoGrafanaJavaKubernetesOciPrometheusPythonScalaTerraform
Reposted 4 Days AgoSaved
Remote or Hybrid
United States
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
The Lead Site Reliability Engineer will oversee the Infrastructure SRE team, focusing on system reliability, automation, and mentoring while collaborating with product engineering.
Top Skills: Ci/CdDatadogDockerElk StackGitopsGoKubernetesLinux/UnixNew RelicNoSQLPrometheusPythonSQLStackdriverTerraform
Reposted 4 Days AgoSaved
Hybrid
Chicago, IL, USA
103K-159K Annually
Mid level
103K-159K Annually
Mid level
Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software
As a Site Reliability Engineer, you will optimize data services, improve system resilience, automate processes, and collaborate with cross-functional teams to enhance reliability and performance.
Top Skills: AzureBashCi/CdDockerElasticsearchGoGrafanaHashicorp TerraformJavaKubernetesLinuxMariadbMaxscalePowershellPrometheusPythonRuby
Reposted 4 Days AgoSaved
In-Office or Remote
New York, NY, USA
150K-250K Annually
Mid level
150K-250K Annually
Mid level
Mobile • Software
Site Reliability Engineers will work on production infrastructure, focusing on AWS and Kubernetes while ensuring high availability and customer satisfaction.
Top Skills: AirflowAWSCircleCICloudwatchEksGrafanaMongoDBPagerdutyPingdomRustScala SparkTerraformTypescript
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account