Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Reposted 2 Days AgoSaved
Easy Apply
Easy Apply
Information Technology • Security • Software
Manage daily operations of a classified NOC, focusing on Kubernetes services, incident response, system monitoring, and ensuring security and availability.
Top Skills:
Aws GovcloudAzure GovernmentC2EC2SDockerElastic StackFluentdFluxGrafanaHelmJIRAJwccKubernetesOsticketPrometheusTerraform
Software
The Principal Site Reliability Engineer will design and improve systems for reliability in payments software, guiding development cycles and incident response, while ensuring service health and organizational efficiency.
Top Skills:
CassandraGoJavaKafkaOraclePostgresPythonRabbitMQShell
Software
The Principal Site Reliability Engineer will enhance system reliability, promote SRE practices, lead organizational improvements, and ensure efficient software development and incident response processes.
Top Skills:
CassandraGoJavaKafkaOraclePostgresPythonRabbitMQShell
Artificial Intelligence • Cloud • Social Impact • Software • Wearables
As a Site Reliability Engineer II, you'll develop automation workflows, manage cloud operations, and enhance service reliability while participating in incident response and code reviews.
Top Skills:
ApmAWSAws CloudformationAzureC#Ci/CdGoJavaKubernetesObservability ToolsPythonTemporalTerraform
Other • Energy
Lead SRE practices for GCP-based data platforms, automate workflows, design reliable architectures, mentor engineers, and improve operational processes.
Top Skills:
BigQueryCi/CdCloud LoggingCloud MonitoringCloud StorageCompute EngineDataflowDatastreamGithub ActionsGitlab CiGkeGoogle Cloud PlatformIamKubernetesPub/SubPythonTerraform
Aerospace • Energy
The Site Reliability Engineer ensures performance and availability of compute and network infrastructure, automating solutions and addressing potential issues proactively while mentoring junior staff.
Top Skills:
.NetAnsibleAWSAzureChefDatadogFtpGoGrafanaJavaNode.jsPuppetPythonSaltSensuSnmpSplunkTcp/IpTerraform
Fintech • Consulting
The SRE at Equifax ensures reliability and performance of large-scale systems, automating operational tasks and collaborating with dev and ops teams in a hybrid work environment.
Top Skills:
AnsibleBashChefDockerGithub ActionsGoJavaJavaScriptJenkinsKubernetesNode.jsPythonTerraform
Fintech • Payments
The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.
Top Skills:
Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk
Fitness • Healthtech • Retail • Pharmaceutical
The Senior Manager, SRE Release Engineering oversees Release Engineering for the Pharmacy & Consumer Wellness line, ensuring high-quality technology releases through collaboration with IT teams and managing end-to-end change releases.
Top Skills:
AWSAzureDockerGCPKubernetesServicenowSharepoint
Retail • Sports
Lead global D2C Site Reliability and Platform Operations to ensure availability, performance, and scalability of eCommerce and omnichannel systems. Define SRE strategy, SLIs/SLOs, incident management, observability, cloud operations, FinOps, vendor management, and global on-call models while building and developing high-performing teams and operational playbooks.
Top Skills:
AlertingCi/CdCloud InfrastructureError BudgetsFinopsIncident ManagementMonitoringObservabilitySite Reliability Engineering (Sre)SlasSlisSlos
Artificial Intelligence • eCommerce • Retail
Lead the SRE and DevOps team, ensure infrastructure reliability, oversee cloud operations, drive automation, and collaborate cross-functionally.
Top Skills:
AzureBashCi/CdDatadogDockerElk StackGoGrafanaKubernetesPowershellPrometheusPythonTerraform
Aerospace • Big Data • Greentech • Hardware • Social Impact
Design, deploy, and operate compute services for on-premises and cloud satellite imaging platforms. Build reproducible, scalable, highly available deployments, troubleshoot distributed systems, optimize constrained environments, document and automate operations, and participate in on-call rotations to ensure reliability for customer-facing and air-gapped deployments.
Top Skills:
AlloyAnsibleBashCudaGitopsGrafanaHelmJIRAK3SKubernetesKustomizeOpentelemetryPrometheusProxmoxPythonRke2TalosTerraform
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Artificial Intelligence • Cloud • Information Technology • Mobile • Software • Consulting
The role involves designing and implementing observability solutions using OpenTelemetry, managing platform engineering tasks, and ensuring site reliability through various engineering practices.
Top Skills:
AWSAzureCi/CdCloudFormationDockerGCPGoJavaKubernetesNode.jsOpentelemetryPulumiPythonRustTerraform
Artificial Intelligence • Cloud • Information Technology • Mobile • Software • Consulting
The role involves designing and implementing OpenTelemetry solutions, optimizing telemetry infrastructure, establishing SRE practices, and managing observability across cloud platforms.
Top Skills:
ArgocdAWSAzureBashCloudFormationDockerGCPGithub ActionsGitlab CiGoJavaJenkinsNode.jsOpentelemetryPowershellPulumiPythonRustTerraform
Software
Join the SRE team to improve monitoring, alerting, observability, and reliability of Fireblocks' production systems. Triage incidents, run RCA, create runbooks and automation (Python, Lambda, shell, Ansible, ArgoCD), collaborate with R&D/support, and participate in on-call rotation.
Top Skills:
AnsibleArgocdAWSAws LambdaAzureBashBitbucketC++ChefCoralogixDatadogDockerGerritGitGitlabGCPHelmJavaScriptKubernetesLinuxMySQLNew RelicNginxNode.jsPhabricatorPrometheusPuppetPythonShellSplunk
Software
As an AI Support Engineer, you'll manage support requests, resolve user issues, optimize ML models, and contribute to product development.
Top Skills:
Tensorrt
Real Estate • Financial Services • PropTech
As a Site Reliability Engineer, you will support AWS Cloud products, optimize processes, enhance automation, and ensure system reliability and performance.
Top Skills:
ArgocdAWSAzure DevopsBashCi/CdCloudwatchDockerEksFluxcdGitKubernetesPowershellPythonSQLTerraform
Reposted 2 Days AgoSaved
Fintech • Analytics
As a Senior Site Reliability Engineer, you'll lead incident recovery, enhance production stability, automate processes, and collaborate with development teams to improve operational efficiency.
Top Skills:
AWSAzureBigpandaCloud-Native ApplicationsDatadogDnsDockerGitHTTPKubernetesShell ScriptingTcp/IpUnix
Artificial Intelligence • Legal Tech • Professional Services • Software
As a Staff Software Engineer in Site Reliability, you'll manage infrastructure for reliability and scalability, lead incident management, and automate operational tasks.
Top Skills:
AWSAzureBashCloudFormationDatadogGCPGoIncidentioPagerdutyPulumiPythonSentryTerraform
Artificial Intelligence • Legal Tech • Professional Services • Software
As a Software Engineer in Site Reliability, you will ensure the reliability and performance of our AI platform through automation and strategic infrastructure management.
Top Skills:
AWSAzureBashCloudFormationDatadogGCPGoKubernetesPagerdutyPythonSentryTerraform
Other • Utilities
The intern will support the maintenance of critical internet systems, focusing on automation, monitoring, and testing for performance and uptime. Responsibilities include collaborating with teams, managing infrastructure, and conducting operational analysis to improve services.
Top Skills:
Configuration ManagementDevops-Centric Automation Tools
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you'll ensure site reliability, improve infrastructure automation, manage incidents, and collaborate with engineering teams to enhance systems.
Top Skills:
DockerGoKafkaKubernetesLinuxMongoDBPostgresRedisRubyTerraform
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you will ensure the reliability of internal services, improving automation and infrastructure, and collaborating with engineering teams to resolve issues and enhance product performance.
Top Skills:
DockerGoKafkaKubernetesMongoDBPostgresRedisRubyTerraform
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer at Braze, you'll ensure uptime for internal services, improve automation, and develop infrastructure tools, collaborating across teams to enhance reliability and scalability.
Top Skills:
ChefDockerKafkaKubernetesMongoDBRedisRuby On RailsTerraform
Cloud • Software
In this role, you'll support large-scale applications, improve observability, mentor team members, and ensure reliability by collaborating on deployments and writing automation scripts while providing 24/7 support.
Top Skills:
AnsibleAWSBashConfluenceDockerElk StackGCPGitlab CicdGrafanaJenkinsJIRAKubernetesLinuxMongoDBMySQLNagiosOciPerlPostgresPrometheusPuppetPythonTerraform
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results





.png)








.jpg)


.jpg)












