Get the job you really want.
Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Senior Observability Engineer maintains monitoring systems, designs log aggregation solutions, automates tasks with scripts, and ensures platform performance.
Top Skills:
AnsibleBashDynatraceElasticsearchElkFilebeatFluentbitFluentdGrafanaLinuxLogstashOtelPowershellPrometheusPythonTerraform
Reposted YesterdaySaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves supporting network infrastructure, automating cloud infrastructure, managing CI/CD workflows, and ensuring operational excellence in IT support, including incident response and security practices.
Top Skills:
AnsibleAWSBashDockerGitKubernetesPythonRubyTerraform
AdTech • eCommerce • Food • Marketing Tech • Retail
The Senior Site Reliability Engineer is responsible for ensuring production system reliability, scalability, and performance through automation, monitoring, and infrastructure engineering. The role includes mentoring junior engineers and managing production environments, while collaborating with engineering teams to improve system resilience.
Top Skills:
AksArgocdAWSAzureBashDatadogDockerElkGCPGithub ActionsGoJavaKafkaKubernetesPrometheusPythonRedisSpring BootTerraformTomcat
AdTech • eCommerce • Food • Marketing Tech • Retail
The Senior Site Reliability Engineer is responsible for ensuring production systems' reliability, scalability, and performance through automation, observability, and infrastructure engineering.
Top Skills:
AksArgocdBashDatadogDockerElkGithub ActionsGoJavaKafkaKubernetesPrometheusPythonRedisSpring BootTerraformTomcat
AdTech • eCommerce • Food • Marketing Tech • Retail
Responsible for maintaining and improving the reliability of production systems through automation, monitoring, and incident response in a cloud-native environment, while mentoring junior engineers.
Top Skills:
AksArgocdBashDatadogDockerElkGithub ActionsGoJavaKafkaKubernetesPrometheusPythonRedisTerraformTomcat
Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
The Senior Site Reliability Engineer will enhance reliability in production SaaS systems, implement AI agents, improve observability, and mentor junior engineers.
Top Skills:
.NetAksAWSAzureBashC#DatadogEksGCPGoGrafanaKubernetesLinuxOpentelemetryPrometheusPythonTerraform
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Site Reliability Engineer, develop solutions for deployment engineers, ensure scalable system delivery, and improve operational capabilities for military technologies.
Top Skills:
C++Cloud TechnologiesCybersecurityGoNetworkingPythonRust
Artificial Intelligence • Big Data • Information Technology • Security • Software
The Site Reliability Engineer ensures operational excellence in a telecommunication solution on the public cloud, handling automation, incident management, performance planning, and security collaboration.
Top Skills:
AnsibleAWSDatadogDockerGCPGitlabHelmJavaJenkinsKubernetesNoSQLTerraform
Fintech • Consulting
The Site Reliability Engineer at Equifax manages system uptime across cloud and hybrid architectures, builds infrastructure as code, creates CI/CD pipelines, and leads availability postmortems while ensuring service reliability and performance.
Top Skills:
AnsibleAWSBashChefDockerGCPGoJavaJavaScriptJenkinsKubernetesNode.jsPythonTerraform
Fintech • Consulting
The SRE at Equifax ensures reliability and performance of large-scale systems, automating operational tasks and collaborating with dev and ops teams in a hybrid work environment.
Top Skills:
AnsibleBashChefDockerGithub ActionsGoJavaJavaScriptJenkinsKubernetesNode.jsPythonTerraform
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Responsible for developing incident management guidelines, supporting production systems, defining reliability metrics, and driving automation for high service availability.
Top Skills:
GoGrafanaPerlPrometheusPythonRuby
Cloud • Software
Maintain 99.99% uptime for GovCloud services through 24/7 monitoring, incident response (Sev0/Sev1), RCAs, automation of detection/remediation, smart-hands support, and mentoring teammates while ensuring compliance and security.
Top Skills:
AWSBsdC2SGoLinuxMonitoring SystemsPythonRed Hat Enterprise LinuxSolarisTcp/Ip
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Software
Seeking a Site Reliability Engineer to enhance cloud service reliability, automation and performance, while implementing DevSecOps best practices in a remote setup.
Top Skills:
AnsibleAWSAws CloudwatchAzureDockerElkGCPJavaJenkinsKubernetesPythonShell ScriptingTerraform
Artificial Intelligence • Big Data • Machine Learning • Software
The role involves designing and implementing custom installations of the C3 AI Platform for Federal customers, ensuring uptime, and automating system processes while collaborating with cross-functional teams.
Top Skills:
AnsibleAWSAzureBashKubernetesLinuxPuppetPythonRubyTerraform
Healthtech
The Staff Site Reliability Engineer will lead uptime strategy, incident management, business continuity, disaster recovery, and scalability efforts while mentoring and driving SRE culture within the organization.
Top Skills:
DatadogGCPGkeGoGrafanaKubernetesPrometheusPythonTerraformTypescript
Blockchain • Software
As a Senior Engineer, SRE/DevOps, you will enhance blockchain infrastructure reliability, automate deployment, and collaborate on CI/CD practices while ensuring security and performance optimization.
Top Skills:
AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElk-StackEthereumGCPK8SKubernetesOpsgeniePingdomPythonTerraform
Reposted 11 Hours AgoSaved
Easy Apply
Easy Apply
Aerospace
The role involves ensuring the reliability and security of systems, developing automation solutions, and optimizing cloud infrastructure to support operations.
Top Skills:
AirflowAmazon EksArgocdAWSBashDockerElk StackGitlab CiGrafanaJenkinsKafkaPowershellPrometheusPythonSpark
Aerospace
The Staff Site Reliability Engineer will design SRE procedures, engineer SLOs, build tooling, and create dashboards. The role includes automating tasks and collaborating within the SRE team.
Top Skills:
DatadogGoJaegerKubernetesPrometheusPulumiPythonTerraform
News + Entertainment
The Site Reliability Engineer will manage cloud infrastructure, ensure platform reliability, optimize performance, and implement security best practices for Courier Newsroom's digital properties.
Top Skills:
AnsibleCi/CdDockerKubernetesPythonSaaSTerraformWordpress
Fintech
The Site Reliability Engineer will manage Kubernetes clusters, automate infrastructure, ensure cloud resource reliability, and collaborate across teams to enhance operational efficiency.
Top Skills:
Amazon S3Apache MesosAWSAzureC/C++CephCloud InfrastructureDockerHdfsHelmInfrastructure As CodeJavaJavaScriptKubernetesLinuxNfsPostgresPythonRubyTerraformYarn
Aerospace • Other
The role involves managing Kubernetes and Linux servers, supporting containerized applications, implementing automation solutions, and mentoring peers in a fast-paced environment.
Top Skills:
AnsibleDockerGoGrafanaHelmInfluxdbJSONKubernetesLinuxPrometheusPythonRkeTerraformYaml
Aerospace • Other
The Sr. IT Linux Site Reliability Engineer will manage and optimize Kubernetes clusters, automate systems, and collaborate with teams to ensure system resilience and performance.
Top Skills:
AnsibleDockerGoGrafanaKubernetesLinuxPrometheusPythonTerraform
Reposted 11 Hours AgoSaved
Easy Apply
Easy Apply
AdTech • Marketing Tech
The Senior Software Engineer for Core Services SRE will maintain infrastructure, develop reliable systems, lead technical initiatives, and conduct security reviews.
Top Skills:
AerospikeAWSBoundaryConsulElasticsearchEnvoyGoGrafanaKafkaNginxNomadPackerPrometheusRdsRedisScylladbTerraformVagrantVaultWaypoint
Reposted 11 Hours AgoSaved
Easy Apply
Easy Apply
Financial Services
As a Site Reliability Engineer, you'll ensure high availability of Commodities Technology applications, automate processes, and contribute to incident analysis and monitoring systems.
Top Skills:
AnsibleAWSC#DatadogDockerKubernetesLinuxPowershellPythonTerraformWindows
Security • Cybersecurity
The Staff Site Reliability Engineer will lead reliability strategy, architecture, and incident response while mentoring engineers and improving operational excellence.
Top Skills:
AWSCi/CdGithub ActionsJavaScriptPythonRubyTerraform
Top Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results

.png)




.png)









_0.png)













