Get the job you really want.
Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs
Cloud • Fintech • Other • Software
The Senior Site Reliability Engineer will support development teams, enhancing service reliability and automation, while ensuring compliance and security commitments.
Top Skills:
AWSBashDatadogEcsGoPythonTerraform
Reposted 14 Days AgoSaved
Financial Services
Own reliability and scalability of on-prem observability platforms (ELK, Grafana); handle production escalations, capacity planning, SLOs, onboarding, automation, IaC (Terraform/Helm/Ansible), upgrades, security hardening, and platform modernization.
Top Skills:
AnsibleApm InstrumentationBashBeatsChefElasticsearchElk StackFluent BitFluentdGrafanaHelmKibanaLinuxLogstashNew RelicOpentelemetryPrometheusPuppetPythonShell Scripting/Linux ShellSolarwindsTerraform
Healthtech • Payments • Software
Lead and mentor a team of client integration engineers, manage production and integration efforts for large-scale distributed systems, gather client requirements, drive SDLC activities, provide datacenter and customer support, and continuously improve system reliability and operations.
Top Skills:
JavaObject-Oriented Programming
Information Technology • Software • Cryptocurrency • Web3
The Senior Site Reliability Engineer will design, build, and manage Azure infrastructure for HashSphere, ensuring secure and scalable deployments while enhancing system reliability and operational excellence in partnership with cross-functional teams.
Top Skills:
ArgoAzureGoGrafanaKubernetesPrometheusPythonSpaceliftTerraform
Cloud • Information Technology
As a Sr. Site Reliability Engineer, you'll ensure service reliability, build automation, and collaborate on infrastructure improvements while mentoring others.
Top Skills:
AnsibleCatchpointDockerElkGoGrafanaHashicorp VaultJenkinsKubernetesLinuxPrometheusPythonTerraform
Security • Software
The Senior Site Reliability Engineer will manage AWS infrastructure, automate deployment, ensure architecture meets requirements, and develop tools for reliability.
Top Skills:
AnsibleAWSC#C++CloudFormationCloudwatchDatadogDockerEc2EksElkGrafanaJavaPythonS3TerraformVpc
Software
Lead the modernization of AWS cloud infrastructure, implement automation, ensure system reliability, and manage performance with a focus on security and incident response.
Top Skills:
AngularApexAWSC#ElasticacheNew RelicNode.jsNpmPm2PythonRedisShell ScriptingTerraform
Artificial Intelligence • Healthtech
The Senior Site Reliability Engineer will design and implement infrastructure automation, CI/CD pipelines, and monitoring for a healthcare AI platform while ensuring reliability and security.
Top Skills:
AnsibleAWSAws KmsAzureAzure Key VaultDatadogDockerElkGCPGrafanaHashicorp VaultJenkinsKubernetesTerraform
Blockchain • Software • Cryptocurrency • Web3
As a Senior Site Reliability Engineer, you will oversee AWS/GCP infrastructure, ensure system reliability, deploy applications, and enhance automation in a collaborative team environment.
Top Skills:
AnsibleAWSCi/CdElkGCPJenkinsKubernetesLinuxPrometheusPuppetTerraform
Blockchain • Software • Cryptocurrency • Web3
Responsible for ensuring reliability and scalability of systems, maintaining AWS/GCP infrastructure, deploying applications, and improving operational processes.
Top Skills:
AnsibleAWSDnsElkGCPHTTPHttpsJenkinsKubernetesLinuxPrometheusPuppetTcpTerraformUdp
Artificial Intelligence • Software
As a Senior SRE, you'll enhance data infrastructure, optimize performance, build reliability, automate processes, and manage incident responses while supporting enterprise clients' uptime requirements.
Top Skills:
ClickhouseGoPostgresPythonTypescript
Artificial Intelligence • Software • Generative AI
Lead reliability and performance for Plaud.ai's AI products. Design and operate scalable cloud-native systems, run on-call/incident response, build observability and reliability automation, define SLOs/SLIs, and drive postmortems and platform reliability improvements.
Top Skills:
AWSAzureDistributed SystemsGCPGoJavaKubernetesLogsObservability (MetricsPythonTracing)
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Artificial Intelligence • Marketing Tech • Software • Big Data Analytics
The Senior Site Reliability Engineer will design and maintain scalable infrastructure, improve system reliability, manage CI/CD pipelines, and collaborate across teams for operational excellence.
Top Skills:
AnsibleArgocdAWSBashDatadogDockerElkGithub ActionsGrafanaKubernetesLinuxOpentelemetryPrometheusPythonTerraform
Artificial Intelligence • Healthtech • Software
Design, build, and maintain secure and scalable infrastructure for critical healthcare applications, lead incident responses, and support engineering teams.
Top Skills:
BashGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Automotive
Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.
Top Skills:
AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform
Fintech
The Senior Site Reliability Engineer will enhance system reliability, design SRE platforms, automate workflows, and ensure scalability in a Kubernetes-based environment.
Top Skills:
AirflowAnsibleArgocdAWSBashDockerGoGrafanaKafkaKubernetesMqOpentelemetryPythonRundeckSqsTerraform
Fintech • Payments
Senior Site Reliability Engineers at PayPal ensure the reliability and performance of mobile and backend systems, implementing standards, automation, and observability while managing incidents and mentoring junior staff.
Top Skills:
AWSAzureDatadogFirebase CrashlyticsGCPGoPythonSentry
Information Technology • Security • Software
The Senior Site Reliability Engineer will manage and ensure uptime, performance, and reliability of cloud services while optimizing resource allocation and leading incident responses.
Top Skills:
AnsibleAWSAzureDatadogJenkinsNetappOctopus DeployPowershellPrometheusSplunkTerraformVMware
Software
The Senior SRE will ensure reliability of production systems, design monitoring processes, and build automation tools while collaborating in a regulated environment. The role blends operational tasks and coding responsibilities.
Top Skills:
AWSCdkCloudfrontEcsGithub ActionsHoneycombLambdaNixOpentelemetryPythonRdsSQLTerraformTypescript
Software
The Senior Site Reliability Engineer will design and maintain scalable systems, enhance automation, manage CI/CD pipelines, and mentor junior engineers, utilizing multiple programming languages and IaC tools.
Top Skills:
AWSAzureC#CoralogixDatadogDockerGoJavaJavaScriptKubernetesPrometheusPythonTerraform
Marketing Tech • Cryptocurrency
The Senior Site Reliability Engineer will support high-performance trading infrastructure, enhance security and reliability, and develop automation tools while collaborating with teams.
Top Skills:
AWSAzureBashDockerEbpfGoGCPJavaScriptKubernetesLinuxOpentelemetryPrometheusPythonSIEMTypescript
Other
The Sr. Site Reliability Engineer will maintain and administer enterprise systems, troubleshoot operational issues, and develop scripts. This role requires collaboration across teams and participation in project planning and execution.
Top Skills:
AnsibleApacheAzureC#ChefIisJavaJbossPerlPowershellPuppetPythonRubyTomcat
Aerospace • Hardware • Logistics • Robotics • Software • Transportation
The Senior Site Reliability Engineer will lead cloud infrastructure initiatives, develop best practices, write software, and manage systems while working closely with developers. They will also participate in an on-call rotation and set high technical standards for interviews.
Top Skills:
AWSKafkaKubernetes
Aerospace • Hardware • Defense
Lead design, build, and operation of scalable, reliable cloud infrastructure; mentor engineers; make architecture and technology decisions; introduce new tools; lead cross-team initiatives; participate in on-call rotations and incident response.
Top Skills:
AlertingAWSEc2GitopsInfrastructure-As-Code (Iac)KubernetesLambdaMonitoringOn-CallS3Service MeshService RegistrationTerraformVpc
Other • Social Impact
The Senior Site Reliability Engineer is responsible for maintaining Wikimedia's infrastructure, improving reliability, automating tasks, and mentoring peers while participating in incident management.
Top Skills:
Apache Traffic ServerBashDebianEnvoyGoGrafanaHaproxyKubernetesNginxPrometheusPuppetPythonRubyVarnish
Top Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results
.png)
































