Get the job you really want.
Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs
Fintech • Machine Learning • Payments • Software • Financial Services
This role involves driving the product management strategy for Cyber SRE by embedding reliability in products, creating automated solutions, and enhancing cybersecurity practices while collaborating with engineering leaders.
Top Skills:
AIOpentelemetry
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
The Staff Site Reliability Engineer will architect, operate, and improve the platform while ensuring security compliance and enhancing development processes.
Top Skills:
AWSElasticsearchIstioKubernetesNatsNode.jsPostgresPythonReactTerraformTypescript
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves improving software reliability, automating processes, collaborating with teams on system optimization, and mentoring engineers to establish reliability as a core value.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
Enterprise Web • Hardware • Internet of Things • Software
The Senior Site Reliability Engineer will mentor teams on observability practices, architect systems for growth, automate developer tasks, and debug production issues.
Top Skills:
GoKubernetesLgtm StackOpentelemetryPrometheusTypescript
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Lead a team responsible for the reliability of hybrid cloud systems, defining SLOs/SLIs, managing on-prem utilities, and ensuring environment integrity for autonomous vehicle operations.
Top Skills:
AnsibleChefConfiguration Management ToolsDhcpHybrid CloudLinuxNtpPxeSite Reliability EngineeringSlo Frameworks
Reposted 9 Days AgoSaved
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Staff Engineer will define reliability architecture, automate foundational utilities, develop observability tools, ensure environment integrity, and mentor colleagues.
Top Skills:
AnsibleChefDhcpKubernetesLinuxNtpPxe
Reposted 9 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills:
AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
Information Technology • Web3
The Site Reliability Engineer manages AWS Kubernetes infrastructure, ensuring operational excellence, security, and scalability, while implementing reliability improvements and best practices.
Top Skills:
ArgocdAWSBashDatadogEksGoKafkaKubernetesPostgresPythonSysdigTerraform
Fintech • Information Technology • Financial Services
As a Site Reliability Engineer, you'll ensure operational reliability of trading platforms, troubleshoot issues, and drive SRE best practices within a collaborative environment.
Top Skills:
AWSGenai ToolsJavaJenkinsPythonShell ScriptingSQL
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Site Reliability Engineer II will focus on improving reliability and engineering practices by automating processes, building observability solutions, and mentoring teams to enhance operational excellence.
Top Skills:
AWSAws CdkCloudwatchGitGithub ActionsJavaNewrelicPythonTerraformTypescript
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Site Reliability Engineer II will enhance observability, drive engineering maturity, and automate processes across teams while mentoring engineers.
Top Skills:
AWSAws CdkCloudwatchGitGithub ActionsJavaNewrelicPythonTerraformTypescript
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Site Reliability Engineer II role focuses on driving reliability and observability across teams through automation, AI solutions, and engineering maturity. Responsibilities include establishing SLIs/SLOs, architecting observability strategies, leading workshops, and guiding engineering practices.
Top Skills:
AWSAws CdkCloudwatchGitGithub ActionsJavaNewrelicPythonTerraformTypescript
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Site Reliability Engineer II focuses on enhancing reliability and observability, driving process improvements, mentoring engineers, and leveraging AI for automation within the SRE team at Cox Automotive.
Top Skills:
AWSAws CdkCloudwatchGitGithub ActionsJavaNewrelicPythonTerraformTypescript
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Site Reliability Engineer II will enhance reliability and observability across teams, automate processes, mentor engineers, and implement AI solutions.
Top Skills:
Agentic AutomationAIAWSAws CdkCloudwatchGitGithub ActionsJavaNewrelicPythonTerraformTypescript
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Site Reliability Engineer II enhances reliability and observability in engineering practices, drives improvements using metrics, leads workshops, and promotes automation practices across teams.
Top Skills:
AIAWSAws CdkCloudwatchGitGithub ActionsJavaLlmsNewrelicPythonTerraformTypescript
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Site Reliability Engineer II will drive reliability and observability across teams, design automation processes, create engineering maturity assessments, and utilize AI solutions to enhance operational workflows.
Top Skills:
AWSAws CdkCloudwatchGitGithub ActionsJavaNewrelicPythonTerraformTypescript
Information Technology • Software • Financial Services • Quantitative Trading
The Site Reliability Engineer will provide support and diagnose issues within a real-time, distributed environment, focusing on large-scale application and infrastructure management, with basic required skills in UNIX/Linux, networking, SQL, and scripting languages.
Top Skills:
BashPythonSQLTcp/IpUdpUnix/Linux
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Lead the SRE team in managing NG911 call routing systems, ensuring 99.999% availability through technical leadership, incident management, and process automation.
Top Skills:
.Net CoreAngularAnsibleAWSAzureC#Github ActionsJavaJenkinsKafkaMs Sql ServerPostgresRabbitMQRedisTerraform
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
As a Site Reliability Engineer, you will design Kubernetes infrastructure, build CI/CD pipelines, manage cloud resources, and enhance operational workflows with automation and observability tools, ensuring system reliability and performance.
Top Skills:
Ai/Ml ToolingGithub ActionsGoGoogle Cloud PlatformGrafanaHelmJavaKafkaKubernetesPrometheusPythonTerraform
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Lead Site Reliability Engineer will ensure reliability, scalability, and performance of Mastercard's applications, enhancing operational practices and developer collaboration in a proactive environment.
Top Skills:
Ci/CdDevOpsGoJavaPythonSpring Framework
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will ensure systems run smoothly, work with automation tools, resolve issues, and drive operational improvements.
Top Skills:
AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
The Site Reliability Engineer will automate tasks, enhance platform infrastructure, improve observability, and lead incident response efforts for optimal performance.
Top Skills:
AWSGrafanaHoneycombLinuxPythonTerraform
Fintech • Machine Learning • Payments • Software • Financial Services
Lead a team of developers to create solutions for regulatory needs, utilizing various technologies and collaborating with product managers.
Top Skills:
AnsibleApache MesosSparkAWSDockerGoJavaJenkinsKubernetesMarathonPythonRubySQLTerraform
AdTech • Digital Media • Internet of Things • Marketing Tech • Mobile • Retail • Software
The role involves monitoring and maintaining IP networks, troubleshooting incidents, automating tasks, and collaborating with cross-functional teams to ensure network reliability.
Top Skills:
AWSAzureBashBgpDhcpDnsEigrpElkGCPGrafanaIs-IsMplsOspfPrometheusPythonSnmpSplunkTcp/IpVlans
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Oversee operational support of SAP BTP CPI applications, manage incidents, lead support specialists, and collaborate on architecture and governance for finance processes.
Top Skills:
Abap ProxiesAemCapmCloud ConnectorCloud FoundryEdge Integration CellIdocJSONMessage QueuesOauthOdataRestSAMLSap BtpSfapiSftpSoapXML
Top Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results

.png)
.png)













.png)











