Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Edtech
The Senior Site Reliability Engineer will ensure product reliability and performance, develop monitoring and alerting systems, and propose architectural changes.
Top Skills:
AWSBashCC++DockerGCPJavaKubernetesPerlPython
Artificial Intelligence • Cloud • Information Technology • Security • Software
Responsibilities include defining, implementing, and growing the SRE practice, ensuring reliability and performance of production environments, and collaborating with cross-functional teams.
Top Skills:
Aws GovcloudBashDockerElasticGrafanaKubernetesPrometheusPythonSplunkTypescript
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills:
AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
Database
The Senior Site Reliability Engineer at Niche will manage cloud infrastructure, oversee incident responses, mentor team members, and promote best practices to ensure reliability across distributed systems and applications.
Top Skills:
AWSBashDockerGCPGitGoGrafanaKafkaKubernetesPrometheusPythonSQLSumo LogicTerraform
Fintech • Information Technology • Payments
Seeking a Site Reliability Engineer to manage application reliability, incident resolution, and project delivery while collaborating with teams and automating tasks.
Top Skills:
.NetAnsibleApacheAWSAzureBash ScriptingC#DockerJavaKubernetesLinuxMongoDBMssqlPowershellPythonSplunkTerraformTomcatUnix
3D Printing • Aerospace • Hardware • Robotics • Software
Lead the reliability and scalability of BRINC's production systems, building secure cloud infrastructure and improving incident response. Collaborate with teams for optimal system performance.
Top Skills:
AWSInfrastructure As CodeJavaScriptNode.jsPython
Artificial Intelligence
As a Senior Site Reliability Engineer, you will design automation for infrastructure, lead initiatives, ensure reliability and security, and collaborate across teams to enhance system scaling and efficiency.
Top Skills:
AnsibleAWSGCPGoJavaPythonRubySpinnakerTerraform
Other • Software • Analytics
The Sr. Site Reliability Engineer will manage SaaS capabilities, implement monitoring systems, automate operational tasks, and provide on-call support.
Top Skills:
AWSBashDockerEksElkGitJavaKubernetesPrometheusPythonTerraform
Other • Software • Analytics
The role involves deploying and managing SaaS solutions, automating infrastructure processes, troubleshooting system issues, and collaborating with a team of engineers.
Top Skills:
Arcgis VelocityArcgis Workflow ManagerAWSAws LambdaBashDockerEksElkGitKafkaKubernetesOpensearchPrometheusPythonTerraform
Other • Software • Analytics
As a Sr. Site Reliability Engineer, you will manage cloud-based SaaS products, automate infrastructure, troubleshoot issues, and provide technical support while collaborating with a team of engineers.
Top Skills:
AWSAws LambdaBashDockerEcsEksElkGitJavaKafkaKubernetesOpensearchPrometheusPythonSecurity GroupsTerraformVpc
Other • Software • Analytics
The role involves deploying and managing SaaS capabilities on AWS, including monitoring systems, automation solutions, and troubleshooting incidents. Collaboration with SRE engineers is key to operational success across multiple regions.
Top Skills:
AWSAws LambdaBashDockerEcsElkGitGitKafkaKubernetesOpensearchPrometheusPythonTerraform
Cloud
The Senior Site Reliability Engineer will enhance the Splunk ecosystem and develop an Observability Platform by automating infrastructure and managing complex distributed systems, while optimizing log collection and incident response.
Top Skills:
AWSGCPGoKubernetesLinuxOpentelemetryPythonRubySplunkTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Artificial Intelligence • Information Technology • Software
Design, build, and operate large-scale Kubernetes platforms while developing Kubernetes Operators, enhancing reliability and scalability, and troubleshooting production issues.
Top Skills:
Ci/CdCrossplaneGoHelmKubernetesTerraform
Cloud • Security
As a Senior Site Reliability Engineer, you'll ensure the reliability of production SaaS applications in Azure, automate operational processes, maintain compliance in a FedRAMP environment, and address incident responses.
Top Skills:
AzureAzure Application InsightsAzure DevopsDatadogKubernetesLog AnalyticsPowershellPythonTerraform
Software
As a Site Reliability Engineer at Podium, you'll ensure product stability and scalability, collaborate with engineering teams, handle on-call production issues, and mentor junior engineers.
Top Skills:
AnsibleAWSCi/CdDatadogDockerGitGitlabGoHelmHoneycombKubernetesPrometheusPythonRubyStrongdmTerraform
Cloud • Fintech • Other • Software
The Senior Site Reliability Engineer will support development teams, enhancing service reliability and automation, while ensuring compliance and security commitments.
Top Skills:
AWSBashDatadogEcsGoPythonTerraform
Information Technology • Real Estate • Analytics
The Senior Site Reliability Engineer will design observability frameworks, own reliability for large-scale systems, and mentor others. Responsibilities include collaborating with teams, optimizing performance, and ensuring the infrastructure's resilience and scalability.
Top Skills:
AWSAzureBashC#CloudFormationDockerGCPJavaKubernetesNode.jsPythonTerraform
Financial Services
Own reliability and scalability of on-prem observability platforms (ELK, Grafana); handle production escalations, capacity planning, SLOs, onboarding, automation, IaC (Terraform/Helm/Ansible), upgrades, security hardening, and platform modernization.
Top Skills:
Elasticsearch,Logstash,Kibana,Elk Stack,Grafana,New Relic,Solarwinds,Prometheus,Terraform,Helm,Ansible,Chef,Puppet,Python,Shell Scripting/Linux Shell,Bash,Linux,Beats,Fluentd,Fluent Bit,Opentelemetry,Apm Instrumentation
Healthtech • Payments • Software
Lead and mentor a team of client integration engineers, manage production and integration efforts for large-scale distributed systems, gather client requirements, drive SDLC activities, provide datacenter and customer support, and continuously improve system reliability and operations.
Top Skills:
JavaObject-Oriented Programming
Artificial Intelligence • Software
As a Senior SRE, you'll enhance data infrastructure, optimize performance, build reliability, automate processes, and manage incident responses while supporting enterprise clients' uptime requirements.
Top Skills:
ClickhouseGoPostgresPythonTypescript
Blockchain • Software • Cryptocurrency • Web3
As a Senior Site Reliability Engineer, you will oversee AWS/GCP infrastructure, ensure system reliability, deploy applications, and enhance automation in a collaborative team environment.
Top Skills:
AnsibleAWSCi/CdElkGCPJenkinsKubernetesLinuxPrometheusPuppetTerraform
Blockchain • Software • Cryptocurrency • Web3
Responsible for ensuring reliability and scalability of systems, maintaining AWS/GCP infrastructure, deploying applications, and improving operational processes.
Top Skills:
AnsibleAWSDnsElkGCPHTTPHttpsJenkinsKubernetesLinuxPrometheusPuppetTcpTerraformUdp
Artificial Intelligence • Healthtech • Software
Design, build, and maintain secure and scalable infrastructure for critical healthcare applications, lead incident responses, and support engineering teams.
Top Skills:
BashGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Automotive
Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.
Top Skills:
AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform
Artificial Intelligence • Software • Generative AI
Lead reliability and performance for Plaud.ai's AI products. Design and operate scalable cloud-native systems, run on-call/incident response, build observability and reliability automation, define SLOs/SLIs, and drive postmortems and platform reliability improvements.
Top Skills:
Aws,Gcp,Azure,Kubernetes,Distributed Systems,Go,Python,Java,Observability (MetricsLogsTracing)
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results












.png)



















