Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Fintech • Software
The Site Reliability Engineer leads tech teams for resilient infrastructure, enhances reliability via automation, and integrates DevSecOps practices. They improve application reliability and work with cloud-native platforms.
Top Skills:
Cloud-Native PlatformsKubernetesOpenshiftOpenstackPrometheusSplunkVMware
Information Technology • Software
The Site Reliability Engineer automates IT operations, enhances system reliability, collaborates on architecture, and ensures efficient deployment for the Platform team.
Top Skills:
AnsibleAWSAzureChefGoGCPJavaJavaScriptKubernetesLinux/UnixPuppetPython
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills:
ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
HR Tech • Information Technology • Professional Services • Software • Business Intelligence • Consulting • Automation
Seeking a Site Reliability Engineer with expertise in Unix/Linux, scripting languages, and experience in containerization, cloud platforms, and application monitoring tools.
Top Skills:
AnsibleApache TomcatAWSCassandraChefCoradiantDockerDynatraceElasticGomezGCPJenkinsKafkaLinuxMq SeriesOraclePuppetPythonShell ScriptingSplunkTealeafUnixVagrantWebsphere
3D Printing • Artificial Intelligence • Software • Design
The role involves building reliable platforms for 3D/4D content delivery to AR/VR devices, monitoring system health, and improving operational practices in collaboration with the engineering team.
Top Skills:
Aws FargateCoreweaveGrafanaKubernetesPrometheusTerraform
Fintech • Information Technology • Payments
As a Staff Site Reliability Engineer, you will improve system reliability, lead incident response, automate tasks, support cloud migration, and manage enterprise systems.
Top Skills:
Active DirectoryAWSIisMqMs Sql ServerPowershellWindows Server
Energy
The Site Reliability Engineer will design and implement high-availability systems, automate IT infrastructures, and ensure effective monitoring and alerting across teams.
Top Skills:
Active DirectoryAnsibleAzure)ChefCloud Infrastructure (AwsJSONLinuxPuppetPythonRestVMwareWindows ServerYaml
Reposted 19 Days AgoSaved
Cloud • Information Technology
The Site Reliability Engineer will support IaaS services, monitor infrastructure health, perform root cause analysis, automate processes, and collaborate with teams for service reliability.
Top Skills:
AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
Reposted 19 Days AgoSaved
Easy Apply
Easy Apply
Healthtech
The SRE will design and implement platform solutions, maintain cloud environments, monitor and troubleshoot production issues, and automate tasks to improve efficiency.
Top Skills:
AnsibleAWSDockerGCPGitIacLinuxMySQLPHPTerraform
Financial Services
As a Staff SRE Engineer, you'll lead the Data Infra team in improving reliability, architecture, and automation for the Data Platform while mentoring engineers.
Top Skills:
AWSClojureDatomicEc2KubernetesLambdasScalaSparkStep Functions
Cloud • Information Technology
As a Site Reliability Engineer, you will design and implement monitoring solutions, establish monitoring frameworks, automate incident management, and integrate monitoring into IT processes to enhance system reliability.
Top Skills:
AiopsNagiosScience LogicServicenowVMwareVrealize Operations ManagerZabbix
Cloud
The Staff Site Reliability Engineer will manage large-scale cloud production systems, ensuring reliability and performance, while automating processes and responding to incidents.
Top Skills:
AWSBashCloudFormationDockerGoHelmKubernetesPythonRubyTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Artificial Intelligence • Healthtech
The Senior Site Reliability Engineer will design and implement infrastructure automation, CI/CD pipelines, and monitoring for a healthcare AI platform while ensuring reliability and security.
Top Skills:
AnsibleAWSAws KmsAzureAzure Key VaultDatadogDockerElkGCPGrafanaHashicorp VaultJenkinsKubernetesTerraform
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
The Senior Site Reliability Engineer designs, deploys, and maintains cloud infrastructure, enhances system resilience and leads complex projects, collaborating with teams and promoting best practices.
Top Skills:
ArgocdAWSAzureC++DockerGoHelmKubernetesPythonRustTerraform
Artificial Intelligence • Software
The SRE at Fluidstack is responsible for ensuring infrastructure reliability and performance, handling complex production issues, and improving platform stability.
Top Skills:
AnsibleBashGoKubernetesPythonSlurmTerraform
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Payments • Software • Financial Services
The Lead Site Reliability Engineer will drive reliability strategies, architect and maintain infrastructure, lead incident responses, and influence engineering practices for operational excellence while mentoring team members.
Top Skills:
AWSDockerFastapiKubernetesPostgresPythonTypescriptVue
Reposted 20 Days AgoSaved
Easy Apply
Easy Apply
Analytics
The Site Reliability Engineer will ensure the reliability and performance of IaaS services, perform incident resolution, and enhance system reliability through automation while supporting mobility across hybrid infrastructures and collaborating extensively with various teams.
Top Skills:
AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
AdTech • Big Data • eCommerce • Marketing Tech • Real Estate • Software
The Site Reliability Engineer will manage AWS infrastructure, optimize Kubernetes environments, build CI/CD pipelines, and enhance system security and performance.
Top Skills:
AnsibleAWSBashCloudflareCloudwatchDockerGitlabGoGrafanaKubernetesPrometheusPythonTerraform
Insurance • Cybersecurity
The Site Reliability Engineer II will build and operate infrastructure, improve system reliability, and enhance developer tools while collaborating across teams using AWS, Terraform, and IaC principles.
Top Skills:
AWSEcsGithub ActionsGoKafkaKinesisKubernetesPythonTerraform
Information Technology • Mobile • Software
As a Site Reliability Engineer, you'll ensure system reliability and scalability, automate processes, optimize performance, and collaborate on system design.
Top Skills:
AWSAzureBashCloudFormationDatadogDockerElkGoGoogle Cloud PlatformGrafanaHelmKubernetesNew RelicPrometheusPulumiPythonTerraform
Cloud • Software
The Site Reliability Engineer (SRE) will manage reliable, scalable systems, focusing on software development, infrastructure automation, and incident response. Responsibilities include monitoring, CI/CD pipeline management, security compliance, and cost optimization while collaborating with various teams.
Top Skills:
AWSAzureDockerElk StackGCPGitGrafanaJavaKubernetesPHPPrometheusPythonShellTerraform
Gaming • Mobile
The Site Reliability Engineer (SRE) will enhance production system stability and performance, collaborate with DevOps, manage on-call responsibilities, and improve observability. Responsibilities include monitoring, reliability engineering, incident management, and documentation.
Top Skills:
ArgocdAWSBashEc2EksGithub ActionsGitlab Ci/CdGraylogHashicorp VaultHelmIamKubernetesNew RelicPythonRoute53S3Terraform
Reposted 22 Days AgoSaved
eCommerce • Fashion • Mobile • Software
Lead SRE team ensuring reliability and performance of systems. Drive automation, collaborate on reliability goals, and measure effectiveness through SLOs and SLAs.
Top Skills:
AWSEksGoJavaKubernetesNew RelicPythonSplunk
Blockchain • Fintech • Social Media • Cryptocurrency • NFT • Web3
Design, build, and operate scalable, highly available infrastructure and platform software for Zora's blockchain services (indexer, APIs, data pipelines). Automate workflows, maintain core systems, improve developer experience, participate in on-call rotation, and contribute strategic technical direction.
Top Skills:
AsyncioBaseBridgesCephCloudflare Pages FunctionsDatadogDockerEthereumGoIpfsKubernetesMongoDBOpentelemetryOptimismOptimistic RollupsPlasmaPolygonPostgresPythonRpc NodesSidechainsVercelZk-Rollups
Security • Software • Analytics
Design, operate, and automate scalable, secure infrastructure for Axiom Cloud. Define SLOs, plan disaster recovery and capacity, tune performance, improve deployment practices, build reliability tooling, respond to incidents, and promote monitoring and observability across teams.
Top Skills:
Aws,Docker,Kubernetes,Amazon Eks,Terraform,Pulumi,Linux,Github Actions,Gitlab,Circleci,Llms,Golang,Monitoring And Observability Tools
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results
















.png)



.png)












