Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Healthtech • Biotech • Pharmaceutical
Design, operate, and automate enterprise Oracle Database platforms across on‑prem, hybrid, and cloud environments. Ensure availability, performance tuning, HA/DR, backup/recovery, security/compliance, monitoring, and lifecycle management while driving IaC automation and SRE practices to reduce toil and improve reliability.
Top Skills:
Active Data GuardAiopsAnsibleAWSAzureCi/CdData PumpExaccExadataGitGrafanaInfrastructure As CodeKubernetesOciOci Resource ManagerOemOracle Cloud InfrastructureOracle Data GuardOracle Database (19C+)Oracle RacPythonRmanServicenowShell ScriptingSplunkSQL ServerTerraform
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills:
AWSDockerGCPKubernetes
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
Lead architecture and build reliability platforms, drive AIOps automation, champion SRE practices, lead incident response and postmortems, advance observability, and mentor engineers to improve system reliability and performance.
Top Skills:
AiopsAWSAzureContinuous ProfilingDatadogDnsElkGCPGoGrafanaHttp/SKubernetesLoad BalancingOpentelemetryPrometheusPythonTcp/Ip
Reposted 16 Days AgoSaved
Easy Apply
Easy Apply
Marketing Tech • Mobile • Software
Lead the Site Reliability Engineering team, ensuring platform reliability, scalability, and developer support while fostering an inclusive environment and coaching team members.
Top Skills:
EmberGoReact
Artificial Intelligence • Information Technology
As a Site Reliability Engineer, maintain user-facing services, implement best practices for reliability, and manage production incidents.
Top Skills:
AnsibleCloud ServicesKubernetesProgramming LanguagesTerraform
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Generative AI
The Site Reliability Engineer will develop, deploy, and operate AI infrastructure, focusing on high-performance and scalable machine learning systems using Kubernetes and cloud platforms.
Top Skills:
AWSAzureC++GCPGoKubernetesOci
eCommerce
Responsible for platform reliability, monitoring, automation, and system health for Coupang's customer-facing services, ensuring scalable solutions and handling production incidents.
Top Skills:
AWSAzureDatadogDockerElastic StackGoGoogle Cloud PlatformGrafanaJavaKubernetesNew RelicPrometheusPythonRuby
Artificial Intelligence • Healthtech • Information Technology • Software
As a Site Reliability Engineer, you will manage the production environment, focusing on infrastructure design, automation, and optimizing deployment pipelines to ensure high availability.
Top Skills:
HelmKafkaKubernetesPostgresPythonRedisTerraformTypescript
Artificial Intelligence • Machine Learning • Security • Database • Analytics • Big Data Analytics
As a Site Reliability Engineer, you'll ensure the availability and performance of AI applications, maintain infrastructure, automate tasks, and troubleshoot issues in high-scale environments.
Top Skills:
AnsibleAWSAzureBashCircleCICloudFormationDatadogDockerDynatraceEc2Elk StackGCPGitlab CiGoGrafanaJenkinsKubernetesLambdaLinuxPrometheusPythonS3TerraformUnix
Information Technology • Software
As a DevOps Engineer, you'll design and scale secure systems, manage AWS environments, automate operations, and ensure operational excellence for revenue teams.
Top Skills:
Amazon AuroraAWSDockerDynamoDBGithub ActionsKafkaS3SnowflakeSparkSqsTerraform
Financial Services
The Senior Cluster Site Reliability Engineer will enhance the research compute cluster's uptime, reliability, and performance through engineering and operational improvements, ensuring high availability for researchers working on machine learning problems.
Top Skills:
AnsibleAWSAWSCephDockerElkGCPGCPGrafanaHorovodHpcInfinibandKubeflowKueueLokiLustreMlflowOpentelemetryPodmanPrometheusPythonRdmaRubyS3SingularitySlurmTerraform
Reposted 16 Days AgoSaved
Easy Apply
Easy Apply
Software
The Senior SRE Manager will establish an SRE team, implement best practices, manage incidents, and enhance system reliability, scaling operations effectively.
Top Skills:
Cloud InfrastructureDistributed SystemsObservability
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Software • Cybersecurity
As an SRE Engineer II, manage multi-cloud infrastructure, enhance reliability, design services, implement IaC, develop CI/CD pipelines, and automate tasks with a focus on security and scalability.
Top Skills:
Arm TemplatesAWSAws CodepipelineAzureAzure DevopsCloudFormationGCPGoJenkinsPowershellPythonTerraform
Cloud • Information Technology • Security • Software
The Site Reliability Engineer III role involves developing services, automating infrastructure, managing CI/CD environments, and improving system reliability for applications and technology stacks.
Top Skills:
AnsibleAzureChefGitGitlabGoJavaScriptKubernetesMs Sql ServerPostgresPuppetPythonTerraform
Artificial Intelligence • Information Technology • Software
The role involves developing and maintaining tools for data center networks, ensuring performance and security while collaborating with teams on network management.
Top Skills:
Azure DevopsGitlabGoGrafanaJenkinsLinuxNagiosPrometheusPythonSolarwindsZabbix
Reposted 22 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills:
AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
Information Technology • Productivity • Software • Infrastructure as a Service (IaaS)
The role involves diagnosing infrastructure issues, participating in on-call rotations, improving application availability, and enhancing automation in cloud environments.
Top Skills:
AnsibleAWSC++CloudFormationDatadogGoHelmJavaKotlinKubernetesNew RelicPostgresSplunkTerraform
Software
As a Site Reliability Engineer, you will manage the reliability and scalability of platform infrastructure, build observability tools, and automate processes to enhance operational excellence.
Top Skills:
AWSGCPGoKubernetesPulumiPythonTerraform
Information Technology • Consulting
The Lead Site Reliability Engineer will create an IT support automation strategy to reduce ticket volume and automate workflows, focusing on eliminating recurring issues and driving measurable outcomes across IT support systems.
Top Skills:
AWSAzureGCPGoKubernetesOauth2PulumiPythonRest ApisSAMLTerraform
Information Technology • Security • Software
As a Site Reliability Engineer, you'll build and maintain cloud infrastructure, support Kubernetes workloads, and enhance CI/CD pipelines within a collaborative team environment.
Top Skills:
AnsibleAWSGCPGoKubernetesLinuxPythonTerraform
Software
Seeking a Senior Site Reliability Engineer to enhance reliability and speed for Stream Aligned teams, ensuring ownership of services, building tooling, and improving deployment processes.
Top Skills:
AWSJavaKubernetesLinuxPostgresTerraform
Fintech
The Principal Site Reliability Engineer designs and implements software to enhance application performance and resilience while ensuring security standards. Responsibilities include automating application management, providing observability, and leading cross-functional teams. Mentorship and on-call rotation participation are expected.
Top Skills:
AuroraAWSChefDockerDynamo DbGitGoJavaJenkinsJmsKafkaKubernetesMavenMemcachedOraclePythonRedisSqsSwarm
Cloud • Software • Analytics
The Senior Cloud SRE enhances reliability and availability of solutions by automating tasks, providing on-call support, and mentoring teams.
Top Skills:
AnsibleAppdynamicsBmcC#C++DatadogDockerDynatraceGitGrafanaJavaJIRAKubernetesNew RelicPerlPrometheusPythonRubySplunkTerraform
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills:
Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython
Cloud • Security • Software
As a Staff Site Reliability Engineer, you will design, build, and maintain cloud infrastructure, improve deployment processes, and collaborate across teams.
Top Skills:
Ci/CdDockerGoKubernetes
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results














.png)


















