Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Fintech
Lead adoption of SRE practices to improve reliability, observability, automation, and incident response. Implement and maintain observability tooling, instrumentation, CI/CD, and infrastructure-as-code. Partner with developers, participate in on-call rotations, drive postmortems, and reduce operational overhead through automation.
Top Skills:
AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Reposted 9 Days AgoSaved
Fintech
Support and evolve SRE practices: implement and maintain observability, monitoring, alerting, automation, and resilience for services. Participate in on-call rotations, incident response, postmortems, and collaborate with engineering teams to improve reliability and operational efficiency.
Top Skills:
AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Artificial Intelligence • Other • Software • Industrial • Manufacturing
As a Site Reliability Engineer, you will enhance and secure our AI platform infrastructure, focusing on performance, compliance, incident response, and automation while collaborating with teams globally.
Top Skills:
ArgocdBashCi/CdDatadogDockerElkGCPGithub ActionsGitlab CiGoGrafanaJenkinsKubernetesPrometheusPythonSpinnakerTerraform
Hardware • Semiconductor • Manufacturing
Design, build, deploy, and manage on-prem backend infrastructure and services for a small semiconductor fab. Manage bare-metal Linux servers, networking, observability, backups, and automation. Deploy and operate services like consul, vault, grafana, and postgres; build high-frequency telemetry ingestion, OS image automation, and security/secret management. Contribute to an in-house infra-as-code tool and collaborate across hardware and software teams.
Top Skills:
AlertmanagerBare-Metal ServersConsulDnsGiteaGoGrafanaLinuxOs Image AutomationPostgresPythonRedpandaReverse ProxiesRustS3-Compatible StorageSingle-Board ComputersSystemdTlsVaultVectorVictoriametricsVpns
Music
As a Site Reliability Engineer, you'll build and maintain cloud infrastructure for Spotify's AI-native developer platform, ensuring reliability and performance, while collaborating with senior engineers.
Top Skills:
AWSGCPPythonReactTerraformTypescript
Software
As a Senior Platform Engineer/SRE, you'll design, implement, and manage scalable cloud solutions, focusing on platform reliability, automation, and compliance, while collaborating with cross-functional teams.
Top Skills:
Argo WorkflowsArgocdAWSCi/CdCircleCICloudFormationDockerEnvoyGithub ActionsGoGrafanaIstioKubernetesPythonTerraform
Information Technology • Legal Tech
The role involves maintaining and improving Azure infrastructure, managing Infrastructure as Code with Terraform, enhancing security measures, and operating CI/CD pipelines.
Top Skills:
AzureAzure DevopsBashCircleCIDatadogEfkElkGithub ActionsPowershellPythonTerraform
Security • Software
Work with Cloud Engineering to improve availability, performance, security, and scalability of CyberArk SaaS. Monitor, triage, and automate remediation of production incidents, enhance monitoring and dashboards, participate in on-call rotation, and influence system design to prevent failures. Focus on automation (Ansible, scripting), IaC, cloud platforms, and secure operations.
Top Skills:
AnsibleAWSAzureBashChefCloudFormationGCPLinuxPowershellPuppetPythonRubyTerraformUnixWindows
Security • Software
Design, implement, and architect AWS cloud infrastructure and automation for SaaS reliability. Lead SRE/DevOps practices, configuration management, observability, and recovery planning while mentoring engineers and driving platform improvements.
Top Skills:
AnsibleAWSC#C++CatchpointCicdCloudFormationCloudwatchDatadogDockerEc2EksElasticsearchElkGrafanaHelmInfluxdbJavaKubernetesLogstashLogz.IoPythonS3SaltTerraformVpc
Information Technology
Design and operate secure infrastructure for government projects. Optimize performance, manage storage with IaC tools, and ensure system reliability in high-security environments.
Top Skills:
AnsibleArgocdGoGpu HardwareKubernetesKyvernoPulumiTerraform
News + Entertainment
As an Ads Reliability Engineer, you will ensure the reliability of Netflix's Ad Suite by designing scalable infrastructure, collaborating with teams, and implementing automation for monitoring and incident response.
Top Skills:
AWSAzureGCPGoJavaKubernetesPythonTerraform
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
The Senior Director of SRE leads and defines reliability and operational excellence across products, manages the SRE team, and scales reliability practices within the organization.
Top Skills:
AWSAzureCloud-Native NetworkingDistributed SystemsGCPKubernetesMicroservicesSite Reliability Engineering Principles
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Fintech • Financial Services
Ensure the reliability and scalability of systems through monitoring, automation, incident response, and collaboration with diverse teams while managing risk and providing technical guidance.
Top Skills:
CorvilElasticItrsLinuxPython
Artificial Intelligence • Healthtech
The Site Reliability Engineer will enhance system reliability, define observability standards, respond to incidents, and collaborate with engineering teams on performance and compliance improvements.
Top Skills:
AWSContainerized ServicesDistributed WorkflowsObservability ToolingPostgresServerless Compute
Big Data • Information Technology • Security • Software
The Senior Developer will drive observability roadmaps using SRE Golden Signals, establish monitoring strategies, enhance system reliability, and act as an expert in New Relic technology for performance management.
Top Skills:
BashCri-OCshKubernetesNew RelicPerlWindows Powershell
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves supporting network infrastructure, automating cloud infrastructure, managing CI/CD workflows, and ensuring operational excellence in IT support, including incident response and security practices.
Top Skills:
AnsibleAWSBashDockerGitKubernetesPythonRubyTerraform
Healthtech
The SRE Engineer will ensure system reliability of digital products, implement monitoring solutions, optimize performance, and improve deployment automation.
Top Skills:
AWSCi/CdCloudwatchDatadogKubernetesPython
Fintech • Payments
Join WEX as a Site Reliability Engineer to enhance system reliability in Azure, automate tasks, and collaborate with teams to optimize performance and incident response.
Top Skills:
AnsibleAzureBashCloudFormationDockerElk StackGoGrafanaKubernetesPrometheusPythonSplunkTerraform
16 Days AgoSaved
Easy Apply
Easy Apply
Cloud • Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer, you'll automate and manage GitLab environments, ensuring reliability, and scalability while troubleshooting production issues, and improving operational efficiency.
Top Skills:
AnsibleCloud Services (AwsElkGcp)GoGrafanaHelm ChartsKubernetesOmnibus-GitlabPrometheusRubyTerraform
Software
The Site Reliability Engineer will ensure the reliability and performance of Cloaked's services, contributing to the company's mission of protecting consumer data privacy.
Energy
As a Site Reliability Engineer, you will design resilient systems, manage incident responses, and implement automation for infrastructure management, ensuring high operational standards.
Top Skills:
AWSAzureBashGCPGoKubernetesPythonTerraform
Transportation
Design and develop Waabi's observability stack, optimize performance, build automation tooling, and support application requirements while leading projects and mentoring teams.
Top Skills:
AWSC/C++DockerGoGrafanaJavaKubernetesOpentelemetryPythonRust
Security • Software
The role involves architecting and leading deployment automation, managing SaaS reliability, guiding teams on cloud tools, and responding to incidents.
Top Skills:
AnsibleAWSCloudFormationCloudwatchDatadogGrafanaHelmKubernetesOpensearchPager DutyPythonSaltTerraform
Security • Software
The Site Reliability Engineer will enhance production services' reliability and security, automate tasks, monitor systems, and manage incidents.
Top Skills:
AnsibleAWSAzureBashCloudFormationGCPLinux/UnixPowershellPythonRubyTerraformWindows Os
Information Technology
As a Senior Site Reliability Engineer, you'll enhance system resilience, automate tasks, and improve infrastructure for the Intelligence Community. You'll need significant Linux experience and programming knowledge.
Top Skills:
ConfluenceDockerGitGoHpJavaJenkinsJIRAKubernetesLinuxNessusPackerPythonRust
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results



.png)








.png)


















