Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Reposted 4 Days AgoSaved
Fintech • Analytics
As a Senior Site Reliability Engineer, you'll lead incident recovery, enhance production stability, automate processes, and collaborate with development teams to improve operational efficiency.
Top Skills:
AWSAzureBigpandaCloud-Native ApplicationsDatadogDnsDockerGitHTTPKubernetesShell ScriptingTcp/IpUnix
Reposted 4 Days AgoSaved
Fintech • Analytics
As a Senior Site Reliability Engineer, you will enhance the reliability and performance of the FX trading platform, focusing on system health, automation, and collaboration with development teams. Responsibilities include measuring availability, incident response, and supporting cloud migration.
Top Skills:
AWSAzureBashC#DatadogJavaKubernetesPythonSQL
Aerospace • Artificial Intelligence
The Site Reliability Engineer will architect and manage ground infrastructure for satellite systems, ensuring high availability, automating deployments, and optimizing data management systems.
Top Skills:
AnsibleAWSAzureC++CloudFormationEksElkGCPGrafanaHelmKubernetesPrometheusPythonTerraform
Blockchain • Fintech • Cryptocurrency
Responsible for the design, implementation, and reliability of systems across hybrid cloud and on-premises environments, while leading technical initiatives and mentoring engineers.
Top Skills:
AnsibleDatadogGithub ActionsHelmKubernetesPythonTerraform
Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer, you will enhance the reliability of Drata’s product teams through automation, architecture reviews, and operational excellence using cloud-native technologies.
Top Skills:
AiopsAWSBashDatadogDockerGitGithub ActionsKubernetesLinuxMySQLPythonTerraform
Financial Services
The Staff Site Reliability Engineer will lead Platform Engineering's SRE efforts by defining technical strategy, overseeing architecture, and enhancing operational excellence through mentorship and governance.
Top Skills:
ArgocdGCPGkeGoKafkaNode.jsPythonTerraform
Computer Vision • Machine Learning • Software
As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.
Top Skills:
AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
Consumer Web • eCommerce • Fashion • Retail
The Senior Site Reliability Engineer ensures the health of systems, automates processes, and collaborates on architecture to maintain uptime and reliability in production environments.
Top Skills:
AnsibleAWSAzureDatadogDockerElasticsearchGCPGraphiteHaproxyJavaScriptJenkinsKubernetesMongoDBNagiosNew RelicNginxNode.jsRabbitMQRedisRubyTerraformTomcat
Consumer Web • eCommerce • Fashion • Retail
The Software Engineer, SRE will develop, deploy, and support new product features while ensuring operational excellence and quality support in a fast-paced environment.
Top Skills:
AWSDockerElasticsearchHaproxyJavaScriptKubernetesMongoDBNginxNode.jsRabbitMQRedisRubyTomcat
Cloud
The role involves building and managing observability infrastructure in GCP, automating deployments, and optimizing data processes for high reliability.
Top Skills:
GkeGoGCPGrafanaKubernetesOpentelemetryPythonRubySplunkTerraform
Aerospace
The Site Reliability Engineer will ensure system reliability, assist with incident response, improve operational quality, and automate processes to reduce toil. Responsibilities include incident resolution, reliability evaluations, and platform enablement.
Top Skills:
Argo CdDockerGitGitlabGoGrafanaJenkinsKubernetesOtel StandardsPrometheusPython
Artificial Intelligence • Machine Learning • Generative AI
As a Site Reliability Engineer, you will manage Kubernetes clusters, automate infrastructure, improve operational metrics, and enhance reliability across data centers.
Top Skills:
CloudFormationGoGpuKubernetesLinuxPythonTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Information Technology • Consulting
The Site Reliability Engineer designs monitoring frameworks, manages SLIs/SLOs, automates cloud infrastructure, ensures compliance, and promotes SRE best practices.
Top Skills:
.Net CoreAmazon AwsAnsibleBashChefDockerElkGithub ActionsGoogle GcpGrafanaJavaJenkinsKubernetesAzureNode.jsOpen-TelemetryPowershellPrometheusPuppetPythonRSplunkTerraform
Reposted 5 Days AgoSaved
Financial Services
As a Principal Application Support Engineer, you'll ensure system reliability and operational resilience, implementing SRE practices, and leading incident management efforts.
Top Skills:
AWSAzureDynatraceGCPGoItsiJavaLinuxPythonSplunkUnix
Digital Media • Social Media • Software • Sports
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
Top Skills:
Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform
Software
As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.
Top Skills:
AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Edtech
The Lead Software Engineer will lead the SRE team, focusing on reliability, performance optimization, security, and mentoring developers, while improving overall platform resilience.
Top Skills:
ActivejobAnsibleAWSAws CloudwatchEc2EcsElasticsearchGitGCPGoogle Cloud StackdriverJenkinsJIRAKubernetesMemcachedMongoDBNew RelicNode.jsPostgresRedisRuby On RailsSidekiqSpinnakerTerraformTerragrunt
Gaming • Information Technology • Mobile • Software • Esports
Seeking a Senior Site Reliability Engineer to design and operate scalable platform solutions, enhance reliability, and improve developer experience and operational efficiency across engineering teams.
Top Skills:
AWSGCP
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, improve monitoring and logging, optimize database infrastructure, and collaborate on scaling systems efficiently.
Top Skills:
AWSClickhouseKubernetesMySQLPostgresRedis
Healthtech
Develop and implement processes to ensure high availability and reliability of services. Responsibilities include incident management, automation, capacity planning, and risk mitigation.
Top Skills:
AWSAzureDatadogDockerGrafanaJavaScriptNew RelicPrometheusPythonRubySplunkTerraform
Internet of Things • Cybersecurity
The Site Reliability Engineer will manage AWS GovCloud infrastructure, ensuring compliance and high availability while driving automation, security, and incident response best practices.
Top Skills:
AnsibleAws GovcloudBashDockerElk StackGitlab Ci/CdGrafanaJenkinsKubernetesPrometheusPythonTerraform
Fintech
Lead adoption of SRE practices to improve reliability, observability, automation, and incident response. Implement and maintain observability tooling, instrumentation, CI/CD, and infrastructure-as-code. Partner with developers, participate in on-call rotations, drive postmortems, and reduce operational overhead through automation.
Top Skills:
AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Reposted 6 Days AgoSaved
Fintech
Support and evolve SRE practices: implement and maintain observability, monitoring, alerting, automation, and resilience for services. Participate in on-call rotations, incident response, postmortems, and collaborate with engineering teams to improve reliability and operational efficiency.
Top Skills:
AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Insurance
The Site Reliability Engineer enhances systems reliability and performance while developing software for observability and supports incident management in a dynamic environment.
Top Skills:
Ci/Cd PipelinesInformaticaKubernetesNew RelicOraclePostgresPythonSnowflakeSplunkSQL
Real Estate • Sales • Software • PropTech
The Senior Site Reliability Engineer will lead infrastructure reliability, performance, and scalability while mentoring engineers and overseeing system health and incident responses.
Top Skills:
AWSAzureC#CloudFormationDockerGCPGoKubernetesPulumiPythonTerraformTypescript
Let Your Resume Do The Work
Upload your resume to be matched with jobs you're a great fit for.
Success! We'll use this to further personalize your experience.
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results
.jpg)










.jpeg)





















