Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
As a Site Reliability Engineer, you'll support NBCU's broadcasting and streaming infrastructure, ensuring system reliability, deployment automation, and incident response for critical systems.
Top Skills:
AnsibleArgocdAWSBashCloudFormationCloudwatchDockerEksElkGithub ActionsGrafanaJenkinsKubernetesPythonSplunkTerraform
AdTech • eCommerce • Food • Marketing Tech • Retail
The Principal Site Reliability Engineer will design and lead site reliability practices, ensuring system resiliency and operational excellence, while mentoring junior staff and driving large-scale reliability initiatives.
Top Skills:
AksArgocdDatadogDockerGithub ActionsJavaKubernetesPythonRedisSpring BootTerraformTomcat
AdTech • eCommerce • Food • Marketing Tech • Retail
The Principal Site Reliability Engineer will lead operational excellence initiatives, overseeing high-availability systems and scalability strategies while mentoring teams.
Top Skills:
AksArgocdBashDatadogDockerGithub ActionsGoJavaKubernetesPythonRedisSpring BootTerraformTomcat
Financial Services
Lead the Site Reliability Engineering team, enhancing application reliability, scalability, and performance. Mentor engineers, manage incidents, and implement best practices.
Top Skills:
DatadogDockerDynatraceEcsGit LabGrafanaJava Spring BootJenkinsKubernetesMicroservicesPrometheusPythonSplunkTerraform
Artificial Intelligence • Computer Vision • HR Tech • Machine Learning • Software
The Site Reliability Engineer II will manage and ensure the reliability and efficiency of SaaS application platforms, leveraging tools for automation, monitoring, and incident response while collaborating with various teams.
Top Skills:
AnsibleArgocdAWSAzureCisDockerElasticsearchFips 140-2Fips 140-3GCPGoGrafanaHelmIptablesJavaJenkinsKubernetesLinuxMongoDBMssqlMySQLPostgresPrometheusPythonSelinuxSolrStigTerraform
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills:
ArgocdAWSKubernetesPythonTerraform
Fintech • Machine Learning • Payments • Software • Financial Services
Lead a team to create cloud-based solutions, drive tech transformations, and mentor engineers while collaborating with product managers.
Top Skills:
AnsibleAWSDockerGoJavaKubernetesPythonRubySQLTerraform
Artificial Intelligence • Legal Tech • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
As a Site Reliability Engineer, you will ensure system uptime, manage CI/CD pipelines, and enhance security and observability while troubleshooting issues in a collaborative environment.
Top Skills:
AWSAzureCloudFormationDatadogDockerGCPGrafanaKubernetesPrometheusTerraform
Financial Services
Lead Site Reliability Engineer responsible for implementing site reliability principles, mentoring engineers, and enhancing system observability and stability through AI-driven solutions.
Top Skills:
.NetAws CloudDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
Fintech • Software
The Site Reliability Engineer will optimize development infrastructure, manage source control systems, and support production incidents while mentoring junior staff.
Top Skills:
AWSDockerGitlab CiGoJenkinsKubernetesLinuxPythonShell ScriptingTeamcityUnix
Reposted 12 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
This role involves setting technical strategies, collaborating across teams, managing operations and availability, and fostering a culture of quality and ownership within the Site Reliability Engineering team.
Top Skills:
AWSKotlinKubernetesMySQLPythonSpark
Big Data • Cloud • Software • Database
The Site Reliability Engineer will design and build cloud infrastructure for MongoDB Atlas, optimize performance, and automate services worldwide.
Top Skills:
AWSDnsGCPHTTPKubernetesLinuxAzureProgramming LanguagesTls
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Cloud • Mobile • Software
The Director of Engineering, DevOps will lead DevOps & SRE functions, ensuring infrastructure reliability and driving innovation across the organization.
Top Skills:
AnsibleAWSAzureGCPGrafanaJenkinsKubernetesPrometheusTerraform
Mobile • Software
Site Reliability Engineers will work on production infrastructure, focusing on AWS and Kubernetes while ensuring high availability and customer satisfaction.
Top Skills:
AirflowAWSCircleCICloudwatchEksGrafanaMongoDBPagerdutyPingdomRustScala SparkTerraformTypescript
Cloud • Mobile • Software
The Director of Engineering, DevOps leads DevOps and Site Reliability Engineering, ensuring technical infrastructure reliability, scalability, and security while fostering automation and optimizing processes.
Top Skills:
AnsibleAWSAzureGCPGrafanaJenkinsKubernetesPrometheusTerraform
Financial Services
Lead site reliability engineering efforts, mentor team members, advocate for reliability principles, and enhance operational efficiency through AI and automation.
Top Skills:
AIAWSDatadogDynatraceElkGCPGrafanaKubernetesOpentelemetryPrometheusSplunkTerraform
Financial Services
The Site Reliability Engineer III collaborates on building reliable applications and infrastructure, implements CI/CD, and optimizes operational performance.
Top Skills:
.NetDatadogDockerDynatraceEcsGitlabGrafanaJava/Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
Financial Services
As a Software Engineer III, you will design and deliver technology products, ensuring secure and scalable solutions while solving complex problems and improving software applications.
Top Skills:
Cloud TechnologiesDatabase Querying LanguagesFront-End TechnologiesModern Programming Languages
Financial Services
As a Senior Lead Site Reliability Engineer, you will design reliability frameworks, mentor engineers, lead observability implementations, and ensure effective service levels for applications and platforms.
Top Skills:
DatadogDynatraceGrafanaPrometheusSplunk
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Staff Production Service Engineer will maintain cloud infrastructure, drive reliability improvements, troubleshoot issues, mentor team members, and utilize software development and systems engineering skills.
Top Skills:
AnsibleAWSAzureBashDockerGCPGrafanaJavaJavaScriptKafkaKubernetesLinuxMariadbMySQLNginxOpenstackOraclePostgresPrometheusPuppetPythonSplunkTerraform
Financial Services
Responsible for maintaining system reliability, coding solutions, incident resolution, and implementing observability practices in a financial institution.
Top Skills:
CloudDatadogDynatraceGitlabGrafanaJenkinsLinuxPrometheusSplunkTerraformWindows
Cloud • Information Technology • Security • Software • Cybersecurity
Join a talented team as a Systems Reliability Engineer to enhance the Cloudflare platform's availability and performance using automation and monitoring tools.
Top Skills:
AnsibleApache AirflowChefConsulDockerGoGrafanaLinuxNginxNomadPostgresPrometheusPuppetPythonRustSaltstackSQLTemporal
Artificial Intelligence • Digital Media • eCommerce • Marketing Tech • Software • Automation
Lead the Production Engineering team, ensuring the reliability of cloud infrastructure, managing talent development, and driving strategic initiatives.
Top Skills:
AWSEksGCPKafkaPulsarSnsSqs
Artificial Intelligence • Machine Learning • Software
As a Staff Site Reliability Engineer, you will enhance the reliability, scalability, and performance of production services by applying SRE principles, implementing observability practices, automating processes, and collaborating with engineering teams.
Top Skills:
AWSAzureCloudFormationDatadogDockerElk StackGCPGoGrafanaJaegerKubernetesOpentelemetryOpentofuPrometheusPythonTerraform
Financial Services
The Site Reliability Engineer III will optimize and maintain applications and infrastructure, implement automated solutions, and support SRE practices, collaborating across teams.
Top Skills:
.NetAWSAzureCockroachdbDatadogDockerDynatraceEcsGitlabGrafanaJavaJenkinsKubernetesMySQLOraclePrometheusPythonSplunkSpring BootTerraform
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results