Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Aerospace • Artificial Intelligence • Logistics • Machine Learning • Software • Transportation • Defense
Lead the deployment, scaling, and maintenance of the Flyways AI Platform in a secure cloud infrastructure, coding software solutions and managing complex systems.
Top Skills:
AWSCircleCIDockerGrafanaHelmJenkinsK8SPostgresPythonTerraform
Information Technology • Software • Financial Services
The Site Reliability Engineer will support real-time systems, perform capacity planning, manage application migrations, and troubleshoot in a collaborative environment.
Top Skills:
BashPythonSQLTcp/IpUdpUnix/Linux
Edtech • Enterprise Web • HR Tech • Social Impact • Software
The Principal Site Reliability Engineer will design and optimize AWS environments, lead technical initiatives, mentor engineers, and enhance platform reliability through cloud architecture and automation, focusing on security, compliance, and operational excellence.
Top Skills:
AWSAws CdkAws EksBashCloudFormationGoGrafanaHelmKubernetesPrometheusPythonTerraformTypescript
Information Technology • Software • Financial Services
The role involves supporting and diagnosing problems in a real-time environment, managing large-scale application deployments, and collaborating with teams to solve technical challenges.
Top Skills:
BashPythonSQLTcp/IpUdpUnix/Linux
Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software
The Site Reliability Engineer will automate operations, collaborate with teams, design cloud platforms, scale infrastructure, and enhance security for cloud services.
Top Skills:
AksAzureBashChefDockerEfkElkGoGrafanaJavaKubernetesPowershellPrometheusPythonRubyTerraform
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Lead day-to-day support strategies ensuring high availability of services, drive quality engineering best practices, and mentor technical talent.
Top Skills:
Automation ToolsSre Methodologies
Sales • Software • Automation
As a Site Reliability Engineer, you'll maintain and enhance infrastructure systems, manage databases, ensure system stability, and automate processes using various DevOps tools and technologies.
Top Skills:
AnsibleAWSDockerElasticsearchFlaskKubernetesMongoDBPostgresPythonRedisTerraform
Artificial Intelligence • Enterprise Web • Information Technology • Machine Learning • Mobile • Software • Analytics
The Site Reliability Engineer will improve alert quality, maintain infrastructure, and enhance operational security while collaborating with teams.
Top Skills:
Cloud TechnologiesGkeKubernetes
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Lead the SRE team to ensure continuous service operation, mentor engineers, automate infrastructure, and implement monitoring strategies.
Top Skills:
AnsibleAWSC#/.NetChefDockerGCPGoJavaKubernetesNutanixPythonTerraformVsphere
Artificial Intelligence • Big Data • Information Technology • Software
Design, build, and maintain infrastructure for multi-tenant SaaS platform focusing on reliability, security, and scalability, while enhancing incident response and system monitoring.
Top Skills:
AWSBashCi/CdDatadogGoGrafanaKubernetesPrometheusPythonTerraform
AdTech • Digital Media • Internet of Things • Marketing Tech • Mobile • Retail • Software
Design, implement, and maintain complex IP networks, utilizing MPLS technologies, managing projects, and ensuring operational reliability while providing Layer-3 support.
Top Skills:
BashBgpCisco IosCisco Ios-XrIp/Mpls TechnologiesIsisLdpOspfPHPPython
Fintech • Machine Learning • Payments • Software • Financial Services
Lead a technology portfolio as a DevOps Engineer, driving major transformation while collaborating with product managers to deliver cloud-based solutions.
Top Skills:
AnsibleAWSDockerGoJavaKubernetesPythonRubySQLTerraform
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Consumer Web • Digital Media • Information Technology • News + Entertainment • Social Media
The Site Reliability Engineer will enhance and optimize server infrastructure, improve system performance, and handle incident responses, driving efficiency in cloud-based environments.
Top Skills:
AnsibleArgocdCC#C++DockerGoHelmJavaKubernetesLinuxPythonRustTerraform
Machine Learning • Payments • Security • Software • Financial Services
The Site Reliability Engineer Senior stabilizes environments, manages capacity and performance, creates monitoring systems, and mentors junior staff.
Top Skills:
Application DevelopmentDisaster RecoveryInfrastructure ManagementMonitoring SystemsSoftware Solutions
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
As a Site Reliability Engineer, you will develop automation, troubleshoot live environments, collaborate with teams, and ensure system reliability for NBCUniversal's broadcast and streaming platforms.
Top Skills:
AnsibleArgocdAWSBashCloudFormationCloudwatchDockerEksElkGithub ActionsGrafanaJenkinsKubernetesPythonSplunkTerraform
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
As a Site Reliability Engineer, you'll support NBCU's broadcasting and streaming infrastructure, ensuring system reliability, deployment automation, and incident response for critical systems.
Top Skills:
AnsibleArgocdAWSBashCloudFormationCloudwatchDockerEksElkGithub ActionsGrafanaJenkinsKubernetesPythonSplunkTerraform
AdTech • eCommerce • Food • Marketing Tech • Retail
The Principal Site Reliability Engineer will design and lead site reliability practices, ensuring system resiliency and operational excellence, while mentoring junior staff and driving large-scale reliability initiatives.
Top Skills:
AksArgocdDatadogDockerGithub ActionsJavaKubernetesPythonRedisSpring BootTerraformTomcat
AdTech • eCommerce • Food • Marketing Tech • Retail
The Principal Site Reliability Engineer will lead operational excellence initiatives, overseeing high-availability systems and scalability strategies while mentoring teams.
Top Skills:
AksArgocdBashDatadogDockerGithub ActionsGoJavaKubernetesPythonRedisSpring BootTerraformTomcat
Financial Services
Lead the Site Reliability Engineering team, enhancing application reliability, scalability, and performance. Mentor engineers, manage incidents, and implement best practices.
Top Skills:
DatadogDockerDynatraceEcsGit LabGrafanaJava Spring BootJenkinsKubernetesMicroservicesPrometheusPythonSplunkTerraform
Artificial Intelligence • Computer Vision • HR Tech • Machine Learning • Software
The Site Reliability Engineer II will manage and ensure the reliability and efficiency of SaaS application platforms, leveraging tools for automation, monitoring, and incident response while collaborating with various teams.
Top Skills:
AnsibleArgocdAWSAzureCisDockerElasticsearchFips 140-2Fips 140-3GCPGoGrafanaHelmIptablesJavaJenkinsKubernetesLinuxMongoDBMssqlMySQLPostgresPrometheusPythonSelinuxSolrStigTerraform
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills:
ArgocdAWSKubernetesPythonTerraform
Fintech • Machine Learning • Payments • Software • Financial Services
Lead a team to create cloud-based solutions, drive tech transformations, and mentor engineers while collaborating with product managers.
Top Skills:
AnsibleAWSDockerGoJavaKubernetesPythonRubySQLTerraform
Artificial Intelligence • Legal Tech • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
As a Site Reliability Engineer, you will ensure system uptime, manage CI/CD pipelines, and enhance security and observability while troubleshooting issues in a collaborative environment.
Top Skills:
AWSAzureCloudFormationDatadogDockerGCPGrafanaKubernetesPrometheusTerraform
Financial Services
Lead Site Reliability Engineer responsible for implementing site reliability principles, mentoring engineers, and enhancing system observability and stability through AI-driven solutions.
Top Skills:
.NetAws CloudDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
Fintech • Software
The Site Reliability Engineer will optimize development infrastructure, manage source control systems, and support production incidents while mentoring junior staff.
Top Skills:
AWSDockerGitlab CiGoJenkinsKubernetesLinuxPythonShell ScriptingTeamcityUnix
Popular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results