Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Security
The Director of DevSecOps and SRE will lead teams in SRE, Cloud Infrastructure, and DevOps practices, focusing on automation, infrastructure reliability, and security policies while mentoring engineers and managing software projects.
Top Skills:
Aws Cloud TechnologiesGitlabGrafanaJavaKubernetesLokiMaterial UiPostgresPrometheusRabbitMQReactReduxSentrySpringTailwindTerraform
Retail • Software
The Lead Architect is responsible for designing, implementing, and managing cloud infrastructure and DevOps/SRE solutions, focusing on Azure and infrastructure automation.
Top Skills:
AnsibleAzureBashDockerElastic CloudGCPGithub ActionsKubernetesOpentelemetryPythonTerraform
Cloud • Information Technology • Security • Software
Design, build, and operate network infrastructure for cloud and on-prem environments, ensuring reliability, scalability, and security through automation and observability.
Top Skills:
AnsibleAws VpcAzure VnetsBgpDnsElkEnvoyFirewallsGcp VpcGoGrafanaNginxOpentelemetryOspfPrometheusPythonTcp/IpTerraformTransit GatewayVlans
Cloud • Information Technology • Security • Software
The role involves designing, building, and operating infrastructure systems, focusing on automation, reliability, and security for cloud and on-prem environments while collaborating closely with engineering teams.
Top Skills:
AnsibleBashCi/CdCloudFormationDockerElkGoGrafanaKubernetesLinuxOpentelemetryPrometheusPythonTerraform
Aerospace • Other
The Sr. IT Linux Site Reliability Engineer will manage and optimize Kubernetes clusters, automate systems, and foster collaboration to support SpaceX's engineering teams and infrastructure needs.
Top Skills:
AnsibleDockerGitGoGrafanaHelmJSONKubernetesLinuxPrometheusPythonTerraformYaml
Aerospace • Other
The Sr. IT Linux Site Reliability Engineer will manage and optimize Kubernetes environments, automate infrastructure, and collaborate on software platforms ensuring high performance and reliability.
Top Skills:
AnsibleGitGoGrafanaHelmInfluxdbJenkinsJsonnetKubernetesLinuxPrometheusPythonTerraformVMwareYaml
Software • Cryptocurrency
Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.
Top Skills:
Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform
Gaming
Manage operational tasks for gaming services, design runtime environments, monitor metrics, optimize architecture, and research software solutions.
Top Skills:
C/C++GoIstioJavaK8SLinuxMySQLNginxPythonRustShell
Blockchain
The Blockchain Site Reliability Engineer is responsible for maintaining blockchain nodes' reliability, monitoring, incident response, and building automation tools to enhance operations.
Top Skills:
DockerElkGoGrafanaJavaScriptKubernetesLinuxPrometheusPythonRustShell
Artificial Intelligence • Robotics • Automation • Manufacturing
Responsible for managing and setting up internal systems infrastructure, migrating SaaS to self-hosted solutions, implementing monitoring systems, and ensuring security compliance.
Top Skills:
AnsibleAWSAzureCloudFormationDatadogDnsGCPGrafanaHTTPLinux/UnixPrometheusTcp/IpTerraform
Artificial Intelligence • Information Technology • Software
Responsible for configuring, monitoring, and maintaining systems in global data centers, developing automation, and enhancing system performance. Mentors teams and collaborates with various departments on architectural solutions.
Top Skills:
AnsibleDistributed Storage SystemsDockerGitlabJenkinsKubernetesLinuxPackerPythonTerraform
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, enhance monitoring and database infrastructure, and collaborate on scalable systems to maintain reliability as usage scales.
Top Skills:
AWSClickhouseKubernetesMySQLPostgresRedis
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Gaming • Information Technology • Mobile • Software • Esports
As a Senior Site Reliability Engineer, you will enhance service infrastructure, develop automation tools, and support integration and maintenance of backend services in a large-scale environment.
Top Skills:
AWSBambooC#C++CassandraCiscoDataflowF5FlinkGitJavaJenkinsKinesisLinuxMavenMongodbMySQLPerforcePuppetPythonRedisShell ScriptTeam CityVertica
Reposted 18 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
Manage continuous delivery infrastructure for reliable code deployment. Collaborate with teams to streamline onboarding, support deployment systems, and participate in on-call rotations.
Top Skills:
Argo WorkflowsArgocdAWSAzureGoGoogle Cloud PlatformKubernetesPython
Blockchain • Web3
As a Site Reliability Engineer, you'll enhance observability, logging, and tracing, collaborating with engineers to optimize performance and security of infrastructure.
Top Skills:
AnsibleAWSAws CdkGCPGitGoGrafanaKubernetesLgtmLokiMimirOpentelemetryPrometheusRustSentryTempoTerraformTypescriptWebassembly
Artificial Intelligence • Security • Software
You will develop and improve cloud infrastructure, support distributed systems, and write infrastructure-as-code while collaborating across teams.
Top Skills:
AWSCloudFormationDockerGoJavaKubernetesPythonTerraform
Cloud • Information Technology
As a Staff Site Reliability Engineer, you will enhance cloud product lines, ensuring real-time scalability, collaborating with teams, and automating builds.
Top Skills:
AnsibleAWSAzureBashDnsDockerEnvoyGCPGitGoGrafanaHaproxyHTTPJenkinsKafkaKubernetesLinuxMySQLOciOpentelemetryPostgresPrometheusPuppetPythonRedisTcp/IpTelegrafTerraformTls
Automotive
The SRE Engineer will monitor and improve cloud and data platforms, lead incident response, maintain AWS infrastructure, and drive service reliability using monitoring tools and automation scripts.
Top Skills:
AWSAws CliBashCi/CdDatabricksDelta LakeFlinkGrafanaHadoopIcebergKafkaKinesisKubernetesPowershellPrometheusPythonSparkSqsTerraform
Artificial Intelligence • Automotive • Internet of Things • Software
As an SRE intern, you will work on system reliability, design dashboards, collaborate with teams, and apply machine learning techniques.
Top Skills:
AWSJavaJavaScriptKubernetesPythonTerraform
Healthtech • Telehealth
Seeking a Site Reliability Engineer to ensure availability and performance of cloud infrastructure. Responsibilities include observability solutions, incident response, and collaboration with teams to improve reliability and service health.
Top Skills:
AnsibleAWSAzureAzure MonitorBashCloudwatchDynatraceElasticGrafanaPowershellPythonTerraform
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Site Reliability Engineer, develop solutions for deployment engineers, ensure scalable system delivery, and improve operational capabilities for military technologies.
Top Skills:
C++Cloud TechnologiesCybersecurityGoNetworkingPythonRust
Fintech • Mobile • Security • Software • Cybersecurity
Seeking a Staff Site Reliability Engineer to design, deploy and maintain high-availability infrastructure, ensuring reliability, performance, and security while collaborating with engineering teams.
Top Skills:
AWSElk StackGoGrafanaJaegerJavaKubernetesOpentelemetryOpentofuPrometheusPythonTerraform
Cloud • Information Technology
As a Staff Platform Engineer, you'll develop and maintain infrastructure components using Go and Node.js, improve service reliability, mentor juniors, and manage data ecosystems.
Top Skills:
EnvoyExpressGoJenkinsKafkaMySQLNode.jsPostgresPuppetPythonReactRedis
Sports
Manage and improve the AWS infrastructure, deploy into new regions, monitor releases, and implement new technologies in a fast-paced environment.
Top Skills:
AWSDockerGrafanaKubernetesPrometheusPython
Aerospace
The Staff Site Reliability Engineer will design SRE procedures, engineer SLOs, build tooling, and create dashboards. The role includes automating tasks and collaborating within the SRE team.
Top Skills:
DatadogGoJaegerKubernetesPrometheusPulumiPythonTerraform
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results



















%20Logo.png)













