Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Artificial Intelligence • Software
The Site Reliability Engineer will manage infrastructure, drive enterprise deployments, and ensure the reliability of the Freeplay platform by working closely with customers and optimizing cloud architectures.
Top Skills:
AWSAzureDatadogElasticsearchGCPHelmKotsNats JetstreamPostgresReplicatedTerraform
Artificial Intelligence • Software
As Staff SRE Tech Lead, you'll oversee platform reliability and scalability, lead the SRE team, architect data infrastructures, and optimize systems while implementing automation and observability practices.
Top Skills:
ClickhouseGoPostgresPythonTypescript
Information Technology • Software • Automation
The Site Reliability Engineer ensures operational stability in a cloud environment, providing customer support and troubleshooting while collaborating in a fast-paced team.
Top Skills:
AccumuloAnsibleAWSBashDockerGrafanaHadoopHadoop Distributed File SystemJavaJIRAKubernetesLinuxOpenstackPrometheusPythonSaltVirtualization
Information Technology • Software • Automation
The Senior Site Reliability Engineer will manage AWS environments, develop Infrastructure as Code, and automate operational tasks to ensure high availability in cloud systems.
Top Skills:
Amazon Web Services (Aws)AnsibleAws Certified Developer-AssociateAws Certified Solutions Architect-AssociateAws Certified Solutions Architect-ProfessionalAws Certified Sysops Administrator-AssociateCertified Kubernetes Administrator (Ckad)Ci/CdDockerElastic Certified EngineerElastic Certified Observability EngineerKubernetesTerraform
Artificial Intelligence • Software
The Senior Site Reliability Engineer will ensure the reliability and scalability of our Generative AI SaaS platform, implement automation, and support incident response efforts.
Top Skills:
AWSAzureBashCloudFormationDockerElk StackGCPGoGrafanaKubernetesPrometheusPythonTerraform
Payments
As a Principal Site Reliability Engineer, you'll architect scalable infrastructure, drive reliability, mentor engineers, and lead AI enablement efforts, ensuring high-performance across systems.
Top Skills:
AWSCi/CdDatadogElasticsearchGoGrafanaKubernetesNew RelicPrometheusPythonRds (Mysql/Postgres)Sql-Based RdbmsTypescript
Aerospace • Manufacturing
As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.
Top Skills:
ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform
Aerospace • Defense • Manufacturing
As Lead Site Reliability Engineer, you'll ensure reliability and performance of AI infrastructure, manage deployments, and mentor junior engineers.
Top Skills:
AnsibleBmcCi/CdCudaIdracImpiKubernetesLinuxNvidia GpusOpenshiftTerraform
Information Technology • Software • Big Data Analytics
The Site Reliability Engineer will design, analyze, and troubleshoot large-scale distributed systems, focusing on operating systems and performance tuning.
Top Skills:
ApacheJava
Fintech
The Site Reliability Engineer will manage and optimize Kubernetes clusters and cloud infrastructure, focusing on reliability, monitoring, and automation processes.
Top Skills:
AWSAzureC/C++CloudFormationDockerGCPHelmJavaJavaScriptKubernetesLinuxPostgresPythonRubyTerraform
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The Senior Site Reliability Engineer will ensure the reliability and scalability of enterprise applications, lead incident management, develop automation tools, mentor team members, and collaborate with cross-functional teams.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The Senior Software Engineer - SRE will ensure the reliability and scalability of enterprise applications, handle incident management, and mentor team members, requiring expertise in Java and open-source technologies.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Software Engineer, you'll ensure the scalability and reliability of enterprise applications, leading incident management, automation, and strategic engineering efforts while mentoring team members.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Site Reliability Engineer, ensure system reliability and scalability, lead incident management, develop automation tools, and mentor team members.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Site Reliability Engineer at Veeva, you will enhance the reliability and scalability of applications, lead incident management, and mentor team members while working with modern technologies.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The role involves ensuring the scalability and reliability of enterprise applications through operational experience in Java environments, incident management, and full-stack diagnostics, in collaboration with cross-functional teams.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The Senior Site Reliability Engineer will ensure the scalability and reliability of enterprise applications, manage incidents, automate operations, mentor team members, and support cross-team collaborations across a technology stack, primarily focusing on backend development.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Payments • Software • Automation
Lead platform and infrastructure direction on AWS, evolve CI/CD and ephemeral environments, set observability and SLO standards, drive incident response and postmortems, mentor engineers, and build automation to reduce operational risk.
Top Skills:
AWSCi/CdDistributed SystemsEcsEphemeral Environments/Preview DeploysFargateGithub ActionsLogsObservability (MetricsSlos/Slis/Error BudgetsTracing)
Artificial Intelligence • Information Technology • Software
Lead end-to-end platform reliability: define SLIs/SLOs, harden production architecture, ensure Kubernetes runtime and queue safety, run incident command for Sev1/Sev2, own observability/on-call/runbooks, and gate risky releases while delivering a prioritized reliability roadmap.
Top Skills:
BullmqKoaKubernetesNode.jsPostgraphilePostgresReactRedisTypescript
Security • Software
Maintain, automate, and improve operational tools and customer deployment processes; monitor and ensure service SLOs, backup/restore, alerting, and incident response; drive GitOps/IaC practices, cost tracking, and automation of repetitive tasks while supporting outages and upgrades.
Top Skills:
Ansible,Terraform,Helm,Kubernetes,Aws,Gcp,Azure,Prometheus,Grafana,Bash,Python,Gitops
24 Days AgoSaved
Easy Apply
Easy Apply
Hardware • Quantum Computing
Maintain and integrate hardware and software systems for quantum controls, manage lab and test infrastructure (HIL, K8s, networking, rack servers), automate provisioning and CI/CD, implement monitoring/alerting and observability, support incident response and root-cause analysis, and define operational procedures to ensure reliability across development and production environments.
Top Skills:
Python,Bash,Go,Docker,Git,Kubernetes,Grafana,Prometheus,Elk Stack,Gitlab Ci,Jenkins,Ansible,Terraform,Ubuntu,Debian,Red Hat,Windows,Dns,Dhcp,Tcp/Ip,Vlan,Lan,Wan,Routers,Switches,Rack Mount Servers,Hardware-In-The-Loop (Hil)
Artificial Intelligence • Information Technology • Software
The Site Reliability Engineer will maintain systems, develop automation, monitor performance, and ensure uptime for Zoom infrastructure, participating in on-call support.
Top Skills:
AnsibleCi/CdGitGitlab CiJenkinsKubernetesPackerPythonShellTerraform
Fintech
The Principal Site Reliability Engineer at Fidelity will enhance system reliability, manage large-scale infrastructures, and automate processes using various technologies.
Top Skills:
AnsibleAWSCi/CdDatadogGrafanaJenkinsPythonTerraformYugabyte
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design, build, and operate global, multi-cloud HPC service platforms. Own IaC-driven provisioning, reliability, observability, capacity planning, incident response, and automation to ensure high uptime and QoS for internal customers.
Top Skills:
AiopsAWSCi/CdContainer ManagementGCPGoInfrastructure As CodeKubernetesLog CollectionLsfMetricsMonitoringObservabilityOciPerlPythonRubySlurm
Other • Social Impact
The Senior Site Reliability Engineer is responsible for maintaining Wikimedia's infrastructure, improving reliability, automating tasks, and mentoring peers while participating in incident management.
Top Skills:
Apache Traffic ServerBashDebianEnvoyGoGrafanaHaproxyKubernetesNginxPrometheusPuppetPythonRubyVarnish
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results


























