Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 23 Days AgoSaved
In-Office
Boulder, CO, USA
135K-180K Annually
Mid level
135K-180K Annually
Mid level
Artificial Intelligence • Software
The Site Reliability Engineer will manage infrastructure, drive enterprise deployments, and ensure the reliability of the Freeplay platform by working closely with customers and optimizing cloud architectures.
Top Skills: AWSAzureDatadogElasticsearchGCPHelmKotsNats JetstreamPostgresReplicatedTerraform
Reposted 23 Days AgoSaved
Remote or Hybrid
2 Locations
250K-295K Annually
Senior level
250K-295K Annually
Senior level
Artificial Intelligence • Software
As Staff SRE Tech Lead, you'll oversee platform reliability and scalability, lead the SRE team, architect data infrastructures, and optimize systems while implementing automation and observability practices.
Top Skills: ClickhouseGoPostgresPythonTypescript
Reposted 23 Days AgoSaved
In-Office
Annapolis Junction, MD, USA
165K-230K Annually
Expert/Leader
165K-230K Annually
Expert/Leader
Information Technology • Software • Automation
The Site Reliability Engineer ensures operational stability in a cloud environment, providing customer support and troubleshooting while collaborating in a fast-paced team.
Top Skills: AccumuloAnsibleAWSBashDockerGrafanaHadoopHadoop Distributed File SystemJavaJIRAKubernetesLinuxOpenstackPrometheusPythonSaltVirtualization
Reposted 23 Days AgoSaved
In-Office
Annapolis Junction, MD, USA
165K-230K Annually
Expert/Leader
165K-230K Annually
Expert/Leader
Information Technology • Software • Automation
The Senior Site Reliability Engineer will manage AWS environments, develop Infrastructure as Code, and automate operational tasks to ensure high availability in cloud systems.
Top Skills: Amazon Web Services (Aws)AnsibleAws Certified Developer-AssociateAws Certified Solutions Architect-AssociateAws Certified Solutions Architect-ProfessionalAws Certified Sysops Administrator-AssociateCertified Kubernetes Administrator (Ckad)Ci/CdDockerElastic Certified EngineerElastic Certified Observability EngineerKubernetesTerraform
Reposted 23 Days AgoSaved
In-Office
Dublin, CA, USA
Senior level
Senior level
Artificial Intelligence • Software
The Senior Site Reliability Engineer will ensure the reliability and scalability of our Generative AI SaaS platform, implement automation, and support incident response efforts.
Top Skills: AWSAzureBashCloudFormationDockerElk StackGCPGoGrafanaKubernetesPrometheusPythonTerraform
Reposted 23 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
200K-300K Annually
Expert/Leader
200K-300K Annually
Expert/Leader
Payments
As a Principal Site Reliability Engineer, you'll architect scalable infrastructure, drive reliability, mentor engineers, and lead AI enablement efforts, ensuring high-performance across systems.
Top Skills: AWSCi/CdDatadogElasticsearchGoGrafanaKubernetesNew RelicPrometheusPythonRds (Mysql/Postgres)Sql-Based RdbmsTypescript
Reposted 23 Days AgoSaved
Remote
United States
115K-135K Annually
Mid level
115K-135K Annually
Mid level
Aerospace • Manufacturing
As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.
Top Skills: ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform
Reposted 23 Days AgoSaved
In-Office
Washington, DC, USA
Mid level
Mid level
Aerospace • Defense • Manufacturing
As Lead Site Reliability Engineer, you'll ensure reliability and performance of AI infrastructure, manage deployments, and mentor junior engineers.
Top Skills: AnsibleBmcCi/CdCudaIdracImpiKubernetesLinuxNvidia GpusOpenshiftTerraform
Reposted 23 Days AgoSaved
In-Office
San Francisco, CA, USA
Mid level
Mid level
Information Technology • Software • Big Data Analytics
The Site Reliability Engineer will design, analyze, and troubleshoot large-scale distributed systems, focusing on operating systems and performance tuning.
Top Skills: ApacheJava
Reposted 23 Days AgoSaved
In-Office
Honolulu, HI, USA
100K-170K Annually
Junior
100K-170K Annually
Junior
Fintech
The Site Reliability Engineer will manage and optimize Kubernetes clusters and cloud infrastructure, focusing on reliability, monitoring, and automation processes.
Top Skills: AWSAzureC/C++CloudFormationDockerGCPHelmJavaJavaScriptKubernetesLinuxPostgresPythonRubyTerraform
Reposted 23 Days AgoSaved
In-Office or Remote
Los Angeles, CA, USA
110K-270K Annually
Senior level
110K-270K Annually
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The Senior Site Reliability Engineer will ensure the reliability and scalability of enterprise applications, lead incident management, develop automation tools, mentor team members, and collaborate with cross-functional teams.
Top Skills: AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Reposted 23 Days AgoSaved
In-Office or Remote
Portland, OR, USA
110K-270K Annually
Senior level
110K-270K Annually
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The Senior Software Engineer - SRE will ensure the reliability and scalability of enterprise applications, handle incident management, and mentor team members, requiring expertise in Java and open-source technologies.
Top Skills: AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 23 Days AgoSaved
In-Office or Remote
Honolulu, HI, USA
110K-270K Annually
Senior level
110K-270K Annually
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Software Engineer, you'll ensure the scalability and reliability of enterprise applications, leading incident management, automation, and strategic engineering efforts while mentoring team members.
Top Skills: AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Reposted 23 Days AgoSaved
In-Office or Remote
Boston, MA, USA
110K-270K Annually
Senior level
110K-270K Annually
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Site Reliability Engineer, ensure system reliability and scalability, lead incident management, develop automation tools, and mentor team members.
Top Skills: AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Reposted 23 Days AgoSaved
In-Office or Remote
Boston, MA, USA
110K-270K Annually
Senior level
110K-270K Annually
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Site Reliability Engineer at Veeva, you will enhance the reliability and scalability of applications, lead incident management, and mentor team members while working with modern technologies.
Top Skills: AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Reposted 23 Days AgoSaved
In-Office or Remote
Bend, OR, USA
110K-270K Annually
Senior level
110K-270K Annually
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The role involves ensuring the scalability and reliability of enterprise applications through operational experience in Java environments, incident management, and full-stack diagnostics, in collaboration with cross-functional teams.
Top Skills: AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Reposted 23 Days AgoSaved
In-Office or Remote
San Luis Obispo, CA, USA
110K-270K Annually
Senior level
110K-270K Annually
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The Senior Site Reliability Engineer will ensure the scalability and reliability of enterprise applications, manage incidents, automate operations, mentor team members, and support cross-team collaborations across a technology stack, primarily focusing on backend development.
Top Skills: AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
24 Days AgoSaved
In-Office
New York, NY, USA
200K-250K Annually
Expert/Leader
200K-250K Annually
Expert/Leader
Payments • Software • Automation
Lead platform and infrastructure direction on AWS, evolve CI/CD and ephemeral environments, set observability and SLO standards, drive incident response and postmortems, mentor engineers, and build automation to reduce operational risk.
Top Skills: AWSCi/CdDistributed SystemsEcsEphemeral Environments/Preview DeploysFargateGithub ActionsLogsObservability (MetricsSlos/Slis/Error BudgetsTracing)
24 Days AgoSaved
Hybrid
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
Lead end-to-end platform reliability: define SLIs/SLOs, harden production architecture, ensure Kubernetes runtime and queue safety, run incident command for Sev1/Sev2, own observability/on-call/runbooks, and gate risky releases while delivering a prioritized reliability roadmap.
Top Skills: BullmqKoaKubernetesNode.jsPostgraphilePostgresReactRedisTypescript
24 Days AgoSaved
Easy Apply
Remote
US
Easy Apply
Senior level
Senior level
Security • Software
Maintain, automate, and improve operational tools and customer deployment processes; monitor and ensure service SLOs, backup/restore, alerting, and incident response; drive GitOps/IaC practices, cost tracking, and automation of repetitive tasks while supporting outages and upgrades.
Top Skills: Ansible,Terraform,Helm,Kubernetes,Aws,Gcp,Azure,Prometheus,Grafana,Bash,Python,Gitops
24 Days AgoSaved
Easy Apply
In-Office
Boston, MA, USA
Easy Apply
Senior level
Senior level
Hardware • Quantum Computing
Maintain and integrate hardware and software systems for quantum controls, manage lab and test infrastructure (HIL, K8s, networking, rack servers), automate provisioning and CI/CD, implement monitoring/alerting and observability, support incident response and root-cause analysis, and define operational procedures to ensure reliability across development and production environments.
Top Skills: Python,Bash,Go,Docker,Git,Kubernetes,Grafana,Prometheus,Elk Stack,Gitlab Ci,Jenkins,Ansible,Terraform,Ubuntu,Debian,Red Hat,Windows,Dns,Dhcp,Tcp/Ip,Vlan,Lan,Wan,Routers,Switches,Rack Mount Servers,Hardware-In-The-Loop (Hil)
Reposted 24 Days AgoSaved
In-Office
San Jose, CA, USA
87K-186K Annually
Junior
87K-186K Annually
Junior
Artificial Intelligence • Information Technology • Software
The Site Reliability Engineer will maintain systems, develop automation, monitor performance, and ensure uptime for Zoom infrastructure, participating in on-call support.
Top Skills: AnsibleCi/CdGitGitlab CiJenkinsKubernetesPackerPythonShellTerraform
Reposted 24 Days AgoSaved
In-Office
Merrimack, NH, USA
Senior level
Senior level
Fintech
The Principal Site Reliability Engineer at Fidelity will enhance system reliability, manage large-scale infrastructures, and automate processes using various technologies.
Top Skills: AnsibleAWSCi/CdDatadogGrafanaJenkinsPythonTerraformYugabyte
Reposted YesterdaySaved
In-Office
3 Locations
152K-288K Annually
Senior level
152K-288K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design, build, and operate global, multi-cloud HPC service platforms. Own IaC-driven provisioning, reliability, observability, capacity planning, incident response, and automation to ensure high uptime and QoS for internal customers.
Top Skills: AiopsAWSCi/CdContainer ManagementGCPGoInfrastructure As CodeKubernetesLog CollectionLsfMetricsMonitoringObservabilityOciPerlPythonRubySlurm
YesterdaySaved
Easy Apply
Remote
USA
Easy Apply
113K-176K Annually
Senior level
113K-176K Annually
Senior level
Other • Social Impact
The Senior Site Reliability Engineer is responsible for maintaining Wikimedia's infrastructure, improving reliability, automating tasks, and mentoring peers while participating in incident management.
Top Skills: Apache Traffic ServerBashDebianEnvoyGoGrafanaHaproxyKubernetesNginxPrometheusPuppetPythonRubyVarnish
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account