Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 21 Days AgoSaved
In-Office
San Francisco, CA, USA
150K-200K
Junior
150K-200K
Junior
Artificial Intelligence • Information Technology
As a Site Reliability Engineer, maintain user-facing services, implement best practices for reliability, and manage production incidents.
Top Skills: AnsibleCloud ServicesKubernetesProgramming LanguagesTerraform
Reposted 21 Days AgoSaved
In-Office
Palo Alto, CA, USA
100K-200K
Mid level
100K-200K
Mid level
Mobile • Other • Manufacturing
The Site Reliability Engineer is responsible for the stability and performance of application systems, focusing on cloud infrastructure, automation, and maintaining high availability of services.
Top Skills: AWSAzureCi/CdGoGCPKubernetesLinuxPythonShell Scripting
Reposted 21 Days AgoSaved
In-Office
Chevy Chase, MD, USA
110K-260K Annually
Senior level
110K-260K Annually
Senior level
Insurance
As a Senior Staff Engineer at GEICO, you'll lead technical initiatives in SRE, ensuring reliability and performance of systems while mentoring teams and driving improvements.
Top Skills: AnsibleAzureCC++DockerGoGrafanaHelmJavaKubernetesLokiNoSQLOpenstackOpentelemetryPrometheusPuppetPythonSpinnakerSQLTerraform
Reposted 21 Days AgoSaved
In-Office
Chevy Chase, MD, USA
100K-215K Annually
Senior level
100K-215K Annually
Senior level
Insurance
The Senior Engineer SRE Incident Response (NOC) at GEICO is responsible for overseeing incident response operations, ensuring efficient resolution of technical issues, and maintaining system integrity. The role involves collaboration with various teams and continuous improvement of incident management processes.
Reposted 21 Days AgoSaved
In-Office
Marlborough, MA, USA
112K-112K
Senior level
112K-112K
Senior level
Retail
The Lead Site Reliability Engineer will improve digital platform applications, manage projects, ensure system reliability, and handle production incidents.
Top Skills: Ci/CdIstioKubernetesLinuxPrometheusRedisTerraform
Reposted 21 Days AgoSaved
In-Office
Palo Alto, CA, USA
180K-440K
Expert/Leader
180K-440K
Expert/Leader
Information Technology
The role focuses on backend services for grok.com, requiring expertise in Kubernetes, CI systems, monitoring, and infrastructure as code technologies.
Top Skills: ArgocdBuildkiteGrafanaKubernetesPagerdutyPrometheusPulumiTerraform
Reposted 21 Days AgoSaved
In-Office
New York, NY, USA
175K-245K
Mid level
175K-245K
Mid level
Financial Services
As a Site Reliability Engineer, you'll ensure high availability of Commodities Technology applications, automate processes, and contribute to incident analysis and monitoring systems.
Top Skills: AnsibleAWSC#DatadogDockerKubernetesLinuxPowershellPythonTerraformWindows
Reposted 21 Days AgoSaved
In-Office
Palo Alto, CA, USA
120K-140K Annually
Senior level
120K-140K Annually
Senior level
Hardware • Manufacturing
As an SRE, you'll maintain service reliability, operate monitoring tools, automate tasks in Python, and manage incident responses.
Top Skills: AnsibleAWSBashGitlabGrafanaKubernetesLokiPrometheusPythonTempoTerraform
Reposted 22 Days AgoSaved
In-Office
Newark, NJ, USA
130K-140K Annually
Expert/Leader
130K-140K Annually
Expert/Leader
Fintech • Financial Services
As the Lead Site Reliability Engineer, oversee the modernization and operation of investment management platforms, ensuring high availability, security, and performance while collaborating with technical teams.
Top Skills: AWSCitrixDatadogIisLinuxMicrosoft SqlPower BIPowershellVMwareWindows
22 Days AgoSaved
Hybrid
Colorado, USA
125K-150K Annually
Mid level
125K-150K Annually
Mid level
Artificial Intelligence • HR Tech • Legal Tech • Marketing Tech • Software • Conversational AI • Generative AI
The Site Reliability Engineer will enhance SaaS solutions' stability and scalability by automating workflows, monitoring systems, and responding to incidents.
Top Skills: AnsibleAWSAzureDatadogDynatraceNew RelicPuppetTerraform
22 Days AgoSaved
In-Office
Reston, VA, USA
109K-147K
Mid level
109K-147K
Mid level
Information Technology • Software
The SRE will manage Verisign's data platform by architecting, deploying, and ensuring the stability and performance of large-scale data systems, while collaborating with multiple teams for customer support and infrastructure improvements.
Top Skills: AnsibleDockerDruidHadoopJenkinsKafkaKubernetesPythonSpark
Reposted 22 Days AgoSaved
Hybrid
Sunnyvale, CA, USA
204K-247K Annually
Senior level
204K-247K Annually
Senior level
Cloud • Greentech • Other • Energy
As a Staff Site Reliability Engineer focused on storage, you'll ensure the reliability and performance of cloud storage systems while optimizing distributed, fault-tolerant architectures for AI workloads.
Top Skills: AnsibleCCephDockerGlusterfsGoIscsiJavaKubernetesNfsNvme-OfOpenebsPuppetPythonSmbTerraform
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 22 Days AgoSaved
Hybrid
San Francisco, CA, USA
204K-247K Annually
Senior level
204K-247K Annually
Senior level
Cloud • Greentech • Other • Energy
The role involves ensuring reliability of AI-optimized cloud services, focusing on design, automation, and performance for AI workloads.
Top Skills: C++GoJavaKubernetesPython
Reposted 3 Days AgoSaved
Remote
United States
150K-200K Annually
Mid level
150K-200K Annually
Mid level
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills: AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
22 Days AgoSaved
Remote
US
113K-233K Annually
Mid level
113K-233K Annually
Mid level
Social Media
As a Site Reliability Engineer II at Pinterest, you will develop software for improving the reliability of distributed systems, automate processes, manage capacity, and enhance engineering collaboration through frameworks and tools.
Top Skills: AnsibleAWSBsdChefDockerElasticsearchEnvoyFabricGoHadoopHaproxyHbaseKafkaLinuxMemcacheMySQLNginxPuppetPythonSaltTerraformUnixZookeeper
22 Days AgoSaved
Remote
US
166K-293K
Senior level
166K-293K
Senior level
Artificial Intelligence • Software
As a Principal Site Reliability Engineer, you will design hybrid infrastructure, integrate edge devices and cloud resources, optimize performance and costs, and collaborate with cross-functional teams to ensure robust systems.
Top Skills: AWSGoKubernetesLinuxPythonTerraformTerragrunt
22 Days AgoSaved
Remote
United States
154K-210K Annually
Senior level
154K-210K Annually
Senior level
Information Technology • Security • Cybersecurity
The Sr. Site Reliability Engineer will ensure the smooth operation of infrastructure, oversee incident management, and improve operational efficiency while leading a small team.
Top Skills: AWSBazelCloudGitopsGrafanaHelmKubernetesLinuxPrometheusSaaSTerraform
Reposted 4 Days AgoSaved
In-Office
Seattle, WA, USA
160K-250K
Mid level
160K-250K
Mid level
Artificial Intelligence • Cloud • Software
The Senior Site Reliability Engineer will automate operations, optimize workflows for teams, manage secure infrastructure, and participate in on-call duties.
Top Skills: AristaAWSBashCephChefCifsCiscoDnsDockerElk StackFortinetHpHTTPIcmpIscsiJenkinsKubernetesLinux/Debian Family/UbuntuMesosphereNfsNode.jsPivotal GreenplumPostgresPythonRabbitMQRubyS3ScyllaSshSslSupermicroTcpTls
Reposted 22 Days AgoSaved
In-Office or Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Software • Generative AI
This role involves designing and maintaining cloud infrastructure, automating provisioning, and enhancing system reliability through monitoring, collaboration, and mentorship.
Top Skills: AWSAzureDockerElk StackGCPGoGrafanaJavaKubernetesPrometheusPythonTerraform
Reposted 22 Days AgoSaved
In-Office
Santa Clara, CA, USA
147K-238K Annually
Senior level
147K-238K Annually
Senior level
Cybersecurity
As a Principal Site Reliability Engineer, you will build and maintain scalable cloud infrastructure, ensuring reliability and security while automating deployments and orchestrating monitoring solutions.
Top Skills: BashDockerFirehydrantGCPGitlab Ci/CdGitopsGoGrafanaJavaKubernetesLokiMySQLNode.jsPagerdutyPrometheusPythonTerraform
Reposted 22 Days AgoSaved
In-Office
Langley, VA, USA
Senior level
Senior level
Information Technology • Security • Software
The Principal Site Reliability Engineer ensures system availability and performance, automates processes, and supports a multi-tenant microservices application suite with a focus on incident response and system reliability.
Top Skills: AWSBashCloudFormationDatadogElk StackGoGrafanaJavaKubernetesOpentelemetryPostgresPrometheusPythonTerraform
Reposted 22 Days AgoSaved
In-Office
Reston, VA, USA
136K-184K
Senior level
136K-184K
Senior level
Information Technology • Software
Build and maintain Verisign's Kubernetes platform, enforce security practices, monitor performance, and provide tier 3 support. Requires extensive experience with Kubernetes and related technologies.
Top Skills: GitJIRAKubernetesLinuxPythonTerraformUnix
Reposted 22 Days AgoSaved
In-Office
Austin, TX, USA
125K-181K
Senior level
125K-181K
Senior level
Fintech • Information Technology • Payments
The Staff Site Reliability Engineer maintains and optimizes Hadoop and Kafka clusters on cloud platforms, driving innovation by ensuring system availability and performance while collaborating with teams and developing monitoring tools.
Top Skills: AnsibleAWSAzureBig DataGCPGrafanaHadoopHdfsJavaKafkaLinuxMapreduceOperaPythonSparkSplunk
Reposted 22 Days AgoSaved
Hybrid
New York City, NY, USA
90K-120K
Mid level
90K-120K
Mid level
Software
The Site Reliability Engineer will manage deployment pipelines, implement observability solutions, enhance infrastructure security, and troubleshoot issues to ensure system reliability and performance.
Top Skills: AWSCloudFormationDatadogDockerGitlabGrafanaKubernetesMongoDBMySQLPostgresPrometheusTerraform
Reposted 22 Days AgoSaved
In-Office
2 Locations
180K-440K
Mid level
180K-440K
Mid level
Information Technology
As a Site Reliability Engineer, you'll design and operate scalable storage systems and optimize performance for AI research data management.
Top Skills: GoKubernetesPulumiRust
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account