Get the job you really want.

Top Site Reliability Engineer Jobs

11 Days AgoSaved
Remote
USA
184K-240K Annually
Senior level
184K-240K Annually
Senior level
Information Technology • Security • Cybersecurity
The Staff/Principal Site Reliability Engineer leads infrastructure initiatives, architects solutions for cloud and SaaS, and collaborates cross-functionally to enhance reliability and innovation.
Top Skills: AWSBashBazelCuelangDatadogGitopsGoGrafanaHelmKubernetesLinuxPrometheusPythonTerraform
11 Days AgoSaved
In-Office
Columbus, GA, USA
2-5 Annually
Mid level
2-5 Annually
Mid level
eCommerce • Fintech • Payments
The Site Reliability Engineer ensures system reliability through monitoring, automation, incident response, and performance improvement, bridging development and operations.
Top Skills: AnsibleAWSAws EksJenkinsKubernetesRedhat OpenshiftTerraform
16 Days AgoSaved
In-Office
Colorado Springs, CO, USA
180K-220K Annually
Senior level
180K-220K Annually
Senior level
Software • Defense
As a Senior Site Reliability Engineer, you will ensure service quality and issue resolution, leading incident responses and mentoring teams. You'll work on scaling applications and managing secure cloud environments through automation, focusing on metrics and reliability within DoD settings.
Top Skills: AnsibleAWSDatadogDockerElk StackGrafanaHelmKubernetesLinuxLokiPrometheusTerraformVMware
Reposted 11 Days AgoSaved
In-Office
11 Locations
125K-150K Annually
Senior level
125K-150K Annually
Senior level
Legal Tech
Lead the design and automation of enterprise network infrastructures, managing cloud and on-premises networks with a focus on security and scalability.
Top Skills: AnsibleAWSAzureBashBgpEvpnFortianalyzerFortinetFortinet Sd-WanFrroutingLinuxNsxNvidia CumulusOspfPalo AltoPanoramaPowershellPythonSolarwindsSonicTerraformVcfVMware
Reposted 11 Days AgoSaved
In-Office
Palo Alto, CA, USA
100K-200K Annually
Mid level
100K-200K Annually
Mid level
Mobile • Other • Manufacturing
The Site Reliability Engineer is responsible for the stability and performance of application systems, focusing on cloud infrastructure, automation, and maintaining high availability of services.
Top Skills: AWSAzureCi/CdGoGCPKubernetesLinuxPythonShell Scripting
Reposted 11 Days AgoSaved
In-Office
New York, NY, USA
175K-245K Annually
Mid level
175K-245K Annually
Mid level
Financial Services
As a Site Reliability Engineer, you'll ensure high availability of Commodities Technology applications, automate processes, and contribute to incident analysis and monitoring systems.
Top Skills: AnsibleAWSC#DatadogDockerKubernetesLinuxPowershellPythonTerraformWindows
Reposted 11 Days AgoSaved
Remote
US
175K-200K Annually
Senior level
175K-200K Annually
Senior level
Blockchain • Software
As a Senior Engineer, SRE/DevOps, you will enhance blockchain infrastructure reliability, automate deployment, and collaborate on CI/CD practices while ensuring security and performance optimization.
Top Skills: AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElk-StackEthereumGCPK8SKubernetesOpsgeniePingdomPythonTerraform
Reposted 11 Days AgoSaved
In-Office
San Francisco, CA, USA
150K-200K Annually
Junior
150K-200K Annually
Junior
Artificial Intelligence • Information Technology
As a Site Reliability Engineer, maintain user-facing services, implement best practices for reliability, and manage production incidents.
Top Skills: AnsibleCloud ServicesKubernetesProgramming LanguagesTerraform
Reposted 11 Days AgoSaved
In-Office
Dallas, TX, USA
Senior level
Senior level
Blockchain • Fintech • Financial Services • Cryptocurrency
The Vice President, Site Reliability Engineer will design, implement, and maintain large-scale Linux infrastructures, develop automation scripts, manage network devices, and ensure security compliance while leading a team and collaborating on complex issues.
Top Skills: AnsibleAWSAzureBashDockerElk StackGCPGrafanaKubernetesLinuxPerlPrometheusPuppetPythonSaltstack
Reposted 11 Days AgoSaved
Remote
USA
Senior level
Senior level
Blockchain • Fintech • Financial Services • Cryptocurrency
The VP of Site Reliability Engineering will design and maintain large-scale Linux infrastructure, develop automation scripts, manage network devices, ensure security compliance, and support critical infrastructure.
Top Skills: AnsibleAWSAzureBashBgpDockerGCPKubernetesLinuxOspfPerlPuppetPythonSaltstackVlans
Reposted 11 Days AgoSaved
In-Office or Remote
2 Locations
205K-235K Annually
Senior level
205K-235K Annually
Senior level
Financial Services
The Senior Cluster Site Reliability Engineer will enhance the research compute cluster's uptime, reliability, and performance through engineering and operational improvements, ensuring high availability for researchers working on machine learning problems.
Top Skills: AnsibleAWSAWSCephDockerElkGCPGCPGrafanaHorovodHpcInfinibandKubeflowKueueLokiLustreMlflowOpentelemetryPodmanPrometheusPythonRdmaRubyS3SingularitySlurmTerraform
Reposted 11 Days AgoSaved
In-Office
Chevy Chase, MD, USA
110K-260K Annually
Senior level
110K-260K Annually
Senior level
Insurance
As a Senior Staff Engineer at GEICO, you'll lead technical initiatives in SRE, ensuring reliability and performance of systems while mentoring teams and driving improvements.
Top Skills: AnsibleAzureCC++DockerGoGrafanaHelmJavaKubernetesLokiNoSQLOpenstackOpentelemetryPrometheusPuppetPythonSpinnakerSQLTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 11 Days AgoSaved
In-Office
Chevy Chase, MD, USA
100K-215K Annually
Senior level
100K-215K Annually
Senior level
Insurance
The Senior Engineer SRE Incident Response (NOC) at GEICO is responsible for overseeing incident response operations, ensuring efficient resolution of technical issues, and maintaining system integrity. The role involves collaboration with various teams and continuous improvement of incident management processes.
Reposted 11 Days AgoSaved
In-Office
Palo Alto, CA, USA
120K-140K Annually
Senior level
120K-140K Annually
Senior level
Hardware • Manufacturing
As an SRE, you'll maintain service reliability, operate monitoring tools, automate tasks in Python, and manage incident responses.
Top Skills: AnsibleAWSBashGitlabGrafanaKubernetesLokiPrometheusPythonTempoTerraform
Reposted 11 Days AgoSaved
In-Office
Palo Alto, CA, USA
180K-440K Annually
Senior level
180K-440K Annually
Senior level
Information Technology
The Senior Site Reliability Engineer will design and optimize Kubernetes clusters, manage infrastructure with IaC tools, and enhance system reliability while collaborating with teams.
Top Skills: AnsibleCluster ApiCniCriCsiKubernetesPulumiTerraform
12 Days AgoSaved
Remote
United States
Junior
Junior
Software
As an Associate Site Reliability Engineer, you will automate processes, improve platform stability, support incident response, and contribute to compliance efforts in a SaaS environment.
Top Skills: AnsibleAWSAzureBashCloudFormationDatadogGCPGitGoGrafanaKubernetesPrometheusPythonTerraform
12 Days AgoSaved
Remote
United States
Junior
Junior
Software
The Site Reliability Engineer will enhance the stability and efficiency of the SaaS platform through automation and supporting compliance for FedRAMP.
Top Skills: AnsibleAWSAzureBashCloudFormationDatadogGCPGitGoGrafanaKubernetesLinuxPrometheusPythonTerraform
12 Days AgoSaved
Remote
US
101K-161K Annually
Senior level
101K-161K Annually
Senior level
Cloud • Security • Software • Analytics
As an SRE, you'll ensure scalability and reliability for Arista's CloudVision service, focusing on automation, performance, and safety in production environments.
Top Skills: AnsibleBashGoGoogle Cloud PlatformGoogle Kubernetes EngineKubernetesPulumiPython
12 Days AgoSaved
In-Office
8 Locations
110K-230K Annually
Senior level
110K-230K Annually
Senior level
Insurance
The Staff SRE Engineer will design and maintain scalable distributed systems, automate infrastructure, manage CI/CD processes, and mentor engineers, focusing on enhancing billing platform efficiency.
Top Skills: AirflowAnsibleAWSAzureAzure AutomationAzure DevopsC#ChefDockerGCPGitGoJavaKafkaKotlinKubernetesNoSQLPythonSparkSQLTerraform
12 Days AgoSaved
In-Office
Lehi, UT, USA
Senior level
Senior level
Software
As a Site Reliability Engineer at Podium, you'll ensure product stability and scalability, collaborate with engineering teams, handle on-call production issues, and mentor junior engineers.
Top Skills: AnsibleAWSCi/CdDatadogDockerGitGitlabGoHelmHoneycombKubernetesPrometheusPythonRubyStrongdmTerraform
Reposted 12 Days AgoSaved
In-Office
Palo Alto, CA, USA
180K-440K Annually
Expert/Leader
180K-440K Annually
Expert/Leader
Information Technology
The role focuses on backend services for grok.com, requiring expertise in Kubernetes, CI systems, monitoring, and infrastructure as code technologies.
Top Skills: ArgocdBuildkiteGrafanaKubernetesPagerdutyPrometheusPulumiTerraform
Reposted 12 Days AgoSaved
In-Office
Reston, VA, USA
109K-147K Annually
Mid level
109K-147K Annually
Mid level
Information Technology • Software
The SRE will manage Verisign's data platform by architecting, deploying, and ensuring the stability and performance of large-scale data systems, while collaborating with multiple teams for customer support and infrastructure improvements.
Top Skills: AnsibleDockerDruidHadoopJenkinsKafkaKubernetesPythonSpark
Reposted 12 Days AgoSaved
Remote
US
166K-293K Annually
Senior level
166K-293K Annually
Senior level
Artificial Intelligence • Software
As a Principal Site Reliability Engineer, you will design hybrid infrastructure, integrate edge devices and cloud resources, optimize performance and costs, and collaborate with cross-functional teams to ensure robust systems.
Top Skills: AWSGoKubernetesLinuxPythonTerraformTerragrunt
Reposted 12 Days AgoSaved
Hybrid
Colorado, USA
125K-150K Annually
Mid level
125K-150K Annually
Mid level
Artificial Intelligence • HR Tech • Legal Tech • Marketing Tech • Software • Conversational AI • Generative AI
The Site Reliability Engineer will enhance SaaS solutions' stability and scalability by automating workflows, monitoring systems, and responding to incidents.
Top Skills: AnsibleAWSAzureDatadogDynatraceNew RelicPuppetTerraform
Reposted 12 Days AgoSaved
In-Office
2 Locations
130K-140K Annually
Expert/Leader
130K-140K Annually
Expert/Leader
Fintech • Financial Services
As the Lead Site Reliability Engineer, oversee the modernization and operation of investment management platforms, ensuring high availability, security, and performance while collaborating with technical teams.
Top Skills: AWSCitrixDatadogIisLinuxMicrosoft SqlPower BIPowershellVMwareWindows
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account