Get the job you really want.

Top Senior Site Reliability Engineer Jobs

Reposted 16 Days AgoSaved
In-Office
Los Angeles, CA, USA
181K-237K Annually
Senior level
181K-237K Annually
Senior level
Healthtech • Insurance
The Senior Software Engineer will lead technical projects in cloud infrastructure, mentoring teams, improving DevOps practices, and ensuring system resilience.
Top Skills: AWSCi/CdGCPGithub ActionsGrafanaIamIstioKubernetesPrometheusTerraformVpc Peering
Reposted 16 Days AgoSaved
In-Office
San Francisco, CA, USA
181K-237K Annually
Senior level
181K-237K Annually
Senior level
Healthtech • Insurance
The Senior Software Engineer will lead complex projects, mentor engineers, and ensure cloud infrastructure is resilient and automated. Responsibilities include developing software, managing production environments, and enforcing coding standards.
Top Skills: ArgocdAWSGCPGithub ActionsGrafanaIstioKubernetesPrometheusTerraform
Reposted 16 Days AgoSaved
In-Office
Tempe, AZ, USA
181K-237K Annually
Senior level
181K-237K Annually
Senior level
Healthtech • Insurance
The Senior Software Engineer will lead cloud infrastructure projects, mentor junior engineers, ensure system reliability, and drive technical roadmaps.
Top Skills: AWSCi/CdGCPGithub ActionsGrafanaIstioKubernetesPrometheusTerraform
Reposted 16 Days AgoSaved
In-Office
Boston, MA, USA
181K-237K Annually
Senior level
181K-237K Annually
Senior level
Healthtech • Insurance
The Senior Software Engineer will lead technical projects, mentor engineers, and build resilient cloud infrastructures focusing on SRE best practices.
Top Skills: AWSCi/CdGCPGithub ActionsGrafanaKubernetesPrometheusTerraform
Reposted 16 Days AgoSaved
In-Office
Overland Park, KS, USA
Senior level
Senior level
Healthtech • Professional Services • Software
Lead design, architecture, and reliability of scalable systems; own incident response, monitoring, and CI/CD automation. Mentor engineers, drive tooling and AI adoption, and collaborate across teams to meet business needs and maintain high system availability.
Top Skills: Argo CdAzure Devops PipelinesCi/CdElk StackGCPGrafanaIstioKubernetesNew RelicOpentelemetryTerraform
Reposted 16 Days AgoSaved
In-Office
Birmingham, AL, USA
Senior level
Senior level
Automotive • Hardware • Logistics
The Site Reliability Engineer III enhances system reliability by building automation and supporting large-scale systems, ensuring critical platforms function optimally.
Top Skills: APIsAzure DevopsDynatraceGoogle Cloud PlatformGrafanaHTTPJavaKubernetesMicroservicesPrometheusTerraform
17 Days AgoSaved
In-Office
San Francisco, CA, USA
230K-390K Annually
Senior level
230K-390K Annually
Senior level
Artificial Intelligence • Software
As a Software Engineer on the Site Reliability team, you'll ensure system reliability, scalability, and observability while partnering with engineering teams and improving incident management processes.
Top Skills: AWSCi/Cd ToolingContainer OrchestrationDatadogGrafanaPrometheusTerraform
17 Days AgoSaved
Remote
Location, WV, USA
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Insurance • Cybersecurity
This role involves leading AI enablement, developing tools for AI-assisted development, and ensuring reliable, secure production environments. Responsibilities include integrating AI tools into workflows and mentoring engineering teams.
Top Skills: Ai-Assisted Development ToolsAWSCi/Cd ToolsCursorDatadogEcsGithub ActionsGithub CopilotGoKubernetesPythonTerraform
17 Days AgoSaved
Easy Apply
In-Office
Boston, MA, USA
Easy Apply
135K-165K Annually
Mid level
135K-165K Annually
Mid level
Artificial Intelligence
The Site Reliability Engineer II will enhance infrastructure and software reliability, write efficient code, collaborate across teams, and maintain platforms and monitoring tools.
Top Skills: AWSCi/CdCoralogixDockerJavaScriptKubernetesPythonSentryTerraformUnix Shell
17 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
135K-165K Annually
Mid level
135K-165K Annually
Mid level
Artificial Intelligence
In this role, the Site Reliability Engineer will improve reliability and performance of infrastructure, write clean code, collaborate across teams, and maintain platforms for deployed software.
Top Skills: AWSCi/CdDockerJavaScriptKubernetesPythonTerraformUnix Shell
17 Days AgoSaved
Remote or Hybrid
4 Locations
5-5 Annually
Senior level
5-5 Annually
Senior level
Database
The Site Reliability Engineer will oversee the Digital Realty interconnection fabric network infrastructure, focusing on network operations, automation, and development. Responsibilities include maintaining global network infrastructure, responding to alerts, and working with various cloud platforms and automation tools.
Top Skills: AnsibleAWSAzureGitGCPIbm CloudJenkinsLinuxOracle CloudPythonTerraform
17 Days AgoSaved
In-Office
Tacoma, WA, USA
172K-262K Annually
Senior level
172K-262K Annually
Senior level
Cloud • Information Technology • Security • Software
The Senior Manager will lead the SRE and DevOps teams, manage software engineering, collaborate on cloud infrastructure, and drive innovation, ensuring resilience and quality.
Top Skills: Amazon Web ServicesApmCCi/CdCloudFormationEc2ElbGitGoGrafanaJavaJenkinsKubernetesPagerdutyPythonS3SpinnakerSplunkVpc
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 17 Days AgoSaved
In-Office
Seattle, WA, USA
Senior level
Senior level
Other
In this role, you will manage day-to-day operations of Internet-based enterprise systems, identify operational issues, develop tools for maintenance, and collaborate on infrastructure documentation and project execution.
Top Skills: .NetAnsibleApacheAzureChefIisJbossPerlPowershellPuppetPythonRubyTomcat
Reposted 17 Days AgoSaved
In-Office
Seattle, WA, USA
Senior level
Senior level
Other
Responsible for monitoring, provisioning, and customer interactions, with a focus on maintaining high availability in complex web environments.
Top Skills: .NetAnsibleApacheCfengineChefDyanatraceGoIisJavaJbossNasNew RelicPerlPowershellPuppetPythonRaidRubySanSplunkSumo LogicTomcatWindows
Reposted 17 Days AgoSaved
Hybrid
New York City, NY, USA
Senior level
Senior level
Cloud • Healthtech • Internet of Things • Machine Learning • Software
Lead the design and implementation of scalable and fault-tolerant infrastructure on AWS and Kubernetes, mentor engineers, and drive operational excellence.
Top Skills: AWSGoGrafanaJavaKubernetesOpentelemetryPrometheusPythonTerraform
Reposted 17 Days AgoSaved
Easy Apply
In-Office
Redmond, WA, USA
Easy Apply
160K-220K Annually
Senior level
160K-220K Annually
Senior level
Aerospace • Other
Design, operate, and scale infrastructure for Starlink, developing automation and collaboration with software engineers to enhance product operability and performance.
Top Skills: AnsibleBashCC++DockerGoKubernetesLinuxPythonTerraform
Reposted 17 Days AgoSaved
Easy Apply
In-Office
Redmond, WA, USA
Easy Apply
160K-220K Annually
Senior level
160K-220K Annually
Senior level
Aerospace • Other
Design, operate, and scale infrastructure for Starlink's software and network, focusing on Kubernetes and high availability systems.
Top Skills: AnsibleBashC++GoKubernetesPythonTerraform
Reposted 17 Days AgoSaved
In-Office
New York, NY, USA
181K-237K Annually
Senior level
181K-237K Annually
Senior level
Healthtech • Insurance
The Senior Software Engineer, Cloud Infrastructure is responsible for architecting resilient systems on AWS/GCP, leading projects, mentoring engineers, improving software quality, and ensuring compliance with laws and regulations.
Top Skills: AWSCi/CdGCPGithub ActionsGrafanaIstioKubernetesPrometheusTerraform
18 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
172K-215K Annually
Expert/Leader
172K-215K Annually
Expert/Leader
Aerospace • Big Data • Greentech • Hardware • Social Impact
The Site Reliability Engineer will build, deploy, and operate computing services for satellite imaging, ensuring reliable and scalable infrastructure while collaborating with cross-functional teams.
Top Skills: AlloyAnsibleBashCloud-Native InfrastructureGrafanaHelmK3SKubernetesKustomizeOpentelemetryPrometheusProxmoxPythonRke2TalosTerraform
18 Days AgoSaved
Remote
United States
160K-250K Annually
Senior level
160K-250K Annually
Senior level
Artificial Intelligence • Cloud • Machine Learning • Software • Database • App development • Generative AI
As a Site Reliability Engineer at Replit, you'll enhance system reliability through observability, automation, incident management, and performance optimization, serving millions globally.
Top Skills: AnsibleDatadogGoGoogle Cloud PlatformGrafanaKubernetesPrometheusPulumiPythonTerraform
18 Days AgoSaved
Remote
United States
220K-325K Annually
Senior level
220K-325K Annually
Senior level
Artificial Intelligence • Cloud • Machine Learning • Software • Database • App development • Generative AI
As a Staff Site Reliability Engineer at Replit, you will ensure infrastructure reliability, drive automation, lead incident management, and mentor the engineering team while enhancing system performance and observability.
Top Skills: DatadogGoGoogle Cloud PlatformGrafanaKubernetesOpentelemetryPrometheusPythonTerraform
18 Days AgoSaved
In-Office
Secaucus, NJ, USA
150K-175K Annually
Expert/Leader
150K-175K Annually
Expert/Leader
Healthtech • Database
Seeking a Principal Site Reliability Engineer to build a SRE practice, enhance reliability, mentor teams, and drive performance engineering to optimize Quest products and services.
Top Skills: AnsibleAuroraAWSAzureBigtableCassandraCi/CdCloud Pub/SubCloud SpannerCloud SqlDockerDynamoDBDynatraceGitlabGoGCPJavaJmsKafkaKinesisKubernetesMqPerlPythonRdsRubyShell ScriptingTerraform
18 Days AgoSaved
In-Office
Owings Mills, MD, USA
159K-339K Annually
Senior level
159K-339K Annually
Senior level
Financial Services
As a Principal Site Reliability Engineer, you'll lead a team focusing on observability and automating solutions for cloud and on-prem infrastructures, enhancing reliability and incident response across T. Rowe Price's tech ecosystem.
Top Skills: .Net CoreAmazon AwsAnsibleElastic StackGoGrafanaJavaMySQLNew RelicNode.jsPostgresPrometheusPythonSolarwinds DpaSplunkSQL ServerTerraformVagrantVault
Reposted 18 Days AgoSaved
Easy Apply
In-Office
Boston, MA, USA
Easy Apply
125K-150K Annually
Senior level
125K-150K Annually
Senior level
Artificial Intelligence • Cloud • Social Impact • Software • Wearables
As a Site Reliability Engineer II, you will develop automation workflows and services, manage cloud operations, participate in incident response, and influence architectural patterns for improved efficiency.
Top Skills: AWSAws CloudformationAzureC#Ci/CdGoJavaKubernetesPythonTemporalTerraform
18 Days AgoSaved
Hybrid
Austin, TX, USA
132K-210K Annually
Senior level
132K-210K Annually
Senior level
Fintech • Information Technology • Payments
The Staff Site Reliability Engineer designs and builds cloud-native infrastructure on Azure for data services, ensuring reliability, security, and scalability.
Top Skills: AutomationAzure Kubernetes ServiceConfiguration ManagementContainer OrchestrationInfrastructure As CodeAzure
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account