Get the job you really want.

Top Remote Site Reliability Engineer Jobs

Reposted 6 Days AgoSaved
Remote
USA
Mid level
Mid level
Software • Analytics
The role involves automating and managing AWS infrastructure, ensuring reliability and scalability of stateful systems, and optimizing deployment processes. You'll also handle incident responses and improve operational tooling.
Top Skills: AWSKubernetesTerraformTerragrunt
6 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Cloud • Information Technology
Lead and mentor the SRE team, manage production infrastructure, oversee operational excellence, and drive strategic initiatives for Backblaze's growth and service delivery.
Top Skills: Change ManagementCloud OperationsData Center OperationsIncident ManagementInfrastructure EngineeringObservability ToolsSre
6 Days AgoSaved
Remote
United States
162K-175K Annually
Junior
162K-175K Annually
Junior
Cloud • Software
The role involves managing cloud-native environments on Azure, developing automation frameworks, troubleshooting incidents, enhancing system reliability, and collaborating with engineering teams for better service stability.
Top Skills: AzureAzure MonitoringBashElkGitGithub ActionsGrafanaJenkinsNosql DatabasesPrometheusPythonTcp/Ip Networking
6 Days AgoSaved
Remote
US
136K-177K Annually
Senior level
136K-177K Annually
Senior level
Big Data • Machine Learning • Software • Analytics
As a Lead Site Reliability Engineer, you will drive the reliability strategy, improve system health, lead incident management, and mentor engineers for a multi-region SaaS platform.
Top Skills: ArgocdC++Ci/CdCloud PlatformsDatadogGitopsGrafanaInfrastructure As CodeJavaJavaScriptKubernetesPython
Reposted 6 Days AgoSaved
Remote
2 Locations
Junior
Junior
Computer Vision • Information Technology • Machine Learning • Natural Language Processing • Real Estate • Software
The SRE will maintain infrastructure for SaaS products on AWS, support developers, manage platform components, and handle IT tasks.
Top Skills: AWSComputer VisionIacLarge Language ModelsNlpTerraform
7 Days AgoSaved
Remote
2 Locations
Entry level
Entry level
Computer Vision • Information Technology • Machine Learning • Natural Language Processing • Real Estate • Software
As an Associate Site Reliability Engineer, you will build and maintain infrastructure on AWS, handle IT tasks, and work on software projects to enhance platform reliability and security.
Top Skills: AWSComputer VisionDatadogLarge Language ModelsNlpTerraform
Reposted 7 Days AgoSaved
Remote
US
250K-300K Annually
Senior level
250K-300K Annually
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Software • Cybersecurity • Generative AI • Data Privacy
Lead global SRE and infrastructure teams to ensure reliability, scalability, and cost-efficiency of production and developer platforms. Define cloud and Kubernetes architecture, IaC, CI/CD, SLOs/SLIs, incident management, and cloud cost optimization while partnering with Security, Product, Finance, and Engineering.
Top Skills: AIAutomationAWSCi/CdCloud-Native SystemsGCPInfrastructure As CodeKubernetesTerraform
Reposted 7 Days AgoSaved
Remote
USA
156K-288K Annually
Mid level
156K-288K Annually
Mid level
Computer Vision • Machine Learning • Software
As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.
Top Skills: AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
Reposted 7 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
230K-250K Annually
Expert/Leader
230K-250K Annually
Expert/Leader
Artificial Intelligence • Healthtech • Software
The Staff Site Reliability Engineer will lead the reliability of production systems by defining SRE practices, improving observability, and ensuring fault-tolerance in cloud environments.
Top Skills: AWSGoKubernetesPostgresPythonTerraformTypescript
Reposted 8 Days AgoSaved
In-Office or Remote
2 Locations
76K-136K Annually
Junior
76K-136K Annually
Junior
Cloud • Security • Software • Cybersecurity
As a Site Reliability Engineer, you will troubleshoot production issues, automate systems, define database requirements, and collaborate with Dev and QA teams for stability.
Top Skills: AnsibleCassandraChefNoSQLPythonRedis
Reposted 13 Days AgoSaved
Remote or Hybrid
3 Locations
147K-278K Annually
Senior level
147K-278K Annually
Senior level
Cloud • Software
Responsible for maintaining FedRAMP compliant services, designing infrastructure, monitoring systems, and ensuring security for federal regions, while driving automation and collaboration with development teams.
Top Skills: AWSFedrampGoKubernetesPuppetPythonTerraformUnix/Linux
14 Days AgoSaved
Easy Apply
Remote
31 Locations
Easy Apply
130K-140K Annually
Senior level
130K-140K Annually
Senior level
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, improve monitoring and logging, optimize database infrastructure, and collaborate on scaling systems efficiently.
Top Skills: AWSClickhouseKubernetesMySQLPostgresRedis
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 8 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Edtech
The Lead Software Engineer will lead the SRE team, focusing on reliability, performance optimization, security, and mentoring developers, while improving overall platform resilience.
Top Skills: ActivejobAnsibleAWSAws CloudwatchEc2EcsElasticsearchGitGCPGoogle Cloud StackdriverJenkinsJIRAKubernetesMemcachedMongoDBNew RelicNode.jsPostgresRedisRuby On RailsSidekiqSpinnakerTerraformTerragrunt
Reposted 8 Days AgoSaved
Remote
United States
Senior level
Senior level
Digital Media • Social Media • Software • Sports
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
Top Skills: Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform
Reposted 8 Days AgoSaved
Remote
United States
175K-275K Annually
Mid level
175K-275K Annually
Mid level
Software
As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.
Top Skills: AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
9 Days AgoSaved
Remote
United States
Senior level
Senior level
Healthtech
Develop and implement processes to ensure high availability and reliability of services. Responsibilities include incident management, automation, capacity planning, and risk mitigation.
Top Skills: AWSAzureDatadogDockerGrafanaJavaScriptNew RelicPrometheusPythonRubySplunkTerraform
9 Days AgoSaved
Remote
United States
190K-215K Annually
Senior level
190K-215K Annually
Senior level
Internet of Things • Cybersecurity
The Site Reliability Engineer will manage AWS GovCloud infrastructure, ensuring compliance and high availability while driving automation, security, and incident response best practices.
Top Skills: AnsibleAws GovcloudBashDockerElk StackGitlab Ci/CdGrafanaJenkinsKubernetesPrometheusPythonTerraform
Reposted 15 Days AgoSaved
In-Office or Remote
La Crosse, WI, USA
92K-164K Annually
Mid level
92K-164K Annually
Mid level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Senior Observability Engineer maintains monitoring systems, designs log aggregation solutions, automates tasks with scripts, and ensures platform performance.
Top Skills: AnsibleBashDynatraceElasticsearchElkFilebeatFluentbitFluentdGrafanaLinuxLogstashOtelPowershellPrometheusPythonTerraform
Reposted 15 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves supporting network infrastructure, automating cloud infrastructure, managing CI/CD workflows, and ensuring operational excellence in IT support, including incident response and security practices.
Top Skills: AnsibleAWSBashDockerGitKubernetesPythonRubyTerraform
Reposted 10 Days AgoSaved
Remote
USA
388K-558K Annually
Senior level
388K-558K Annually
Senior level
News + Entertainment
As an Ads Reliability Engineer, you will ensure the reliability of Netflix's Ad Suite by designing scalable infrastructure, collaborating with teams, and implementing automation for monitoring and incident response.
Top Skills: AWSAzureGCPGoJavaKubernetesPythonTerraform
Reposted 10 Days AgoSaved
In-Office or Remote
New York, NY, USA
133K-190K Annually
Mid level
133K-190K Annually
Mid level
Music
As a Site Reliability Engineer, you'll build and maintain cloud infrastructure for Spotify's AI-native developer platform, ensuring reliability and performance, while collaborating with senior engineers.
Top Skills: AWSGCPPythonReactTerraformTypescript
Reposted 10 Days AgoSaved
Remote
United States
Mid level
Mid level
Information Technology • Legal Tech
The role involves maintaining and improving Azure infrastructure, managing Infrastructure as Code with Terraform, enhancing security measures, and operating CI/CD pipelines.
Top Skills: AzureAzure DevopsBashCircleCIDatadogEfkElkGithub ActionsPowershellPythonTerraform
Reposted 10 Days AgoSaved
Remote or Hybrid
4 Locations
148K-249K Annually
Senior level
148K-249K Annually
Senior level
Transportation
Design and develop Waabi's observability stack, optimize performance, build automation tooling, and support application requirements while leading projects and mentoring teams.
Top Skills: AWSC/C++DockerGoGrafanaJavaKubernetesOpentelemetryPythonRust
12 Days AgoSaved
Remote
United States
Senior level
Senior level
Software
As a Site Reliability Engineer, you will enhance system reliability, manage cloud services, respond to incidents, and support network systems.
Top Skills: AutomationCisco RoutingCloud ServicesF5 Load BalancingFortinet FirewallsInfrastructure AutomationMonitoringNetworking
Reposted 12 Days AgoSaved
Remote
Texas, USA
Mid level
Mid level
Blockchain
The Blockchain Site Reliability Engineer is responsible for maintaining blockchain nodes' reliability, monitoring, incident response, and building automation tools to enhance operations.
Top Skills: DockerElkGoGrafanaJavaScriptKubernetesLinuxPrometheusPythonRustShell
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account