Get the job you really want.

Top Remote Site Reliability Engineer Jobs

Reposted 9 Days AgoSaved
Remote
United States
111K-226K Annually
Expert/Leader
111K-226K Annually
Expert/Leader
Cloud • Security • Software • Cybersecurity
The Staff Site Reliability Engineer will enhance AI/ML infrastructure, manage CI/CD pipelines, ensure system reliability, and troubleshoot applications, focusing on cloud-based operations.
Top Skills: AWSAzureBashDockerGitGitGCPGrafanaHuggingface TransformersKubernetesLlmPrometheusPythonPyTorchTensorrtTerraform
Reposted 9 Days AgoSaved
In-Office or Remote
8 Locations
Mid level
Mid level
Sports
Manage and improve the AWS infrastructure, deploy into new regions, monitor releases, and implement new technologies in a fast-paced environment.
Top Skills: AWSDockerGrafanaKubernetesPrometheusPython
Reposted 9 Days AgoSaved
Remote
US
175K-200K Annually
Senior level
175K-200K Annually
Senior level
Blockchain • Software
As a Senior Engineer, SRE/DevOps, you will enhance blockchain infrastructure reliability, automate deployment, and collaborate on CI/CD practices while ensuring security and performance optimization.
Top Skills: AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElk-StackEthereumGCPK8SKubernetesOpsgeniePingdomPythonTerraform
11 Days AgoSaved
Easy Apply
Remote
US
Easy Apply
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Software
The Senior SRE Manager will establish an SRE team, implement best practices, manage incidents, and enhance system reliability, scaling operations effectively.
Top Skills: Cloud InfrastructureDistributed SystemsObservability
Reposted 11 Days AgoSaved
In-Office or Remote
Berkeley, CA, USA
205K-235K Annually
Senior level
205K-235K Annually
Senior level
Financial Services
The Senior Cluster Site Reliability Engineer will enhance the research compute cluster's uptime, reliability, and performance through engineering and operational improvements, ensuring high availability for researchers working on machine learning problems.
Top Skills: AnsibleAWSAWSCephDockerElkGCPGCPGrafanaHorovodHpcInfinibandKubeflowKueueLokiLustreMlflowOpentelemetryPodmanPrometheusPythonRdmaRubyS3SingularitySlurmTerraform
12 Days AgoSaved
In-Office or Remote
Miami, FL, USA
Senior level
Senior level
Software
Seeking a Senior Site Reliability Engineer to enhance reliability and speed for Stream Aligned teams, ensuring ownership of services, building tooling, and improving deployment processes.
Top Skills: AWSJavaKubernetesLinuxPostgresTerraform
Reposted 17 Days AgoSaved
Remote or Hybrid
Oregon, IL, USA
130K-180K Annually
Senior level
130K-180K Annually
Senior level
Information Technology • Productivity • Software • Infrastructure as a Service (IaaS)
The role involves diagnosing infrastructure issues, participating in on-call rotations, improving application availability, and enhancing automation in cloud environments.
Top Skills: AnsibleAWSC++CloudFormationDatadogGoHelmJavaKotlinKubernetesNew RelicPostgresSplunkTerraform
Reposted 17 Days AgoSaved
Easy Apply
In-Office or Remote
Canada, KS, USA
Easy Apply
124K-266K Annually
Senior level
124K-266K Annually
Senior level
Cloud • Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer at GitLab, you will automate and manage the lifecycle of GitLab environments, ensuring reliability and scalability while leading incident responses and architectural decisions.
Top Skills: AnsibleAWSElkGCPGoGrafanaKubernetesPrometheusRubyTerraform
Reposted 17 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills: AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
Reposted 12 Days AgoSaved
Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
Top Skills: AWSClickhouseGoJavaKafkaKubernetesPulumiTerraform
13 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
Senior level
Senior level
Fitness
The Staff Site Reliability Engineer will establish SRE best practices, drive observability strategy, implement software solutions, and mentor engineers. Responsibilities include improving platform resilience, managing risks, and participating in incident response processes.
Top Skills: AnsibleAWSAzureBashCloudFormationGCPGoKubernetesPulumiPythonTerraform
13 Days AgoSaved
Remote
US
Mid level
Mid level
Software • Analytics
This SRE role involves deep ownership of production systems, focusing on improving AWS infrastructure, operational tooling, and automation for scaling ClickHouse installations at petabyte scale.
Top Skills: AnsibleAWSClickhouseEc2LinuxTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 14 Days AgoSaved
Remote or Hybrid
Colorado, USA
125K-150K Annually
Mid level
125K-150K Annually
Mid level
Artificial Intelligence • HR Tech • Legal Tech • Marketing Tech • Software • Conversational AI • Generative AI
As a Site Reliability Engineer, you'll ensure stability and reliability of SaaS solutions by enhancing operational efficiency, managing cloud infrastructure, and automating workflows while mentoring engineers.
Top Skills: AnsibleAWSAzureDatadogDynatraceLinuxNew RelicPuppetTerraformWindows
Reposted 14 Days AgoSaved
Easy Apply
In-Office or Remote
2 Locations
Easy Apply
165K-225K Annually
Senior level
165K-225K Annually
Senior level
Artificial Intelligence • Cloud • Information Technology • Software
Build and operate production-grade AI infrastructure using Kubernetes, ensuring high availability, reliability, and performance. Develop custom operators and implement automation for efficient operations and monitoring.
Top Skills: AnsibleBashElk StackEnterprise Storage SystemsGrafanaHigh-Performance NetworkingKubernetesLinuxNvidia Gpu TechnologiesPrometheusPythonTerraform
Reposted 14 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
89K-287K Annually
Mid level
89K-287K Annually
Mid level
3D Printing • Artificial Intelligence • Software • Design
The role involves building reliable platforms for 3D/4D content delivery to AR/VR devices, monitoring system health, and improving operational practices in collaboration with the engineering team.
Top Skills: Aws FargateCoreweaveGrafanaKubernetesPrometheusTerraform
Reposted 14 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
Senior level
Senior level
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills: ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
15 Days AgoSaved
In-Office or Remote
4 Locations
124K-206K Annually
Senior level
124K-206K Annually
Senior level
Cloud • Information Technology
The Site Reliability Engineer will support IaaS services, monitor infrastructure health, perform root cause analysis, automate processes, and collaborate with teams for service reliability.
Top Skills: AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
Reposted 15 Days AgoSaved
Remote
United States
140K-180K Annually
Senior level
140K-180K Annually
Senior level
Fintech
As a Site Reliability Engineer, you will enhance system reliability through scalable infrastructure, observability practices, automation, and collaboration with engineering teams.
Top Skills: AWSDatadogGoGrafanaJavaKubernetesNode.jsPrometheusPulumiPythonTerraform
Reposted 15 Days AgoSaved
Easy Apply
In-Office or Remote
3 Locations
Easy Apply
Senior level
Senior level
Healthtech
The SRE will design and implement platform solutions, maintain cloud environments, monitor and troubleshoot production issues, and automate tasks to improve efficiency.
Top Skills: AnsibleAWSDockerGCPGitIacLinuxMySQLPHPTerraform
Reposted 15 Days AgoSaved
Remote
Arizona, USA
Senior level
Senior level
Information Technology • Marketing Tech • Social Media
Lead PKI operations and SRE team to ensure operational readiness and compliance of trust-critical services, fostering a culture of excellence.
Top Skills: AnsibleBashCompliance Frameworks (Soc 2ElasticsearchGoGrafanaLinuxNistPci)Pki FrameworksPrometheusPythonSaltTls Protocols
16 Days AgoSaved
Easy Apply
In-Office or Remote
3 Locations
Easy Apply
124K-206K Annually
Senior level
124K-206K Annually
Senior level
Analytics
The Site Reliability Engineer will ensure the reliability and performance of IaaS services, perform incident resolution, and enhance system reliability through automation while supporting mobility across hybrid infrastructures and collaborating extensively with various teams.
Top Skills: AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
Reposted 21 Days AgoSaved
In-Office or Remote
2 Locations
180K-220K Annually
Senior level
180K-220K Annually
Senior level
Software • Defense
We are hiring a Senior Site Reliability Engineer to ensure deployment stability and service quality, working in on-premise DoD and AWS environments.
Top Skills: AnsibleAWSDockerDod ComplianceHelmKubernetesLinuxTerraformVMware
Reposted 25 Days AgoSaved
In-Office or Remote
2 Locations
150K-250K Annually
Mid level
150K-250K Annually
Mid level
Software
As a Site Reliability Engineer, you'll build and maintain infrastructure for ML models, automate processes, and collaborate cross-functionally.
Top Skills: Circle CiCloudFormationElk StackGithub ActionsGitlab CiGrafanaJenkinsKubernetesOpentelemetryPrometheusPulumiTerraform
Reposted 16 Days AgoSaved
Easy Apply
In-Office or Remote
Lehi, UT, USA
Easy Apply
Senior level
Senior level
Software
As a Site Reliability Engineer at Podium, you'll ensure product stability and scalability, collaborate with engineering teams, handle on-call production issues, and mentor junior engineers.
Top Skills: AnsibleAWSCi/CdDatadogDockerGitGitlabGoHelmHoneycombKubernetesPrometheusPythonRubyStrongdmTerraform
Reposted 17 Days AgoSaved
In-Office or Remote
New York, NY, USA
110K-130K Annually
Senior level
110K-130K Annually
Senior level
News + Entertainment
The Site Reliability Engineer will manage cloud infrastructure, ensure platform reliability, optimize performance, and implement security best practices for Courier Newsroom's digital properties.
Top Skills: AnsibleCi/CdDockerKubernetesPythonSaaSTerraformWordpress
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account