Get the job you really want.

Top Site Reliability Engineer Jobs

15 Days AgoSaved
Remote
USA
Expert/Leader
Expert/Leader
Blockchain
As a Site Reliability Engineer, you'll ensure scalability, performance, and reliability of blockchain applications, tackling operational challenges through automated solutions and proactive system designs.
Top Skills: GoGrafanaHelmKubernetesPulumiPythonRustShell ScriptingTerraform
15 Days AgoSaved
Remote
United States
Mid level
Mid level
Fitness
The Site Reliability Engineer will ensure system reliability and performance, design scalable architectures, improve CI/CD pipelines, maintain infrastructures, and lead incident response efforts.
Top Skills: ArgocdAWSDatadogDockerGithub ActionsGoJavaScriptKubernetesPrometheusPythonTerraform
Reposted 15 Days AgoSaved
In-Office
New York, NY, USA
Senior level
Senior level
Artificial Intelligence
As an Applied AI Engineer, you will onboard customers, deploy AI solutions, work on complex projects, and provide technical guidance. You'll contribute to open-source projects and communicate effectively with stakeholders.
Top Skills: AnsibleAWSAzureDockerGCPKubernetesPythonTerraform
Reposted 15 Days AgoSaved
Remote
2 Locations
Junior
Junior
Computer Vision • Information Technology • Machine Learning • Natural Language Processing • Real Estate • Software
The SRE will maintain infrastructure for SaaS products on AWS, support developers, manage platform components, and handle IT tasks.
Top Skills: AWSComputer VisionIacLarge Language ModelsNlpTerraform
Reposted 15 Days AgoSaved
Easy Apply
Remote or Hybrid
3 Locations
Easy Apply
142K-185K
Senior level
142K-185K
Senior level
Artificial Intelligence • Hardware • Software • Quantum Computing
The Staff Site Reliability Engineer will create, support, and manage infrastructure, ensuring high uptime and performance for IonQ's quantum computing platform, while mentoring junior engineers.
Top Skills: GCPKubernetesLinuxPythonShellTerraformVMware
Reposted 15 Days AgoSaved
In-Office
Pleasant Grove, UT, USA
15-15
Expert/Leader
15-15
Expert/Leader
Hardware • Internet of Things
The Staff Site Reliability Engineer will design and implement infrastructure solutions, optimize system performance, lead incident management, and provide technical mentorship within Pura.
Top Skills: AWSGCPGoKubernetesNode.jsPythonTerraform
Reposted 15 Days AgoSaved
Remote
United States
96K-132K Annually
Junior
96K-132K Annually
Junior
Information Technology • Security • Cybersecurity
The Site Reliability Engineer will manage large-scale SaaS operations, drive automation, ensure uptime, and collaborate with engineering for improved reliability and customer satisfaction.
Top Skills: ArgocdAWSAzureGceGhaGoJavaJenkinsKubernetesMesosNomadPythonRuby
16 Days AgoSaved
In-Office
Headquarters, AZ, USA
37-37
Internship
37-37
Internship
Software • Cybersecurity
The Engineering Intern will support backend, platform, or SRE tasks, learning to design reliable cloud infrastructure and automate processes using scripting languages. Responsibilities include monitoring and improving system reliability, assisting in incident management, and collaborating with engineering teams.
Top Skills: AWSAzureDockerGCPGoKubernetesPython
16 Days AgoSaved
Hybrid
San Francisco, CA, USA
97K-125K Annually
Entry level
97K-125K Annually
Entry level
Fintech
As a Site Reliability Engineer I, you'll enhance the reliability and maintainability of systems, develop applications, manage cloud infrastructure, and contribute to observability practices. You'll also participate in on-call rotations.
Top Skills: BashCloud InfrastructureGenaiInfrastructure As CodeJavaLinuxPythonUnixWindows
16 Days AgoSaved
In-Office
Los Gatos, CA, USA
170K-720K
Mid level
170K-720K
Mid level
News + Entertainment
The Reliability Engineer will maintain a scalable CDN platform by improving resiliency, analyzing data, and providing design assistance to ISP partners while handling production issues.
Top Skills: BgpDnsDockerHttp/SPrestoPythonSpark SqlTcp/IpTlsTrinoUnix/Linux
16 Days AgoSaved
In-Office
Santa Clara, CA, USA
152K-245K Annually
Senior level
152K-245K Annually
Senior level
Cybersecurity
Design, build, and maintain production services for scaling infrastructure, focusing on automation and collaboration with teams for incident resolution and customer engagement.
Top Skills: BgpGoIpv6Nat64OspfPythonTerraform
16 Days AgoSaved
In-Office
Tyson's Corner, VA, USA
136K-170K Annually
Junior
136K-170K Annually
Junior
Fintech
The Site Reliability Engineer will manage and optimize Kubernetes clusters, ensuring reliability and scalability while collaborating with cross-functional teams and implementing security best practices.
Top Skills: Amazon S3AnsibleApache MesosAWSAzureC/C++CephCloudFormationGCPHdfsJavaJavaScriptKubernetesNfsPythonRubyTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
16 Days AgoSaved
Remote
United States
Senior level
Senior level
Software
EngFlow seeks an experienced Site Reliability Engineer to design, build, and maintain cloud infrastructure for a distributed build acceleration platform, ensuring performance, scalability, and high availability while automating processes and resolving incidents efficiently.
Top Skills: AWSGCPKubernetesTerraform
16 Days AgoSaved
Remote or Hybrid
United States
150K-225K Annually
Senior level
150K-225K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
The Staff Site Reliability Engineer will enhance reliability, scalability, and performance by architecting platforms, leading incident responses, mentoring engineers, and implementing SRE practices.
Top Skills: AWSAzureDatadogElkGCPGoGrafanaKubernetesOtelPrometheusPython
Reposted 16 Days AgoSaved
In-Office
3 Locations
96K-159K Annually
Junior
96K-159K Annually
Junior
Insurance
Join Travelers as a Site Reliability Engineer I to perform software engineering tasks across the technology landscape, focusing on automation and observability.
Top Skills: AWSDatadogDockerDynatraceElk StackKubernetesOpentelemetryPrometheusPythonTerraform
Reposted 16 Days AgoSaved
In-Office
New York, NY, USA
225K-253K Annually
Mid level
225K-253K Annually
Mid level
Artificial Intelligence
As a Staff Site Reliability Engineer, you will ensure the reliability and performance of API infrastructure, collaborate with the API and platform teams, and support application deployment.
Top Skills: ArgocdC#GrafanaHaproxyHelmKubernetesPrometheusTerraform
Reposted 16 Days AgoSaved
Hybrid
2 Locations
135K-180K Annually
Senior level
135K-180K Annually
Senior level
Software
Lead the deployment and management of cloud infrastructure, automating processes and ensuring compliance while collaborating with teams to enhance service quality.
Top Skills: AWSAzureCloudwatchDatadogElkGCPGrafanaHelmJavaKubernetesNode.jsOpensearchOpentelemetryPrometheusPythonTerraform
Reposted 16 Days AgoSaved
In-Office
New York City, NY, USA
Senior level
Senior level
Blockchain • Cryptocurrency
The Site Reliability Engineer ensures reliability, scalability, and performance of systems by collaborating to design, implement, and maintain infrastructure solutions in a multi-cloud environment, focusing on automation, incident management, and security.
Top Skills: ArgocdAWSAzureBashGCPGithub ActionsGitlabciGoGrafanaHelmPrometheusPythonTerraformTypescript
Reposted 16 Days AgoSaved
In-Office or Remote
West, TX, USA
Senior level
Senior level
Cloud
The Software Engineer will enhance and optimize MinIO's cloud-native storage platform, focusing on DevOps practices, automation, and performance validation while collaborating with customers and engineers to ensure high-quality deployments.
Top Skills: CC++ContainersGoKubernetesMicroservicesRust
Reposted 16 Days AgoSaved
In-Office
Memphis, TN, USA
Senior level
Senior level
Information Technology
The Site Reliability Engineer at xAI is responsible for maintaining and improving data center reliability, managing monitoring systems, and ensuring high availability for AI workloads.
Top Skills: ArgocdBuildkiteC++GoGrafanaKubernetesPrometheusPulumiRustTerraform
Reposted 16 Days AgoSaved
In-Office or Remote
2 Locations
248K-391K
Expert/Leader
248K-391K
Expert/Leader
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Principal Staff SRE will lead initiatives in building and optimizing core infrastructure services on-prem and cloud, deploying and managing services at scale, and improving performance with automation and monitoring tools.
Top Skills: DhcpDnsEbpfGoLdapLinuxNtpPythonTerraformXdp
Reposted 16 Days AgoSaved
Remote
US
180K-200K
Senior level
180K-200K
Senior level
Artificial Intelligence • Cloud • Healthtech • Information Technology • Software • Business Intelligence
The Site Reliability Engineer I will automate and streamline software delivery, manage cloud infrastructure on AWS, and optimize CI/CD pipelines while collaborating with engineering teams to ensure reliability and performance.
Top Skills: AnsibleAWSAws CodepipelineBashChefConfluenceDockerElk StackGithub ActionsGoGrafanaJaegerJavaJenkinsJIRAKafkaNewrelicNode.jsOpentelemetryPerlPrometheusPuppetPythonTerraformTerragruntZipkin
17 Days AgoSaved
In-Office
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence
The Staff/Lead/Senior/Principal Site Reliability Engineer will establish SRE practices, ensure platform reliability, and support infrastructure scaling for enterprise AI workloads.
Top Skills: AWSBetterstackCloudwatchGithub ActionsGrafanaKubernetesMongodbPagerdutyPostgresPrometheusTerraform
Reposted 17 Days AgoSaved
In-Office
New York, NY, USA
163K-263K Annually
Expert/Leader
163K-263K Annually
Expert/Leader
AdTech • Marketing Tech
As SVP, you will lead a global team in overseeing the SRE, DevOps, and infrastructure for Dentsu.Connect, ensuring operational excellence and strategic planning.
Top Skills: Azure
Reposted 22 Days AgoSaved
In-Office
Costa Mesa, CA, USA
166K-220K Annually
Senior level
166K-220K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
You will build, deploy, and maintain critical infrastructure and improve CI/CD pipelines while promoting observability and reliability across teams.
Top Skills: AnsibleAWSAzureBashCloudFormationDockerGoGoogle Cloud PlatformHelmKubernetesPuppetPythonRustTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account