Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 21 Hours AgoSaved
Hybrid
San Francisco, CA, USA
204K-247K Annually
Senior level
204K-247K Annually
Senior level
Cloud • Greentech • Other • Energy
The role involves ensuring reliability of AI-optimized cloud services, focusing on design, automation, and performance for AI workloads.
Top Skills: C++GoJavaKubernetesPython
Reposted 21 Hours AgoSaved
In-Office
Palo Alto, CA, USA
106K-199K Annually
Senior level
106K-199K Annually
Senior level
Gaming • Software • Metaverse
The Senior Distributed Storage SRE Engineer manages distributed storage systems, ensuring stability, designing disaster recovery solutions, and optimizing performance. Responsibilities include incident response, tool development, and resource management.
Top Skills: GoLinuxPythonShellTcp/IpUnix
Reposted 21 Hours AgoSaved
In-Office
4 Locations
172K-258K Annually
Expert/Leader
172K-258K Annually
Expert/Leader
Fintech
The Principal Site Reliability Engineer enhances application performance and reliability, manages incidents, designs systems, and mentors teams.
Top Skills: AuroraAWSChefDockerDynamo DbGitGoJavaJenkinsJmsKafkaKubernetesMavenMemcachedOraclePythonRedisSqsSwarm
Reposted 21 Hours AgoSaved
Remote
USA
Mid level
Mid level
Information Technology • Other • Software • Consulting
The Site Reliability Engineer (SRE) will ensure system reliability and performance, automate operations, develop CI/CD pipelines, and manage cloud infrastructure.
Top Skills: AnsibleAWSAzureDatadogDockerEcsJavaKubernetesPythonTerraformTerragrunt
Reposted 21 Hours AgoSaved
In-Office
2 Locations
84K-133K Annually
Junior
84K-133K Annually
Junior
Digital Media • Gaming • Internet of Things • News + Entertainment • Retail • Business Intelligence • Cybersecurity
The Site Reliability Engineer 2 will ensure system reliability, manage infrastructure, optimize performance, automate operations, and resolve technical issues while collaborating with various teams.
Top Skills: AnsibleAWSAws S3AzureCassandraDockerElk StackGCPGoGrafanaHadoopHdfsJavaKafkaKubernetesMySQLNosql DatabasesOciPostgresPrometheusPythonScalaSparkTerraform
Reposted YesterdaySaved
In-Office
St. Louis, MO, USA
Senior level
Senior level
Fintech • Analytics
As a Senior Site Reliability Engineer at LSEG, you'll support critical applications, automate operations, and ensure cloud service health while collaborating across teams.
Top Skills: AWSAzureDatadogDockerGitKubernetesPython
YesterdaySaved
Easy Apply
Hybrid
Austin, TX, USA
Easy Apply
Junior
Junior
Other • Software
As a Junior Site Reliability Engineer, you'll develop infrastructure services, automate workflows, enhance observability, and participate in incident response to improve system reliability.
Top Skills: AWSDatadogDockerGithub ActionsGoJavaScriptJIRAPythonTerraform
YesterdaySaved
In-Office
Santa Clara, CA, USA
150K-250K Annually
Senior level
150K-250K Annually
Senior level
Artificial Intelligence • Machine Learning
As a Senior Site Reliability Engineer, you'll manage and optimize HPC cluster operations, develop automation, troubleshoot issues, and support ML/research teams while ensuring smooth infrastructure operations.
Top Skills: AnsibleAWSAzureBashCephGCPGitopsHpcKubernetesL2/L3 NetworkingNvidia A100Nvidia H100PythonPyTorchTensorFlowTerraform
Reposted 6 Days AgoSaved
In-Office
Chicago, IL, USA
125K-188K Annually
Senior level
125K-188K Annually
Senior level
AdTech • eCommerce • Food • Marketing Tech • Retail
The Senior Site Reliability Engineer ensures the reliability and performance of production systems through automation, incident response, and tool design, while mentoring junior engineers.
Top Skills: AksArgocdBashDatadogDockerElkGithub ActionsGoJavaKafkaKubernetesLinuxPrometheusPythonRedisSpring BootTerraformTomcat
Reposted 6 Days AgoSaved
Hybrid
New York City, NY, USA
205K-225K Annually
Senior level
205K-225K Annually
Senior level
Artificial Intelligence • Fintech • Payments • Social Impact • Analytics • Financial Services • Automation
As a Senior SRE, you'll ensure reliable and scalable systems, develop observability solutions and infrastructure as code, and lead incident response efforts.
Top Skills: AWSCloudFormationDatadogElkPrometheusTerraform
Reposted YesterdaySaved
In-Office
St. Louis, MO, USA
100K-120K Annually
Senior level
100K-120K Annually
Senior level
Fintech • Analytics
As a Senior Site Reliability Engineer, you'll lead incident recovery, enhance production stability, automate processes, and collaborate with development teams to improve operational efficiency.
Top Skills: AWSAzureBigpandaCloud-Native ApplicationsDatadogDnsDockerGitHTTPKubernetesShell ScriptingTcp/IpUnix
Reposted YesterdaySaved
In-Office
Saint Louis, MO, USA
Expert/Leader
Expert/Leader
Fintech • Analytics
Responsible for key application functions, driving remediation and automation, managing service operations, and ensuring continuous improvement in performance and cost-effectiveness.
Top Skills: DatadogItrs
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted YesterdaySaved
Easy Apply
Remote
United States
Easy Apply
171K-190K Annually
Senior level
171K-190K Annually
Senior level
Edtech • Kids + Family • Sales • Social Impact • Software
The role involves designing, building, and maintaining cloud infrastructure on AWS, focusing on scalability, security, and developer efficiency while providing technical leadership to the team.
Top Skills: AWSBuildkiteCi/CdCloudFormationCloudwatchDatadogDockerGithub ActionsJenkinsKubernetesPostgresPrometheusTerraform
Reposted YesterdaySaved
In-Office
Atlanta, GA, USA
Senior level
Senior level
Healthtech • Payments • Software
Looking for an SRE Principal Engineer to lead cloud strategy, optimize infrastructure, ensure reliability and security, and mentor junior team members.
Top Skills: AnsibleAWSAws CloudtrailAws CloudwatchBashCloudFormationConsulDockerGrafanaHashicorp VaultKubernetesLokiNew RelicPrometheusPythonTempoTerraform
Reposted YesterdaySaved
Easy Apply
In-Office
Burlington, MA, USA
Easy Apply
68K-102K Annually
Junior
68K-102K Annually
Junior
On-Demand • Security • Software
The Site Reliability Engineer is responsible for maintaining server and network infrastructure health, monitoring operations, tracking assets, and collaborating with IT and Engineering teams.
Top Skills: Asset Tracking SoftwareDastMonitoring ToolsSastSca
Reposted YesterdaySaved
In-Office
Chevy Chase, MD, USA
110K-260K Annually
Senior level
110K-260K Annually
Senior level
Insurance
As a Senior Staff Engineer, lead SRE initiatives, ensure reliability and performance of systems, mentor engineers, and respond to incidents.
Top Skills: AnsibleAzureDockerGoGrafanaHelmJavaKubernetesNoSQLOpenstackOpentelemetryPrometheusPuppetPythonSpinnakerSQLTerraform
Reposted YesterdaySaved
Easy Apply
Remote
USA
Easy Apply
Senior level
Senior level
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills: ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Reposted YesterdaySaved
In-Office or Remote
5 Locations
175K-250K Annually
Mid level
175K-250K Annually
Mid level
Software
As a Site Reliability Engineer, you'll ensure platform reliability through scalable systems, incident response, observability, and collaboration with engineering teams.
Top Skills: AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Reposted YesterdaySaved
In-Office
Santa Clara, CA, USA
220K-270K Annually
Senior level
220K-270K Annually
Senior level
Cybersecurity
The Sr Principal Site Reliability Engineer will develop automation solutions, empower engineers with self-service tools, uphold reliability standards, and lead incident response efforts using AI for enhanced system reliability.
Top Skills: AWSAzureDockerGCPGoKubernetesPython
Reposted YesterdaySaved
In-Office or Remote
Charleston, SC, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Machine Learning
The Site Reliability Engineer will ensure reliability and scalability of cloud-native microservices, manage CI/CD pipelines, and enhance system performance and security while collaborating with technical teams in healthcare projects.
Top Skills: AWSCloudFormationDynatraceElkFhirGitJenkinsJwtKubernetesOauthPrometheusRestful ApisTerraform
Reposted YesterdaySaved
Easy Apply
Remote
USA
Easy Apply
Senior level
Senior level
Gaming • Mobile • Software
As an SRE Manager, you will lead a team to enhance infrastructure services, manage incidents, and contribute to technical decisions while ensuring high availability and scalability of systems.
Top Skills: Amazon AwsAnsibleArtifactoryCrossplaneDatadogElasticsearchGitlabGoGCPJaegerJenkinsKubernetesAzureMongoDBPackerPostgresPythonRedisTerraformVault
Reposted 2 Days AgoSaved
In-Office
Ward, AR, USA
Mid level
Mid level
Financial Services
The Analyst SRE role focuses on enhancing the resilience and performance of API platforms, involving CI/CD pipeline management, incident resolution, and automation of tasks.
Top Skills: ArtifactoryAWSAzureCloudFormationCloudwatchDockerEc2EcsGitJenkinsKubernetesLinuxVpc
Reposted 2 Days AgoSaved
In-Office
Atlanta, GA, USA
Senior level
Senior level
Healthtech • Payments • Software
The Senior Specialist in Site Reliability Engineering will enhance system reliability, manage incidents, lead technical initiatives, and mentor engineers to improve system performance and scalability.
Top Skills: AWSAzureBashCloudFormationGCPGoGrafanaKubernetesPowershellPrometheusPythonSplunkTerraform
Reposted 2 Days AgoSaved
In-Office
Lake Oswego, OR, USA
70K-91K Annually
Mid level
70K-91K Annually
Mid level
Hardware • Information Technology • Other • Software • Analytics
The Site Reliability Engineer will optimize system performance, manage cloud infrastructure, and automate processes while ensuring seamless operations.
Top Skills: Automation ToolsCloud InfrastructureMonitoring ToolsSite Reliability EngineeringSystem Administration
7 Days AgoSaved
Easy Apply
Hybrid
Somerville, MA, USA
Easy Apply
150K-185K Annually
Senior level
150K-185K Annually
Senior level
Enterprise Web • Hardware • Internet of Things • Software
The Senior Site Reliability Engineer will mentor teams on observability practices, architect systems for growth, automate developer tasks, and debug production issues.
Top Skills: GoKubernetesLgtm StackOpentelemetryPrometheusTypescript
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account