Get the job you really want.

Top Site Reliability Engineer Jobs

11 Days AgoSaved
Hybrid
New York, NY, USA
110K-145K Annually
Mid level
110K-145K Annually
Mid level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
As a Site Reliability Engineer, you'll support NBCU's broadcasting and streaming infrastructure, ensuring system reliability, deployment automation, and incident response for critical systems.
Top Skills: AnsibleArgocdAWSBashCloudFormationCloudwatchDockerEksElkGithub ActionsGrafanaJenkinsKubernetesPythonSplunkTerraform
Reposted 11 Days AgoSaved
In-Office
Quincy, MA, USA
147K-220K Annually
Senior level
147K-220K Annually
Senior level
AdTech • eCommerce • Food • Marketing Tech • Retail
The Principal Site Reliability Engineer will design and lead site reliability practices, ensuring system resiliency and operational excellence, while mentoring junior staff and driving large-scale reliability initiatives.
Top Skills: AksArgocdDatadogDockerGithub ActionsJavaKubernetesPythonRedisSpring BootTerraformTomcat
Reposted 11 Days AgoSaved
In-Office
Salisbury, NC, USA
147K-220K Annually
Senior level
147K-220K Annually
Senior level
AdTech • eCommerce • Food • Marketing Tech • Retail
The Principal Site Reliability Engineer will lead operational excellence initiatives, overseeing high-availability systems and scalability strategies while mentoring teams.
Top Skills: AksArgocdBashDatadogDockerGithub ActionsGoJavaKubernetesPythonRedisSpring BootTerraformTomcat
Reposted 11 Days AgoSaved
Hybrid
Fort Worth, TX, USA
Senior level
Senior level
Financial Services
Lead the Site Reliability Engineering team, enhancing application reliability, scalability, and performance. Mentor engineers, manage incidents, and implement best practices.
Top Skills: DatadogDockerDynatraceEcsGit LabGrafanaJava Spring BootJenkinsKubernetesMicroservicesPrometheusPythonSplunkTerraform
Reposted 11 Days AgoSaved
In-Office or Remote
52 Locations
125K-135K Annually
Senior level
125K-135K Annually
Senior level
Artificial Intelligence • Computer Vision • HR Tech • Machine Learning • Software
The Site Reliability Engineer II will manage and ensure the reliability and efficiency of SaaS application platforms, leveraging tools for automation, monitoring, and incident response while collaborating with various teams.
Top Skills: AnsibleArgocdAWSAzureCisDockerElasticsearchFips 140-2Fips 140-3GCPGoGrafanaHelmIptablesJavaJenkinsKubernetesLinuxMongoDBMssqlMySQLPostgresPrometheusPythonSelinuxSolrStigTerraform
12 Days AgoSaved
In-Office
San Mateo, CA, USA
130K-280K Annually
Junior
130K-280K Annually
Junior
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills: ArgocdAWSKubernetesPythonTerraform
12 Days AgoSaved
Hybrid
McLean, VA, USA
193K-221K Annually
Senior level
193K-221K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
Lead a team to create cloud-based solutions, drive tech transformations, and mentor engineers while collaborating with product managers.
Top Skills: AnsibleAWSDockerGoJavaKubernetesPythonRubySQLTerraform
Reposted 19 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
160K-300K
Senior level
160K-300K
Senior level
Artificial Intelligence • Legal Tech • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
As a Site Reliability Engineer, you will ensure system uptime, manage CI/CD pipelines, and enhance security and observability while troubleshooting issues in a collaborative environment.
Top Skills: AWSAzureCloudFormationDatadogDockerGCPGrafanaKubernetesPrometheusTerraform
Reposted 12 Days AgoSaved
Hybrid
Jersey City, NJ, USA
Senior level
Senior level
Financial Services
Lead Site Reliability Engineer responsible for implementing site reliability principles, mentoring engineers, and enhancing system observability and stability through AI-driven solutions.
Top Skills: .NetAws CloudDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
Reposted 12 Days AgoSaved
Hybrid
Chicago, IL, USA
150K-190K Annually
Junior
150K-190K Annually
Junior
Fintech • Software
The Site Reliability Engineer will optimize development infrastructure, manage source control systems, and support production incidents while mentoring junior staff.
Top Skills: AWSDockerGitlab CiGoJenkinsKubernetesLinuxPythonShell ScriptingTeamcityUnix
Reposted 12 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
200K-275K
Senior level
200K-275K
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
This role involves setting technical strategies, collaborating across teams, managing operations and availability, and fostering a culture of quality and ownership within the Site Reliability Engineering team.
Top Skills: AWSKotlinKubernetesMySQLPythonSpark
Reposted 12 Days AgoSaved
Easy Apply
Hybrid
New York City, NY, USA
Easy Apply
111K-218K Annually
Mid level
111K-218K Annually
Mid level
Big Data • Cloud • Software • Database
The Site Reliability Engineer will design and build cloud infrastructure for MongoDB Atlas, optimize performance, and automate services worldwide.
Top Skills: AWSDnsGCPHTTPKubernetesLinuxAzureProgramming LanguagesTls
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 12 Days AgoSaved
Easy Apply
Hybrid
Los Angeles, CA, USA
Easy Apply
160K-220K
Senior level
160K-220K
Senior level
Cloud • Mobile • Software
The Director of Engineering, DevOps will lead DevOps & SRE functions, ensuring infrastructure reliability and driving innovation across the organization.
Top Skills: AnsibleAWSAzureGCPGrafanaJenkinsKubernetesPrometheusTerraform
Reposted 12 Days AgoSaved
In-Office or Remote
2 Locations
150K-250K Annually
Mid level
150K-250K Annually
Mid level
Mobile • Software
Site Reliability Engineers will work on production infrastructure, focusing on AWS and Kubernetes while ensuring high availability and customer satisfaction.
Top Skills: AirflowAWSCircleCICloudwatchEksGrafanaMongoDBPagerdutyPingdomRustScala SparkTerraformTypescript
Reposted 12 Days AgoSaved
Easy Apply
Hybrid
Raleigh, NC, USA
Easy Apply
Senior level
Senior level
Cloud • Mobile • Software
The Director of Engineering, DevOps leads DevOps and Site Reliability Engineering, ensuring technical infrastructure reliability, scalability, and security while fostering automation and optimizing processes.
Top Skills: AnsibleAWSAzureGCPGrafanaJenkinsKubernetesPrometheusTerraform
13 Days AgoSaved
Hybrid
Jersey City, NJ, USA
50K-150K
Senior level
50K-150K
Senior level
Financial Services
Lead site reliability engineering efforts, mentor team members, advocate for reliability principles, and enhance operational efficiency through AI and automation.
Top Skills: AIAWSDatadogDynatraceElkGCPGrafanaKubernetesOpentelemetryPrometheusSplunkTerraform
13 Days AgoSaved
Hybrid
Fort Worth, TX, USA
Mid level
Mid level
Financial Services
The Site Reliability Engineer III collaborates on building reliable applications and infrastructure, implements CI/CD, and optimizes operational performance.
Top Skills: .NetDatadogDockerDynatraceEcsGitlabGrafanaJava/Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
13 Days AgoSaved
Hybrid
New York, NY, USA
Mid level
Mid level
Financial Services
As a Software Engineer III, you will design and deliver technology products, ensuring secure and scalable solutions while solving complex problems and improving software applications.
Top Skills: Cloud TechnologiesDatabase Querying LanguagesFront-End TechnologiesModern Programming Languages
13 Days AgoSaved
Hybrid
Fort Worth, TX, USA
Senior level
Senior level
Financial Services
As a Senior Lead Site Reliability Engineer, you will design reliability frameworks, mentor engineers, lead observability implementations, and ensure effective service levels for applications and platforms.
Top Skills: DatadogDynatraceGrafanaPrometheusSplunk
13 Days AgoSaved
Remote or Hybrid
Orlando, FL, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Staff Production Service Engineer will maintain cloud infrastructure, drive reliability improvements, troubleshoot issues, mentor team members, and utilize software development and systems engineering skills.
Top Skills: AnsibleAWSAzureBashDockerGCPGrafanaJavaJavaScriptKafkaKubernetesLinuxMariadbMySQLNginxOpenstackOraclePostgresPrometheusPuppetPythonSplunkTerraform
Reposted 13 Days AgoSaved
Hybrid
Fort Worth, TX, USA
Mid level
Mid level
Financial Services
Responsible for maintaining system reliability, coding solutions, incident resolution, and implementing observability practices in a financial institution.
Top Skills: CloudDatadogDynatraceGitlabGrafanaJenkinsLinuxPrometheusSplunkTerraformWindows
Reposted 13 Days AgoSaved
Hybrid
Austin, TX, USA
Mid level
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
Join a talented team as a Systems Reliability Engineer to enhance the Cloudflare platform's availability and performance using automation and monitoring tools.
Top Skills: AnsibleApache AirflowChefConsulDockerGoGrafanaLinuxNginxNomadPostgresPrometheusPuppetPythonRustSaltstackSQLTemporal
Reposted 13 Days AgoSaved
In-Office
Seattle, WA, USA
250K-375K Annually
Senior level
250K-375K Annually
Senior level
Artificial Intelligence • Digital Media • eCommerce • Marketing Tech • Software • Automation
Lead the Production Engineering team, ensuring the reliability of cloud infrastructure, managing talent development, and driving strategic initiatives.
Top Skills: AWSEksGCPKafkaPulsarSnsSqs
14 Days AgoSaved
Easy Apply
Hybrid
2 Locations
Easy Apply
170K-220K
Senior level
170K-220K
Senior level
Artificial Intelligence • Machine Learning • Software
As a Staff Site Reliability Engineer, you will enhance the reliability, scalability, and performance of production services by applying SRE principles, implementing observability practices, automating processes, and collaborating with engineering teams.
Top Skills: AWSAzureCloudFormationDatadogDockerElk StackGCPGoGrafanaJaegerKubernetesOpentelemetryOpentofuPrometheusPythonTerraform
Reposted 14 Days AgoSaved
Hybrid
Fort Worth, TX, USA
Mid level
Mid level
Financial Services
The Site Reliability Engineer III will optimize and maintain applications and infrastructure, implement automated solutions, and support SRE practices, collaborating across teams.
Top Skills: .NetAWSAzureCockroachdbDatadogDockerDynatraceEcsGitlabGrafanaJavaJenkinsKubernetesMySQLOraclePrometheusPythonSplunkSpring BootTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account