Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 6 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
160K-300K Annually
Senior level
160K-300K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
As a Site Reliability Engineer, you will ensure system uptime, manage CI/CD pipelines, and enhance security and observability while troubleshooting issues in a collaborative environment.
Top Skills: AWSAzureCloudFormationDatadogDockerGCPGrafanaKubernetesPrometheusTerraform
Reposted 15 Hours AgoSaved
Easy Apply
Remote
United States
Easy Apply
200K-275K Annually
Senior level
200K-275K Annually
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
The Staff Software Engineer in SRE is responsible for setting technical strategy, ensuring system availability, guiding incident management, and fostering talent within the team to enhance overall system reliability.
Top Skills: AWSBashKotlinKubernetesMySQLPythonSpark
Reposted 15 Hours AgoSaved
In-Office
San Mateo, CA, USA
130K-280K Annually
Junior
130K-280K Annually
Junior
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills: ArgocdAWSKubernetesPythonTerraform
Reposted 15 Hours AgoSaved
Hybrid
Jersey City, NJ, USA
Senior level
Senior level
Financial Services
The Lead Site Reliability Engineer will design solutions to enhance AI/ML platform reliability and scalability, mentor engineers, and ensure high performance systems.
Top Skills: AnsibleAWSAzureDynatraceGCPGrafanaOpen TelemetryTerraform
Reposted 15 Hours AgoSaved
Hybrid
Jersey City, NJ, USA
Mid level
Mid level
Financial Services
The Site Reliability Engineer III will enhance system reliability and performance through coding, infrastructure management, and collaboration, making significant contributions to team solutions and practices.
Top Skills: .NetDatadogDockerDynatraceEcsGitlabGrafanaJavaJenkinsKubernetesPrometheusPythonSplunkSpring BootTerraform
Reposted 15 Hours AgoSaved
Hybrid
Jersey City, NJ, USA
Senior level
Senior level
Financial Services
Lead the operations of technology support for production applications, ensuring stability and availability while troubleshooting issues and collaborating across teams.
Top Skills: Automation ToolsKubernetesObservability ToolsPrivate CloudPublic CloudScripting LanguageServicenow
Reposted 15 Hours AgoSaved
Hybrid
2 Locations
162K-198K Annually
Mid level
162K-198K Annually
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
Join a talented team as a Systems Reliability Engineer to enhance the Cloudflare platform's availability and performance using automation and monitoring tools.
Top Skills: AnsibleApache AirflowChefConsulDockerGoGrafanaLinuxNginxNomadPostgresPrometheusPuppetPythonRustSaltstackSQLTemporal
YesterdaySaved
Remote or Hybrid
Chapin, SC, USA
174K-272K Annually
Senior level
174K-272K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
The Platform Site Reliability Engineer will design and maintain infrastructure, manage cloud systems, enhance incident responses, and embed reliability in services for a SaaS platform.
Top Skills: AWSBashCi/CdDatadogGoGrafanaKubernetesLinuxPrometheusPythonTerraform
2 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
181K-212K Annually
Senior level
181K-212K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Seeking a Senior Site Reliability Engineer to enhance software reliability, automate systems, and mentor engineering teams in reliability practices. Requires strong skills in system design, coding, and observability, along with at least 6 years of software engineering experience.
Top Skills: AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
2 Days AgoSaved
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
214K-260K Annually
Senior level
214K-260K Annually
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
As a Site Reliability Engineer, you'll build software to ensure system reliability, scale infrastructure, and deploy ML systems while collaborating with cross-functional teams.
Top Skills: AWSAzureDockerGCPJavaKubernetesLinuxTerraform
2 Days AgoSaved
Hybrid
Jersey City, NJ, USA
Senior level
Senior level
Financial Services
Lead technical efforts in site reliability, mentor engineers, design resilient systems, and utilize AI for operational efficiency. Oversee incident management and stakeholder engagement to meet service objectives.
Top Skills: .NetAWSDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
Reposted 2 Days AgoSaved
Hybrid
Jersey City, NJ, USA
Senior level
Senior level
Financial Services
As a Senior Lead Site Reliability Engineer, you'll oversee network reliability, collaborate with teams, drive best practices, manage incidents, and foster team development.
Top Skills: AristaAWSAzureBgpBroadcomCiscoF5GitlabGrafanaHttpsJenkinsJuniperKibanaPalo AltoPrometheusSd-WanSevoneSplunkTcp/IpTerraformThousandeyes
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 2 Days AgoSaved
Hybrid
2 Locations
119K-170K Annually
Senior level
119K-170K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Staff Site Reliability Engineer will manage operational duties for FedRAMP cloud products, enhance monitoring systems, handle incident management, and drive automation in cloud environments.
Top Skills: AnsibleAws EcsAws GovcloudKubernetesLinuxPythonTerraform
Reposted 2 Days AgoSaved
Hybrid
Austin, TX, USA
Senior level
Senior level
Big Data • Real Estate • Software
As a Staff SRE Engineer, you'll enhance reliability and observability of platform infrastructure, mentor engineers, and drive architectural improvements across critical systems.
Top Skills: Argo CdAWSCircleCICloudFormationCloudwatchDatadogEc2EksGoGrafanaIamJavaJenkinsKubernetesNewrelicPrometheusPythonRdsS3SplunkTerraform
3 Days AgoSaved
Easy Apply
Hybrid
Austin, TX, USA
Easy Apply
95K-152K Annually
Mid level
95K-152K Annually
Mid level
Artificial Intelligence • Cloud • Information Technology • Machine Learning • Software
As a Site Reliability Engineer, you'll ensure uptime of systems, automate operational tasks, and work closely with development to enhance performance and reliability.
Top Skills: AnsibleChefDockerGoKubernetesLinuxPuppetPythonShell
3 Days AgoSaved
Hybrid
2 Locations
209K-262K Annually
Senior level
209K-262K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
Lead diverse technology projects, manage a team of developers, collaborate on cloud solutions, mentor engineers, and stay updated on tech trends.
Top Skills: AWSDockerGoGCPHTML/CSSJavaJavaScriptKubernetesAzureNoSQLOpen Source RdbmsPythonSQLTypescript
3 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
Mid level
Mid level
AdTech
As a Site Reliability Engineer, you'll maintain the infrastructure for systems, ensure efficiency, automate processes, monitor databases, and participate in architecture discussions.
Top Skills: Amazon KinesisAws LambdaAws SnsBigQueryDockerGcp (Google Cloud Platform)GitlabGoogle Cloud FunctionsGoogle Cloud RunGoogle Pub/SubGrafanaIstioKafkaKubernetesMySQLPrometheusSpannerSQLTerraform
Reposted 3 Days AgoSaved
Remote or Hybrid
United States
Expert/Leader
Expert/Leader
Fintech • Software
The Principal Site Reliability Engineer is responsible for maintaining cloud infrastructure, ensuring application performance, and implementing automated solutions in a SaaS environment, while collaborating with security and software engineering teams.
Top Skills: .NetAnsibleAppdynamicsAWSAzureAzure DevopsC#DatadogDynatraceHarnessJavaJenkinsKubernetesNew RelicTerraform
Reposted 3 Days AgoSaved
Hybrid
Chicago, IL, USA
100K-157K Annually
Mid level
100K-157K Annually
Mid level
AdTech • Digital Media • Marketing Tech
The Site Reliability Engineer 3 will ensure system reliability, automate operations, optimize performance, and collaborate across teams for product implementation.
Top Skills: AnsibleAWSAws S3AzureCassandraDockerElk StackGCPGoGrafanaHadoopHdfsJavaKafkaKubernetesMySQLNosql DatabasesOciPostgresPrometheusPythonScalaSparkTerraform
Reposted 3 Days AgoSaved
Hybrid
Chicago, IL, USA
100K-157K Annually
Mid level
100K-157K Annually
Mid level
AdTech • Digital Media • Marketing Tech
The SRE 3 will ensure the reliability and performance of FreeWheel systems, manage infrastructure, automate operations, and support cross-team collaboration. Responsibilities include system monitoring, incident response, performance optimization, and ensuring security compliance.
Top Skills: AnsibleAWSAws S3AzureCassandraDockerElk StackGCPGoGrafanaHadoopHdfsJavaKafkaKubernetesMySQLOciPostgresPrometheusPythonScalaSparkTerraform
Reposted 3 Days AgoSaved
In-Office or Remote
8 Locations
185K-327K Annually
Senior level
185K-327K Annually
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Embedded Site Reliability Engineer will develop and maintain software applications for Bitcoin mining, focusing on embedded systems and cloud observability. Responsibilities include software testing, bug triage, and collaboration with engineering teams to optimize performance and reliability.
Top Skills: CC++DatadogElasticGoGrafanaJavaScriptLinuxPythonRustSplunkSQLTypescript
4 Days AgoSaved
Hybrid
Fort Worth, TX, USA
Senior level
Senior level
Financial Services
As a Lead Site Reliability Engineer, you will oversee team operations, mentor engineers, implement SRE principles, and enhance system reliability through AI automation.
Top Skills: AWSDatadogDynatraceElkGCPGrafanaKubernetesOpentelemetryPrometheusSplunkTerraform
4 Days AgoSaved
Hybrid
Houston, TX, USA
Mid level
Mid level
Financial Services
The Site Reliability Engineer III will enhance application reliability through code and cloud infrastructure, focusing on monitoring, optimizing, and collaborating across engineering teams to solve complex issues.
Top Skills: .NetDatadogDockerDynatraceEcsGitlabGrafanaJavaJenkinsKubernetesPrometheusPythonSplunkSpring BootTerraform
Reposted 4 Days AgoSaved
Hybrid
Jersey City, NJ, USA
Mid level
Mid level
Financial Services
As a Site Reliability Engineer III, you'll solve complex problems using code and cloud infrastructure, ensure reliability, and optimize application performance while collaborating with teams.
Top Skills: AWSAzureDatadogDockerDynatraceEcsGCPGitlabGrafanaJavaJenkinsKafkaKubernetesPrometheusPythonSplunkSpring BootTerraform
Reposted 4 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
140K-165K Annually
Junior
140K-165K Annually
Junior
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance reliability and observability, automate processes, support engineering teams, and promote a culture of reliability at Coinbase.
Top Skills: AWSAzureDockerEc2GCPGoKubernetesRubyTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account