Top Site Reliability Engineer Jobs

Reposted YesterdaySaved
In-Office
Houston, TX, USA
Senior level
Senior level
Other • Energy
Lead SRE practices for GCP-based data platforms, automate workflows, design reliable architectures, mentor engineers, and improve operational processes.
Top Skills: BigQueryCi/CdCloud LoggingCloud MonitoringCloud StorageCompute EngineDataflowDatastreamGithub ActionsGitlab CiGkeGoogle Cloud PlatformIamKubernetesPub/SubPythonTerraform
Reposted YesterdaySaved
In-Office
San Francisco, CA, USA
238K-290K Annually
Expert/Leader
238K-290K Annually
Expert/Leader
Artificial Intelligence • Legal Tech • Professional Services • Software
As a Staff Software Engineer in Site Reliability, you'll manage infrastructure for reliability and scalability, lead incident management, and automate operational tasks.
Top Skills: AWSAzureBashCloudFormationDatadogGCPGoIncidentioPagerdutyPulumiPythonSentryTerraform
Reposted YesterdaySaved
In-Office
San Francisco, CA, USA
200K-260K Annually
Mid level
200K-260K Annually
Mid level
Artificial Intelligence • Legal Tech • Professional Services • Software
As a Software Engineer in Site Reliability, you will ensure the reliability and performance of our AI platform through automation and strategic infrastructure management.
Top Skills: AWSAzureBashCloudFormationDatadogGCPGoKubernetesPagerdutyPythonSentryTerraform
Reposted YesterdaySaved
In-Office
30005, Alpharetta, GA, USA
Mid level
Mid level
Fintech • Consulting
The Site Reliability Engineer at Equifax manages system uptime across cloud-native and hybrid architectures, builds infrastructure as code, develops CI/CD pipelines, and enhances service reliability through automated tooling and troubleshooting.
Top Skills: AnsibleAWSBashChefDockerGCPGoJavaJavaScriptJenkinsKubernetesNode.jsPythonTerraform
Reposted YesterdaySaved
Hybrid
San Francisco, CA, USA
190K-220K Annually
Senior level
190K-220K Annually
Senior level
Artificial Intelligence • Big Data • Software
You will manage the infrastructure for the Data Replication team, focusing on Kubernetes, reliability standards, and integrating product features with infrastructure. You'll enhance observability and tooling using AI, ensuring engineers can effectively manage their stack.
Top Skills: AIAWSCi/CdDatadogGCPGrafanaKubernetesPrometheusTerraform
Reposted YesterdaySaved
In-Office
Park, MI, USA
Senior level
Senior level
Fintech • Financial Services
The role involves designing CI/CD pipelines, managing cloud infrastructure, ensuring platform stability, and leading automation efforts while mentoring junior engineers.
Top Skills: ArmAWSAzureAzure DevopsBashCloudFormationDockerDynatraceElkGCPGithub ActionsGrafanaJenkinsKubernetesNew RelicOraclePrometheusPythonSplunkSQLTerraform
Reposted YesterdaySaved
In-Office or Remote
11 Locations
160K-179K Annually
Senior level
160K-179K Annually
Senior level
Fintech • Payments
The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.
Top Skills: Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk
2 Days AgoSaved
Remote
USA
180K-210K Annually
Senior level
180K-210K Annually
Senior level
Artificial Intelligence • Insurance • Software • Automation
The Staff Site Reliability Engineer will build and scale infrastructure for Assured's platform, automate delivery, enhance observability, and lead mentoring initiatives.
Top Skills: AWSKubernetesPostgresTerraform
2 Days AgoSaved
In-Office or Remote
5 Locations
73K-133K Annually
Mid level
73K-133K Annually
Mid level
Information Technology • Software
The Site Reliability Engineer will enhance continuous integration and delivery processes, manage multi-cloud environments, and mentor teams in deploying microservices solutions.
Top Skills: AnsibleArgo CdAWSAzureBash ScriptingChefCloudFormationDockerFleetFlux CdGitGrafanaHelmIstioKubernetesMySQLOraclePostgresPrometheusPuppetSQL ServerTerraformVMware
Reposted 2 Days AgoSaved
In-Office
Secaucus, NJ, USA
150K-170K Annually
Expert/Leader
150K-170K Annually
Expert/Leader
Healthtech • Database
Seeking a Principal Site Reliability Engineer to build a SRE practice, enhance reliability, mentor teams, and drive performance engineering to optimize Quest products and services.
Top Skills: AnsibleAuroraAWSAzureBigtableCassandraCi/CdCloud Pub/SubCloud SpannerCloud SqlDockerDynamoDBDynatraceGitlabGoGCPJavaJmsKafkaKinesisKubernetesMqPerlPythonRdsRubyShell ScriptingTerraform
2 Days AgoSaved
In-Office
Atlanta, GA, USA
99K-124K Annually
Senior level
99K-124K Annually
Senior level
Fintech • Insurance • Financial Services
The Senior Site Reliability Engineer will design and maintain scalable infrastructure, develop software for reliability, implement CI/CD pipelines, monitor performance, collaborate on AI/ML workloads, and lead incident response efforts.
Top Skills: AnsibleAWSAzureDynatraceGitJavaPythonTerraform
2 Days AgoSaved
Hybrid
Arvada, CO, USA
160K-200K Annually
Mid level
160K-200K Annually
Mid level
Aerospace • Cloud • Software • Defense • Automation
Design and automate cloud systems for U.S. Government, focusing on DevSecOps, reliability, deployment automation, and observability. Participate in on-call rotations, supporting production environments and improving system resilience.
Top Skills: Aws EksDatadogGitlabGrafanaKubernetesLinux/UnixPythonTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
2 Days AgoSaved
In-Office
Los Angeles, CA, USA
130K-145K Annually
Mid level
130K-145K Annually
Mid level
Events
The Site Reliability Engineer II designs and maintains scalable systems, focusing on automation, monitoring, incident response, and collaboration with developers to enhance operational practices and efficiency.
Top Skills: BashCloud Service OperationsContainersContinuous DeliveryContinuous IntegrationGoInfrastructure As CodeOrchestration PlatformsPython
Reposted 2 Days AgoSaved
In-Office
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Software
The Site Reliability Engineer ensures the reliability and performance of products Devin and Windsurf, managing incident response, CI/CD pipelines, infrastructure as code, and fostering a reliability culture within the engineering team.
Top Skills: AWSAzureCi/CdGCPKubernetesTerraform
Reposted 2 Days AgoSaved
In-Office
Overland Park, KS, USA
Senior level
Senior level
Healthtech • Professional Services • Software
The Sr Software Engineer leads complex software development, ensuring solution scalability, collaborating with teams, solving technical problems, and advocating for high-quality software solutions.
Top Skills: AngularArgo CdAzure DevopsCi/CdGoogle Cloud PlatformKubernetesNew RelicOpentelemetryRuby On RailsTerraform
Reposted 2 Days AgoSaved
In-Office
New York, NY, USA
177K-265K Annually
Senior level
177K-265K Annually
Senior level
Fintech • Financial Services
The Site Reliability Engineer Lead oversees daily operations and architectural resilience, driving SRE principles for application performance and efficiency, and fostering a culture of technical excellence.
Top Skills: AnsibleAppdynamicsGoGrafanaJavaKubernetesLokiMimirOpenshiftPrometheusPythonTempoTerrraform
Reposted 2 Days AgoSaved
Remote
United States
170K-200K Annually
Senior level
170K-200K Annually
Senior level
Software
Lead SRE to define SRE strategy, architecture, and roadmap; design and operate containerized, compliant cloud environments; build observability, incident management, automation, and developer platform capabilities; mentor SRE team and collaborate with security, compliance, and product teams to ensure reliability at scale.
Top Skills: AWSAws MarketplaceAzureAzure MarketplaceGCPGoogle Cloud MarketplaceGrafanaKubernetesPrometheusTerraform
Reposted 2 Days AgoSaved
In-Office
Irving, TX, USA
88K-137K Annually
Senior level
88K-137K Annually
Senior level
Consulting
As a Site Reliability Engineer, you'll enhance system performance and reliability through automation, monitor service levels, manage incidents, and improve application stability while collaborating with agile teams.
Top Skills: .Net CoreApi GatewayAppdynamicsAWSC#DatadogDockerDynatraceEc2EksHibernateJ2EeJavaScriptJdbcJenkinsJqueryKubernetesLambdaNew RelicNode.jsReactSplunkSpringTomcat
Reposted 2 Days AgoSaved
In-Office
Washington, DC, USA
180K-225K Annually
Expert/Leader
180K-225K Annually
Expert/Leader
Artificial Intelligence • Cloud • Social Impact • Software • Wearables
As a Staff Site Reliability Engineer at Axon, you will design and implement the Zero Touch platform while enhancing security and identity management across cloud systems, collaborating with various teams to improve overall infrastructure and reliability.
Top Skills: CdkCloudFormationGoJavaPythonTerraformTypescript
Reposted 2 Days AgoSaved
In-Office
Boston, MA, USA
180K-225K Annually
Senior level
180K-225K Annually
Senior level
Artificial Intelligence • Cloud • Social Impact • Software • Wearables
As a Staff Site Reliability Engineer, you will design and implement Axon's core platforms, focusing on automation, security, and compliance, while collaborating across teams to enhance cloud operations.
Top Skills: CdkCloudFormationGoJavaPythonTerraformTypescript
Reposted 2 Days AgoSaved
Remote
United States
Mid level
Mid level
Software • Consulting
As a Senior Application Support Engineer, you will ensure application reliability, manage incidents, and collaborate with teams to enhance performance and support processes.
Top Skills: AppdynamicsAWSDatadogLinuxMulesoftOpentelemetryPythonSplunk
Reposted 2 Days AgoSaved
In-Office
6 Locations
90K-122K Annually
Mid level
90K-122K Annually
Mid level
Fintech • Analytics
The Site Reliability Engineer will manage production monitoring, incident response, and enhance automation using various tools. They will ensure observability and participate in SRE process improvements.
Top Skills: AWSCucumberDatadog ApmDatadog DbmDynamoDBEc2EcsElkJavaJenkinsPagerdutyPlaywrightRdsS3Secrets ManagerSeleniumServicenowSplunkSpring Boot
Reposted 3 Days AgoSaved
In-Office
St. Petersburg, FL, USA
86K-109K Annually
Senior level
86K-109K Annually
Senior level
Information Technology • Consulting
The Site Reliability Engineer will drive the observability roadmap, standardize monitoring practices, optimize alerting tools, and collaborate with teams to enhance operational efficiency and system reliability.
Top Skills: .NetAsp.Net CoreAWSAzureC#DatadogDockerGCPGrafanaKubernetesNew RelicPowershellPrometheusReactSplunkWeb Apis
3 Days AgoSaved
Remote
USA
Mid level
Mid level
Information Technology • Software
As a DevOps/Site Reliability Engineer, you will manage cloud infrastructure, CI/CD pipelines, and improve system reliability and performance while supporting AI data pipelines.
Top Skills: AWSDatadogEc2EksGithub ActionsGoGrafanaIamKubernetesPrometheusPythonRdsS3Terraform
3 Days AgoSaved
In-Office
Norfolk, VA, USA
92K-167K Annually
Mid level
92K-167K Annually
Mid level
Information Technology • Software
The SRE Product Owner leads the SRE team, manages product strategy, engages stakeholders, optimizes reliability, and enhances automation processes.
Top Skills: AnsibleAtlassian ProductsDod 8570.01 Iat Level IiPowershellPython
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account