Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 9 Days AgoSaved
Easy Apply
Hybrid
Chicago, IL, USA
Easy Apply
129K-232K Annually
Senior level
129K-232K Annually
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you will ensure the reliability of internal services, improving automation and infrastructure, and collaborating with engineering teams to resolve issues and enhance product performance.
Top Skills: DockerGoKafkaKubernetesMongoDBPostgresRedisRubyTerraform
Reposted 9 Days AgoSaved
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
129K-232K Annually
Senior level
129K-232K Annually
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer at Braze, you'll ensure uptime for internal services, improve automation, and develop infrastructure tools, collaborating across teams to enhance reliability and scalability.
Top Skills: ChefDockerKafkaKubernetesMongoDBRedisRuby On RailsTerraform
4 Days AgoSaved
Easy Apply
Remote
US
Easy Apply
110K-175K Annually
Senior level
110K-175K Annually
Senior level
Cloud • Software
In this role, you'll support large-scale applications, improve observability, mentor team members, and ensure reliability by collaborating on deployments and writing automation scripts while providing 24/7 support.
Top Skills: AnsibleAWSBashConfluenceDockerElk StackGCPGitlab CicdGrafanaJenkinsJIRAKubernetesLinuxMongoDBMySQLNagiosOciPerlPostgresPrometheusPuppetPythonTerraform
Reposted 4 Days AgoSaved
In-Office
Boulder, CO, USA
103K-172K Annually
Senior level
103K-172K Annually
Senior level
Cloud • Information Technology
Operate and maintain high-performance, mission-critical visualization services; implement observability and instrumentation; optimize hardware usage; lead incident diagnosis and root-cause analysis; and develop deployment automation for on-prem and production integrations.
Top Skills: Ci/CdDockerGrafanaHelmKubernetesNatsOpenshiftPrometheusPythonRhelTempoUnreal Engine
Reposted 4 Days AgoSaved
Easy Apply
In-Office
San Francisco, CA, USA
Easy Apply
130K-175K Annually
Junior
130K-175K Annually
Junior
Energy
The Site Reliability Engineer will design and implement systems, drive automation, coordinate between teams, support deployed systems, and ensure scalability for rapid growth.
Top Skills: Active DirectoryAnsibleAWSAzureChefJSONLinuxPuppetPythonRestVMwareWindows ServerYaml
Reposted 4 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
170K-200K Annually
Senior level
170K-200K Annually
Senior level
Software
Lead SRE to define SRE strategy, architecture, and roadmap; design and operate containerized, compliant cloud environments; build observability, incident management, automation, and developer platform capabilities; mentor SRE team and collaborate with security, compliance, and product teams to ensure reliability at scale.
Top Skills: AWSAws MarketplaceAzureAzure MarketplaceGCPGoogle Cloud MarketplaceGrafanaKubernetesPrometheusTerraform
Reposted 4 Days AgoSaved
Easy Apply
In-Office
Boulder, CO, USA
Easy Apply
103K-172K Annually
Senior level
103K-172K Annually
Senior level
Analytics
Operate and maintain high-performance visualization and mission systems; implement observability and metrics; optimize hardware usage; lead problem diagnosis and root-cause analysis; develop deployment automation and integrate visualization software with development and production infrastructures.
Top Skills: Ci/CdDockerGrafanaHelmKubernetesNatsOpenshiftPrometheusPythonRhelTempoUnreal Engine
Reposted 5 Days AgoSaved
In-Office
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Software
As a Site Reliability Engineer at Mercor, you will ensure production reliability, develop SRE function, and collaborate with engineering teams to maintain system performance.
Top Skills: AWSKubernetesSpaceliftTerraform
Reposted 5 Days AgoSaved
Remote
2 Locations
Junior
Junior
Computer Vision • Information Technology • Machine Learning • Natural Language Processing • Real Estate • Software
The SRE will maintain infrastructure for SaaS products on AWS, support developers, manage platform components, and handle IT tasks.
Top Skills: AWSComputer VisionIacLarge Language ModelsNlpTerraform
Reposted 5 Days AgoSaved
In-Office
New York, NY, USA
129K-168K Annually
Senior level
129K-168K Annually
Senior level
Healthtech
Build and harden AWS cloud environments and CI/CD pipelines, manage IaC and container platforms, own observability and incident response, enforce security and HA/DR, and automate operational tasks to support a regulated medical-imaging platform.
Top Skills: Apache AirflowAWSAws Cloudwatch InsightsBashCdkCloudFormationDicomDirect ConnectDockerEcsEksGitGrafanaHl7IamKmsKubernetesPrivatelinkPrometheusPythonSbomTerraformVpcVpn
Reposted 5 Days AgoSaved
Hybrid
Atlanta, GA, USA
Mid level
Mid level
Fintech • Payments • Financial Services
Build, operate, and scale AWS-based infrastructure using IaC (Terraform), manage EKS and serverless environments, create CI/CD pipelines, implement observability (OpenTelemetry/Prometheus/New Relic), support Postgres/RDS (Aurora), lead incident response and define SRE practices (SLIs/SLOs/error budgets).
Top Skills: AuroraAWSAws RdsAzureCloudFormationEcsEksGithub ActionsGitlabGoGCPJavaKubernetesNew RelicOpentelemetryOpentofuPostgresPrometheusPythonRubyServerlessTerraformTerragrunt
Reposted 5 Days AgoSaved
In-Office
Redwood City, CA, USA
156K-261K Annually
Senior level
156K-261K Annually
Senior level
Consumer Web • eCommerce • Fashion • Retail
Seeking a Staff Software Engineer for the SRE team to enhance CI/CD systems, optimize infrastructure, and improve developer productivity. Responsibilities include architecting solutions, mentoring engineers, and driving technical initiatives to elevate operational excellence.
Top Skills: AnsibleBashGithub ActionsGoHelmJenkinsKubernetesPythonRubySpinnakerTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 5 Days AgoSaved
In-Office
Sunnyvale, CA, USA
37-37 Hourly
Internship
37-37 Hourly
Internship
Software • Cybersecurity
The Engineering Intern will support backend, platform, or SRE tasks, learning to design reliable cloud infrastructure and automate processes using scripting languages. Responsibilities include monitoring and improving system reliability, assisting in incident management, and collaborating with engineering teams.
Top Skills: AWSAzureDockerGCPGoKubernetesPython
Reposted 11 Days AgoSaved
Hybrid
Austin, TX, USA
Senior level
Senior level
Big Data • Real Estate • Software
Seeking a Senior Site Reliability Engineer to enhance platform reliability, observability, and operational excellence using AWS, Kubernetes, and various monitoring tools in a collaborative environment.
Top Skills: Argo CdAWSCircleCICloudFormationDatadogEksFargateGoGrafanaJavaJenkinsKubernetesNewrelicPrometheusPythonSplunkTerraform
Reposted 11 Days AgoSaved
In-Office
Costa Mesa, CA, USA
143K-191K Annually
Senior level
143K-191K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Senior Site Reliability Engineer, you'll deploy and maintain Anduril's system hardware and software, ensuring mission-critical capabilities and troubleshooting complex issues while working collaboratively with engineering teams and customers.
Top Skills: C++GoPythonRust
6 Days AgoSaved
In-Office or Remote
3 Locations
195K-300K Annually
Expert/Leader
195K-300K Annually
Expert/Leader
Cloud • Information Technology • Security • Software
As a DevOps Architect, you'll lead automation efforts for SaaS services, mentor junior team members, and set strategic directions for CI/CD and monitoring solutions.
Top Skills: AIAWSAzureFluxGCPGoGrafanaJavaJenkinsKubernetesOciProgramming Languages: C/C++PrometheusPython
6 Days AgoSaved
Easy Apply
In-Office
Burlington, MA, USA
Easy Apply
68K-102K Annually
Junior
68K-102K Annually
Junior
On-Demand • Security • Software
The Site Reliability Engineer will maintain server and network health, manage hardware lifecycle, assist with network cabling, coordinate with vendors, and collaborate with IT and Engineering teams.
Top Skills: Asset Tracking SoftwareMonitoring ToolsNetwork InfrastructureServer Infrastructure
Reposted 6 Days AgoSaved
Remote
US
250K-300K Annually
Senior level
250K-300K Annually
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Software • Cybersecurity • Generative AI • Data Privacy
Lead global SRE and infrastructure teams to ensure reliability, scalability, and cost-efficiency of production and developer platforms. Define cloud and Kubernetes architecture, IaC, CI/CD, SLOs/SLIs, incident management, and cloud cost optimization while partnering with Security, Product, Finance, and Engineering.
Top Skills: AIAutomationAWSCi/CdCloud-Native SystemsGCPInfrastructure As CodeKubernetesTerraform
7 Days AgoSaved
In-Office
3 Locations
53K-90K Annually
Junior
53K-90K Annually
Junior
Healthtech • Financial Services
As a Site Reliability Engineer I, you will support critical web applications, solve technical puzzles, collaborate across teams, and manage escalations while ensuring high service standards.
Top Skills: AWSAzureC#JavaPostgresPythonSQL
7 Days AgoSaved
Remote
Location, WV, USA
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Insurance • Cybersecurity
Lead AI enablement across engineering by designing and developing tools for AI-assisted development, driving tooling adoption, and ensuring infrastructure reliability in production environments.
Top Skills: Ai-Assisted Development ToolsAWSDatadogEcsGithub ActionsGoKubernetesPythonTerraform
Reposted 7 Days AgoSaved
In-Office
McLean, VA, USA
78K-176K Annually
Junior
78K-176K Annually
Junior
Information Technology
Design, develop, and secure container platforms and CI/CD pipelines; automate builds, testing, and deployments; troubleshoot pipeline and cloud issues; recommend container adoption strategies; support cloud-native SDLC and APM monitoring; collaborate to solve complex platform problems.
Top Skills: AnsibleAWSAzureBash (Linux Shell Script)Ci/CdContainersDatadogDockerDynatraceGithub ActionsGitlabJenkinsKubernetesNew RelicOpenidPythonSAMLSingle-Sign-OnTerraform
Reposted 7 Days AgoSaved
In-Office
2 Locations
99K-225K Annually
Mid level
99K-225K Annually
Mid level
Information Technology
As a Site Reliability Engineer, you will enhance system resilience, manage cloud infrastructures, automate tasks, and document technical procedures for the Intelligence Community.
Top Skills: AWSC#CiscoCitrixDevOpsJavaJavaScriptJenkinsLinuxOracle CloudPythonVMwareWindows
7 Days AgoSaved
Hybrid
Pittsburgh, PA, USA
Senior level
Senior level
Fintech • Financial Services
The SRE will enhance software and systems reliability, automate operations, and improve customer experiences while managing scalability and performance.
Top Skills: .NetAIAmazon S3AngularAnsibleApp DynamicsAWSDockerHazelcastJavaKafkaKubernetesLinuxAzureOraclePythonSplunkSQLSybaseTerraform
7 Days AgoSaved
Remote
USA
156K-288K Annually
Mid level
156K-288K Annually
Mid level
Computer Vision • Machine Learning • Software
As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.
Top Skills: AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
7 Days AgoSaved
Remote
United States
120K-150K Annually
Senior level
120K-150K Annually
Senior level
Information Technology
The Lead Site Reliability Engineer will ensure platform reliability and performance, guiding SRE principles, managing incidents, and fostering collaboration across teams while leveraging cloud technologies and automation.
Top Skills: AWSAzureAzure DevopsBashBicepCloudFormationGithub ActionsGoJenkinsPowershellPythonTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account