Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 17 Days AgoSaved
In-Office
San Mateo, CA, USA
130K-200K Annually
Senior level
130K-200K Annually
Senior level
Edtech
The Senior Site Reliability Engineer will ensure product reliability and performance, develop monitoring and alerting systems, and propose architectural changes.
Top Skills: AWSBashCC++DockerGCPJavaKubernetesPerlPython
18 Days AgoSaved
In-Office
Fairfax, VA, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Security • Software
Responsibilities include defining, implementing, and growing the SRE practice, ensuring reliability and performance of production environments, and collaborating with cross-functional teams.
Top Skills: Aws GovcloudBashDockerElasticGrafanaKubernetesPrometheusPythonSplunkTypescript
Reposted 22 Days AgoSaved
Remote
United States
150K-200K Annually
Mid level
150K-200K Annually
Mid level
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills: AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
18 Days AgoSaved
Remote
USA
Senior level
Senior level
Database
The Senior Site Reliability Engineer at Niche will manage cloud infrastructure, oversee incident responses, mentor team members, and promote best practices to ensure reliability across distributed systems and applications.
Top Skills: AWSBashDockerGCPGitGoGrafanaKafkaKubernetesPrometheusPythonSQLSumo LogicTerraform
Reposted 19 Days AgoSaved
In-Office
Austin, TX, USA
111K-172K Annually
Mid level
111K-172K Annually
Mid level
Fintech • Information Technology • Payments
Seeking a Site Reliability Engineer to manage application reliability, incident resolution, and project delivery while collaborating with teams and automating tasks.
Top Skills: .NetAnsibleApacheAWSAzureBash ScriptingC#DockerJavaKubernetesLinuxMongoDBMssqlPowershellPythonSplunkTerraformTomcatUnix
Reposted 19 Days AgoSaved
Remote or Hybrid
2 Locations
154K-199K Annually
Senior level
154K-199K Annually
Senior level
3D Printing • Aerospace • Hardware • Robotics • Software
Lead the reliability and scalability of BRINC's production systems, building secure cloud infrastructure and improving incident response. Collaborate with teams for optimal system performance.
Top Skills: AWSInfrastructure As CodeJavaScriptNode.jsPython
Reposted 19 Days AgoSaved
Easy Apply
In-Office
Bozeman, MT, USA
Easy Apply
120K-170K Annually
Senior level
120K-170K Annually
Senior level
Artificial Intelligence
As a Senior Site Reliability Engineer, you will design automation for infrastructure, lead initiatives, ensure reliability and security, and collaborate across teams to enhance system scaling and efficiency.
Top Skills: AnsibleAWSGCPGoJavaPythonRubySpinnakerTerraform
Reposted 19 Days AgoSaved
In-Office
Vienna, VA, USA
82K-138K Annually
Senior level
82K-138K Annually
Senior level
Other • Software • Analytics
The Sr. Site Reliability Engineer will manage SaaS capabilities, implement monitoring systems, automate operational tasks, and provide on-call support.
Top Skills: AWSBashDockerEksElkGitJavaKubernetesPrometheusPythonTerraform
Reposted 19 Days AgoSaved
In-Office
Charlotte, NC, USA
82K-138K Annually
Senior level
82K-138K Annually
Senior level
Other • Software • Analytics
The role involves deploying and managing SaaS solutions, automating infrastructure processes, troubleshooting system issues, and collaborating with a team of engineers.
Top Skills: Arcgis VelocityArcgis Workflow ManagerAWSAws LambdaBashDockerEksElkGitKafkaKubernetesOpensearchPrometheusPythonTerraform
Reposted 19 Days AgoSaved
In-Office
St. Louis, MO, USA
82K-138K Annually
Senior level
82K-138K Annually
Senior level
Other • Software • Analytics
As a Sr. Site Reliability Engineer, you will manage cloud-based SaaS products, automate infrastructure, troubleshoot issues, and provide technical support while collaborating with a team of engineers.
Top Skills: AWSAws LambdaBashDockerEcsEksElkGitJavaKafkaKubernetesOpensearchPrometheusPythonSecurity GroupsTerraformVpc
Reposted 19 Days AgoSaved
In-Office
Redlands, CA, USA
82K-138K Annually
Senior level
82K-138K Annually
Senior level
Other • Software • Analytics
The role involves deploying and managing SaaS capabilities on AWS, including monitoring systems, automation solutions, and troubleshooting incidents. Collaboration with SRE engineers is key to operational success across multiple regions.
Top Skills: AWSAws LambdaBashDockerEcsElkGitGitKafkaKubernetesOpensearchPrometheusPythonTerraform
21 Days AgoSaved
In-Office
Bellevue, WA, USA
147K-202K Annually
Senior level
147K-202K Annually
Senior level
Cloud
The Senior Site Reliability Engineer will enhance the Splunk ecosystem and develop an Observability Platform by automating infrastructure and managing complex distributed systems, while optimizing log collection and incident response.
Top Skills: AWSGCPGoKubernetesLinuxOpentelemetryPythonRubySplunkTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 21 Days AgoSaved
In-Office
San Jose, CA, USA
99K-229K Annually
Senior level
99K-229K Annually
Senior level
Artificial Intelligence • Information Technology • Software
Design, build, and operate large-scale Kubernetes platforms while developing Kubernetes Operators, enhancing reliability and scalability, and troubleshooting production issues.
Top Skills: Ci/CdCrossplaneGoHelmKubernetesTerraform
21 Days AgoSaved
Remote
U.S.
Senior level
Senior level
Cloud • Security
As a Senior Site Reliability Engineer, you'll ensure the reliability of production SaaS applications in Azure, automate operational processes, maintain compliance in a FedRAMP environment, and address incident responses.
Top Skills: AzureAzure Application InsightsAzure DevopsDatadogKubernetesLog AnalyticsPowershellPythonTerraform
Reposted 21 Days AgoSaved
Easy Apply
Remote
US
Easy Apply
Senior level
Senior level
Software
As a Site Reliability Engineer at Podium, you'll ensure product stability and scalability, collaborate with engineering teams, handle on-call production issues, and mentor junior engineers.
Top Skills: AnsibleAWSCi/CdDatadogDockerGitGitlabGoHelmHoneycombKubernetesPrometheusPythonRubyStrongdmTerraform
Reposted 21 Days AgoSaved
In-Office or Remote
Menlo Park, CA, USA
196K-225K Annually
Senior level
196K-225K Annually
Senior level
Cloud • Fintech • Other • Software
The Senior Site Reliability Engineer will support development teams, enhancing service reliability and automation, while ensuring compliance and security commitments.
Top Skills: AWSBashDatadogEcsGoPythonTerraform
Reposted 22 Days AgoSaved
In-Office
Arlington, VA, USA
Senior level
Senior level
Information Technology • Real Estate • Analytics
The Senior Site Reliability Engineer will design observability frameworks, own reliability for large-scale systems, and mentor others. Responsibilities include collaborating with teams, optimizing performance, and ensuring the infrastructure's resilience and scalability.
Top Skills: AWSAzureBashC#CloudFormationDockerGCPJavaKubernetesNode.jsPythonTerraform
22 Days AgoSaved
In-Office
2 Locations
Senior level
Senior level
Financial Services
Own reliability and scalability of on-prem observability platforms (ELK, Grafana); handle production escalations, capacity planning, SLOs, onboarding, automation, IaC (Terraform/Helm/Ansible), upgrades, security hardening, and platform modernization.
Top Skills: Elasticsearch,Logstash,Kibana,Elk Stack,Grafana,New Relic,Solarwinds,Prometheus,Terraform,Helm,Ansible,Chef,Puppet,Python,Shell Scripting/Linux Shell,Bash,Linux,Beats,Fluentd,Fluent Bit,Opentelemetry,Apm Instrumentation
Reposted 22 Days AgoSaved
In-Office
Atlanta, GA, USA
Senior level
Senior level
Healthtech • Payments • Software
Lead and mentor a team of client integration engineers, manage production and integration efforts for large-scale distributed systems, gather client requirements, drive SDLC activities, provide datacenter and customer support, and continuously improve system reliability and operations.
Top Skills: JavaObject-Oriented Programming
Reposted 23 Days AgoSaved
Remote or Hybrid
2 Locations
230K-275K Annually
Senior level
230K-275K Annually
Senior level
Artificial Intelligence • Software
As a Senior SRE, you'll enhance data infrastructure, optimize performance, build reliability, automate processes, and manage incident responses while supporting enterprise clients' uptime requirements.
Top Skills: ClickhouseGoPostgresPythonTypescript
Reposted 23 Days AgoSaved
Hybrid
New York, NY, USA
165K-225K Annually
Senior level
165K-225K Annually
Senior level
Blockchain • Software • Cryptocurrency • Web3
As a Senior Site Reliability Engineer, you will oversee AWS/GCP infrastructure, ensure system reliability, deploy applications, and enhance automation in a collaborative team environment.
Top Skills: AnsibleAWSCi/CdElkGCPJenkinsKubernetesLinuxPrometheusPuppetTerraform
Reposted 23 Days AgoSaved
Hybrid
San Francisco, CA, USA
165K-225K Annually
Senior level
165K-225K Annually
Senior level
Blockchain • Software • Cryptocurrency • Web3
Responsible for ensuring reliability and scalability of systems, maintaining AWS/GCP infrastructure, deploying applications, and improving operational processes.
Top Skills: AnsibleAWSDnsElkGCPHTTPHttpsJenkinsKubernetesLinuxPrometheusPuppetTcpTerraformUdp
Reposted 23 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
200K-240K Annually
Senior level
200K-240K Annually
Senior level
Artificial Intelligence • Healthtech • Software
Design, build, and maintain secure and scalable infrastructure for critical healthcare applications, lead incident responses, and support engineering teams.
Top Skills: BashGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Reposted 23 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Automotive
Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.
Top Skills: AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform
24 Days AgoSaved
Hybrid
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Software • Generative AI
Lead reliability and performance for Plaud.ai's AI products. Design and operate scalable cloud-native systems, run on-call/incident response, build observability and reliability automation, define SLOs/SLIs, and drive postmortems and platform reliability improvements.
Top Skills: Aws,Gcp,Azure,Kubernetes,Distributed Systems,Go,Python,Java,Observability (MetricsLogsTracing)
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account