Get the job you really want

Top Site Reliability Engineer Jobs

411+ Job Results
10 Days Ago
United States
Remote
410 Employees
Mid level
410 Employees
Mid level
Software
As a Site Reliability Engineer, you will manage production infrastructure on AWS and Azure, ensuring high availability and performance. You'll automate alerts, collaborate with R&D for scalable solutions, and document processes for repeatability. Your responsibilities include troubleshooting incidents, monitoring system observability, and conducting on-call duties.
10 Days Ago
United States
Remote
197 Employees
Senior level
197 Employees
Senior level
Healthtech
As a Senior Site Reliability Engineer at Sword Health, you will maintain service health, develop automation tools, optimize system performance, ensure security compliance, manage databases, and share knowledge within the team.
10 Days Ago
Rocklin, CA, USA
2,000 Employees
Entry level
2,000 Employees
Entry level
Information Technology • Machine Learning • Software • Analytics • Business Intelligence • App development • Generative AI
As a Site Reliability Engineer at Nisum, you will provide Level 2/3 support for eCommerce applications, analyze root causes of production issues, collaborate with teams to ensure application stability, monitor application performance, document support activities, and participate in on-call support.
10 Days Ago
Austin, TX, USA
1,500 Employees
120K-297K Annually
Junior
1,500 Employees
120K-297K Annually
Junior
Social Media • Software
As a Site Reliability Engineer, you will be responsible for maintaining and enhancing the performance and reliability of large-scale HPC and AI/ML systems, managing clusters, automating deployments, troubleshooting issues, and collaborating with cross-functional teams to support infrastructure.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+37 More
10 Days Ago
Denver, CO, USA
7,000 Employees
107K-153K Annually
Senior level
7,000 Employees
107K-153K Annually
Senior level
Artificial Intelligence • Cloud • Events • Productivity • Software • Business Intelligence • Conversational AI
The Site Reliability Engineer (SRE) at RingCentral is responsible for maintaining and improving service reliability and availability. Duties include integrating monitoring solutions, implementing failover mechanisms, conducting risk assessments, and responding to incidents in a collaborative environment. Experience with observability platforms, containerization, and programming is essential.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+47 More
11 Days Ago
Broomfield, CO, USA
Hybrid
205 Employees
Senior level
205 Employees
Senior level
AdTech • Big Data • Marketing Tech • Software
As a Site Reliability Engineer, you will automate software delivery, support cloud-based solutions on AWS, and improve delivery processes. Responsibilities include monitoring systems, collaborating with development teams, managing infrastructures, and participating in deployment and release processes. You will leverage your knowledge in Linux, scripting, and orchestration tools to maintain high availability of services.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+32 More
11 Days Ago
USA
Remote
43 Employees
Entry level
43 Employees
Entry level
Artificial Intelligence
As a Site Reliability Engineer at Phaidra, you will work in the Infrastructure Engineering team to build and maintain infrastructure that supports AI-powered control systems for industrial automation. You will leverage cloud platforms like AWS, GCP, or Azure, along with Kubernetes and CI/CD practices, while ensuring observability and reliability in the systems you oversee.
11 Days Ago
Raleigh, NC, USA
2,736 Employees
Mid level
2,736 Employees
Mid level
Information Technology • Security • Cybersecurity
The Site Reliability Engineer will co-develop and enhance cloud platform services, improve reliability and performance, automate deployment processes, support production readiness, lead incident responses, and drive improvements in operational efficiencies. The role demands expertise in programming, systems design, and incident management.

Featured Jobs

11 Days Ago
Greenwich, CT, USA
223,850 Employees
177K-265K Annually
223,850 Employees
177K-265K Annually
Not Specified
Fintech
An Engineering Manager role within the Site Reliability Engineering (SRE) team, responsible for improving team productivity, driving operational excellence, and fostering collaboration. Requires technical expertise, leadership abilities, and organisational skills.
11 Days Ago
USA
Remote
1,557 Employees
117K-126K Annually
Junior
1,557 Employees
117K-126K Annually
Junior
Healthtech • Software
As an Intermediate Site Reliability Engineer, you will ensure system reliability, scalability, and efficiency by maintaining uptime and performance, automating processes, and collaborating with teams to enhance system architecture. You'll also implement best practices in site reliability and system administration.
11 Days Ago
Alpharetta, GA, USA
4,172 Employees
150K-214K Annually
Mid level
4,172 Employees
150K-214K Annually
Mid level
Cloud • Security • Software • Cybersecurity
As a Team Lead, DevOps/SRE, you will influence team objectives, design and maintain scalable infrastructure on Microsoft Azure and other cloud platforms, automate deployments, and support service levels while improving system reliability and performance.
Top Benefits:
401-K
Commuter Benefits
Dental Insurance
+18 More
12 Days Ago
Philadelphia, PA, USA
Hybrid
259 Employees
Mid level
259 Employees
Mid level
Payments
The Site Reliability Engineer will work to ensure high availability and resiliency of the FreedomPay Commerce Platform. Responsibilities include implementing observability strategies, managing incident response, troubleshooting issues, and collaborating with teams to reduce manual toil. The role requires a tech-savvy individual with strong problem-solving skills and experience in high throughput web environments.
Top Benefits:
401-K
Commuter Benefits
Company Outings
+13 More
14 Days Ago
MO, USA
Remote
19,002 Employees
99K-183K Annually
Senior level
19,002 Employees
99K-183K Annually
Senior level
Healthtech
The Lead Site Reliability Engineer is responsible for managing and maintaining platform infrastructure performance, reliability, and security by utilizing SRE practices. They design Kubernetes clusters, implement Infrastructure as Code, manage container orchestration, and ensure compliance and security. Responsibilities also include monitoring, performance optimization, and mentoring junior team members.
Top Benefits:
401-K
Commuter Benefits
Company Outings
+16 More
14 Days Ago
USA
Remote
67 Employees
120K-135K Annually
Mid level
67 Employees
120K-135K Annually
Mid level
Cloud • Information Technology
The Staff Site Reliability Engineer will automate processes, collaborate with teams to implement an observability stack, design cloud solutions, improve system resilience, and enhance customer experiences. Responsibilities include resolving technical challenges and creating documentation for reliability issues.
14 Days Ago
New York, NY, USA
Remote
Hybrid
86 Employees
120K-250K Annually
Senior level
86 Employees
120K-250K Annually
Senior level
Big Data • Fintech • Machine Learning • Real Estate • Database
Cherre is seeking a Senior DevOps and Site Reliability Engineer to build and support its data management platform. Responsibilities include implementing integrations, deploying updates, developing scripts for automation, and improving customer experience through enhanced workflows. Candidates should have extensive experience in CI/CD, infrastructure management automation, and cloud systems architecture.
Top Benefits:
401-K
Commuter Benefits
Company Equity
+22 More
14 Days Ago
USA
Remote
2,355 Employees
Senior level
2,355 Employees
Senior level
Social Impact
As a Staff Site Reliability Engineer, you will lead and mentor a team, ensuring the reliability, scalability, and security of the platform. Responsibilities include designing AWS infrastructure, collaborating with developers for performance optimization, automating tasks, and developing monitoring systems to handle incidents efficiently.
Top Benefits:
Health Insurance
Conferences Training
Performance Bonus
+1 More
20 Days Ago
U.S.
Remote
Senior level
Senior level
eCommerce • Software • Design • SEO
As a Senior Site Reliability Engineer, you will enhance the reliability of Webflow's applications, maintain monitoring tools, optimize resource allocation in Kubernetes, collaborate across teams, and improve incident response processes. Your role focuses on ensuring the stability and scalability of customer-facing infrastructure for millions of users.
Top Benefits:
401-K
Commuter Benefits
Company Equity
+50 More
20 Days Ago
Texas, USA
Remote
460 Employees
Senior level
460 Employees
Senior level
Food • Logistics • Mobile • On-Demand • App development
As a Senior Site Reliability Engineer, you will drive cloud and configuration management, ensure system reliability and performance, and mentor team members. Your role includes service disruption troubleshooting, maintaining monitoring systems, and reducing operational toil. You'll work closely with various engineering teams to deploy and operate products at scale while advocating for best practices in production systems management.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+36 More
15 Days Ago
Oakland, CA, USA
Hybrid
1,200 Employees
187K-233K Annually
Senior level
1,200 Employees
187K-233K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer at Fivetran, you will be responsible for ensuring the reliability and robustness of the production infrastructure, improving incident response, and managing the deployment pipeline while engaging with various teams to maintain high availability of services.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+57 More
15 Days Ago
Oakland, CA, USA
Hybrid
1,200 Employees
187K-233K Annually
Senior level
1,200 Employees
187K-233K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will ensure the reliability and performance of Fivetran's production infrastructure, improve systems reliability, manage incident responses, and collaborate with engineering on deployment and automation scripts.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+57 More
15 Days Ago
Oakland, CA, USA
Hybrid
1,200 Employees
187K-233K Annually
Senior level
1,200 Employees
187K-233K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer at Fivetran, you will ensure the reliability and robustness of the production infrastructure, handle incident responses, and drive improvements in system performance while collaborating with various teams.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+57 More
15 Days Ago
Oakland, CA, USA
Hybrid
1,200 Employees
187K-233K Annually
Senior level
1,200 Employees
187K-233K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer at Fivetran, you will ensure the reliability and performance of the infrastructure by monitoring systems, managing incident responses, and collaborating with engineering teams to enhance deployment processes. You will own the scalability and stability of the infrastructure while integrating reliability into the product roadmap.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+57 More
15 Days Ago
Oakland, CA, USA
Hybrid
1,200 Employees
187K-233K Annually
Senior level
1,200 Employees
187K-233K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer at Fivetran, you'll ensure the performance and reliability of its infrastructure, drive incident response efforts, and collaborate with engineering teams to enhance the product's reliability and stability. You'll also manage monitoring, deployment pipelines, and work closely with security to mitigate infrastructure vulnerabilities.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+57 More
15 Days Ago
United States
Remote
126 Employees
100K-130K Annually
Junior
126 Employees
100K-130K Annually
Junior
Security
Multiple Site Reliability Engineer positions available at a leading cybersecurity company with a focus on automating infrastructure operations and scaling for future growth. Responsibilities include managing production services, automating common actions, implementing monitoring strategies, collaborating with teams, and improving software development processes. Qualifications include a BS or MS in Computer Science, 1 year of industry experience, excellent communication skills, problem-solving abilities, and UNIX/Linux system administration background.
16 Days Ago
USA
Remote
660 Employees
Senior level
660 Employees
Senior level
Blockchain • Fintech • Cryptocurrency
As a Principal Site Reliability Engineer at Gemini, you will lead engineering teams in modern DevOps practices, enhance service reliability and performance, provide architectural guidance, and implement best practices in monitoring and automation. You'll also evaluate systems pre-launch and educate teams on reliability and resiliency methods.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+30 More
All Filters
Date Posted
Job Category
Experience
Industry
Company Name
Company Size