Get the job you really want

Top Site Reliability Engineer Jobs

307+ Job Results
8 Days Ago
2 Locations
Remote
200 Employees
Senior level
200 Employees
Senior level
Software
As a Senior Platform Engineer at Mux, you will design and operate the infrastructure for Mux's platforms, focusing on scalable systems and CI/CD processes. You'll improve platform usability via automation, lead cross-functional projects, debug production issues, and promote engineering standards and best practices.
Top Benefits:
401-K
Commuter Benefits
Company Outings
+24 More
8 Days Ago
TX, USA
Remote
1,637 Employees
Senior level
1,637 Employees
Senior level
Cloud • Information Technology • Other • Software
The Site Reliability Engineer III is responsible for designing, developing, and optimizing systems for reliability and performance. This role involves implementing tools to measure system health, guiding engineering teams in observability practices, and improving operational processes. The engineer will proactively address production issues and provide technical leadership, mentoring other staff as needed.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+59 More
8 Days Ago
8 Locations
Remote
4,900 Employees
66K-87K Annually
Entry level
4,900 Employees
66K-87K Annually
Entry level
Fintech • Payments
As an entry-level Site Reliability Engineer, you will learn SRE principles, assist in system design and incident response, and support automation and tooling with a focus on improving system reliability. You will collaborate with development teams, engage in capacity planning, and understand security compliance while gaining practical experience in a technical role.
Top Benefits:
401-K
Adoption Assistance
Company Equity
+18 More
8 Days Ago
8 Locations
Remote
4,900 Employees
156K-208K Annually
Senior level
4,900 Employees
156K-208K Annually
Senior level
Fintech • Payments
The Senior Staff Site Reliability Engineer will lead and mentor SRE teams, design and implement scalable systems, optimize performance, manage incident responses, and ensure compliance and security within the organization. They will also focus on automation tools and collaborate closely with software development teams.
Top Benefits:
401-K
Adoption Assistance
Company Equity
+18 More
8 Days Ago
United States
Remote
259 Employees
165K-210K Annually
Senior level
259 Employees
165K-210K Annually
Senior level
Cloud • Payments
As a Staff Site Reliability Engineer at VGS, you will architect and maintain scalable cloud infrastructure, lead incident management, optimize performance, and collaborate with cross-functional teams to enhance system reliability. You will also advocate for best practices and mentor junior engineers while driving continuous improvement efforts.
8 Days Ago
Portland, OR, USA
35 Employees
Mid level
35 Employees
Mid level
Software
As a Site Reliability Engineer, you will manage and enhance AWS infrastructure, optimize Kubernetes clusters, develop Infrastructure as Code with Terraform, improve CI/CD pipelines, and ensure system security and performance monitoring. You will collaborate with teams to resolve issues and improve application reliability.
8 Days Ago
Dallas, TX, USA
2,760 Employees
Junior
2,760 Employees
Junior
Cloud • Information Technology
The Site Reliability Engineer ensures high availability and performance of OVHcloud products, manages infrastructure, diagnoses errors, automates tasks with scripting, and participates in software development and monitoring. They collaborate in on-call rotations and provide support for newly developed products and services.
8 Days Ago
United States
Remote
Mid level
Mid level
Software
As a Site Reliability Engineer at Vercel, you'll enhance Edge infrastructure, manage incident responses, and integrate SRE practices into engineering processes. You'll focus on improving reliability, performance, and efficiency while developing automated systems for software delivery and capacity management.
9 Days Ago
United States
Remote
410 Employees
Mid level
410 Employees
Mid level
Software
As a Site Reliability Engineer, you will manage production infrastructure on AWS and Azure, ensuring high availability and performance. You'll automate alerts, collaborate with R&D for scalable solutions, and document processes for repeatability. Your responsibilities include troubleshooting incidents, monitoring system observability, and conducting on-call duties.
9 Days Ago
United States
Remote
197 Employees
Senior level
197 Employees
Senior level
Healthtech
As a Senior Site Reliability Engineer at Sword Health, you will maintain service health, develop automation tools, optimize system performance, ensure security compliance, manage databases, and share knowledge within the team.
9 Days Ago
Rocklin, CA, USA
2,000 Employees
Entry level
2,000 Employees
Entry level
Information Technology • Machine Learning • Software • Analytics • Business Intelligence • App development • Generative AI
As a Site Reliability Engineer at Nisum, you will provide Level 2/3 support for eCommerce applications, analyze root causes of production issues, collaborate with teams to ensure application stability, monitor application performance, document support activities, and participate in on-call support.
9 Days Ago
Austin, TX, USA
1,500 Employees
120K-297K Annually
Junior
1,500 Employees
120K-297K Annually
Junior
Social Media • Software
As a Site Reliability Engineer, you will be responsible for maintaining and enhancing the performance and reliability of large-scale HPC and AI/ML systems, managing clusters, automating deployments, troubleshooting issues, and collaborating with cross-functional teams to support infrastructure.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+37 More
9 Days Ago
Denver, CO, USA
7,000 Employees
107K-153K Annually
Senior level
7,000 Employees
107K-153K Annually
Senior level
Artificial Intelligence • Cloud • Events • Productivity • Software • Business Intelligence • Conversational AI
The Site Reliability Engineer (SRE) at RingCentral is responsible for maintaining and improving service reliability and availability. Duties include integrating monitoring solutions, implementing failover mechanisms, conducting risk assessments, and responding to incidents in a collaborative environment. Experience with observability platforms, containerization, and programming is essential.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+47 More
9 Days Ago
Broomfield, CO, USA
Hybrid
205 Employees
Senior level
205 Employees
Senior level
AdTech • Big Data • Marketing Tech • Software
As a Site Reliability Engineer, you will automate software delivery, support cloud-based solutions on AWS, and improve delivery processes. Responsibilities include monitoring systems, collaborating with development teams, managing infrastructures, and participating in deployment and release processes. You will leverage your knowledge in Linux, scripting, and orchestration tools to maintain high availability of services.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+32 More
10 Days Ago
USA
Remote
43 Employees
Entry level
43 Employees
Entry level
Artificial Intelligence
As a Site Reliability Engineer at Phaidra, you will work in the Infrastructure Engineering team to build and maintain infrastructure that supports AI-powered control systems for industrial automation. You will leverage cloud platforms like AWS, GCP, or Azure, along with Kubernetes and CI/CD practices, while ensuring observability and reliability in the systems you oversee.
9 Days Ago
Raleigh, NC, USA
2,736 Employees
Mid level
2,736 Employees
Mid level
Information Technology • Security • Cybersecurity
The Site Reliability Engineer will co-develop and enhance cloud platform services, improve reliability and performance, automate deployment processes, support production readiness, lead incident responses, and drive improvements in operational efficiencies. The role demands expertise in programming, systems design, and incident management.
9 Days Ago
Greenwich, CT, USA
223,850 Employees
177K-265K Annually
223,850 Employees
177K-265K Annually
Not Specified
Fintech
An Engineering Manager role within the Site Reliability Engineering (SRE) team, responsible for improving team productivity, driving operational excellence, and fostering collaboration. Requires technical expertise, leadership abilities, and organisational skills.
9 Days Ago
USA
Remote
1,557 Employees
117K-126K Annually
Junior
1,557 Employees
117K-126K Annually
Junior
Healthtech • Software
As an Intermediate Site Reliability Engineer, you will ensure system reliability, scalability, and efficiency by maintaining uptime and performance, automating processes, and collaborating with teams to enhance system architecture. You'll also implement best practices in site reliability and system administration.
10 Days Ago
Alpharetta, GA, USA
4,172 Employees
150K-214K Annually
Mid level
4,172 Employees
150K-214K Annually
Mid level
Cloud • Security • Software • Cybersecurity
As a Team Lead, DevOps/SRE, you will influence team objectives, design and maintain scalable infrastructure on Microsoft Azure and other cloud platforms, automate deployments, and support service levels while improving system reliability and performance.
Top Benefits:
401-K
Commuter Benefits
Dental Insurance
+18 More
10 Days Ago
United States
Remote
264 Employees
195K-220K Annually
Mid level
264 Employees
195K-220K Annually
Mid level
Software
The Staff Software Engineer, SRE at Fieldwire will enhance the platform's cloud infrastructure, influence design decisions, lead monitoring and troubleshooting efforts, provide mentorship, and ensure compliance with company standards. They will work collaboratively with engineering teams to scale and improve Fieldwire’s services.
10 Days Ago
Philadelphia, PA, USA
Hybrid
259 Employees
Mid level
259 Employees
Mid level
Payments
The Site Reliability Engineer will work to ensure high availability and resiliency of the FreedomPay Commerce Platform. Responsibilities include implementing observability strategies, managing incident response, troubleshooting issues, and collaborating with teams to reduce manual toil. The role requires a tech-savvy individual with strong problem-solving skills and experience in high throughput web environments.
Top Benefits:
401-K
Commuter Benefits
Company Outings
+13 More
12 Days Ago
MO, USA
Remote
19,002 Employees
99K-183K Annually
Senior level
19,002 Employees
99K-183K Annually
Senior level
Healthtech
The Lead Site Reliability Engineer is responsible for managing and maintaining platform infrastructure performance, reliability, and security by utilizing SRE practices. They design Kubernetes clusters, implement Infrastructure as Code, manage container orchestration, and ensure compliance and security. Responsibilities also include monitoring, performance optimization, and mentoring junior team members.
Top Benefits:
401-K
Commuter Benefits
Company Outings
+16 More
12 Days Ago
USA
Remote
67 Employees
120K-135K Annually
Mid level
67 Employees
120K-135K Annually
Mid level
Cloud • Information Technology
The Staff Site Reliability Engineer will automate processes, collaborate with teams to implement an observability stack, design cloud solutions, improve system resilience, and enhance customer experiences. Responsibilities include resolving technical challenges and creating documentation for reliability issues.
12 Days Ago
New York, NY, USA
Remote
Hybrid
86 Employees
120K-250K Annually
Senior level
86 Employees
120K-250K Annually
Senior level
Big Data • Fintech • Machine Learning • Real Estate • Database
Cherre is seeking a Senior DevOps and Site Reliability Engineer to build and support its data management platform. Responsibilities include implementing integrations, deploying updates, developing scripts for automation, and improving customer experience through enhanced workflows. Candidates should have extensive experience in CI/CD, infrastructure management automation, and cloud systems architecture.
Top Benefits:
401-K
Commuter Benefits
Company Equity
+22 More
12 Days Ago
USA
Remote
2,355 Employees
Senior level
2,355 Employees
Senior level
Social Impact
As a Staff Site Reliability Engineer, you will lead and mentor a team, ensuring the reliability, scalability, and security of the platform. Responsibilities include designing AWS infrastructure, collaborating with developers for performance optimization, automating tasks, and developing monitoring systems to handle incidents efficiently.
Top Benefits:
Health Insurance
Conferences Training
Performance Bonus
+1 More
All Filters
Date Posted
Job Category
Experience
Industry
Company Name
Company Size