Get the job you really want.
Top Site Reliability Engineer Jobs
The Platform SRE II role involves architecting and developing software frameworks for Grubhub's cloud platform, ensuring they are testable and fault tolerant. You'll work with engineers to enable API calls, data storage, and job queues, while providing tier-3 support and consulting on infrastructure design.
The Site Reliability Engineer will join the SRE team to manage and improve the caching infrastructure and automation for Atlassian's cloud products. Responsibilities include ensuring high-availability systems, managing public cloud services, developing and debugging code, and automating tasks.
The Principal Site Reliability Engineer will enhance service reliability and performance by collaborating with various teams to implement reliability practices. The role requires deep expertise in cloud infrastructure and operational experience while mentoring other engineers and driving large-scale initiatives.
As a Senior Software Engineer focusing on Site Reliability Engineering, you'll design, develop, and maintain software solutions addressing complex business needs and enhance application reliability and observability using tools such as Dynatrace and BigPanda.
In this role, you will ensure the reliability and performance of eCommerce platform systems. Responsibilities include monitoring and troubleshooting systems, participating in on-call rotations, designing automated processes, collaborating with teams on operational practices, and ensuring compliance with security requirements.
The Lead Site Reliability Engineer will lead and improve the reliability of applications and platforms, conduct resiliency design reviews, mentor team members, and provide technical guidance on issues impacting the team's performance. This role involves collaborating on service level objectives and managing major incidents effectively.
The Infrastructure Engineer at Kustomer will build and maintain software infrastructure and automation capabilities, focusing on deployment, testing, performance, and scalability. This role involves conducting reviews, providing support in systems architecture design, and leading initiatives for application performance monitoring and continuous integration.
Oversee daily activities of your software engineering team, align priorities with team goals, educate stakeholders on service-level objectives, manage issues, and promote a safety culture while undertaking technical projects in the area of site reliability.
Featured Jobs
The Site Reliability Engineer in the Infrastructure team will manage and enhance the company’s infrastructure, focusing on automation, technical support, and defining the infrastructure roadmap. Responsibilities include maintaining system uptime, optimizing costs, enhancing security measures, and implementing monitoring solutions.
The Director of Software Engineering will lead teams in Cloud Governance and Site Reliability Engineering (SRE), focusing on managing cloud infrastructure, enhancing operational excellence, and delivering innovative technology solutions. Responsibilities include mentoring engineers, influencing stakeholders, defining strategies, and continuously improving software engineering practices.
The Site Reliability Engineer will design and develop software systems to solve operations problems using AI and ML. They will work within the SDLC to build cloud solutions, implement SRE frameworks, and support production in cloud environments. Responsibilities include failure analysis, maintaining technical documentation, and acting as a quality control for engineering deliverables, while participating in on-call rotations.
As a Principal Site Reliability Engineer in the Datastores team, you will ensure the reliability and scalability of mission-critical datastores, drive automation for operational excellence, and collaborate with cross-functional teams to shape architecture and strategy. This role involves mentoring team members and designing systems optimized for availability and performance.
As a Staff Site Reliability Engineer, you will ensure the reliability of data infrastructure, collaborate on automation, monitoring, and compliance, while leading efforts to maintain data performance and availability.
The Site Reliability Engineer will improve the quality of pager alerts, maintain operational stability, keep infrastructure updated, and enhance operational security while collaborating effectively with teams. Responsibilities also include monitoring engineering initiatives and operating scalable applications and databases.
The role involves supporting and diagnosing issues in a distributed environment, capacity planning, and application migrations. Applicants must be knowledgeable in UNIX/Linux and have experience in Python and SQL. Strong communication skills and the ability to work under pressure are essential.
The Site Reliability Engineer at Citadel will focus on ensuring the reliability and performance of applications, automating tasks, resolving systemic issues, and collaborating with various engineering teams. Responsibilities also include incident management, improving systems operationally, and promoting the SRE mindset across teams.
As a Senior Software Engineer focusing on DevOps/SRE, you will collaborate within Agile teams to design, develop, and support robust technical solutions. Your role involves leveraging various programming languages and cloud technologies to deliver experiences that enhance financial empowerment for users.
As a Senior Software Engineer in DevOps, you'll collaborate with Agile teams to design and implement full-stack technical solutions, mentor peers, and deliver cloud-based services that empower customers. The role emphasizes staying updated with tech trends and utilizing diverse tools and languages in a cloud-centric environment.
The Site Reliability Engineer will scale cloud services, manage caching infrastructure, and improve service reliability and performance. Responsibilities include building monitoring into the code, defining alerts, and automating tasks. Programming expertise, particularly in backend languages, and strong communication skills are essential as the role involves collaborating with both technical and non-technical audiences.
The Site Reliability Engineer will support first line problem diagnosis in real-time distributed environments, oversee large-scale application capacity planning and deployment, and assist in application migrations and upgrades, collaborating effectively with teammates and maintaining clear communication in a fast-paced environment.
As a Senior Lead Software Engineer at Capital One, you will lead a portfolio of technology projects, drive collaboration with digital product managers, and develop cloud-based solutions while mentoring fellow engineers. You will work with various programming languages and technologies to create innovative solutions to meet regulatory needs.
Lead a team of developers on diverse technology projects that create solutions for regulatory compliance. Stay updated on tech trends, mentor other engineers, and collaborate with product managers to deliver cloud-based solutions. Utilize a wide range of programming languages and tools in developing innovative software systems.
As a Principal Site Reliability Engineer at BAE Systems, you will ensure seamless service delivery, tackle complex service requests, support infrastructure for large-scale software applications, and measure performance indicators. In a collaborative team, you will innovate and implement service methodologies while contributing to the company's technological evolution.
As a Site Reliability Engineer at BAE Systems, you will work on deploying and monitoring IaaS, PaaS, and SaaS solutions, while collaborating across teams to ensure seamless service delivery and tackle complex service requests to improve operational support for large-scale software applications.
As an SRE Specialist at Capco, you will enhance system reliability and performance by implementing SRE principles, automating operations, and ensuring security compliance using DevOps best practices. You will oversee CloudOps strategies and lead the implementation of ServiceNow modules, optimizing workflows and service delivery.
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
No Results
No Results