Senior Customer Site Reliability Engineer - OpenShift Managed Cloud Services

Sorry, this job was removed at 12:19 a.m. (CST) on Saturday, Feb 28, 2026
Be an Early Applicant
Hiring Remotely in Ireland, IRL
Remote
Cloud • Information Technology • Internet of Things • Software • Consulting • Infrastructure as a Service (IaaS) • Automation
Creating better technology the open source way
The Role

Job Summary:

Red Hat are looking for a Senior Customer Site Reliability Engineer (CSRE) to join our Openshift Managed Cloud Services (MCS) team. The Senior CSRE plays a crucial role in ensuring the availability, reliability, and performance of critical services at scale. This role is responsible for independently managing complex systems and solving intricate problems that have a significant impact on service quality and stability.
 

A Senior CSRE has a customer-first mindset and will act as a technical lead for customer escalations applying expert troubleshooting to ensure timely and effective resolutions that maintain trust and confidence. They will leverage extensive experience in software, and systems engineering to automate operations, reduce toil, and drive continuous improvement across the service lifecycle.  They work autonomously, demonstrating strong judgment and decision-making capabilities while managing non-routine assignments.
 

Collaboration is essential, as you will partner with Technical Account Managers, Services, Fleet SRE, DevOps, and infrastructure teams to address customer-specific and fleet-wide issues, ensuring the stability and functionality of our cloud-based systems.
 

As a champion of Knowledge-Centered Support (KCS), you will document resolutions, root causes, and best practices to enrich the knowledge base and promote self-service solutions. Additionally, you will mentor team members, fostering a collaborative and continuously learning culture that equips them to manage complex challenges.
 

This role is ideal for a highly skilled and motivated individual who thrives in a fast-paced, collaborative environment and is passionate about driving reliability, scalability, and customer satisfaction.
 

Responsibilities

  • Manage large-scale, distributed systems, focusing on minimizing downtime and improving system resilience.

  • Maintain customer trust and confidence by ensuring stability and functionality of services.

  • Drive continuous enhancement of processes, tools, and methodologies to support the evolving needs of the service.

  • Lead the development of code and automation scripts to optimize the scalability, reliability, and performance of services.

  • Lead and participate in high-priority customer escalations, adopting a customer-first mindset.

  • Coordinate and execute complex incident response procedures, ensuring timely resolution and thorough postmortems.

  • Collaborate with cross-functional teams to enhance system robustness.

  • Demonstrate a proactive mindset to help preempt escalations and ensure reliable operations.

  • Document resolutions, root causes, and best practices to enrich the knowledge base and promote self-service solutions.

  • Mentor and coach team members, fostering a culture of continuous learning, knowledge sharing and collaboration.

  • Participate in on-call rotation and provide leadership during critical incidents.

  • Collaborate on strategic AI and automation projects designed to increase the efficiency of fleet operations and troubleshooting, ultimately delivering a better product experience for customers.

  • Given the customer-facing nature of this SRE role, exceptional communication skills are essential. You must demonstrate the ability to articulate complex technical solutions and lead critical incident calls with confidence, even in high-pressure environments."

Required Skills

  • Advanced Experience with Openshift/Kubernetes container platform support or administration.

  • Proficient with container-based technologies on Linux.

  • Proficient in managing Linux-based systems in a public cloud such as AWS, Azure, or GCP.

  • Advanced experience with enterprise systems monitoring; knowledge of Prometheus is preferred.

  • Advanced with enterprise configuration management such as Ansible, Terraform.

  • Software engineering experience using object-oriented languages; golang is preferred.

  • Superior communications skills and experience working directly with and presenting to customers.

  • Ability to quickly learn new technologies and follow industry trends.

  • Demonstrated ability to quickly and accurately troubleshoot systems issues.

  • Solid understanding of standard TCP/IP networking and common protocols.

#LI-OA1

About Red Hat

Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.

Inclusion at Red Hat
Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.

Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.


Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email [email protected]. General inquiries, such as those regarding the status of a job application, will not receive a reply.

Similar Jobs

Circle (Community) Logo Circle (Community)

Lead Engineer, Discover

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
250K-300K Annually

ServiceNow Logo ServiceNow

Senior Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Dublin, IRL
28000 Employees

ServiceNow Logo ServiceNow

Senior Account Escalation Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Dublin, IRL
28000 Employees

ServiceNow Logo ServiceNow

Sales Executive

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Dublin, IRL
28000 Employees
50K-80K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Raleigh, NC
20,000 Employees
Year Founded: 1993

What We Do

At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

Why Work With Us

Red Hatters freely exchange different viewpoints, contribute ideas, and solve problems together. Our love of collaboration, accountability, a sense of community, and a measure of autonomy combine to create a powerful force that fosters innovation and makes Red Hat a great place to work.

Gallery

Gallery

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account