Manager Site Reliability Engineering

Posted 13 Days Ago
Be an Early Applicant
New York, NY
In-Office
96K-160K Annually
Senior level
News + Entertainment
The Role
The Manager of Site Reliability Engineering will lead and mentor a team focused on maintaining AWS-based infrastructure, ensuring platform stability, scalability, performance, and security for digital sports streaming applications.
Summary Generated by Built In

Madison Square Garden Entertainment Corp. (MSG Entertainment) is a leader in live entertainment, delivering unforgettable experiences while forging deep connections with diverse and passionate audiences. The Company’s portfolio includes a collection of world-renowned venues – New York’s Madison Square Garden, The Theater at Madison Square Garden, Radio City Music Hall, and Beacon Theatre; and The Chicago Theatre – that showcase a broad array of sporting events, concerts, family shows, and special events for millions of guests annually. In addition, the Company features the original production, the Christmas Spectacular Starring the Radio City Rockettes, which has been a holiday tradition for 90 years. More information is available at www.msgentertainment.com.

Who are we hiring?  

The Manager, Site Reliability Engineering  will lead the platform stability, scalability, and security efforts for our digital sports streaming application. This is a hands-on technical leadership role focused on maintaining the reliability of our AWS-based infrastructure, enhancing observability and automation, and ensuring the performance and security of systems that power live and on-demand video streaming.  This role will be central to triaging video playback issues, guiding cloud architecture, and reducing mean time to recovery (MTTR).  

What will you do?  

  • Own the reliability, performance, and security of the platform infrastructure that supports our live and on-demand video streaming app 
  • Lead and grow a small technical team (SRE, VideoOps) and act as a hands-on mentor and contributor.  
  • Design and maintain robust monitoring, logging, and alerting systems, using tools such as CloudWatch, Datadog, and Conviva, to ensure visibility into platform health, fast incident response, and high availability across our video streaming infrastructure.  
  • Define and enforce operational best practices including disaster recovery, redundancy, backup, and failover strategies.  
  • Investigate and resolve complex issues across the application stack, from infrastructure and APIs to video delivery and playback.  
  • Lead incident response efforts and participate in an on-call rotation during peak traffic events (typically evenings EST).  
  • Collaborate with Product and Engineering teams to guide architectural decisions that prioritize platform resilience, scalability, and security. 
  • Partner with L1 Operations and Customer Care teams to triage issues, drive incident resolution, and close the loop on recurring or systemic problems 
  • Own the implementation and continuous strengthening of platform security, including identity management, secrets handling, IAM policies, and AWS-level hardening. 
  • Evaluate and introduce new tools, technologies, and architectural patterns to improve the reliability of the system.  
  • Track and improve SLAs, SLOs, and operational KPIs related to uptime, latency, video playback quality, and security posture.  

What do you need to succeed? 

  • 5+ years of experience in SRE, DevOps, or platform infrastructure roles, with 2+ years in a team lead or manager capacity.  
  • Experience operating and scaling production environments in AWS, including services like CloudFront, Lambda, S3, API Gateway, and CloudWatch. 
  • AWS Certification (Solutions Architect, DevOps Engineer, or similar) or equivalent deep hands-on experience.  
  • Strong background in system observability, with experience using tools like Conviva, CloudWatch, and Datadog for monitoring, distributed tracing, and alerting.  
  • Deep understanding of video streaming architecture including HLS/DASH, CDNs, DRM, SSAI, and multi-platform delivery (mobile, web, CTV).  
  • Expertise in scripting and automation using Python, Bash, or similar, with infrastructure-as-code tools like Terraform or CloudFormation 
  • Proven ability to lead platform security initiatives, including IAM policy management, token handling, and securing service architecture. 
  • Experience collaborating with engineering teams to improve CI/CD pipelines, automate infrastructure changes, and support safe production releases.  
  • Strong analytical and troubleshooting skills across application, network, and video delivery layers. 
  • Excellent communication skills with the ability to drive cross-functional alignment and manage vendor relationships 
  • Participation in an after-hours on-call rotation is expected, particularly during live sporting events and high-traffic periods 

#LI-Onsite  

Pay Range
$96,000$160,000 USD

At MSG, we recognize the importance of upskilling employees’ talents and strengths so they can drive their careers forward. We are proud to offer a robust set of tools and resources to help employees understand their interests and purpose, harness their talents and obtain the skills they need to reach the next step in their careers. Growth and longevity for our employees are top priorities here.

We value diversity and are looking for extraordinary employees of all backgrounds! MSG is an Equal Opportunity Employer and provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity or expression, sexual and reproductive health choices, national origin, citizenship, age, genetic information, disability, or veteran status. In addition to federal law mandates, MSG complies with all applicable state and local laws governing nondiscrimination in all locations and will consider requests for reasonable accommodations as required.

Top Skills

Api Gateway
AWS
Bash
CloudFormation
Cloudfront
Cloudwatch
Conviva
Datadog
Lambda
Python
S3
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
New York, New York
1,651 Employees

What We Do

MSG Entertainment is a world leader in live entertainment, comprised of world-renowned venues and marquee brands. Utilizing our powerful assets and expertise, we produce, present, or host a variety of entertainment and sports events, delivering unforgettable experiences for millions of fans each year. Across our venues and brands, MSG Entertainment sets the standard for excellence and innovation while forging deep connections with diverse and passionate audiences.

Our Company includes our portfolio of iconic venues: New York’s Madison Square Garden, The Theater at Madison Square Garden, Radio City Music Hall, and Beacon Theatre; and The Chicago Theatre – each a prominent destination for unforgettable experiences and events. With flexible seating capacities and configurations that range from 2,800 to 21,000, our venues enable us to showcase a broad array of compelling sporting events, concerts, family shows, and special events that cover a wide spectrum of genres. This includes at Madison Square Garden, known as “The World’s Most Famous Arena,” which serves as home to the New York Knicks and New York Rangers – two of the most recognized franchises in professional sports – and perennially hosts the biggest names in music and entertainment.

MSG Entertainment also features the wholly-owned original production, the Christmas Spectacular Starring the Radio City Rockettes, which has been a holiday tradition for generations of fans at Radio City Music Hall since 1933. The show’s enduring popularity is driven by the world-famous Radio City Rockettes, the longest-running precision dance company in America.

More information is available at www.msgentertainment.com

Similar Jobs

Cooley Logo Cooley

Engineering Manager

Information Technology • Legal Tech
In-Office
12 Locations
3345 Employees
165K-235K Annually

Sphere Entertainment Co. Logo Sphere Entertainment Co.

Manager Site Reliability Engineering

Digital Media • Events • News + Entertainment
In-Office
New York, NY, USA
96K-160K Annually
In-Office
2 Locations
59 Employees
150K-225K Annually
In-Office
New York, NY, USA
1139 Employees
180K-185K Annually

Similar Companies Hiring

TIDAL Thumbnail
Software • News + Entertainment • Mobile • Information Technology • Music • Consumer Web
New York, NY
450 Employees
Sandbox VR Thumbnail
Virtual Reality • Retail • News + Entertainment • Gaming • Events
Tsim Sha Tsui East, Kowloon
650 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account