Advertisement

Senior Site Reliability Engineer

Sorry, this job was removed at 9:10 a.m. (CST) on Wednesday, December 22, 2021
Find out who's hiring in San Francisco, CA.
See all Developer + Engineer jobs in San Francisco, CA
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

Job Summary:

Do you want to be part of a team that makes streaming magic through one of the most reliable streaming services in the World? Our SREs provide expert engineering services in cloud automation, and reliability engineering to all of our services that power streaming for Disney+, ESPN+, and more, home to 100 million+ subscribers and ESPN fight nights. We are passionate about our services running with maximum uptime and minimum latency so that our subscribers have the best streaming experience of all our content.

As a Senior engineer, you are looked at by your fellow team members as a ‘go to’ individual; you are someone who has a clear understanding of, and can thoroughly elaborate on SRE principles and best practices to a given audience.To be successful in this role you will continuously uphold and improve all the relevant reliability aspects for our services, with an increased focus on SLIs and SLOs, while raising the reliability of a variety of large scale user facing and internal services.

Teams are located in New York, San Francisco, Manchester UK, Poland, Amsterdam and more.

Responsibilities:

  • Deploy and manage innovative modern cloud technologies using infrastructure-as-code, self-healing, and security automation patterns;
  • Develop useful telemetry, alerts, and response to reduce Mean Time To Repair (MTTR);
  • Collaborate and provide technical excellence within and across teams;
  • Consult on best practices and develop tools to enable smooth adoptions of good service reliability practices and methods;
  • Identify areas of improvement in reliability, efficiency, and operations;
  • Build tools to help your SRE team quickly pinpoint, isolate and resolve issues related to infrastructure, platform services and applications;
  • Continuously refine monitoring processes, configurations, and thresholds;
  • Practice and promote sustainable incident response and blameless postmortems
  • Develop runbooks and tools to streamline processes and shorten problem resolution time;
  • Write code that improves scalability, performance, maintainability, and security;
  • Add, tune and maintain alert configurations and documentation as needed;
  • Operate in the high-pressure environment and troubleshoot complex issues across distributed applications quickly, while successfully handling multiple priorities;
  • Cultivate full-team participation in high quality, thoughtful software;
  • Develop and improve CI/CD processes to improve release cadence and success;
  • Use Chaos Engineering principles and methodologies to test what you build under real-world conditions;
  • Mentor SREs in technical and non-technical SRE responsibilities;
  • Take primary responsibility for large (multi-person) efforts, including planning, execution, and training

Basic Qualifications:

  • Creative and innovative outside the box thinking
  • 5-7 years of experience in SRE, devops, technical operations, systems engineering, software engineering or related discipline
  • Proficient, collaborative, & experienced in building reliable, scalable, enterprise systems
  • Excellent communication skills, both verbal and written
  • Passionate and curious about ways to leverage technology while continually learning
  • Ability to identify root-cause sources of instability in a high-traffic, large-scale distributed systems
  • Experience in designing, building, and operating large-scale production systems
  • Efficiently skilled with the use of containers in enterprise production environments (e.g. Docker, Kubernetes, LXC, AWS ECS and EKS)
  • Configuration management and orchestration (e.g. Terraform, Cloud Formation, Ansible)
  • Comfortable in one or more of the following languages (Python, Java, Scala, Go, Rust, Ruby, or similar)
  • Scripting languages like Ruby, Bash, PowerShell or Python;
  • Skilled in Cloud/PaaS/SaaS Environments (e.g. AWS, Azure, Google Cloud Compute)
  • Hands-on experience using source control (Git, GitHub) and feature branching strategies
  • Experience with continuous integration tools (e.g. Jenkins, Gitlab CI/CD, AWS CodeBuild, CodeDeploy, CodePipeline, Azure DevOps, Spinnaker)
  • Knowledge of best practices and IT operations in an always-up, always-available service;
  • Possess expertise in scalable testing, automation, continuous integration frameworks and best practices;
  • Experience in SDLC, distributed systems, networking, hardware, logistics and operations or capacity planning;
  • UNIX/Linux administration, troubleshooting, performance tuning, and security

Preferred:

  • Experience with DevOps methodologies and/or SRE
  • Experience with container orchestration systems, such as AWS ECS or Kubernetes
  • Experience with monitoring and observability tooling such as Datadog, Prometheus, Grafana
  • Experience with automating infrastructure, deployment and testing using tools like Cloudformation, Ansible or Terraform, and can explain the Infrastructure as Code paradigm
  • Experience with Service Level Objectives and Error Budgets
  • Experience with configuration management, such as Puppet and Ansible
  • Understanding of the principles and methodologies behind Chaos Engineering
  • Experience with software development in Java, Scala, etc
  • BS Degree in Computer Science, Electrical & Computer Engineering or Mathematics;
Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Technology we use

  • Engineering
    • C++Languages
    • JavascriptLanguages
    • PHPLanguages
    • PythonLanguages
    • ScalaLanguages
    • SqlLanguages
    • SwiftLanguages
    • Backbone.jsFrameworks
    • DjangoFrameworks
    • HadoopFrameworks
    • JSFFrameworks
    • MeteorFrameworks
    • Node.jsFrameworks
    • Ruby on RailsFrameworks

An Insider's view of The Walt Disney Company

How does the company support your career growth?

Over my 13 years with the company, I’ve had passionate leaders and colleagues with diverse backgrounds who have taught me and given me opportunities to expand into areas I never thought possible. You have the freedom to take career risks and apply your previous experience in ways you may not anticipate.

Chase

Product Management Director

What is your vision for the company?

Disney has always been at the heart of the evolution of the media industry, and technology is an essential part of that. The way that we tell and consume stories in the future is going to be completely different than it is today, and The Walt Disney Company is uniquely positioned to shape and create that future.

Jamie

SVP/Chief Technology Officer, The Walt Disney Studios

What are The Walt Disney Company Perks + Benefits

The Walt Disney Company Benefits Overview

Because our employees and cast members are at the heart of everything we do, Disney offers a competitive total rewards package that includes pay, health and savings benefits, time-off programs, educational opportunities and more. Together, these rewards make up a comprehensive package that help you live your best life, grow personally and professionally and take advantage of the special extras that only Disney can provide.

Eligibility for certain reward programs will vary based on your job status, work location and/or the terms of any applicable collective bargaining agreement.

Culture
Volunteer in local community
Partners with nonprofits
Diversity
Dedicated diversity and inclusion staff
Diversity employee resource groups
Hiring practices that promote diversity
Health Insurance + Wellness
Dental insurance
Vision insurance
Health insurance
Life insurance
Mental health benefits
Financial & Retirement
401(K)
401(K) matching
Charitable contribution matching
Child Care & Parental Leave
Childcare benefits
Generous parental leave
Vacation + Time Off
Generous PTO
Paid holidays
Paid sick days

More Jobs at The Walt Disney Company

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about The Walt Disney CompanyFind similar jobs like this