Sr. Engineering Manager, Tooling and Reliability Platforms

Posted 14 Days Ago
Hiring Remotely in United States of America
Remote
136K-284K Annually
Senior level
AdTech • Digital Media • Information Technology • Other
The Role
Lead the Tooling & Reliability Platforms team to enhance reliability and efficiency using AI-driven strategies. Manage incident management tools and foster a culture of innovation in engineering practices.
Summary Generated by Built In
It takes powerful technology to connect our brands and partners with an audience of hundreds of millions of people. Whether you’re looking to write mobile app code, engineer the servers behind our massive ad tech stacks, or develop algorithms to help us process trillions of data points a day, what you do here will have a huge impact on our business—and the world.

A Little About Us

Our Tooling and Reliability Platforms team operates as a foundational pillar of the Central Technology Organization. We provide the "paved road" for Yahoo's diverse verticals, enabling them to ship world-class products at a global scale. Our mission is to build modern, secure, and highly efficient platforms that power all of Yahoo's brands, with a relentless focus on Engineered Resilience.

A Lot About You

We are looking for a strategic Senior Engineering Manager (M4) to lead our Tooling & Reliability Platforms team. You are a Product Lead for the "paved road" of reliability at Yahoo, managing a large squad of engineers responsible for our incident management ecosystem while evolving these tools into a comprehensive, AI-augmented Reliability Platform.

You are strategic about the north star of Engineered Resilience, owning the roadmap for automated diagnostics and chaos engineering. You foster a culture of high-trust and continuous experimentation, where engineers are empowered to use modern tools to solve complex reliability challenges. You understand that in a modern engineering org, reliability is achieved through a mix of elite software engineering and intelligent automation.

Key Responsibilities

  • Engineering Leadership & Productivity: Manage and grow a high-performing team . Identify and implement AI-driven efficiencies in the product lifecycle to accelerate platform delivery and engineering productivity.

  • Product & Workflow Ownership: Treat the reliability stack as a product. Define the roadmap for the Incident Management platform, ensuring these tools reduce cognitive load for hundreds of service teams by replacing manual investigation steps with AI-assisted workflows.

  • AIOps & Governance: Drive the integration of GenAI and SRE Agents into production environments. Establish frameworks for validating AI-generated incident summaries and hypothesis generation to ensure accuracy and prevent automated hallucinations.

  • Resilience Engineering: Define the vision for the next generation of Resilience Engineering, focusing on building services that make products inherently resilient through automated alert diagnostics and self-healing systems.

  • Vendor Advocacy: Act as a high-leverage partner to our key vendors, holding them accountable for roadmap delivery and ensuring their features align with our team vision.

Who You Are

  • A Builder & A Leader: Experience managing manager-level or senior IC reports in a high-scale environment, with a track record of building internal platforms.

  • Product-Minded: You don’t just "install" tools; you architect a "paved road" that engineers want to use, focusing on reducing friction through intelligent automation.

  • AI-Forward: You possess a commitment to combining SRE with LLMs and have the expertise to convert AI potential into effective, real-world automation and structured prompt interaction with AI tools.

  • Strategic & Adaptive: Ability to manage day-to-day operations while pivoting strategy to account for emerging AI-driven reliability trends.

Basic Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

  • 5+ years of experience leading SRE or DevOps teams in a high-scale, cloud-native environment.

  • Strong background in Software Engineering (Python, Go, or Java) and Infrastructure-as-Code.

  • Deep familiarity with incident management and AIOps tools (e.g., Rootly, PagerDuty, BigPanda).

  • Experience evaluating and refining AI-generated outputs in a technical or operational context.

  • Proven ability to collaborate with SaaS partners to influence a collective product vision.

  • Comfort operating in an evolving, AI-augmented environment with a focus on continuous learning.

Preferred Qualifications

  • East coast timezone preference

  • Experience with BCP/DR planning or Chaos Engineering.

  • Previous experience implementing large-scale AIOps or "Self-Healing" infrastructure initiatives.


The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!

Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.

We believe that a diverse and inclusive workplace strengthens Yahoo and deepens our relationships. When you support everyone to be their best selves, they spark discovery, innovation and creativity. Among other efforts, our 11 employee resource groups (ERGs) enhance a culture of belonging with programs, events and fellowship that help educate, support and create a workplace where all feel welcome.

The compensation for this position ranges from $136,125.00 - $283,750.00/yr and will vary depending on factors such as your location, skills and experience.The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.

Currently work for Yahoo? Please apply on our internal career site.

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Sunnyvale, CA
10,001 Employees

What We Do

Yahoo is a global media and tech company that connects people to their passions. We reach nearly 900 million people around the world, bringing them closer to what they love—from finance and sports, to shopping, gaming and news—with the trusted products, content and tech that fuel their day. For partners, we provide a full-stack platform for businesses to amplify growth and drive more meaningful connections across advertising, search and media.

Similar Jobs

Optum Logo Optum

Care Manager RN - LA Market - Remote

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Metairie Terrace, LA, USA
160000 Employees
60K-107K Annually

CDW Logo CDW

Delivery Mgr

Information Technology
Remote or Hybrid
CO, USA
15100 Employees
121K-145K Annually

CDW Logo CDW

Administrative Assistant

Information Technology
Remote or Hybrid
US
15100 Employees
22-30 Hourly

CDW Logo CDW

Senior Business Analyst

Information Technology
Remote or Hybrid
US
15100 Employees
102K-143K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account