Lead Site Reliability Engineer

Posted 10 Hours Ago
Be an Early Applicant
Hiring Remotely in US
Remote
136K-177K Annually
Senior level
Big Data • Machine Learning • Software • Analytics
We are a leader in Analytic Process Automation.
The Role
As a Lead Site Reliability Engineer, you will drive the reliability strategy, improve system health, lead incident management, and mentor engineers for a multi-region SaaS platform.
Summary Generated by Built In

Meet the Moment with Alteryx


We're living through a once-in-a-generation shift in how work gets done. Data, automation, and AI are quickly becoming the center of every business decision - and Alteryx is leading the transformation.


You'll be working on the challenges that sit at the heart of modern business. No matter your role, the work you do will help organizations move faster, see more clearly, and tackle questions that used to feel impossible.


If you're ready to meet the moment with innovation, curiosity, and excellence, there's a place for you here.

Why work for just any analytics company? At Alteryx, Inc., we are explorers, dreamers and innovators. We’re on a journey to build the best analytics platform in the world, but we can’t do it without people like you, leading the way. Forget the stereotypical tech companies of the past. Embrace the unconventional, exercise your imagination and help alter the future with Alteryx.

We’re looking for a Lead SRE to own reliability outcomes for a modern split-plane, multi-region SaaS platform serving enterprise customers. This is a hands-on technical leadership role focused on system design, reliability strategy, and cross-team execution.
You’ll lead efforts that directly impact SLO attainment, MTTR reduction, and cost efficiency, while shaping how reliability is engineered, measured, and scaled across the platform.


 

What You’ll Do
Define and drive reliability strategy across control-plane and data-plane systems, including multi-region resilience, BCDR, and failover design
Establish and operationalize SLOs, SLAs, and error budgets, ensuring they inform planning and engineering tradeoffs
Lead initiatives that measurably improve MTTR, incident prevention, and overall service health
Own incident management end-to-end, driving systemic fixes and long-term reliability improvements beyond immediate response
Lead architecture and design reviews to ensure systems meet scalability, reliability, and cost efficiency goals
Champion automation and modernization, including AI-driven reliability improvements
Establish and enforce code quality and review standards
Lead cross-functional initiatives and align engineering with product priorities
Mentor senior engineers and act as a technical leader across teams


 

What You Bring
6+ years leading delivery of complex, distributed systems or SaaS platforms
Strong experience with multi-region, split-plane architectures (control-plane / data-plane)
Proven track record improving SLOs, MTTR, and system reliability at scale
Proficiency in languages like Python, Java, C++, or JavaScript
Deep experience with:
Kubernetes (multi-cluster), CI/CD, and GitOps (ArgoCD)
SLO/SLA design, observability, and incident management
Infrastructure as Code and cloud platforms
Disaster recovery, resilience, and security best practices
Strong leadership skills with experience mentoring senior engineers and influencing cross-team decisions


 

Nice to Have
Experience with chaos engineering and large-scale reliability automation
Background in enterprise SaaS platforms or split-plane architectures
Expertise in navigating, understanding and leveraging modern Observability platfroms (Datadog, Grafana, etc)

 (edited) 

Compensation:

Alteryx is committed to fair, equitable, and transparent compensation. Final compensation will be determined by various factors such as your relevant work experience, education, certifications, skills, and geographic location. 

The salary range for this role in the United States is $136,000 - $177,000.

Employees may also be eligible for a wide range of other benefits, such as a bonus or commission, medical, retirement, financial, wellness, time off, employee discounts, and others.

Interested? Learn more and apply today at alteryx.com/careers!

Find yourself checking a lot of these boxes but doubting whether you should apply? At Alteryx, we support a growth mindset for our associates through all stages of their careers. If you meet some of the requirements and you share our values, we encourage you to apply. As part of our ongoing commitment to a diverse, equitable, and inclusive workplace, we’re invested in building teams with a wide variety of backgrounds, identities, and experiences.

Benefits & Perks:

Alteryx has amazing benefits for all Associates which can be viewed here.

For roles in San Francisco and Los Angeles: Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Alteryx will consider for employment qualified applicants with arrest and conviction records.

This position involves access to software/technology that is subject to U.S. export controls. Any job offer made will be contingent upon the applicant’s capacity to serve in compliance with U.S. export controls.

Top Skills

Argocd
C++
Ci/Cd
Cloud Platforms
Datadog
Gitops
Grafana
Infrastructure As Code
Java
JavaScript
Kubernetes
Python
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Irvine, CA
1,786 Employees
Year Founded: 1997

What We Do

Alteryx is a leader in Analytic Process Automation (APA). The Alteryx APA platform unifies analytics, data science and business process automation in one easy-to-use platform to accelerate digital transformation. Every data worker, regardless of technical acumen, is empowered to be curious and solve problems.

Why Work With Us

Alteryx’s mission is to deliver breakthroughs. We promise customers our technology will help them deliver breakthrough outcomes. We make a similar commitment to employees: Working at Alteryx will be your breakthrough. Whether you are looking to make a change in your career or your life, Alteryx is a place where you will make it happen.

Gallery

Gallery

Similar Jobs

NBCUniversal Logo NBCUniversal

Staff Software Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
New York, NY, USA
68000 Employees
130K-170K Annually

Milestone Systems Logo Milestone Systems

Site Reliability Engineer

Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
Remote or Hybrid
United States
1500 Employees
160K-180K Annually

athenahealth Logo athenahealth

Site Reliability Engineer

Healthtech • Information Technology • Telehealth
Remote
2 Locations
7200 Employees
119K-203K Annually

DraftKings Logo DraftKings

Site Reliability Engineer

Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Remote or Hybrid
United States
6400 Employees
148K-185K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account