Lead Site Reliability Engineer

Reposted 6 Days Ago
Hiring Remotely in US
Remote
136K-177K Annually
Senior level
Big Data • Machine Learning • Software • Analytics
We are a leader in Analytic Process Automation.
The Role
As a Lead Site Reliability Engineer, you will drive the reliability strategy, improve system health, lead incident management, and mentor engineers for a multi-region SaaS platform.
Summary Generated by Built In

Meet the Moment with Alteryx


We're living through a once-in-a-generation shift in how work gets done. Data, automation, and AI are quickly becoming the center of every business decision - and Alteryx is leading the transformation.


You'll be working on the challenges that sit at the heart of modern business. No matter your role, the work you do will help organizations move faster, see more clearly, and tackle questions that used to feel impossible.


If you're ready to meet the moment with innovation, curiosity, and excellence, there's a place for you here.

Why work for just any analytics company? At Alteryx, Inc., we are explorers, dreamers and innovators. We’re on a journey to build the best analytics platform in the world, but we can’t do it without people like you, leading the way. Forget the stereotypical tech companies of the past. Embrace the unconventional, exercise your imagination and help alter the future with Alteryx.

We’re looking for a Lead SRE to own reliability outcomes for a modern split-plane, multi-region SaaS platform serving enterprise customers. This is a hands-on technical leadership role focused on system design, reliability strategy, and cross-team execution.
You’ll lead efforts that directly impact SLO attainment, MTTR reduction, and cost efficiency, while shaping how reliability is engineered, measured, and scaled across the platform.


 

What You’ll Do
Define and drive reliability strategy across control-plane and data-plane systems, including multi-region resilience, BCDR, and failover design
Establish and operationalize SLOs, SLAs, and error budgets, ensuring they inform planning and engineering tradeoffs
Lead initiatives that measurably improve MTTR, incident prevention, and overall service health
Own incident management end-to-end, driving systemic fixes and long-term reliability improvements beyond immediate response
Lead architecture and design reviews to ensure systems meet scalability, reliability, and cost efficiency goals
Champion automation and modernization, including AI-driven reliability improvements
Establish and enforce code quality and review standards
Lead cross-functional initiatives and align engineering with product priorities
Mentor senior engineers and act as a technical leader across teams


 

What You Bring
6+ years leading delivery of complex, distributed systems or SaaS platforms
Strong experience with multi-region, split-plane architectures (control-plane / data-plane)
Proven track record improving SLOs, MTTR, and system reliability at scale
Proficiency in languages like Python, Java, C++, or JavaScript
Deep experience with:
Kubernetes (multi-cluster), CI/CD, and GitOps (ArgoCD)
SLO/SLA design, observability, and incident management
Infrastructure as Code and cloud platforms
Disaster recovery, resilience, and security best practices
Strong leadership skills with experience mentoring senior engineers and influencing cross-team decisions


 

Nice to Have
Experience with chaos engineering and large-scale reliability automation
Background in enterprise SaaS platforms or split-plane architectures
Expertise in navigating, understanding and leveraging modern Observability platfroms (Datadog, Grafana, etc)

 (edited) 

Compensation:

Alteryx is committed to fair, equitable, and transparent compensation. Final compensation will be determined by various factors such as your relevant work experience, education, certifications, skills, and geographic location. 

The salary range for this role in the United States is $136,000 - $177,000.

Employees may also be eligible for a wide range of other benefits, such as a bonus or commission, medical, retirement, financial, wellness, time off, employee discounts, and others.

Interested? Learn more and apply today at alteryx.com/careers!

Find yourself checking a lot of these boxes but doubting whether you should apply? At Alteryx, we support a growth mindset for our associates through all stages of their careers. If you meet some of the requirements and you share our values, we encourage you to apply. As part of our ongoing commitment to a diverse, equitable, and inclusive workplace, we’re invested in building teams with a wide variety of backgrounds, identities, and experiences.

Benefits & Perks:

Alteryx has amazing benefits for all Associates which can be viewed here.

For roles in San Francisco and Los Angeles: Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Alteryx will consider for employment qualified applicants with arrest and conviction records.

This position involves access to software/technology that is subject to U.S. export controls. Any job offer made will be contingent upon the applicant’s capacity to serve in compliance with U.S. export controls.

Skills Required

  • 6+ years leading delivery of complex distributed systems or SaaS platforms
  • Strong experience with multi-region split-plane architectures
  • Proven track record improving SLOs and system reliability
  • Proficiency in languages like Python, Java, C++, or JavaScript
  • Deep experience with Kubernetes, CI/CD, and GitOps

Alteryx Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Alteryx and has not been reviewed or approved by Alteryx.

  • Fair & Transparent Compensation Pay is frequently characterized as fair and competitive for the role, with total compensation (including stock/equity and benefits) described as a point of strong satisfaction. Competitive salaries and commissions are positioned as a tool to attract and retain talent.
  • Healthcare Strength Medical, dental, and vision coverage is described as comprehensive, including employer-paid coverage for employees alongside life and disability insurance, FSAs, and mental health support. Wellness programming and related supports (such as fitness reimbursements and organized workouts) reinforce the perceived strength of health-related offerings.
  • Wellbeing & Lifestyle Benefits Perks and lifestyle supports are described as broad, spanning items like health club reimbursement, home-office stipends for remote work, commuter support, and office amenities such as meals and snacks. Flexibility-oriented benefits are also present through hybrid/virtual eligibility and volunteer time.

Alteryx Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Irvine, CA
1,786 Employees
Year Founded: 1997

What We Do

Alteryx is a leader in Analytic Process Automation (APA). The Alteryx APA platform unifies analytics, data science and business process automation in one easy-to-use platform to accelerate digital transformation. Every data worker, regardless of technical acumen, is empowered to be curious and solve problems.

Why Work With Us

Alteryx’s mission is to deliver breakthroughs. We promise customers our technology will help them deliver breakthrough outcomes. We make a similar commitment to employees: Working at Alteryx will be your breakthrough. Whether you are looking to make a change in your career or your life, Alteryx is a place where you will make it happen.

Gallery

Gallery

Similar Jobs

Airwallex Logo Airwallex

Senior Site Reliability Engineer

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Remote or Hybrid
San Francisco, CA, USA
2200 Employees
Remote
United States
1233 Employees

Launch Potato Logo Launch Potato

Site Reliability Engineer

AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
Remote
United States
160 Employees

Launch Potato Logo Launch Potato

Site Reliability Engineer

AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
Remote
United States
160 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account