Staff/Principal Site Reliability Engineer

Reposted 3 Days Ago
Easy Apply
Hiring Remotely in USA
Remote
184K-240K Annually
Senior level
Information Technology • Security • Cybersecurity
The Role
The Staff/Principal Site Reliability Engineer leads infrastructure initiatives, architects solutions for cloud and SaaS, and collaborates cross-functionally to enhance reliability and innovation.
Summary Generated by Built In

We are seeking an exceptional Staff/Principal Site Reliability Engineer to lead critical infrastructure initiatives and drive innovation across our organization. You'll architect scalable solutions, navigate complex technical challenges independently, and deliver results under tight deadlines in a fast-paced environment. You'll work cross-functionally alongside builders who have helped shape the success of companies such as Google, Okta, AWS, and Snowflake.

We are building the next generation identity security platform for the multi-cloud era - will you join us?

You will:

Strategic Leadership & Technical Execution

  • Lead enterprise-wide reliability and infrastructure projects across multiple teams with high autonomy
  • Navigate ambiguous problem spaces and deliver innovative solutions under tight deadlines
  • Architect and deploy solutions for Cloud Prem and SaaS customers at scale
  • Drive technical innovation and establish SRE best practices across the organization
  • Respond to critical incidents, lead root cause analysis, and implement long-term resolutions
  • Develop automation solutions to streamline operations and reduce manual workload
  • Participate in on-call rotation and ensure effective incident handoff and documentation

Cross-Functional Collaboration & Communication

  • Partner with Engineering, Product, and Customer Success teams to align reliability goals with business objectives
  • Communicate complex technical concepts effectively to technical and non-technical audiences, including executives
  • Influence technical decisions across teams through thought leadership and demonstrated expertise
  • Build consensus and drive adoption of new tools, processes, and architectural patterns

Customer-Facing Technical Leadership

  • Provide tier 2/3 technical support to enterprise customers for complex troubleshooting
  • Work directly with customer technical teams to resolve deployment, configuration, and integration challenges
  • Conduct technical onboarding and provide expert guidance on platform architecture and best practices
  • Create customer-facing documentation, troubleshooting guides, and run-books
  • Lead customer calls and technical discussions as a trusted advisor

Team Development

  • Mentor SRE and engineering team members, elevating technical capabilities
  • Foster a culture of reliability, operational excellence, and continuous improvement
You have:

Required Experience

  • BS degree in Computer Science or related field (or equivalent practical experience)
  • 7+ years in Site Reliability Engineering, DevOps, or Infrastructure Engineering
  • Proven track record leading large-scale, cross-team infrastructure projects from conception to production
  • Demonstrated ability to work autonomously on ambiguous projects with tight deadlines

Technical Expertise

  • 5+ years with AWS (VPC, EC2, RDS, EKS, CloudFormation) and cloud automation
  • Expert-level experience with Kubernetes, Helm, Linux, and Terraform
  • Strong experience with GitOps model, distributed version control, and CI/CD pipelines
  • Proficiency with monitoring tools (Prometheus, Grafana, DataDog)
  • Strong programming/scripting skills (Python, Go, Bash) for automation
  • Deep understanding of distributed systems, microservices, and reliability patterns
  • Experience with Bazel and CueLang a plus

Leadership & Communication

  • Exceptional ability to articulate complex technical concepts to diverse audiences
  • Track record of driving technical change across organizational boundaries
  • Successfully delivered multiple complex projects under tight deadlines
  • Strong customer service orientation with patience and empathy

Work Style

  • Thrives in ambiguous environments and makes progress without perfect information
  • Hands-on, "can do" attitude with bias for action
  • Low ego and high intellectual curiosity
  • Comfortable working across time zones
  • Self-motivated with strong ownership mentality

The compensation for this role depends on several factors such as the candidate's skills, qualifications, experience, and work location. For candidates offered a position at the posted job level, the provided range is the expected base salary. This does not include any additional variable compensation, such as commission.

Compensation Disclosure
$184,000$240,000 USD

Our Culture 

We’re driven to build a strong company culture and are looking for individuals with solid alignment with the following:

  • Ownership Mindset
  • Act with Integrity
  • Guardians of our Customers
  • Opinionated Humility
  • Build Trust, Earn Trust

At Veza, your base pay is one part of your total compensation package. For this position, the reasonably expected pay range can be discussed with your recruiter for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for equity and a competitive benefits package.

Veza is proud to be an equal opportunity employer. We are committed to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other applicable legally protected characteristics. We also consider qualified applicants according to applicable federal, state, and local laws. If a candidate with a disability requires an accommodation during the recruitment process, please email [email protected]

About Veza

Veza is the identity security company. Identity and security teams use Veza to secure identity access across SaaS apps, on-prem apps, data systems, and cloud infrastructure. Veza solves the blind spots of traditional identity tools with its unique ability to ingest and organize permissions metadata in the Veza Authorization Graph. Global enterprises like Blackstone, Wynn Resorts, and Expedia trust Veza to visualize access permissions, monitor permissions activity, automate access reviews, and remediate privilege violations. Founded in 2020, Veza is headquartered in Redwood City, California, and is funded by Accel, Bain Capital, Ballistic Ventures, GV, Norwest Venture Partners, and True Ventures. Visit us at veza.com and follow us on LinkedIn, Twitter, and YouTube.

Top Skills

AWS
Bash
Bazel
Cuelang
Datadog
Gitops
Go
Grafana
Helm
Kubernetes
Linux
Prometheus
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
160 Employees
Year Founded: 2020

What We Do

Veza is the authorization platform for data security. Designed for hybrid, multi-cloud environments, Veza enables organizations to easily understand, manage and control who can and should take what action on what data. We empower customers to leverage the power of authorization for an identity-first approach to security, addressing critical business needs tied to managing access governance, data lake security, cloud entitlements, privileged access, and more. Global enterprises like Blackstone, ASAPP, Barracuda Networks, Choice Hotels, and a number of Fortune 500 and emerging organizations trust Veza to secure their enterprise data. Founded in 2020, Veza is headquartered in Los Gatos, California and is funded by Accel, Bain Capital, Ballistic Ventures, GV, Norwest Venture Partners, and True Ventures. To learn more please visit us at www.veza.com.

Similar Jobs

Circle Logo Circle

Senior Site Reliability Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Remote
United States of America
1050 Employees
148K-195K Annually

Zillow Logo Zillow

Senior Site Reliability Engineer

Other • Real Estate • PropTech
Remote
USA
7863 Employees
153K-257K Annually
In-Office or Remote
Andover, MA, USA
651 Employees
120K-140K Annually

NVIDIA Logo NVIDIA

Site Reliability Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
2 Locations
21960 Employees
248K-391K Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account