Principal Site Reliability Engineer

Posted 8 Days Ago
Be an Early Applicant
Vancouver, BC, CAN
Hybrid
260K-275K Annually
Expert/Leader
Software
The Role
Design, build, and operate shared, reusable platform infrastructure and services (Kubernetes, CI/CD, messaging, databases, service mesh) across multi-cloud, multi-region environments. Develop automation, observability, and APIs; support internal teams and participate in on-call rotations to ensure reliability, scalability, and performance.
Summary Generated by Built In
Why Join Saviynt
 
•        Work on a mission-critical SaaS platform used by global enterprises
•        Solve complex reliability challenges at scale
•        Influence architecture and engineering culture at a company level
•        Competitive compensation, benefits, and growth opportunities
 
 
Security & Compliance
 
This role requires compliance with Saviynt’s information security and privacy policies, including annual security training

What You Will Be Doing

    In this pivotal role, you will be instrumental in designing, building, and maintaining the shared infrastructure services and platforms that our product and application teams will depend on
     
    You will focus on creating reusable, reliable, and scalable solutions that abstract away complexity, enabling other teams to focus on their core business logic and deliver features faster in a multi-cloud environment
     
    Design and build core platform components and shared infrastructure services that other development teams will integrate with and leverage to deploy and operate their applications
     
    Architect, implement, and manage highly available and scalable Kubernetes platforms as a service for internal consumers
     
    Develop robust, internal-facing tools and automation for infrastructure provisioning and management primarily using Go (Golang)
     
    Architect and optimize foundational solutions within Cloud environments (AWS, Azure, etc.), focusing on creating reusable patterns and modules for other teams
     
    Design and implement shared Event-Driven Architecture components and messaging platforms using technologies like Kafka or Google Pub/Sub that product teams can easily utilize
     
    Develop and maintain robust CI/CD pipelines (e.g., GitLab CI and ArgoCD) as a service, providing standardized and automated deployment workflows for various development teams
     
    Design and build resilient Distributed Systems components that serve as building blocks for other applications, focusing on reliability, fault tolerance, and performance
     
    Manage and optimize our shared infrastructure across Multi-Region Cloud Environments, ensuring that platform services are globally available and performant for all consumers
     
    Establish and enhance centralized Observability and Monitoring platforms and tools that provide self-service insights for consuming teams
     
    Define and implement clear, well-documented RESTful API designs for the infrastructure services you build, ensuring ease of integration for internal clients
     
    Implement and manage Service Mesh (e.g., Envoy, Istio) capabilities, providing traffic management, security, and policy enforcement as a shared platform for services
     
    Design, implement, and optimize highly available Relational Database services or shared data platforms for broad organizational use
     
    Collaborate closely with product development teams to understand their infrastructure needs and pain points, providing technical guidance and support
     
    Participate in on-call rotations to support the critical shared infrastructure you build

What You Bring

    9+ years of experience in an Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a strong focus on building tools and services for other engineers
     
    Deep expertise with Kubernetes in production environments, particularly in providing it as a platform(i.e single tenant and multi-tenant deployment architectures)
     
    Strong programming skills in Go (Golang) and Python, with experience building robust, maintainable backend services and automation
     
    Extensive hands-on experience with at least one major Cloud Provider (AWS, GCP, or Azure); multi-cloud experience is a strong plus, especially in building abstractions over them
     
    Proven experience designing and implementing Event-Driven Architecture and message queuing systems (e.g., Kafka, RMQ, NATS) as shared services
     
    Solid understanding and practical experience with CI/CD pipeline tools (especially GitLab CI) and experience establishing automated delivery processes for other teams
     
    Demonstrable experience designing and operating Distributed Systems, with an understanding of patterns for creating reliable, shared components
     
    Familiarity with Multi-Region Cloud Environments and strategies for building globally distributed and highly available platform
     
    Proficiency in establishing and utilizing comprehensive Observability and Monitoring platforms (e.g., Prometheus, Grafana, ELK stack, Datadog) for shared infrastructure
     
    Strong experience with RESTful API design principles and building well-documented, consumable APIs
     
    Knowledge of Service Mesh concepts and practical experience with solutions like Istio in a platform context
     
    Hands-on experience with Relational Databases (e.g., MySQL, PostgresSQL), ideally in managing them as a service
     
    Excellent communication skills and the ability to clearly articulate complex technical concepts to both technical and non-technical audiences
     
    A strong customer-centric mindset, treating internal development teams as your primary customers
     
    Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience required

Skills Required

  • 9+ years in Infrastructure Development, Platform Engineering, or Site Reliability Engineering
  • Deep expertise with Kubernetes in production (single-tenant and multi-tenant architectures)
  • Strong programming skills in Go (Golang) and Python
  • Hands-on experience with at least one major cloud provider (AWS, GCP, or Azure)
  • Multi-cloud experience and building abstractions over cloud providers
  • Experience designing and implementing Event-Driven Architectures and message systems (Kafka, Google Pub/Sub, RabbitMQ, NATS)
  • Experience with CI/CD pipeline tools (especially GitLab CI) and GitOps tools like ArgoCD
  • Demonstrable experience designing and operating distributed systems for reliability and performance
  • Familiarity with Multi-Region cloud strategies and globally distributed platform design
  • Proficiency with observability and monitoring platforms (Prometheus, Grafana, ELK stack, Datadog)
  • RESTful API design and building well-documented, consumable APIs
  • Knowledge and practical experience with service mesh technologies (Envoy, Istio)
  • Hands-on experience with relational databases (MySQL, PostgreSQL)
  • Experience managing databases as a shared service (DBaaS)
  • Willingness and ability to participate in on-call rotations supporting critical shared infrastructure
  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical/military experience
  • Excellent communication skills and strong internal customer focus

Saviynt Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Saviynt and has not been reviewed or approved by Saviynt.

  • Leave & Time Off Breadth Time off is described as flexible, with policies including flexible time off and mentions of unlimited PTO. This breadth can make time away easier to take alongside company holidays.
  • Wellbeing & Lifestyle Benefits In‑office amenities such as catered food, drinks, and snacks, plus social events like birthday celebrations and team outings, are highlighted. These lifestyle perks add day‑to‑day convenience and connection.
  • Career-Linked Recognition & Rewards Employee recognition is emphasized, with programs to celebrate those who go above and beyond. Regular recognition activities are cited alongside team bonding initiatives.

Saviynt Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
El Segundo, CA
0 Employees
Year Founded: 2010

What We Do

Saviynt’s Enterprise Identity Cloud helps modern enterprises scale cloud initiatives and solve the toughest security and compliance challenges in record time. The company brings together identity governance (IGA), granular application access, cloud security, and privileged access to secure the entire business ecosystem and provide a frictionless user experience.

Similar Jobs

Samsara Logo Samsara

Senior Data Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Canada
4000 Employees
119K-154K Annually

Square Logo Square

Merchant and Network Compliance Manager

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote or Hybrid
8 Locations
12000 Employees
103K-194K Annually

Block Logo Block

Merchant and Network Compliance Manager

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
In-Office or Remote
8 Locations
12000 Employees
103K-194K Annually

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Temporary Sales Associate-1

eCommerce • Fashion • Retail • Sales • Wearables • Design
Hybrid
Richmond, BC, CAN
16000 Employees
18-21 Hourly

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account