Principal Engineer, Cloud Platforms

Posted 3 Days Ago
Be an Early Applicant
2 Locations
Hybrid
Expert/Leader
Software
The Role
Lead design and implementation of shared, highly available cloud platform services: Kubernetes platforms, CI/CD, event-driven messaging, observability, service mesh, relational DB as-a-service, automation in Go/Python, multi-region cloud optimizations, and on-call support.
Summary Generated by Built In
About Saviynt

Saviynt is a leader in identity security, delivering an AI-powered platform that governs and secures access to applications, data, and business processes for some of the world’s largest enterprises and government institutions. Built for the AI era, Saviynt enables organizations to move faster—securely and compliantly.
 

Why This Role Matters

Saviynt’s platform is mission-critical for our customers. As we scale globally, reliability, availability, and performance are not optional—they are core product features.

As a Principal  Engineer, you will define and drive the reliability strategy for our SaaS platform. This is a high-impact, hands-on engineering role with broad influence across infrastructure, platform, and application teams. You will shape how Saviynt designs, operates, and measures reliability at scale.

This role is ideal for engineers who want to work on hard reliability problems, influence architecture across teams, and leave a lasting mark on a growing SaaS platform.
 

What You’ll Do

In this pivotal role, you will be instrumental in designing, building, and maintaining the shared infrastructure services and platforms that our product and application teams will depend on

You will focus on creating reusable, reliable, and scalable solutions that abstract away complexity, enabling other teams to focus on their core business logic and deliver features faster in a multi-cloud environment

Design and build core platform components and shared infrastructure services that other development teams will integrate with and leverage to deploy and operate their applications

Architect, implement, and manage highly available and scalable Kubernetes platforms as a service for internal consumers

Develop robust, internal-facing tools and automation for infrastructure provisioning and management primarily using Go (Golang)

Architect and optimize foundational solutions within Cloud environments (AWS, Azure, etc.), focusing on creating reusable patterns and modules for other teams

Design and implement shared Event-Driven Architecture components and messaging platforms using technologies like Kafka or Google Pub/Sub that product teams can easily utilize

Develop and maintain robust CI/CD pipelines (e.g., GitLab CI and ArgoCD) as a service, providing standardized and automated deployment workflows for various development teams

Design and build resilient Distributed Systems components that serve as building blocks for other applications, focusing on reliability, fault tolerance, and performance

Manage and optimize our shared infrastructure across Multi-Region Cloud Environments, ensuring that platform services are globally available and performant for all consumers

Establish and enhance centralized Observability and Monitoring platforms and tools that provide self-service insights for consuming teams

Define and implement clear, well-documented RESTful API designs for the infrastructure services you build, ensuring ease of integration for internal clients

Implement and manage Service Mesh (e.g., Envoy, Istio) capabilities, providing traffic management, security, and policy enforcement as a shared platform for services

Design, implement, and optimize highly available Relational Database services or shared data platforms for broad organizational use

Collaborate closely with product development teams to understand their infrastructure needs and pain points, providing technical guidance and support

Participate in on-call rotations to support the critical shared infrastructure you build
 

What We’re Looking For

9+ years of experience in an Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a strong focus on building tools and services for other engineers

Deep expertise with Kubernetes in production environments, particularly in providing it as a platform(i.e single tenant and multi-tenant deployment architectures)

Strong programming skills in Go (Golang) and Python, with experience building robust, maintainable backend services and automation

Extensive hands-on experience with at least one major Cloud Provider (AWS, GCP, or Azure); multi-cloud experience is a strong plus, especially in building abstractions over them

Proven experience designing and implementing Event-Driven Architecture and message queuing systems (e.g., Kafka, RMQ, NATS) as shared services

Solid understanding and practical experience with CI/CD pipeline tools (especially GitLab CI) and experience establishing automated delivery processes for other teams

Demonstrable experience designing and operating Distributed Systems, with an understanding of patterns for creating reliable, shared components

Familiarity with Multi-Region Cloud Environments and strategies for building globally distributed and highly available platform

Proficiency in establishing and utilizing comprehensive Observability and Monitoring platforms (e.g., Prometheus, Grafana, ELK stack, Datadog) for shared infrastructure

Strong experience with RESTful API design principles and building well-documented, consumable APIs

Knowledge of Service Mesh concepts and practical experience with solutions like Istio in a platform context

Hands-on experience with Relational Databases (e.g., MySQL, PostgresSQL), ideally in managing them as a service

Excellent communication skills and the ability to clearly articulate complex technical concepts to both technical and non-technical audiences

A strong customer-centric mindset, treating internal development teams as your primary customers

Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience required
 

Why Join Saviynt

•        Work on a mission-critical SaaS platform used by global enterprises
•        Solve complex reliability challenges at scale
•        Influence architecture and engineering culture at a company level
•        Competitive compensation, benefits, and growth opportunities
 

Security & Compliance

This role requires compliance with Saviynt’s information security and privacy policies, including annual security training

Top Skills

Argocd
AWS
Azure
Datadog
Elk Stack
Envoy
GCP
Gitlab Ci
Go (Golang)
Google Pub/Sub
Grafana
Istio
Kafka
Kubernetes
MySQL
Nats
Postgres
Prometheus
Python
Rabbitmq (Rmq)
Restful Apis
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
El Segundo, CA
0 Employees
Year Founded: 2010

What We Do

Saviynt’s Enterprise Identity Cloud helps modern enterprises scale cloud initiatives and solve the toughest security and compliance challenges in record time. The company brings together identity governance (IGA), granular application access, cloud security, and privileged access to secure the entire business ecosystem and provide a frictionless user experience.

Similar Jobs

Optum Logo Optum

License EAP Counselor - Remote (Early 1st Shift & Weekends with Differentials)

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Atlanta, GA, USA
160000 Employees
29-52 Hourly

Optum Logo Optum

Medical Director - Post-Acute Care Management - Care Transitions - Remote anywhere in US

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
10 Locations
160000 Employees
249K-373K Annually

CDW Logo CDW

Architect

Information Technology
Remote or Hybrid
US
15100 Employees
147K-211K Annually

CDW Logo CDW

Channel Manager

Information Technology
Remote or Hybrid
US
15100 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account