Principal Software Engineer, AI Platform Engineering

Posted Yesterday
Be an Early Applicant
El Segundo, CA, USA
Hybrid
Senior level
Software
The Role
The Principal Software Engineer leads the architectural design of a data platform, ensuring isolated and compliant data flows, overseeing AI data lakes, batch and streaming pipelines, and developing service APIs.
Summary Generated by Built In

ABOUT SAVIYNT

Saviynt is a leader in identity security, delivering an AI-powered platform that governs and secures access to applications, data, and business processes for global enterprises and government institutions. Built for the AI era, Saviynt helps organizations move faster — securely and compliantly.

 

ABOUT THE ROLE

You set the architectural direction for how training data flows, evolves, and is governed across the AI Platform. You define the standards ML engineers and scientists build on, and ensure every training signal is tenant-isolated, PII-free, and traceable from source to model.

 

WHAT YOU'LL OWN

  • AI Data Lake on GCS: bucket layout, raw → silver → gold tier separation, CMEK encryption, lifecycle rules

  • Batch pipelines: Spark on Dataproc for TB-scale feature backfills, Iceberg compaction, and daily S3→GCS incremental sync

  • Streaming pipelines: Apache Beam on Dataflow for sub-5-min CDC ingestion with exactly-once semantics and PII assertion gates

  • Schema registry: Avro / Protobuf schema versioning, compatibility modes, and migration playbooks for safe schema evolution

  • Orchestration: Flyte as primary DAG layer — task authoring standards, domain isolation, retry policies, DataCatalog memoization; evaluate Kubeflow Pipelines where relevant

  • Multi-tenancy: strict per-tenant GCS prefix isolation, quota policies, and cross-tenant contamination validation

  • Data Anonymizer and Data Labeler microservices: strip PII and attach ML labels before signals leave each customer environment

  • Feature store: Feast offline (GCS Parquet) and online (Redis) with point-in-time correctness and < 0.1% consistency SLA

  • Vector database: operate Pgvector (Cloud SQL) for POC and Qdrant on GKE for production-scale embedding storage; design index strategies (IVFFlat, HNSW) and manage ANN query latency SLAs

  • RAG data pipeline: build embedding generation pipelines that chunk, encode, and upsert document embeddings into the vector store; own the data refresh cadence and staleness SLAs for retrieval context

  • Service APIs: expose data platform services (feature serving, embedding upsert, schema validation) over HTTPS with mTLS and gRPC where low-latency streaming is required

  • Synthetic data pipelines for dev/staging where real customer data is not permitted

  • Data quality gates: Great Expectations / dbt checks as Flyte tasks, blocking on schema and PII-absence failures

 

YOU'LL THRIVE HERE IF YOU HAVE

  • 8+ years of data engineering at production scale across multiple companies

  • Demonstrated principal impact: platform standards you defined adopted org-wide, or major cross-team pipeline/schema migrations you led

  • Data lake ownership (essential): you have designed and operated a production data lake end-to-end — storage layout, partitioning strategy, tiered retention (hot/warm/cold), table format (Iceberg or Delta Lake), compaction, and access control; not just consumed one

  • Deep Spark (PySpark / Scala): executor tuning, shuffle diagnosis, Iceberg table maintenance

  • Hands-on Beam / Dataflow: windowing, exactly-once, side inputs, autoscaling

  • Schema registry experience: Protobuf / Avro compatibility rules, breaking-change migrations in production

  • Orchestration at scale: Flyte, Kubeflow Pipelines, Airflow, or Prefect — operated in production, ideally benchmarked two

  • Multi-tenant data architecture: per-tenant isolation as a hard requirement, not a post-hoc concern

  • Feature store operations: Feast or Tecton, point-in-time joins, online/offline consistency

  • Vector databases: Pgvector or Qdrant in production — index tuning, ANN search, embedding upsert pipelines

  • RAG data fundamentals: chunking strategies, embedding model selection, retrieval quality evaluation, and context freshness management

  • API transport: gRPC and HTTPS/mTLS for service-to-service communication; comfortable defining proto contracts and managing certificate lifecycle

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience

 

NICE TO HAVE

  • Differential privacy or k-anonymity for ML training datasets

  • Open source contributions: Feast, Great Expectations, Apache Beam, or dbt

  • Familiarity with IAM / access governance data: entitlements, provisioning events, access graphs

  • Iceberg or Delta Lake at petabyte scale

 

WHY JOIN SAVIYNT

  • Work on a large-scale, Kubernetes-based SaaS platform

  • Solve challenging cloud and reliability problems at scale

  • Collaborate with strong engineers in a reliability-focused culture

  • Competitive compensation, benefits, and growth opportunities

 

SECURITY & COMPLIANCE

This role requires adherence to Saviynt's information security and privacy policies, including annual security training.

Skills Required

  • 8+ years of data engineering experience at production scale
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • Deep experience with Spark (PySpark / Scala)
  • Hands-on experience with Apache Beam / Dataflow
  • Experience with schema registry (Protobuf / Avro)
  • Experience with orchestration tools like Flyte or Kubeflow Pipelines
  • Experience with data lake operations and architecture
  • Experience managing vector databases like Pgvector or Qdrant
  • Experience with API transport using gRPC and HTTPS/mTLS

Saviynt Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Saviynt and has not been reviewed or approved by Saviynt.

  • Leave & Time Off Breadth Time off is described as flexible, with policies including flexible time off and mentions of unlimited PTO. This breadth can make time away easier to take alongside company holidays.
  • Wellbeing & Lifestyle Benefits In‑office amenities such as catered food, drinks, and snacks, plus social events like birthday celebrations and team outings, are highlighted. These lifestyle perks add day‑to‑day convenience and connection.
  • Career-Linked Recognition & Rewards Employee recognition is emphasized, with programs to celebrate those who go above and beyond. Regular recognition activities are cited alongside team bonding initiatives.

Saviynt Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
El Segundo, CA
0 Employees
Year Founded: 2010

What We Do

Saviynt’s Enterprise Identity Cloud helps modern enterprises scale cloud initiatives and solve the toughest security and compliance challenges in record time. The company brings together identity governance (IGA), granular application access, cloud security, and privileged access to secure the entire business ecosystem and provide a frictionless user experience.

Similar Jobs

ServiceNow Logo ServiceNow

Principal Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Santa Clara, CA, USA
29000 Employees
218K-381K Annually

Samsara Logo Samsara

Sales Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
United States
4000 Employees
96K-145K Annually

Samsara Logo Samsara

Specialist Seller - Enterprise

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
United States
4000 Employees
350K-350K Annually

Samsara Logo Samsara

Senior Business Value Strategist

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
United States
4000 Employees
145K-196K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account