Sr. Director, Platform Engineering

Reposted 4 Hours Ago
Be an Early Applicant
3 Locations
In-Office
Senior level
eCommerce • Fashion
The Role
The Senior Director of Platform Engineering will lead the strategy and operations for the enterprise Cloud Platform across Azure and GCP, drive CI/CD practices, manage infrastructure as code using Terraform, and oversee developer experience and FinOps for cloud spend optimization.
Summary Generated by Built In
About the RoleThe Senior Director, Platform Engineering leads the strategy, development, and operations of our enterprise Cloud Platform across Azure and GCP. This role owns the Continuous Delivery Platform built on Kubernetes, drives FinOps discipline to optimize cloud spend, and ensures teams can build, ship, and run software with speed and confidence. You will shape the modern DevOps tech stack and champion infrastructure automation through Terraform, enabling engineering teams across the organization to deliver at scale. Your leadership will directly impact developer productivity, system reliability, and the pace of innovation.What You'll DoCLOUD PLATFORM STRATEGY & ENGINEERING
  • Define and execute the multi-cloud platform strategy across Azure and GCP, ensuring architectural consistency, security, and scalability.
  • Lead the design and evolution of shared platform services — networking, identity, compute, storage, and observability — as self-service capabilities for product engineering teams.
  • Own the API gateway and service mesh layer (Istio), enabling secure, observable, and resilient service-to-service communication across the platform.
  • Evaluate emerging cloud-native technologies and make build-vs-buy decisions that balance innovation with operational sustainability.
  • Partner with Security, Architecture, and Compliance teams to embed governance and policy-as-code into every layer of the platform.
CONTINUOUS DELIVERY & DEVELOPER EXPERIENCE
  • Own and evolve the enterprise Continuous Delivery Platform built on Kubernetes, enabling teams to build, test, ship, and schedule workloads reliably.
  • Drive adoption of modern CI/CD pipelines, container orchestration, GitOps workflows, and progressive delivery practices (canary, blue-green, feature flags).
  • Champion developer experience (DevEx) as a first-class product — reducing friction from code commit to production through self-service tooling, golden paths, and internal developer portals.
  • Establish and track platform adoption metrics (deployment frequency, lead time, change failure rate, MTTR) aligned with DORA benchmarks.
INFRASTRUCTURE AUTOMATION & DEVOPS EXCELLENCE
  • Lead the infrastructure-as-code practice using Terraform, ensuring all cloud resources are provisioned, versioned, and managed through automated, repeatable pipelines.
  • Drive the DevOps culture and toolchain strategy — standardizing on modern practices for configuration management, secrets management, service mesh, and policy enforcement.
  • Build and maintain reusable Terraform modules, landing zones, and account/project vending solutions that accelerate onboarding of new workloads and teams.
  • Ensure infrastructure changes flow through the same CI/CD rigor as application code, with automated testing, drift detection, and compliance checks.
OBSERVABILITY, MONITORING & EVENT STREAMING
  • Define and own the enterprise observability strategy — ensuring comprehensive monitoring, logging, tracing, and alerting across all platform services and application workloads.
  • Lead the implementation and standardization of monitoring toolchains (e.g., Prometheus, Grafana, Datadog, Azure Monitor, Google Cloud Operations Suite) to provide real-time visibility into system health and performance.
  • Own the platform's event streaming and messaging infrastructure built on Apache Kafka, enabling reliable, high-throughput, real-time data pipelines across the organization.
  • Establish SLIs, SLOs, and error budgets as the foundation for reliability decisions, partnering with product engineering teams to drive a culture of proactive incident prevention.
  • Ensure distributed tracing and service dependency mapping are in place across the Istio service mesh, enabling rapid root cause analysis during incidents.
API GATEWAY & SERVICE MESH MANAGEMENT
  • Lead the strategy and operations of the Istio service mesh and API gateway layer, providing traffic management, mutual TLS, rate limiting, and fine-grained access control across microservices.
  • Define and enforce API lifecycle management standards — versioning, deprecation policies, schema governance, and developer documentation.
  • Partner with application teams to optimize service-to-service communication patterns, latency, and resilience through circuit breaking, retries, and intelligent routing.
  • Ensure API gateway and mesh configurations are managed as code, fully integrated into the CI/CD pipeline with automated canary analysis and rollback.
FINOPS & CLOUD FINANCIAL MANAGEMENT
  • Establish and lead the FinOps practice, creating visibility, accountability, and optimization of cloud spend across Azure and GCP.
  • Implement cost allocation frameworks (tagging, showback/chargeback) that tie cloud consumption to business units, products, and teams.
  • Partner with Finance, Procurement, and Engineering leadership to forecast cloud budgets, manage committed-use agreements, and identify savings opportunities.
  • Build dashboards and reporting cadences that keep cloud costs transparent from engineering teams to executive leadership.
TEAM LEADERSHIP & ORGANIZATIONAL DEVELOPMENT
  • Build, mentor, and scale a high-performing platform engineering organization spanning SRE, DevOps, cloud infrastructure, observability, and FinOps disciplines.
  • Foster a culture of ownership, continuous improvement, and blameless incident learning.
  • Attract and retain top engineering talent by creating an environment of technical excellence, psychological safety, and meaningful career growth.
  • Establish clear team charters, on-call rotations, SLOs, and operational readiness standards.
CROSS-FUNCTIONAL PARTNERSHIP & STAKEHOLDER MANAGEMENT
  • Serve as the primary technology leader for cloud platform and delivery infrastructure, communicating strategy, roadmaps, and trade-offs to executive leadership.
  • Partner with product engineering, data engineering, security, and enterprise architecture teams to align platform capabilities with business priorities.
  • Coordinate transparent, timely communications on platform health, incidents, capacity, and upcoming changes.
  • Prepare and deliver executive summaries covering platform performance, cost trends, reliability metrics, and strategic initiatives.
Who You Are
  • 15+ years of progressive experience in software engineering, infrastructure, or platform engineering, with at least 5 years in senior leadership roles managing managers.
  • Deep hands-on background in cloud platforms (Azure and/or GCP required; multi-cloud experience strongly preferred).
  • Proven experience building and operating Kubernetes-based platforms at enterprise scale, including container orchestration, Istio service mesh, and workload scheduling.
  • Strong expertise in API gateway architecture, service mesh (Istio), traffic management, and microservices communication patterns.
  • Demonstrated experience with enterprise observability and monitoring platforms (Prometheus, Grafana, Datadog, or equivalent), including SLI/SLO frameworks and incident management.
  • Hands-on knowledge of event streaming and messaging platforms, particularly Apache Kafka, for building real-time data pipelines at scale.
  • Strong command of infrastructure-as-code practices with Terraform and modern DevOps toolchains (CI/CD, GitOps, observability, secrets management).
  • Demonstrated success standing up or maturing a FinOps function, including cost optimization, tagging strategies, and cloud financial governance.
  • Track record of building high-performing engineering teams with a culture of ownership, operational excellence, and continuous delivery.
  • Excellent executive communication skills — able to translate complex technical strategies into business outcomes for non-technical stakeholders.
  • Experience driving platform adoption and developer experience as an internal product, using metrics and feedback loops to guide investment.
  • Strong knowledge of DORA metrics, SRE principles, and modern reliability practices.
PREFERRED QUALIFICATIONS
  • Kubernetes certifications (CKA, CKAD) or Terraform certifications (HashiCorp Certified).
  • Cloud certifications in Azure (Solutions Architect Expert) and/or GCP (Professional Cloud Architect).
  • FinOps Certified Practitioner or equivalent experience with the FinOps Foundation framework.
  • Experience with platform engineering tooling such as Backstage, Crossplane, ArgoCD, or Flux.
  • Certified Kafka expertise or significant production experience operating Kafka/Confluent at scale.
  • Experience with Istio certification or advanced service mesh architectures in multi-cluster environments.
  • Background in retail, e-commerce, or high-transaction-volume environments.

Skills Required

  • 15+ years of experience in software engineering, infrastructure or platform engineering
  • 5+ years in senior leadership roles managing managers
  • Deep hands-on background in Azure and/or GCP
  • Experience with Kubernetes-based platforms at enterprise scale
  • Expertise in API gateway architecture and service mesh
  • Knowledge of infrastructure-as-code practices with Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Bristol
11,000 Employees
Year Founded: 1969

What We Do

In 1969, Don and Doris Fisher opened the first Gap store on Ocean Avenue in San Francisco. They wanted to make it easier to find a great pair of jeans, and they did. Their denim and records store was a hit, and it grew to become one of the world’s most iconic brands. Today we’re represented in more than 1400 stores in over 40 countries, and online. We have headquarters in New York, London, Shanghai, Tokyo, and, of course, San Francisco. Our unique aesthetic is optimistic cool, elevated American style. Our clothes are crafted with care, with focused attention to thoughtful design. We believe in staying true to our heritage while creating what’s next. Don and Doris Fisher always wanted to “do more than sell clothes.” They wanted to support the people who ran their company, to be active in their communities, and to have a positive impact on the world. Their vision helped transform retail, and we’re still following their lead. We stand for freedom and possibility for all; we champion diverse ideas that transcend generations, geographies and genders.

Similar Jobs

Crunchyroll Logo Crunchyroll

Senior Director, Platform Engineering - Enterprise Technology

Digital Media • eCommerce • Gaming • Mobile • News + Entertainment
Hybrid
Dallas, TX, USA
1300 Employees
Hybrid
Austin, TX, USA
897 Employees
210K-263K Annually

PwC Logo PwC

(DO NOT APPLY) PTT Test 6/18

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
8 Locations
370000 Employees
77K-214K Annually

PwC Logo PwC

Legal Process & Technology Consulting Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
8 Locations
370000 Employees
99K-232K Annually

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
US
35 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account