Senior Platform Engineer

Posted 11 Days Ago
Be an Early Applicant
San Jose, CA, USA
In-Office
150K-180K Annually
Senior level
Cloud • Information Technology
The Role
Design, operate, and optimize cloud infrastructure (primarily AWS) with an emphasis on cost efficiency (FinOps). Build automation, observability, CI/CD integration, and tooling; participate in on-call rotations and incident escalation; drive cost visibility, rightsizing, and commitment strategy automation across engineering teams.
Summary Generated by Built In

Senior Platform Engineer

Location: US (Remote)

About Platform9

Platform9 is the leader in enterprise Private Cloud. Founded by VMware cloud veterans, we build Private Cloud Director — software that turns your existing hardware into a full-featured, future-ready private cloud. We stay focused on one thing: exceptional customer outcomes.

Enterprises choose Platform9 to replace legacy virtualization because it removes operational complexity without forcing a rip-and-replace. Private Cloud Director gives infrastructure teams a familiar GUI for managing VMs and containers, seamless integration with existing hardware and third-party storage, and critical enterprise features like HA/DR, scale, and reliability — all while unlocking robust API control and vendor independence.

With over 30,000 nodes in production at companies like Cloudera, EBSCO, Juniper Networks, and Rackspace, Platform9 is the proven path to a modern, open private cloud. We are backed by prominent investors and supported by a partner ecosystem of resellers, SIs, MSPs, and technology vendors. Our values — innovation, customer obsession, ownership, radical candor, and excellence — guide every decision.

About the Role

We’re looking for a Senior Platform Engineer who treats cloud spend as a first-class engineering concern alongside reliability and performance. You’ll design, operate, and continuously improve our cloud infrastructure — and you’ll own the FinOps discipline that keeps it cost-efficient at scale.

Day to day, that means partnering with engineering, SRE and sales to build cost visibility, right-size resources, optimize commitment strategies, and automate everything from provisioning to deployment. You’ll also be a key escalation point for production incidents and a force multiplier for the broader engineering org through tooling and process improvements.

Responsibilities
  • Design, implement, and maintain cloud infrastructure across multiple hyperscalers (primarily AWS), including Kubernetes clusters, OpenStack environments, and supporting services.
  • Own cloud cost optimization end-to-end: analyze spend, eliminate waste, right-size resources, and manage commitment strategies (Reserved Instances, Savings Plans) to reduce total infrastructure cost.
  • Establish and evolve FinOps practices across the org cost allocation, chargeback/showback models, tagging policies, and spend forecasting — so engineering teams can make financially informed infrastructure decisions.
  • Automate infrastructure provisioning, configuration management, and application deployments using Terraform, Flux, and similar tools.
  • Build and maintain observability for both system health and cost efficiency using Prometheus, Grafana, Loki, and related tools; surface spending trends to engineering and leadership through clear dashboards and regular reporting.
  • Develop internal tooling and scripts that reduce toil and improve operational leverage.
  • Collaborate with engineering teams to design and maintain CI/CD pipelines.
  • Participate in on-call rotation and serve as a senior escalation point for infrastructure, application, and performance incidents.
  • Stay current on trends in cloud computing, DevOps, and cloud financial management, and bring relevant ideas back to the team.
Qualifications
  • 5+ years in a DevOps or SRE role with deep experience in cloud infrastructure and operations.
  • Demonstrated experience with FinOps principles — commitment management, rightsizing, waste reduction, cost allocation, and translating infrastructure decisions into financial impact for non-technical stakeholders.
  • Extensive Kubernetes experience: cluster administration, deployment strategies, and production troubleshooting.
  • Proficiency in infrastructure-as-code (Terraform, Ansible, or similar).
  • Strong scripting skills in Python or equivalent; strong systems programming in Go or equivalent.
  • Solid configuration management experience with Salt, Chef, or similar.
  • Hands-on experience with observability tooling: Prometheus, Cortex, Grafana, Loki.
  • Familiarity with CI/CD tools and best practices.
  • Strong Linux administration and debugging skills.
  • Excellent communication skills — you can explain an infrastructure trade-off or a cost anomaly to an engineer and a finance lead in the same conversation.
  • Proven incident management experience.
  • OpenStack experience is a plus, not a hard requirement.
Bonus Points
  • EKS (Elastic Kubernetes Service) experience.
  • Experience managing on-premise infrastructure.
  • FinOps Foundation certification (FOCP) or equivalent.
  • Experience with cloud cost management platforms (Archera, CloudHealth, Apptio Cloudability, AWS Cost Explorer).
  • Familiarity with OpenTelemetry and AI-powered observability tools.
  • Experience in a fast-paced startup environment.
Benefits and Perks

Employees today are looking for companies that truly care and recognise their whole person. Platform 9's benefits and perks have been carefully designed to ensure that we take care of an employee's emotions and physical well-being. Many of our benefits extend to families, who form a significant part of our well-being at work. Please note that benefits change by country.

  • Competitive Compensation and Equity
  • Medical Healthcare for you and your family
  • Hybrid Work Model
  • Wellness Benefits
  • Professional Development/Global certifications
  • Reward and Recognition Programs
  • Team Building Activities
  • Our benefits have been carefully selected, keeping in mind employees requirements and personal situations now and for the future 
 

(Salary Range: $150-180K/year)

Skills Required

  • 5+ years in a DevOps or SRE role with deep experience in cloud infrastructure and operations.
  • Demonstrated experience with FinOps principles (commitment management, rightsizing, waste reduction, cost allocation).
  • Extensive Kubernetes experience: cluster administration, deployment strategies, production troubleshooting.
  • Proficiency in infrastructure-as-code (Terraform, Ansible, or similar).
  • Strong scripting skills in Python (or equivalent).
  • Strong systems programming skills in Go (or equivalent).
  • Configuration management experience with Salt, Chef, or similar.
  • Hands-on experience with observability tooling: Prometheus, Cortex, Grafana, Loki.
  • Familiarity with CI/CD tools and best practices.
  • Strong Linux administration and debugging skills.
  • Excellent communication skills (technical and non-technical stakeholders).
  • Proven incident management experience.
  • OpenStack experience.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mountain View, CA
122 Employees
Year Founded: 2013

What We Do

Platform9 is the open distributed cloud company, offering the power of the public cloud on infrastructure of customers’ choice—powered by Kubernetes and cloud-native technologies. Public clouds are walled gardens, and DIY is difficult and time-consuming. Platform9 offers a third option—an open and faster option—enabling a better way to go cloud-native. Platform9’s service powers 40K+ nodes across private, public and edge clouds. Innovative enterprises like Juniper, Kingfisher Plc, Mavenir, Redfin and Cloudera achieve 4x faster time-to-market, up to 90% reduction in operational costs, and 99.9% uptime. Platform9 is an inclusive, globally distributed company, backed by leading investors.

Similar Jobs

True Anomaly Logo True Anomaly

Senior Platform Engineer

Aerospace • Artificial Intelligence • Hardware • Machine Learning • Software • Defense • Manufacturing
In-Office
2 Locations
300 Employees
170K-275K Annually

Drata Logo Drata

Senior Platform Engineer

Security • Software • Cybersecurity • Automation
Hybrid
San Francisco, CA, USA
600 Employees
151K-205K Annually

Applied Systems Logo Applied Systems

Cloud Platform Engineer

Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Remote or Hybrid
2 Locations
3040 Employees
100K-160K Annually

Grow Therapy Logo Grow Therapy

Reliability Engineer

Healthtech • Social Impact • Software
Hybrid
3 Locations
460 Employees
182K-250K Annually

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account