The Role
Design and implement SkyPilot's commercial multicloud platform: architect control and data plane separation, tenant/user management, scaling, monitoring, and alerting. Build production-grade cloud-native platform services and APIs using Go, Kubernetes, gRPC, PostgreSQL, and Terraform, prioritizing reliability, security, and great user experience.
Summary Generated by Built In
SkyPilot is building the future of multicloud AI infra. We are the Berkeley founding team commercializing SkyPilot (9.5K+ GitHub stars, 200+contributors), to enable AI to run on different cloud infrastructures in a portable, cost-optimizing, and highly available way.
SkyPilot is deployed at 100s of companies, including Fortune 500s and top AI-natives (Shopify, Redis, Abridge, Hippocratic, Applied Compute, etc.). In 2025, adoption grew >600%, now launching more GPUs per month than the biggest neocloud’s fleet. Currently in stealth, SkyPilot is founded in 2024 by UC Berkeley PhDs and professors (incl. Databricks cofounders). We’re building a top-tier engineering team, with current talent from Databricks, Google, Crusoe, ByteDance, and PingCap.
What You’ll Do
You’ll play an instrumental role in designing and implementing SkyPilot’s commercial cloud platform, which will power a reimagined multicloud AI experience:
- Architect SkyPilot’s commercial cloud platform from the ground up: Control plane and data plane separation, tenant/user management, control plane scaling, monitoring, alerting.
- Building core, production-grade platform services: Designing and implementing APIs and services in a cloud-native stack (e.g., Go, Kubernetes, microservices), balancing reliability, security, and simplicity.
Ideal Candidates
You are a seasoned engineer with experience building SaaS/cloud platforms from zero to one.
- 6+ years of experience in building SaaS platforms at startups: You have 6+ years of experience building SaaS platforms at startups, from inception to launch to scaling. You are intimately familiar with the best-in-class tools/vendors needed for a SaaS platform.
- SaaS platform expertise: You have hands-on experience building user and organization management, authentication and RBAC, API gateway, usage metering and billing integration, CI/CD pipelines, and other core platform services — using technologies like gRPC, Go, Kubernetes, PostgreSQL, Terraform.
- Great product taste: You believe great products must deliver both a solid platform foundation and a great user experience.
What We Offer
- Competitive equity, compensation, and health benefits.
- Chance to work with some of the best minds in cloud, distributed, and AI systems, with significant autonomy and ownership.
- Front-row seat at the latest open-source infra startup from Berkeley.
Skills Required
- 6+ years of experience building SaaS platforms at startups
- Hands-on experience designing and implementing APIs and services in a cloud-native stack (e.g., Go, Kubernetes, microservices)
- Experience with gRPC
- Experience with PostgreSQL
- Experience with Terraform
- Experience building user and organization management, authentication and RBAC, API gateway, usage metering and billing integration, and CI/CD pipelines
- Great product taste and focus on delivering strong user experience
- Familiarity with best-in-class tools and vendors for SaaS platforms
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
SkyPilot is an open-source framework designed to run, manage, and scale AI, machine learning, and data science workloads on any AI infrastructure. It provides teams with a unified interface to launch jobs across various clouds and regions, automating compute selection and cost optimization to reduce expenses and simplify the management of complex cloud resources without requiring deep infrastructure expertise.









