Head of Infrastructure

Reposted 18 Days Ago
Be an Early Applicant
San Francisco, CA
In-Office
Expert/Leader
Artificial Intelligence • Information Technology
The Role
The Head of Infrastructure will lead the design, evolution, and reliability of a globally distributed GPU cloud, managing infrastructure roadmaps and guiding engineering teams to achieve strategic goals.
Summary Generated by Built In
Who We Are

Hyperbolic Labs is on a mission to democratize AI by breaking down the barriers to computing power with our Open-Access AI Cloud. By making better use of idle computing resources across the globe, we offer an innovative GPU marketplace and AI inference service that promise affordability and accessibility for all. As pioneers at the intersection of AI and open-source technology, we believe in an open future where AI innovation is limited only by imagination, not by access to resources. We're looking for forward-thinking individuals who share our passion for making AI universally accessible, secure, and affordable. Join us in building a platform that empowers innovators everywhere to turn their visionary AI projects into reality.

As we prepare for growth after our Series A, backed by industry leaders, our team — led by co-founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing.

About the Role

We are hiring a Head of Infrastructure to lead the design, evolution, and reliability of Hyperbolic’s globally distributed GPU cloud. This role sits at the center of our mission: you will architect and scale the systems that power our peer-to-peer GPU marketplace, inference fabric, and core platform primitives.

You’ll own the infrastructure roadmap end-to-end—from distributed systems design and resource orchestration to networking, security, and global capacity strategy. You’ll grow and mentor a world-class engineering organization, establish engineering excellence standards, and partner closely with Product, Security, Platform, and GTM leadership to translate future AI workloads into infrastructure reality.

Who You Are

You are an infrastructure leader with a track record of scaling complex systems, guiding high-impact teams, and making deeply technical decisions in environments where reliability and performance are existential.

Leadership & Strategic Execution

  • 10+ years in infrastructure, systems engineering, or distributed systems, including 5+ years leading managers and senior ICs.

  • Proven ability to own multi-year infrastructure roadmaps, align stakeholders, and translate ambiguous requirements into crisp technical direction.

  • Experience building, scaling, and mentoring high-performing engineering orgs across infrastructure, platform, and SRE disciplines.

  • Exceptional judgment in balancing velocity with reliability, cost, and security.

  • Comfortable working in fast-moving, high-stakes environments where infrastructure is the product.

Technical Depth & System Design

  • Deep expertise in distributed systems, operating systems internals, networking, and resource orchestration.

  • Hands-on experience with container orchestration systems (Kubernetes, Nomad, SLURM, custom schedulers) at global scale.

  • Strong engineering background with the ability to read and write production code (Go, Rust, Python, or similar).

  • Experience architecting multi-cloud + on-prem + edge topologies, including GPU-centric workloads.

  • Expert-level understanding of infrastructure-as-code, automation frameworks, and GitOps workflows.

  • Expertise in designing observability systems (metrics, tracing, logging, alerting) and building operational excellence.

Operational Excellence & Security

  • A track record of owning 99.9–99.99% uptime targets, incident response processes, and resilience engineering.

  • Passionate about security-first infrastructure, including workload isolation, network security, IAM, hardening, and compliance.

  • Experience leading major capacity planning, load forecasting, and cost optimization initiatives.

Bonus Experience

  • Contributions to open-source infra tools, kernels, schedulers, or distributed systems libraries.

  • Familiarity with service mesh, mTLS, RPC frameworks, or low-latency communication patterns.

Why You Should Join Us
  • High impact: your work affects the entire stack and enables all engineering teams

  • Ownership: you’ll own production systems and have real autonomy

  • Learning: exposure to new infrastructure challenges and the chance to grow

  • List of perks & benefits: e.g. equity, health, remote policy, hardware budget, offsites, etc.

  • Inclusive culture: we strive to build a diverse, supportive team

Hyperbolic is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Top Skills

Go
Kubernetes
Nomad
Python
Rust
Slurm
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
30 Employees

What We Do

At Hyperbolic, we’re building the leading open-access AI cloud.

Access inference and compute at a fraction of the cost, and build AI applications without relying on centralized infrastructures.

We invite builders, compute providers, researchers, and individuals to join us on this journey.

Similar Jobs

OpenAI Logo OpenAI

Head of Infrastructure Communications

Artificial Intelligence • Machine Learning • Generative AI
In-Office
San Francisco, CA, USA
224 Employees
12-12 Annually

Anthropic Logo Anthropic

Head of Infrastructure Accounting

Artificial Intelligence • Natural Language Processing • Generative AI
Easy Apply
In-Office
San Francisco, CA, USA
57 Employees
300K-360K Annually

Airwallex Logo Airwallex

Talent Acquisition Specialist

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Remote or Hybrid
San Francisco, CA, USA
2000 Employees

Snap Inc. Logo Snap Inc.

Staff Desense Engineer

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
2 Locations
5000 Employees
195K-343K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account