AI Capacity Planning & Procurement Manager

Reposted 5 Days Ago
Be an Early Applicant
Redwood City, CA
In-Office
150K-250K Annually
Senior level
Software • Generative AI
The Role
The Capacity & Supply Procurement Manager will oversee GPU and cloud capacity procurement, negotiate contracts, and strategize capacity planning to support operations efficiently.
Summary Generated by Built In
About Us:

Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We’ve been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we’re an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.

The Role:

We are an AI company building reliable, high-performance model serving infrastructure. Our customers run mission-critical workloads and expect consistency, cost transparency, and predictable scaling. Capacity — GPU, network, and cloud economics — is existential for us. You will be one of the earliest hires focused on making sure we always have the right compute, at the right price, ahead of demand.

Key Responsibilities:
  • Architect the Multi-Year Capacity Strategy: Develop and own the strategic, multi-year capacity plan by synthesizing inputs from the company strategy, business forecast, Industry trend and Engineering’s product roadmap.
  • Infrastructure Cost Management: Directly manage the biggest cost on our P&L. Translate market insights, technology advancement, and forward-looking plans into rigorous financial models to enable fast business growth while minimizing Total Cost of Ownership (TCO) per unit of performance (e.g., TFLOP). Partner with infra, finance, and GTM to codify capacity strategy into budgets and KPIs
  • Build market intelligence with advanced compute technology roadmap, lead time, and pricing for relevant SKUs (H200/B200/B300/GB200/GB300/MI355, networking, storage)
  • Own end-to-end procurement of cloud capacity (GPUs, storage, networking, etc) across multiple vendors (cloud, bare-metal, colocation, integrators, brokers) including RFQs/RFPs, commercial Negotiation. 
  • Define Capacity Management discipline with processes and tools: inventory, allocation, unit economics, cost attribution and optimization
  • Stand up processes for disciplined renewals, rev-share allocations, and hedging against supply shocks
Minimum Qualifications:
  • 5–10+ years of experience in capacity management, sourcing, data center supply, cloud procurement, infrastructure operations, or a related field (e.g., hyperscaler, colocation provider, OEM, or LLM infrastructure startup)
  • Demonstrated ability to develop and communicate multi-year capacity plans that align business, product, and financial objectives
  • Strong knowledge of GPU and server SKUs, networking topologies, power and space constraints, and a high-level understanding of the global AI compute supply and demand landscape
  • Proven experience negotiating contracts valued at seven figures or higher, with a solid grasp of key terms such as commit profiles, flexibility clauses, drawdowns, credits, and SLA remedies
  • Proficiency in unit economics and scenario modeling, including total cost of ownership (TCO) and TFLOP-month analysis
  • Ability to operate effectively in a fast-paced, low-process environment and contribute to company-defining initiatives
Preferred Qualifications:
  • Established network across major capacity suppliers, including NVIDIA partners, cloud providers, brokers, integrators, and colocation vendors
  • Experience connecting capacity planning with real-world model serving workload patterns and performance requirements
  • Background in AI infrastructure environments, such as hyperscaler clouds or AI infrastructure startups

Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.

Base Pay Range (Plus Equity)
$150,000$250,000 USD
Why Fireworks AI?
  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

Top Skills

Capacity Planning
Cloud Procurement
Data Center Supply
Financial Modeling
Gpu
Infrastructure Operations
Networking
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Redwood City, CA
63 Employees
Year Founded: 2022

What We Do

Fireworks.ai offers generative AI platform as a service. We optimize for rapid product iteration building on top of gen AI as well as minimizing cost to serve.

https://fireworks.ai/careers

Similar Jobs

Anduril Logo Anduril

Staff Optical Engineer, Space

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
220K-292K Annually

Anduril Logo Anduril

Manager of Proposal and Capture

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
146K-194K Annually

Anduril Logo Anduril

Hardware Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
191K-253K Annually

Samsara Logo Samsara

Director, Performance Marketing

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
6 Locations
2800 Employees
140K-250K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account