Sr. Manager, Capacity Planning

Reposted 3 Days Ago
Easy Apply
Be an Early Applicant
San Francisco, CA, USA
In-Office
230K-260K Annually
Senior level
Artificial Intelligence • Information Technology
The Role
Lead demand forecasting and supply planning for GPU compute capacity: build forecasting models, set allocation decisions, create tooling and dashboards, and align engineering, finance, and GTM to optimize fleet utilization and revenue per GPU.
Summary Generated by Built In
About the Role

This role owns the intersection of customer demand and compute supply: forecasting how much capacity we need, deciding which products/services get allocated capacity, and building the systems to make those decisions repeatable. You will work across GTM, Strategic Finance, and Infrastructure Engineering to turn demand planning into a structured, data-driven function, directly impacting revenue per GPU and how efficiently we scale our compute fleet.

Responsibilities
  • Run demand forecasting in partnership with GTM and Strategic Finance, translating customer pipeline and usage data into the capacity signals that drive supply planning decisions
  • Create demand planning model based on existing utilization and growth metrics
  • Define Supply Plan to deliver on demand forecast
  • Partner with Strategic Finance on capital allocation inputs, providing the demand and utilization data that informs capacity investment decisions and revenue modeling
  • Own capacity allocation decisions, matching demand to available compute resources and making trade-off recommendations 
  • Design and build allocation tooling and dashboards, defining requirements, working with engineering or using low-code tools to automate tracking of customer commitments, capacity utilization, and reallocation workflows
  • Define and maintain capacity health metrics (fleet utilization, revenue per GPU, committed vs. available capacity) and reporting that gives leadership and GTM visibility into allocation status and risks
Requirements
  • 7+ years of experience in capacity planning, demand planning, revenue operations, supply chain or infrastructure strategy within cloud, AI/ML, or a high-growth technology environment, with a track record of building or scaling a planning function
  • Experience partnering with GTM teams to translate pipeline and usage data into capacity or supply planning decisions
  • Strong quantitative skills: able to build and own forecasting models, utilization analyses, and scenario planning frameworks in SQL and spreadsheets; Python is a plus
  • Experience designing or improving operational workflows such as allocation systems, intake processes, or cross-functional planning cadences
  • Ability to synthesize competing inputs (customer commitments, infrastructure timelines, financial targets) into clear allocation recommendations under constraints
  • Ability to drive alignment across engineering, finance, and go-to-market stakeholders
  • Strong written and verbal communication skills, with the ability to present complex trade-offs clearly to senior leadership
Nice to Have
  • Experience at a cloud provider, AI infrastructure company, or hyperscaler working on capacity allocation, fleet planning, or compute economics
  • Experience running or contributing to a deal desk process, including capacity checks, pricing approvals, or custom configuration reviews
  • Familiarity with GPU workload characteristics (training vs. inference, model size, throughput/latency trade-offs) and how they influence resource planning
  • Background in building dashboards or lightweight tooling to operationalize planning workflows
About Together AI

Together AI is an AI-native cloud company building the infrastructure to make AI faster, cheaper, and more accessible. We're rapidly scaling our GPU footprint: signing our own data center leases, building large-scale clusters, and expanding toward a global owned-infrastructure presence. Our research team has contributed to breakthroughs like FlashAttention, Hyena, and RedPajama, and we co-design across software, hardware, and algorithms to push the frontier of AI efficiency.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $230K - $260K + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. 

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our Privacy Policy at https://www.together.ai/privacy

Top Skills

Python
Spreadsheets
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, California
84 Employees
Year Founded: 2022

What We Do

Together AI is a research-driven artificial intelligence company. We contribute leading open-source research, models, and datasets to advance the frontier of AI. Our decentralized cloud services empower developers and researchers at organizations of all sizes to train, fine-tune, and deploy generative AI models. We believe open and transparent AI systems will drive innovation and create the best outcomes for society

Similar Jobs

NVIDIA Logo NVIDIA

Senior Manager, Capacity Planning

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office
Santa Clara, CA, USA
21960 Employees
184K-299K Annually

Applied Systems Logo Applied Systems

Product Manager

Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Remote or Hybrid
United States
3040 Employees
85K-150K Annually

Atlassian Logo Atlassian

MBA Product Marketing Intern - Regulated Industries, 2026 Summer U.S

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
San Francisco, CA, USA
11000 Employees
29-40 Hourly

Compa Logo Compa

Senior Software Engineer

Artificial Intelligence • HR Tech • Other • Software • Business Intelligence
Remote or Hybrid
2 Locations
70 Employees
160K-225K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account