Project Manager, Hardware & Business Operations

Posted 5 Days Ago
Be an Early Applicant
San Francisco, CA
Mid level
Artificial Intelligence • Information Technology
The Role
The Project Manager will optimize and scale decentralized GPU resources, manage GPU hardware inventory, implement tracking systems for GPU performance, and improve operational processes. Responsibilities include system monitoring, reporting, cross-functional collaboration with engineering and operations, and maintaining supplier relationships.
Summary Generated by Built In

Project Manager, Hardware & Business Operations 

Location: 

San Francisco, CA (Hybrid)

Role: As the first Project Manager for hardware at a pioneering AI infrastructure company, you will be at the core of optimizing and scaling our decentralized GPU resources. Your role is crucial in ensuring that the backbone of our AI models—thousands of GPUs distributed across multiple data centers—operates efficiently and reliably, enabling cutting-edge AI advancements that democratize access to AI technology globally. You’ll have the opportunity to shape the future of AI infrastructure, working alongside top engineers and innovators to power the next generation of AI-driven solutions.


Responsibilities:

  • Monitor and manage GPU hardware inventory across multiple decentralized data centers; track the lifecycle of GPUs, including acquisition, deployment, usage, maintenance, and decommissioning
  • Develop and maintain a system to log and track all GPU outages or malfunctions, including the root cause analysis, downtime duration, and replacement cycles; generate reports on utilization, availability, and performance trends, and recommend improvements
  • Continuously seek opportunities to improve GPU tracking processes and systems, including the implementation of automation and data analytics dashboards
  • Work with engineering, customer success, and operations to resolve outages, documenting resolutions and lessons learned for continuous improvement
  • Develop and maintain strong relationships with GPU suppliers to ensure favorable terms and timely deliveries
  • Prepare clear billing summaries, breaking down costs to justify charges based on usage, and serve as the primary contact for billing inquiries from customers

Requirements

  • Bachelor's degree in business, information technology, or engineering related fields
  • At least 3 years of experience in technical program management, inventory management, and/or data center operations / project management
  • Proficiency with inventory management and/or project management systems and tools
  • Experience with data analytics and report generation for performance monitoring
  • Experience working cross-functionally with engineering, operations, and external vendors
  • Excellent communication skills for handling customer inquiries
  • Strong attention to detail for maintaining accurate records and documentation
  • Strong problem solving skills and ability to work in a fast-paced environment

Nice to Have:

  • Experience with cloud computing platforms or decentralized cloud infrastructure.
  • Certifications in inventory management or data center operations.
  • Proven experience in tracking and managing the lifecycle of GPUs or similar hardware, including acquisition, deployment, maintenance, and decommissioning.

About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. 

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $111,000 - $165,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.


Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.


Please see our Privacy Policy at https://www.together.ai/privacy

The Company
San Francisco, California
84 Employees
On-site Workplace
Year Founded: 2022

What We Do

Together AI is a research-driven artificial intelligence company. We contribute leading open-source research, models, and datasets to advance the frontier of AI. Our decentralized cloud services empower developers and researchers at organizations of all sizes to train, fine-tune, and deploy generative AI models. We believe open and transparent AI systems will drive innovation and create the best outcomes for society

Similar Jobs

Hybrid
Los Angeles, CA, USA
1096 Employees
125K-130K Annually

Atlassian Logo Atlassian

Principal Program Manager, Growth

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
159K-256K Annually

Anduril Logo Anduril

Qualification and Development Test Project Manager

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
1400 Employees
120K-180K Annually

Anduril Logo Anduril

Air Dominance & Strike - Program Director

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
1400 Employees
182K-273K Annually

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account