Capacity Operations and Analytics Manager

Posted Yesterday
Be an Early Applicant
2 Locations
In-Office or Remote
200K-322K
Expert/Leader
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role
Manage and optimize GPU capacity and compute resources, develop data models and reporting systems, collaborate across teams to improve efficiency and meet business needs.
Summary Generated by Built In

Our technology has no boundaries! NVIDIA is building the world’s most groundbreaking and pioneering computing platforms. Because of our work, scientists, researchers, and engineers can advance their ideas. At its core, our visual computing technology not only enables an outstanding computing experience but it is also energy efficient! We pioneered a supercharged form of computing loved by the most fast-paced computer users in the world - scientists, designers, artists, and gamers. It’s not just technology, though! It is our people, some of the brightest in the world, and our company makes NVIDIA one of the most fun, innovative, and dynamic places to work! At the center of NVIDIA are our core values, like innovation, excellence, determination, and team, that guide us to be the best we can be.

What you will be doing:

  • Manage and optimize GPU capacity and other compute resources across various cloud service providers to meet growing demands and ensure efficient utilization.

  • Build, develop, and maintain data models, reporting systems, data automation systems, dashboards, and performance metrics that support NVIDIA Infrastructure governance programs and strategic capacity decisions.

  • Analyze the technical and business needs for GPU capacity and other compute resources from various internal and external teams.

  • Identify performance bottlenecks in day-to-day usage of compute resources and collaborate with relevant infrastructure teams to resolve them.

  • Drive infrastructure resource efficiency initiatives in partnership with engineering, finance, and product teams.

  • Develop and enhance tooling for our cloud infrastructure and analytics platform to optimize resource usage and performance for NVIDIA and its customers. This includes crafting and developing tools for automating workflows and potentially leveraging AI techniques to extract useful signals and insights from generated data.

  • Partner and cross-collaborate with Finance, Product, Service Owners, and Infrastructure Engineering teams to align cloud capacity management with company goals and develop Infrastructure and Service Level Key Performance Indicators (KPIs) to match Customer satisfaction.

  • Lead multi-year budget-based compute resource planning with engineering.

What we need to see:

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field, or equivalent experience.

  • 12+ years of overall experience in cloud computing, specifically in managing or sourcing GPU capacity with cloud service providers. A proven track record of large-scale computing operations and planning is a plus.

  • Strong technical proficiency in cloud architecture, development and deployment, and managing large data sets.

  • Deep understanding of cloud service models (IaaS, PaaS, SaaS) and cloud infrastructure technologies. Experience with Cloud Service Providers such as AWS, Azure, GCP, and OCI is required.

  • Demonstrated experience in leveraging AI tools and techniques to extract useful signals and insights from data, specifically to improve resource usage and automation

  • Strong understanding and practical application of statistical modeling and machine learning methodologies for improving operational efficiency and informing strategic capacity decisions

  • Proficiency with data analytics, visualization, and monitoring tools such as Kibana, Grafana, Splunk, Prometheus, Tableau, Plotly.

  • Knowledge of analytics, statistical modeling, and machine learning methodologies.

  • Excellent communication and interpersonal skills, with the ability to collaborate effectively with various departments and influence strategic decisions.

  • Ability to operate effectively amidst uncertainty and rapidly changing business conditions, with an agile mindset and a commitment to ongoing improvement.

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence. NVIDIA is widely considered one of the technology world’s most desirable employers. Some of the world's most forward-thinking and hardworking people are working for us. If you're creative and autonomous, we want to hear from you!

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 200,000 USD - 322,000 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until October 25, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Ai Tools
AWS
Azure
Cloud Architecture
Cloud Computing
Data Analytics
GCP
Gpu Capacity
Grafana
Kibana
Machine Learning
Oci
Plotly
Prometheus
Splunk
Statistical Modeling
Tableau
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
21,960 Employees
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Flatfile Logo Flatfile

Product Engineer

Artificial Intelligence • Software • Database
Remote
United States
55 Employees
220K-300K Annually

Flatfile Logo Flatfile

Product Engineer

Artificial Intelligence • Software • Database
Remote
United States
55 Employees
200K-250K Annually

Headway Logo Headway

Program Manager

Consumer Web • Healthtech • Professional Services • Social Impact • Software
Easy Apply
Remote
USA
819 Employees
122K-179K

Cohere Health Logo Cohere Health

RN Reviewer

Healthtech • Software
Easy Apply
Remote
United States
900 Employees
32-35

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account