NVIDIA's AI Software Platforms team is building the next generation of agentic AI infrastructure that lets coding agents synthesize, optimize, and deploy GPU kernels automatically. This job focuses on crafting AI kernels that connect data pipelines, evaluation suites, and GPU-accelerated runtimes. This helps developers safely release faster, better-performing inference and training solutions.
As Product Managers at NVIDIA, we enable developers to be successful on the NVIDIA platform and push the boundaries of what is possible with AI deployments. In this role, you will act as the internal champion for AI agents and LLM-based coding workflows that generate optimized kernels. You'll partner closely with engineering, research, and customers to define strategy, develop roadmaps, and build products that span the entire agent lifecycle — from data collection and synthetic data generation to evaluation, deployment, and continuous improvement.
What you'll be doing:
We architect agent-focused products that let coding agents generate, refactor, and optimize CUDA kernels and graph-level execution plans across diverse GPU architectures.
Define the end-to-end data lifecycle for agent training and evaluation, including dataset curation, artificial data creation, and benchmark suites for correctness, latency, and adaptability.
Partner with CUDA, kernel, and compiler engineering teams to integrate agents with compilers, profilers, execution sandboxes, and runtimes in a safe, observable way.
We collaborate with internal and external developers, NVIDIA leaders, and ecosystem partners to drive multi-agent orchestration, prioritize features, and deliver launches and messaging for agentic AI kernel generation.
What we need to see:
7+ years of technical product management or closely related experience shipping developer or platform products in AI, ML infrastructure, or high-performance computing; we care deeply about end-to-end ownership and impact.
Proven experience in the AI agent or LLM space, including developing or productizing coding agents. Experience with multi-agent orchestration and self-healing or code loops that improve over time is required. Candidates should also have worked on connecting agents to compilers or execution environments.
Proven record of crafting and releasing automated testing or evaluation suites. These suites measure agents on non-subjective metrics such as correctness, performance, and latency. We rely on data to guide both development and iteration.
BS or MS in Computer Engineering, Computer Science, or a related technical field, or equivalent experience in parallel computing architectures and systems.
Ways to stand out from the crowd:
PhD or equivalent experience in Computer Engineering, Computer Science, or another technical specialty.
Track record building or launching coding-agent platforms or copilots used by development teams at scale, and contributions to performance-critical open-source projects (e.g., Triton, TVM, FlashAttention, kernel libraries, agent frameworks) with clear community adoption and impact.
Research experience in GPU kernel optimization, collective or group communication algorithms, multi-agent systems, or ML model serving / inference architectures that shows how you think about systems end-to-end.
Experience crafting cost-per-inference or cost-per-token models that incorporate hardware utilization, energy efficiency, and cluster scaling, and using those models to guide product strategy and tradeoffs.
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Skills Required
- 7+ years of technical product management experience in AI, ML infrastructure, or high-performance computing
- Proven experience in AI agent or LLM space with multi-agent orchestration
- Experience crafting automated testing or evaluation suites
- BS or MS in Computer Engineering, Computer Science, or related field
NVIDIA Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about NVIDIA and has not been reviewed or approved by NVIDIA.
-
Equity Value & Accessibility — Equity awards and a discounted ESPP are highlighted as core parts of total compensation, enabling employees to share in the company’s success. Stock-based compensation and the two-year lookback ESPP are consistently described as especially valuable.
-
Healthcare Strength — Health coverage is portrayed as robust, with comprehensive medical, dental, and vision options alongside mental health support and on-site care resources. Employer HSA contributions and wellness perks reinforce the depth of the offering.
-
Retirement Support — Retirement programs are depicted as strong, featuring a meaningful 401(k) match with Roth options and support for Mega Backdoor Roth contributions. These elements position long-term savings as a notable advantage of the total rewards package.
NVIDIA Insights
What We Do
NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”








