Senior GPU Kernel Performance Lead

Sorry, this job was removed at 06:07 p.m. (CST) on Tuesday, Jan 13, 2026
Be an Early Applicant
Santa Clara, CA
In-Office
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role

We're now looking for a Senior GPU Kernel Performance Lead. Do you enjoy analyzing and reporting on GPU kernel performance? If so, consider applying for the role of Senior GPU Kernel Performance Analysis Lead! Our team delivers high-performance GPU math kernels to NVIDIA’s cuDNN, cuBLAS, and TensorRT libraries to accelerate deep learning models. The team is proud to play an integral part in enabling breakthroughs in domains such as image classification, speech recognition, natural language processing,and large language models. We’re always striving for peak performance and energy efficiency on current and future-generation GPUs.

As a kernel performance analysis lead, you will oversee all efforts pertaining to the performance of our kernels. Join the team that is building the underlying software used across the world to power the revolution in artificial intelligence! To get a sense of the code we write, check out our CUTLASS open-source project showcasing performant matrix multiply on NVIDIA’s Tensor Cores with CUDA. While there will be the opportunity for hands-on development, this position specifically is to lead a team for validating the performance of the kernels.

What you’ll be doing:

  • Specify test cases, derived from Deep Learning workloads, to provide adequate directed and use-case coverage across all kernels on both simulation and silicon targets

  • Determine performance theory through the development and use of analytical models

  • Track and report on kernel performance throughout the development lifecycle by using and expanding upon current infrastructure

  • Provide feedback to the kernel developers by identifying performance regressions and opportunities to reach the achievable peak performance

What we need to see:

  • PhD degree in Computer Science, Computer Engineering, Applied Math, or related field (or equivalent experience) with 8+ years of relevant industry experience.

  • Demonstrated strong C++ programming and software design skills, including debugging, performance analysis, and test design

  • Experience leading or managing a team relating to the performance of CPUs, GPUs, or other DL accelerators

Ways to stand out from the crowd:

  • Experience with analytical models and cycle-accurate HW simulators

  • Knowledgeable about performance tools like Nsight or VTune

  • Programming experience beyond C++ including assembly, MLIR/LLVM, Python, and CUDA/OpenCL

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and collaborative software leader seeking new challenges? If so, we want to hear from you! Come, join our DL Architecture team and help build the real-time, cost-effective AI computing platform driving our success in this exciting and quickly growing field.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 431,250 USD for Level 6.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until January 13, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Blacksmith Logo Blacksmith

Recruiter

Information Technology • Software
In-Office
2 Locations
7 Employees
170K-210K Annually

Spectrum Logo Spectrum

Marketing Analyst

Information Technology • Internet of Things • Mobile • On-Demand • Software
In-Office
El Segundo, CA, USA
100000 Employees
82K-157K Annually

Spectrum Logo Spectrum

Managing Director - Spectrum News 1

Information Technology • Internet of Things • Mobile • On-Demand • Software
In-Office
El Segundo, CA, USA
100000 Employees
135K-288K Annually

Spectrum Logo Spectrum

Strategic Account Manager

Information Technology • Internet of Things • Mobile • On-Demand • Software
In-Office
Anaheim, CA, USA
100000 Employees
57K-114K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
21,960 Employees
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account