Senior Software Engineer - DGX Cloud API Services

Posted 20 Days Ago
3 Locations
In-Office or Remote
168K-322K
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role
The role involves building GPU-accelerated Kubernetes clusters, enhancing APIs for the DGX Cloud Platform, and ensuring user experience. Requires strong skills in Go and Kubernetes services.
Summary Generated by Built In

Join NVIDIA's DGX Cloud Kubernetes API Services team and be at the forefront of building GPU-accelerated Kubernetes clusters supporting NVIDIA AI, robotics, and scientific computing projects. As an API Services Software Engineer, you will work across the stack with partner teams to bring NVIDIA's GPUs to life in a cloud or on-prem environment, ensuring end-to-end performance across compute, storage, and networking.

An API Services engineer is above all responsible for ensuring a good customer and developer experience on the DGX Cloud Kubernetes Platform, working with our Runtime and Cluster Architecture teams and more to be the voice of the customer. We serve both customers looking to access their GPU compute via Kubernetes for whatever workloads they wish, as well as developers looking to use our API automation to bring their own services such as node health monitoring to all of NVIDIA.

What you will be doing:

  • Help build out and scale customer-facing APIs and systems for the DGX Cloud Kubernetes Platform

  • Work with the Runtime and Cluster Architecture teams to provide a complete GPU-accelerated Kubernetes clusters to a wide variety of NVIDIA initiatives

  • Be the voice of our customers to ensure they have a smooth experience to access the compute they need for the workloads they want

  • Build platform services for other NVIDIA developers to bring their services to NVIDIA Kubernetes clusters

What we need to see:

  • BS/MS in Computer Science or related field (or equivalent experience).

  • 8+ years of relevant work experience.

  • Experience in building foundational SaaS systems at scale, such as API design, user management, or authentication and authorization flows

  • Proficiency in Go and building Go services at scale

  • Experience with deploying and maintaining services atop Kubernetes

  • Experience writing automation with Kubernetes (i.e. Controllers, CustomResourceDefinitions, etc.)

  • Background with AWS or GCP and related technologies like S3, GCS, RDS, etc.

  • Ability to solve issues across multiple layers: infrastructure, Kubernetes, application runtime

  • Communicate effectively across a big organization, both within and outside the Kubernetes Platform organization

Ways to stand out from the crowd:

  • Experience working on internal tools and services for large engineering organizations

  • Experience working across multiple layers of cloud infrastructure such as CSP APIs, Terraform, Kubernetes, and custom controllers and automation atop

  • Experience working deeply in and with the upstream Kubernetes apiserver code

  • Background with user-facing APIs with a focus on customer and/or developer experience

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're a creative, curious, and driven technical leader, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 264,500 USD for Level 4, and 200,000 USD - 322,000 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 30, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

AWS
GCP
Gcs
Go
Kubernetes
Rds
S3
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
21,960 Employees
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Remote
United States
100K-120K Annually

monday.com Logo monday.com

Senior Business Analyst

Productivity • Sales • Software
Remote or Hybrid
New York, NY, USA
145K-180K Annually

monday.com Logo monday.com

Sales Development Representative

Productivity • Sales • Software
Remote or Hybrid
New York, NY, USA
70K-85K

Chamberlain Group Logo Chamberlain Group

Retail Sales Operations Lead

Automotive • Hardware • Internet of Things • Mobile • Software • App development • PropTech
Remote or Hybrid
2 Locations
113K-186K Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account