NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 30 years. Today, we're at the forefront of AI innovation powering breakthroughs in research, autonomous vehicles, robotics, and more. The DGX Cloud team builds and operates the AI infrastructure that fuels this progress.
We’re looking for a Senior Full-Stack Software Engineer to join the AI Hub team within the DGX Cloud AI Infrastructure organization. The AI Hub team accelerates AI research by ensuring NVIDIA’s AI infrastructure is used efficiently, transparently, and at scale. Our primary goal is to build a unified, self-service “single pane of glass” portal that enables AI researchers to efficiently manage, monitor, and optimize their use of Managed AI research Superclusters.
What You’ll Be Doing:
Lead the architecture and delivery of high-scale web products across frontend, backend services, and data layers, with clear availability and latency targets (SLOs/SLAs).
Own multi-team initiatives end to end: problem discovery, RFCs/design reviews, phased rollouts, and success metrics tied to product and business outcomes.
Drive reliability, performance, and observability improvements to meet exascale standards.
Establish engineering standards and reusable platforms/design systems to reduce complexity, support load and long-term tech debt.
Collaborate with NVIDIA AI Research teams to identify pain points and deliver the next generation user experience that accelerates their work.
Mentor and sponsor engineers; improve code quality, testing, security, and observability through reviews, pairing, and coaching.
Stay ahead of AI/ML infrastructure trends and drive adoption of best practices within the team.
What We Need To See:
12+ years of software engineering experience delivering production web systems.
Bachelor’s degree or higher in Computer Science or a related technical field (or equivalent experience).
Strong cross-functional collaboration skills, including active listening, translating complex use cases into clear technical requirements, and designing data models aligned with business logic and outcomes.
Deep cloud expertise (AWS, GCP, or Azure), infrastructure as code, containers, and orchestration (Docker, Kubernetes), along with mature CI/CD and safe deployment practices.
Full-stack depth: modern SPA frameworks (React/Next.js or Vue/Nuxt), JavaScript/TypeScript, and one or more backend languages (Node.js, Python, and/or Golang).
Familiarity with observability stacks such as OpenSearch, Prometheus, Grafana, or Loki.
Proficiency in API design (REST), schema evolution, and integration patterns, with a strong commitment to automated testing.
Experience building machine learning platforms or self-service internal infrastructure tools focused on efficiency, resiliency, and observability.
Clear written and verbal communication skills, strong problem-solving ability, and a growth mindset.
Experience leveraging AI-assisted development tools (e.g., Cursor).
Ways to Stand Out from the Crowd:
Hands-on ML platform depth (MLE experience or strong familiarity with DL frameworks such as PyTorch, TensorFlow, JAX; distributed training ecosystems like Ray).
Datacenter-scale operational experience, including GPU cluster debugging, performance triage, and root-cause analysis across complex distributed systems.
At NVIDIA, you’ll be immersed in a diverse, supportive environment where you’re empowered to do your best work. The DGX Cloud AI Infrastructure team is at the core of NVIDIA’s AI efforts building the software that makes scalable research possible. Join us and help power the next wave of innovation. NVIDIA provides competitive salaries and a comprehensive benefits package. Our engineering teams are expanding rapidly due to exceptional growth.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Skills Required
- 12+ years of software engineering experience delivering production web systems
- Bachelor's degree or higher in Computer Science or a related technical field
- Deep cloud expertise (AWS, GCP, or Azure) and infrastructure as code
- Full-stack depth: modern SPA frameworks (React/Next.js or Vue/Nuxt) and one or more backend languages (Node.js, Python, or Golang)
- Familiarity with observability stacks (OpenSearch, Prometheus, Grafana)
NVIDIA Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about NVIDIA and has not been reviewed or approved by NVIDIA.
-
Equity Value & Accessibility — Equity awards and a discounted ESPP are highlighted as core parts of total compensation, enabling employees to share in the company’s success. Stock-based compensation and the two-year lookback ESPP are consistently described as especially valuable.
-
Healthcare Strength — Health coverage is portrayed as robust, with comprehensive medical, dental, and vision options alongside mental health support and on-site care resources. Employer HSA contributions and wellness perks reinforce the depth of the offering.
-
Retirement Support — Retirement programs are depicted as strong, featuring a meaningful 401(k) match with Roth options and support for Mega Backdoor Roth contributions. These elements position long-term savings as a notable advantage of the total rewards package.
NVIDIA Insights
What We Do
NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”









