Solutions Architect, DGX Cloud

Reposted 21 Days Ago
2 Locations
In-Office or Remote
148K-236K
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role
As a Solutions Architect, you will enable and support DGX Cloud Partners, enhancing their onboarding experience while providing technical expertise and fostering customer satisfaction with NVIDIA's AI platform.
Summary Generated by Built In

Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a hardworking Solution Architect (SA) to join the DGX Cloud SA Segment Team. The mission of the DGX Cloud Segment team is to guide and enable the successful adoption at scale of DGX Cloud and NVIDIA AI Enterprise Software in production.

NVIDIA DGX Cloud is an AI platform for developers, researchers, and enterprises, optimized for the demands of Generative AI. The DGX Cloud SA team is dedicated to shaping the future of DGX Cloud by actively gathering and incorporating partner feedback and product requirements. Our team will help optimize the onboarding process for NVIDIA Cloud Partners, ensuring fast time to insights and exceptional user experience. Additionally, we will collaborate with internal teams to scale expertise and knowledge through training and the creation of repeatable guides. Our focus on building reliable infrastructure, partner qualifications, and assets will streamline onboarding, ultimately increasing adoption of DGX Cloud.

What you’ll be doing:

Work closely with DGX Cloud Partners, become their trusted technical advisor, advocate for their needs, and ensure they are successful in accomplishing their business goals with the platform.

  • Accelerate NVIDIA Cloud Partner onboarding time, cluster manageability and reliability.

  • Scale knowledge, reach, and opportunities by building and educating vertical teams and communities on DGX Cloud and NVIDIA Reference Architectures.

  • Communicate to our Reference Architecture teams findings gathered from the field.

  • Provide technical education and facilitate field product feedback to improve DGX Cloud.

  • Enable partners to participate in the DGX Cloud Ecosystem with the goal of end-user satisfaction and increased sales.

What we need to see:

  • Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science (or equivalent experience)

  • 5+ years of proven experience with one or more Cloud Service Providers (AWS, Azure, GCP or OCI), NVIDIA Cloud Partners (CoreWeave, Lambda Labs, Crusoe, etc) and cloud-native architectures and software.

  • Demonstrated experience in technical leadership, strong understanding of NVIDIA technologies, and success in working with customers.

  • Expertise with parallel filesystems (e.g. Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects (InfiniBand, Omni Path, RoCE, and Gig-E).

  • Strong coding and debugging skills, and demonstrated expertise in one or more of the following areas: Machine Learning, Deep Learning, Slurm, Kubernetes, MPI, MLOps, LLMOps, Ansible, Terraform, and other high-performance AI cluster solutions.

  • Proficient in deploying GPU applications in Slurm, Kubernetes, docker, helm, registries

  • Linux-based configuration management and monitoring solutions, system administration, OS installation, configuration, and troubleshooting

  • Networking technologies (e.g. router, firewall, load balancer, DNS, VPN) for complex infrastructure configuration

Ways to stand out from the crowd:

  • Experience using DGX Cloud, NVIDIA AI Enterprise AI Software including Base Command Manager, NeMo, and NVIDIA's Inference Microservices.

  • Experience with AI application development and deployment

  • Background with deploying and configuring observability tooling including Grafana, Prometheus, W&B, Nagios, Zabbix

  • Experience with high performance or large-scale computing environments.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 5, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Ansible
AWS
Azure
Beegfs
Deep Learning
Docker
GCP
Gig-E
Gpfs
Grafana
Helm
Infiniband
Kubernetes
Llmops
Lustre
Machine Learning
Mlops
Mpi
Nagios
Oci
Omni Path
Prometheus
Roce
Slurm
Terraform
W&B
Wekaio
Zabbix
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
21,960 Employees
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

In-Office or Remote
San Francisco, CA, USA
40K-80K Annually

ServiceNow Logo ServiceNow

Staff Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
San Diego, CA, USA
156K-273K Annually

ServiceNow Logo ServiceNow

Consultant

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Austin, TX, USA

ServiceNow Logo ServiceNow

Product Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
San Diego, CA, USA
147K-258K Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account