Senior Software Engineer, AI Platform - Robotics

Reposted 2 Hours Ago
Be an Early Applicant
Santa Clara, CA
In-Office
148K-288K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role
Engineer cloud-native backend services for NVIDIA's robotics platform, focusing on Kubernetes orchestration, microservices development, and scalable ML workflows.
Summary Generated by Built In

We’re building the infrastructure that powers GR00T, NVIDIA’s general-purpose humanoid robotics platform. This is not a typical DevOps job. You’ll help engineer the cloud-native backend that drives simulation, synthetic data generation, multi-stage model training, and robotic deployment—all at massive scale. Our orchestration system, NVIDIA OSMO, is built to handle real-time robotics workflows in cloud environments across thousands of GPUs. We’re looking for a pragmatic Kubernetes-native backend and infrastructure engineer who excels in solving complex orchestration problems in distributed AI/ML systems.

What you’ll be doing:

  • Architect, develop, and deploy backend services supporting NVIDIA GR00T using Kubernetes and cloud-native technologies.

  • Collaborate with ML, simulation, and robotics engineers to deploy scalable, reproducible, and observable multi-node training and inference workflows.

  • Extend and maintain OSMO’s orchestration layers to support heterogeneous compute backends and robotic data pipelines.

  • Develop Helm charts, controllers, CRDs, and service mesh integrations to support secure and fault-tolerant system operation.

  • Implement microservices written in Go or Python that power GR00T task execution, metadata tracking, and artifact delivery.

  • Optimize job scheduling, storage access, and networking across hybrid and multi-cloud Kubernetes environments (e.g., OCI, Azure, on-prem).

  • Build tooling that simplifies deployment, debugging, and scaling of robotics workloads.

What we need to see:

  • BS, MS, or PhD degree in Computer Science, Electrical Engineering, Computer Engineering, or related field (or equivalent experience)

  • 12+ years of work experience in DevOps, backend, or cloud infrastructure engineering.

  • Hands-on experience building and deploying microservices in Kubernetes-native environments.

  • Proficiency in Golang or Python, especially for backend systems and operators.

  • Experience with Helm, or other Kubernetes templating and config management tools.

  • Familiarity with GitOps workflows, observability stacks (e.g., Prometheus, Grafana), and container CI/CD pipelines.

  • Strong understanding of container networking, storage (e.g., PVCs, ephemeral), and scheduling.

Ways to stand out from the crowd:

  • Experience with ML training workflows, distributed job orchestration (e.g., MPI, Ray, Triton Inference Server).

  • Knowledge of robotics frameworks (e.g., ROS2) or simulation tools (e.g., Isaac Sim, Omniverse).

  • Background with GPU cluster management and scheduling across cloud providers.

  • Contributions to open-source Kubernetes projects or custom operators/controllers.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you are creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 425,500 USD for Level 6.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until December 7, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Ci/Cd
Cloud-Native Technologies
Docker
Gitops
Go
Grafana
Helm
Kubernetes
Prometheus
Python
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
21,960 Employees
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Mochi Health Logo Mochi Health

Software Engineer

Healthtech • Telehealth
Easy Apply
In-Office
San Francisco, CA, USA
70 Employees
140K-180K Annually
Hybrid
San Francisco, CA, USA
1100 Employees
189K-351K Annually

Notion Logo Notion

Business Systems Analyst

Artificial Intelligence • Productivity • Software
Hybrid
2 Locations
1000 Employees
150K-170K Annually

Snap Inc. Logo Snap Inc.

Senior Client Partner - Restaurants

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
5 Locations
5000 Employees
121K-214K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account