Senior Software Engineer, Metropolis AI NIM

Posted 16 Hours Ago
Be an Early Applicant
Santa Clara, CA
Senior level
Artificial Intelligence • Hardware • Robotics • Software • Metaverse
The Role
The Senior Software Engineer will develop and optimize AI models as NVIDIA Inference Microservices, collaborating with partners to implement advanced computer vision and language models. Responsibilities include design and development of streaming AI pipelines, performance optimization, and adherence to quality standards. Candidates should have strong problem-solving skills and extensive experience in AI and deep learning methodologies.
Summary Generated by Built In

We are seeking a senior software engineer for Metropolis AI NIM to develop and deliver the state-of-the-art AI models to the world in the form of NVIDIA Inference Microservices (NIM). You will collaborate across the organization to bring the latest flagship models (both CV and Vision-Language Models) from our community and partners—such as VILA and Florence-2—to life as optimized NVIDIA Inference Microservices (NIM). This role offers an outstanding opportunity to craft the future of AI at a fast-growing company at the forefront of the AI revolution. Join our team of world-class software engineers and partners to deliver the most advanced models with lightning-fast inference.

In this role, you will develop hardware-accelerated solutions that enable rapid creation and deployment using the latest deep learning, artificial intelligence, and computer vision technologies. This position offers you the opportunity to collaborate within a worldwide matrixed software team focusing on core technologies for Multi-Modal and Streaming AI applications including CV and Vision-Language Model (VLM) inference pipelines and Omniverse-based simulation technologies among various exciting Multi-Modal AI technologies and have broad impact within our highly dynamic and technology-focused company.

What you’ll be doing:

  • Collaborate closely with our partners and the open-source community to deliver their flagship models as highly optimized NVIDIA Inference Microservices (NIM).

  • Research and develop innovative deep learning methodologies to accurately evaluate new model families across diverse domains.

  • Analyze, influence, and enhance AI/DL libraries, frameworks, and APIs, ensuring consistency with the best engineering practices.

  • You will design and develop accelerated streaming AI pipelines using CV and VLM models and lead technical design discussions

  • Profile and optimize the AI pipelines to ensure scalability, reliability, and efficiency.

  • Take on complex system-level optimization and resource utilization challenges.

  • Participate in a product development lifecycle that values high standards for clear requirements, software quality and performance.

  • Write code in Python and C++

What we need to see:

  • BS, MS, or PhD in Computer Science, AI, Applied Math, or a related field, or equivalent experience, with 5+ years of industry experience.

  • 3+ years of hands-on experience in AI for computer vision (CV) and large language models (LLMs).

  • Complex system design and development using Python, C++ 14/17/20, and object-oriented programming.

  • Strong problem-solving, debugging, performance analysis, test design, and documentation skills.

  • Solid mathematical foundations and expertise in AI/DL algorithms.

  • Excellent written and verbal communication skills, with the ability to work both independently and collaboratively in a fast-paced environment.

  • Passion for expanding your technical knowledge into new areas.

  • Ability to excel in a multinational, multi-time-zone environment: excellent communication skill (verbal & written), collaborates well, represents our core values.

Ways to stand out from the crowd:

  • Demonstrated implementation of computer vision / machine learning applications, microservices, container and cloud-native application development.

  • Experience with cloud native architecture involving dockers, k8s, microservices.

  • Hands-on experience with inference and deployment environments like TensorRT, ONNX, Triton, or vLLM.

  • Experience working with High Availability environments

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and passionate people in the world working for us. Are you a creative problem solver with a passion for solving real-world problems with AI? If so, we want to hear from you.

The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

C++
Python
The Company
HQ: Santa Clara, CA
21,960 Employees
On-site Workplace
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Liftoff Logo Liftoff

Senior Software Engineer, Creative Serving Team

AdTech • Big Data • Machine Learning • Marketing Tech • Mobile • Software
California, USA
645 Employees

Artera Logo Artera

Senior Solutions Engineer

Healthtech • Other • Sales • Software • Analytics • Conversational AI
Easy Apply
Hybrid
Los Angeles, CA, USA
318 Employees
110K-130K Annually

The Walt Disney Company Logo The Walt Disney Company

Controls Engineer - Project Hire

AdTech • Digital Media • News + Entertainment
Hybrid
Sacramento, CA, USA
200000 Employees

The Walt Disney Company Logo The Walt Disney Company

Lead Software Engineer (Roku/BrightScript)

AdTech • Digital Media • News + Entertainment
Hybrid
San Francisco, CA, USA
200000 Employees
152K-214K Annually

Similar Companies Hiring

TrainingPeaks (A Peaksware Company) Thumbnail
Software • Fitness
Louisville, CO
69 Employees
bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account