AI Network System Architect

Job Posted 2 Days Ago Posted 2 Days Ago
Be an Early Applicant
3 Locations
Mid level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role
As a Senior AI Network System Architect, you will investigate technologies in ML/AI, optimize communication libraries, and conceptualize innovative networking products to enhance AI workloads.
Summary Generated by Built In

Our technology has no boundaries! NVIDIA is building the world’s most groundbreaking and state-of-the-art accelerated computing platforms. Because of our work, scientists, researchers, and engineers can advance their ideas. We pioneered a supercharged form of computing loved by the fastest-paced computer users in the world - scientists, designers, artists, and gamers.

We seek a highly motivated Senior AI Network System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation Infiniband, NVLink, and Ethernet systems will be at the forefront of connecting and powering the world's most advanced AI clusters. As an AI system architect at NVIDIA, you will have the opportunity to work on some of the most cutting-edge technology and help drive the innovation of our next-generation networks that top researchers and engineers worldwide will use.

What You’ll Be Doing:

  • Investigating emerging technologies and methodologies in ML and AI to discern their interactions with network infrastructure.

  • Executing workloads on AI systems, conducting profiling, and analyzing bottlenecks and possible enhancements.

  • Conducting research and implementing optimizations for communication libraries like NCCL and UCX.

  • Spearheading the conceptualization of next-generation networking products tailored to support and accelerate state-of-the-art ML workloads.

  • Develop models for simulations, analyze simulation results, and develop optimization algorithms.

  • Collaborate with multi-functional teams, including other architecture teams, logic design, system software, firmware, and ML research teams, to ensure the successful execution of the project.

What We Need To See:

  • M.Sc, or Ph. D degree in Computer Science, Computer Engineering, or Electrical Engineering.

  • At least 2+ years of industry or research experience in computer networks.

  • Extensive expertise in ML/AI workloads, particularly in distributed training.

  • Excellent understanding of large-scale network behavior and the effect of distributed computing workloads on the network.

  • Experience in the development of simulation environments.

  • Great problem-solving and critical-thinking skills.

  • Ability to thrive in a fast-paced and dynamic environment is necessary.

  • Work concurrently with multiple groups in the organization.

Ways To Stand Out Of The Crowd:

  • Knowledge of communication libraries such as NCCL, UCX, and UCC.

  • Good knowledge of network protocols - such as InfiniBand, IP, TCP, RoCE, and network topologies.

  • Experience with Python, C++, and dockers.

  • Expertise in system engineering, operations research, and intricate hardware-software integrated systems.

  • Demonstrated experience in DLRM, LLM or other generative AI.

NVIDIA has some of the most forward-thinking and hardworking people in the world working for us, and due to unprecedented growth, our world-class engineering teams are growing fast. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

We are committed to fostering a diverse work environment and are proud to be an equal-opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. Please contact us to request accommodation.

Top Skills

C++
Infiniband
Ip
Nccl
Python
Roce
Tcp
Ucc
Ucx
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
21,960 Employees
On-site Workplace
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Artlist Logo Artlist

DevOps Engineer

Digital Media • Music • Other • Social Media
Hybrid
Ra'anana, ISR
450 Employees

ServiceNow Logo ServiceNow

Senior Backend Developer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Petah Tikva, ISR
26000 Employees

ServiceNow Logo ServiceNow

Senior Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Petah Tikva, ISR
26000 Employees

ServiceNow Logo ServiceNow

Sr Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Petah Tikva, ISR
26000 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees
Not Eligible
Save
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account