Networking Software Expert, RDMA/RoCE – Next-Generation AI Infrastructure Storage Platform

Posted 15 Days Ago
Be an Early Applicant
Center District, VA, USA
Hybrid
Senior level
Hardware • Information Technology • Semiconductor • Manufacturing
The Role
The Networking Software Expert will design low-latency software architecture for a next-gen AI storage platform, focusing on high-throughput data paths, performance optimization, and collaboration across teams.
Summary Generated by Built In
Company Description

Sandisk understands how people and businesses consume data and we relentlessly innovate to deliver solutions that enable today’s needs and tomorrow’s next big ideas. With a rich history of groundbreaking innovations in Flash and advanced memory technologies, our solutions have become the beating heart of the digital world we’re living in and that we have the power to shape.

Sandisk meets people and businesses at the intersection of their aspirations and the moment, enabling them to keep moving and pushing possibility forward. We do this through the balance of our powerhouse manufacturing capabilities and our industry-leading portfolio of products that are recognized globally for innovation, performance and quality.

Sandisk has two facilities recognized by the World Economic Forum as part of the Global Lighthouse Network for advanced 4IR innovations. These facilities were also recognized as Sustainability Lighthouses for breakthroughs in efficient operations. With our global reach, we ensure the global supply chain has access to the Flash memory it needs to keep our world moving forward.

Job Description

We are building a next-generation storage platform for AI infrastructure that combines high-performance flash, accelerator technologies, and advanced storage software, with the goal of delivering a breakthrough step-function improvement in cost, power efficiency, density, and scalability for AI-era data-center storage.

We are seeking a Networking Software Expert, RDMA/RoCE to define and drive the low-latency network data path of this platform, enabling efficient scaling from node-level communication up to large-scale infrastructure deployment.

This role will work at the intersection of storage, networking, and system architecture, shaping how low-latency networking, advanced NIC technologies, software services, and hardware capabilities come together in a tightly optimized system. The role will also help strengthen Sandisk's broader data-center networking and distributed infrastructure architecture knowledge across the Architecture organization.

Responsibilities:

  • Drive the low-latency networking software architecture and implementation for a groundbreaking storage platform targeting step-change improvements in cost, power, density, and scalability
  • Define and implement high-throughput, low-latency data paths across storage nodes and larger-scale deployments 
  • Own the software architecture around advanced NIC integration, queueing models, completion handling, memory registration strategy, and zero-copy data movement 
  • Analyze and optimize end-to-end network behavior, including latency, throughput, CPU efficiency,  congestion sensitivity, and tail behavior under scale 
  • Help define how data, metadata, and control-plane traffic are partitioned across the platform 
  • Work closely with architecture, hardware, firmware, software, and silicon teams on HW/SW partitioning decisions and opportunities for networking acceleration
  • Drive performance bring-up and debugging on real high-speed Ethernet environments
  • Contribute to transport-level design decisions for multi-node communication and large-scale system scaling
  • Build and optimize robust software for error handling, recovery, observability, and production readiness in RDMA environments 
  • Contribute to technical direction, coding standards, and architectural quality across the networking and storage stack

Qualifications

Required:

  • 8+ years of experience in high-performance networking, distributed systems, or low-latency infrastructure software
  • Strong hands-on experience with RDMA/RoCE software development in Linux environments
  • Deep experience with C/C++ and Linux systems programming 
  • Strong understanding of RoCE/RDMA concepts including queue pairs, completion queues, low-latency transport, memory registration, and zero-copy data movement
  • Experience with high-speed Ethernet environments and performance tuning of low-latency data paths
  • Demonstrated expertise in performance analysis, bottleneck identification, profiling, and optimization under multi-node or large-scale conditions
  • Ability to debug complex issues across software, firmware, NIC/RNIC, and system-level integration boundaries 
  • Strong collaborator with the ability to work across architecture, hardware, firmware, silicon, and software teams 
  • Fluent English with the technical clarity and credibility required to work with technical stakeholders and partners

Preferred:

  • Experience with storage networking, NVMe-oF, or distributed infrastructure systems
  • Experience with DPU/SmartNIC environments and modern data-center NIC architectures
  • Experience with congestion-control tuning, telemetry, and large-scale network bring-up
  • Background in AI infrastructure, hyperscaler environments, or startup environments
  • Experience bringing complex infrastructure software from concept to production

Additional Information

Sandisk thrives on the power and potential of diversity. As a global company, we believe the most effective way to embrace the diversity of our customers and communities is to mirror it from within. We believe the fusion of various perspectives results in the best outcomes for our employees, our company, our customers, and the world around us. We are committed to an inclusive environment where every individual can thrive through a sense of belonging, respect and contribution.

Sandisk is committed to offering opportunities to applicants with disabilities and ensuring all candidates can successfully navigate our careers website and our hiring process. Please contact us at [email protected] to advise us of your accommodation request. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

Skills Required

  • 8+ years of experience in high-performance networking, distributed systems, or low-latency infrastructure software
  • Strong hands-on experience with RDMA/RoCE software development in Linux environments
  • Deep experience with C/C++ and Linux systems programming
  • Strong understanding of RoCE/RDMA concepts including queue pairs and zero-copy data movement
  • Experience with high-speed Ethernet environments and performance tuning
  • Demonstrated expertise in performance analysis and optimization
  • Ability to debug complex issues across software and firmware integration boundaries
  • Strong collaboration skills with cross-functional teams
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
11,000 Employees
Year Founded: 1988

What We Do

Sandisk is a leading developer, manufacturer, and provider of data storage devices and solutions based on NAND flash technology, including memory cards, USB flash drives, and solid-state drives (SSDs).

Similar Jobs

MetLife Logo MetLife

Customer Care Advocate Disability Service- Omaha NE 7.20.26

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
42K-42K Annually

General Motors Logo General Motors

District Manager, OnStar Fleet & Commercial - SCR

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees

General Motors Logo General Motors

Chevrolet District Manager Parts and Service

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees
81K-109K Annually

Superhuman Logo Superhuman

Senior ABX Manager

Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
Remote or Hybrid
United States
1500 Employees
137K-209K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account