Technical Product Manager – Storage

Posted Yesterday
Be an Early Applicant
27 Locations
In-Office or Remote
Senior level
Artificial Intelligence • Information Technology • Consulting
The Role
Own the vision and backlog for Nebius storage services (block, file, parallel/HPC, object). Define technical requirements (data paths, consistency, replication, erasure coding), make performance/durability trade-offs, coordinate cross-team initiatives, and ensure storage meets durability, availability, scalability, and cost targets for AI/ML and HPC workloads.
Summary Generated by Built In

About Nebius:

Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform that supports developers and enterprises from data and model training through to production deployment, without the cost and complexity of building large in-house AI/ML infrastructure.

Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own the hard problems across compute, storage, networking and applied AI.

Listed on Nasdaq (NBIS) and headquartered in Amsterdam, we have a global footprint with R&D hubs across Europe, the UK, North America and Israel. Our team of 1,500+ includes hundreds of engineers with deep expertise across hardware, software and AI R&D.

The Role

Nebius is looking for a deeply technical Technical Product Manager – Storage to join the team. In this role, the candidate will own the vision, roadmap, and priorities for storage services in Nebius Cloud, including block storage, file and parallel file systems, object storage, and the storage capabilities that underpin AI/ML and HPC workloads.

The candidate will be responsible for shaping and managing backlogs for storage service teams and leading company-wide initiatives related to storage. This is a hands-on, technically demanding role: the candidate is expected to reason about storage internals, data paths, and performance trade-offs directly with engineers. It requires strong technical depth combined with the ability to coordinate across engineering, development, product, technical support, and go-to-market teams.

Responsibilities
  • Own and manage the product backlog for storage service teams (block, file, parallel/HPC, and object storage)
  • Lead and coordinate cross-company initiatives involving storage, including data durability, performance, capacity, and cost initiatives
  • Work closely with engineering and architecture teams to define product requirements at the level of data paths, consistency models, replication and erasure coding, and performance characteristics, and deliver new storage features
  • Make informed technical trade-offs on durability, availability, latency, throughput, and cost, and defend them with data
  • Partner with product marketing and technical pre-sales/post-sales teams on technical publications, go-to-market activities, customer engagement, acquisition, and retention related to storage
  • Ensure the delivery of storage services that meet high standards for durability, availability, performance, scalability, and cost efficiency, including storage for AI/ML training and inference and HPC
Requirements
  • Hands-on experience building storage products in the cloud is the single most important requirement: the candidate must have designed, built, or operated cloud storage services (not just used or integrated them), in senior engineering, architecture, or technical leadership roles at hyperscalers, cloud providers, storage vendors, or other advanced technology companies
  • Deep understanding of storage systems internals: block, file, and object storage architectures, distributed storage, replication and erasure coding, consistency and durability models, and the read/write data path
  • Hands-on familiarity with technologies and interfaces such as NVMe/NVMe-oF, iSCSI, POSIX and parallel file systems (e.g., Lustre, GPFS/Spectrum Scale, BeeGFS), Ceph, S3-compatible object storage, and RDMA-based data paths
  • Ability to reason quantitatively about IOPS, throughput, latency (including tail latency), and cost per usable TB, and to read and interpret benchmarks
  • Strong technical expertise in at least two of the following areas:
    • Block storage and virtualization (volumes, snapshots, replication, NVMe-oF)
    • File and parallel file systems for HPC/AI (Lustre, GPFS, BeeGFS, NFS)
    • Distributed storage internals (replication, erasure coding, consistency, repair)
    • Storage performance engineering and benchmarking
    • Object storage at scale (S3-compatible APIs, metadata, multipart, lifecycle)
  • Proven track record of delivering complex technical initiatives requiring coordination across multiple teams or stakeholders
  • Technical leadership experience is a strong plus
  • Product management experience is not required, but a strong willingness to learn and grow into the role is essential
Nice to Have
  • Experience with storage for AI/ML training pipelines and large-scale HPC (checkpointing, data loading at scale, high-throughput sequential and random access patterns)
  • Experience creating technical documentation, guides, tutorials, or reference architectures for storage products
Ideal Candidate

The ideal candidate is a technically strong professional with a background in cloud or cloud storage products at hyperscalers (AWS, GCP, Azure), other public cloud providers, storage vendors, their partners, or highly digitalized enterprises.

The candidate may come from a background in storage engineering, distributed systems, architecture, SRE, or technical leadership, with hands-on experience building cloud storage services, distributed storage systems, or high-performance storage for ML and HPC. Experience in technical product or platform ownership or technical leadership is highly valued, even if the candidate has not held a formal manager title.

Relevant experience in ML and HPC environments is strongly desirable given the storage demands of these workloads, though the candidate is expected to adapt quickly and learn new domains.

About Nebius

Nebius AI is an AI cloud platform with one of the largest GPU capacities in Europe. Launched in November 2023, the Nebius AI platform provides high-end, training-optimized infrastructure for AI practitioners. As an NVIDIA preferred cloud service provider, Nebius AI offers a variety of NVIDIA GPUs for training and inference, as well as a set of tools for efficient multi-node training. 

Nebius AI owns a data center in Finland, built from the ground up by the company’s R&D team and showcasing our commitment to sustainability. The data center is home to ISEG, the most powerful commercially available supercomputer in Europe and the 16th most powerful globally (Top 500 list, November 2023).  

Nebius’s headquarters are in Amsterdam, Netherlands, with teams working out of R&D hubs across Europe and the Middle East. 

Nebius AI is built with the talent of more than 500 highly skilled engineers with a proven track record in developing sophisticated cloud and ML solutions and designing cutting-edge hardware. This allows all the layers of the Nebius AI cloud – from hardware to UI – to be built in-house, distictly differentiating Nebius AI from the majority of specialized clouds: Nebius customers get a true hyperscaler-cloud experience tailored for AI practitioners. We’re growing and expanding our products every day. 

If you’re up to the challenge and are excited about AI and ML as much as we are, join us!

Benefits & Perks:

  • Competitive compensation
  • Career growth and learning opportunities
  • Flexibility and ownership
  • Collaborative and innovative culture
  • Opportunity to work on impactful AI projects
  • International environment and talented teams

What's it like to work at Nebius:

Fast moving - Bold thinking - Constant growth - Meaningful impact - Trust and real ownership - Opportunity to shape the future of AI 

Equal Opportunity Statement:

Nebius is an equal opportunity employer. We are committed to fostering an inclusive and diverse workplace and to providing equal employment opportunities in all aspects of employment. We do not discriminate on the basis of race, color, religion, sex (including pregnancy), national origin, ancestry, age, disability, genetic information, marital status, veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by applicable law.

Applicants must be authorized to work in the country in which they apply and will be required to provide proof of employment eligibility as a condition of hire. 

If you need accommodations during the application process, please let us know.

Skills Required

  • Hands-on experience designing, building, or operating cloud storage services (cloud providers, hyperscalers, storage vendors, or advanced tech companies)
  • Deep understanding of storage systems internals: block, file, object architectures, distributed storage, replication, erasure coding, consistency and durability models, and read/write data paths
  • Hands-on familiarity with NVMe/NVMe-oF, iSCSI, POSIX, NFS, parallel file systems (Lustre, GPFS/Spectrum Scale, BeeGFS), Ceph, S3-compatible object storage, and RDMA-based data paths
  • Ability to reason quantitatively about IOPS, throughput, latency (including tail latency), and cost per usable TB and interpret benchmarks
  • Strong technical expertise in at least two areas: block storage/virtualization, file/parallel file systems, distributed storage internals, storage performance engineering/benchmarking, object storage at scale
  • Proven track record delivering complex technical initiatives across multiple teams or stakeholders
  • Willingness to learn and grow into product management (product management experience not required)
  • Technical leadership experience
  • Experience with storage for AI/ML training pipelines and large-scale HPC (checkpointing, data loading at scale)
  • Experience creating technical documentation, guides, tutorials, or reference architectures for storage products
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Amsterdam
473 Employees

What We Do

Cloud platform specifically designed to train AI models

Similar Jobs

Tulip Logo Tulip

Marketing Manager

Enterprise Web • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
27 Locations
310 Employees

Mondelēz International Logo Mondelēz International

Change Manager o9 MEU, Demand Planning

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
9 Locations
90000 Employees

Mondelēz International Logo Mondelēz International

Global Process Owner - Accounts Payable (fixed-term contract)

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
4 Locations
90000 Employees

Pfizer Logo Pfizer

Director, AI Engineering--Clinical Development and Operations (CD&O)

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office or Remote
31 Locations
121990 Employees
177K-294K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account