Principal HPC Network Engineer (remote in the EU)

Posted Yesterday
Be an Early Applicant
Hiring Remotely in Barcelona, Cataluña, ESP
In-Office or Remote
Senior level
Software
The Role
Design, deploy, and maintain high-performance HPC network fabrics (InfiniBand/Ethernet); troubleshoot routing, latency, and throughput; manage Fortinet security appliances; perform performance tuning, capacity planning, automation scripting, documentation, and on-call escalation for HPC/AI infrastructure.
Summary Generated by Built In
Company Description

Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers platform engineering teams to deliver composable, production-ready developer platforms across any environment—on-premises, in the cloud, at the edge, or in sovereign data centers. As enterprises navigate the growing complexity of AI-driven workloads, Mirantis delivers the automation, GPU orchestration, and policy-driven control needed to manage infrastructure with confidence and agility. Committed to open standards and freedom from lock-in, Mirantis ensures that customers retain full control of their infrastructure strategy.

Mirantis serves many of the world’s leading enterprises, including Adobe, DocuSign, Liberty Mutual, PayPal, Reliance Jio, Societe Generale, Splunk, and Volkswagen. Learn more at www.mirantis.com.

Job Description

Role Overview:
We are seeking a highly skilled Senior HPC Networking Engineer to design, deploy, manage, and troubleshoot high-performance networking environments. The ideal candidate will have deep expertise in InfiniBand technologies, strong general networking knowledge, and hands-on experience with Fortinet solutions. You will play a critical role in ensuring the performance, reliability, and scalability of HPC infrastructure.

Key Responsibilities:

  • Design, deploy, and maintain high-performance network infrastructures for HPC environments, with a strong focus on InfiniBand fabrics.

  • Troubleshoot complex network issues across InfiniBand and Ethernet environments, ensuring minimal downtime and optimal performance.

  • Manage and optimize InfiniBand components, including switches, HCAs, subnet managers, and fabric configurations.

  • Perform performance tuning, monitoring, and capacity planning for HPC networking systems.

  • Implement and maintain network security using Fortinet solutions (FortiGate, FortiManager, FortiAnalyzer).

  • Diagnose and resolve issues related to routing, switching, latency, and throughput across hybrid network environments.

  • Collaborate with compute, storage, and platform teams to support HPC workloads and cluster operations.

  • Develop and maintain documentation for network architecture, configurations, and operational procedures.

  • Participate in on-call rotations and provide escalation support for critical incidents.

  • Lead or contribute to network upgrades, migrations, and new deployments.

Qualifications

Required:

  • 5+ years of experience in network engineering, with a focus on HPC or data center environments.

  • Strong hands-on experience with InfiniBand technologies (e.g., Mellanox/NVIDIA).

  • Solid understanding of networking fundamentals: TCP/IP, routing protocols (BGP, OSPF), VLANs, QoS, and network design.

  • Proven experience deploying and troubleshooting Fortinet solutions (FortiGate, FortiManager, VPNs, firewall policies).

  • Experience with network performance analysis and troubleshooting tools.

  • Familiarity with Linux systems and scripting for automation (e.g., Bash, Python).

  • Strong analytical and problem-solving skills.

Preferred:

  • Experience with large-scale HPC clusters or AI/ML infrastructure.

  • Knowledge of RDMA, MPI, and low-latency networking concepts.

  • Certifications such as FCSS/FCNSP (Fortinet), CCNP/CCIE, or equivalent.

  • Experience with automation and Infrastructure as Code tools (e.g., Ansible, Terraform).

Soft Skills:

  • Strong communication and collaboration skills.

  • Ability to work independently and handle complex technical challenges.

  • Detail-oriented with a proactive approach to problem-solving.

Additional Information

We offer:

  • Operate some of the most advanced AI infrastructure environments in production today.
  • Work with the latest NVIDIA GPU technologies, Kubernetes platforms, and high-performance networking environments.
  • Help define operational standards and reliability practices for next-generation AI infrastructure services.
  • Influence the adoption of AI-powered operational capabilities through k0rdent AI.
  • Work alongside highly skilled engineers solving complex infrastructure and platform challenges at scale.
  • Join a growing organisation investing heavily in AI infrastructure, platform services, and operational innovation.

#Remote

We are a Leader for Container Management in G2 (#2 after AWS)!

We are a Leader for Container Management in G2 (#2 after AWS)!

Skills Required

  • 5+ years of experience in network engineering, focused on HPC or data center environments
  • Hands-on experience with InfiniBand technologies (e.g., Mellanox/NVIDIA)
  • Solid understanding of networking fundamentals: TCP/IP, routing protocols (BGP, OSPF), VLANs, QoS, and network design
  • Proven experience deploying and troubleshooting Fortinet solutions (FortiGate, FortiManager, VPNs, firewall policies)
  • Experience with network performance analysis and troubleshooting tools
  • Familiarity with Linux systems and scripting for automation (Bash, Python)
  • Strong analytical and problem-solving skills
  • Experience with large-scale HPC clusters or AI/ML infrastructure
  • Knowledge of RDMA, MPI, and low-latency networking concepts
  • Certifications such as FCSS/FCNSP (Fortinet), CCNP/CCIE, or equivalent
  • Experience with automation and Infrastructure as Code tools (Ansible, Terraform)
  • Strong communication and collaboration skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Campbell, CA
729 Employees
Year Founded: 1999

What We Do

We are dedicated to helping organizations increase developer productivity and ship code faster on public and private clouds. We provide a ZeroOps experience to remove the stress of managing cloud native infrastructure by combining software and automation tools with our cloud native expertise to deliver the industry's leading secure cloud platforms. Our capabilities allow us to provide a secure and reliable cloud native platform that includes validated FIPS-140-2 Encryption and DISA STIG ready capabilities. Who do we serve? We serve a wide range of industries, building on our extensive customer experience to provide distinct value in specific verticals including Financial Services, Government & Education, Healthcare, Manufacturing, and Telecommunications. Mirantis serves many of the world’s leading enterprises, including Adobe, DocuSign, Inmarsat, PayPal, Reliance Jio, Societe Generale, Splunk, and S&P Global. Learn more at www.mirantis.com.

Similar Jobs

Tulip Logo Tulip

Marketing Manager

Enterprise Web • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
27 Locations
310 Employees

Akamai Technologies Logo Akamai Technologies

Senior Manager Sales Development

Cloud • Security • Software • Cybersecurity
In-Office or Remote
7 Locations
10285 Employees

Akamai Technologies Logo Akamai Technologies

Solutions Engineer

Cloud • Security • Software • Cybersecurity
In-Office or Remote
2 Locations
10285 Employees

Pfizer Logo Pfizer

Director, AI Engineering--Clinical Development and Operations (CD&O)

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office or Remote
31 Locations
121990 Employees
177K-294K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account