Kubernetes DevOps/Platform Engineer

Posted 3 Days Ago
Be an Early Applicant
Hiring Remotely in Poznań, Województwo wielkopolskie
In-Office or Remote
Senior level
Software
The Role
As a Kubernetes DevOps/Platform Engineer, you will integrate the k0rdent-ai platform, manage Kubernetes clusters, automate pipeline creation, and provide operational support for complex system issues.
Summary Generated by Built In
Company Description

About Mirantis

Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers platform engineering teams to deliver composable, production-ready developer platforms across any environment—on-premises, in the cloud, at the edge, or in sovereign data centers. As enterprises navigate the growing complexity of AI-driven workloads, Mirantis delivers the automation, GPU orchestration, and policy-driven control needed to manage infrastructure with confidence and agility. Committed to open standards and freedom from lock-in, Mirantis ensures that customers retain full control of their infrastructure strategy.

We serve global leaders including Adobe, PayPal, Liberty Mutual, Splunk, and Volkswagen.  Learn more at www.mirantis.com.

Job Description

We are looking for a skilled Kubernetes DevOps/Platform Engineer to drive end-to-end custom integration across our k0rdent-ai platform. You will collaborate with our engineering teams to design and deliver scalable GPU infrastructure orchestration based on Kubernetes stack.

Mirantis k0rdent AI empowers platform architects and MLOps engineers with open, composable infrastructure management for AI workloads and scalable inference application hosting at scale. 

It allows rapid deployment and execution of models alongside core application components and Mirantis-validated foundation services. Deployment can be performed on any cloud or infrastructure with zero lock-in, all built on Kubernetes standards.

It also enables automated observation, scaling, and management to ensure optimal performance, GPU utilization, and cost efficiency.

https://www.mirantis.com/software/mirantis-k0rdent-ai/

https://www.mirantis.com/resources/mirantis-ai-factory-reference-architecture/

Main Responsibilities

  • Build comprehensive automation pipelines for infrastructure provisioning and service deployments

  • Provide operational support by diagnosing, triaging, and resolving complex system issues

  • Design and deploy bare metal Kubernetes clusters for GPU/AI workloads in customer datacenters

  • Design and implement datacenter networking with Nvidia Bluefield 3 DPUs

  • Configure and troubleshoot Infiniband fabrics for high-performance GPU interconnects

  • Implement Metal3-based bare metal provisioning pipelines for physical server infrastructure

  • Configure and integrate Kubevirt for VM-based workloads on Kubernetes

  • Deploy and manage k0rdent (Cluster API-based) tooling for Kubernetes cluster lifecycle management for tenant clusters

  • Implement GPU workload onboarding systems for training and inference

  • Build automation using GitHub CI for product integration testing

  • Work directly with product teams to collect and drive the requirements for future features / fixes

Qualifications

​​​​​​​

Advanced Kubernetes expertise - Hands-on experience operating production clusters, including:

  • Deep understanding of Kubernetes architecture, controllers, and operators

  • Experience with Cluster API lifecycle management and upgrades

  • Troubleshooting complex multi-tenant environments

  • Custom Resource Definitions (CRDs) and operator patterns

Bare metal infrastructure management - Direct experience provisioning and managing physical servers, BIOS/firmware management, and hardware lifecycle automation

Virtualization technologies - Practical experience with KVM, LibVirt, and VM management on Linux

Software Defined Networking (SDN) - Understanding of overlay networks, network policies, and SDN controllers in Kubernetes and VM environments

Golang proficiency - Ability to read, debug, and contribute to Kubernetes operator code and controllers

CI/CD automation - Strong scripting and automation skills (Bash, Python, Ansible, Terraform) and experience building infrastructure-as-code pipelines

GitOps practices - Experience with declarative infrastructure management and Git-based workflows

Will be a strong Plus:

InfiniBand networking experience

Cluster API framework experience

Nvidia GPU infrastructure (NVLink)

SmartNIC experience (Nvidia Bluefield or similar)

OVN (Open Virtual Network) or other SDN platforms

Metal3 or similar baremetal provisioning tools

Storage networking (NVMe-oF, Ceph)

GitHub Actions/CI or similar automation platforms

Additional Information

What does Mirantis offer you?
- Work with an established Silicon Valley leader in the cloud infrastructure industry;
- Work with exceptionally passionate, talented and engaging colleagues, helping Fortune 500 and Global 2000 customers implement next-generation cloud technologies;
- Be a part of cutting-edge, open-source innovation;
- Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued;
- Professional development and training;
- Attend conferences and working groups;
- Company outings, happy hours, hackathons, and tech talks;
- Receive a competitive compensation package with a strong benefits plan.

It is understood that Mirantis, Inc. may use automated decision-making technology (ADMT) for specific employment-related decisions. Opting out of ADMT use is requested for decisions about evaluation and review connected with the specific employment decision for the position applied for. You also have the right to appeal any decisions made by ADMT by sending your request to [email protected]

By submitting your resume, you consent to the processing and storage of your personal data in accordance with applicable data protection laws, for the purposes of considering your application for current and future job opportunities.

We are a Leader for Container Management in G2 (#2 after AWS)!

Top Skills

Ansible
Bash
Ceph
Go
Gpu
Infiniband
Kubernetes
Kvm
Libvirt
Metal3
Nvme
Python
Sdn
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Campbell, CA
729 Employees
Year Founded: 1999

What We Do

We are dedicated to helping organizations increase developer productivity and ship code faster on public and private clouds. We provide a ZeroOps experience to remove the stress of managing cloud native infrastructure by combining software and automation tools with our cloud native expertise to deliver the industry's leading secure cloud platforms. Our capabilities allow us to provide a secure and reliable cloud native platform that includes validated FIPS-140-2 Encryption and DISA STIG ready capabilities.


Who do we serve?
We serve a wide range of industries, building on our extensive customer experience to provide distinct value in specific verticals including Financial Services, Government & Education, Healthcare, Manufacturing, and Telecommunications.

Mirantis serves many of the world’s leading enterprises, including Adobe, DocuSign, Inmarsat, PayPal, Reliance Jio, Societe Generale, Splunk, and S&P Global. Learn more at www.mirantis.com.

Similar Jobs

MacPaw Logo MacPaw

Director of Data & Analytics

Information Technology • Security • Software • Cybersecurity • App development • Data Privacy
Remote or Hybrid
28 Locations

Atlassian Logo Atlassian

Operations Specialist

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
Poland

MacPaw Logo MacPaw

Chief Revenue Officer

Information Technology • Security • Software • Cybersecurity • App development • Data Privacy
Remote or Hybrid
28 Locations

GitLab Logo GitLab

Senior Manager, Talent Development

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
29 Locations
112K-240K Annually

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account