Senior Infrastructure Engineer (Openstack)

Posted 3 Days Ago
Be an Early Applicant
Hiring Remotely in GBR
Remote
Senior level
Artificial Intelligence • Blockchain • Cloud • Software
The AI Factory. Accelerating the Future.
The Role
The Senior Infrastructure Engineer will design and deploy GPU-optimized OpenStack and Kubernetes clusters, automate deployment, manage GPU workloads, and ensure performance and security of the infrastructure.
Summary Generated by Built In

NexGen Cloud is a rapidly growing IaaS company focused on providing innovative cloud solutions and infrastructure services. Our GPU cloud infrastructure solutions accelerate development in industries such as Artificial Intelligence & Machine Learning, VFX & Rendering, Data Science & IoT, and Computer Aided Engineering & MDO.

We are dedicated to helping our clients navigate the complexities of the digital world and achieve success through cutting-edge, scalable, secure and affordable solutions.

At the company's heart stands a group of very talented, experienced, and motivated individuals who want to make a positive change and a lasting impact on the tech world.

Position Summary:

We’re looking for a Senior Infrastructure Engineer with deep OpenStack and strong Kubernetes expertise to join our Infrastructure Engineering team. You’ll play a key role in shaping and scaling our GPUaaS offering, combining the flexibility of OpenStack with the automation and developer-centric capabilities of Kubernetes.

In this role, you'll design GPU-optimized Kubernetes clusters, build multi-tenant GPU infrastructure, and contribute to automation, observability, and CI/CD tooling across the platform.

Key Responsibilities:

    • Cloud & Container Platform Design
      Architect and deploy OpenStack and Kubernetes clusters designed for GPU scheduling, high performance, and multi-tenant workloads.
    • Infrastructure Automation
      Automate deployment pipelines for cloud infrastructure using Terraform, Ansible, Helm, and Kubernetes Operators.
    • GPU Workload Enablement
      Build and manage GPU-ready container runtimes, NVIDIA device plugins, and Kubernetes-native GPU provisioning frameworks.
    • Cluster Operations & Observability
      Ensure high availability and performance of OpenStack and Kubernetes clusters using tools such as PrometheusGrafanaLoki, and Thanos.
    • Security, Policy & Governance
      Implement secure namespace isolation, RBAC, and network policies across OpenStack and Kubernetes layers.
    • Collaboration & Mentorship
      Work cross-functionally with DevOps, AI, Support, and Product teams to align infrastructure services with platform goals. Provide guidance on Kubernetes best practices.

Qualifications and Skills:

  • 5+ years of experience with OpenStack in production environments.
  • 3+ years of experience managing production-grade Kubernetes clusters, including bare-metal or private cloud environments.
  • Strong hands-on expertise with:
    • Kubernetes operators, Helm, and custom resource definitions (CRDs)
    • GPU orchestration in Kubernetes using NVIDIA tools
    • Multi-cluster or federated Kubernetes
  • Proficiency in Linux, Ceph, networking (Calico/Cilium), and infrastructure scripting (Python, Bash).
  • Strong knowledge of cloud-native security, policy frameworks, and service meshes.
  • Experience with CI/CD pipelinesGitOps, and infrastructure-as-code tooling (Terraform, Ansible, ArgoCD).

Good to have:

    • Experience integrating Kubernetes with OpenStack.
    • Prior contributions to Kubernetes SIGs or CNCF projects.
    • Knowledge of GPU metering, billing, and quota enforcement.
    • Familiarity with HPC environments, InfiniBand/ROCEv2 networking, or Slurm integration.

What We Offer:

  • Competitive salary
  • 100% home-office 
  • Full-time permanent contract.
  • Opportunity to work with a diverse team of talented professionals who are passionate about technology and innovation.
  • A collaborative and supportive work environment that encourages professional growth and development.
  • Exposure to cutting-edge technologies and the opportunity to make a significant impact on the future of cloud computing.
  • Possibility to participate on international events

We encourage applications from candidates of all backgrounds and experiences. Our commitment to diversity and inclusion drives our success as a company and reflects our dedication to fostering a diverse and innovative workforce.

Join our team and become a part of the NexGen Cloud Team, where innovation, collaboration, and growth are at the heart of everything we do. If you are a passionate, talented, and motivated individual looking to make a difference, apply now!


Top Skills

Ansible
Bash
Calico
Ceph
Cilium
Grafana
Helm
Kubernetes
Linux
Loki
Nvidia Device Plugins
Openstack
Prometheus
Python
Terraform
Thanos
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
92 Employees
Year Founded: 2020

What We Do

NexGen Cloud, founded in 2020, is a global leader in sustainable AI Cloud solutions, offering Data Sovereignty to its clients.

Powered by 100% renewable energy, NexGen Cloud's expertise is rooted in the deployment and management of advanced AI infrastructure and cloud services.

NexGen Cloud’s solutions are tailored to meet the diverse needs of AI enterprises and practitioners through a suite of specialised products and services, including the AI Supercloud for large-scale bespoke environments, and Hyperstack, a service for on-demand enterprise GPU access.

Similar Jobs

Atlassian Logo Atlassian

Consultant

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
London, Greater London, England, GBR
11000 Employees

Atlassian Logo Atlassian

Senior Enterprise Deal Manager

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
London, Greater London, England, GBR
11000 Employees

Atlassian Logo Atlassian

Senior Principal - Customer Success, Strategic Accounts

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
London, Greater London, England, GBR
11000 Employees

Atlassian Logo Atlassian

Account Executive

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
London, Greater London, England, GBR
11000 Employees

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Rain Thumbnail
Web3 • Payments • Infrastructure as a Service (IaaS) • Fintech • Financial Services • Cryptocurrency • Blockchain
New York, NY
40 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account