Lead Infrastructure Engineer

Reposted 5 Days Ago
Be an Early Applicant
Hiring Remotely in UK
Remote
Senior level
Artificial Intelligence • Blockchain • Cloud • Software
The AI Factory. Accelerating the Future.
The Role
Lead the design, operation, and evolution of OpenStack and Kubernetes for GPU workloads, managing a team of engineers and ensuring platform reliability.
Summary Generated by Built In

Lead Infrastructure Engineer (OpenStack)

Location: UK (Remote)

Department: Infrastructure

Reporting to: Head of Infrastructure

ABOUT NEXGEN CLOUD:

NexGen Cloud is the company behind Hyperstack, a full-stack AI cloud serving tens of thousands of customers from AI researchers to enterprises running the world's most compute-intensive workloads. We deliver on-demand and private GPU infrastructure to teams who treat performance as a requirement, not a feature.

We're a tight-knit, fast-moving team working at the cutting edge of AI cloud infrastructure. We practice what we preach, equipping our people with AI at every level so we can solve harder problems, ship faster, and keep raising the bar for what enterprise GPU infrastructure looks like.

THE ROLE: Lead Infrastructure Engineer (OpenStack)

This role exists because we are scaling Hyperstack rapidly to meet global demand for GPU-powered infrastructure across AI, ML, and HPC workloads. As our OpenStack and Kubernetes environments grow in complexity, we need strong technical leadership to keep everything stable, scalable, and moving forward. You'll have direct ownership over the performance, reliability, and evolution of our OpenStack and Kubernetes platforms in-region, while leading a small, high-calibre team of engineers.

This is a role for someone who leads from the front — hands-on when needed, but equally comfortable setting direction, making decisions, and holding the bar high.

WHAT YOU'LL BE DOING:

Rather than a long checklist, here's what success in this role looks like:

  • Own and drive the design, deployment, and operation of OpenStack and Kubernetes clusters optimised for GPU workloads
  • Lead and develop a team of 4–5 infrastructure engineers, setting clear direction and standards
  • Build and improve infrastructure through automation — IaC, GitOps, and CI/CD pipelines
  • Ensure platform reliability through strong monitoring, observability, and incident management practices
  • Collaborate closely with DevOps, Product, and Support teams to align infrastructure with real-world customer needs
  • Take ownership of operational governance including incident, problem, and change management
  • Identify opportunities to simplify, standardise, and scale systems as the platform grows
  • Communicate clearly with leadership on platform performance, risks, and improvements
ABOUT YOU:

We're more interested in how you think and work than in a perfect CV. You'll likely bring a combination of the following:

Essential
  • Strong hands-on experience operating OpenStack in production environments
  • Experience running production-grade Kubernetes clusters — ideally bare-metal or private cloud
  • Solid Linux, networking, and storage fundamentals with a pragmatic troubleshooting approach
  • Experience with infrastructure automation, CI/CD, and Git-based workflows
  • Proven leadership or mentoring experience within infrastructure or platform teams
  • Experience managing incidents and coordinating response during critical service events
  • Strong communication skills, particularly translating technical issues for non-technical stakeholders
Nice to Have
  • Experience integrating Kubernetes with OpenStack
  • Exposure to GPU infrastructure, HPC, or large-scale compute platforms
  • Familiarity with advanced networking or cloud-native ecosystems
  • Contributions to open-source or cloud-native communities
WHAT WE OFFER:
  • Competitive salary and annual discretionary bonus scheme
  • Employee wellbeing benefits
  • 25 days of holiday, plus public holidays
  • Flexible working arrangements (remote or hybrid, depending on role and location)
  • Real ownership and autonomy, with the trust to take initiative and experiment
  • The opportunity to make a visible, meaningful impact as we scale
  • Clear career progression and growth opportunities in a fast-growing company
  • A collaborative, international culture built on trust, transparency, and ownership
  • The chance to help shape NexGen Cloud's team, culture, and future alongside ambitious, mission-driven colleagues
MORE INFORMATION

Head over to our NexGen Cloud careers page to view current openings and follow us on LinkedIn and X to learn more about our journey, newest releases and hear exciting news in the neocloud space.

 
 
 
 
 

Skills Required

  • Strong hands-on experience operating OpenStack in production environments
  • Experience running production-grade Kubernetes clusters
  • Solid Linux, networking, and storage fundamentals
  • Experience with infrastructure automation, CI/CD, and Git-based workflows
  • Proven leadership or mentoring experience within infrastructure/platform teams
  • Experience managing incidents and coordinating response during critical service events
  • Strong communication skills to non-technical stakeholders
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
92 Employees
Year Founded: 2020

What We Do

NexGen Cloud, founded in 2020, is a global leader in sustainable AI Cloud solutions, offering Data Sovereignty to its clients. Powered by 100% renewable energy, NexGen Cloud's expertise is rooted in the deployment and management of advanced AI infrastructure and cloud services. NexGen Cloud’s solutions are tailored to meet the diverse needs of AI enterprises and practitioners through a suite of specialised products and services, including the AI Supercloud for large-scale bespoke environments, and Hyperstack, a service for on-demand enterprise GPU access.

Similar Jobs

ServiceNow Logo ServiceNow

Sr Enterprise Account Exec - Utilities

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Staines, Surrey, England, GBR
29000 Employees

ServiceNow Logo ServiceNow

Architect

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Staines, Surrey, England, GBR
29000 Employees

Zscaler Logo Zscaler

Director, Solutions Consulting - UKI

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
UK
8697 Employees

Teya Logo Teya

Business Development Manager

Fintech • Payments • Financial Services
In-Office or Remote
Birmingham, West Midlands, England, GBR
1000 Employees

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Software
US
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account