Cloud Infrastructure Engineer

Posted 7 Days Ago
Be an Early Applicant
London, Greater London, England, GBR
In-Office
60K-65K Annually
Mid level
Edtech
The Role
Ensure availability, reliability and scalability of AWS-based services. Manage IaC, CI/CD for containerised and serverless apps, monitor with Prometheus/Grafana/CloudWatch, respond to incidents, drive cost optimisation and platform improvements.
Summary Generated by Built In

What we do

At Perlego, we are working hard to make education accessible to all. In this digital age, we believe that anyone should be able to learn anything at any time. Knowledge should be more accessible, not locked behind sky-high price tags.

Over the past 9 years, our goal has been to support students across the UK & Europe to access quality books. Our ambition is to expand our support to students globally, specifically looking at the US, and build a product that goes beyond the book, a platform that helps students study smarter and more effectively.

What we're looking for:

We are looking for an experienced Cloud Infrastructure Engineer with a strong background in AWS services and monitoring tools. In this role, you will ensure the availability and reliability of our services. You will be integral to swiftly addressing issues, resolving incidents independently, and thriving in a fast-paced environment.

What you’ll do:

As a Cloud Infrastructure Engineer, your main focus will be to ensure our services remain highly available and performant. Key responsibilities include:

Cloud Infrastructure Management:

  • Manage and support AWS infrastructure, focusing on scalability, security, and reliability.
  • Handle deployments, managing CI/CD pipelines for both containerised (Docker/ECS) and serverless (AWS Lambda) applications.
  • Own infrastructure as code — provisioning resources declaratively so environments are reproducible, version-controlled, and safe to change.
  • Ensure effective backup, recovery, and disaster recovery strategies to minimise downtime.
  • Manage operational and analytical data stores (Aurora MySQL, DynamoDB)
  • Drive cost optimization across the infrastructure — monitoring spend, eliminating waste, and rightsizing resources to balance performance and cost.

Monitoring & Incident Management:

  • Monitor and manage platform activity using tools like PrometheusGrafana, or AWS CloudWatch
  • Respond quickly to alerts and incidents, independently resolving issues and ensuring service uptime.
  • Conduct post-incident reviews and help improve system resiliency through automation and monitoring enhancements.
  • Review network activity with AWS Security Hub and Cloudflare

Collaboration & Communication:

  • Collaborate with cross-functional teams to implement platform improvements.
  • Work independently and make swift decisions when managing service incidents outside core business hours.
  • Assist in platform security, ensuring adherence to best practices for cloud security and compliance.

Continuous Improvement:

  • Automate manual processes to reduce human error and improve efficiency.
  • Continuously enhance monitoring systems, ensuring robust early detection and resolution capabilities.
  • Identify potential performance bottlenecks and contribute to overall platform optimisation.

Requirements

This role is ideal for you if you possess:

  • Experience in Cloud Infrastructure Engineering, DevOps, or a similar field.
  • Strong experience with AWS services and containerised applications
  • Strong experience operating operational data stores (Aurora MySQL, DynamoDB).
  • Expertise in using monitoring tools (e.g. Prometheus, Grafana, CloudWatch) for real-time platform performance insights.
  • Strong understanding of network security and Cloudflare , VPC and networking fundamentals, with a clear grasp of how traffic and infrastructure components flow together end to end.
  • Hands-on experience with CI/CD pipeline management for deploying containerised (Docker) and serverless applications, preferably with GitHub Actions
  • Proficiency in Linux-based operating systems and shell scripting.
  • Familiarity with Infrastructure as Code tools (Terraform, CloudFormation).
  • Experience with incident management, troubleshooting, and platform recovery in high-pressure environments.
  • Strong communication skills with a proven ability to work both independently and collaboratively

⭐️ It’s a plus if you have:

  • Experience working in a global, distributed team providing off-hours support.
  • Previous experience with SecOps and cloud security best practices.
  • Familiarity with scaling highly available systems in a fast-paced, growth-oriented environment.

Benefits

✨ Compensation

The salary available for this role is £60,000-65,000 dependent upon experience.

🏠 Flexible

We operate a flexible hybrid working environment. However we would be open to a remote role for the right candidate.

🧠 L&D Budget

We value continuous learning and you will have a personal L&D budget for online courses, subscriptions, or books not on Perlego.

🤓 Learning Time

All employees have dedicated Learning Time to focus on new skills, projects, or interests outside their day-to-day role, including Hackathons.

🌴 Work-Life Balance

22 days annual leave + 1 additional day per year of service

❄️ Office Reset

The days between Boxing Day and New Year off, additional to annual leave.

🛐 Flexi Bank Holidays

Flexibility to swap local bank holidays for religious or cultural days.

🗺 Work from overseas

Flexible short-period remote working overseas, as long as you remain a UK tax resident.

🏖 Sabbatical

1-month unpaid sabbatical after 3 years; 1-month paid sabbatical after 5 years.

💛 Personal Days

1 additional day per year for life events.

🍏 Health & Wellbeing

Private medical, optical and dental insurance via Vitality.

🚲 Cycle to Work Scheme

🎉 Social

Regular social events and activities for everyone.

🍼 Family time

Competitive matched parental leave and phased return to work.

👼 Workplace Nursery Benefit

Skills Required

  • Experience in Cloud Infrastructure Engineering, DevOps, or similar field.
  • Strong experience with AWS services and containerised applications.
  • Experience operating operational data stores (Aurora MySQL, DynamoDB).
  • Expertise using monitoring tools (Prometheus, Grafana, CloudWatch).
  • Strong understanding of network security, Cloudflare, VPC and networking fundamentals.
  • Hands-on experience with CI/CD pipeline management for deploying containerised and serverless applications.
  • Familiarity with GitHub Actions for CI/CD (preferable).
  • Proficiency in Linux-based operating systems and shell scripting.
  • Familiarity with Infrastructure as Code tools (Terraform, CloudFormation).
  • Experience with incident management, troubleshooting, and platform recovery under pressure.
  • Strong communication skills and ability to work independently and collaboratively.
  • Experience working in a global, distributed team providing off-hours support.
  • Previous experience with SecOps and cloud security best practices.
  • Familiarity with scaling highly available systems in fast-paced, growth-oriented environments.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
136 Employees
Year Founded: 2017

What We Do

Perlego was born to provide an affordable (and sustainable) textbook solution for learners around the world, by partnering with publishers and removing the costs of print, distribution, and retail markup.

Similar Jobs

Ordnance Survey Logo Ordnance Survey

Infrastructure Engineer

Information Technology • Consulting
In-Office
Southampton, Hampshire, England, GBR
1421 Employees
44K-51K Annually

Lloyds Banking Group Logo Lloyds Banking Group

Infrastructure Engineer

Fintech • Software • Financial Services
In-Office
2 Locations
60287 Employees
73K-81K Annually

n8n Logo n8n

Senior Cloud Engineer

Artificial Intelligence • Software • Automation
In-Office or Remote
34 Locations
61 Employees
In-Office
Solihull, Birmingham, West Midlands, England, GBR
1675 Employees

Similar Companies Hiring

ReUp Education Thumbnail
Social Impact • Edtech
Austin, TX
180 Employees
Learneo Thumbnail
Software • Machine Learning • Edtech • Artificial Intelligence
NL
397 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account