Site Reliability Engineer

Posted 3 Days Ago
Be an Early Applicant
Hiring Remotely in Philippines
Remote
Mid level
Software
The Role
The Site Reliability Engineer ensures the reliability and scalability of CloudBlue's SaaS platforms, focusing on monitoring, incident response, and automation. They collaborate with teams to improve system stability and performance, conducting capacity planning and managing operational processes.
Summary Generated by Built In

Position Summary: 

With team members and customers in 39 countries around the globe, HostPapa is currently one of the fastest-growing web hosting companies with a wide range of products available. At its core, we provide individuals and small and medium-sized businesses with access to valuable tools and services critical to their online success, including a Website Builder service for making website creation an ultra-easy task for anyone. Tailored to meet every user's unique needs, our award-winning customer support, email, and cloud-based solutions keep HostPapa at the cutting edge of the web hosting industry and innovation by putting our customers first.

This role focuses on CloudBlue, a HostPapa business that powers cloud commerce for many of the world’s largest service providers, including major Telcos, distributors, and MSPs. CloudBlue enables partners to monetize and manage cloud services and subscriptions at scale, combining the agility of a high-growth business with the backing of a global organization.

As the Site Reliability Engineer, you will help ensure the reliability, scalability, and observability of CloudBlue’s multi-tenant SaaS platforms used by service providers worldwide. You will focus on improving system stability and performance through monitoring, high availability, and incident response, while working closely with DevOps, Platform, and Engineering teams to build and operate resilient production systems.

What you’ll do

  • Define and implement SLIs, SLOs, and error budgets for critical CloudBlue services to ensure reliability and performance
  • Influence system architecture with a strong focus on reliability, scalability, and operability, designing systems for fault tolerance, graceful degradation, and self-healing
  • Reduce operational toil by identifying opportunities for automation and process improvement
  • Design and operate CloudBlue’s observability stack across metrics, logs, and traces using tools such as Datadog, Grafana, and Elastic Stack
  • Develop actionable alerting strategies and dashboards that provide clear insight into platform and business health
  • Design and maintain high-availability architectures, implementing redundancy, failover, and disaster recovery strategies across regions and availability zones
  • Conduct capacity planning, load testing, and performance optimization to ensure platform stability and scalability
  • Act as a senior responder during production incidents, leading incident coordination, communication, and service restoration
  • Own blameless postmortems and drive improvements that reduce incident frequency, MTTR, and customer impact
  • Improve reliability of Kubernetes-based platforms through health checks, autoscaling strategies, rollout safety, and resilience testing
  • Partner with engineering and DevOps teams to improve deployment safety, rollback strategies, and platform reliability
  • Maintain runbooks and operational documentation, and promote SRE best practices across engineering teams
  • Support other tasks or projects as assigned to meet team and business needs

About you

  • 3+ years of experience as an SRE, DevOps Engineer, or Production Engineer, with strong ownership of production systems
  • Proven experience operating highly available, enterprise-grade, multi-tenant SaaS platforms
  • Hands-on experience with observability and monitoring tools such as Datadog, Grafana, and Elasticsearch/Kibana
  • Solid understanding of Linux, networking, and distributed systems fundamentals
  • Experience working with containerized environments such as Docker and Kubernetes
  • Strong scripting and automation skills using Python and/or Bash
  • Experience participating in on-call rotations and incident response in production environments
  • Strong written and spoken English
  • Experience defining SLIs/SLOs and managing error budgets at scale will be considered a plus
  • Exposure to hyperscale or service-provider-grade platforms is an advantage
  • Cloud experience, preferably with Azure; experience with AWS and/or GCP will also be valued
  • Experience working with hybrid or on-premises integrations is beneficial
  • Familiarity with chaos engineering and resilience testing will be considered an asset

What We Offer:

  • Work from anywhere - this is a remote opportunity
  • A competitive salary that values you and your unique skill sets
  • Career advancement & professional development opportunities to help you reach your full potential
  • Flexible work arrangements to support work/life balance

About Us:

At HostPapa, we’ve been committed to providing a complete array of enterprise-grade cloud services solutions to every business owner since 2006. These services, traditionally out of reach to smaller businesses, are offered in a one-stop shop, making it quick and easy for customers to select the services they need to grow. We back these offerings with 24/7 award‑winning customer support in four languages.

Our HostPapa team values diversity and inclusion. We have a friendly company culture built on trust and respect. With the acquisition of several companies into our product portfolio, we’re growing at an incredible rate and have ample opportunities for career growth. 

Come join our talented team of enthusiastic, hard-working, passionate, driven people engaged in meaningful, innovative work. We can’t wait to meet you!

HostPapa is an equal-opportunity employer committed to diversity and inclusion. As a multicultural organization, we encourage individual achievement and recognize the strength of our diverse team. 

HostPapa is committed to providing accommodations for people with disabilities. If you require accommodation, please let us know, and we will work with you to meet your needs. Accommodation may be provided in all parts of the hiring process.

It is anticipated that this position will be performed outside of Ontario.
 

Top Skills

AWS
Azure
Bash
Chaos Engineering
Datadog
Distributed Systems
Docker
Elastic Stack
GCP
Grafana
Hybrid Integrations
Kubernetes
Linux
Python
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Burlington, Ontario
247 Employees
Year Founded: 2006

What We Do

About HostPapa

HostPapa is a privately-owned company headquartered in Burlington, Ontario. HostPapa also has locations in 11 other countries around the world. At HostPapa, we consider every one of our customers to be a part of our family. That's why our motto is "Let Papa take care of you!"​

We understand that our customers'​ websites are important and that they need to be able to count on us to ensure that their service is not interrupted.

We have established a solid foundation to offer hosting solutions and cloud services for small and medium-sized businesses that are reliable, easy-to-use, and customer service-oriented, all for a low cost.
At HostPapa, we value our customers and recognize their need for outstanding customer service. We are not satisfied until our customers are!

With HostPapa you get:

Feature-rich hosting packages
Money-back guarantee
FREE domain registration
Uptime guarantees
Online knowledgebase / support
Help with using the tools / getting set up
Ecommerce capabilities
Free apps

With HostPapa you can depend on:

Dedicated customer service
Quality equipment
Maximum guaranteed uptime
Highly functional tools for administration
The leading feature set available
Secure and reliable backups
A solid and honest business partner

Careers at HostPapa

A career at HostPapa is fun, laid back, rewarding and challenging. We offer continuous learning and opportunities for career advancement. If you think you have the skill-set to work with us and love to work in a fast-paced environment, then HostPapa is definitely the place for you!

It is our goal at HostPapa to help our employees advance in their careers. By adopting an ongoing, hands-on learning environment, our employees are continuously growing and expanding their knowledge. We believe that employee education and training are essential, not only for our staff but also for our customers.

Similar Jobs

Remote
Philippines
60 Employees

Muvr Logo Muvr

Site Reliability Engineer

Software • Transportation
Remote
Philippines
19 Employees

GCash Logo GCash

Site Reliability Engineer

Fintech • Software • Financial Services
Remote
National Capital Region, PHL
2570 Employees

Avid (avid.com) Logo Avid (avid.com)

Site Reliability Engineer

News + Entertainment • Software
Remote
Philippines
1522 Employees

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account