Site Reliability Engineer

Posted 8 Days Ago
Be an Early Applicant
Hiring Remotely in Sandton, City of Johannesburg, Gauteng, ZAF
In-Office or Remote
Mid level
Artificial Intelligence • Machine Learning • Software • Analytics
The Role
As a Site Reliability Engineer, you'll ensure production environment reliability on GCP, implement monitoring and automation, and troubleshoot incidents.
Summary Generated by Built In

CloudSmiths is looking for a proactive Intermediate Site Reliability Engineer (GCP) to join our Managed Services team.

In this role, you will be a key player in ensuring the reliability, scalability, and performance of production environments for our diverse range of clients. You will bridge the gap between development and operations by implementing robust monitoring, automation, and DevOps practices specifically within the Google Cloud Platform.

Key Responsibilities:

  • Act as a technical resource for SRE practices on GCP, ensuring consistent uptime and performance across various environments.
    Champion DevOps best practices by applying Infrastructure as Code (IaC) principles using tools like Terraform, Ansible, or Deployment Manager.
  • Drive monitoring initiatives using tools such as Grafana, Prometheus, and Stackdriver to ensure deep visibility into system health.
  • Design, maintain, and optimize CI/CD pipelines using GCP-native tools and industry standards.
  • Troubleshoot complex production incidents, perform root cause analysis, and foster a proactive, blameless post-mortem culture.
  • Manage your workload effectively while maintaining clear communication with internal and external stakeholders regarding project progress.

Requirements:

  • 3–5+ years of hands-on experience in a Site Reliability, DevOps, or Cloud Engineering role.
  • Strong experience working directly with GCP infrastructure, services, and security/cost optimization.Containerization: Proven experience with Kubernetes (GKE), Docker, and container orchestration at scale.

Technical Skills:

  • Expertise in UNIX/Linux administration.
  • Strong scripting skills in Python, Bash, or Shell.
  • Familiarity with configuration management tools like Chef, Puppet, or Ansible.
  • A Degree or Diploma in IT, Computer Science, or equivalent experience.
  • Google Cloud Professional certifications (DevOps Engineer or Cloud Architect) are highly advantageous.

Why Join Us?

We are 100% remote. Enjoy the flexibility of working from anywhere. 

You’ll drive monitoring, observability, and CI/CD initiatives using the latest GCP-native tools.

We value innovative thinkers who aren't afraid to challenge the status quo to drive excellence

Skills Required

  • 3-5+ years of hands-on experience in a Site Reliability, DevOps, or Cloud Engineering role
  • Strong experience working directly with GCP infrastructure, services, and security/cost optimization
  • Proven experience with Kubernetes (GKE), Docker, and container orchestration at scale
  • Expertise in UNIX/Linux administration
  • Strong scripting skills in Python, Bash, or Shell
  • Familiarity with configuration management tools like Chef, Puppet, or Ansible
  • A Degree or Diploma in IT, Computer Science, or equivalent experience
  • Google Cloud Professional certifications (DevOps Engineer or Cloud Architect) are highly advantageous
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Gauteng
128 Employees
Year Founded: 2014

What We Do

CloudSmiths is a technology consultancy specialising in data analytics, machine learning, software development, AI and business reporting in the cloud.

Similar Jobs

Circle (circle.so) Logo Circle (circle.so)

Senior Site Reliability Engineer

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
130K-140K Annually

Inspired Testing Logo Inspired Testing

Devops Engineer

Software • Consulting
Remote
South Africa
238 Employees

Deimos Logo Deimos

Site Reliability Engineer

Information Technology • Software
In-Office or Remote
Johannesburg, City of Johannesburg, Gauteng, ZAF
88 Employees

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account