Senior Cloud & AWS Support Engineer

Posted Yesterday
Be an Early Applicant
Hiring Remotely in Gauteng, ZAF
Remote
Senior level
Artificial Intelligence • Big Data • Software • Business Intelligence
Transforming businesses with innovative custom software development, AI and data engineering services.
The Role
Provide AWS-focused operational support and incident response for a client-embedded environment while building and automating cloud infrastructure. Own alert triage, on-call duties, runbooks, CI/CD and IaC (Terraform), networking, monitoring, security/compliance, cost optimisation, and collaborate with engineering teams. Mentor juniors, engage stakeholders, run post-incident reviews, and contribute to solution design and pre-sales.
Summary Generated by Built In

DVT is one of the top software development companies on the continent, consulting on cutting-edge applications for leading enterprises in South Africa and globally. We are committed to continuously developing our people, with a strong culture of learning, internal speaking, and sponsored technical events across the AWS ecosystem.

We are looking for a Senior Cloud & AWS Support Engineer to join our cloud team on a client-embedded engagement. This is a dual role: alongside building and maintaining cloud infrastructure and automation, you will provide day-to-day AWS operational support — owning incident, alert, and request triage, on-call response, and the operational health of the client's AWS estate. You sit within the client's Operations ("keeping the lights on") team, working closely with their platform and engineering teams to keep production stable while improving it.

DUTIES AND RESPONSIBILITIESAWS Support & Operations
  • Act as first responder for the client's AWS environment — triage, diagnose, and resolve incidents and service requests within agreed SLAs

  • Own alert handling across CloudWatch, GuardDuty, Security Hub, and AWS Health, including the email/notification alerting pipeline, routing, and noise reduction

  • Participate in the on-call rotation (incl. cross-timezone cover where client launches require it) and lead structured incident response and post-incident reviews

  • Manage the operational ticket queue (incident, request, problem, and change), maintaining clear status, ownership, and communication to client stakeholders

  • Build and maintain runbooks, playbooks, and knowledge-base articles to standardise response and enable faster, repeatable resolution

  • Drive problem management — identify recurring issues, perform root-cause analysis, and convert findings into permanent fixes and automation

  • Support patching, backup/restore verification, and routine operational maintenance under tagging- and SCP-based governance

Infrastructure & Automation
  • Design, implement, and maintain scalable CI/CD pipelines (e.g. GitHub Actions, AWS CodePipeline, GitLab CI) for automated testing, deployment, and provisioning

  • Manage infrastructure-as-code with Terraform (primary), CloudFormation and CDK — remote state, modular refactoring, and multi-environment deployments

  • Develop ephemeral feature environments for isolated testing (Terraform workspaces / Terragrunt) and automate provisioning, routing (Route 53, ALB/NLB), and teardown

  • Apply scheduled shut-downs to drive cost management; use tagging, Kubernetes taints/tolerations, and tag-driven patching schedules

  • Design and implement AWS networking — VPC architecture, security groups, NACLs, Transit Gateway, and hybrid connectivity

Deployment & Operations
  • Collaborate with software teams to integrate and deploy backend services (Java, .NET, Python, Node.js) and containerised applications

  • Implement blue/green or canary deployment strategies with safe, traceable rollouts and automated rollback

  • Integrate unit and functional/API testing (pytest, Postman/Newman, LocalStack) into the CI pipeline

  • Establish conventions for test execution, image tagging, versioning, and reusable CI/CD components

  • Manage production deployments, change windows, and release coordination across environment

Monitoring, Security & Compliance

  • Implement comprehensive monitoring, logging, and observability (CloudWatch, X-Ray, third-party APM)

  • Monitor and optimise system performance, deployment efficiency, resource utilisation, and cloud costs

  • Maintain security best practices — Well-Architected & OWASP, secrets management (Secrets Manager, Parameter Store), IAM policies, SAST/DAST scanning, and compliance controls

  • Configure alerting and incident-response workflows and lead post-incident reviews

  • Ensure compliance with relevant standards (ISO 27001, SOC 2, POPIA / GDPR) per client requirements

Leadership & Client Engagement

  • Provide technical leadership and mentorship to junior engineers and client development teams

  • Engage directly with client stakeholders to understand requirements, advise, and present solutions

  • Participate in architectural reviews, retrospectives, and planning to continuously improve tooling and process

  • Conduct training and create documentation to embed DevOps and operational best practices

  • Contribute to pre-sales — solution design, effort estimation, and technical proposals

Required Experience and Skills

  • 7+ years in Cloud DevOps, SRE, or AWS operational support, with strong CI/CD and infrastructure automation experience

  • Hands-on experience running AWS managed support / operations — incident management, alert triage, on-call, and SLA-bound resolution

  • Deep AWS proficiency: EC2, ECS/EKS, Lambda, S3, DynamoDB, RDS, VPC, Route 53, CloudFront, IAM (Identity Center), CloudWatch, X-Ray

  • Expert IaC (Terraform required; CloudFormation beneficial) — remote state, modules, multi-environment

  • Strong CI/CD platform experience (GitHub Actions, CodePipeline, GitLab CI, or Jenkins)

  • Advanced scripting in Bash or Python for automation and tooling

  • Containerisation and orchestration (Docker, Kubernetes/EKS, Helm beneficial)

  • Familiarity with AWS GuardDuty, Security Hub, and AWS Health alert handling and triage

  • Experience with ITSM / ticketing workflows and structured incident, problem, and change management

  • Monitoring/observability tooling (CloudWatch, Prometheus, Grafana, Datadog, or similar)

  • AWS networking — VPC design, security groups, load balancers, DNS, hybrid connectivity

  • Secrets management (Secrets Manager, Parameter Store, HashiCorp Vault)

  • DevSecOps practices — SCA/SAST/DAST, vulnerability and compliance automation

  • AWS cost optimisation, tagging strategies, and FinOps practices

  • Excellent communication; proven mentorship; consulting / professional-services delivery across concurrent client engagements

Preferred Qualifications

  • AWS certifications — Solutions Architect Professional, DevOps Engineer Professional, Security Specialty

  • AWS Certified SysOps Administrator (operations-focused)

  • Certified Kubernetes Administrator (CKA) / Application Developer (CKAD)

  • Multi-cloud (AWS, Azure, GCP) or hybrid cloud architecture experience

  • Experience in regulated industries (financial services, gaming/lottery, healthcare, government)

  • GitOps tooling (ArgoCD, Flux); serverless and event-driven design

  • Disaster recovery, backup strategies, and multi-region deployments

  • Configuration management (Ansible, Chef, Puppet); public speaking / open-source contributions

Minimum Requirements

  • Matric (Grade 12) certificate

  • Bachelor's degree in Computer Science, IT, Engineering, or related field (or equivalent practical experience)

  • At least one relevant professional certification (AWS DevOps Engineer, Solutions Architect, SysOps Administrator, CKA, or equivalent)

  • South African work authorisation or valid work permit

  • Willingness to participate in an on-call rotation for production AWS support

  • Ability to travel occasionally for client meetings or site visits

Who we are:

Skills Required

  • 7+ years in Cloud DevOps, SRE, or AWS operational support
  • Hands-on experience running AWS managed support/operations (incident management, alert triage, on-call, SLA-bound resolution)
  • Deep AWS proficiency (EC2, ECS/EKS, Lambda, S3, DynamoDB, RDS, VPC, Route 53, CloudFront, IAM, CloudWatch, X-Ray)
  • Expert Infrastructure as Code with Terraform (required)
  • CloudFormation and CDK (beneficial)
  • CI/CD platform experience (GitHub Actions, AWS CodePipeline, GitLab CI, or Jenkins)
  • Advanced scripting in Bash or Python for automation
  • Containerisation and orchestration (Docker, Kubernetes/EKS); Helm beneficial
  • Familiarity with AWS GuardDuty, Security Hub, and AWS Health alert handling and triage
  • Experience with ITSM/ticketing workflows and structured incident, problem, and change management
  • Monitoring and observability tooling experience (CloudWatch, Prometheus, Grafana, Datadog or similar)
  • AWS networking skills (VPC design, security groups, NACLs, Transit Gateway, load balancers, DNS, hybrid connectivity)
  • Secrets management experience (AWS Secrets Manager, Parameter Store, HashiCorp Vault)
  • DevSecOps practices (SCA/SAST/DAST, vulnerability and compliance automation)
  • AWS cost optimisation, tagging strategies, and FinOps practices
  • Ability to design and maintain CI/CD pipelines and ephemeral feature environments (Terragrunt/Terraform workspaces)
  • Experience integrating testing into CI (pytest, Postman/Newman, LocalStack)
  • Excellent communication, mentorship, and consulting/professional-services delivery experience
  • Matric (Grade 12) certificate
  • Bachelor's degree in Computer Science, IT, Engineering, or related field (or equivalent practical experience)
  • At least one relevant professional certification (AWS DevOps Engineer, Solutions Architect, SysOps Administrator, CKA, or equivalent)
  • South African work authorisation or valid work permit
  • Willingness to participate in an on-call rotation for production AWS support
  • Ability to travel occasionally for client meetings or site visits
  • Experience supporting production deployments, release coordination, and change windows
  • Experience driving problem management, root-cause analysis, and converting findings into permanent fixes/automation
  • Familiarity with blue/green or canary deployment strategies and automated rollback
  • Preferred: AWS certifications (Solutions Architect Professional, DevOps Engineer Professional, Security Specialty)
  • Preferred: AWS Certified SysOps Administrator (operations-focused)
  • Preferred: Certified Kubernetes Administrator (CKA) / CKAD
  • Preferred: Multi-cloud (AWS, Azure, GCP) or hybrid cloud architecture experience
  • Preferred: Experience in regulated industries (financial services, gaming/lottery, healthcare, government)
  • Preferred: GitOps tooling (ArgoCD, Flux) and serverless/event-driven design experience
  • Preferred: Disaster recovery, backup strategies, and multi-region deployments
  • Preferred: Configuration management (Ansible, Chef, Puppet)
  • Preferred: Public speaking or open-source contributions
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
689 Employees
Year Founded: 1999

What We Do

Custom Software, AI and Data Engineering services. Global Scale. Local Engagement. DVT is a proudly South African founded and operated technology services provider. As a BEE level 2 company, we are a committed local partner to South African clients, providing experienced software technology teams or staff to build, enhance or maintain your AI, data and software solutions.

Similar Jobs

Circle (circle.so) Logo Circle (circle.so)

Lead Product Designer

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
140K-170K Annually

Clipboard Logo Clipboard

Client Support Specialist

Edtech • Healthtech • Information Technology • Hospitality
Remote
7 Locations
700 Employees

ServiceNow Logo ServiceNow

Enterprise Account Executive

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Johannesburg, Gauteng, ZAF
29000 Employees

Circle (circle.so) Logo Circle (circle.so)

Designer

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
100K-120K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account