Senior DevOps Engineer

Posted Yesterday
Be an Early Applicant
Riyadh, SAU
In-Office
Senior level
Gaming • Information Technology • Software
The Role
Design, operate, and secure infrastructure for generative AI products: IaC, CI/CD, container platforms, GPU model serving, networking, secrets, databases, observability, and incident response. Set operational standards for reliability, cost, and security.
Summary Generated by Built In

This role builds and runs the infrastructure our Generative AI products depend on: the pipelines that ship code, the platforms that run services and models, and the controls that keep all of it secure and reliable. AI workloads bring their own demands. GPUs, model serving, inference autoscaling, and token cost all shape the work, and you have run workloads like these before. You should be comfortable owning infrastructure as code, CI/CD, observability, and security on AWS, and ready to set the operational standards a growing team will lean on.

WHAT YOU WILL DO

  • Write and maintain infrastructure as code so environments are reproducible, reviewable, and quick to recover.
  • Own CI/CD: the pipelines that build, test, scan, and deploy applications, agents, and model-serving services.
  • Run the container platform (EKS, ECS, or Fargate) and the deployment workflows on top of it, including GitOps where it fits
  • Stand up the runtime for AI workloads: GPU capacity, model serving such as vLLM, Triton, or TGI, inference autoscaling, and the gateways and caching that sit in front of the models.
  • Manage API gateways, networking, load balancing, DNS, and certificates so services are exposed safely and predictably.
  • Own secrets, identity, and least-privilege access across every environment.
  • Run databases in production: clustering, replication, failover, backups, and recovery.
  • Build monitoring into everything, including token usage and GPU utilisation, with alerting and clear service objectives.
  • Lead reliability and security practice: incident response, policy as code, vulnerability and container scanning, and cost discipline, which matters once GPUs are in the mix.

Requirements
  • Eight or more years in DevOps, SRE, or infrastructure engineering overall. That includes hands-on experience supporting AI or ML workloads in production, which can be a more recent part of your backgroun.
  • Strong infrastructure as code with Terraform or OpenTofu, including module design and remote state. Experience
  • Strong infrastructure as code with Terraform or OpenTofu, including module design and remote state. Experience with HCP Terraform (formerly Terraform Cloud) is a plus.
  • Configuration management with Ansible.
  • Solid AWS experience across compute, networking (VPC, subnets, security groups, load balancers, Route 53), IAM, and storage
  • Strong CI/CD with GitHub Actions, including reusable workflows and careful handling of credentials.
  • Containers and orchestration: Docker with Kubernetes (EKS preferred), Helm, and a registry such as ECR.
  • API gateway experience with Kong or Amazon API Gateway, including auth, rate limiting, and routing.
  • Database operations including clustering and high availability, with RDS or Aurora, PostgreSQL, and a cache such as Redis or ElastiCache
  • Secrets management with HashiCorp Vault, AWS Secrets Manager, or Parameter Store.
  • Observability with Prometheus, Grafana, CloudWatch, and OpenTelemetry, or close equivalents.
  • Comfort in Linux and scripting with Bash and Python.

Skills Required

  • Eight or more years in DevOps, SRE, or infrastructure engineering
  • Hands-on experience supporting AI or ML workloads in production (GPUs, model serving, inference autoscaling)
  • Infrastructure as code with Terraform or OpenTofu, including module design and remote state
  • Experience with HCP Terraform (Terraform Cloud)
  • Configuration management with Ansible
  • Strong AWS experience across compute, networking (VPC, subnets, security groups, load balancers, Route 53), IAM, and storage
  • CI/CD with GitHub Actions, including reusable workflows and secure credential handling
  • Containers and orchestration: Docker, Kubernetes (EKS preferred), Helm, and container registry (ECR)
  • API gateway experience with Kong or Amazon API Gateway (auth, rate limiting, routing)
  • Database operations: RDS/Aurora, PostgreSQL, clustering, replication, backups, and Redis/ElastiCache
  • Secrets management with HashiCorp Vault, AWS Secrets Manager, or Parameter Store
  • Observability with Prometheus, Grafana, CloudWatch, and OpenTelemetry (or equivalents)
  • Comfortable in Linux and scripting with Bash and Python
  • Experience with model-serving frameworks such as vLLM, Triton, or TGI
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Boston
73 Employees
Year Founded: 2024

What We Do

Mirai is a Riyadh-based video games studio where technology and creative talent combine with industry experts to learn, develop, and shape the future of games.

Similar Jobs

Devsinc Logo Devsinc

Senior Devops Engineer

Information Technology • Software
In-Office
Riyadh, SAU
1934 Employees

Adree Logo Adree

Devops Engineer

Information Technology • Software
In-Office
Riyadh, SAU
18 Employees

Capco Logo Capco

Senior Manager/Director - Data Lead

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
10 Locations
6000 Employees

Capco Logo Capco

Information Technology Business Analyst

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
10 Locations
6000 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account