DevOps Engineer SE II - GCP & AI

Reposted 9 Days Ago
Be an Early Applicant
Pune, Mahārāshtra, IND
In-Office
Senior level
Digital Media • Gaming • Software
The Role
The DevOps Engineer will manage GCP infrastructure, build AI deployment pipelines, implement security measures, and optimize costs while ensuring system observability.
Summary Generated by Built In
Responsibilities:
  • Infrastructure Ownership: Own Helpshift production services and ensure complete monitoring coverage, troubleshoot and fix production issues.
  • Infrastructure as Code (IaC): Design and maintain scalable GCP infrastructure using Terraform o
  • AI Orchestration & LLMOps: Build deployment pipelines for AI agents, managing vector databases (e.g., Vertex AI Search, Pinecone, Weaviate, ElasticSearch) and model endpoints.
  • Security (DevSecOps): Implement "Security-by-Design," including IAM least-privilege access, secret management (Secret Manager), and automated vulnerability scanning for AI workloads.
  • CI/CD Excellence: Architect high-velocity pipelines for both traditional microservices and AI model prompts/configurations. Design, implement, and maintain secure CI/CD pipelines for automating deployment, configuration, and testing processes.
  • Observability: Set up comprehensive monitoring for system health and LLM-specific metrics (latency, token usage, and cost)
  • Cloud Governance: Optimise GCP costs and manage resource quotas, especially for GPU/TPU-intensive AI tasks.
  • Cross Cloud Deployment: Establish & Optimise the connectivity among apps deployed in different cloud environments (AWS <> GCP)

RequirementsRequirements
  • Relevant experience of 6+ years and above
  • Expert-level Google Cloud Platform (GCP) administration skills: GKE, Cloud Run, Vertex AI, GCS, NEG etc
  • Experience deploying Vector Databases (Pinecone, Weaviate, ElasticSearch or Vertex Search) and managing API rate limits/throttling for LLM providers.
  • Setting up Cloud Monitoring/Logging specifically for AI metrics: token consumption, inference latency, and model error rates.
  • In-depth knowledge of running/managing UNIX-like operating systems (we use Ubuntu)
  • Strong knowledge of networking protocols, security architectures, and identity and access management (IAM) principles.
  • Experience with containerisation technologies (e.g., Docker, Kubernetes) and securing containerised environments.
  • Proficiency in Python and Bash
  • Experience in designing and building solutions that are highly scalable, fault tolerant and cost-effective
  • Experience with IaaC tools like Ansible, Terraform.
  • Ability to analyse bottlenecks in architecture and quickly debug to reach a resolution for issues
  • Have an automation mindset and ability to reason and work with complex systems.
  • Excellent communication and documentation skills
  • Quick learner and good mentor for junior team members
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Dublin, Dublin
4,788 Employees
Year Founded: 1998

What We Do

Keywords Studios is an international technical and creative services provider to the global video games industry and beyond. We bring to life digital content that entertains, connects, challenges and educates people worldwide. Established in 1998, and now with more than 65 facilities in 22 countries strategically located in Asia, the Americas, Australia and Europe, we provide integrated art creation, marketing services, software engineering, testing, localization, audio and customer care services across more than 50 languages and 16 games platforms to a blue-chip client base of more than 950 clients across the globe.

Similar Jobs

Capco Logo Capco

Product Manager

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Capco Logo Capco

Test Engineer

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Capco Logo Capco

Product Manager

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

TransUnion Logo TransUnion

Consultant

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Pune, Mahārāshtra, IND
13000 Employees

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account