Site Reliability Engineer III (DevOps+SRE+Platform Eng)

Reposted Yesterday
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka, IND
In-Office
Senior level
Information Technology • Productivity • Software
The Role
Design, provision, and maintain cloud-native infrastructure and large-scale Kubernetes clusters. Build self-service developer platform tools (primarily in Go), automate guardrails with Kyverno/OPA, optimize GitHub Actions and ArgoCD GitOps pipelines, improve observability, enforce cost-efficiency (FinOps), and embed AI-assisted engineering to streamline delivery and reliability.
Summary Generated by Built In
Join Vonage and help us innovate cloud communications for businesses worldwide!
Why this role matters:

As a Platform Engineer, you are a force multiplier for our engineering teams. You don’t just manage servers; you build and maintain the cloud-native foundations that make shipping code at Vonage fast, safe, and easy. Your mission is to manage robust infrastructure while eliminating developer friction through simple-to-use automation, ensuring our production APIs remain highly available, scalable, and cost-efficient.

Your key responsibilities:
  • Infrastructure Management: Take ownership of our cloud-native footprint. You will lead the design, provisioning, and maintenance of resilient infrastructure, ensuring our environments are stable and meet high-availability requirements.
  • Build Self-Service Tools: Help build and evolve our internal developer platform. You’ll use Go (predominantly) to create workflows that automate repetitive tasks—like infrastructure provisioning, restarting deployments, or managing service health.
  • Kubernetes Operations: Act as a key technical resource for our large-scale Kubernetes clusters. You’ll dive deep into the stack to troubleshoot complex networking or performance bottlenecks that others can't solve.
  • Automate Guardrails: Use Kyverno to bake security and best practices directly into the cluster. You’ll ensure that as we scale, our clusters stay compliant and healthy without slowing down development.
  • Streamline Delivery: Own the "path to production" by optimizing our GitHub Actions and ArgoCD pipelines, ensuring global teams can deploy code safely through GitOps.
  • Embed AI assisted engineering into daily practice to accelerate delivery and enhance outcomes
  • Proactively adopt emerging AI capabilities to improve workflows and shares best practices with the team
What you'll bring
  • Experience: 5+ years in DevOps, Platform Engineering, or SRE roles with a heavy focus on cloud-native architecture.
  • Kubernetes Knowledge: Practical experience managing and troubleshooting production EKS or GKE clusters. You should be comfortable "under the hood" of a cluster.
  • Programming Skills: Strong proficiency in Go (preferred) or Python. We are looking for someone who can build tools, APIs, or custom controllers to automate complex logic.
  • CI/CD & GitOps: Hands-on experience with ArgoCD and GitHub Actions (specifically creating reusable workflows and custom actions).
  • Policy & Config: Experience using Helm for packaging and Kyverno (or OPA) for policy enforcement.
  • Infrastructure as Code: Mastery of Terraform, with an interest in moving toward Kubernetes-native management (like Crossplane).
  • System Reliability: A deep commitment to infrastructure stability and the ability to manage complex cloud environments at scale.
  • The Troubleshooting Mindset: You enjoy solving "hard" problems and can navigate complex distributed systems to find root causes.
  • Observability: Experience using tools like Prometheus or Grafana to monitor system health and performance.
  • Efficiency: A "FinOps" mindset - you care about right-sizing resources so we aren't over-provisioning or wasting budget.
How you’ll benefit:
  • Attractive Discretionary Time Off
  • Private Medical Insurance with optional dependent coverage
  • Educational Assistance Reimbursement Program
  • Opportunities for reimbursement for conferences, trainings, and other personal development events
  • Maternity and Paternity Leave
  • Ask recruiter for country specific information
  • Additional benefits and perks will be shared and discussed with you by the recruiter during the interview process

There’s no perfect candidate. You don't need all the preferred qualifications to make a valuable impact on our team. Our employees and customers come from diverse backgrounds, so if you're passionate about what you could achieve at Vonage, we'd love to hear from you.

To learn how we process your personal data during the recruitment process, please refer to our Privacy Notice

Who we are:

Vonage is a global cloud communications leader. And your talent will further help brands - such as Airbnb, Viber, WhatsApp, and Snapchat - accelerate their digital transformation through our fully programmable-based unified communications, contact center solutions, and communications APIs. Ready to innovate? Then join us today.

Note: The purpose of this profile is to provide a general summary of essential responsibilities for the position and is not meant as an exhaustive list. Assignments may differ for individuals within the same role based on business conditions, departmental need or geographic location. 

Skills Required

  • 5+ years in DevOps, Platform Engineering, or SRE roles
  • Production experience managing and troubleshooting EKS or GKE Kubernetes clusters
  • Proficiency in Go or Python for building automation, tools, APIs, or controllers
  • Strong proficiency in Go (preferred)
  • Hands-on experience with ArgoCD and GitHub Actions (creating reusable workflows and custom actions)
  • Experience using Helm for packaging
  • Experience with Kyverno or OPA for policy enforcement
  • Mastery of Terraform for infrastructure as code
  • Interest or experience with Crossplane or Kubernetes-native management
  • Observability experience with Prometheus or Grafana
  • Demonstrated troubleshooting skills for complex distributed systems and networking/performance bottlenecks
  • FinOps mindset: experience or focus on cost efficiency and right-sizing cloud resources
  • Experience building internal developer platforms and automation to reduce developer friction
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Holmdel, NJ
2,500 Employees
Year Founded: 2001

What We Do

We’re making communications more flexible, intelligent, and personal, to help enterprises the world over stay ahead. We provide unified communications, contact centers and programmable communications APIs, built on the world's most flexible cloud communications platform.

Gallery

Gallery

Similar Jobs

Wells Fargo Logo Wells Fargo

Software Engineer

Fintech • Financial Services
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
205000 Employees
159K-305K Annually

Wells Fargo Logo Wells Fargo

Operations Processor

Fintech • Financial Services
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
205000 Employees

Wells Fargo Logo Wells Fargo

Systems Operations Engineer

Fintech • Financial Services
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
205000 Employees

Wells Fargo Logo Wells Fargo

Senior Secured Lending Underwriter

Fintech • Financial Services
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
205000 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account