Site Reliability Engineer (SRE)

Reposted 22 Days Ago
Be an Early Applicant
Hyderabad, Telangana, IND
In-Office
Senior level
Artificial Intelligence • Natural Language Processing
The Role
As a Site Reliability Engineer, you'll design and manage Kubernetes infrastructures, implement advanced autoscaling solutions, and enhance observability while collaborating with engineering teams to ensure reliability and performance of services.
Summary Generated by Built In

Kore.ai is a globally recognized leader in the conversational and generative AI space helping enterprises deliver extraordinary experiences for their customers, employees, and contact center agents. Kore.ai’s goal is to empower businesses with effective, simple and responsible AI solutions that create engaging interactions. sectors serving over 100M of consumers and 500,000+ employees worldwide. With billions of interactions automated using our AI-powered technology, we have been able to save over $500M for these companies. 


Kore.ai is one of the fastest growing AI companies globally. We are recognized as a leader by the leading technology and industry analysts like Gartner, Forrester, IDC, ISG, Everest, and others. 


Founded in 2014 by serial successful entrepreneur, Raj Koneru, Kore.ai supports customers globally across offices in Orlando, Hyderabad, New York, London, Germany, Dubai Frankfurt, Tokyo and Seoul.


We’re reshaping the way companies harness the power of AI, simplifying and enhancing accessibility. Work alongside some of the brightest minds in the industry to pioneer safe, reliable solutions. Join the Kore.ai team and help companies of all sizes simplify the adoption of advanced AI solutions responsibly.

JD – Site Reliability Engineer (SRE)

Kore.ai is a pioneering force in enterprise AI transformation, empowering organizations through our

comprehensive agentic AI platform. With innovative offerings across "AI for Service," "AI for Work," and

"AI for Process," we're enabling over 400+ Global 2000 companies to fundamentally reimagine their

operations, customer experiences, and employee productivity.

Our end-to-end platform enables enterprises to build, deploy, manage, monitor, and continuously

improve agentic applications at scale. We've automated over 1 billion interactions every year with voice

and digital AI in customer service and transformed employee experiences for tens of thousands of

employees through productivity and AI-driven workflow automation.

Recognized as a leader by Gartner, Forrester, IDC, ISG, and Everest, Kore.ai has secured Series D

funding of $150M, including strategic investment from NVIDIA to drive Enterprise AI innovation.

Founded in 2014 and headquartered in Florida, we maintain a global presence with offices in India, UK,

Germany, Korea, and Japan.

About the Role

  • We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on Kubernetes
  • ecosystems to join our growing team. You will play a critical role in designing, operating and scaling our
  • cloud-native infrastructure, ensuring high availability, performance and resilience of our production
  • services.
  • The ideal candidate has deep hands-on expertise in Kubernetes orchestration, advanced autoscaling
  • strategies, GitOps workflows, infrastructure-as-code provisioning and modern observability practices.
  • You will work closely with engineering and product support teams to embed reliability into every layer of
  • our stack.

RESPONSIBILITIES

  • Design, manage, and optimize large-scale Kubernetes clusters (EKS/AKS/GKE or selfmanaged)
  • for reliability, security and cost efficiency.
  • Implement and maintain advanced autoscaling solutions using HPA, VPA and event-driven
  • scaling with KEDA.
  • Provision and manage cloud infrastructure and Kubernetes resources declaratively using
  • Crossplane for multi-cloud/hybrid environments.
  • Drive GitOps practices by owning and enhancing Argo CD deployments, application sets, and
  • progressive delivery workflows (canary, blue-green).
  • Define, monitor, and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs),
  • and error budgets using observability data.
  • Build and maintain comprehensive observability pipelines with tools using OpenTelemetry or
  • eBPF.
  • Participate in on-call rotations, lead incident response, perform root cause analysis and
  • facilitate blameless postmortems.
  • Collaborate on capacity planning, chaos engineering experiments and disaster recovery
  • strategies.

EXPERIENCE REQUIRED

  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.
  • 8+ years of experience in SRE, Platform Engineering or DevOps roles with a heavy Kubernetes
  • focus.
  • Expert-level knowledge of Kubernetes, including custom resource definitions, operators,
  • networking (CNI), storage (CSI), and security (Pod Security Standards, OPA/Gatekeeper).
  • Good production experience with:
  • Autoscaling: HPA (metrics-based), VPA, and KEDA (event-driven scaling for queues,
  • databases, etc.).
  • Crossplane for provisioning cloud resources and composing control planes.
  • Argo CD for declarative GitOps deployments, multi-cluster management, and application
  • lifecycle.
  • Strong hands-on experience with observability platforms, particularly distributed tracing and
  • performance analytics or eBPF-based full-stack observability.
  • Proficiency in Infrastructure as Code tools (Terraform, Helm, Jsonnet/Kustomize).
  • Programming skills in Python, Go, or similar for automation and tooling.
  • Solid understanding of CI/CD pipelines (GitHub Actions, GitLab CI, Argo Workflows).

PREFERRED SKILLS

  • Experience with multi-region/multi-cluster Kubernetes architectures and service meshes
  • (Istio/Linkerd).
  • Contributions to or deep usage of chaos engineering tools (Chaos Mesh, Litmus).
  • Familiarity with cost optimization tools (Kubecost, CloudZero) and FinOps practices.
  • Relevant certifications (CKA/CKS, Google Professional Cloud Architect, etc.).
  • Experience implementing SLO-driven development and reliability budgeting.

EDUCATION QUALIFICATION

Bachelor’s degree in computer science, Engineering, or equivalent practical experience.

Skills Required

  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
  • 8+ years of experience in SRE, Platform Engineering or DevOps roles with Kubernetes focus
  • Expert knowledge of Kubernetes orchestration, operators, and security standards
  • Production experience with HPA, VPA, and KEDA
  • Proficiency in Infrastructure as Code tools (Terraform, Helm)
  • Programming skills in Python, Go, or similar for automation

Kore.ai Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Kore.ai and has not been reviewed or approved by Kore.ai.

  • Fair & Transparent Compensation Pay is characterized as fair or competitive in certain roles, with particular strength noted in senior U.S. go-to-market positions. Pay-and-benefits sentiment also trends toward the middle-to-okay range rather than uniformly negative.
  • Healthcare Strength Health insurance is portrayed as strong in the limited U.S. benefits snapshots available. Core medical coverage is presented as a clear bright spot where details are provided.
  • Parental & Family Support Paid parental leave is presented as comparatively strong, with maternity and paternity leave described as meaningful. Family-related time-off support is one of the more consistently specified benefits elements.

Kore.ai Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Orlando, FL
654 Employees
Year Founded: 2013

What We Do

Digital future is headed where convenience and speed coexist with personal and human touch. People should be able to communicate with companies, systems and smart machines in the same way they’d talk to friends and colleagues. Kore.ai strives to achieve this with its unsurpassed innovation in Natural Language Understanding (NLU) to deliver the next generation of human-to-machine interactions, in the form of virtual assistants, for the greater good of our customers and their employees. We are constantly in search of people who share this vision and are fired by values that we believe are at the core of everything we do: an obsession to achieve client success through understanding and empathy, turning every challenge as an opportunity to innovate, a culture of openness, the willingness to embrace bold ideas; and being a trailblazer when it comes to trying, failing (fast) and learning to succeed. A reason why reputed analyst firms like Gartner, Forrester, IDC, ISG, Everest Group, Celent, and more have recognized as a Market Leader. It is not worth living if you are not having some fun – Kore.ai has everything you need to make your career successful, purposeful, and happy. Because digital experiences of today will eventually manifest as conversational interactions and transform the way enterprises interact with their customers, partners and employees, we have every reason to believe you can make a difference.

Similar Jobs

In-Office
Hyderabad, Telangana, IND
3062 Employees

Zeta Logo Zeta

Site Reliability Engineer

Cloud • Fintech • Financial Services
In-Office
Hyderabad, Telangana, IND
1834 Employees
In-Office
Hyderabad, Telangana, IND
505 Employees
In-Office
Hyderabad, Telangana, IND
3062 Employees

Similar Companies Hiring

Legora Thumbnail
Artificial Intelligence • Legal Tech • Software
Chicago, Illinois
700 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account