Site Reliability Engineer III

Posted 18 Days Ago
Be an Early Applicant
Bangalore, Bengaluru Urban, Karnataka, IND
In-Office
Senior level
Fintech • Financial Services
The Role
Design, build, and operate reliable cloud-native banking platforms using SRE principles. Implement automation, monitoring, incident response, scalability, and cost optimization. Mentor teams, support production escalations, and drive DevOps/SRE best practices.
Summary Generated by Built In

Candescent is a forward-thinking technology company transforming how financial institutions deliver Intelligent Banking experiences. We unite digital banking, account opening, and branch solutions that power and connect digital banking, account opening, and branch solutions—creating seamless engagement across digital, remote, and in-person channels.

Our Experience-Led, Intelligence-Driven approach combines human-centered design with data, automation, and cloud-based innovation. Built on an API-first architecture, our extensible ecosystem enables institutions to adapt quickly, integrate easily, and unlock new opportunities for growth—turning every customer interaction into a moment of clarity, confidence, and connection.

Position: Site Reliability Engineer III

Experience: 5-8 Years

Location: Bangalore (Ecospace)

Candescent Site Reliability Engineering (SRE) mission is to proactively ensure the reliability, availability and performance of our Digital First banking applications. As a member of the SRE team, you will focus on building and operating highly reliable application platforms by applying SRE principles such as automation, observability, resilience and continuous improvement. 

You will partner closely with application and platform teams to define reliability standards, implement monitoring, alerting and incident response practices and embed scalability and performance considerations into application design and delivery. Through tooling, automation, and best practices, you will help development teams build and operate services that meet agreed reliability objectives. 

As a senior engineer in the organization, you will also provide mentorship within the SRE team and across peer engineering teams, helping elevate operational maturity, drive adoption of SRE practices, and strengthen reliability culture across our core initiatives.

Responsibilities:

  •  Partner with cloud architects to build, test and revise proposed architectures and solutions
  • Assist in building various tools/automation to streamline existing processes
  • Work with Development, Security and Business Unit teams to deliver a world class cloud platform
  • Build automation scripts and frameworks to improve operational processes and procedures.
  • Learn, deploy and document newer technologies for the potential deployment of services following a development and release life cycle
  • Support production escalations as needed.
  • Driving ongoing improvements and efficiencies in operational practices, tools & processes.
  • Identify areas for improvement or gaps in our systems, whether related to scaling, reliability, automation or cost optimization.

Required Skills/Experience:

  • Building and supporting production level Kubernetes clusters; Optimizing containerized workloads
  • Experience with cloud networking; configuring VPC’s, firewalls, ingress/egress, CDN.
  • Experience in AWS services.
  • Handson on EKS, RDS, Lambda, Cloudwatch, Storage Solutions(EFS, S3, EBS..)
  • Experience in Terraform/Terragrunt
  • Experience in Cloud migrations.
  • BS in Computer Science or related field, or equivalent experience.
  • Must have high initiative and be a clear communicator.
  • Must be good at setting up and troubleshooting environments
  • Extensive experience with Prometheus/Dynatrace or other logging tools.
  • Strong knowledge/experience with Application and Infrastructure Delivery automation, orchestration and configuration management.
  • Experience operating within cloud environments
  • Continued establishment of best in class DevOps development, automation and deployment practices, policies and standards.

Desired Skill Set:

  • Container build/management and Kubernetes
  • Amazon Web Services
  • Cloud migrations (Google/AWS)
  • IAC - Terraform
  • Scripting – Python
  • CI/CD - GitHub
  • Version control – GIT, GitOps

Statement to Third Party Agencies
To ALL recruitment agencies: Candescent only accepts resumes from agencies on the preferred supplier list. Please do not forward resumes to our applicant tracking system, Candescent employees, or any Candescent facility. Candescent is not responsible for any fees or charges associated with unsolicited resumes.

Skills Required

  • Production Kubernetes clusters and container workload optimization
  • Cloud networking (VPC, firewalls, ingress/egress, CDN)
  • Experience with AWS services
  • Hands-on experience with EKS, RDS, Lambda, CloudWatch, EFS, S3, EBS
  • Experience with Terraform and Terragrunt
  • Experience in cloud migrations
  • BS in Computer Science or related field, or equivalent experience
  • High initiative and clear communication skills
  • Ability to set up and troubleshoot environments
  • Extensive experience with Prometheus, Dynatrace, or other monitoring/logging tools
  • Application and infrastructure automation, orchestration and configuration management expertise
  • Experience operating within cloud environments
  • Drive DevOps development, automation and deployment practices, policies and standards
  • Container build and management
  • Cloud migrations experience with Google Cloud (GCP)
  • Scripting with Python
  • CI/CD with GitHub
  • Version control with Git and GitOps practices
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
1,030 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account