Candescent

Site Reliability Engineer III

Posted 18 Days Ago

Be an Early Applicant

Bangalore, Bengaluru Urban, Karnataka, IND

In-Office

Senior level

Fintech • Financial Services

The Role

Design, build, and operate reliable cloud-native banking platforms using SRE principles. Implement automation, monitoring, incident response, scalability, and cost optimization. Mentor teams, support production escalations, and drive DevOps/SRE best practices.

Summary Generated by Built In

Candescent is a forward-thinking technology company transforming how financial institutions deliver Intelligent Banking experiences. We unite digital banking, account opening, and branch solutions that power and connect digital banking, account opening, and branch solutions—creating seamless engagement across digital, remote, and in-person channels.

Our Experience-Led, Intelligence-Driven approach combines human-centered design with data, automation, and cloud-based innovation. Built on an API-first architecture, our extensible ecosystem enables institutions to adapt quickly, integrate easily, and unlock new opportunities for growth—turning every customer interaction into a moment of clarity, confidence, and connection.

Position: Site Reliability Engineer III

Experience: 5-8 Years

Location: Bangalore (Ecospace)

Candescent Site Reliability Engineering (SRE) mission is to proactively ensure the reliability, availability and performance of our Digital First banking applications. As a member of the SRE team, you will focus on building and operating highly reliable application platforms by applying SRE principles such as automation, observability, resilience and continuous improvement.

You will partner closely with application and platform teams to define reliability standards, implement monitoring, alerting and incident response practices and embed scalability and performance considerations into application design and delivery. Through tooling, automation, and best practices, you will help development teams build and operate services that meet agreed reliability objectives.

As a senior engineer in the organization, you will also provide mentorship within the SRE team and across peer engineering teams, helping elevate operational maturity, drive adoption of SRE practices, and strengthen reliability culture across our core initiatives.

Responsibilities:

Partner with cloud architects to build, test and revise proposed architectures and solutions
Assist in building various tools/automation to streamline existing processes
Work with Development, Security and Business Unit teams to deliver a world class cloud platform
Build automation scripts and frameworks to improve operational processes and procedures.
Learn, deploy and document newer technologies for the potential deployment of services following a development and release life cycle
Support production escalations as needed.
Driving ongoing improvements and efficiencies in operational practices, tools & processes.
Identify areas for improvement or gaps in our systems, whether related to scaling, reliability, automation or cost optimization.

Required Skills/Experience:

Building and supporting production level Kubernetes clusters; Optimizing containerized workloads
Experience with cloud networking; configuring VPC’s, firewalls, ingress/egress, CDN.
Experience in AWS services.
Handson on EKS, RDS, Lambda, Cloudwatch, Storage Solutions(EFS, S3, EBS..)
Experience in Terraform/Terragrunt
Experience in Cloud migrations.
BS in Computer Science or related field, or equivalent experience.
Must have high initiative and be a clear communicator.
Must be good at setting up and troubleshooting environments
Extensive experience with Prometheus/Dynatrace or other logging tools.
Strong knowledge/experience with Application and Infrastructure Delivery automation, orchestration and configuration management.
Experience operating within cloud environments
Continued establishment of best in class DevOps development, automation and deployment practices, policies and standards.

Desired Skill Set:

Container build/management and Kubernetes
Amazon Web Services
Cloud migrations (Google/AWS)
IAC - Terraform
Scripting – Python
CI/CD - GitHub
Version control – GIT, GitOps

Statement to Third Party Agencies
To ALL recruitment agencies: Candescent only accepts resumes from agencies on the preferred supplier list. Please do not forward resumes to our applicant tracking system, Candescent employees, or any Candescent facility. Candescent is not responsible for any fees or charges associated with unsolicited resumes.

Skills Required

Production Kubernetes clusters and container workload optimization
Cloud networking (VPC, firewalls, ingress/egress, CDN)
Experience with AWS services
Hands-on experience with EKS, RDS, Lambda, CloudWatch, EFS, S3, EBS
Experience with Terraform and Terragrunt
Experience in cloud migrations
BS in Computer Science or related field, or equivalent experience
High initiative and clear communication skills
Ability to set up and troubleshoot environments
Extensive experience with Prometheus, Dynatrace, or other monitoring/logging tools
Application and infrastructure automation, orchestration and configuration management expertise
Experience operating within cloud environments
Drive DevOps development, automation and deployment practices, policies and standards
Container build and management
Cloud migrations experience with Google Cloud (GCP)
Scripting with Python
CI/CD with GitHub
Version control with Git and GitOps practices

View all jobs at Candescent

View Candescent Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.