Site Reliability Engineer IV

Posted 17 Days Ago
Be an Early Applicant
Hyderabad, Telangana, IND
In-Office
Expert/Leader
Fintech • Financial Services
The Role
As a Site Reliability Engineer IV, you'll ensure the reliability and performance of digital banking applications, partner with development teams, and drive SRE practices. Responsibilities include application monitoring, incident response, automation of tasks, and mentoring peers.
Summary Generated by Built In

Candescent is the leading cloud-based digital banking solutions provider for financial institutions. We are transforming digital banking with intelligent, cloud-powered solutions that connect account opening, digital banking, and branch experiences for financial institutions. Our advanced technology and developer tools enable seamless, differentiated customer journeys that elevate trust, service, and innovation. Success here requires flexibility in a fast-paced environment, a client-first mindset, and a commitment to delivering consistent, reliable results as part of a performance-driven, values-led team.  With team members around the world, Candescent is an equal opportunity employer.

Position: Site Reliability Engineer IV

Experience: 9-12 Years

Location: Bangalore (Ecospace)

Candescent Site Reliability Engineering (SRE) mission is to proactively ensure the reliability, availability and performance of our Digital First banking applications. As a member of the SRE team, you will focus on building and operating highly reliable application platforms by applying SRE principles such as automation, observability, resilience and continuous improvement. 

You will partner closely with application and platform teams to define reliability standards, implement monitoring, alerting and incident response practices and embed scalability and performance considerations into application design and delivery. Through tooling, automation, and best practices, you will help development teams build and operate services that meet agreed reliability objectives. 

As a senior engineer in the organization, you will also provide mentorship within the SRE team and across peer engineering teams, helping elevate operational maturity, drive adoption of SRE practices, and strengthen reliability culture across our core initiatives.

Responsibilities 

  • Support and operate production applications running on Kubernetes and AWS

  • Troubleshoot application-level issues using logs, metrics, traces, and runtime signals

  • Participate in incident response, root cause analysis, and post-incident reviews

  • Work closely with development teams to understand application architecture, dependencies, and data flows

  • Improve application observability by defining meaningful alerts, dashboards, and SLOs

  • Automate repetitive operational tasks to reduce toil

  • Support application deployments, rollbacks, and runtime configuration changes

  • Identify reliability, performance, and scalability gaps in application behavior

  • Drive continuous improvements in operational readiness, runbooks, and on-call practices

  • Influence application teams to adopt shift-left reliability practices

Must-Have Skills & Experience 

  • Hands-on experience supporting Java applications in production

  • Strong understanding of JVM fundamentals (heap/memory management, garbage collection, OOM issues, thread analysis)

  • Proven experience with SRE practices, including:

    • Incident response and on-call support

    • Root cause analysis and postmortems

    • SLIs, SLOs, and reliability-driven operations

  • Strong experience troubleshooting using application logs, metrics, and monitoring tools

  • Experience operating Java applications on Kubernetes (EKS) from an application/runtime perspective

  • Experience with deployment strategies (rolling, blue/green, canary)

  • Ability to write automation and scripts (Python or any) to reduce operational toil

  • Solid understanding of application architecture and service dependencies (databases, messaging systems, external APIs)

  • Strong collaboration and communication skills; ability to work closely with development teams

  • Demonstrates accountability and sound judgment when responding to high-pressure incidents

Good-to-Have Skills & Experience 

  • Exposure to platform or infrastructure concepts supporting application workloads

  • Experience with AWS services such as EKS, RDS/Aurora, S3, EFS, and CloudWatch

  • CI/CD pipeline experience (GitHub Actions, Jenkins)

  • Familiarity with GitOps practices

  • Experience with cloud migrations or modernization efforts

Statement to Third Party Agencies
To ALL recruitment agencies: Candescent only accepts resumes from agencies on the preferred supplier list. Please do not forward resumes to our applicant tracking system, Candescent employees, or any Candescent facility. Candescent is not responsible for any fees or charges associated with unsolicited resumes.

Top Skills

AWS
Ci/Cd
Cloudwatch
Github Actions
Java
Jenkins
Kubernetes
Python
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Atlanta, Georgia
1,030 Employees
Year Founded: 2024

What We Do

Candescent brings together the transformative technologies that power and connect account opening, digital banking and branch solutions for banks and credit unions of all sizes. And we’re here to help you extend, differentiate and illuminate your digital-first banking experiences. Our industry-leading products and services, cloud architecture and on-demand developer tools give you the power to differentiate and deliver seamless customer journeys.

Similar Jobs

Vertafore Logo Vertafore

Senior Site Reliability Engineer

Information Technology • Insurance • Software
Hybrid
Hyderabad, Telangana, IND
2372 Employees
Easy Apply
In-Office
Hyderabad, Telangana, IND
677 Employees
In-Office or Remote
4 Locations
165 Employees

Similar Companies Hiring

Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account