Site Reliability Engineer IV

Reposted 23 Days Ago
Be an Early Applicant
Hyderabad, Telangana, IND
In-Office
Expert/Leader
Fintech • Financial Services
The Role
As a Site Reliability Engineer IV, you'll ensure the reliability and performance of digital banking applications, partner with development teams, and drive SRE practices. Responsibilities include application monitoring, incident response, automation of tasks, and mentoring peers.
Summary Generated by Built In

Candescent is the leading cloud-based digital banking solutions provider for financial institutions. We are transforming digital banking with intelligent, cloud-powered solutions that connect account opening, digital banking, and branch experiences for financial institutions. Our advanced technology and developer tools enable seamless, differentiated customer journeys that elevate trust, service, and innovation. Success here requires flexibility in a fast-paced environment, a client-first mindset, and a commitment to delivering consistent, reliable results as part of a performance-driven, values-led team.  With team members around the world, Candescent is an equal opportunity employer.

Position: Site Reliability Engineer IV

Experience: 9-12 Years

Location: Bangalore (Ecospace)

Candescent Site Reliability Engineering (SRE) mission is to proactively ensure the reliability, availability and performance of our Digital First banking applications. As a member of the SRE team, you will focus on building and operating highly reliable application platforms by applying SRE principles such as automation, observability, resilience and continuous improvement. 

You will partner closely with application and platform teams to define reliability standards, implement monitoring, alerting and incident response practices and embed scalability and performance considerations into application design and delivery. Through tooling, automation, and best practices, you will help development teams build and operate services that meet agreed reliability objectives. 

As a senior engineer in the organization, you will also provide mentorship within the SRE team and across peer engineering teams, helping elevate operational maturity, drive adoption of SRE practices, and strengthen reliability culture across our core initiatives.

Responsibilities 

  • Support and operate production applications running on Kubernetes and AWS

  • Troubleshoot application-level issues using logs, metrics, traces, and runtime signals

  • Participate in incident response, root cause analysis, and post-incident reviews

  • Work closely with development teams to understand application architecture, dependencies, and data flows

  • Improve application observability by defining meaningful alerts, dashboards, and SLOs

  • Automate repetitive operational tasks to reduce toil

  • Support application deployments, rollbacks, and runtime configuration changes

  • Identify reliability, performance, and scalability gaps in application behavior

  • Drive continuous improvements in operational readiness, runbooks, and on-call practices

  • Influence application teams to adopt shift-left reliability practices

Must-Have Skills & Experience 

  • Hands-on experience supporting Java applications in production

  • Strong understanding of JVM fundamentals (heap/memory management, garbage collection, OOM issues, thread analysis)

  • Proven experience with SRE practices, including:

    • Incident response and on-call support

    • Root cause analysis and postmortems

    • SLIs, SLOs, and reliability-driven operations

  • Strong experience troubleshooting using application logs, metrics, and monitoring tools

  • Experience operating Java applications on Kubernetes (EKS) from an application/runtime perspective

  • Experience with deployment strategies (rolling, blue/green, canary)

  • Ability to write automation and scripts (Python or any) to reduce operational toil

  • Solid understanding of application architecture and service dependencies (databases, messaging systems, external APIs)

  • Strong collaboration and communication skills; ability to work closely with development teams

  • Demonstrates accountability and sound judgment when responding to high-pressure incidents

Good-to-Have Skills & Experience 

  • Exposure to platform or infrastructure concepts supporting application workloads

  • Experience with AWS services such as EKS, RDS/Aurora, S3, EFS, and CloudWatch

  • CI/CD pipeline experience (GitHub Actions, Jenkins)

  • Familiarity with GitOps practices

  • Experience with cloud migrations or modernization efforts

Statement to Third Party Agencies
To ALL recruitment agencies: Candescent only accepts resumes from agencies on the preferred supplier list. Please do not forward resumes to our applicant tracking system, Candescent employees, or any Candescent facility. Candescent is not responsible for any fees or charges associated with unsolicited resumes.

Skills Required

  • Hands-on experience supporting Java applications in production
  • Strong understanding of JVM fundamentals
  • Proven experience with SRE practices
  • Strong experience troubleshooting using application logs
  • Experience operating Java applications on Kubernetes
  • Ability to write automation and scripts
  • Solid understanding of application architecture
  • Strong collaboration and communication skills

Candescent Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Candescent and has not been reviewed or approved by Candescent.

  • Leave & Time Off Breadth Policies include unlimited vacation for full-time exempt staff, tenure-based accrual for non-exempt, plus floating holidays and sick leave. This breadth of time off suggests flexibility across employment classifications.
  • Wellbeing & Lifestyle Benefits A discount program is cited that provides access to deals at over 250 retailers. This perk adds everyday savings beyond core benefits.

Candescent Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Atlanta, Georgia
1,030 Employees
Year Founded: 2024

What We Do

Candescent brings together the transformative technologies that power and connect account opening, digital banking and branch solutions for banks and credit unions of all sizes. And we’re here to help you extend, differentiate and illuminate your digital-first banking experiences. Our industry-leading products and services, cloud architecture and on-demand developer tools give you the power to differentiate and deliver seamless customer journeys.

Similar Jobs

LiveRamp Logo LiveRamp

Senior Site Reliability Engineer

Big Data • Cloud • Marketing Tech • Social Impact • Software
In-Office
Hyderabad, Telangana, IND
1190 Employees
50K-150K Annually
In-Office
Hyderabad, Telangana, IND
3062 Employees
In-Office
Hyderabad, Telangana, IND
677 Employees
In-Office or Remote
4 Locations
165 Employees

Similar Companies Hiring

Scotch Thumbnail
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
US
35 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account