US Bank Jobs

Software Engineering Manager - SRE

US Bank

Software Engineering Manager - SRE

Reposted 3 Days Ago

Be an Early Applicant

Chennai, Tamil Nadu, IND

In-Office

Expert/Leader

Fintech

The Role

Lead SRE teams to own reliability for distributed systems, APIs, microservices and data pipelines. Define SLOs/SLIs, run incident response, capacity planning, resiliency testing, and automation to reduce toil. Manage people, on-call operations, cross-functional collaboration, platform enablement, and operational governance for production readiness and continuous reliability improvement.

Summary Generated by Built In

At U.S. Bancorp India, we’re on a journey to do our best. We believe it takes all of us to bring our shared ambition to life, and each person is unique in their potential. A career with U.S. Bancorp India gives you a wide, ever-growing range of opportunities to discover what makes you thrive at every stage of your career. Try new things, learn new skills and discover what you excel at—all from Day One.

Job Description

Key Responsibilities

Reliability Engineering & Service Operations

Own reliability outcomes for distributed systems, APIs, microservices, data pipelines, and critical production platforms, with accountability for availability, latency, throughput, and saturation

Define and operationalize SLOs, SLIs, error budgets, alert thresholds, and service health indicators to improve customer experience and engineering accountability

Lead production readiness reviews, capacity planning, performance testing, failover validation, chaos/resiliency testing, and disaster recovery preparedness

Drive standardization of monitoring, telemetry, distributed tracing, logging, synthetic checks, and runbook practices across services and platforms

Partner with engineering teams to reduce operational toil through automation, self-healing workflows, auto-remediation, and reliability-focused platform improvements

Incident Management & Operational Excellence

Lead major incident response, escalation management, and technical triage for high-severity production events, ensuring rapid mitigation, stakeholder communication, and service recovery in high-pressure, time-critical environments

Establish strong practices for root cause analysis, problem management, failure mode analysis, and durable corrective/preventive actions

Drive operational governance using metrics such as MTTR, MTTD, change failure rate, incident recurrence, alert noise, and service error budget consumption

Partner with infrastructure, application, database, and network teams to proactively identify scaling risks, dependency bottlenecks, and single points of failure

People & Team Management

Directly manage multiple SRE leads and senior reliability engineers, providing technical coaching, operational guidance, and performance leadership across teams

Build and scale high-performing SRE teams in the GCC environment with strong focus on production ownership, operational engineering, and systems thinking

Drive team capability in on-call operations, debugging, incident command, automation development, and platform diagnostics

Foster a culture of blameless incident analysis, engineering accountability, and continuous reliability improvement

Manage staffing, on-call coverage, skill distribution, and hiring aligned to platform complexity, production demand, and business criticality

Cross-Functional Collaboration

Partner with software engineering, infrastructure, database, network, security, and platform teams to improve system stability, deployment safety, and operational readiness

Translate business criticality and customer impact into technical reliability priorities, architecture guardrails, recovery objectives, and measurable engineering outcomes

Work effectively in a distributed/global operating model, ensuring seamless coordination with onshore engineers, command centers, platform owners, and leadership teams during both steady-state and incident scenarios

Automation & Platform Enablement

Promote and govern infrastructure as code, configuration management, CI/CD reliability, release guardrails, policy enforcement, and automated rollback/recovery patterns

Enable engineering teams with standardized tooling for observability, deployment validation, incident response, debugging, performance diagnostics, and service dependency analysis

Drive platform modernization through reusable automation, reliability frameworks, production diagnostics, and engineering patterns that improve resiliency and reduce mean time to recovery

Basic Qualifications

Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience

12+ years of experience in software engineering, site reliability engineering, production operations, platform engineering, or infrastructure engineering

5+ years of experience leading reliability or production engineering teams, including managing leads or senior engineers in technically complex environments

Preferred Skills & Experience

Proven experience managing multiple SRE, production engineering, or platform operations teams in a matrix/global setup

Strong experience working in offshore/onshore operating models, preferably within Banking and Financial Services

Hands-on knowledge of cloud platforms, Kubernetes, Linux systems, networking fundamentals, distributed systems, and infrastructure automation

Experience with observability platforms, telemetry pipelines, incident tooling, configuration management, and service governance

Strong understanding of CI/CD pipelines, scripting languages, infrastructure as code, release engineering, and automated operational workflows

Ability to review architecture and operational designs for scalability, fault tolerance, recovery, and performance bottlenecks

Strong stakeholder management and communication skills across global engineering teams and senior leadership

Leadership Competencies

Ability to balance technical depth, operational discipline, architecture awareness, and people leadership

Strong decision-making skills with a focus on business continuity, failure risk, service reliability, and engineering trade-offs, with the ability to remain calm and effective in high-pressure operational situations

Proven ability to influence engineering design and operational practices without direct authority across platform and application teams

High ownership mindset with focus on resilience engineering, operational excellence, and predictable service behavior in production

If there’s anything we can do to accommodate a disability during any portion of the application or hiring process, please refer to our disability accommodations for applicants.

Posting may be closed earlier due to high volume of applicants.

This is an U.S. Bancorp India posting. U.S. Bancorp India is a part of the U.S. Bank family.

Skills Required

Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
12+ years experience in software engineering, site reliability, production operations, platform engineering, or infrastructure engineering
5+ years experience leading reliability or production engineering teams, including managing leads or senior engineers
Proven experience managing multiple SRE, production engineering, or platform operations teams in a matrix/global setup
Experience with offshore/onshore operating models, preferably in Banking and Financial Services
Hands-on knowledge of cloud platforms, Kubernetes, Linux systems, networking fundamentals, distributed systems, and infrastructure automation
Experience with observability platforms, telemetry pipelines, incident tooling, configuration management, and service governance
Strong understanding of CI/CD pipelines, scripting languages, infrastructure as code, release engineering, and automated operational workflows
Ability to review architecture and operational designs for scalability, fault tolerance, recovery, and performance bottlenecks
Strong stakeholder management and communication skills across global engineering teams and senior leadership

US Bank Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about US Bank and has not been reviewed or approved by US Bank.

Retirement Support — The package pairs a pension with a matched 401(k), strengthening long‑term financial security. Retirement programs and other financial safeguards are presented as comprehensive.
Leave & Time Off Breadth — Paid vacation, sick time, numerous holidays, and dedicated volunteer hours provide meaningful time away. Additional time off with tenure and options to expand PTO bolster flexibility.
Healthcare Strength — Medical, dental, and vision coverage with HSA/FSA options and wellness resources are described as robust. Health insurance is characterized as top‑notch in multiple descriptions.

Learn more about US Bank's Compensation & Benefits →

US Bank Insights

What's It Like to Work at US Bank? US Bank Culture & Values US Bank Career Growth & Development What's the Work-Life Balance Like at US Bank? US Bank Leadership & Management US Bank Company Growth, Stability & Outlook

View all jobs at US Bank

View US Bank Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Minneapolis, MN

What We Do

We believe in putting people first, and our dedication to making ethical decisions and doing the right thing is at the heart of what we do. We're proud to be named by Ethisphere as a 2018 World's Most Ethical Company.