Software Engineering Manager - SRE

Reposted 3 Days Ago
Be an Early Applicant
Chennai, Tamil Nadu, IND
In-Office
Expert/Leader
Fintech
The Role
Lead SRE teams to own reliability for distributed systems, APIs, microservices and data pipelines. Define SLOs/SLIs, run incident response, capacity planning, resiliency testing, and automation to reduce toil. Manage people, on-call operations, cross-functional collaboration, platform enablement, and operational governance for production readiness and continuous reliability improvement.
Summary Generated by Built In

At U.S. Bancorp India, we’re on a journey to do our best. We believe it takes all of us to bring our shared ambition to life, and each person is unique in their potential. A career with U.S. Bancorp India gives you a wide, ever-growing range of opportunities to discover what makes you thrive at every stage of your career. Try new things, learn new skills and discover what you excel at—all from Day One.

Job Description

Key Responsibilities 

Reliability Engineering & Service Operations 

  • Own reliability outcomes for distributed systems, APIs, microservices, data pipelines, and critical production platforms, with accountability for availability, latency, throughput, and saturation 

  • Define and operationalize SLOs, SLIs, error budgets, alert thresholds, and service health indicators to improve customer experience and engineering accountability 

  • Lead production readiness reviews, capacity planning, performance testing, failover validation, chaos/resiliency testing, and disaster recovery preparedness 

  • Drive standardization of monitoring, telemetry, distributed tracing, logging, synthetic checks, and runbook practices across services and platforms 

  • Partner with engineering teams to reduce operational toil through automation, self-healing workflows, auto-remediation, and reliability-focused platform improvements 

Incident Management & Operational Excellence 

  • Lead major incident response, escalation management, and technical triage for high-severity production events, ensuring rapid mitigation, stakeholder communication, and service recovery in high-pressure, time-critical environments 

  • Establish strong practices for root cause analysis, problem management, failure mode analysis, and durable corrective/preventive actions 

  • Drive operational governance using metrics such as MTTR, MTTD, change failure rate, incident recurrence, alert noise, and service error budget consumption 

  • Partner with infrastructure, application, database, and network teams to proactively identify scaling risks, dependency bottlenecks, and single points of failure 

People & Team Management 

  • Directly manage multiple SRE leads and senior reliability engineers, providing technical coaching, operational guidance, and performance leadership across teams 

  • Build and scale high-performing SRE teams in the GCC environment with strong focus on production ownership, operational engineering, and systems thinking 

  • Drive team capability in on-call operations, debugging, incident command, automation development, and platform diagnostics 

  • Foster a culture of blameless incident analysis, engineering accountability, and continuous reliability improvement 

  • Manage staffing, on-call coverage, skill distribution, and hiring aligned to platform complexity, production demand, and business criticality 

Cross-Functional Collaboration 

  • Partner with software engineering, infrastructure, database, network, security, and platform teams to improve system stability, deployment safety, and operational readiness 

  • Translate business criticality and customer impact into technical reliability priorities, architecture guardrails, recovery objectives, and measurable engineering outcomes 

  • Work effectively in a distributed/global operating model, ensuring seamless coordination with onshore engineers, command centers, platform owners, and leadership teams during both steady-state and incident scenarios 

Automation & Platform Enablement 

  • Promote and govern infrastructure as code, configuration management, CI/CD reliability, release guardrails, policy enforcement, and automated rollback/recovery patterns 

  • Enable engineering teams with standardized tooling for observability, deployment validation, incident response, debugging, performance diagnostics, and service dependency analysis 

  • Drive platform modernization through reusable automation, reliability frameworks, production diagnostics, and engineering patterns that improve resiliency and reduce mean time to recovery 

 

Basic Qualifications 

  • Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience 

  • 12+ years of experience in software engineering, site reliability engineering, production operations, platform engineering, or infrastructure engineering 

  • 5+ years of experience leading reliability or production engineering teams, including managing leads or senior engineers in technically complex environments 

 

Preferred Skills & Experience 

  • Proven experience managing multiple SRE, production engineering, or platform operations teams in a matrix/global setup 

  • Strong experience working in offshore/onshore operating models, preferably within Banking and Financial Services 

  • Hands-on knowledge of cloud platforms, Kubernetes, Linux systems, networking fundamentals, distributed systems, and infrastructure automation 

  • Experience with observability platforms, telemetry pipelines, incident tooling, configuration management, and service governance 

  • Strong understanding of CI/CD pipelines, scripting languages, infrastructure as code, release engineering, and automated operational workflows 

  • Ability to review architecture and operational designs for scalability, fault tolerance, recovery, and performance bottlenecks 

  • Strong stakeholder management and communication skills across global engineering teams and senior leadership 

 

Leadership Competencies 

  • Ability to balance technical depth, operational discipline, architecture awareness, and people leadership 

  • Strong decision-making skills with a focus on business continuity, failure risk, service reliability, and engineering trade-offs, with the ability to remain calm and effective in high-pressure operational situations 

  • Proven ability to influence engineering design and operational practices without direct authority across platform and application teams 

  • High ownership mindset with focus on resilience engineering, operational excellence, and predictable service behavior in production 

If there’s anything we can do to accommodate a disability during any portion of the application or hiring process, please refer to our disability accommodations for applicants.

Posting may be closed earlier due to high volume of applicants.

This is an U.S. Bancorp India posting. U.S. Bancorp India is a part of the U.S. Bank family.

Skills Required

  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
  • 12+ years experience in software engineering, site reliability, production operations, platform engineering, or infrastructure engineering
  • 5+ years experience leading reliability or production engineering teams, including managing leads or senior engineers
  • Proven experience managing multiple SRE, production engineering, or platform operations teams in a matrix/global setup
  • Experience with offshore/onshore operating models, preferably in Banking and Financial Services
  • Hands-on knowledge of cloud platforms, Kubernetes, Linux systems, networking fundamentals, distributed systems, and infrastructure automation
  • Experience with observability platforms, telemetry pipelines, incident tooling, configuration management, and service governance
  • Strong understanding of CI/CD pipelines, scripting languages, infrastructure as code, release engineering, and automated operational workflows
  • Ability to review architecture and operational designs for scalability, fault tolerance, recovery, and performance bottlenecks
  • Strong stakeholder management and communication skills across global engineering teams and senior leadership

US Bank Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about US Bank and has not been reviewed or approved by US Bank.

  • Retirement Support The package pairs a pension with a matched 401(k), strengthening long‑term financial security. Retirement programs and other financial safeguards are presented as comprehensive.
  • Leave & Time Off Breadth Paid vacation, sick time, numerous holidays, and dedicated volunteer hours provide meaningful time away. Additional time off with tenure and options to expand PTO bolster flexibility.
  • Healthcare Strength Medical, dental, and vision coverage with HSA/FSA options and wellness resources are described as robust. Health insurance is characterized as top‑notch in multiple descriptions.

US Bank Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Minneapolis, MN

What We Do

We believe in putting people first, and our dedication to making ethical decisions and doing the right thing is at the heart of what we do. We're proud to be named by Ethisphere as a 2018 World's Most Ethical Company.

Similar Jobs

Comcast Logo Comcast

Specialist 3 - Functional Systems & Technology

Digital Media • Information Technology • News + Entertainment
Hybrid
Chennai, Tamil Nadu, IND
115000 Employees

Comcast Logo Comcast

Engineer 2 - Machine Learning

Digital Media • Information Technology • News + Entertainment
Hybrid
Chennai, Tamil Nadu, IND
115000 Employees

Comcast Logo Comcast

Engineer 2 - Cyber Security Operations

Digital Media • Information Technology • News + Entertainment
Hybrid
Chennai, Tamil Nadu, IND
115000 Employees

Comcast Logo Comcast

Specialist 3, Functional Systems & Technology

Digital Media • Information Technology • News + Entertainment
Hybrid
Chennai, Tamil Nadu, IND
115000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account