Principal SRE / Cross-Cluster SRE Lead

Reposted Yesterday
Be an Early Applicant
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
In-Office
Expert/Leader
Fintech • Payments • Software • Financial Services
The Role
Lead cross-cluster Site Reliability Engineering efforts, enforce standards, drive incident reduction, define metrics, and enhance automation across systems.
Summary Generated by Built In

ABOUT US

We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value – across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we’re proud to support the global economy. 

We’re unique too. We were established to find a better way for the global financial community to move value – a reliable, safe and secure approach that the community can trust, completely. We’re always striving to be better and are constantly evolving in an ever-changing landscape, without undermining that trust. Five decades on, our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently, test exhaustively, then implement fast. In a connected and exciting era, our mission has never been more relevant. Swift now has a presence in 200+ countries and legal territories to serve a community of more than 12,000 banks and financial institutions.   

Key Responsibilities

Cross-Cluster Standardization

  • Define and enforce incident management practices
  • Standardize alerting, monitoring, and request handling
  • Align workflows across ServiceNow and Jira
  • Ensure consistency across all clusters

Reliability Engineering

  • Define SLO, SLA, MTTR, MTRS standards
  • Identify systemic reliability gaps across clusters
  • Drive incident reduction and prevention strategies
  • Establish reliability as a measurable discipline

Automation Strategy

  • Identify cross-cluster automation opportunities
  • Define reusable automation patterns and frameworks
  • Eliminate duplicated operational solutions
  • Drive reduction of manual toil

Architecture Alignment

  • Partner with Solution Architects across clusters
  • Ensure operability is built into system design
  • Align monitoring, alerting, and failover strategies
  • Prevent conflicting tooling or architectural decisions

Governance and Reviews

  • Lead cross-cluster SRE reviews
  • Track adoption of standards and practices
  • Drive accountability across clusters
  • Highlight systemic risks and gaps

Technical Leadership

  • Guide SRE leads across clusters
  • Raise technical standards within SRE
  • Mentor engineers on reliability practices
  • Influence engineering teams on operability

Minimum Requirements:

Experience

  • 15+ years in software engineering, platform engineering, or SRE
  • Experience operating production systems at scale
  • Experience across multiple systems or domains

Technical Depth

Strong in at least two areas:

  • Distributed systems
  • Observability and monitoring
  • Infrastructure and cloud platforms
  • Automation and software engineering

Capabilities

  • Strong debugging and incident analysis skills
  • Ability to design automation solutions
  • Strong systems thinking across complex environments
  • Ability to influence without direct authority

Success Indicators

  • Standardized SRE practices across all clusters
  • Reduced incident recurrence
  • Improved MTTR and operational efficiency
  • Increased automation coverage
  • Reduced duplication across teams

What we offer

We give you the freedom to be yourself. We are creating an environment of unique individuals – like you – with different perspectives on the financial industry and the world. A diverse and inclusive environment in which everyone’s voice counts and where you can reach your full potential.

We are committed to an inclusive and accessible recruitment process. If you require a reasonable accommodation related to accessibility during your application or interview, please contact [email protected] or indicate this in your application.

Please note that this mailbox is not monitored for general recruitment enquiries and should only be used for accessibility or accommodation-related requests (for example related to vision, hearing or neurodiversity).

All requests are confidential and will not affect your candidacy.

Don’t meet every single requirement? At Swift, we are dedicated to building a workplace where people can bring their full selves and ideas to the team, so if you are excited about this role, we encourage you to apply even if you do not meet every single qualification.

Skills Required

  • 15+ years in software engineering, platform engineering, or SRE
  • Experience operating production systems at scale
  • Experience across multiple systems or domains
  • Strong in at least two areas: Distributed systems, Observability and monitoring, Infrastructure and cloud platforms, Automation and software engineering
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
4,765 Employees
Year Founded: 1973

What We Do

SWIFT is a global member-owned cooperative and the world’s leading provider of secure financial messaging services. We provide our community with a platform for messaging and standards for communicating, and we offer products and services to facilitate access and integration, identification, analysis and regulatory compliance. Our messaging platform, products and services connect more than 11,000 banking and securities organisations, market infrastructures and corporate customers in more than 200 countries and territories. SWIFT also brings the financial community together – at global, regional and local levels – to shape market practice, define standards and debate issues of mutual interest or concern. For more information, visit www.swift.com or follow us on Twitter: @swiftcommunity

Similar Jobs

MongoDB Logo MongoDB

Senior Solutions Architect

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
Malaysia
5550 Employees

Capco Logo Capco

Consultant

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Hybrid
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
6000 Employees

Capco Logo Capco

Business Analyst

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Hybrid
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
6000 Employees

Airwallex Logo Airwallex

Financial Crime Operation Senior Analyst

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
In-Office or Remote
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
2200 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account