Senior Site Reliability Engineer - ELK

Reposted 16 Days Ago
Be an Early Applicant
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
In-Office
Senior level
Fintech • Payments • Software • Financial Services
The Role
The Senior Site Reliability Engineer will enhance systems reliability and operability using ELK and Kafka, develop automation tools, and support integration efforts. They will document processes, train junior members, and collaborate with teams for ongoing system improvements.
Summary Generated by Built In

ABOUT US

We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value – across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we’re proud to support the global economy. 

We’re unique too. We were established to find a better way for the global financial community to move value – a reliable, safe and secure approach that the community can trust, completely. We’re always striving to be better and are constantly evolving in an ever-changing landscape, without undermining that trust. Five decades on, our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently, test exhaustively, then implement fast. In a connected and exciting era, our mission has never been more relevant. Swift now has a presence in 200+ countries and legal territories to serve a community of more than 12,000 banks and financial institutions.   

Join our central DevOps Engineering Services organization at Swift, committed to reshaping the developer experience. As a Site Reliability Engineer, you'll be pivotal in crafting end-to-end delivery pipelines, ensuring seamless integration, deployment of infrastructure and software, and providing essential maintenance and support to our developer community. With a strong focus on real zero-trust strategies, problem-solving capabilities, and customer-oriented approaches, join us on our transformative journey.

What to expect:

  • Contribute to deployment phases with a focus on scalability, reliability, and operability of ELK and Kafka solutions. Ensure that production readiness is considered at every stage of the software lifecycle.
  • Develop automation scripts, infrastructure as code, and tooling using industry best practices to improve system reliability, reduce manual effort, and enable self-service.
  • Analyze production issues, identify root causes, and implement long-term reliability improvements through automation, alerting, monitoring, and architectural enhancements.
  • Work collaboratively with other team members and provide guidance to more junior team members.
  • Organize an efficient handover through high quality documentation and training.
  • Automate the deployment and operation of multi-tenant infrastructure, handling tasks that ensure system resilience and availability.
  • Develop and maintain monitoring tools, dashboards, and self-healing mechanisms.
  • Participate in on-call rotations, weekend deployment duty, conduct blameless postmortems, and drive continuous learning.
  • Work closely with developers, product teams, and engineering stakeholders to troubleshoot issues, improve systems, and integrate reliability improvements
  • Collaborate with technical teams on operational concerns of integration solutions on ELK platform.

What will make you successful?

  • Bachelor’s/master’s degree in engineering, Computer Science, IT, or equivalent experience.
  • Minimum 8 years of SRE/Software development experience in an (preferably) international setting.
  • Familiarity or experience with data ingestion with big data technologies (Elastic Search, Logstash, Kibana and kafka).
  • Experience with CICD development & deployment tools such as Maven, Jenkins, Nexus, Git, and Docker.
  • Proficiency in Linux OS
  • Proficiency in scripting and automation (e.g. Python, PowerShell, YAML) with the ability to develop tools and infrastructure as code (Preferably Ansible, Terraform, Kubernetes, OpenShift).
  • Understanding of distributed systems and microservices architectures, including REST and SOAP APIs.
  • Hands-on experience with ITIL processes, including Incident, Problem, and Continual Improvement, is an advantage.
  • Experience working within an Agile-driven environment.
  • Practical experience in building metrics for data-driven reporting.
  • Strong interpersonal skills with a customer-centric mindset and ability to work effectively across diverse cultures.
  • Proven ability to collaborate with both local and remote teams across different time zones.
  • Familiarity with or experience in managing VM hosts using vCenter is an advantage

What we offer

We give you a competitive package

We help you perform at your best

We help you make a difference

We give you the freedom to be yourself

We give you the freedom to be yourself. We are creating an environment of unique individuals – like you – with different perspectives on the financial industry and the world. A diverse and inclusive environment in which everyone’s voice counts and where you can reach your full potential.

We are committed to an inclusive and accessible recruitment process. If you require a reasonable accommodation related to accessibility during your application or interview, please contact [email protected] or indicate this in your application.

Please note that this mailbox is not monitored for general recruitment enquiries and should only be used for accessibility or accommodation-related requests (for example related to vision, hearing or neurodiversity).

All requests are confidential and will not affect your candidacy.

Don’t meet every single requirement? At Swift, we are dedicated to building a workplace where people can bring their full selves and ideas to the team, so if you are excited about this role, we encourage you to apply even if you do not meet every single qualification.

Top Skills

Ansible
Docker
Elastic Search
Git
Jenkins
Kafka
Kibana
Kubernetes
Linux
Logstash
Maven
Nexus
Openshift
Powershell
Python
Terraform
Yaml
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
4,765 Employees
Year Founded: 1973

What We Do

SWIFT is a global member-owned cooperative and the world’s leading provider of secure financial messaging services. We provide our community with a platform for messaging and standards for communicating, and we offer products and services to facilitate access and integration, identification, analysis and regulatory compliance. Our messaging platform, products and services connect more than 11,000 banking and securities organisations, market infrastructures and corporate customers in more than 200 countries and territories. SWIFT also brings the financial community together – at global, regional and local levels – to shape market practice, define standards and debate issues of mutual interest or concern. For more information, visit www.swift.com or follow us on Twitter: @swiftcommunity

Similar Jobs

Pfizer Logo Pfizer

Health Representative (Central)

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
121990 Employees

CrowdStrike Logo CrowdStrike

Regional Sales Manager

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
Malaysia
10000 Employees

Wise Logo Wise

FinCrime Reporting Specialist - Indonesian Speaker

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
8000 Employees

Airwallex Logo Airwallex

Finance Manager

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
In-Office
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
2000 Employees

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account