Want to build a worldwide brand from Taiwan, and to communicate our brand story to millions of users worldwide?
Want to be based in Taiwan but work in a silicon-valley-like environment, and to build world-class brand and products?
Want to participate in the global fintech and blockchain movement, and work at an English-speaking workplace?
Come change the world with us! Join this fast-growing startup founded by software veterans and funded by top VCs, Skype co-founders, and the Taiwanese government (NDF)!
We’re hiring for an experienced SRE Manager. The exact mix of other skills does not matter, so long as your tool chest includes a mix of abilities. Be willing to attack anything that comes your way, learn on the fly and get things done.
Come talk to us if you want to push your skillset in a dynamic fast-paced environment.
Responsibilities:
- Lead and manage the SRE team to ensure high availability, scalability, and reliability of production systems
- Own AWS cloud infrastructure operations, monitoring, security, resource management, and cost optimization in a 24/7 environment
- Lead incident management, troubleshooting, RCA, and post-incident improvements
- Ensure infrastructure, cloud environments, and operational processes comply with security, audit, and regulatory requirements (e.g. MAS TRM, ISO 27001)
- Drive SRE best practices including observability, alerting, SLA/SLO/SLI, capacity planning, disaster recovery, and high availability
- Improve system performance, reliability, and operational efficiency through automation and architecture optimization
- Build and maintain CI/CD, IaC, and GitOps workflows to improve deployment efficiency and system consistency
- Manage Kubernetes / EKS platforms and containerized infrastructure
- Collaborate closely with Backend, Data, Security, and Product teams on architecture design and operational improvements
- Build and improve monitoring and observability platforms such as Grafana, ELK, CloudWatch, Zabbix, and Nagios
- Mentor team members, support technical growth, and drive cross-functional collaboration
- Maintain operational documentation, SOPs, and incident reports
- Participate in and improve on-call and incident response processes
Requirements:
- 8+ years of Linux system administration and large-scale infrastructure experience
- 2+ years of team management or Tech Lead experience
- Hands-on experience operating high-traffic, high-availability cloud platforms in a 24/7 environment
- Strong experience with AWS services, including:
- EC2, API Gateway, AppSync
- VPC, IAM, Networking
- Lambda, Aurora, ElastiCache (Redis)
- CloudFront, CloudWatch, EKS
- Security Services, SNS, Parameter Store, Secrets Manager
- Strong Kubernetes and container infrastructure experience, including EKS administration and troubleshooting
- Experience with Infrastructure as Code and configuration management tools such as Terraform, Helm, and Kustomize
- Experience with CI/CD and GitOps tools such as Jenkins, GitHub Actions, Argo Workflow, and ArgoCD
- Familiar with observability and monitoring tools including Grafana, ELK, Zabbix, and Nagios
- Experience managing distributed systems and related technologies such as MongoDB, Kafka, Load Balancers, and HA architecture
- Strong understanding of SRE / DevOps practices, including Incident Management, Capacity Planning, Disaster Recovery, and SLA/SLO/SLI
- Proficient in scripting or programming languages such as Bash, Python, or Golang
- Knowledge of cloud security, infrastructure security, and technical risk management
- Strong communication, collaboration, and problem-solving skills in fast-paced environments
- Experience in FinTech, Crypto, or high-availability platforms is a plus
- Familiar with compliance and security frameworks such as MAS TRM and ISO 27001 is a plus
Location: Taipei (check it out on Google Maps!)
About XREX
Regarding our culture
Skills Required
- 8+ years of Linux system administration and large-scale infrastructure experience
- 2+ years of team management or Tech Lead experience
- Hands-on experience operating high-traffic, high-availability cloud platforms in a 24/7 environment
- Strong experience with AWS services
- Strong Kubernetes and container infrastructure experience
- Experience with Infrastructure as Code and configuration management tools
- Experience with CI/CD and GitOps tools
- Familiar with observability and monitoring tools
- Strong understanding of SRE/DevOps practices
- Proficient in scripting or programming languages
What We Do
XREX is a blockchain-enabled financial institution working with banks, regulators, and users to redefine banking together. We provide enterprise-grade banking services to small to medium-sized businesses (SMBs) in or dealing with emerging markets, and novice-friendly financial services to individuals worldwide. Founded in 2018 and operating globally under multiple licenses, XREX offers a full suite of services such as digital asset custody, wallet, cross-border payment, fiat-crypto conversion, cryptocurrency exchange, asset management, and fiat currency on-off ramps. Sharing the social responsibility of financial inclusion, XREX leverages blockchain technologies to further financial participation, access, and education
.jpeg)







