SRE Manager

Posted 25 Days Ago
Be an Early Applicant
Taipei City, TWN
In-Office
Senior level
Blockchain • Fintech • Financial Services
The Role
Lead and manage the SRE team, ensuring high availability and reliability of production systems, while optimizing AWS infrastructure and implementing SRE best practices.
Summary Generated by Built In
About 

Want to build a worldwide brand from Taiwan, and to communicate our brand story to millions of users worldwide?

Want to be based in Taiwan but work in a silicon-valley-like environment, and to build world-class brand and products?

Want to participate in the global fintech and blockchain movement, and work at an English-speaking workplace?

Come change the world with us! Join this fast-growing startup founded by software veterans and funded by top VCs, Skype co-founders, and the Taiwanese government (NDF)!

We’re hiring for an experienced SRE Manager. The exact mix of other skills does not matter, so long as your tool chest includes a mix of abilities. Be willing to attack anything that comes your way, learn on the fly and get things done. 

Come talk to us if you want to push your skillset in a dynamic fast-paced environment.

Responsibilities:

  • Lead and manage the SRE team to ensure high availability, scalability, and reliability of production systems
  • Own AWS cloud infrastructure operations, monitoring, security, resource management, and cost optimization in a 24/7 environment
  • Lead incident management, troubleshooting, RCA, and post-incident improvements
  • Ensure infrastructure, cloud environments, and operational processes comply with security, audit, and regulatory requirements (e.g. MAS TRM, ISO 27001)
  • Drive SRE best practices including observability, alerting, SLA/SLO/SLI, capacity planning, disaster recovery, and high availability
  • Improve system performance, reliability, and operational efficiency through automation and architecture optimization
  • Build and maintain CI/CD, IaC, and GitOps workflows to improve deployment efficiency and system consistency
  • Manage Kubernetes / EKS platforms and containerized infrastructure
  • Collaborate closely with Backend, Data, Security, and Product teams on architecture design and operational improvements
  • Build and improve monitoring and observability platforms such as Grafana, ELK, CloudWatch, Zabbix, and Nagios
  • Mentor team members, support technical growth, and drive cross-functional collaboration
  • Maintain operational documentation, SOPs, and incident reports
  • Participate in and improve on-call and incident response processes

Requirements:

  • 8+ years of Linux system administration and large-scale infrastructure experience
  • 2+ years of team management or Tech Lead experience
  • Hands-on experience operating high-traffic, high-availability cloud platforms in a 24/7 environment
  • Strong experience with AWS services, including:
    • EC2, API Gateway, AppSync
    • VPC, IAM, Networking
    • Lambda, Aurora, ElastiCache (Redis)
    • CloudFront, CloudWatch, EKS
    • Security Services, SNS, Parameter Store, Secrets Manager
  • Strong Kubernetes and container infrastructure experience, including EKS administration and troubleshooting
  • Experience with Infrastructure as Code and configuration management tools such as Terraform, Helm, and Kustomize
  • Experience with CI/CD and GitOps tools such as Jenkins, GitHub Actions, Argo Workflow, and ArgoCD
  • Familiar with observability and monitoring tools including Grafana, ELK, Zabbix, and Nagios
  • Experience managing distributed systems and related technologies such as MongoDB, Kafka, Load Balancers, and HA architecture
  • Strong understanding of SRE / DevOps practices, including Incident Management, Capacity Planning, Disaster Recovery, and SLA/SLO/SLI
  • Proficient in scripting or programming languages such as Bash, Python, or Golang
  • Knowledge of cloud security, infrastructure security, and technical risk management
  • Strong communication, collaboration, and problem-solving skills in fast-paced environments
  • Experience in FinTech, Crypto, or high-availability platforms is a plus
  • Familiar with compliance and security frameworks such as MAS TRM and ISO 27001 is a plus

    Location: Taipei (check it out on Google Maps!)

    About XREX

    Regarding our culture

    Skills Required

    • 8+ years of Linux system administration and large-scale infrastructure experience
    • 2+ years of team management or Tech Lead experience
    • Hands-on experience operating high-traffic, high-availability cloud platforms in a 24/7 environment
    • Strong experience with AWS services
    • Strong Kubernetes and container infrastructure experience
    • Experience with Infrastructure as Code and configuration management tools
    • Experience with CI/CD and GitOps tools
    • Familiar with observability and monitoring tools
    • Strong understanding of SRE/DevOps practices
    • Proficient in scripting or programming languages
    Am I A Good Fit?
    beta
    Get Personalized Job Insights.
    Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

    The Company
    Taipei, Songshan Dist
    88 Employees
    Year Founded: 2018

    What We Do

    XREX is a blockchain-enabled financial institution working with banks, regulators, and users to redefine banking together. We provide enterprise-grade banking services to small to medium-sized businesses (SMBs) in or dealing with emerging markets, and novice-friendly financial services to individuals worldwide. Founded in 2018 and operating globally under multiple licenses, XREX offers a full suite of services such as digital asset custody, wallet, cross-border payment, fiat-crypto conversion, cryptocurrency exchange, asset management, and fiat currency on-off ramps. Sharing the social responsibility of financial inclusion, XREX leverages blockchain technologies to further financial participation, access, and education

    Similar Jobs

    Micron Technology Logo Micron Technology

    Intern - GRC, EIS

    Artificial Intelligence • Hardware • Information Technology • Machine Learning
    In-Office
    Taipei City, TWN
    45000 Employees

    Ericsson Logo Ericsson

    Network Engineer

    Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
    In-Office
    Taipei City, TWN
    88000 Employees

    Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

    Sales Associate

    eCommerce • Fashion • Retail • Sales • Wearables • Design
    Hybrid
    Taipei City, TWN
    16000 Employees

    Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

    Sales Associate

    eCommerce • Fashion • Retail • Sales • Wearables • Design
    Hybrid
    Xin Yi Qu, Taipei City, TWN
    16000 Employees

    Similar Companies Hiring

    Hanover Park Thumbnail
    Artificial Intelligence • Fintech • Software • Financial Services
    New York, New York
    42 Employees
    Kepler  Thumbnail
    Fintech • Software
    New York, New York
    6 Employees
    Onshore Thumbnail
    Artificial Intelligence • Fintech • Software • Financial Services
    New York, New York
    60 Employees

    Sign up now Access later

    Create Free Account

    Please log in or sign up to report this job.

    Create Free Account