Responsibilities:
- Establish a SRE site and help build an effective, inclusive SRE team.
- Provide technical leadership for the local team and work closely with partner team technical leads and cloud leadership.
- Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and building automated responses for non-exceptional service conditions.
- Manage execution of project priorities, deadlines, and deliverables.
- Lead Incident Management during Incidents.
- Responsible for driving MTTR as per the Incident SLA.
- Responsible for having 100% coverage for various alerts covering Application, Infrasture, Security, Flows etc
Skills:
- 6-10 years of experience in distributed systems, storage systems, or databases, algorithms and data structures and/or Unix/Linux systems internals (e.g., filesystems, system calls) and administration.
- Experience designing, analyzing, and troubleshooting large-scale distributed systems.
- Experience in MySQL or Postgres SQL in database.
- Hands-on experience on operating with k8s and any cloud.
- Excellent communication skills and a sense of ownership, with a systematic problem-solving approach
Top Skills
What We Do
Founded in 2015, Zeta is a provider of next-gen credit card processing platform. Zeta’s cloud-native and fully API-enabled stack offers a comprehensive range of capabilities, including processing, issuing, lending, core banking, fraud detection, and loyalty programs. With a strong focus on technology, Zeta has over 1700+ employees and contractors, with more than 70% dedicated to technology roles. Operating across the US, UK, Middle East, and Asia, Zeta has served a global customer base of 35+ clients who have issued over 15 million cards on Zeta's platform to date. Backed by prominent investors such as Softbank Vision Fund 2 and Mastercard, Zeta has raised $280 million, at a valuation of $1.5 billion.






