Senior Software Engineer - SRE Focused

Posted 5 Days Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka
In-Office
300K-300K Annually
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
The Role
As a Senior Software Engineer focused on SRE, you'll lead production incident responses, debug and fix issues, and drive reliability improvements across services, ensuring platform integrity and performance.
Summary Generated by Built In

We are a B2B WealthTech startup based in Abu Dhabi and backed by BNY Mellon (America’s oldest bank and first company to list on NYSE) and Lunate (a new $50B AUM alternative asset management firm based in Abu Dhabi, UAE). The company has raised $300M to build a state of the art wealth technology platform.

Our mission is to power and grow our clients’ Wealth franchises through differentiated experiences, financial solutions, and insights. Our digital wealth management platform- will enable banks and other financial institutions in the Middle East to grow and further penetrate affluent, HNW and UHNW investor segments.

While still leveraging the capabilities and knowledge of large organizations, our fintech is a startup with truly cross-functional and agile teams.

For more information, please visit www.alpheya.com

Role

We're building a team that owns production incident response, deep debugging, and permanent fixes across application, data, and deployment layers.

This is not a tickets-only ops role. You will write code, ship fixes safely, and harden the platform so issues don't repeat.

Note: This is a software engineering role with real production ownership. You’ll combine engineering and operations to own outcomes end-to-end: investigate incidents, ship code fixes, and prevent repeat issues through tests, observability, and hardening.

  • Lead and execute production incident response: triage, mitigation, stakeholder communication, and coordination across teams
  • Debug and fix issues across Go services (mandatory) and the broader stack (Node.js services where relevant)
  • Work across service boundaries: GraphQL/RPC, distributed tracing, dependency failures, performance bottlenecks, and safe degradation patterns
  • Troubleshoot Kubernetes workloads and deployments
  • Diagnose PostgreSQL/CNPG issues
  • Handle production bugs that span application + data pipelines (ETL/Snowflake mappings), including backfills/replays and data-quality validation
  • Build prevention: add regression tests, improve observability , and maintain runbooks/service passports
  • Drive reliability improvements: SLOs/SLIs, alert quality, release readiness checks, and operational standards across teams

Requirements
  • 7+ years in SRE / Production Engineering / Platform Engineering (reliability-focused)
  • Strong Go (mandatory): ability to read, debug, and ship production fixes in Go codebases
  • Proven experience debugging distributed systems in production (latency, error rates, timeouts, retries, cascading failures)
  • Strong hands-on experience with Kubernetes in production environments
  • Experience with Helm and GitOps workflows (FluxCD preferred; ArgoCD acceptable)
  • Solid PostgreSQL troubleshooting experience (performance, incident patterns, migrations)
  • Observability experience (metrics/logging/tracing; Datadog/Grafana/Tempo/Loki experience is a plus)
  • Strong incident leadership: calm under pressure, clear communication, structured problem-solving
  • Engineering hygiene: PR discipline, reviews, testing mindset, safe rollouts/rollbacks
  • Comfortable with IAM/security fundamentals in real production systems: OAuth2/OIDC basics, RBAC/least privilege, and safe secrets handling

Good to Have

  • Node.js backend experience in production
  • Experience in FinTech / regulated environments / high-availability systems (auditability, change control, incident rigor)
  • Data reliability experience: ETL monitoring, reconciliation, Snowflake operations, schema/mapping drift handling
  • Reliability patterns common to trading/fintech platforms: correctness and data integrity mindset (idempotency, reconciliation), resilient partner integrations, and strong observability for critical user journeys

Top Skills

Datadog
Gitops
Go
Grafana
Helm
Kubernetes
Loki
Postgres
Tempo
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Abu Dhabi
31 Employees
Year Founded: 2023

What We Do

Alpheya offers a full suite of cloud-native, AI-powered wealthtech solutions designed to strip away the complexities of wealth management and deliver differentiated experiences, insights, and new opportunities for your business.

Similar Jobs

CrowdStrike Logo CrowdStrike

Senior Software Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
KA, IND
10000 Employees

PayPal Logo PayPal

Manager, Data Science

Fintech • Payments
In-Office or Remote
2 Locations
34450 Employees

Kong Logo Kong

Engineering Manager

Artificial Intelligence • Cloud • Information Technology • Software • Big Data Analytics
In-Office
2 Locations
800 Employees

Unisys Logo Unisys

Solutions Architect

Information Technology
In-Office or Remote
10 Locations
22588 Employees

Similar Companies Hiring

Granted Thumbnail
Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account