GoKwik

Senior DevOps Engineer

Reposted 4 Days Ago

Be an Early Applicant

Gurugram, Haryana, IND

In-Office

Senior level

eCommerce

The Role

Lead SRE-focused reliability for production systems: run on-call and incident response, debug distributed systems, design self-healing scalable infrastructure, build observability (metrics, logs, traces, SLIs/SLOs, APM), support Terraform/Kubernetes/CI-CD automation, mentor engineers, conduct blameless postmortems, and drive resiliency improvements.

Summary Generated by Built In

About GoKwik

GoKwik is a growth operating system designed to power D2C and eCommerce brands—from checkout optimization and reducing return-to-origin (RTO), to payments, retention, and post-purchase engagement. Today, GoKwik enables 12,000+ merchants worldwide, processes around $2B in GMV, and is strengthening its AI-powered infrastructure. Backed by RTP Global, Z47, Peak XV, and Think Investments, and bolstered by a $13M growth round in June 2025 (total funding: $68M), GoKwik is scaling aggressively across India, the UK, Europe, and the US.

Why This Role Matters

We are looking for an SRE-focused Engineer to join our DevOps team. This role is 80% Site Reliability Engineering and 20% DevOps enablement, with observability, resilience, and incident management at its core. You will lead on-call operations, build world-class observability systems, and drive reliability engineering practices across the organization. Alongside this, you will collaborate on automation and CI/CD improvements to ensure services are built and operated for scale. We are an engineering-first team that continuously invests in tools, tests, processes, and technology. We consider our people our biggest asset and strive to build a culture of continuous learning and growth.

What You’ll Own

Lead SRE practices for reliability, scaling, and performance of production systems
Lead on-call operations and incident response, ensuring fast resolution and minimal customer impact
Perform deep debugging of production issues across infrastructure, services, and databases
Design and automate self-healing, scalable infrastructure
Architect and implement advanced observability (metrics, logs, traces, SLIs/SLOs, APM) to detect, debug, and prevent outages
Support CI/CD and infrastructure automation (Terraform, Kubernetes, pipelines) as part of DevOps responsibilities (20%)
Implement and mature observability practices including SLIs/SLOs, distributed tracing, and APM
Mentor junior engineers in incident management and DevOps best practices
Partner with engineering teams on resilient architecture reviews and reliability improvements
Drive adoption of new tools and best practices to enhance infrastructure reliability
Conduct blameless postmortems, improve incident playbooks, and build a strong prevention culture

Who You Are

5–8 years of experience in SRE / Production Engineering, with some DevOps exposure
Strong expertise in incident management, debugging distributed systems, and on-call operations
Strong background in observability platforms such as Prometheus, Grafana, Datadog, OpenTelemetry, or similar
Deep knowledge of cloud infrastructure (AWS/GCP) including networking, scaling, and HA/DR setups
Hands-on experience with Kubernetes, Terraform, and CI/CD pipelines
Experience with incident frameworks, blameless postmortems, chaos engineering, and resiliency testing
Ability to balance short-term firefighting with long-term reliability engineering
Strong scripting skills (Shell, Python, or Go preferred)

Why GoKwik

At GoKwik, we aren’t just building tools—we’re rewriting the playbook for eCommerce in India. We exist to solve some of the most complex challenges faced by digital-first brands: low conversion rates, high RTO, and poor post-purchase experience. Our checkout and conversion stack powers 500+ leading D2C brands and marketplaces—and we’re just getting started.

Skills Required

5-8 years of experience in SRE / Production Engineering
Strong expertise in incident management, debugging distributed systems, and on-call operations
Experience with observability platforms such as Prometheus, Grafana, Datadog, OpenTelemetry
Deep knowledge of cloud infrastructure (AWS or GCP) including networking, scaling, and HA/DR
Hands-on experience with Kubernetes, Terraform, and CI/CD pipelines
Experience with incident frameworks, blameless postmortems, chaos engineering, and resiliency testing
Strong scripting skills (Shell, Python, or Go)

View all jobs at GoKwik

View GoKwik Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

New Delhi, Delhi

265 Employees

Year Founded: 2020

What We Do

GoKwik is a data & technology led enabler, building a full-stack solution suite for eCommerce and D2C brands to help them unlock business growth. Embarked on a mission to democratise the shopping experience, GoKwik enables eCommerce brands to deliver superlative customer experience across the shopping funnel thereby boosting conversion rates and revenue growth. It also solves for other critical pain points of the industry such as COD RTO (Return to Origin) and helps brands manage the RTO problem while offering COD as a payment channel. With its recent addition of a third product: KwikChat, GoKwik is solving for low ROIs on marketing campaigns through 30+ Whatsapp use cases such as abandoned cart recovery, click to whatsapp ad campaigns & headless checkout. 1 in 3 shoppers is already shopping on the GoKwik network that has helped 500+ brands scale their businesses with higher GMV realisation & profit margins. It is helmed by Chirag Taneja (Co-Founder and Chief Executive Officer), Vivek Bajpai (Co-Founder and Chief Technology Officer), and Ankush Talwar (Co-Founder and Chief Data Scientist). GoKwik is backed by investors such as Sequoia Capital, Matrix Partners India, RTP Global & Think Investments. GoKwik's team has deep knowledge in the space of eCommerce with people having previous experience in Flipkart, Razorpay, Swiggy, Myntra, Nykaa, and more.