Lead Site Reliability Engineer

Posted Yesterday
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka
In-Office
Senior level
HR Tech • Information Technology • Software
The Role
Lead and scale observability, incident response, and reliability practices. Build monitoring, logging, tracing, SLIs/SLOs, run chaos engineering, lead postmortems, mentor SREs, and collaborate across teams to improve uptime and performance.
Summary Generated by Built In
About us

The global hiring revolution is shaping a future where talent can thrive everywhere, driving innovation and progress on a global scale.

Multiplier is at the forefront of this change. By removing barriers and simplifying global hiring, we’re creating a level playing field where businesses and individuals – (like you) – can compete, grow, and succeed, regardless of geography.

Multiplier empowers companies to hire, onboard, manage, and pay talent in 150+ countries, quickly and compliantly. Our mission is to build a world without limits, where ambitious businesses can look beyond borders to build their global dream teams. Our unified employment platform, complete with world-class EOR, AOR and Global Payroll products, means it has never been easier to seize the global hiring opportunity.

We’re backed by some of the best in the business, (Sequoia, DST, and Tiger Global), are led by industry-leading experts, scaling fast, and seeking brilliant like-minded enthusiasts to join our team. The future is borderless. Let’s build it together.

About the Role

Multiplier is seeking a highly skilled Lead Site Reliability Engineer (SRE) to join our engineering organization. This role is critical to scaling and hardening our infrastructure and ensuring the availability, reliability, and performance of our systems. The ideal candidate is an experienced engineer with a strong programming background, hands-on experience with modern observability stacks, and a deep understanding of incident management and system reliability practices.

What would you do / key responsibilities
  • Design, build, and evolve our observability and telemetry stack using tools such as Sentry, ELK, Coralogix, New Relic, Squadcast , APM platforms

  • Implement and maintain logging, monitoring, alerting, and tracing infrastructure across services

  • Lead efforts in incident response, including coordination, resolution, and root cause analysis (RCA)

  • Define, monitor, and maintain SLIs, SLOs, and SLAs to ensure service reliability and performance

  • Drive chaos engineering practices to proactively uncover system weaknesses and improve resilience

  • Conduct and lead postmortems and reliability reviews, focusing on continuous learning and improvement

  • Build proactive monitoring solutions to detect and remediate potential issues before they impact customers

  • Collaborate with engineering, security, and IT teams to ensure end-to-end system reliability

  • Mentor junior SREs / CREs and contribute to defining best practices across the organization

Required Qualifications
  • 8+ years of experience in SRE, Production Engineering, or backend engineering roles

  • Must have - Proficiency in at least one modern programming language (e.g., Go, Python, Java)

  • Deep understanding of observability principles (Logging → Metrics → Traces)and hands-on experience with Sentry, ELK, Coralogix, New Relic, or equivalent tools

  • Experience designing and operationalizing SLIs/SLOs/SLAs

  • Strong knowledge of incident management frameworks and leading high-severity incident response

  • Experience conducting post-incident reviews and driving reliability improvements

  • Familiarity with chaos engineering tools and practices (e.g., Gremlin, Chaos Mesh, Chaos Monkey)

  • Proven track record of improving system uptime, reliability, and performance

Preferred Qualifications
  • Experience with containerization / ECS , cloud-native infrastructure (AWS), and service mesh technologies. Kubernetes experience is good to have.

  • Prior experience in a high-scale production environment

  • Certifications in SRE, DevOps, or cloud platforms (e.g., AWS Certified DevOps Engineer)

What We Offer:
  • High-impact role with the chance to play a key role in a rapidly growing company.

  • Full autonomy in your role, with the flexibility to work in a hybrid environment.

  • Work with a passionate, energetic, and diverse team.

  • Competitive benefits, recognition programs, and career development opportunities.

  • Attractive ESOPs, giving you a stake in the company’s success.

  • Comprehensive health insurance coverage for you and your family’s well-being.

  • Generous holiday policy.

  • A company that genuinely invests in your professional success.

Equal Employment Opportunity, Multiplier is an equal opportunity employer. We value diversity. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Top Skills

Apm Platforms
AWS
Chaos Mesh
Chaos Monkey
Containerization
Coralogix
Ecs
Elk
Go
Gremlin
Java
Kubernetes
New Relic
Python
Sentry
Service Mesh
Squadcast
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, New York
563 Employees
Year Founded: 2020

What We Do

Multiplier is a leading global employment platform that makes it easy for companies to employ teams internationally. Its proprietary technology simplifies the employment process by managing the complexities of local compliance, labour contracts, payroll, benefits and taxes.
We enable companies to manage their distributed teams via a simple dashboard while taking responsibility for local labor law compliance on their behalf. We are passionate about creating a world where people can get a job they love, without having to leave the people they love.

Similar Jobs

Optum Logo Optum

Site Reliability Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office
Bangalore, Bengaluru Urban, Karnataka, IND
160000 Employees
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
6000 Employees

Iron Mountain Logo Iron Mountain

Site Reliability Engineer

Big Data • Cloud • Information Technology
In-Office or Remote
2 Locations
32000 Employees
In-Office
Bangalore, Bengaluru Urban, Karnataka, IND
20567 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account