Senior Software Engineer, Reliabilty

Reposted 23 Days Ago
Be an Early Applicant
Bangalore, Bengaluru Urban, Karnataka, IND
In-Office
Senior level
Cloud • Security • Software • Cybersecurity
The Role
The Senior Software Engineer, Reliability will lead SRE initiatives, mentor teams, design reliable systems, enhance observability, and ensure operational excellence across Veeam's platform.
Summary Generated by Built In

Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI at scale. As the market leader in both data resilience and data security posture management, Veeam is built for the convergence of identity, data, security, and AI risk. Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, who trust Veeam to keep their businesses running. Join us as we go fearlessly forward together, growing, learning, and making a real impact for some of the world’s biggest brands.

We are looking for a Senior Software Engineer, Reliability, you will serve as a hands-on technical leader within the SRE team, guiding senior engineers, influencing product development teams, and ensuring the systems we operate are built to be reliable, scalable, and observable from the ground up.

You will drive strategic initiatives, mentor others in the practice of SRE, and help define architectural best practices across our platform. This role is pivotal in aligning teams, enforcing high standards, and scaling SRE principles globally within Veeam.

Yours tasks will include

Reliability Engineering & Resilience

  • Design and evolve infrastructure to be highly available, fault tolerant, and scalable across public clouds (initially Azure, with future expansion plans to other providers).

  • Establish and maintain SLIs, SLOs, and error budgets that define and enforce reliability objectives.

  • Lead incident response, analysis, blameless postmortems, and sharing sessions in order to maximize learning across our entire engineering team and driving changes to the entire socio-technical engineering system.

Observability & Operational Excellence

  • Drive adoption of deep observability practices, ensuring telemetry, logs, metrics, and tracing are comprehensive and actionable.

  • Develop automation and self-healing tools to reduce toil and support Veeam’s fleet management strategy.

  • Participate in on-call rotations and lead operational excellence across the stack.

Engineering at Scale

  • Contribute to infrastructure as code (IaC), CI/CD systems, deployment automation, and scalable config management.

  • Integrate and extend monitoring and chaos engineering tools to validate reliability assumptions under load and failure conditions.

  • Implement testing strategies, canary deployments, and release validation pipelines to protect production environments and allow teams to safely deliver new features as quickly as possible.

Collaboration & Culture

  • Embed within product and platform teams to champion reliability from design through delivery.

  • Contribute to a learning culture focused on continuous improvement and proactive risk management.

  • Mentor engineers and advocate for DevOps/SRE best practices across global teams.

What we expect from you:

  • 5+ years of hands-on experience in a Software Engineering role with at least 2 years in Site Reliability, Platform Engineering, or similar.
  • Deep experience building systems on public cloud providers (Azure preferred)

  • Strong programming skills in JS, Node, Typescript, Go, Java, C#, or similar.

  • Proven track record in delivering monitoring, alerting, and observability tooling (e.g., Prometheus, Grafana, OpenTelemetry).

  • Experience with IaC tools like Terraform/Pulumi, and container orchestration (e.g., Kubernetes).

  • Solid understanding of distributed systems, cloud networking, and cloud-native system design.

  • Excellent communication and collaboration skills across geographies and disciplines.

Will be an added advantage:

  • Experience working on large-scale B2B SaaS platforms.

  • Background in chaos engineering, resilience testing, performance testing, load testing, or incident learning programs.

  • Familiarity with compliance frameworks (e.g., ISO, SOC 2, GDPR, FEDRAMP/CMMC).

We offer:
  • 18 paid vacation days, plus 4 extra global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
    Private medical coverage for you and up to four dependents
  • Life, accident, and disability insurance with enhanced coverage
  • Annual flexible wellbeing allowance for physical and mental wellness
  • Free confidential counselling and coaching via Employee Assistance Program (EAP), including legal and financial advice
  • Meal, fuel, and transportation benefits based on work arrangement
  • Daycare reimbursement and safe cab facility for eligible employees
  • Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops, and learning events like our annual Global Day of Learning

Please note: If the applicant is permanently located outside India, Veeam reserves the right to decline the application.


#LI-KP1
#Hybrid

Veeam Software is an equal opportunity employer and does not tolerate discrimination in any form on the basis of race, color, religion, gender, age, national origin, citizenship, disability, veteran status or any other classification protected by federal, state or local law. All your information will be kept confidential.

Personal data collected during the recruitment process will be processed in accordance with our Recruiting Privacy Notice, which explains how your information is collected, used, and handled in connection with hiring activities. By applying for this position, you consent to this processing. 

By submitting your application, you confirm that the information provided, including any supporting documents, is complete and accurate to the best of your knowledge. Any misrepresentation, omission, or falsification may result in disqualification from consideration or, if discovered after employment begins, termination of employment.

Skills Required

  • 5+ years of hands-on experience in a Software Engineering role
  • At least 2 years in Site Reliability, Platform Engineering, or similar
  • Deep experience building systems on public cloud providers
  • Strong programming skills in JS, Node, Typescript, Go, Java, or C#
  • Proven track record in delivering monitoring and observability tooling
  • Experience with IaC tools like Terraform/Pulumi
  • Solid understanding of distributed systems and cloud networking
  • Excellent communication and collaboration skills

Veeam Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Veeam and has not been reviewed or approved by Veeam.

  • Healthcare Strength Healthcare coverage is comprehensive with options that include employee-only no-cost tiers, plus mental-health support through an assistance program. Feedback suggests these offerings compare well in tech.
  • Leave & Time Off Breadth Time off includes unlimited PTO in the U.S., paid company holidays, quarterly company-wide recharge days, and paid volunteer time. Feedback suggests team norms influence how fully this flexibility is utilized.
  • Strong & Reliable Incentives Sales and pre-sales roles feature meaningful on-target earnings with competitive base and variable structures. Feedback suggests these plans provide strong upside for high performers.

Veeam Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Alpharetta, GA
4,172 Employees
Year Founded: 2006

What We Do

Veeam provides a single platform for modernizing backup, accelerating hybrid cloud and securing data. Veeam has 400,000+ customers worldwide, including 82% of the Fortune 500 and 69% of the Global 2,000. Veeam’s 100% channel ecosystem includes global partners, as well as HPE, NetApp, Cisco and Lenovo as exclusive resellers, and boasts more than 35K transacting partners worldwide.

Similar Jobs

CSC Logo CSC

Service Desk Analyst

Fintech • Legal Tech • Software • Financial Services • Cybersecurity • Data Privacy
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
8500 Employees

Datadog Logo Datadog

Manager 2, Premier Support Engineering

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
6500 Employees

SmartBear Logo SmartBear

Product Owner

Cloud • Internet of Things • Software • App development • Automation
Easy Apply
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
800 Employees

eClinical Solutions Logo eClinical Solutions

Data Engineer

Cloud • Healthtech • Professional Services • Software • Pharmaceutical
Easy Apply
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
400 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account