GOV Site Reliability Engineer

Posted Yesterday
Hiring Remotely in United States
Remote
152K-253K Annually
Mid level
Cloud • Security • Software • Cybersecurity
The Role
Join the GOV/Sovereign Cloud SRE team to maintain and improve reliability for the Veeam Data Cloud. Responsibilities include incident response, SLIs/SLOs, observability (monitoring, alerting, dashboards), runbooks and documentation, IaC and CI/CD work in compliance-restricted environments, and participation in on-call rotations. Collaborate with engineering, security, and compliance teams to implement high availability and automation.
Summary Generated by Built In

Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI at scale. As the market leader in both data resilience and data security posture management, Veeam is built for the convergence of identity, data, security, and AI risk. Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, who trust Veeam to keep their businesses running. Join us as we go fearlessly forward together, growing, learning, and making a real impact for some of the world’s biggest brands.

Site Reliability Engineer — Government & Sovereign Cloud

Veeam is building a global SRE function to support the Veeam Data Cloud, our SaaS platform. This role is part of the team supporting our Government and Sovereign Cloud environment.

Success here requires a self-starter mindset — you'll need to be comfortable building your own context and tracking down information across a large, distributed engineering organization.

You'll work alongside senior engineers to execute on reliability work, close observability gaps, respond to incidents, and help maintain the operational foundation the team runs on.

What You Will DoDiscovery & Documentation
  • Get up to speed on VDC workloads, dependencies, and operational workflows by reading code, docs, and working with SMEs.

  • Write and maintain runbooks, incident guides, and operational documentation.

  • Support knowledge transfer and contribute to onboarding materials for the team.

Reliability & Incident Response
  • Participate in incident response including triage, investigation, mitigation, and postmortems.

  • Help implement and maintain SLIs, SLOs, and error budgets defined by the team.

  • Identify reliability issues during incidents or reviews and propose concrete improvements.

  • Support high availability and fault tolerance work on Azure, including Azure Government.

Observability
  • Close monitoring gaps by implementing instrumentation, alerting, and dashboards based on team standards.

  • Contribute to toil reduction through automation and tooling improvements.

  • Participate in on-call rotations.

Infrastructure & Delivery
  • Work with IaC, CI/CD pipelines, and deployment tooling in compliance-restricted environments.

  • Support testing, canary deployments, and release validation workflows.

  • Implement changes to infrastructure and configuration following established patterns and review processes.

Collaboration
  • Work with engineering, security, compliance, and operations teams to execute on reliability improvements.

  • Communicate clearly about system behavior, risk, and status — in writing and in meetings.

  • Raise blockers and gaps proactively; don't wait for problems to escalate.

What We Are Looking ForRequired
  • 3+ years in Software Engineering, with at least 1 year in SRE, Platform Engineering, or DevOps working on cloud-hosted services.

  • Experience with cloud infrastructure on Azure or a comparable cloud provider.

  • Familiarity with regulated or compliance-oriented environments such as government (FedRAMP, CMMC), financial (PCI-DSS), or healthcare (HIPAA). You understand that compliance shapes what you can and can't do operationally.

  • Able to read and understand code well enough to investigate system behavior without always having someone walk you through it.

  • Experience with monitoring and observability tools (e.g., Prometheus, Grafana, OpenTelemetry, ELK stack).

  • Experience with IaC tools (Terraform, Terragrunt, or Pulumi) and container orchestration (Kubernetes).

  • Experience with CI/CD tooling such as GitHub Actions, Azure DevOps, GitLab CI, or ArgoCD.

  • Strong programming skills in one or more of: TypeScript/JS, Go, Java, C#, or similar.

  • Solid understanding of distributed systems fundamentals and networking basics.

  • Clear written and verbal communication skills.

Preferred
  • Experience in Government or Sovereign Cloud environments (e.g., Azure Government, AWS GovCloud).

  • Background in SaaS platforms or multi-tenant systems.

  • Familiarity with chaos engineering, resilience testing, or load testing.

  • Exposure to building or improving reliability practices on a team.

  • Familiar with AI-first development workflows using LLM-powered tools for automation, code generation, or documentation.

Why Join?
  • Work on a high-impact reliability practice for a growing GOV/Sovereign Cloud platform.

  • Learn from senior engineers while owning real work end-to-end.

  • Collaborate with strong teams across product, cloud engineering, security, and compliance.

  • Professional development resources including mentorship, training, and volunteer days.

  • Competitive compensation and benefits.

What you'll get

  • Unlimited paid time off, 12 paid holidays including 4 global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
  • Paid parental leave: 8 weeks for all parents, 16 weeks for birthing parents
  • Medical, dental, and vision coverage starting on your first day
  • Mental health support, therapy sessions, and digital wellness tools via our Employee Assistance Program
  • 401(k) retirement plan with company matching contributions
  • Fertility, adoption, and surrogacy support through Maven, plus paid volunteer time
  • AirVet: 24/7 virtual veterinary care at no cost
  • Legal services, identity protection, and supplemental health insurance options
  • Tax-advantaged spending accounts for healthcare, dependent care, and commuting
  • Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops, and learning events like our annual Global Day of Learning

Compensation Transparency

Veeam is committed to pay transparency and equitable compensation. For this role, the compensation range below reflects the expected total target compensation (TTC), inclusive of base pay and a competitive performance-based bonus. For roles with a commission plan, the compensation range represents On Target Earnings (OTE), which includes base salary plus variable commission. When determining compensation, Veeam takes into consideration factors such as experience, education, skills, and geographic zone. Offers are typically made below the midpoint of the range.

In addition to compensation, Veeam provides a comprehensive benefits package, including health coverage, retirement plans, and unlimited time off.

U.S. Geographic Zones & Compensation Ranges (TTC / OTE)
Zone 1: San Francisco Bay Area, New York City Boroughs
$151,500$252,500 USD
Zone 2: Washington, California (excluding San Francisco Bay Area)
$138,900$231,400 USD
Zone 3: Texas, Illinois, North Carolina, Colorado, Massachusetts, Pennsylvania, Virginia, Oregon, Nevada, Hawaii, New York (excluding NYC boroughs); Sales roles located in Georgia, Ohio, and Arizona
$126,300$210,400 USD
Zone 4: All other US locations
$109,800$183,000 USD

Veeam Software is an equal opportunity employer and does not tolerate discrimination in any form on the basis of race, color, religion, gender, age, national origin, citizenship, disability, veteran status or any other classification protected by federal, state or local law. All your information will be kept confidential.

Personal data collected during the recruitment process will be processed in accordance with our Recruiting Privacy Notice, which explains how your information is collected, used, and handled in connection with hiring activities. By applying for this position, you consent to this processing. 

By submitting your application, you confirm that the information provided, including any supporting documents, is complete and accurate to the best of your knowledge. Any misrepresentation, omission, or falsification may result in disqualification from consideration or, if discovered after employment begins, termination of employment.

Skills Required

  • 3+ years in Software Engineering with at least 1 year in SRE, Platform Engineering, or DevOps working on cloud-hosted services.
  • Experience with cloud infrastructure on Azure or a comparable cloud provider.
  • Familiarity with regulated or compliance-oriented environments (FedRAMP, CMMC, PCI-DSS, HIPAA).
  • Ability to read and understand code to investigate system behavior.
  • Experience with monitoring and observability tools (Prometheus, Grafana, OpenTelemetry, ELK stack).
  • Experience with IaC tools (Terraform, Terragrunt, or Pulumi) and container orchestration (Kubernetes).
  • Experience with CI/CD tooling such as GitHub Actions, Azure DevOps, GitLab CI, or ArgoCD.
  • Strong programming skills in one or more of: TypeScript/JavaScript, Go, Java, C#.
  • Solid understanding of distributed systems fundamentals and networking basics.
  • Clear written and verbal communication skills.
  • Experience in Government or Sovereign Cloud environments (e.g., Azure Government, AWS GovCloud).
  • Background in SaaS platforms or multi-tenant systems.
  • Familiarity with chaos engineering, resilience testing, or load testing.
  • Exposure to building or improving reliability practices on a team.
  • Familiar with AI-first development workflows using LLM-powered tools for automation or code generation.

Veeam Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Veeam and has not been reviewed or approved by Veeam.

  • Healthcare Strength Healthcare coverage is comprehensive with options that include employee-only no-cost tiers, plus mental-health support through an assistance program. Feedback suggests these offerings compare well in tech.
  • Leave & Time Off Breadth Time off includes unlimited PTO in the U.S., paid company holidays, quarterly company-wide recharge days, and paid volunteer time. Feedback suggests team norms influence how fully this flexibility is utilized.
  • Strong & Reliable Incentives Sales and pre-sales roles feature meaningful on-target earnings with competitive base and variable structures. Feedback suggests these plans provide strong upside for high performers.

Veeam Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Alpharetta, GA
4,172 Employees
Year Founded: 2006

What We Do

Veeam provides a single platform for modernizing backup, accelerating hybrid cloud and securing data. Veeam has 400,000+ customers worldwide, including 82% of the Fortune 500 and 69% of the Global 2,000. Veeam’s 100% channel ecosystem includes global partners, as well as HPE, NetApp, Cisco and Lenovo as exclusive resellers, and boasts more than 35K transacting partners worldwide.

Similar Jobs

Easy Apply
Remote
United States
55 Employees
110K-150K Annually

Zeta Global Logo Zeta Global

Associate Vice President, Paid Search - EDU

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
140K-160K Annually

UL Solutions Logo UL Solutions

Senior Project Lead, Energy Storage Advisory

Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Remote or Hybrid
Chicago, IL, USA
15000 Employees
60K-170K Annually

Wipfli Logo Wipfli

Audit Manager, Health Care Industry

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
Irvine, CA, USA
3000 Employees
97K-131K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account