Senior Software Engineer - Reliability Engineering (Remote)

Posted 9 Days Ago
Hiring Remotely in Georgia, USA
Remote
90K-180K Annually
Mid level
Retail
The Role
The Senior Software Engineer for Site Reliability enhances product reliability through automation, proactive monitoring, and collaboration with teams. They lead post-mortems, mentor junior engineers, and drive improvements in scalability and performance while utilizing AI and automation tools.
Summary Generated by Built In

With a career at The Home Depot, you can be yourself and also be part of something bigger.

Position Purpose:

The Senior Software Engineer for Site Reliability drives the platform's stability, scalability, and performance. This role enhances product reliability by engineering automated solutions for complex infrastructure and operational challenges, including leveraging AI-assisted tooling and prompt engineering to accelerate incident diagnosis, automate remediation workflows, and generate actionable insights from operational data. Key responsibilities include championing application availability and efficiency through proactive monitoring, performance tuning, and strategic improvements. The engineer will lead post-mortems, create automation to reduce operational toil—applying AI agents and large language models where they deliver measurable efficiency gains—and partner with product owners and developers to enable the deployment of reliable, high-performing services. This position participates in tool selection, assists with capacity planning, and builds the monitoring and alerting to meet business-defined Service Level Objectives (SLOs). Within a collaborative team, the role also involves mentoring less experienced engineers to foster a culture of operational excellence and practical AI fluency.

Key Responsibilities:

  • 50% Delivery and Execution - Develops, tests, deploys, and maintains software, with a clear understanding of the value the software is to provide; Takes on new opportunities and tough challenges with a sense of urgency, high energy and enthusiasm; Consistently achieves results, even under tough circumstances; Develops test suites (functional, destructive, etc) to enable success, rapid deployment of code to production; Takes a broad view when approaching issues; using a global lens
  • 20% Learns and Grows - Learns through successful and failed experiment when tackling new problems; Actively seeks ways to grow and be challenged using both formal and informal development channels
  • 20% Plans and Aligns - Collaborates with other team members in agile processes; Creates new and better ways for the organization to be successful; Works the Product Team to ensure user stories are valuable, developer ready, easy to understand and testable; Delivers multi-mode communications that convey a clear understanding of the unique needs of different audiences; Adapts approach and demeanor in real time to match the shifting demands of different situations; Relates openly and comfortably with diverse groups of people
  • 10% Supports and Enables - Helps grow junior engineers by providing guidance on modern software development frameworks, and leading technical discussions

Direct Manager/Direct Reports:

  • This position typically reports to Software Engineer Manager or Sr. Manager
  • This position has 0 Direct Reports

Travel Requirements:

  • No travel required.

Physical Requirements:

  • Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles.

Working Conditions:

  • Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable.

Minimum Qualifications:

  • Must be eighteen years of age or older.
  • Must be legally permitted to work in the United States.

Preferred Qualifications:

  • GCP Cloud Infrastructure — BigQuery analytics, ADC auth, cloud-native services
  • Observability — Grafana, Prometheus, Kibana/Elasticsearch (WES logs), OCP Health Dashboards
  • Terraform Enterprise — Infrastructure as Code
  • GitHub — SCM
  • GH Copilot + AI Agents — AI-accelerated incident analysis, automated remediation workflows, prompt-engineered operational tooling
  • SRE Practices — Production Readiness Review, Capacity Planning, Change Validation, Prod Support, Post-Mortems, SLO Definition & Tracking
    ServiceNow — Incident, Problem, and Change management; trend analysis; RCA grouping
  • BigQuery — Incident analytics, problem candidate identification, operational reporting
  • PagerDuty — On-call scheduling, escalation paths, push-button paging
  • Rundeck — Self-heal automation, push-button remediation jobs
  • Atlassian (Jira/Confluence) — RCA documentation, runbooks, architecture diagrams, onboarding
  • CyberArk — Privileged access for WMS/DFC log pulls and node access
  • Manhattan WMS — Warehouse Management System operations, RF/UI/LM node support
  • Python Automation — Operational scripting, BQ pipelines, alert correlation, report generation

Minimum Education:

  • The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study related to the job.

Preferred Education:

  • No additional education

Minimum Years of Work Experience:

  • 3

Preferred Years of Work Experience:

  • No additional years of experience

Minimum Leadership Experience:

  • None

Preferred Leadership Experience:

  • None

Certifications:

  • None

Competencies:

  • Global Perspective
  • Manages Ambiguity
  • Nimble Learning
  • Self-Development
  • Collaborates
  • Cultivates Innovation
  • Situational Adaptability
  • Communicates Effectively
  • Drives Results
  • Interpersonal Savvy

For California, Colorado, Connecticut, Rhode Island, Nevada, New York City, Ithaca (NY), Westchester County (NY), and Washington residents:
 

The pay range for this position is between $90,000.00 - $180,000.00

Skills Required

  • 3 years of work experience in software engineering
  • Bachelor's degree in a related field

The Home Depot Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about The Home Depot and has not been reviewed or approved by The Home Depot.

  • Retirement Support A 401(k) plan with company matching supports long-term savings alongside core pay. Retirement programs are consistently positioned as a meaningful part of total compensation.
  • Equity Value & Accessibility An Employee Stock Purchase Plan enables discounted stock ownership as a core element of compensation. Equity opportunities complement wages and are accessible beyond full-time salaried roles.
  • Strong & Reliable Incentives Profit-sharing and store-performance bonuses offer additional earnings opportunities beyond base pay. Incentive programs are described as recurring and tied to store results.

The Home Depot Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Atlanta, GA
129,974 Employees
Year Founded: 1977

What We Do

The Home Depot, the world’s largest home improvement specialty retailer, values and rewards dedicated, knowledgeable and experienced professionals. We operate over 2,200 retail stores in all 50 states, the District of Columbia, Puerto Rico, the U.S. Virgin Islands, Guam, Canada and Mexico. All of our associates have one thing in mind — helping our customers build and improve upon their homes. Join The Home Depot team today and see for yourself why we are consistently ranked as a top Fortune 500 company.

Similar Jobs

Remote
Georgia, USA
129974 Employees
90K-180K Annually

CrowdStrike Logo CrowdStrike

Social Media Coordinator

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
86K-135K Annually

CrowdStrike Logo CrowdStrike

Senior Salesforce Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
100K-155K Annually

Advisor360 Logo Advisor360

Principal Solutions Specialist (SME) – Advisor & Client Experience (RIA)

Artificial Intelligence • Fintech • Software • Financial Services • Generative AI • Big Data Analytics • Automation
Remote
United States
500 Employees
176K-190K Annually

Similar Companies Hiring

Dutch Bros Coffee Thumbnail
Food • Retail
Tempe, Arizona
6479 Employees
Scotch Thumbnail
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
US
35 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account