Senior Site Reliability Engineer

Posted 25 Days Ago
Be an Early Applicant
Bangalore, Bengaluru Urban, Karnataka
In-Office
Senior level
Information Technology • Security • Software
The Role
The Senior Site Reliability Engineer will design, build, and maintain infrastructure operations using AWS and GCP, implement reliability practices, and support automation initiatives alongside engineering teams. They will manage incidents, contribute to postmortems, and support continuous improvement in a collaborative environment.
Summary Generated by Built In

At SolarWinds, we’re a people-first company. Our purpose is to enrich the lives of the people we serve—including our employees, customers, shareholders, partners, and communities. Join us in our mission to help customers accelerate business transformation with simple, powerful, and secure solutions.

The ideal candidate thrives in an innovative, fast-paced environment and is collaborative, accountable, ready, and empathetic. We’re looking for individuals who believe they can accomplish more as a team and create lasting growth for themselves and others. We hire based on attitude, competency, and commitment. Solarians are ready to advance our world-class solutions in a fast-paced environment and accept the challenge to lead with purpose. If you’re looking to build your career with an exceptional team, you’ve come to the right place. Join SolarWinds and grow with us!


Your Role:

We are seeking a Senior Site Reliability Engineer (Infrastructure & Site Reliability Engineering) with experience in AWS, GCP, Kubernetes, and GitOps to work with our Site Reliability Engineering (SRE) team. The successful candidate will understand SRE practices and have a track record of implementing high-quality site reliability engineering practices (SLAs, SLOs, Proactive Alert Management, Incident Response/Review, Postmortems, etc.).

In this role, you will work with our SRE and cross-functional engineering teams to develop and operate our development and production infrastructure and operations

Responsibilities:

  • Work collaboratively with software engineering teams to define infrastructure and deployment requirements.
  • Contribute actively and assist in our automation and observability initiatives
  • Learn, develop, and maintain operational tools for deployment, monitoring, and analysis of cloud (AWS & GCP) infrastructure and systems
  • Work closely with team members to lead the response to production incidents, conduct postmortems, and drive continuous improvement efforts as part of 24/7 on-call rotations for exposure to critical issue resolution
  • Contribute to on-call documentation and incident response playbooks
  • Establish and drive operations performance through SLOs
  • Embrace and adhere to development best practices, including continuous integration/deployment and code review
  • Demonstrate a strong commitment to continuous learning and professional development by seeking opportunities for mentorship and learning within the team.
  • Our team uses practices to maximize our development velocity, including but not limited to: continuous integration/deployment, code review via GitHub pull requests

Ideal Attributes

  • Strong customer orientation
  • Excellent interpersonal and organizational skills
  • Attention to detail and focus on quality
  • Strong communication skills to effectively liaise with both technical and non-technical staff
  • Ability to act decisively and work well under pressure
  • Must be a collaborative problem solver
  • Strong bias for ownership and action

Qualifications:

  • At least 5+ years of experience designing, building ,and maintaining SAAS environments
  • 4+ years of experience designing, building,g and maintaining AWS/GCP infrastructure with Terraform
  • Experience building and running Kubernetes clusters
  • Experience with observability (monitoring, logging, tracing, metrics)
  • Experience with GitOps CI/CD processes
  • Experience with scripting with Python, Go (Golang), bash, or PowerShell, and AWS CLI tools
  • Experience with security operations – security policies, infrastructure, key management, setup of encryption at rest and transport

SolarWinds is an Equal Employment Opportunity Employer. SolarWinds will consider all qualified applicants for employment without regard to race, color, religion, sex, age, national origin, sexual orientation, gender identity, marital status, disability, veteran status or any other characteristic protected by law.

All applications are treated in accordance with the SolarWinds Privacy Notice: https://www.solarwinds.com/applicant-privacy-notice

Top Skills

AWS
Bash
GCP
Go
Kubernetes
Powershell
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Austin, TX
2,299 Employees
Year Founded: 1999

What We Do

SolarWinds is a leading provider of powerful and affordable IT management software. Our products give organizations worldwide—regardless of type, size, or complexity—the power to monitor and manage their IT services, infrastructures, and applications; whether on-premises, in the cloud, or via hybrid models. We continuously engage with technology professionals—IT service and operations professionals, DevOps professionals, and managed services providers (MSPs)—to understand the challenges they face in maintaining high-performing and highly available IT infrastructures and applications. The insights we gain from them, in places like our THWACK® community, allow us to solve well-understood IT management challenges in the ways technology professionals want them solved. Our focus on the user and commitment to excellence in end-to-end hybrid IT management has established SolarWinds as a worldwide leader in solutions for network and IT service management, application performance, and managed services.

Similar Jobs

MetLife Logo MetLife

Senior Site Reliability Engineer

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
India
43000 Employees

Q2 Logo Q2

Senior Site Reliability Engineer

Digital Media • Fintech • Information Technology • Mobile • Payments • Software • Financial Services
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
2700 Employees

Empower (empower) Logo Empower (empower)

Senior Site Reliability Engineer

Fintech • Payments • Financial Services
In-Office
Bangalore, Bengaluru Urban, Karnataka, IND
11963 Employees
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
6000 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account