Sr. Site Reliability Engineer - HCP Terraform

Posted 2 Hours Ago
Hiring Remotely in United States
Remote
177K-208K Annually
Senior level
Cloud • Information Technology • Security • Software
The Role
Seeking a Senior Site Reliability Engineer to join the Production Engineering team responsible for building and maintaining core services for HCP Terraform architecture. Responsibilities include troubleshooting production incidents, developing SRE best practices, and collaborating to enhance system security and performance.
Summary Generated by Built In

Location: US 100% remote

About the Team

The Terraform Platform Engineering group is composed of Site Reliability Engineers and distributed systems engineers working on the HCP Terraform hosted service. Our group ensures that the platform’s underlying infrastructure, data stores, and core foundational services are reliable, performant, and robust. We work closely with the engineering teams that ship features for both HCP Terraform and the Terraform Enterprise on-premise product. 

We’re seeking a Senior Site Reliability Engineer to join the Production Engineering team. This team is responsible for building and maintaining the “core” services which compose the HCP Terraform architecture and the foundational layer upon which user-facing services are built. In addition to developing and maintaining our own services, the team also fosters operational maturity efforts in conjunction with the product engineering teams working on HCP Terraform. These efforts include documentation, training, and tooling related to service ownership, service-level SLO definitions, monitoring and observability, etc. 

If this sounds like an interesting opportunity, we’d love to meet you! We have a large footprint and a quickly-growing user base, with lots of interesting problems and plenty of opportunities for growth and development.

In this role, you can expect to:

  • Dive into problems with an eye to both immediate remediation as well as the follow-through changes and automation that will prevent future occurrences
  • Troubleshoot production incidents that often span across multiple teams, services, and codebases
  • Help develop and evangelize SRE best practices, techniques, and tools to the engineers building our services
  • Model our incident response process, leading by example during incidents and in blameless retrospectives
  • Maintain day-to-day vigilance with regards to operational security while helping to enhance the intrinsic security of the overall production system
  • Collaborate across teams to improve our tools based on experiences found from running our own software in production
  • Participate in a 24/7 on-call rotation that supports our production services 

You’re a great fit if you have:

  • Have 5+ years of production experience at scale working on any of the following: Backend applications written in Ruby on Rails, Databases, Observability, and services written in Go
  • Experience working closely with teams building Ruby/Rails and Go services
  • Experience building and supporting the production services for a large-scale SaaS application
  • Experience building and scaling distributed, highly available systems
  • Informed opinions from experience about service ownership best practices, incident response and resolution, and platform resiliency
  • Strive for quality through maintainable code and comprehensive testing from development to deployment
  • Communicate clearly while remaining empathetic and kind
  • Have an eagerness to learn through humility and reflection
  • Have experience debugging performance bottlenecks for live services and systems
  • Working knowledge of industry best practices related to information security

How We Work

At HashiCorp, we operate according to a strong set of company principles, many of which are described in The Tao of HashiCorp. We value top-notch collaboration and communication skills, both among internal teams and in how we interact with our users. We take care to balance and be responsive to the needs of our open source community as well as our enterprise level customers.

Engineering at HashiCorp is largely a remote team, and this role is no exception. While prior experience working remotely isn't required, we are looking for team members who perform well given a high level of independence and autonomy.

About the Application Process

Communication is a critical aspect of how we work, a cover letter is a great way to provide a sample of how you communicate. In your cover letter, describe why you're interested in working at HashiCorp, and what draws you to this role in particular. Specifics of your past experience are great to include, too.

At HashiCorp, we are committed to hiring and cultivating a diverse team. If you are on the fence about whether you meet our requirements, please apply anyway!

#LI-Remote

Individual pay within the range will be determined based on job related-factors such as skills, experience, and education or training.

The base pay range for this role in the SF Bay Area / NYC area is:

$176,500$207,600 USD

The base pay range for this role in Seattle Metro, Denver / Boulder Metro, New York (excluding NYC), Washington D.C., or California (excluding SF Bay Area) is:

$161,800$190,300 USD

The base pay range for this role in Colorado (excluding Denver / Boulder Metro) and Washington (excluding Seattle Metro) is:

$147,100$173,000 USD

Top Skills

Go
Ruby On Rails
The Company
HQ: San Francisco, CA
1,200 Employees
Hybrid Workplace
Year Founded: 2012

What We Do

HashiCorp was founded by Mitchell Hashimoto and Armon Dadgar in 2012 with the goal of revolutionizing datacenter management: application development, delivery, and maintenance. The datacenter of today is very different than the datacenter of yesterday, and we think the datacenter of tomorrow is just around the corner.

Jobs at Similar Companies

bet365 Logo bet365

Human Resources Administrator

Digital Media • Gaming • Software • eSports • Automation
Denver, CO, USA
6100 Employees
57K-64K Annually

Jobba Trade Technologies, Inc. Logo Jobba Trade Technologies, Inc.

Senior Back End Developer

Cloud • Information Technology • Productivity • Professional Services • Software
Remote
Hybrid
Chicago, IL, USA
45 Employees

InCommodities Logo InCommodities

Head of People & Culture - NA

Information Technology • Machine Learning • Analytics • Energy • Automation • Renewable Energy
Hybrid
Austin, TX, USA
234 Employees

Similar Companies Hiring

bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account