Intermediate Site Reliability Engineer, Environment Automation

Posted 11 Days Ago
Easy Apply
28 Locations
Remote
Junior
Cloud • Security • Software • Cybersecurity • Automation
GitLab is the most comprehensive AI-powered DevSecOps platform.
The Role
As an SRE, you'll automate environments, debug production issues, contribute to CI/CD workflows, and enhance observability while collaborating across teams.
Summary Generated by Built In

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our platform unites teams and organizations, breaking down barriers and redefining what's possible in software development. Thanks to products like Duo Enterprise and Duo Agent Platform, customers get AI benefits at every stage of the SDLC. 

The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier, with all team members expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact. GitLab is where careers accelerate, innovation flourishes, and every voice is valued. Our high-performance culture is driven by our values and continuous knowledge exchange, enabling our team members to reach their full potential while collaborating with industry leaders to solve complex problems. Co-create the future with us as we build technology that transforms how the world develops software.

An overview of this role

As a Site Reliability Engineer at GitLab, you’ll keep our user-facing services and production systems running smoothly by blending software engineering with infrastructure expertise. Our SREs are pragmatic operators and skilled developers who bring sound engineering principles, operational discipline, and thoughtful automation to everything they touch. The ideal candidate is equally comfortable debugging Go applications and designing scalable Terraform automation across hundreds of environments. You're the go-to for complex production issues, combining deep technical investigation with a developer’s mindset and an operator’s precision.

In the Environment Automation specialization, your focus is on operating and automating hundreds of GitLab environments—from initial provisioning to day-to-day maintenance tasks.
Unlike other SRE roles, this position centers on automating the lifecycle of many tenant environments, ensuring they remain secure, consistent, and reliable at scale.
Some examples of the projects you could work on:

  • Designing infrastructure automation that provisions and operates GitLab environments using Terraform, Ansible, and Kubernetes
  • Creating and maintaining deployment packages for GitLab, such as Helm Charts and omnibus-gitlab
  • Building and operating Dedicated GitLab instances integrated with cloud-native services (e.g., GCP, AWS)
  • Developing tools to orchestrate infrastructure-as-code workflows across multiple tenants
  • Deploying and managing microservices on Kubernetes clusters at scale
  • Enhancing GitLab’s observability stack (e.g., Prometheus, ELK) to support proactive monitoring and incident response
  • Integrating with and operating infrastructure in cloud provider ecosystems (e.g., IAM, networking, storage)
  • Championing and implementing cloud security best practices across automated infrastructure

What You'll Do

  • Support Environment Automation at Scale: Contribute to automating the provisioning, configuration, and management of GitLab environments using Terraform, Ansible, and Kubernetes. Follow best practices to support infrastructure across many tenants with guidance from senior team members.
  • Assist in Debugging Production Issues: Investigate and troubleshoot issues in Kubernetes clusters and GitLab services. Help resolve common problems such as failed deployments, pod crashes, and scheduling conflicts using tools like kubectl.
  • Contribute to IaC and CI/CD Workflows: Write and maintain Terraform modules and scripts to automate routine operations. Participate in improving CI/CD pipelines for safe and repeatable infrastructure changes.
  • Participate in Monitoring and Maintenance: Help monitor environment health using tools like Prometheus, ELK, and Grafana. Assist in improving observability and capacity tracking for tenant environments.
  • Respond to Incidents and Alerts: Take part in the incident response process, helping triage alerts, document issues, and support resolution efforts under the guidance of senior engineers.
  • Collaborate Across Teams: Work with Infrastructure and Development teams to contribute to solutions that improve platform reliability and operational efficiency.

What You'll Bring

  • Experience with Infrastructure as Code: Familiarity with Terraform and Ansible to manage cloud infrastructure. Able to work with modules and understand the basics of state and variable use.
  • Kubernetes Fundamentals: Experience using kubectl, Helm, or Kustomize to interact with Kubernetes clusters. Understands core concepts such as pods, deployments, and rollouts.
  • Basic Programming Skills: Able to read and modify infrastructure tooling written in Go, Ruby, or similar languages.
  • Exposure to Multi-Environment Operations: Experience working with multiple environments or customer setups, even if not at full scale. Understands the challenges of managing consistency and isolation.
  • Monitoring and Troubleshooting Skills: Familiar with basic observability tools and logs. Can identify service issues using dashboards or metrics and escalate appropriately.
  • Collaborative Mindset: Works well in cross-functional teams. Eager to learn from others, share knowledge, and contribute to team success.
  • On-Call Experience: Has participated in on-call rotations for production systems and is comfortable responding to alerts, triaging incidents, and collaborating during recovery efforts.

About the team

GitLab’s Dedicated team, where the SRE Environment Automation role sits, is on a mission to deliver a fully managed, single-tenant GitLab experience through the GitLab Dedicated platform. Our goal is to eliminate manual operations across the entire lifecycle of customer environments, including provisioning, upgrades, security, and monitoring, so customers can focus on unlocking the full potential of The One DevOps Platform without managing the underlying infrastructure. We build scalable, automated systems that ensure each GitLab Dedicated instance is secure, consistent, and production-ready—whether we're managing 10 environments or hundreds.

How GitLab will support you
  • Benefits to support your health, finances, and well-being
  • Flexible Paid Time Off 
  • Team Member Resource Groups
  • Equity Compensation & Employee Stock Purchase Plan
  • Growth and Development Fund
  • Parental leave 
  • Home office support

Please note that we welcome interest from candidates with varying levels of experience; many successful candidates do not meet every single requirement. Additionally, studies have shown that people from underrepresented groups are less likely to apply to a job unless they meet every single qualification. If you're excited about this role, please apply and allow our recruiters to assess your application.

Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.  

Privacy Policy: Please review our Recruitment Privacy Policy. Your privacy is important to us.

GitLab is proud to be an equal opportunity workplace and is an affirmative action employer. GitLab’s policies and practices relating to recruitment, employment, career development and advancement, promotion, and retirement are based solely on merit, regardless of race, color, religion, ancestry, sex (including pregnancy, lactation, sexual orientation, gender identity, or gender expression), national origin, age, citizenship, marital status, mental or physical disability, genetic information (including family medical history), discharge status from the military, protected veteran status (which includes disabled veterans, recently separated veterans, active duty wartime or campaign badge veterans, and Armed Forces service medal veterans), or any other basis protected by law. GitLab will not tolerate discrimination or harassment based on any of these characteristics. See also GitLab’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know during the recruiting process.

Top Skills

AI
Ansible
Devsecops
Elk
Gitlab
Go
Grafana
Kubernetes
Prometheus
Ruby
Terraform

What the Team is Saying

Cynthia
Austin
Panos
Alana
Chloe
Reshmi
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, CA
2,500 Employees
Year Founded: 2014

What We Do

GitLab is an open core software company that develops the most comprehensive DevSecOps Platform used by more than 100,000 organizations. Our mission makes it clear that we believe in a world where everyone can contribute. We make that possible at GitLab by running our operations on our product and staying aligned with our values.

We strive to create a transparent environment where all team members around the world feel that their voices are heard and welcomed. We also aim to be a place where people can show up as their full selves each day and contribute their best.

Why Work With Us

We’ve got big ambitions to make GitLab the most comprehensive AI-powered DevSecOps platform and need skilled contributors to get us there. At GitLab, your contributions shape the future of software development at a time when AI is changing the way software is built. Together, we're building the most comprehensive AI-powered DevSecOps platform.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

GitLab Teams

Team
Sales & Customer Suceess
About our Teams

GitLab Offices

Remote Workspace

Employees work remotely.

All-remote means that each individual in the organization is empowered to work and live where they are most fulfilled; it makes it clear that every team member is equal. No one, not even the executive team, meets in-person on a daily basis.

Typical time on-site: None
San Francisco, CA

Similar Jobs

GitLab Logo GitLab

Senior Back-end Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
31 Locations
2500 Employees

GitLab Logo GitLab

Account Executive

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
28 Locations
2500 Employees

GitLab Logo GitLab

Senior Product Manager

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
31 Locations
2500 Employees

GitLab Logo GitLab

Director, Ecosystems Sales

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
28 Locations
2500 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account