Site Reliability Engineer

Posted 5 Days Ago
Hiring Remotely in United States
Remote
Senior level
Software
The Role
EngFlow seeks an experienced Site Reliability Engineer to design, build, and maintain cloud infrastructure for a distributed build acceleration platform, ensuring performance, scalability, and high availability while automating processes and resolving incidents efficiently.
Summary Generated by Built In
About EngFlow

At EngFlow, we help developers save time by accelerating software builds and tests. Our cloud-based, distributed service optimizes developer workflows through remote execution and caching, improving efficiency, productivity, and product quality.

Backed by top investors, EngFlow is redefining how companies build software and ship well-tested products. Our solutions speed up builds by a factor of 10 or more, while our observability platform provides actionable insights for optimization. Founded by key contributors to Bazel, we build tools that empower engineering teams—from startups to Fortune 500 companies—to enhance developer velocity and improve build performance.

Learn more about our mission, culture, and team: EngFlow | Video

We’re looking for an experienced SRE to join our engineering team. You’ll be at the intersection of software engineering and systems operations — ensuring our distributed infrastructure is highly available, performant, and scalable while enabling our engineers to move quickly and confidently.

Key Responsibilities
  • Design, build, and maintain cloud infrastructure for our distributed build acceleration platform
  • Automate everything: from deployment pipelines to monitoring and recovery
  • Manage scalability and reliability for high-throughput, low-latency systems
  • Implement and maintain observability: logging, metrics, tracing, and alerting
  • Work closely with product and engineering teams to embed reliability into every feature
  • Diagnose and resolve production incidents quickly, and feed learnings back into systems design
  • Optimize cost, performance, and resilience across multi-cloud environments

Requirements
  • 4+ years in SRE, DevOps, or Production Engineering roles
  • Experience managing Kubernetes in production
  • Strong background in cloud infrastructure (GCP or AWS) and IaC (Terraform preferred)
  • Solid knowledge of networking, security, and distributed systems
  • Track record of improving system availability and developer productivity
  • A knack for debugging complex, cross-system issues under pressure

Benefits

We offer comprehensive medical, dental, vision benefits, 401k/pension, parental leave and generous vacation. The team is fully remote but we enjoy meeting together several times a year at exciting destinations throughout the world. We value getting the work done and having fun while doing it, and have done numerous fun team events such as chocolate, whisky, and tea tastings, monthly team games, escape the room, and other fun events.

Top Skills

AWS
GCP
Kubernetes
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Austin, TX
18 Employees
Year Founded: 2020

What We Do

EngFlow is a SaaS company that is redefining how companies build software and ship well-tested products. Its remote execution service speeds up software builds by a factor of 10 or more, and observability platform provides insights to optimize builds and tests. Created by the engineer who led the development of Bazel, Google's open source build system, EngFlow builds tools and connects experts in the Bazel and build ecosystem. EngFlow products are used by engineers from startups to Fortune 500 companies to accelerate developer productivity and positively impact engineering culture.

Similar Jobs

Coupa Logo Coupa

Site Reliability Engineer

Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
In-Office or Remote
Denver, CO, USA
125K-162K

Capital One Logo Capital One

Lead Software Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Remote or Hybrid
2 Locations
205K-257K Annually

Close Logo Close

Site Reliability Engineer

Sales • Software • Automation
Remote
United States
140K-210K Annually

Nexthink Logo Nexthink

Site Reliability Engineer

Artificial Intelligence • Big Data • Information Technology • Software
Remote or Hybrid
Phoenix, AZ, USA
174K-272K

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account