Site Reliability Engineer

Sorry, this job was removed at 07:51 p.m. (CST) on Monday, Jun 23, 2025
Hiring Remotely in United States
Remote
Software
The Role
🚀 About Us

We’re a product-focused startup with a tight-knit team of 14 engineers building tools that help teams make better decisions through great research. We're pragmatic, fast-moving, and obsessed with product quality.

As we grow, our infrastructure needs to grow with us. That means better observability, stronger systems, faster deploys—and smarter decisions about cloud spend. We’re hiring someone who can take ownership of this and lay the foundation for long-term platform health.

🎯 What You’ll Do

You’ll be the first dedicated DevOps/Infra hire with end-to-end ownership of platform health, reliability, and scalability. You’ll partner directly with our engineering team to improve our systems, reduce toil, and make infra a product in its own right.

Your scope will include:

  • Observability, Reliability, Availability

    • Define and maintain service SLOs, dashboards, and alerts

    • Improve incident detection and response

    • Lead incident postmortems, share learnings, and manage follow-up actions

  • Infrastructure

    • Maintain and improve Terraform-managed infrastructure

    • Lead our migration of staging infrastructure to AWS

    • Optimize our use of tools like Datadog, Sentry, and others

  • Capacity Planning & Performance Optimization

    • Identify current and potential future bottlenecks

    • Collaborate with engineers to fine-tune application and infrastructure performance

    • Implement automated and semi-automated scaling strategies to handle growth and evolving workloads

  • Developer Experience & CI/CD

    • Increase pipeline reliability and performance

    • Design & implement load testing strategies as we scale

  • Security & Compliance

    • Work with the CTO in owning and implementing SOC2 compliance protocols and requirements

    • Help foster a security-first culture by promoting best practices and secure-by-default tooling

    • Implement guardrails and additional security tools as needed

  • Cloud Cost Management

    • Monitor and optimize cloud spend

    • Build visibility and tooling to help teams make cost-aware decisions

💡 You Might Be a Great Fit If You...
  • Have 4–8+ years of experience in DevOps, SRE, or Infrastructure roles

  • Have hands-on AWS experience (EC2, RDS, VPCs, etc.)

  • Are confident with Terraform, GitHub Actions, Docker, and PostgreSQL

  • Have a track record of improving observability and reducing incident response times

  • Have worked in high-autonomy, high-ownership environments

  • Are cost-conscious and can identify waste in infra and cloud spend

  • Love building leverage tools for engineers—infra as a product

📈 Growth Path

This is a foundational hire. Today, the role is fully IC, but there’s clear runway to grow into:

  • Platform leadership (tech lead or manager)

  • Head of Infra/SRE if we expand the team

  • Principal engineer focused on scale, reliability, and platform strategy

You’ll have support and visibility from leadership, and the freedom to chart your path as the company grows.

⚙️ Our Stack
  • Cloud: AWS

  • Infra-as-code: Terraform
    CI/CD: GitHub Actions

  • Containers: Docker, lightweight Kubernetes
    Monitoring: Datadog, Sentry

  • Database: PostgreSQL, Redis

  • App: Rails, React, Sidekiq

✨ Why This Role?
  • Impact: You’ll shape the systems and culture of how we build and run software.

  • Trust: High autonomy and low process—make smart decisions, move fast.

  • People: No egos, just a team that values thoughtfulness, speed, and care.

  • Growth: Opportunity to grow with the company in whichever direction excites you.


Similar Jobs

Coinbase Logo Coinbase

Site Reliability Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
USA
4000 Employees
152K-179K Annually

Milestone Systems Logo Milestone Systems

Site Reliability Engineer

Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
Remote or Hybrid
2 Locations
1500 Employees
160K-180K Annually

Coinbase Logo Coinbase

Senior Software Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
USA
4000 Employees
181K-212K Annually

Circle Logo Circle

Site Reliability Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Remote
United States of America
1050 Employees
153K-205K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Oakland, California
44 Employees
Year Founded: 2021

What We Do

Great Question is the all-in-one UX research platform trusted by customer-centric teams at Figma, Gusto, and Brex. Recruit participants and run your favorite methods — from user interviews and focus groups to surveys and prototype tests. Then analyze and store all of your research data — from recordings and transcripts to highlights, reels, and insight reports — in one enterprise-grade repository.

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account