Standard Template Labs Jobs

Sr. Site Reliability Engineer

Standard Template Labs

Sr. Site Reliability Engineer

Reposted 7 Days Ago

New York, NY, USA

In-Office

160K-230K Annually

Senior level

Artificial Intelligence • Information Technology • Software

IT Service Mgmt is about to be redefined. We're building the engine for the next generation of enterprise IT technology.

The Role

The role involves designing and managing multi-cloud infrastructure, implementing CI/CD pipelines, ensuring platform reliability, scalability, and security, while optimizing performance for a SaaS platform used by enterprise customers.

Summary Generated by Built In

Standard Template Labs is an AI-native startup reimagining the future of IT Service and Configuration Management. Backed by leading investors, we're leveraging AI to transform how enterprises manage and engage with their IT ecosystems.

About the Role

We’re looking for a Senior Site Reliability Engineer (SRE) to own the reliability, performance, and scalability of our AI-native platform. You’ll operate at the intersection of software engineering and infrastructure, building systems that keep our platform highly available, observable, and resilient in production.

This is a hands-on engineering role where you’ll write production code (primarily in Python) while also owning on-call operations and incident response.

ResponsibilitiesReliability & Production Ownership

Own the availability, latency, and performance of critical production systems
Participate in and improve a 24/7 on-call rotation, responding to incidents and driving resolution
Lead incident response, root cause analysis (RCA), and postmortems
Design systems that fail gracefully and recover automatically

Automation & Engineering (Python-heavy)

Write production-grade Python code to:
- Automate infrastructure workflows
- Build internal reliability tools
- Improve deployment, rollback, and recovery systems
Eliminate manual operational work through automation and self-healing systems

Observability & Monitoring

Design and implement:
- Metrics, logging, tracing
- Alerting systems (reduce noise, improve signal)
Build dashboards and tooling to give real-time visibility into system health

Infrastructure & Scalability

Operate and improve systems running on:
- Cloud platforms (AWS/GCP/Azure)
- Containers (Docker, Kubernetes)
Scale systems to handle enterprise workloads and high-throughput traffic
Improve deployment pipelines, CI/CD, and infrastructure-as-code

Reliability Engineering & Resilience

Define and enforce:
- SLAs / SLOs / error budgets
Conduct:
- Load testing
- Chaos testing
Build resilient systems that can tolerate failure

Collaboration

Partner with product and backend engineers to:
- Improve system reliability
- Embed observability into services
Help teams design production-ready systems from day one

QualificationsCore Requirements

Strong software engineering background (not just ops)
Proficiency in Python (required) for building tools and services
Experience operating production systems at scale

Infrastructure & Systems

Experience with:
- Kubernetes / Docker
- Cloud platforms (AWS/GCP/Azure)
- Distributed systems

Reliability & Operations

Experience with:
- On-call rotations and incident response
- Monitoring tools (Grafana, Prometheus, etc.)
- Debugging production issues under pressure

Nice to Have

Experience with:
- AI/ML systems or data pipelines
- Event-driven architectures
- High-availability systems

What We Offer

Build foundational product features for an AI-first enterprise platform
The opportunity to take ownership of critical systems that scale to millions of users
A culture that values craftsmanship, autonomy, and technical excellence
Competitive compensation, equity, and benefits package
Work from our Flatiron District, Manhattan office, where you’ll be side-by-side with the founding team in a supportive, collaborative setting. Our team works on-site five days a week, growing and building together, and the location is easy to reach with plenty of public transportation options.

As an equal opportunity employer, we don’t tolerate discrimination or harassment of any kind. Whether that’s based on race, ethnicity, age, gender identity, citizenship, religion, sexual orientation, disability, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws. The reasonably estimated yearly salary for this role at is: $160,000—$250,000 USD.

Skills Required

5+ years as a Platform, Infrastructure, or DevOps Engineer in SaaS environments
Strong in Python, Go, or similar programming languages
Proficient with Terraform, Kubernetes, and Docker
Deep experience with AWS, GCP, or Azure
Hands-on with CI/CD tools like GitHub Actions, Argo
Expert in Datadog for observability and monitoring

Standard Template Labs Compensation & Benefits Highlights

Healthcare Strength — Core health coverage includes medical, dental, and vision insurance as part of the package. Public materials repeatedly position this as a solid core benefit for employees.
Leave & Time Off Breadth — Paid holidays and an unlimited PTO policy are offered to support flexibility and recharging. Descriptions emphasize flexibility in time off as part of total compensation.
Equity Value & Accessibility — A generous options package and company equity are included alongside a competitive base salary. Materials frame equity as a core component of total rewards at the startup.

Learn more about Standard Template Labs's Compensation & Benefits →

Standard Template Labs Insights

What's It Like to Work at Standard Template Labs? Standard Template Labs Culture & Values Standard Template Labs Career Growth & Development What's the Work-Life Balance Like at Standard Template Labs? Standard Template Labs Leadership & Management Standard Template Labs Company Growth, Stability & Outlook

View all jobs at Standard Template Labs

View Standard Template Labs Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: New York, NY

25 Employees

Year Founded: 2025

What We Do

We’re on a mission to reinvent IT Service Management from the ground up. In an industry dominated by outdated, bloated tools, we believe it’s time for a clean slate — modern, efficient, and built with the end-user in mind. Our goal is to create software that teams actually want to use — tools that accelerate productivity instead of slowing it down. With deep product expertise, strong backing, and a bold vision, we’re building a next-generation ITSM platform designed for the way companies work today and tomorrow. No legacy baggage. No compromises. Just smart, scalable, high-performance software that redefines what IT service can be.

Why Work With Us

Backed by leading investors, we're leveraging AI, graph-based architecture, and exceptional design to transform how enterprises manage and engage with their technology ecosystems.

Gallery

Standard Template Labs Offices

Learn More

OnSite Workspace

Work from our Midtown Manhattan office alongside the founding team in a focused, collaborative environment. We're in-office five days a week, moving quickly and building together—with easy access to public transportation.

Typical time on-site: None

HQNew York, NY

Flatiron District