Staff Site Reliability Engineer

Posted 15 Days Ago
2 Locations
Remote
181K-259K Annually
Expert/Leader
Beauty • Cloud • Fintech • Marketing Tech • Payments • Productivity • Software
We empower the appointment-based, self-care industry to give their clients more of the magical moments that matter most.
The Role
As a Staff Site Reliability Engineer, you'll lead reliability strategies, improve system resilience, and mentor teams on best practices in a hands-on role.
Summary Generated by Built In

Who is Boulevard?

Boulevard provides the first and only client experience platform for appointment-based, self-care businesses. We empower our customers to give their clients more of the magical moments that matter most.

Before launching in 2016, our founders spent months interviewing salon managers and working behind front desks to understand their pain points so we could design a modern, user-friendly platform that meets the unique needs of their business. Our roots may be in hair salons, but we are built for the broader self-care industry, including many types of salons, spas, medspa, barbershops, and more. Our technology not only helps our customers survive but thrive. Take a look at how we (and YOU) can make that happen. 

We have an insatiable curiosity and embrace experimentation. We believe that simple solutions require the most sophistication, and we design each and every detail to maximize potential, power, and impact. Do our values match? Read through our story and what we value the most.

Our team values and celebrates our diverse backgrounds. Being open about who we are and what we do allows us to do the best work of our lives. We believe in equal opportunity for all, and you should too.

Come do the best work of your life at Boulevard.

We’re hiring a Staff Site Reliability Engineer to shape the foundation of Site Reliability Engineering at Boulevard

Here you will not just build infrastructure or tooling, but improve systems at scale, influence reliability across engineering, and drive a reliability strategy. You’ll help teams establish SLOs and build repeatable practices for how teams observe, debug, and improve their services.

Reporting to the Director of Cloud & Reliability, this hands-on technical leadership role will uplevel reliability practices and build resilient approaches. You’ll help teams adopt best practices, define what “good” looks like, and partner with teams to get there.

  1. Reliable Infrastructure - a foundation of stability, and security.
  2. Developer Productivity - empowering builders to do the right things.
  3. Clear ownership - accountability aligned with ownership. Collaboration, not silos.
  4. Long-term Focus - we engineer for tomorrow.
Key Projects & Initiatives
  • Golden Paths to Production: Establish and evolve paved paths that make production-readiness the default for every service at Boulevard. Build shared tooling, templates, and deployment workflows that encode best practices for observability, testing, and resilience.
  • Shared Systems & Production Tooling: Develop core libraries, shared services, and self-service tooling that improve reliability, resilience, and developer efficiency.
  • Reliability & Fault Tolerance Improvements: Lead initiatives that make the platform more robust, fault-tolerant, and self-healing.
  • Observability & Operational Insight: Enhance Boulevard’s observability stack to turn data into action and insight into reliability. Expand metrics, logging, and tracing coverage across critical systems, ensuring full visibility into production health.
  • Platform Performance Optimization: Drive continuous improvement in system and application performance, ensuring services remain fast, reliable, and cost-efficient. Use observability data to identify bottlenecks and improve service efficiency across compute, network, and storage layers.
What You’ll Do Here
  • Define Boulevard’s Reliability Strategy: Lead the development and evolution of our reliability vision, establishing SLOs, SLIs, and error budgets that balance reliability, performance, and delivery speed. Partner with engineering and product teams to embed reliability as a measurable, shared responsibility.
  • Architect and Scale Resilient Systems: Partner with engineering teams to design, build, and operate scalable, fault-tolerant, and secure distributed systems that power Boulevard’s continued growth and customer trust.
  • Develop Production Tooling and Shared Systems: Create and maintain production-grade tooling, shared libraries, and services that improve system resilience and developer productivity. Build the foundations that make our platform more robust, and make reliability the default for every service.
  • Drive Observability and Operational Excellence: Elevate our observability stack, enhancing metrics, logging, tracing, and alerting to enable actionable insights, faster incident resolution, and proactive reliability improvements. Leverage this data to prioritize and remediate performance bottlenecks.
  • Automate Everything: Champion automation to eliminate manual toil, streamline operational workflows, and build self-service tooling that empowers developers and embeds reliability into daily development practices. You can navigate the tradeoffs between toil and automation.
  • Mentor and Influence Across Engineering: Act as a technical leader and mentor, guiding engineers in scalable system design, capacity planning, and operational excellence to help foster a culture where reliability is everyone’s responsibility.
What You’ll Need to Thrive
  • Deep Systems Expertise: 8–10+ years of experience in systems, infrastructure, or backend engineering, with a track record of building and operating distributed systems at scale. You have a deep understanding of reliability, scalability, and performance in complex, production-grade environments.
  • Reliability Engineering Mindset: Proven experience defining and delivering reliability outcomes through SLOs, SLIs, error budgets, and mature observability practices. You approach reliability as an engineering discipline, not an afterthought.
  • Automation-First Philosophy: Strong background in infrastructure-as-code, scripting, and automation (e.g., Terraform, Python, Go, or similar). You believe in eliminating manual toil and codifying operational excellence into reusable tools and systems.
  • Incident Management Mastery: Experienced in detecting, diagnosing, and mitigating production incidents in high-availability systems. You drive blameless postmortems and translate lessons learned into sustainable reliability improvements.
  • Collaboration & Influence: Exceptional communication and stakeholder management skills. You’re adept at aligning diverse teams, advocating for reliability practices, and influencing without authority, raising the operational bar across engineering.
  • Technical Leadership & Mentorship: Demonstrated ability to mentor engineers, set technical standards, and scale your impact through influence. You thrive on enabling others and shaping a reliability-first culture across the organization.
  • Comfort with Ambiguity: Thrives in dynamic, fast-moving environments. You excel at navigating uncertainty, setting direction where none exists, and iterating quickly toward meaningful impact.

Bonus:

  • Experience with Elixir, Phoenix, Ruby, or Rails.
  • Hands-on experience identifying and improving database performance.

How we’ll take care of you:  

Your starting total cash compensation for this role is between $181,125 and $258,750 depending on your current skills, experience, training, and overall market demands. This salary range is subject to change, and there is always room for growth and advancement

In addition to the wonderful people you’ll get to work with and challenging projects that’ll push you - Boulevard is here to make sure you’re always at the top of your game emotionally, mentally, and physically. 

  • ✨ We’ve got you covered with a 401(k) match plus dental, medical, vision, and life insurance. 

  • 🏝 Take a break whenever you need with our flexible vacation day policy. 

  • 🖥 Fully remote so you can choose where you want to work. You’ll receive a work from home stipend every month. 

  • 💚 Family planning resources and specialized support programs. 

  • 🔮 Equity: get ahead on the ground floor and grow with Boulevard. 

  • 💅 Boulevard Bucks Learning and Development program allows employees to explore businesses in the market we serve.


📲 We recommend following our official LinkedIn page to stay up to date on all things Boulevard life!

Boulevard Labs, Inc. is an Equal Opportunity Employer committed to hiring a diverse workforce and sustaining an inclusive culture. All employment decisions at Boulevard Labs, Inc. are based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, gender, sexual orientation, gender identity or expression, veteran status, or any other status protected under federal, state, or local law.

Top Skills

Elixir
Go
Python
Ruby on Rails
Ruby
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Los Angeles, CA
260 Employees
Year Founded: 2016

What We Do

Boulevard provides a Client Experience Platform (CXP) that is purpose-built for appointment-based, self-care businesses. Our customers thrive on developing deep relationships that go beyond any single visit or transaction, and our technology extends their ability to deliver personalized, enjoyable experiences with online appointment scheduling, messaging and payments that are simple, elegant, and reliable.

Successfully serving the self-care industry with a differentiated brand is critical to Boulevard's long-term vision. As we reinforce and grow our position as a trusted partner to self-care businesses, we seek to evolve into a marketplace where consumers can easily find and access our customers. We aspire to become a lifestyle brand that connects consumers instantly and easily with an entire marketplace of self-care businesses that help them look and feel their best.

Gallery

Gallery

Similar Jobs

P2P.org Logo P2P.org

Site Reliability Engineer

Information Technology
In-Office or Remote
35 Locations
179 Employees

Alpaca Logo Alpaca

Site Reliability Engineer

Fintech • Information Technology
Remote
2 Locations
132 Employees

Veeva Logo Veeva

Senior Software Engineer

Big Data • Cloud • Healthtech • Software • Big Data Analytics
In-Office or Remote
Toronto, ON, CAN
6000 Employees
110K-270K Annually

Anaplan Logo Anaplan

Site Reliability Engineer

Information Technology
Remote
Canada
2194 Employees

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Rain Thumbnail
Web3 • Payments • Infrastructure as a Service (IaaS) • Fintech • Financial Services • Cryptocurrency • Blockchain
New York, NY
40 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account