Senior Site Reliability Engineer

Posted Yesterday
Easy Apply
Be an Early Applicant
El Segundo, CA
In-Office
183K-235K Annually
Senior level
Artificial Intelligence • Machine Learning • Security • Software
HiveWatch makes it easier for companies to keep their people, assets, and brands safe.
The Role
The Senior Site Reliability Engineer will ensure the reliability of systems, debug issues, maintain CI/CD pipelines, improve system reliability, and mentor engineers.
Summary Generated by Built In

About Us:

HiveWatch is a tech-forward, inclusive organization fostering the evolution of the physical security industry. We are a diverse team of forward thinkers who empower each other to find creative and collaborative solutions in an industry ripe for modernization. We are passionate about the problems we’re solving for our customers and equally passionate about the company we’re building.  

HiveWatch is here to help security teams pivot from chasing threats to preventing them. We protect organizations, people, and property through the intelligent orchestration of physical security programs. With better communication, more insights, and less “noise”, we are modernizing what it means for businesses and their employees to truly feel safe.

POSITION OVERVIEW:

HiveWatch is seeking a Staff Site Reliability Engineer to join our Platform Team, where you'll architect and maintain mission-critical edge infrastructure that connects our SaaS platform to customer systems. You'll ensure exceptional performance, reliability, and observability across our distributed environment while providing technical leadership to our growing engineering team. This role reports directly to our VP of Engineering.

WHAT YOU'LL DO:

  • Own the reliability of mission-critical systems including production monitoring, alerting, and capacity planning
  • Debug and resolve complex production issues across the full stack, from infrastructure to application code
  • Participate in a regular on-call rotation to provide 24/7 coverage for critical systems
  • Perform root cause analysis requiring deep code-level investigation and implement preventive measures
  • Build automation and tooling to reduce operational toil and improve system reliability
  • Maintain CI/CD pipelines, observability infrastructure, and database performance optimization
  • Increase the resiliency, scalability, and maintainability of production environments
  • Establish on-call procedures and disaster recovery processes
  • Provide technical leadership and mentorship to foster engineering excellence and reliability culture

TECH STACK:

  • Languages: Kotlin, Rust, TypeScript, and Python
  • Deployments: GitHub Actions, Terraform, Terragrunt, and Helm
  • Infrastructure: AWS (Kinesis, Serverless, RDS, EKS), Kubernetes, Docker, Postgres, IoT Edge

PREFERRED QUALIFICATIONS:

  • Experience with our tech stack: Kotlin, Rust, TypeScript, Python
  • Expertise in AWS architecture and services
  • Experience in physical security, IoT, or edge computing environments
  • Expertise with advanced AWS services (Kinesis, Lambda, EKS, RDS)
  • Experience with Terraform and Terragrunt specifically
  • Background in high-availability, multi-tenant SaaS environments
  • Experience establishing SRE practices and culture from the ground up
  • Track record of leading incident response and post-mortem processes
  • Experience mentoring and developing junior engineers
  • Knowledge of security best practices and compliance requirements
  • Experience with edge computing and distributed system architectures
  • Previous experience in a startup or high-growth environment (50-200 employees)

MINIMUM QUALIFICATIONS:

  • 7+ years of software engineering experience with strong coding skills in production environments
  • 5+ years of SRE, DevOps, or production operations experience
  • Expertise with cloud platforms (AWS preferred) and containerized applications (Docker, Kubernetes)
  • Experience with Infrastructure as Code (Terraform, CloudFormation, or similar)
  • Proficiency in at least one object oriented programming language in our tech stack (Java, Kotlin, Python)
  • Hands-on experience with relational databases and SQL performance optimization
  • Experience with monitoring and observability tools (Prometheus, Grafana, DataDog, or equivalent)
  • Strong debugging skills across distributed systems and microservices architectures
  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience

ADDITIONAL INFO:

  • Salary range for this position: $183,000 to $235,000 per year
  • Eligible to participate in HiveWatch Equity Incentive Plan

*Final offer will be at the company's sole discretion and determined by multiple factors, including years and depth of relevant experience and expertise, location, and other business considerations.

Benefits & Culture:

At HiveWatch, we’re passionate about taking care of our people — and it shows in the benefits we offer. Our team enjoys:

  • Comprehensive health coverage:  medical, dental, vision, and life insurance
  • Cutting-edge work in an emerging field with huge growth potential
  • Competitive compensation packages designed to reward top talent
  • A modern, newly renovated HQ right on Main Street in El Segundo, CA
  • 401(k) with a 4% company match to help you invest in your future (match launches in 2026)
  • Flexible paid time off so you can recharge when you need it
  • Additional benefits include ClassPass credits and a discount on pet insurance
  • A family-friendly, compassionate culture that values balance and belonging

We encourage you to challenge the status quo, share your perspective, and leave fear at the (access-controlled) door.

Our EEO Statement:

HiveWatch is an equal opportunity employer and we are committed to cultivating a work environment that supports, inspires, and respects all individuals. We execute our hiring practices so that they are merit-based and we do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity/expression, marital status, age, disability, medical condition, genetic information, national origin, ancestry, military or veteran status, or other protected characteristic.

Top Skills

AWS
Docker
Github Actions
Helm
Iot Edge
Kotlin
Kubernetes
Postgres
Python
Rust
Terraform
Terragrunt
Typescript
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: El Segundo, CA
50 Employees
Year Founded: 2020

What We Do

Born from the frustration of experienced security professionals who understood the industry's biggest challenges, HiveWatch was created to solve what legacy systems couldn't. Security teams are drowning in video feeds and alarms while struggling with staff turnover and missed incidents. They're inundated with repetitive and mundane tasks, tying them down with low-impact and time-consuming work. This type of activity creates intense frustration, leading to complacency, burnout, high turnover, and is ultimately a waste of company funds and resources.

As physical security grows and transforms, enabled by advancing AI and technology, it's critical for the industry to embrace this new era to improve the important work the people in physical security are doing. HiveWatch emerged as the antidote: a cloud-native platform that brings together disparate systems, reduces false alarms, and delivers actionable intelligence to security leaders so they can make better decisions. By removing monotonous and laborious tasks, security personnel can be upskilled and enabled to do more impactful, meaningful, and proactive work.

Why Work With Us

HiveWatch is a tech-forward, innovative, and inclusive organization leading the evolution of the physical security industry. We are a data-driven, tech-geek, and fun (at least we like to think so) group of people looking for more diverse perspectives to join us in defining what “keeping people safe” really means.

Gallery

Gallery

Similar Jobs

Anduril Logo Anduril

Senior Site Reliability Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
166K-220K Annually

ServiceNow Logo ServiceNow

Senior Site Reliability Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
San Diego, CA, USA
28000 Employees
111K-172K Annually

Citizen Health Logo Citizen Health

Senior Site Reliability Engineer

Healthtech • Information Technology • Internet of Things
In-Office
San Francisco, CA, USA
38 Employees
160K-190K Annually

Articul8 AI Logo Articul8 AI

Senior Site Reliability Engineer

Artificial Intelligence • Software
In-Office
Dublin, CA, USA
58 Employees

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account