Senior Site Reliability Engineer

Posted Yesterday
Hiring Remotely in United States
Remote
183K-183K Annually
Senior level
Artificial Intelligence • Real Estate
The Role
As a Senior Site Reliability Engineer, you will enhance platform reliability and observability, streamline incident response, improve cloud infrastructure, and collaborate across teams to drive operational excellence.
Summary Generated by Built In

TL;DR

As an SRE at Snappt, you will:

  • Own the reliability, scalability, and observability of our platform
  • Build out monitoring, alerting, and incident response processes so we catch issues before our customers do
  • Standardize infrastructure provisioning (Terraform, Atlantis) and drive cloud cost visibility
  • Partner with product engineering to improve developer productivity, CI/CD reliability, and cloud infrastructure setup
  • Lead initiatives across security, compliance, and cross-functional engineering culture

Who we are

We are a Series A, well-funded tech startup that is kicking ass and taking names. In just two short years, we have captured nearly 10% market share… because our fraud detection technology is saving our customers millions, soon to be billions of dollars.

We are an entrepreneurial-led organization, passionate about building a company with awesome people and a relentless focus on the customer. And as cliche as it may sound, we walk the walk and stand behind our Core Values and how we treat one another.

We are a team of 90, soon to be 100+. If you’re like us and think you have what it takes to join us, keep reading!

What we do

Snappt is on a mission to bring trust back to renting. As a fast-growing PropTech company, we help multifamily operators stop fraud before it happens and approve renters with confidence. Our Applicant Trust Platform™ combines advanced AI and human expertise to make leasing fair, transparent, and secure. Trusted by many of the top names in multifamily, Snappt is redefining how the industry protects communities and makes smarter, safer leasing decisions.

At Snappt, our values guide how we work and how we win together. We aim to Be Kind in every interaction, knowing collaboration thrives on respect. We Live Curiously, asking questions and experimenting fearlessly to drive innovation. We Embrace Play, because creativity flows when we make space for joy. And we always Give a Sh!t, holding ourselves to the highest standards and caring deeply about the outcomes we deliver. These values fuel our culture and make Snappt a place where big challenges are met with bold ideas and real impact.

Who you are

  • You are passionate; you throw your energy and conviction into the work you do.  
  • You are naturally curious, with an innate desire to understand people, uncover opportunities for improvement, and proactively solve challenges that drive meaningful business outcomes
  • You are collaborative and pragmatic; you thrive in environments where people work together to create winning solutions.
  • You have a commitment to self-development and personal growth; you pursue your personal and professional interests with energy and enthusiasm.
  • You assume positive intent; you believe your teammates and customers come from a place of good intention.
  • You thrive in ambiguity, turning unstructured problems into scalable systems and processes.

What you will do

  • Infrastructure & DevOps Tooling
    • Standardize Terraform/Atlantis usage, improve integration and documentation.
    • Streamline new-service bootstrapping to reduce time-to-first-deploy.
    • Manage overall cloud infrastructure health and availability as an enabler for product engineering teams.
    • Improve CI/CD reliability and developer experience across key platforms (e.g., fraud-platform).
  • Monitoring, Alerting & Incident Response
    • Build Datadog monitoring with meaningful thresholds, tags, and alerting logic agentic AI systems—to address new and evolving business challenges.
    • Define and implement SLIs, SLOs, and SLAs across services.
    • Build platform health dashboards for leadership and team use.
    • Establish on-call rotations, severity levels, and incident response protocols.
    • Lead post-incident reviews and drive continuous improvement.
    • Coach product engineering teams to take on incident response processes and procedures from your best practices.
  • FinOps & Cloud Efficiency
    • Implement AWS cost tagging, allocation dashboards, and anomaly alerting.
    • Partner with teams to review spend and drive optimizations.
    • Support cost controls in Databricks and production ML workloads (including Sagemaker integrations).
  • Security & Compliance Enablement
    • Collaborate and drive initiatives to ensure compliance with frameworks important to Snappt, including SOC 2, ensuring practices continue to maintain a high level of security and compliance.
    • Build alerting, monitoring, and operational processes to ensure deeper compliance while automating where pragmatically possible.
  • Cross-Functional Leadership & Culture
    • Host workshops (on-call readiness, Terraform/Atlantis, incident response).
    • Lead monthly security/infra reviews with eng leaders.
    • Collaborate with EMs and engineers to prioritize and plan SRE initiatives.

What you bring

  • Required
    • Bachelor’s or Master’s in Computer Science, Computer Systems, Management Information Systems, or related field.
    • 5+ years’ experience in SRE, DevOps, or Infrastructure Engineering.
    • Strong skills in observability (Datadog, Prometheus, Grafana, or equivalent).
    • Experience defining/implementing SLIs/SLOs and incident management processes
    • Proficiency with Terraform and CI/CD (Atlantis, GitHub Actions, CircleCI, etc.).
    • Hands-on AWS expertise (ECS, S3, RDS, Lambda, VPC networking).
    • Strong communication/documentation skills for cross-team collaboration.
  • Preferred
    • Experience with Databricks, SageMaker, or ML pipeline reliability.
    • Cloud cost optimization (AWS Budgets, Cost Explorer, anomaly detection).
    • Experience in high-compliance environments (SOC 2, HIPAA, PCI).
    • Background in developer enablement, DevEx metrics, or FinOps.
    • Familiarity with incident response tooling (PagerDuty, Opsgenie).

Where you’ll work

Snappt is a remote-first organization. 

We actually don’t even have an office. So if you’re accountable and do high-quality work...do it from anywhere. You can also expect to see your teammates throughout the year IRL, at company-supported retreats and events.

We are an Equal Opportunity Employer 

We believe the best ideas come from working with people of different backgrounds and unique perspectives.

We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws.

This policy applies to all employment practices within our organization, including hiring, recruiting, promotion, termination, layoff, recall, leave of absence, compensation, benefits, training, and apprenticeship. Snappt makes hiring decisions based solely on qualifications, merit, and business needs at the time.

Annual Base Salary: $183,000

Top Skills

AWS
CircleCI
Datadog
Github Actions
Grafana
Prometheus
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Los Angeles, CA
84 Employees
Year Founded: 2016

What We Do

Half of evictions are due to fraud. At $7,000 or more per eviction this has become a significant problem for landlords. The best way to avoid evictions is to adequately screen applicants, yet the growing availability of online tools make it easy for applicants to forge their financial documentation. These forged documents are often impossible to spot and none of the current tools landlords use to check applicants have the ability to spot fraudulent financial documentation.

To address this need, Snappt provides a quick and inexpensive service that can accurately spot fraudulent documentation.

Similar Jobs

Circle Logo Circle

Senior Site Reliability Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Remote
United States of America
1050 Employees
148K-195K Annually

ServiceNow Logo ServiceNow

Senior Site Reliability Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
San Diego, CA, USA
28000 Employees
111K-172K Annually

Capital One Logo Capital One

Lead Software Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Remote or Hybrid
McLean, VA, USA
55000 Employees
205K-257K Annually
In-Office or Remote
Santa Clara, CA, USA
16068 Employees
120K-200K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account