Senior Site Reliability Engineer

Posted 16 Days Ago
Santa Clara, CA, USA
In-Office
140K-180K Annually
Senior level
Software
The Role
Lead the modernization of AWS cloud infrastructure, implement automation, ensure system reliability, and manage performance with a focus on security and incident response.
Summary Generated by Built In

LeanData helps the world’s fastest-growing companies automate, simplify, and accelerate revenue.

We are looking for a Senior Site Reliability Engineer to lead the strategic evolution of our cloud infrastructure. Reporting directly to the SVP of Engineering, this role is designed for a builder - someone who wants to move beyond maintenance and into the realm of architectural transformation.

You will have the autonomy to evaluate our existing AWS footprint and lead the charge in modernizing our environment. Your mission is to take a high-velocity system and implement the best practices, guardrails, and automated architectures that will support our next 10x of scale. You will be the primary authority on reliability, performance, and infrastructure security.

Please note: This is a hybrid role based in our Santa Clara, CA office, with an in-office schedule of two days per week – Monday and Wednesday.

Key Responsibilities
  • Architectural Modernization: Lead the design and implementation of a scalable, "Cloud-First" AWS architecture. You will drive the transition toward fully automated, state-of-the-art Infrastructure as Code (Terraform).

  • High Availability & Resilience: Design and implement robust Disaster Recovery (DR) and Business Continuity plans, moving our services toward a zero-downtime deployment model.

  • Performance & Capacity Engineering: Own the strategy for capacity planning and autoscaling. You will optimize our compute resources (EC2, Lambda) to handle bursty traffic patterns with precision and cost-efficiency.

  • Advanced Observability: Define our monitoring and alerting philosophy using New Relic for deep APM and system insights. Partner this with IncidentIO to ensure we catch and resolve issues before they impact customers.

  • Streamlined CI/CD: Partner with feature teams to refine Change Management and CI/CD pipelines, ensuring code moves from "commit" to "production" safely and predictably.

  • Cloud Security: Harden our network architecture and application security posture, including WAF management and secure service-to-service communication.

The Tech Stack
  • Cloud Infrastructure: AWS (EC2, Lambda, SQS, SNS, ALB, API Gateway, S3, WAF).

  • Observability & Incident Response: New Relic (APM/Infrastructure), IncidentIO.

  • Automation & Tools: Terraform, Redis/Elasticache, Shell Scripting, NPM/PM2.

  • Application Ecosystem: NodeJS, Python, C#, Angular, Apex.

  • Integration: Salesforce Managed Packages, MSFT Dynamics365.

Who You Are
  • Experienced Architect: 5+ years of experience in SRE, DevOps, or Systems Engineering, with a proven track record of managing complex AWS environments.

  • Proven Incident Commander: You demonstrate calm, decisive leadership during high-pressure outages. You have extensive experience running blameless postmortems and, crucially, driving the remediation work needed to prevent recurrence.

  • Observability Pro: You have deep experience configuring New Relic (or similar platforms) to create meaningful dashboards, SLIs, and SLOs.

  • Automation Advocate: You believe that manual intervention is a bug. You have deep experience with Terraform and a "Code-First" approach to infrastructure.

  • Strategic Problem Solver: You can look at a complex, "needs-based" architecture and formulate a clear, prioritized roadmap to move it toward industry best practices.

  • Collaborative Leader: You enjoy working with feature engineers to help them build "reliability-by-design" into their services.

  • Education: A Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent professional experience).

Why work at LeanData:

  • LeanData covers employee insurance premiums up to 90%

  • Stock options in LeanData for all full-time employees

  • Flexible PTO

  • 401K plan

Top Skills

Angular
Apex
AWS
C#
Elasticache
New Relic
Node.js
Npm
Pm2
Python
Redis
Shell Scripting
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
240 Employees
Year Founded: 2012

What We Do

Today’s growth leaders are powering their B2B selling with LeanData, the gold standard in modern revenue orchestration and an essential element of the modern RevTech stack. The LeanData Revenue Orchestration Platform, powered by No-Code Automation, simplifies and accelerates the coordination of all the plays, people, and processes needed to transform buyer signals into buying decisions. With LeanData, revenue teams operate with precision and alignment, taking every change in stride and driving operational excellence that fuels compelling buyer experiences.

Similar Jobs

Crexi Logo Crexi

Senior Site Reliability Engineer

Real Estate • Sales • Software • PropTech
Easy Apply
Hybrid
Los Angeles, CA, USA
400 Employees
160K-214K Annually
Hybrid
3 Locations
1100 Employees
147K-278K Annually

MongoDB Logo MongoDB

Senior Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
9 Locations
5550 Employees
127K-249K Annually
Remote or Hybrid
United States
1750 Employees

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account