Site Reliability Engineer II

Posted 17 Days Ago
Hiring Remotely in USA
Remote
147K-173K Annually
3-5 Years Experience
Security • Cybersecurity
The Role
The Site Reliability Engineer II at Abnormal Security is responsible for ensuring the prevention, detection, efficient remediation, and quick recovery from outages that impact the Abnormal Security Platform. They will build tools and processes for deployment operations, identify gaps in existing processes for incident prevention, establish metrics for system detection, define incident severity classification guidelines for remediation, and design tools for incident recovery. The ideal candidate should have a bachelor's degree in Computer Science or equivalent professional experience, at least 1 year of experience as a Site Reliability Engineer, and experience with public cloud providers, observability stack, and incident management tools.
Summary Generated by Built In

About The Role

Enterprises of all sizes trust Abnormal Security’s cloud products to stop cybercrime. These products must scale with the growth of our customers, and ensure reliability and availability by being resilient. This is where our SRE fits in, ensuring the prevention, detection, efficient remediation, and quick recovery from outages that impact the Abnormal Security Platform. 

Come empower the rest of engineering to stop cybercrime as we expand our offerings across both clouds and regions. 

There are a lot of opportunities for growth and career advancement – it’s up to you to own your career here. Some potential career paths for this role include: 

  • Positioning yourself to be a founding member of a team that will have an outsized impact on the rest of the company.
  • Growing into a Senior technical leadership role. 

What You Will Do 

  • Deployment Operations
    • Build tools and processes to standardize deployment of Abnormal Security product suite in a multi-datacenter setup.
    • Partner with R&D teams to develop pre and post deployment checklists, canary test environments and workflows, and safe rollback processes.
  • Incident Prevention
    • Identify gaps in existing processes and advocate for necessary changes to improve overall system stability and availability.
    • Lead the Production Readiness Review process to ensure the resilience of systems before customer deployment.
    • Oversee the Critical Change Management Review process for the safe application of changes to critical services.
    • Develop and enforce architecture guidelines to minimize downtime and ensure high system availability.
  • Detection
    • Establish consistent definition of metrics for “Is this product working”.
    • Define and monitor SLAs/SLOs for critical systems, actively tracking deviations and triggering alerts when necessary.
  • Remediation
    • Define incident severity classification guidelines and implement incident response protocols to promptly address issues and reduce downtime.
    • Facilitate effective communication between Engineering and Customer Success teams during incidents.
  • Incident Recovery
    • Design and implement tools to expedite system recovery and minimize the impact of incidents.
    • Develop guidelines for Post Mortems after incidents to prevent recurrence.

Must Have

  • Bachelor’s in Computer Science, Computer Engineering, or equivalent professional experience
  • 1+ experience as a Site Reliability Engineer, responsible for the reliability of shared services
  • Experience with a public cloud provider (AWS, Azure, GCP), observability stack (Prometheus, Grafana), and incident management tools (PagerDuty, Sentry, Slack integration).

Nice To Have 

  • Experience with defining and implementing SRE practices such as Change Management, Production Readiness Review, and Incident Post Mortems.
  • Experience with container orchestration, preferably Kubernetes and Helm.
  • Experience developing Infrastructure as Code (IaC) modules and building automation, preferably Terraform.


#LI-NT1


At Abnormal Security certain roles are eligible for a bonus, restricted stock units (RSUs), and benefits. Individual compensation packages are based on factors unique to each candidate, including their skills, experience, qualifications and other job-related reasons. We know that benefits are also an important piece of your total compensation package. Learn more about our Compensation and Equity Philosophy on our Benefits & Perks page.

Base salary range:

$147,200$173,200 USD

Top Skills

AWS
Azure
GCP
The Company
San Francisco, CA
175 Employees
On-site Workplace
Year Founded: 2018

What We Do

The Abnormal Security platform protects enterprises from targeted email attacks. Abnormal Behavior Technology (ABX) models the identity of both employees and external senders, profiles relationships and analyzes email content to stop attacks that lead to account takeover, financial damage and organizational mistrust. Though one-click, API-based Office 365 and G Suite integration, Abnormal sets up in minutes and does not disrupt email flow.
Abnormal Security was founded in 2018 by CEO Evan Reiser, CTO Sanjay Jeyakumar, Head of Machine Learning Jeshua Bratman, and Founding Engineers Abhijit Bagri and Dmitry Chechik. The team previously built behavioral profiling and machine learning technologies at Twitter, Google and Pinterest that are being applied to solve a problem that costs organizations $1 billion per year, according to the FBI. The Abnormal Security platform stops targeted phishing, business email compromise and account takeover attacks that have never been seen before.

Jobs at Similar Companies

MacPaw Logo MacPaw

SMM Specialist for Setapp

Information Technology • Security • Software • Cybersecurity • App development • Data Privacy
Remote
Hybrid
Kyiv, Kiev, UKR
550 Employees

Silverfort Logo Silverfort

Head of Global Channel & Field Marketing

Information Technology • Sales • Security • Cybersecurity • Automation
Remote
United States
357 Employees

Invoice Home Logo Invoice Home

Senior Ruby On Rails Software Developer

Fintech • Information Technology • Mobile • Software • Financial Services • Cybersecurity • SEO
Austin, TX, USA
20 Employees
120K-150K Annually

Similar Companies Hiring

Invoice Home Thumbnail
Software • SEO • Mobile • Information Technology • Fintech • Financial Services • Cybersecurity
Austin, TX
20 Employees
MacPaw Thumbnail
Software • Security • Information Technology • Data Privacy • Cybersecurity • App development
Cambridge, MA
550 Employees
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account