Site Reliability Engineer

Posted 8 Days Ago
Be an Early Applicant
Toronto, ON, CAN
In-Office
115K-130K Annually
Mid level
Information Technology • Software
The Role
Manage reliability of production systems, define and enforce SLOs, lead incident responses, automate deployment and infrastructure provisioning, and improve observability and scalability of cloud services.
Summary Generated by Built In

About Kaseya

Kaseya is the leading provider of AI-powered IT management and cybersecurity software, serving Managed Service Providers (MSPs) and internal IT organizations worldwide. Our comprehensive platform helps organizations efficiently manage, secure, and automate their IT environments, driving operational efficiency and long-term business success.

Backed by Insight Partners, a leading global software investor, Kaseya has experienced sustained double-digit growth and continues to expand its global footprint. Today, Kaseya supports customers in more than 20 countries and manages over 15 million endpoints worldwide.

Founded in 2000, Kaseya has built a culture centered around innovation, accountability, and results. We are a high-growth, high-performance organization that values individuals who are driven, adaptable, and committed to delivering exceptional outcomes for our customers and teammates alike.

At Kaseya, success comes from embracing challenges, moving with urgency, and continuously raising the bar. 

Kaseya is hiring a Site Reliability Engineer to keep our production systems healthy as we scale. You'll own the reliability of services that thousands of MSPs depend on every day. That means defining the SLOs we hold ourselves to, leading incidents when they happen, and building the automation that keeps things stable as we ship. The work is hands on, the on call rotation is real, and the environment runs heavily on AWS. If you treat reliability as a product instead of a chore, you'll fit in well here.

What You'll Do

  • Set, monitor, and enforce SLOs, SLIs, and error budgets that keep our systems reliable
  • Lead incident response, troubleshooting, and blameless postmortems that produce real fixes
  • Build and maintain automated deployment, configuration management, and infrastructure provisioning using Infrastructure as Code
  • Manage cloud and hybrid infrastructure with Terraform or CloudFormation, balancing cost, scalability, and resilience
  • Improve observability across systems through proactive monitoring, alerting, and dashboards that surface issues early
  • Partner with development teams to bake reliability into the SDLC, including deployment automation, capacity planning, and chaos engineering
  • Cut operational toil through automation, systems that recover themselves, and engineering solutions that scale
  • Support containerized and serverless workloads so they stay highly available and fault tolerant in production
  • Stay current on SRE, cloud, and observability practices and bring what works back to the team

Required Qualifications

  • 4 to 5 years of AWS production experience
  • IaC ownership with Terraform or CloudFormation, including state management
  • AWS ECS production experience (or strong Kubernetes background willing to ramp)
  • Active on call rotation with incidents led and postmortems written
  • Working fluency with SLOs, SLIs, and error budgets in production

Preferred Qualifications

  • Kubernetes production experience
  • Broader observability tooling (Datadog, Dynatrace, CloudWatch, Elasticsearch/Kibana)
  • Chaos engineering
  • AWS Lambda or serverless workloads
  • Ansible, Chef, or Puppet
  • DevSecOps work (vulnerability scanning, secrets management, SOC2 or ISO 27001)
  • Production database support (RDS, PostgreSQL, MySQL)
  • Open source contributions or public technical portfolio

The expected annual base salary for this role is CAD $115,000 to CAD $130,000. Final offer will depend on experience, skills, and internal equity. This posting is for an existing vacancy.

 

Additional information
Kaseya provides equal employment opportunity to all employees and applicants without regard to race, religion, age, ancestry, gender, sex, sexual orientation, national origin, citizenship status, physical or mental disability, veteran status, marital status, or any other characteristic protected by applicable law.

Skills Required

  • 4 to 5 years of AWS production experience
  • IaC ownership with Terraform or CloudFormation
  • AWS ECS production experience or strong Kubernetes background
  • Active on call rotation experience
  • Working fluency with SLOs, SLIs, and error budgets

Kaseya Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Kaseya and has not been reviewed or approved by Kaseya.

  • Leave & Time Off Breadth PTO is commonly described around 20–21 days per year plus standard holidays. Some indicate they can fully disconnect while on leave.
  • Equity Value & Accessibility Equity or option grants are available to many roles, offering potential upside beyond base pay. This exposure is presented as a meaningful component of total compensation for some roles.
  • Affordable Benefits The high‑deductible medical plan is described as having low or employer‑covered employee‑only premiums in some cases. This can reduce out‑of‑pocket costs for those who select the HDHP.

Kaseya Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Miami, FL
5,000 Employees
Year Founded: 2000

What We Do

Kaseya is a premier provider of unified IT management and security software for managed service providers (MSPs) and small to medium-sized businesses (SMBS). Through its customer-centric approach, Kaseya delivers best-in-breed technologies that allow organizations to efficiently manage, secure and backup IT. Kaseya offers a broad array of IT management solutions, including well-known names: Kaseya, IT Glue, RapidFire Tools, Spanning Cloud Apps, ID Agent, Graphus, RocketCyber, TruMethods and Unitrends. These solutions empower businesses to command all of IT centrally, easily manage remote and distributed environments, simplify backup and disaster recovery, safeguard against cybersecurity attacks, effectively manage compliance and network assets, streamline IT documentation and automate across IT management functions. Headquartered in Miami, Florida, Kaseya is privately held with a presence in over 20 countries.

Gallery

Gallery

Similar Jobs

Manulife Logo Manulife

Site Reliability Engineer

Fintech • Insurance • Financial Services
In-Office
Toronto, ON, CAN
32427 Employees
113K-210K Annually

TMS LLC Logo TMS LLC

Site Reliability Engineer

Information Technology • Internet of Things
In-Office
Toronto, ON, CAN
65 Employees

GitLab Logo GitLab

Site Reliability Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
In-Office or Remote
2 Locations
2500 Employees
104K-222K Annually
Hybrid
Toronto, ON, CAN
832 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account