Senior Cloud Site Reliability Engineer

Posted 11 Days Ago
Be an Early Applicant
Ottawa, ON
In-Office
Senior level
Information Technology
The Role
The Senior Cloud Site Reliability Engineer will ensure the health of Solace Cloud services, manage production incidents, optimize operations, and implement infrastructure tooling across multiple cloud platforms.
Summary Generated by Built In

Solace helps companies connect and integrate all of their assets through the power of event-driven architecture. Our technology makes it easy to unlock data silos and capture events occurring across large enterprises; stream information about those events everywhere it needs to be in real-time; and give the apps, AI agents and people who receive it the power to immediately react with decisive actions and smart decisions. 

  

Many of the world’s biggest companies trust Solace to modernize their IT infrastructure by embracing trends like AI, cloud and IoT so they can create awesome experiences for their customers, partners and employees. 

  

So, the next time you drive a car, order furniture online, fly in a plane, check your bank balance on your phone, your positive experience could be a direct result of our technology—and your hard work 
 

Overview 

This position is for a Senior Cloud Site Reliability Engineer. You will be responsible for the daily operations of
Solace Cloud, our market-leading SaaS offering, across leading cloud providers and platforms such as Amazon Web Services, Microsoft Azure, Google Cloud Platform, Kubernetes, etc. 

What You Will Do: 

  • Ensuring that the Solace Cloud Services are healthy and reliable, and that SLAs are being met 
  • Design and implement our infrastructure tooling, observability, and automation 
  • Contribute to making the production operations more efficient, less error-prone, etc. 
  • Expert-level knowledge in handling production Incidents in production-grade multi-cloud environments according to industry-standard Incident management process 
  • Process handling service requests and provisioning by the customers. 
  • Proven ability to manage customer escalations and drive resolution in mission-critical, high-impact production environments 
  • Work directly with customers to identify, troubleshoot, and resolve operational issues. 
  • Expert debugging knowledge in Linux and Kubernetes to detect operational issues. 
  • Be on-call rotation and provide 24x7 off-hours support 

 

Ideally, You Will Be: 

  • Highly technical, excited by technology, and eager to stay up to date in a rapidly evolving environment. 
  • Expert-level knowledge in Cloud Networking Solutions 
  • Knowledgeable in demonstrating the ability to debug at a system level and resolve incidents in complex cloud-based environments 
  • Expert in Site reliability engineering and Incident response 
  • A strong communicator who can articulate complex technical issues clearly and concisely & get on the phone with customers. 
  • Experienced in SaaS operations and customer-facing technical support 

 

Required Skills: 

  • Proven expertise with public cloud providers (AWS, Azure, GCP) services & features
  • Proven expertise with cloud Kubernetes infrastructure platforms such as AWS Elastic Kubernetes Service, Azure Kubernetes Service, Google Kubernetes Service 
  • Hands-on experience with Monitoring tools like Datadog, Kibana, Prometheus etc. 
  • Hands-on experience with Infrastructure Automation using Terraform, Cloud Formation 
  • Hands-on expertise in debugging production alerts  
  • Expert-level understanding of Linux Operating Systems 
  • Programmer in languages such as Groovy, Python, and Go 
  • Certified Kubernetes Administrator 
  • Certified Cloud Administrator (AWS, Azure, or GCP) 

 

Why You’ll Want to Join Us at Solace 

  • We have an awesome team! You’ll get to work with some of the smartest individuals in the business. 
  • We believe in work-life balance, and that it’s important to love what you do. 
  • We have adopted a hybrid work model to create an inclusive environment for everyone. 
  • We live by our values every day: craftsmanship, trust, courage, freedom, momentum, humility, and human experience.  
  • Our training programs are top-notch. 
  • We like to brag about our stellar customer lineup! 
  • We are social – we like to keep things simple and fun! 
  • We are one of the top-ranked employers on Glassdoor. 
  • We have a sense of humour and make cool videos on cool topics like MITT and this! 

  

We understand that experience takes on various shapes and sizes. Not sure you meet all the requirements? We still want to hear from you! Your unique experience could be exactly what we are looking for. 

  

At Solace, we believe that diversity and inclusion drive innovation and growth, both in business and in life. We strive to create an enriching and safe workplace where you can be who you are. If you want to do the best work of your career and feel supported every step of the way, we encourage you to join us! 

  

Accommodations are available upon request for anyone taking part in the hiring process. Let us know how we can help! We thank all candidates for their interest, however, only those selected to continue in the selection process will be contacted. 

 

Top Skills

AWS
Azure
Cloud Formation
Datadog
GCP
Go
Groovy
Kibana
Kubernetes
Prometheus
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Ottawa, Ontario
603 Employees
Year Founded: 2001

What We Do

Solace helps large enterprises become modern and real-time by giving them everything they need to make their business operations and customer interactions event-driven. With PubSub+, the market’s first and only event management platform, the company provides a comprehensive way to create, document, discover and stream events from where they are produced to where they need to be consumed – securely, reliably, quickly, and guaranteed. Behind Solace technology is the world’s leading group of data movement experts, with over 20 years of experience helping global enterprises solve some of the most demanding challenges in a variety of industries – from capital markets, retail, and gaming to space, aviation, and automotive. Established enterprises such as SAP, Barclays and the Royal Bank of Canada, multinational automobile manufacturers such as Groupe Renault and Groupe PSA, and industry disruptors such as Jio use Solace’s advanced event broker technologies to modernize legacy applications, deploy modern microservices, and build an event mesh to support their hybrid cloud, multi-cloud and IoT architectures. Learn more at solace.com.

Similar Jobs

Gusto Logo Gusto

Staff Product Designer

Fintech • HR Tech
Easy Apply
Remote or Hybrid
6 Locations
146K-222K

CrowdStrike Logo CrowdStrike

Operations Analyst

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
6 Locations

Sonatus Logo Sonatus

Test Automation Engineer

Automotive • Software
Easy Apply
In-Office
Toronto, ON, CAN

Kraft Heinz Logo Kraft Heinz

Brand Manager

Big Data • Cloud • Food • Machine Learning • Software • Database • Analytics
Hybrid
Toronto, ON, CAN
144K-180K Annually

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
15 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account