Site Reliability Engineer III (SRE III)

Reposted 11 Days Ago
Be an Early Applicant
Toronto, ON, CAN
Hybrid
Senior level
HR Tech
The Role
The Site Reliability Engineer III is responsible for ensuring system reliability, performance, and automation in cloud infrastructure while mentoring junior engineers and driving operational excellence.
Summary Generated by Built In
Who We Are:

At Emburse, you’ll not just imagine the future – you’ll build it. As a leader in travel and expense solutions, we are creating a future where technology drives business value and inspires extraordinary results. Our AI-powered platform helps organizations modernize financial operations, increase visibility, and optimize spend across the enterprise.

The Site Reliability Engineer III (SRE III) plays a critical role in ensuring Emburse’s systems are highly available, scalable, and performant. This role blends deep technical expertise with strong collaboration and leadership skills to drive operational excellence across distributed systems. The ideal candidate is passionate about automation, cloud infrastructure, observability, and continuous improvement, while mentoring junior engineers and driving reliability culture across the organization

What you will do :

    Service Reliability & Performance

    • Proactively identify, evaluate, and implement preventative measures to reduce customer impact.
    • Ensure all services are designed and operated with 24/7 availability, scalability, and resilience in mind.
    • Monitor, troubleshoot, and provide visibility to improve site latency, performance, and uptime.
    • Engineering Excellence & Automation

      • Design, develop, and automate reliable cloud infrastructure and platform services.
      • Apply Infrastructure-as-Code (IaC) principles to manage large-scale distributed systems.
      • Write and maintain scripts, tools, and automation frameworks to support operational efficiency.
      • Partner with engineering leadership to develop solutions enabling developer productivity and remove cross functional dependencies.
      • Collaboration & Process Development

        • Collaborate with Platform Engineering  teams on project definitions, requirements, backlog grooming, and planning processes.
        • Align operational goals with product and engineering roadmaps to ensure reliability requirements are met early in the lifecycle.
        • Define non-functional requirements (NFRs) and influence standards for scalability, observability, and fault tolerance.
        • Lead cross-functional troubleshooting of complex issues spanning applications, infrastructure, databases, and networks.
        • Leadership & Mentorship

          • Serve as a technical mentor to SRE I and II engineers, guiding them in best practices for reliability, automation, and incident management.
          • Lead root cause analysis and postmortem reviews, driving continuous improvement initiatives.
          • Support offshore and distributed teams, promoting effective collaboration and communication.
          • Participate in design and architecture reviews, offering technical recommendations and documentation for key stakeholders

What we are looking for :

    Education:

    • Required: Bachelor’s degree in Computer Science or a STEM field
    •  

      Experience:

      • Minimum 6 years of experience in an engineering or operations role with a focus on reliability, scalability, and automation.
      • Certifications:

        • Preferred: Certified Kubernetes Administrator (CKA) and/or AWS Certification
        •  

          Additional Eligibility Qualifications

          Required Skills:

          • Strong proficiency in Linux-based distributed environments (up to 70% hands-on work).
          • Deep experience with cloud platforms (AWS or Azure) and Infrastructure-as-Code (Terraform).
          • Excellent scripting skills (Python, Bash, Powershell); object-oriented programming experience is a plus.
          • Demonstrated ability to develop and maintain internal tools and automation solutions.
          • Excellent written and verbal communication skills in English.
          • Strong project management and organizational abilities with a bias for action.
          • Experience collaborating with offshore or globally distributed teams.
          • Expertise in containerization and orchestration technologies (Docker, Kubernetes).
          • Experience with Kubernetes scaling tooling (Karpenter, KEDA).
          • Strong understanding of DevOps principles and modern CI/CD pipelines.
          • Experience with observability stacks (Prometheus, Grafana, OpenTelemetry).
          • Familiarity with self-healing systems, and site reliability best practices.
          • Background in SaaS environments or large-scale distributed applications.
          • Analytical thinker with a focus on root-cause problem solving.
          • Self-starter with a strong ownership mentality and accountability.
          • Mentor and collaborator who uplifts teams and promotes learning culture.
          • Committed to operational excellence and continuous improvement.

Why Emburse?

Finance is changing—and at Emburse, we’re leading the way. Our AI-powered solutions help organizations eliminate inefficiencies, gain real-time visibility, and optimize spend—so they can focus on what’s next, not what’s slowing them down.
A Company with Momentum – We serve 12M+ users across 120 countries, helping businesses modernize
 their finance operations.
A Team That Innovates – Work alongside some of the brightest minds in finance, tech, and AI to solve real-
 world challenges.
A Culture That Empowers – Competitive pay, flexible work, and an inclusive, collaborative environment that
 supports your success.
A Career That Matters – Your work here drives efficiency, innovation, and smarter financial decision-making
 for businesses everywhere. 

Shape your future & find what’s next at Emburse. 

Emburse provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics. In addition to federal law requirements, Emburse complies with applicable state and local laws governing nondiscrimination in employment in every location where the company has facilities. This policy applies to all terms and conditions of employment.

Skills Required

  • Bachelor's degree in Computer Science or a STEM field
  • Minimum 6 years of experience in an engineering or operations role focusing on reliability, scalability, and automation
  • Certified Kubernetes Administrator (CKA) and/or AWS Certification
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Portland, ME
832 Employees
Year Founded: 2020

What We Do

Emburse humanizes work by empowering business travelers, finance professionals and CFOs to eliminate manual, time-consuming tasks so they can focus on what matters most. Emburse brings together some of the world’s most powerful and trusted expense and AP automation solutions, including Abacus, Captio, Certify, Chrome River, Nexonia and Tallie. The company’s innovative offerings, which are uniquely tailored for specific industries, company sizes, and geographies, are trusted by more than 4.5 million users in more than 120 countries. Over 14,000 customers, from start-ups to global enterprises, including Boot Barn, Grant Thornton, Telefónica, Lufthansa Systems, and Toyota rely on Emburse to make faster, smarter decisions, empower business travelers to recapture lost nights and weekends spent doing tedious expense management, and help make users’ lives -- and their businesses -- better.

Similar Jobs

McCain Foods Logo McCain Foods

Senior Site Reliability Engineer

Food • Retail • Agriculture • Manufacturing
In-Office
Toronto, ON, CAN
20000 Employees
103K-137K Annually

Sectigo Logo Sectigo

Site Reliability Engineer

Information Technology • Internet of Things • Machine Learning • Software
Hybrid
Ottawa, ON, CAN
406 Employees
80K-100K Annually
In-Office
Toronto, ON, CAN
6000 Employees
136K-187K Annually

Newton.co Logo Newton.co

Site Reliability Engineer

Blockchain • Financial Services • Cryptocurrency • Web3
In-Office or Remote
Toronto, ON, CAN
77 Employees

Similar Companies Hiring

RethinkFirst Thumbnail
Telehealth • Software • Professional Services • Information Technology • HR Tech • Healthtech • Edtech
New York, NY
300 Employees
Empathy Thumbnail
Fintech • Healthtech • HR Tech • Information Technology • Financial Services • Telehealth
New York, NY
180 Employees
Compa Thumbnail
Artificial Intelligence • HR Tech • Software • Business Intelligence
Irvine, CA
75 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account