AI DevOps & Reliability Engineer

Posted 3 Days Ago
Be an Early Applicant
Hiring Remotely in Vancouver, BC, CAN
In-Office or Remote
123K-160K Annually
Senior level
Mobile • Software • Analytics
The Role
Own and operate the delivery platform and embed with an engineering team to improve deployment automation, CI/CD, GitOps, IaC, and production reliability. Lead AI-augmented ops (runbooks, alerting, incident response), build environments, set standards, tune streaming and datastore infrastructure, and mentor teams while tracking DORA and reliability metrics.
Summary Generated by Built In

At Branch, we power every touchpoint with links that work and insights that prove it. From click to conversion, we make growth measurable. Our unparalleled attribution, backed by AI-enhanced linking, is trusted to deliver seamless experiences that increase ROI, decrease wasted spend, and eliminate siloed attribution.

We bring the same rigor to how we build our team, by empowering our people to move fast, own outcomes, and build something that matters. We take pride in making meaningful investments in our team’s health, wealth, and growth so individuals can thrive as we scale. Our culture values smart, humble, and collaborative teammates who take accountability and drive results in an environment where their work truly moves the business forward.

We are innovative, scaling with purpose, and led by seasoned leaders who know how to build enduring companies. Trusted by brands like Instacart, Western Union, NBCUniversal, ZocDoc, and Sephora, we’re big enough to matter, small enough for you to make a real impact. If you’re excited by the grit of building, rapid learning, and shaping the future of customer growth, you’ll find your place here.

About The Group

We're hiring an AI DevOps & Reliability Engineer to own how software ships and runs at Branch. The role has two areas: half central platform and standards work, half embedded with an engineering team. Centrally, you'll build and operate the delivery platform (CI/CD pipelines, deployment automation, environments) so teams can release safely, frequently, and on demand. Embedded, you'll work hands-on with an engineering team day-to-day on their infrastructure, deployment, and operational practices, mentoring them and building their capability over time.

You'll also lead the adoption of AI in DevOps and SRE work at Branch. Bringing modern AI tooling (Claude Code, agentic workflows) into runbook generation, alerting, incident response, and operational tooling is a core part of this role, not a side project. It's a strategic direction we're committed to.

As a lead, you'll work directly with engineering leadership to shape the operations and delivery roadmap across multiple milestones.

What You'll DoDelivery & Release Engineering
  • Design and expand deployment automation, advancing the org toward on-demand and continuous production releases.
  • Establish release practices and standards: progressive delivery, rollback, release tracking, deployment inventory teams can trust.
  • Extend automation deeper into production paths, reducing manual steps and release toil.
  • Enable verification through automation: quality gates as code, build engineering supports our efforts.
Pipelines & Guardrails
  • Own CI/CD standards across teams: quality gates, automated checks, guardrails that catch problems before production.
  • Build pipeline tooling that makes the safe path the easy path for engineers.
Environments
  • Design and build out dev, staging, and on-demand (ephemeral) environments that mirror production and spin up on request.
  • Treat environment provisioning as a product: fast, reproducible, self-service.
AI-Embedded Ops
  • Bring AI tooling into operations: automated runbook generation, intelligent alerting, AI-assisted incident response, operational tooling.
  • Help build an org-wide, AI-augmented ops practice and share patterns across teams.
  • This is a core part of the role, aligned with Branch's broader AI direction.
Infrastructure & GitOps
  • Champion Infrastructure as Code (Terraform / CloudFormation) for provisioning, configuration, and lifecycle management.
  • Drive GitOps-based delivery with Argo CD for secure, repeatable, scalable deployments across Kubernetes.
Operational Reliability
  • Bring a strong reliability foundation: alerting practices, on-call, runbooks, SLI/SLO definition, incident response.
  • Partner with engineering teams on the operational practices that keep their services healthy at high volume.
  • Operate and tune high-volume data infrastructure: streaming pipelines (Kafka) and SQL/NoSQL datastores under heavy production load.
  • Strengthen team-level runbooks, operational readiness, and production hygiene; feed improvements back into the platform.
Embedded Team Work
  • Embed with an assigned engineering team day-to-day, working hands-on with them on infrastructure, deployment, and reliability work.
  • Mentor team engineers on operational best practices, observability, and reliability.
  • Help build the team's capability over time so good practices stick.
Engineering Metrics
  • Stand up DORA metrics (lead time, deployment frequency, change failure rate, MTTR) and use them to target real improvements.
  • Make delivery and reliability health visible to teams and leadership.
Leadership & Partnership
  • Work with engineering leadership on the operations and delivery roadmap.
  • Drive cross-team adoption of standards and tooling through collaboration and influence.
What We're Looking For
  • Hands-on experience adopting AI into DevOps and SRE practices (Claude Code, Cursor, agents, or similar) to improve automation, debugging, and operational efficiency.
  • 7+ years in DevOps, platform, infrastructure, or related engineering roles, ideally in fast-scaling environments.
  • Strong hands-on Kubernetes and AWS experience.
  • Deep IaC experience (Terraform and/or CloudFormation) and the ability to set IaC standards for other teams.
  • Proven CI/CD architecture experience: pipelines, quality gates, release automation.
  • GitOps experience with Argo CD (or Flux) for Kubernetes delivery.
  • Hands-on experience operating streaming infrastructure (Kafka) in production.
  • Experience managing SQL and NoSQL datastores at high volume: performance, scaling, operational health.
  • Solid scripting/automation skills (Python, Bash, or similar).
  • Working knowledge of observability stacks: Prometheus, Grafana, PagerDuty (Loki / Alertmanager a plus).
  • Familiarity with on-call, incident response, SLI/SLO definition, and runbooks, and the operational practices that support them.
  • Strong collaborator and communicator. Comfortable working across teams, mentoring engineers, and driving alignment without authority.
Nice to Have
  • Progressive delivery (canary, blue/green) and feature-flag-driven release experience.
  • Cost / efficiency awareness in cloud infrastructure.
  • Broader data / streaming ecosystem exposure (Spark, schema management, CDC, etc.).
What Success Looks Like
  • Teams ship on demand — merge to prod in hours, no tickets, no waiting on you. Deploy frequency up a tier.
  • Faster without breaking — lead time and MTTR down while change-failure rate holds flat.
  • Platform does the work — safe path is the easy path; manual release steps trending to zero; envs self-service in minutes.
  • AI is in the ops loop — runbooks, alerting, incident response AI-assisted; patterns other teams reuse unprompted.
  • Capability sticks — embedded team owns its own deploy and reliability work after you rotate off.
  • Health is visible — DORA metrics instrumented for teams and leadership; roadmap driven by data.

This role is 100% remote in Canada. This role does not qualify for relocation or visa sponsorship. 

In accordance with applicable law, the following represents a reasonable estimated compensation range for this role: the estimated pay range for this role, if based in Canada is 123,000 CAD to 160,000 CAD. Please note that this information is provided for those hired in Canada only. Compensation for candidates outside of Canada will be based on the candidate’s specific work location. Actual compensation will be determined based on skills, experience, and geographic location and may be more or less than the amount shown above. This role additionally includes a 10% annual bonus tied to company goals. 

The salary range provided represents base compensation and does not include potential equity, which is available for qualifying positions. At Branch, we are committed to the well-being of our team by offering a comprehensive benefits package. From health and wellness programs to paid time off and retirement planning options, we provide a range of benefits for qualified employees. For detailed information on the benefits specific to your position, please consult with your recruiter.

Branch is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

If you think you'd be a good fit for this role, we'd love for you to apply! At Branch, we strive to create an inclusive culture that encourages people from all walks of life to bring their unique, diverse perspectives to work. We aim every day to build an environment that empowers us all to do the best work of our careers, and we can't wait to show you what we have to offer!

A little bit about us: 

Branch is the leading provider of engagement and performance mobile SaaS solutions for growth-focused teams, trusted to maximize the value of their evolving digital strategies. The Branch platform provides a seamless experience across paid and organic, on all channels and platforms, online and offline, to eliminate friction and drive valuable action at the moments of highest intent. With Branch, businesses gain accurate mobile measurement and insights into user interactions, enabling them to drive conversions, engagement, and more intelligent marketing spend.

Branch is an award-winning employer headquartered in Mountain View, CA. World-class brands like Instacart, Western Union, NBCUniversal, Zocdoc and Sephora acquire users, retain customers and drive more conversions with Branch.

Candidate Privacy Information:
For more information on the data that Branch will collect through your application, and how we use, share, delete, and retain that information as part of our recruitment and employment efforts, please see our HR Privacy Policy.

Skills Required

  • Hands-on experience adopting AI into DevOps and SRE practices (Claude Code, Cursor, agents, or similar).
  • 7+ years in DevOps, platform, infrastructure, or related engineering roles.
  • Strong hands-on Kubernetes experience.
  • Strong hands-on AWS experience.
  • Deep Infrastructure as Code experience (Terraform and/or CloudFormation).
  • Proven CI/CD architecture experience: pipelines, quality gates, release automation.
  • GitOps experience with Argo CD (or Flux).
  • Hands-on experience operating streaming infrastructure (Kafka) in production.
  • Experience managing SQL and NoSQL datastores at high volume.
  • Solid scripting/automation skills (Python, Bash, or similar).
  • Working knowledge of observability stacks: Prometheus, Grafana, PagerDuty (Loki / Alertmanager a plus).
  • Familiarity with on-call, incident response, SLI/SLO definition, and runbooks.
  • Strong collaborator and communicator, comfortable mentoring and driving alignment.
  • Progressive delivery (canary, blue/green) and feature-flag-driven release experience.
  • Cost / efficiency awareness in cloud infrastructure.
  • Broader data / streaming ecosystem exposure (Spark, schema management, CDC).

Branch (branch.io) Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Branch (branch.io) and has not been reviewed or approved by Branch (branch.io).

  • Fair & Transparent Compensation Pay is considered competitive for U.S. tech markets, with public recognition specifically for compensation and descriptions of strong total packages including base, equity, and benefits. Role- and location-based salary datapoints cited align with market-level offers for key technical roles.
  • Healthcare Strength Health coverage is described as comprehensive, including medical, dental, and vision, with employer-funded contributions and access to services such as One Medical and an Employee Assistance Program. Some public listings note employer-paid premiums for employees, reinforcing robustness of coverage.
  • Leave & Time Off Breadth Time off is often framed as unlimited or flexible PTO with paid holidays and sick time, supporting work–life balance. Parental leave is also present as part of the broader time-off framework.

Branch (branch.io) Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mountain View, California
520 Employees
Year Founded: 2014

What We Do

Branch is on a mission: to power impactful experiences in the connected world. We build and provide software as a service for enterprise businesses to acquire, retain and engage their users, delivering linking and measurement solutions across all digital environments for flawless user journeys and foolproof campaign insights. Branch is an award-winning employer headquartered in Mountain View, CA. World-class brands like Instacart, Western Union, NBCUniversal, Zocdoc and Sephora acquire users, retain customers and drive more conversions with our solutions. Our people are our lifeblood, and every Branch employee strives to exemplify our core values: 1) Take your shot: Boldly take smart risks and seize opportunities to stay ahead. 2) Hustle with heart: Prioritize impact over activity and own meaningful outcomes. 3) Crush it together: Empathize with customers and deliver value for mutual success.

Similar Jobs

Coinbase Logo Coinbase

Senior Software Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
Canada
4700 Employees
191K-191K Annually

Coinbase Logo Coinbase

Senior Software Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
Canada
4700 Employees
191K-191K Annually

Coinbase Logo Coinbase

Staff Software Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
Canada
4700 Employees
218K-218K Annually

DraftKings Logo DraftKings

Senior Machine Learning Engineer

Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Remote or Hybrid
Canada
6400 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account