Senior Site Reliability Engineer (Azure)

Posted Yesterday
5 Locations
Remote
150K-200K Annually
Senior level
Artificial Intelligence • Blockchain • Information Technology • Consulting
The Role
Lead design and build of production-grade Azure infrastructure using Terraform, ensuring scalable, secure, and repeatable deployments. Provide technical leadership, platform enhancements, observability and incident response improvements, and Tier 2 infrastructure support while collaborating with engineering, security, and product teams to meet enterprise readiness and feature parity goals.
Summary Generated by Built In
Senior Site Reliability Engineer (Enterprise Platform)

Location: US - Open to Europe if happy to overlap with EST

Remote | Full-time

Compensation: $150K - $200K

Our client is seeking a Senior Site Reliability Engineer (Azure) to architect and scale a robust infrastructure foundation for a high-growth distributed systems platform. This position is critical for ensuring that the platform operates as a secure, scalable, and production-ready environment capable of supporting complex enterprise use cases and high reliability standards.

The successful candidate will take a lead role in designing infrastructure from first principles, bridging the gap between product requirements and technical execution. This is a high-impact opportunity for a seasoned engineer to build greenfield Azure environments and establish operational excellence across a global ecosystem.

Key Responsibilities

  • Infrastructure Design: Architect and deploy secure, scalable Azure infrastructure tailored for production-grade distributed systems.
  • Automation & IaC: Develop and maintain Terraform-based infrastructure as code to enable repeatable, automated deployments across various environments.
  • Technical Leadership: Translate ambiguous product and customer requirements into structured technical architecture and actionable execution plans.
  • Platform Enhancement: Build and optimize platform services, APIs, and integrations to extend core system capabilities.
  • Cross-Functional Collaboration: Partner with engineering, security, and product teams to deliver enterprise-ready infrastructure solutions.
  • Operational Excellence: Drive improvements in reliability, observability, and incident response while providing Tier 2 infrastructure support for customer deployments.

Performance Goals: What Success Looks Like

In the first 6–12 months of this role, the following milestones are expected to be achieved:

  • Production Readiness: The Azure environment is established as a fully production-ready deployment setting for the platform.
  • Scalable Deployments: All customer deployments are verified as repeatable, scalable, and secure.
  • Feature Parity: Azure achieves full feature parity with all other supported cloud environments within the organization’s ecosystem.

Interview Process

  1. Recruiter & Technical Screening: Initial HR call followed by an introductory technical interview covering foundational questions.
  2. Hiring Manager Interview: A deeper dive into experience, alignment, and role-specific expectations.
  3. Technical Interview: A comprehensive evaluation of architectural and technical execution skills.
  4. Final Leadership Interview: A concluding session with the VP of Engineering.

Requirements
  • Proven Track Record: Extensive experience designing and building production-grade systems specifically on the Azure stack.
  • Problem Solving: Ability to transform high-level requirements into scalable, delivered systems.
  • Communication: Strong technical communication skills with the ability to interface with both engineering teams and non-technical stakeholders.
  • Mindset: A high-ownership approach with a strong bias for action and accountability.

Functional Expertise

  • Azure Services: Deep knowledge of Azure networking, compute, identity, security, and storage.
  • Infrastructure as Code: Advanced proficiency with Terraform at production scale.
  • Programming: Professional experience in Go and/or Python.
  • Systems Engineering: Background in distributed systems, high-availability architectures, or platform engineering.
  • CI/CD: Experience with automation tooling for the entire infrastructure lifecycle.

Preferred Qualifications

  • Hands-on experience with Kubernetes and container orchestration.
  • Familiarity with observability tools such as Prometheus and Grafana.
  • Experience with workflow/orchestration platforms like Argo or Spacelift.

Benefits

Our client offers a competitive compensation package designed to reward high-impact contributors, including:

  • Equity & Tokens: Participation in the long-term growth of the project.
  • Performance Bonuses: Annual incentives based on individual and company milestones.
  • Health & Retirement: Comprehensive health insurance and 401k plans (available for US-based employees).

Due to the high volume of applications we anticipate, we regret that we are unable to provide individual feedback to all candidates. If you do not hear back from us within 4 weeks of your application, please assume that you have not been successful on this occasion. We genuinely appreciate your interest and wish you the best in your job search.

Commitment to Equality and Accessibility:

At MLabs, we are committed to offer equal opportunities to all candidates. We ensure no discrimination, accessible job adverts, and providing information in accessible formats. Our goal is to foster a diverse, inclusive workplace with equal opportunities for all. If you need any reasonable adjustments during any part of the hiring process or you would like to see the job-advert in an accessible format please let us know at the earliest opportunity by emailing [email protected].

MLabs Ltd collects and processes the personal information you provide such as your contact details, work history, resume, and other relevant data for recruitment purposes only. This information is managed securely in accordance with MLabs Ltd’s Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared only with clients and trusted partners where necessary for recruitment purposes. You may request the deletion of your data or withdraw your consent at any time by contacting [email protected].

Skills Required

  • Extensive experience designing and building production-grade systems on the Azure stack
  • Advanced proficiency with Terraform at production scale
  • Professional experience in Go and/or Python
  • Deep knowledge of Azure networking, compute, identity, security, and storage
  • Background in distributed systems, high-availability architectures, or platform engineering
  • Experience with CI/CD and automation tooling for the infrastructure lifecycle
  • Ability to translate high-level requirements into scalable technical systems
  • Strong technical communication and cross-functional collaboration skills
  • High-ownership mindset with bias for action and accountability
  • Hands-on experience with Kubernetes and container orchestration
  • Familiarity with observability tools such as Prometheus and Grafana
  • Experience with workflow/orchestration platforms like Argo or Spacelift
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
0 Employees
Year Founded: 2018

What We Do

MLabs is a premier consultancy specializing in functional programming, particularly Haskell and Rust, as well as blockchain, AI, and full-stack development. They provide expert staff augmentation, project specification, implementation, and maintenance for mission-critical software development. The company bridges the gap between traditional finance and digital assets, focusing on enterprise blockchain solutions and simplifying multi-chain DeFi for financial institutions and fintechs through a unified API.

Similar Jobs

360Learning Logo 360Learning

Client Success Partner - Enterprise

Artificial Intelligence • Cloud • Edtech • HR Tech • Sales • Software • Generative AI
Easy Apply
Remote
Spain
400 Employees

CrowdStrike Logo CrowdStrike

Regional Sales Manager

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
Spain
10000 Employees

Nexthink Logo Nexthink

Architect

Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Remote or Hybrid
Madrid, Comunidad de Madrid, ESP
1200 Employees

ServiceNow Logo ServiceNow

Architect

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Madrid, Comunidad de Madrid, ESP
29000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account