Weekday, Inc.

Site Leader

Reposted 5 Days Ago

Be an Early Applicant

Hiring Remotely in Poland

Remote

Expert/Leader

Artificial Intelligence • HR Tech • Professional Services • Software

The Role

Lead SRE and Infrastructure teams to design, deploy, and maintain scalable, highly available cloud systems. Define reliability strategy, SLAs/SLOs/SLIs, incident management, automation, observability, capacity planning, cost optimization, and mentor engineering leaders while ensuring cloud, DevOps, and security best practices.

Summary Generated by Built In

This role is for one of the Weekday's clients
Min Experience: 10 years

Location: Poland, Remote (poland)

JobType: full-time
We are seeking a highly experienced and driven Site Leader with a strong background in Site Reliability Engineering (SRE) and Infrastructure to lead and scale our engineering operations. This role is ideal for a seasoned Engineering Manager who thrives at the intersection of leadership, system reliability, and large-scale infrastructure management. As a Site Leader, you will be responsible for building resilient systems, managing high-performing teams, and ensuring the availability, scalability, and performance of mission-critical platforms.

Requirements

Key Responsibilities

Lead and manage SRE and Infrastructure teams, driving operational excellence and fostering a culture of reliability and accountability.
Define and execute the overall infrastructure and reliability strategy aligned with business goals.
Oversee the design, deployment, and maintenance of scalable, highly available, and secure systems.
Establish and monitor SLAs, SLOs, and SLIs, ensuring consistent service performance and uptime.
Drive incident management processes, including root cause analysis, postmortems, and continuous improvement initiatives.
Collaborate with product and engineering teams to embed reliability and scalability into the development lifecycle.
Champion automation, observability, and proactive monitoring to minimize downtime and improve system health.
Manage infrastructure costs, capacity planning, and resource optimization.
Mentor and develop engineering managers and senior engineers, building a strong leadership pipeline.
Ensure adherence to best practices in cloud infrastructure, DevOps, and security compliance.

Required Skills & Qualifications

10–15 years of experience in software engineering, infrastructure, or SRE, with at least 3–5 years in an Engineering Manager or leadership role.
Proven expertise in Site Reliability Engineering (SRE) principles, including reliability, scalability, and fault tolerance.
Strong experience with cloud platforms (such as AWS, GCP, or Azure) and modern infrastructure architectures.
Deep understanding of infrastructure as code (Terraform, CloudFormation), CI/CD pipelines, and containerization technologies (Docker, Kubernetes).
Demonstrated ability to lead and scale distributed engineering teams.
Strong problem-solving skills with a focus on system-level thinking and root cause analysis.
Experience with monitoring and observability tools such as Prometheus, Grafana, ELK stack, or similar.
Excellent stakeholder management and communication skills, with the ability to influence cross-functional teams.

Preferred Qualifications

Experience managing large-scale, high-traffic production systems.
Background in DevOps transformation and cloud-native architecture.
Familiarity with security best practices and compliance frameworks.

Skills Required

10-15 years experience in software engineering, infrastructure, or SRE, with at least 3-5 years in an Engineering Manager or leadership role.
Proven expertise in Site Reliability Engineering (reliability, scalability, fault tolerance).
Strong experience with cloud platforms (AWS, GCP, Azure).
Deep understanding of infrastructure as code (Terraform, CloudFormation).
Experience with CI/CD pipelines.
Containerization technologies (Docker, Kubernetes).
Experience leading and scaling distributed engineering teams.
System-level problem solving, root cause analysis, and incident management.
Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack or similar).
Excellent stakeholder management and communication skills.
Experience managing large-scale, high-traffic production systems.
Background in DevOps transformation and cloud-native architecture.
Familiarity with security best practices and compliance frameworks.

View all jobs at Weekday, Inc.

View Weekday, Inc. Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

Year Founded: 2021

What We Do

Weekday is an AI-powered recruitment platform that helps startups hire top-tier engineering and product talent. By leveraging a massive database of white-collar professionals and advanced outreach tools, the company streamlines the hiring process through automated sourcing, AI-driven resume screening, and white-glove contingency services. Their mission is to modernize recruitment by enabling companies to discover and engage passive candidates efficiently, ensuring high-quality hires for critical roles.