Reliability Engineer

Sorry, this job was removed at 06:22 p.m. (CST) on Thursday, Jul 17, 2025
Be an Early Applicant
Wartha, Ząbkowicki, Województwo dolnośląskie
Hybrid
Productivity • Sales • Software
The Work OS that gives everyone the power to build and improve the way their organization runs.
The Role
Description

We are monday.com, a global software company transforming how businesses run. Our product suite can adapt to the needs of diverse industries and use cases within one powerful platform, empowering ~245,000 customers worldwide to reimagine how work gets done, drive greater efficiency, and scale like never before.

With over 2,500 employees across the globe, we grow by prioritizing transparency and knowledge sharing. We care about the impact you make, not the hours you clock, so we encourage initiative, ownership, and fresh thinking. We back our people with flexible work, wellness and mental health support, and a work environment built on collaboration.

The R&D Team is passionate about building innovative and lovable products, while tackling complex engineering problems at a great scale. We’re accountable for bringing the company’s vision to life by navigating our progress into flawless execution and encouraging full ownership and independence in all projects. Our reliability engineering team will be focused on building tools to ensure the robustness and reliability of our production environment together with high paced development cycles and fast engineering growth.


About The Role
  • Maintain a comprehensive understanding of our service architecture and its dependencies.
  • Identify and mitigate risks associated with tightly coupled services and complex interconnections.
  • Lead service re-architecture initiatives to improve reliability and scalability.
  • Review new services and ensure they meet our reliability standards.
  • Advocate for Chaos Engineering, collaborate with R&D teams, build tools/envs, and improve system resilience
  • Manage the full lifecycle of reliability tools and services, adhering to the comprehensive architectural guidelines
  • Collaborate with teams to define and monitor Service Level Indicators (SLIs) and Service Level Objectives (SLOs) that align with business goals and user expectations
  • Our Stack: Kubernetes, Datadog, Chaos Mesh, AWS, Terraform, CDKTF

Requirements
  • Proven k8s and Linux admin/internals experience.
  • Proven experience with microservice architectures and reliability engineering.
  • Deep understanding of reliability concepts (eg, SLOs, SLIs, and service interconnections).
  • Strong background in incident response and resilience efforts.
  • Ability to collaborate across teams to drive reliability improvements.
  • Proficiency in a programming language (e.g., Node.js, TypeScript, Go) with the ability to design and implement reliability tooling, including microservices and/or microfrontends.
  • (Nice-to-have): Prior knowledge with chaos engineering.

#LI-DNI


Social Title

Reliability Engineer


Social Description

monday.com is looking for a Reliability engineer to build new infrastructure-related features and directly impact the future of our product from UI to database. You’ll join our Infrastructure Group in our Warsaw office.

Our Production Observability team will be focused on building exceptional self service Logging and Monitoring tools to support our high paced development cycles and fast engineering growth.



Our Team

The R&D Team is passionate about building innovative and lovable products, while tackling complex engineering problems at a great scale. We’re accountable for bringing the company’s vision to life by navigating our progress into flawless execution and encouraging full ownership and independence in all projects. The Infra Team is a crucial piece as our company scales and user-base grows, conquering all aspects of product and infrastructure challenges. We are focused around development flow productivity, building application infrastructure and production resilience. We have huge challenges related to hyper growth of engineering, application and data scale.


Position Type
None

What the Team is Saying

Ruchita
Nate
Kyle
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
3,049 Employees
Year Founded: 2012

What We Do

monday.com is a work operating system that transforms the way teams work together. We’ve created a solution that connects people to workplace processes promoting a culture of transparency & empowerment. We're obsessed with building an excellent product. Our goal is to create a work operating system that people will love to use—one that’s fast, beautiful & responsive.

Why Work With Us

At monday.com we believe in transparency, accountability, and impact. Together, those values have lent themselves to create a strong culture of professional and creative autonomy where every team member is encouraged to share ideas and help bring them to life!

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

monday.com Teams

Team
Customer Experience
About our Teams

monday.com Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

monday.com embraces a flexible work environment with our hybrid model!

Typical time on-site: 3 days a week
HQNew York, NY
HQTel Aviv-Yafo, IL
Denver, CO
London, GB
Melbourne, VIC
São Paulo, BR
Sydney, NSW
Warsaw, PL
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account