We are monday.com, a global software company transforming how businesses run. Our product suite can adapt to the needs of diverse industries and use cases within one powerful platform, empowering ~245,000 customers worldwide to reimagine how work gets done, drive greater efficiency, and scale like never before.
With over 2,500 employees across the globe, we grow by prioritizing transparency and knowledge sharing. We care about the impact you make, not the hours you clock, so we encourage initiative, ownership, and fresh thinking. We back our people with flexible work, wellness and mental health support, and a work environment built on collaboration.
The R&D Team is passionate about building innovative and lovable products, while tackling complex engineering problems at a great scale. We’re accountable for bringing the company’s vision to life by navigating our progress into flawless execution and encouraging full ownership and independence in all projects. Our reliability engineering team will be focused on building tools to ensure the robustness and reliability of our production environment together with high paced development cycles and fast engineering growth.
About The Role
- Maintain a comprehensive understanding of our service architecture and its dependencies.
- Identify and mitigate risks associated with tightly coupled services and complex interconnections.
- Lead service re-architecture initiatives to improve reliability and scalability.
- Review new services and ensure they meet our reliability standards.
- Advocate for Chaos Engineering, collaborate with R&D teams, build tools/envs, and improve system resilience
- Manage the full lifecycle of reliability tools and services, adhering to the comprehensive architectural guidelines
- Collaborate with teams to define and monitor Service Level Indicators (SLIs) and Service Level Objectives (SLOs) that align with business goals and user expectations
- Our Stack: Kubernetes, Datadog, Chaos Mesh, AWS, Terraform, CDKTF
Requirements
- Proven k8s and Linux admin/internals experience.
- Proven experience with microservice architectures and reliability engineering.
- Deep understanding of reliability concepts (eg, SLOs, SLIs, and service interconnections).
- Strong background in incident response and resilience efforts.
- Ability to collaborate across teams to drive reliability improvements.
- Proficiency in a programming language (e.g., Node.js, TypeScript, Go) with the ability to design and implement reliability tooling, including microservices and/or microfrontends.
- (Nice-to-have): Prior knowledge with chaos engineering.
#LI-DNI
Social Title
Reliability Engineer
Social Description
monday.com is looking for a Reliability engineer to build new infrastructure-related features and directly impact the future of our product from UI to database. You’ll join our Infrastructure Group in our Warsaw office.
Our Production Observability team will be focused on building exceptional self service Logging and Monitoring tools to support our high paced development cycles and fast engineering growth.
Our Team
The R&D Team is passionate about building innovative and lovable products, while tackling complex engineering problems at a great scale. We’re accountable for bringing the company’s vision to life by navigating our progress into flawless execution and encouraging full ownership and independence in all projects. The Infra Team is a crucial piece as our company scales and user-base grows, conquering all aspects of product and infrastructure challenges. We are focused around development flow productivity, building application infrastructure and production resilience. We have huge challenges related to hyper growth of engineering, application and data scale.
Position Type
None
What We Do
monday.com is a work operating system that transforms the way teams work together. We’ve created a solution that connects people to workplace processes promoting a culture of transparency & empowerment. We're obsessed with building an excellent product. Our goal is to create a work operating system that people will love to use—one that’s fast, beautiful & responsive.
Why Work With Us
At monday.com we believe in transparency, accountability, and impact. Together, those values have lent themselves to create a strong culture of professional and creative autonomy where every team member is encouraged to share ideas and help bring them to life!
Gallery
monday.com Teams
monday.com Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
monday.com embraces a flexible work environment with our hybrid model!






