As a Staff Site Reliability Engineer within the Core Reliability & Observability team, you will play a pivotal role in shaping the company’s observability strategy and ensuring our platform remains reliable, debuggable, and scalable. This role sits at the intersection of infrastructure, developer experience, and product engineering, with a particular focus on building and evolving the foundations of logging, metrics, tracing, and alerting across the organization.
You’ll act as a technical leader and strategic partner to SREs, software engineers, and product teams, guiding decisions, mentoring engineers, and driving cross-cutting initiatives that elevate our operational maturity.
What you will do- Lead the observability strategy across the platform, with an emphasis on building scalable, developer-friendly logging and tracing capabilities.
- Identify and lead large-scale cross-cutting reliability initiatives, including improvements to our incident detection, response, and postmortem analysis capabilities.
- Take part in the on-call rotation, and actively contribute to improving our on-call experience by refining alerting, reducing noise, and ensuring actionable telemetry.
- Serve as a mentor and technical coach to senior engineers, helping elevate the craft of reliability engineering across the company.
- Influence strategic decisions by providing technical guidance to leadership and representing the observability discipline in architectural reviews and platform discussions.
If you don’t meet all the requirements below but believe this opportunity matches your expectations and experience, we still encourage you to apply!
- Extensive experience (8+ years) in SRE, platform engineering, or infrastructure roles within cloud-native environments (preferably AWS, GCP, or Kubernetes-based).
- Deep expertise in observability tooling and architecture, such as:
- Logging: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector
- Tracing: OpenTelemetry or proprietary APMs
- Metrics: Prometheus, Thanos, Datadog, or equivalent
- Strong systems engineering background with fluency in at least one backend programming language (e.g., Go, Python, Ruby).
- Proven ability to lead through influence: setting technical direction, driving consensus, and mentoring engineers across teams.
- Experience designing and operating high-scale telemetry pipelines and working with developers to improve instrumentation quality.
- Comfortable balancing long-term architecture work with fast, iterative improvements.
- Clear, concise communication skills—both written and verbal—with the ability to drive alignment in ambiguous environments.
- Additional health plan scheme with our partner Allianz
- A dedicated onboarding program - the Doctolib Academy
- Mental health and wellbeing offer in partnership with moka.care
- The Doctolib Parent Care Program, including extended parental leave, meet-ups and inspiring conferences
- A sports and wellness provider offering classes for all
- Subsidy for lunch
- A flexible workplace policy offering both hybrid and office-based mode
- Flexibility days allowing to work in EU countries and the UK 10 days per year
- 30min Phone screen with a Tech Recruiter
- 1h30 Technical interview (SRE)
- 1h30 System design interview
- 1h15 Manager interview
Similar Jobs
What We Do
Since Doctolib's creation in 2013, we have had one purpose: strive for a healthier world. 1. We aim to improve the daily lives of care teams by providing them with a new generation of technologies and services. 2. We aim to improve health for all, by offering a fast and frictionless journey for all care episodes, creating new ways for people to receive care and empowering them to become actors of their health. At Doctolib, we are honored to work in the healthcare field and we believe that innovation in healthcare should be handled differently. We apply 4 guiding principles in everything we do: 1. We create helpful solutions for care teams and people. 2. We serve everyone equally and create well-designed and accessible technologies. 3. We team up with our users to strive for a healthier world and act as one team. 4. We protect our users' privacy. It’s their health, their data. To achieve our purpose, we are assembling a team dedicated to improving healthcare, with a human-centric approach and an entrepreneurial mindset. www.doctolib.com
.png)
.jpg)






