Platform Reliability & Observability Lead (SRE)

Posted Yesterday
Be an Early Applicant
Hiring Remotely in Bangalore, Bengaluru Urban, Karnataka, IND
In-Office or Remote
Expert/Leader
Biotech
The Role
The Platform Reliability & Observability Lead (SRE) enhances operational excellence by ensuring reliability, managing observability strategies, automation, and incident management across cloud environments.
Summary Generated by Built In

Job Overview:

The Platform Reliability & Observability Lead (SRE) will own and elevate the reliability, availability, and operational excellence of its hosting and platform services. This is an engineering led role, accountable for measurable reliability outcomes across cloud and hybrid environments supporting regulated clinical workloads. The role leads observability strategy, SLO and error budget programs, incident automation, and root cause engineering, ensuring platforms are resilient, predictable, compliant, and scalable. This position is critical to enabling Operational Excellence, Embedded Quality, Financial Discipline, and Customer Trust.

Summary of Responsibilities:

  • Engineer reliability into hosting and platform services through design reviews, resilience patterns, and readiness assessments.
  • Define and enforce standards for availability, latency, durability, recoverability, and scalability.
  • Own end‑to‑end observability strategy, including metrics, logs, traces, alerting, dashboards, and service health reporting.
  • Establish and operationalize SLIs, SLOs, and error budgets to guide prioritization, release readiness, and risk decisions.
  • Design and automate incident detection, triage, mitigation, rollback, and diagnostics to improve MTTD and MTTR.
  • Lead blameless post‑incident reviews, identify systemic issues, and drive remediation to closure.
  • Reduce operational toil through automation, engineering rigor, and self‑service tooling.
  • Partner with cloud, hosting, IaC, and application teams to embed reliability into the SDLC.

Qualifications (Minimum Required):

  • Bachelor’s degree in computer science, Computer Engineering, or a related field.
  • Excellent communication and public speaking skills, with the ability to present complex architectural concepts to senior leadership, technical teams, and non‑technical stakeholders.
  • Fortrea may consider relevant and equivalent experience in lieu of educational requirements.

Required skills (Minimum Required):

  • 9+ years in Site Reliability Engineering, Platform Engineering, or Production Engineering.
  • Proven ownership of production reliability in cloud or hybrid platforms.
  • Strong foundations in distributed systems, Linux, networking, and system internals.
  • Hands‑on experience with observability architectures and alerting best practices.
  • Strong expertise in SLIs, SLOs, SLAs, and error budgets.
  • Proficiency in Python, Go, Java, or equivalent, with a strong automation mindset.
  • Experience with Azure (preferred), AWS, or GCP
  • Experience with Kubernetes and Infrastructure as Code (Terraform, Bicep, ARM, etc.)

Preferred Qualifications Include:

  • Regulated or GxP environments.
  • Open Telemetry, distributed tracing, and service dependency mapping.
  • Chaos engineering, DR testing, or resilience validation.
  • FinOps and cost‑aware reliability engineering.
  • Building shared reliability or observability platforms.

Physical Demands / Work Environment:

  • Remote-Based, as requested by the line manager

  • Work Timings: 2:00 PM IST to 11.00 PM IST

Learn more about our EEO & Accommodations request here.

Top Skills

Arm
AWS
Azure
Bicep
GCP
Go
Java
Kubernetes
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Research Triangle Park, NC
10,811 Employees

What We Do

Fortrea (Nasdaq: FTRE) is a leading global provider of clinical development and patient access solutions to the life sciences industry. We partner with emerging and large biopharmaceutical, medical device and diagnostic companies to drive healthcare innovation that accelerates life changing therapies to patients in need. Fortrea provides phase I-IV clinical trial management, clinical pharmacology, differentiated technology-enabled trial solutions and post-approval services. Fortrea’s solutions leverage three decades of experience spanning more than 20 therapeutic areas, a passion for scientific rigor, exceptional insights and a strong investigator site network. Our talented and diverse team working in more than 90 countries is scaled to deliver focused and agile solutions to customers globally. Learn more about how Fortrea is becoming a transformative force from pipeline to patient at Fortrea.com.

Similar Jobs

GitLab Logo GitLab

Back-end Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
India
2500 Employees

GitLab Logo GitLab

Senior Back-end Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
India
2500 Employees

GitLab Logo GitLab

Back-end Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
India
2500 Employees

GitLab Logo GitLab

Senior Back-end Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
India
2500 Employees

Similar Companies Hiring

Formation Bio Thumbnail
Pharmaceutical • Healthtech • Biotech • Big Data • Artificial Intelligence
New York, NY
140 Employees
SOPHiA GENETICS Thumbnail
Software • Healthtech • Biotech • Big Data • Artificial Intelligence
Boston, MA
450 Employees
Pfizer Thumbnail
Pharmaceutical • Natural Language Processing • Machine Learning • Healthtech • Biotech • Artificial Intelligence
New York, NY
121990 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account