Senior Site Reliability Engineer (SRE) – Datadog Observability

Posted 19 Days Ago
Be an Early Applicant
5 Locations
In-Office
Senior level
Information Technology • Consulting
The Role
The role involves leading SRE initiatives focusing on Datadog observability, ensuring system reliability and scalability, and implementing best practices for incident management and automation.
Summary Generated by Built In
Senior Site Reliability Engineer (SRE) – Datadog Observability1

Job Title: Senior Site Reliability Engineer (SRE) – Datadog Observability
Experience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in Datadog
Location: Hyderabad preferable but open for Pune and remote
Job Summary:
We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability. The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware
Key Responsibilities:

 

  • Drive end-to-end SRE implementation, ensuring system reliability, scalability, and performance.
  • Design, configure, and manage Datadog dashboards, monitors, alerts, and APM for proactive issue detection and resolution.
  • Utilize the Datadog Roles API to create and manage user roles, global permissions, and access controls for various teams.
  • Collaborate with product managers, engineering teams, and business stakeholders to identify observability gaps and design solutions using Datadog.
  • Implement automation for alerting, incident response, and ticket creation to improve operational efficiency.
  • Work closely with business and IT teams to support critical Financial Month-End, Quarter-End, and Year-End closures.
  • Leverage Datadog AI
  • Provide technical leadership in observability, reliability, and performance engineering practices

Required Skills and Experience:
 

  • 8+ years of experience in Site Reliability Engineering, Observability
  • Minimum 3+ years of hands-on experience with Datadog (dashboards, APM, alerting, log management, Roles API, and monitoring setup).
  • Proven experience implementing SRE best practices—incident management, postmortems, automation, and reliability metrics
  • Excellent stakeholder management and communication skills; experience collaborating with business and IT teams.
  • Strong problem-solving mindset and ability to work in high-pressure production support environments.

Preferred Qualifications:
 

  • Certification in Datadog or related observability platforms.
  • Knowledge of CI/CD tools and automation frameworks.
  • Experience in cloud platforms (AWS, Azure, or OCI).
  • Exposure to ITIL-based production support processes.

Top Skills

Automation Frameworks
AWS
Azure
Ci/Cd Tools
Datadog
Oci
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Salt Lake City, Kolkata
1,794 Employees
Year Founded: 2003

What We Do

A Trusted Partner for Every Digital Enterprise Bringing Value.

Jade Global is a global IT consulting company with two decades of industry experience that helps the world’s leading businesses and organizations build their digital core, optimize their operations, and accelerate revenue growth. We are headquartered in San Jose, California; Jade Global operates with offices in 13 locations across North America, the UK, and Asia.

Renowned as a trusted "partner of choice" for businesses in Healthcare & Life Sciences, Hi-tech, Retail, Manufacturing, and Financial Industries, Jade Global has innovated 30+ industry-specific solutions.

Whether your focus is harnessing or expanding Gen-AI, AI, and digital capabilities, transforming operating models, or accelerating insightful decision-making, we’re here to help you gain and maintain a competitive edge with efficient, sustainable models.

At Jade Global, it’s all about outcomes—your outcomes—and delivering the results you desire, tailored to your unique requirements

Similar Jobs

Jade Global Logo Jade Global

Senior Site Reliability Engineer

Information Technology • Consulting
In-Office
5 Locations
1794 Employees

CrowdStrike Logo CrowdStrike

Engineering Manager

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
18 Locations
10000 Employees

CrowdStrike Logo CrowdStrike

Senior Software Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
16 Locations
10000 Employees

CrowdStrike Logo CrowdStrike

Manager, Threat Research (Remote, IND)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
19 Locations
10000 Employees
12-12 Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account