Senior Software Engineer - Observability Visibility

Posted 5 Hours Ago
Easy Apply
New York, NY, USA
Hybrid
175K-240K Annually
Senior level
Artificial Intelligence • Cloud • Security • Software • Cybersecurity
We are building the monitoring and security platform for developers, IT ops teams and business users in the cloud age.
The Role
Define and implement observability and resilience baselines, measure compliance and risk, design scalable automation and AI-enabled tooling, and provide technical leadership and coaching to improve service reliability and engineering effectiveness across teams.
Summary Generated by Built In

The Observability Visibility SRE Team is part of the Observability and Resilience Enablement group within the SRE/Security organization. Observability and Resilience Enablement focuses on closing the loop between how Datadog engineers detect and respond to issues and incidents and how those learnings translate into measurable risk reduction and lower customer impact. The Observability Visibility team carries the organization's 100% visibility priority, defining observability and reliability baselines and ensuring services consistently meet them by default through scalable, automated, and sustainable solutions.

As a Senior Software Engineer on this team, you will help define, implement and evolve observability and resilience standards across Datadog's engineering organization. You will build systems, tooling, libraries, and automation that make observability and reliability the default experience for service owners, reducing operational risk while driving adoption and consistency. This role combines software engineering and site reliability engineering to drive measurable improvements in engineering effectiveness and service resilience. You will work closely with SRE, platform and product teams to identify gaps, deliver scalable solutions and ensure long-term coverage and compliance with established standards.

At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.

What You'll Do:
  • Define and evolve observability and resilience baselines, ensuring alignment with measurable risk reduction goals across Datadog services.
  • Measure service compliance against established standards, assess risk and remediation complexity and drive sustainable solutions to close identified gaps.
  • Design and deliver scalable observability and reliability capabilities across the software development lifecycle, leveraging automation and AI-driven solutions where appropriate to enable service owners to meet established standards by default while partnering closely with platform, SRE, product and engineering teams to ensure adoption and sustained coverage.
  • Provide technical leadership and day-to-day coaching to team members, accelerating their growth through design reviews, collaborative problem-solving and operational excellence best practices.
Who You Are:
  • You have 5+ years of experience in software engineering, site reliability engineering, or a related discipline supporting production systems at scale.
  • You have hands-on experience with observability and resilience practices, including expertise in identifying, analyzing, and mitigating service and system failure modes.
  • You have strong programming skills in Go and/or Python and can design and build reliable, maintainable systems.
  • You are comfortable navigating complex technical challenges and proposing efficient, scalable, and easy-to-adopt solutions.
  • You have experience delivering AI-enabled software features end-to-end, including design, evaluation, deployment and monitoring and can articulate when AI is the appropriate solution and when it is not.
  • You have strong communication, collaboration, and mentorship skills with experience influencing technical direction across multiple engineering teams.

Datadog values people from all walks of life. We know not everyone will meet all the above qualifications on day one. That’s okay. If you’re passionate about technology and want to grow your experience, we encourage you to apply.

Benefits and Growth:
  • New hire stock equity (RSUs) and employee stock purchase plan (ESPP)
  • Continuous professional development, product training, and career pathing
  • Intradepartmental mentor and buddy program for in-house networking
  • An inclusive company culture and opportunities to participate in Community Guilds (Datadog employee resource groups)
  • Access to Inclusion Talks and internal learning opportunities
  • Free, global mental health benefits for employees and dependents age 6+
  • Competitive global benefits

Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.
To conform to US export control regulations, candidates should be eligible for any required authorizations from the US government. This job is available in various departments within our company; to conform to US export control regulations, some of these roles may require candidates to be eligible for any required authorizations from the US government.

#LI-Hybrid

Datadog offers a competitive salary and equity package, and may include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Datadog offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, a 401(k) plan and match, paid time off, fitness reimbursements, and a discounted employee stock purchase plan.

The reasonably estimated yearly salary for this role at Datadog is:
$175,000$240,000 USD

About Datadog: 

Datadog is the leading observability and security platform for the AI era, providing businesses with unified visibility across the technology stack to manage complexity at scale. It brings applications, infrastructure, data, models, and security into one place, using AI to detect and resolve issues before they impact customers. Trusted globally by Fortune 500 companies and high-growth AI leaders, Datadog enables businesses to move faster with clarity and confidence. Learn more about #DatadogLife on Instagram, LinkedIn, and Datadog Learning Center.

Equal Opportunity at Datadog:

Datadog is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and other characteristics protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. Here are our Candidate Legal Notices for your reference. 

Datadog endeavors to make our Careers Page accessible to all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please complete this form. This form is for accommodation requests only and cannot be used to inquire about the status of applications. 

Privacy and AI Guidelines:

Any information you submit to Datadog as part of your application will be processed in accordance with Datadog’s Applicant and Candidate Privacy Notice. For information on our AI policy, please visit Interviewing at Datadog AI Guidelines.

Skills Required

  • 5+ years experience in software engineering, site reliability engineering, or related discipline supporting production systems at scale.
  • Hands-on experience with observability and resilience practices, including identifying, analyzing, and mitigating failure modes.
  • Strong programming skills in Go and/or Python and ability to design and build reliable, maintainable systems.
  • Experience delivering AI-enabled software features end-to-end, including design, evaluation, deployment, and monitoring.
  • Strong communication, collaboration, and mentorship skills with experience influencing technical direction across multiple engineering teams.
  • Eligibility for any required authorizations from the US government to comply with export control regulations.

What the Team is Saying

Othmane
Angel
Emu
Tay
Norma
Sarah
LJ
Tammy
Olivia

Datadog Compensation & Benefits Highlights

  • Healthcare Strength Medical, dental, and vision coverage paired with dedicated mental‑health access (including free annual sessions for employees and dependents) and gender‑affirming care indicates robust healthcare support. Fitness reimbursements further reinforce preventative wellness.
  • Parental & Family Support Fully paid, gender‑neutral parental leave alongside family‑forming support (adoption, fertility, preservation, surrogacy) and childcare assistance shows strong backing for growing families. Pet‑related assistance in eligible offices expands family‑oriented offerings.
  • Equity Value & Accessibility An employee stock purchase plan with a discount and RSUs for many roles broaden access to ownership. This equity mix enhances total rewards beyond base pay.

Datadog Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
6,500 Employees
Year Founded: 2010

What We Do

Datadog (NASDAQ: DDOG) is a global SaaS business, delivering a rare combination of growth and profitability. We are on a mission to break down silos and solve complexity in the cloud age by enabling digital transformation, cloud migration, and infrastructure monitoring of our customers' entire technology stacks. Built by engineers, for engineers, Datadog is used by organizations of all sizes across a wide range of industries. Together, we champion professional development, diversity of thought, innovation, and work excellence to empower continuous growth. Join the pack and become part of a collaborative, pragmatic, and thoughtful people-first community where we solve tough problems, take smart risks, and celebrate one another.

Why Work With Us

At Datadog, we learn from and celebrate each other daily - each win is a team win. Datadogs solve tough problems, innovate pragmatically, and grow together. We promote from within, provide mentorship and opportunities for career development, and support our colleagues in the process. Best of all? We truly love what we do.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Datadog Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them and their team.

Typical time on-site: 3 days a week
HQNew York, NY
New South Wales
Company Office Image
MX
Amsterdam, NL
Bengaluru, IN
Company Office Image
Boston, MA
Denver, CO
Dublin, IE
Hanyang, KR
Lisbon, PT
United Kingdom
Madrid, ES
Company Office Image
Paris Office
San Francisco, CA
Singapore Office
Tokyo, JP
Learn more

Similar Jobs

Datadog Logo Datadog

Architect

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Hybrid
New York, NY, USA
6500 Employees
146K-213K Annually

Datadog Logo Datadog

Senior Product Designer

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Hybrid
New York, NY, USA
6500 Employees
161K-202K Annually

Datadog Logo Datadog

Account Executive

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Hybrid
New York, NY, USA
6500 Employees
135K-150K Annually

Datadog Logo Datadog

Senior Sales Engineer

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
3 Locations
6500 Employees
149K-198K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account