DoubleVerify

Sr. Site Reliability Engineer I

Reposted Yesterday

New York, NY, USA

In-Office

89K-178K Annually

Senior level

AdTech • Marketing Tech

The Role

The role involves enhancing the reliability and performance of media measurement platforms, managing incidents, implementing observability practices, automating processes, and ensuring high availability of cloud and on-premises infrastructures.

Summary Generated by Built In

Hybrid (3 days per week in office)

Who We Are

DV is the leader in digital performance solutions, helping our advertiser and agency partners Verify the quality of their digital campaigns, Optimise to improve performance and Prove that they’re achieving their business outcomes, through unbiased 3rd party data and analytics. DV’s mission is to be the definitive source of transparency and data-driven insights into the quality and effectiveness of digital advertising for the world’s largest brands, agencies, publishers, and digital ad platforms. Since 2008, DV has helped hundreds of Fortune 500 companies gain the most from their media spend by delivering best-in-class solutions across the digital advertising ecosystem, helping to build a better industry. Learn more at www.doubleverify.com.

What You’ll Do

Build and maintain the reliability, scalability, and performance of our digital media measurement platforms
Implement observability best practices, including metrics collection, dashboarding, and alerting strategies that support proactive reliability improvements
Reduce MTTR for critical incidents through automation, improved observability, and proactive monitoring
Respond to incidents and drive them to resolution, managing Sev1/Sev2 situations
Monitor and maintain high availability infrastructure and services across GCP, AWS, OCI, and on-premises environments
Lead technical projects from planning through deployment, ensuring proper stakeholder communication and team enablement.
Build and deploy automations to eliminate operational toil and improve efficiency across deployment workflows, validation scripts, and self-service capabilities
Leverage AI-assisted development tools to accelerate automation development and problem resolution
Build custom integrations and MCP servers for monitoring platforms to enable programmatic access and AI-driven analysis
Implement Infrastructure-as-Code using Terraform, Helm charts, Python and scripts, and configuration management tools to ensure repeatable, version-controlled infrastructure deployments
Develop production automations for routine operational tasks, reducing manual intervention and accelerating task completion
Create and maintain documentation, runbooks, and SOPs in Confluence to ensure consistent incident response across the team
Participate in on-call rotations and post-incident reviews to minimize downtime and prevent recurrence

Required Experience & Skills

4+ years in Site Reliability Engineering, DevOps, or related operational roles with proven experience in Linux/Unix systems administration
proficiency in scripting and programming languages such as Python, Bash, or Go for automation and tool development
Strong experience with cloud infrastructure and services across GCP, AWS, and OCI, as well as container orchestration tools like Kubernetes
Expertise in monitoring and observability tools such as Prometheus, Grafana, Splunk, Nagios,
Hands-on experience with Infrastructure-as-Code tools like Terraform, Ansible, or Helm
Proven ability to develop and track SLIs, SLOs, and SLAs to drive reliability improvements

Technical Knowledge

Deep understanding of networking, DNS, load balancing, and CDN technologies
Familiarity with databases (SQL, NoSQL, Vertica, MongoDB, Snowflake) and data pipeline technologies
Knowledge of CI/CD pipelines, GitLab, and deployment automation
Experience with workflow automation platforms is a strong plus

Soft Skills & Mindset

Exceptional communication skills with the ability to collaborate across teams and explain technical concepts clearly
Proactive problem-solving approach with a focus on automation and continuous improvement
Ownership mentality — you take full responsibility for complex challenges and reliably deliver outcomes
Trailblazing spirit — innovative use of AI, automation, and new technologies to solve problems and drive improvements
Passion for mentorship and knowledge sharing, elevating the capabilities of the entire team

Preferred Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or related field
Industry certifications such as
- AWS Certified DevOps Engineer
- Google Professional Cloud DevOps Engineer
- Certified Kubernetes Administrator (CKA), or Terraform/Grafana certifications
Experience with AI-assisted development using tools like ChatGPT, Cursor, Glean, or Copilot
Familiarity with security best practices in cloud and containerized environments

The successful candidate’s starting salary will be determined by a number of non-discriminatory factors, including qualifications for the role, level, skills, experience, location, and internal equity relative to peers at DV. The estimated salary range for this role, based on the qualifications set forth in the job description, is between $89,000.00 - $178,000.00. This role will also be eligible for bonus/commission (as applicable), equity, and benefits.

The range above is for the expectations as laid out in the job description; however, we are often open to a wide variety of profiles and recognize that the person we hire may be more or less experienced than this job description as posted.

Not-so-fun fact: Research shows that while men apply to jobs when they meet an average of 60% of job criteria, women and other marginalized groups tend to only apply when they check every box. So if you think you have what it takes but you’re not sure that you check every box, apply anyway!

Skills Required

4+ years in Site Reliability Engineering, DevOps, or related operational roles
Proficiency in scripting and programming languages such as Python, Bash, or Go
Strong experience with cloud infrastructure and services across GCP, AWS, and OCI
Expertise in monitoring and observability tools such as Prometheus, Grafana, Splunk, Nagios
Hands-on experience with Infrastructure-as-Code tools like Terraform, Ansible, or Helm

DoubleVerify Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about DoubleVerify and has not been reviewed or approved by DoubleVerify.

Fair & Transparent Compensation — Salary ranges are posted for U.S. roles and annual pay‑equity analyses are conducted, signaling structured and transparent pay practices. Pay for many technical and product roles is considered competitive with clear bands visible on postings.
Healthcare Strength — Health coverage is described as comprehensive, with medical, dental, vision, and global mental‑health resources. Wellness support includes designated mental wellness days and related activities.
Leave & Time Off Breadth — Self‑directed (unlimited) PTO expands flexibility beyond standard accruals. Quarterly wellness or recharge days further reinforce planned time away.

Learn more about DoubleVerify's Compensation & Benefits →

DoubleVerify Insights

What's It Like to Work at DoubleVerify? DoubleVerify Culture & Values DoubleVerify Career Growth & Development What's the Work-Life Balance Like at DoubleVerify? DoubleVerify Leadership & Management DoubleVerify Company Growth, Stability & Outlook

View all jobs at DoubleVerify

View DoubleVerify Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: New York, NY

721 Employees

Year Founded: 2008

What We Do

DV is powering the new standard of marketing performance, giving advertisers clarity and confidence in their digital investment. Built on best practices, DV solutions create value for media buyers and sellers by bringing transparency and accountability to the market, ensuring ad viewability, brand safety, fraud protection, accurate impression delivery and audience quality across campaigns to drive performance. Since 2008, DV has helped hundreds of Fortune 500 companies gain the most value out of their media spend by delivering best in class solutions across the digital ecosystem that help build a better industry. Learn more at doubleverify.com.