Sr. Site Reliability Engineer

Reposted 8 Days Ago
Be an Early Applicant
Dallas, TX
Hybrid
Senior level
AdTech • Big Data • Marketing Tech • Software
Global leader in measurement & optimization integrating technology, data science, and services.
The Role
The Sr. Site Reliability Engineer will manage the Internal Developer Platform, focusing on reliability, automation, and optimizing development, while mentoring engineers and improving service delivery.
Summary Generated by Built In
Analytic Partners is a global leader in commercial measurement and optimization, turning data into expertise for the world’s largest brands for almost 25 years.
 
Our holistic approach to decisioning is powered by our industry-leading platform and team of experts, who help leaders make better decisions, faster – unlocking business growth and creating powerful customer connections.
 
With clients in 50+ countries and global offices across New York City, Miami, Dallas, Dublin, London, Paris, Singapore, Shanghai, Munich, Poznan, Sydney, Melbourne, Charlottesville and Denver, we’re growing fast. And we’re looking for top talent to join us in shaping the future of analytics.
 
To learn more about what we do, visit analyticpartners.com – and see why we’re recognized as a Leader in the industry by independent research firms Forrester and Gartner.

What You’ll Be Doing

  • Own the Internal Developer Platform (IDP) as a product, treating engineering teams as customers and optimizing for reliability, usability, and delivery velocity.
  • Define and execute a platform roadmap aligned with business priorities, developer needs, and long-term scalability.
  • Design, build, and evolve paved roads for application delivery, including CI/CD pipelines, infrastructure templates, service scaffolding, and standardized deployment patterns.
  • Build self-service capabilities that enable teams to provision, deploy, observe, and operate services with minimal friction.
  • Create and maintain reusable platform abstractions across AWS and Azure that standardize security, reliability, networking, and observability.
  • Reduce developer cognitive load by abstracting unnecessary complexity while enforcing clear guardrails for security, cost, and compliance.
  • Partner closely with application, product, and security teams to embed reliability, scalability, and security by design.
  • Establish and evolve platform standards for logging, monitoring, alerting, tracing, and incident response workloads.
  • Define, measure, and manage SLIs, SLOs, and error budgets for shared platform services.
  • Drive the reduction of operational toil through automation, standardization, and platform-first solutions.
  • Ensure shared platform services meet high standards for availability, performance, resilience, and scalability.
  • Own system-to-system integration and messaging patterns used across the platform.
  • Lead capacity planning, demand forecasting, and performance tuning for platform services.
  • Plan and execute zero-downtime upgrades, migrations, and releases of platform components.
  • Lead platform-level incident response workflows, post-incident reviews, and drive systemic improvements rather than one-off fixes.
  • Evaluate incoming platform requests and translate them into scalable, productized capabilities.
  • Mentor engineers and drive platform adoption through documentation, enablement, and technical evangelism.
  • Participate in a 24x7 on-call rotation as an escalation point for platform reliability and availability issues.
  • Operate effectively in ambiguous problem spaces, making sound architectural and product decisions with limited guidance.

What We Look For In You:

  • Bachelor’s degree in Computer Science or equivalent practical experience.
  • 6+ years of experience in Platform Engineering, Site Reliability Engineering, DevOps, or Systems Engineering roles.
  • Strong expertise in Linux and Windows operating systems.
  • Advanced automation and scripting skills using Python, Bash, and/or PowerShell.
  • Deep, hands-on experience designing and operating AWS and Azure platforms at scale.
  • Strong experience building and operating CI/CD platforms (Jenkins, GitHub Actions or equivalent).
  • Strong experience with Infrastructure as Code and configuration management (Terraform, CloudFormation, ARM, or similar).
  • Production experience with containerized and orchestration platforms such as Docker and Kubernetes.
  • In-depth experience with the HashiCorp ecosystem (Nomad, Consul, Vault).
  • Strong understanding of distributed systems, cloud-native architectures, and reliability patterns.
  • Experience designing and operating observability platforms (e.g., Splunk, Sumo Logic, or similar).
  • Familiarity with security and compliance practices, including vulnerability scanning and enterprise security tooling.
  • Strong understanding of the software delivery lifecycle, release engineering, and platform lifecycle management.
  • Experience working in Agile / DevOps environments with a strong product mindset.
  • Demonstrated ability to influence without authority, set standards, and drive adoption across teams.
  • Excellent communication skills, able to translate platform capabilities into clear developer value.
  • Strong problem-solving skills with a bias toward durable, scalable solutions over short-term fixes.
  • A mindset of continuous improvement, curiosity, and learning.
  • Comfortable supporting a global, follow-the-sun operation when needed.

How We Measure Success:

  • Strong developer adoption and satisfaction with the platform (DX).
  • Reduced deployment friction, lead time, and operational toil.
  • Platform reliability and performance meeting or exceeding defined SLOs.
  • Consistent, high-quality service delivery across engineering teams.
  • Reduced incident frequency and severity driven by systemic platform improvements.
  • Increased standardization, automation, and self-service adoption across the organization.

Our differentiator is – Our People!  We hire the brightest talent and develop them into leaders. We foster a culture of PEOPLE, PASSION and GROWTH.  
People: We value our people, customers, and partners
Passion: We love what we do
Growth: Unlimited growth means unlimited potential
 
AP is a customer-focused, team-oriented organization where innovation and results are rewarded, and individuals can chart the course of their own careers.
 
As a woman founded and led company, this has meant supporting a meritocracy where everyone has opportunities to achieve their best and ensure we foster an environment of diversity, equity, and inclusion. In practice this means we will not only work to recruit a diverse workforce, but also maximize the full potential of all of our people. You can read more about our commitment to DEI Here 

Top Skills

Arm
AWS
Azure
Bash
CloudFormation
Consul
Docker
Github Actions
Hashicorp Nomad
Jenkins
Kubernetes
Linux
Powershell
Python
Splunk
Sumo Logic
Terraform
Vault
Windows
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
205 Employees
Year Founded: 2000

What We Do

Analytic Partners is a proven global leader in measurement and optimization. Our adaptive solutions integrate proprietary technology powered by the latest data science delivered through our platform and high-touch consulting. We enable deeper business understanding to support better, faster decisions.

In "The Forrester Wave™: Marketing Measurement and Optimization Solutions, Q1 2020", Analytic Partners was named a Leader and was top ranked in Strategy and Current Offering among all evaluated vendors. In addition, Analytic Partners received the highest score for Technology Platform.

Founded in 2000 by President and CEO, Nancy Smith, Analytic Partners is fully independent and a Certified Women's Business Enterprise. We are fast growing with global operations across our full-service offices in New York City, Denver, Miami, Charlottesville, Dublin, Paris, Sydney, Hong Kong, Singapore, and Shanghai. Analytic Partners services clients in more than 50 countries, providing world-class expertise and client support, along with powerful integrated technology – GPS Enterprise.

Why Work With Us

We foster a culture of PEOPLE, PASSION and GROWTH.
• People: We value our people, clients, and partners
• Passion: We love what we do
• Growth: Unlimited growth means unlimited potential

Analytic Partners is a Certified Women-Owned Business. We foster diversity and inclusion at all levels including the executive management team.

Gallery

Gallery

Similar Jobs

Zeta Global Logo Zeta Global

Senior Site Reliability Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
140K-170K Annually

General Motors Logo General Motors

Senior Engineer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
2 Locations
165000 Employees
148K-222K Annually
In-Office
Dallas, TX, USA
947 Employees

General Motors Logo General Motors

Site Reliability Engineer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
2 Locations
165000 Employees
202K-302K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account