Site Reliability Engineer

Reposted 3 Days Ago
St. Petersburg, FL, USA
In-Office
86K-109K Annually
Senior level
Information Technology • Consulting
The Role
The Site Reliability Engineer will drive the observability roadmap, standardize monitoring practices, optimize alerting tools, and collaborate with teams to enhance operational efficiency and system reliability.
Summary Generated by Built In

At Zelis, we Get Stuff Done. So, let’s get to it! 

  

A Little About Us 

Zelis is modernizing the healthcare financial experience across payers, providers, and healthcare consumers. We serve more than 750 payers, including the top five national health plans, regional health plans, TPAs and millions of healthcare providers and consumers across our platform of solutions. Zelis sees across the system to identify, optimize, and solve problems holistically with technology built by healthcare experts – driving real, measurable results for clients.

At Zelis, AI is woven into the fabric of how we work. Every associate is expected - and empowered - to partner with AI to challenge the status quo, accelerate innovation, and amplify their impact. This is a place for builders with a growth mindset who act with agility, embrace change, and use modern technology to shape smarter solutions, exceptional experiences, and the future of our industry for our clients, customers, and our culture.

  

A Little About You 

You bring a unique blend of personality and professional expertise to your work, inspiring others with your passion and dedication. Your career is a testament to your diverse experiences, community involvement, and the valuable lessons you've learned along the way. You are more than just your resume; you are a reflection of your achievements, the knowledge you've gained, and the personal interests that shape who you are.

Position Overview

We are seeking a strategic and results-oriented Site Reliability Engineer (Golden Signals Lead) to define and drive the observability roadmap across all platforms.

Job Title: Site Reliability Engineer

Location: Remote, In-office, or Hybrid
Department: IT Operations
Reports To: Manager of Observability & Reliability
Job Type: Full-Time Employee (FTE)

Job Summary:

This role is responsible for establishing a consistent and scalable approach to monitoring and alerting, leveraging golden signals to enhance system reliability and operational efficiency. The successful candidate will collaborate closely with the ZEIT SRE team, engineering leads, and India-based resources to build a unified observability strategy aligned with organizational goals.

Key Responsibilities:

Observability Roadmap Development:

  • Define a unified vision for observability across all platforms, with golden signals as the foundation for monitoring and alerting.
  • Develop and maintain a comprehensive roadmap to improve observability, reduce tool redundancy, and standardize practices across platforms.
  • Establish and track key performance indicators (KPIs) to measure progress and ensure accountability for roadmap milestones.

Collaboration and Alignment:

  • Partner with the ZEIT SRE team and engineering leads to break down silos and promote consistent observability practices.
  • Drive cross-platform collaboration to reduce operational inconsistencies and define a 'north star' approach for observability.
  • Facilitate knowledge sharing to ensure alignment on current and future observability initiatives.

Monitoring and Alerting:

  • Standardize the implementation of golden signals across applications to improve system reliability and incident detection.
  • Optimize alerting tools and reduce redundant or ineffective monitoring interfaces ('panes of glass').
  • Lead efforts to enhance observability while minimizing operational overhead for platform teams.
  • Maintain and enhance observability dashboards, delivering actionable insights into application health and performance.

Operational Support and Improvement:

  • Identify and address gaps in existing observability practices, prioritizing long-term scalability and reliability.
  • Collaborate with India-based resources to execute observability build-outs efficiently and with high quality.
  • Reduce client, provider, and print facility-raised issues through proactive monitoring and early detection.

Reporting and Continuous Improvement:

  • Measure and report on observability success metrics, including actionable alert volume and reduced issue escalations.
  • Continuously evaluate and refine observability strategies based on stakeholder feedback and evolving organizational needs.

Qualifications:

Educational Background:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).

Experience:

  • Minimum of 5 years of experience in Site Reliability Engineering, DevOps, or a related role with a strong focus on observability.
  • 5+ years of hands-on experience with .NET (C#), including advanced knowledge of ASP.NET Core, Web APIs, and performance optimization.
  • Demonstrated success in designing and implementing monitoring and alerting solutions across complex IT environments.

Technical Skills:

  • Deep understanding of SRE principles and golden signals for system monitoring.
  • Proficiency with observability tools such as Prometheus, Grafana, Splunk, New Relic, or Datadog.
  • Familiarity with cloud platforms (AWS, Azure, GCP) and containerization technologies (Docker, Kubernetes).
  • Advanced proficiency in scripting languages such as PowerShell.
  • Experience in front-end development using React.js.
  • Advanced knowledge of .NET

Soft Skills:

  • Strong leadership and collaboration abilities, with a proven ability to align diverse teams toward common goals.
  • Excellent analytical and problem-solving skills, with a proactive approach to identifying and resolving issues.
  • Clear and effective communication skills, capable of conveying technical concepts to stakeholders at all levels.

Preferred Qualifications:

  • Experience with building observability roadmaps and scaling solutions in enterprise environments.
  • Certifications in cloud or DevOps-related disciplines (e.g., AWS Certified DevOps Engineer, Kubernetes Administrator).

Please note at this time we are unable to proceed with candidates who require visa sponsorship now or in the future.

Location and Workplace Flexibility

We have offices in Atlanta GA, Boston MA, Morristown NJ, Plano TX, St. Louis MO, St. Petersburg FL, and Hyderabad, India. We foster a hybrid and remote friendly culture, and all our employee's work locations are based on the needs of the position and determined by the Leadership team. In-office work and activities, if applicable, vary based on the work and team objectives in accordance with Company policies.

  

Equal Employment Opportunity  
Zelis is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. 
 
We welcome applicants from all backgrounds and encourage you to apply even if you don’t meet 100% of the qualifications for the role. We believe in the value of diverse perspectives and experiences and are committed to building an inclusive workplace for all. 


Accessibility Support 
We are dedicated to ensuring our application process is accessible to all candidates. If you are a qualified individual with a disability or a disabled veteran and require a reasonable accommodation with any part of the application and/or interview process, please email [email protected]

  

Disclaimer 

The above statements are intended to describe the general nature and level of work being performed by people assigned to this classification. They are not to be construed as an exhaustive list of all responsibilities, duties, and skills required of personnel so classified. All personnel may be required to perform duties outside of their normal responsibilities, duties, and skills from time to time. 

Skills Required

  • Bachelor's degree in Computer Science or related field or equivalent experience
  • Minimum of 5 years of experience in Site Reliability Engineering or DevOps
  • 5+ years of experience with .NET (C#)
  • Hands-on experience with observability tools
  • Advanced proficiency in scripting languages such as PowerShell
  • Experience in front-end development using React.js
  • Certifications in cloud or DevOps-related disciplines

Zelis Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Zelis and has not been reviewed or approved by Zelis.

  • Healthcare Strength Feedback suggests comprehensive medical, dental, and vision coverage is paired with mental health resources, telehealth, women’s health, and gender-affirming care. An Employee Assistance Program and a Lifestyle Spending Account reinforce depth of health support.
  • Retirement Support Feedback suggests a 401(k) with company match and HSA with employer contributions strengthen financial security. Financial planning resources and tools further bolster this area.
  • Flexible Benefits Feedback suggests flexible time off, hybrid/remote options, and meeting-free Wednesday afternoons support varied needs and work styles. Home office setup support adds practical flexibility.

Zelis Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Bedminster, NJ
924 Employees
Year Founded: 2016

What We Do

As a leading healthcare payments company, we price, explain and pay for care on behalf of payers, providers, and healthcare consumers. Zelis was founded on a belief there is a better way to determine the cost of a healthcare claim, manage payment-related data, and make the payment because more affordable and transparent care is good for all of us. We partner with over 700 payers, 1.5 million providers, and millions of members -- enabling the healthcare industry to pay for care, with care. Zelis brings adaptive technology, a deeply ingrained service culture, and an integrated pre-payment through payments platform to manage the complete payment process.

Similar Jobs

Applied Systems Logo Applied Systems

Site Reliability Engineer

Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Remote or Hybrid
2 Locations
3040 Employees
65K-135K Annually

TransUnion Logo TransUnion

Site Reliability Engineer

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
6 Locations
13000 Employees
113K-188K Annually

MongoDB Logo MongoDB

Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
10 Locations
5550 Employees
127K-249K Annually

MongoDB Logo MongoDB

Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
6 Locations
5550 Employees
126K-248K Annually

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account