Principal Site Reliability Engineer - Remote

Posted An Hour Ago
Be an Early Applicant
Hiring Remotely in Minnetonka, MN, USA
In-Office or Remote
Expert/Leader
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Role
Define and scale SRE standards across teams, implement SLOs/SLIs/error budgets, build observability and resiliency patterns, drive automation and AIOps, improve reliability for large-scale Azure cloud systems, and influence engineering and platform teams.
Summary Generated by Built In
Requisition Number: 2371904
Optum Tech is a global leader in health care innovation. Our teams develop cutting-edge solutions that help people live healthier lives and help make the health system work better for everyone. From advanced data analytics and AI to cybersecurity, we use innovative approaches to solve some of health care's most complex challenges. Your contributions here have the potential to change lives. Ready to build the next breakthrough? Join us to start Caring. Connecting. Growing together.
We are seeking a Principal Site Reliability Engineer (SRE) to define and scale reliability practices across large-scale cloud platforms.
This is a senior individual contributor role focused on setting SRE standards, influencing engineering teams, and driving reliability through automation and AI-enabled operations.
This is a remote role with preference for candidates located in MN.
You'll enjoy the flexibility to work remotely * from anywhere within the U.S. as you take on some tough challenges. For all hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week.
What Makes This Role Unique:
  • Define and influence SRE best practices across multiple platforms and teams
  • Drive adoption of AI-enabled reliability and operational innovation (AIOps)
  • Work on mission-critical healthcare systems at enterprise scale
  • Blend hands-on technical depth with strategic influence
  • Partner across engineering, platform, and security teams to elevate reliability standards

Primary Responsibilities:
  • Define and drive SRE standards across teams
  • Lead implementation of:
    • SLOs, SLIs, error budgets
    • Observability (metrics, logs, tracing)
    • Resiliency patterns (failover, self-healing)
  • Improve reliability through automation and proactive risk mitigation
  • Drive reliability practices in Azure environments
  • Apply AIOps (anomaly detection, intelligent alerting, automation)
  • Influence engineering teams without direct authority

What Success Looks Like:
  • Established consistent SRE practices and standards across teams
  • Improved system reliability, observability, and incident response maturity
  • Delivered measurable gains in uptime, performance, and operational efficiency
  • Enabled AI-driven improvements in reliability and operations

Why Join Optum?
You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.
Required Qualifications:
  • Bachelor's Degree in Computer Science, Information Technology, or a related field, or equivalent practical experience
  • 10+ years of experience in Site Reliability Engineering, Software Engineering, or Cloud Engineering
  • Experience influencing multiple teams or platforms without direct ownership
  • Demonstrated experience improving reliability through automation, tooling, or AI-enabled approaches
  • Proven hands-on expertise in:
    • Reliability engineering (SLOs, SLIs, incident management, observability)
    • Distributed systems in cloud environments (Azure preferred)
  • Solid understanding of system design, performance, scalability, and failure modes

Preferred Qualifications:
  • Experience implementing AI/ML or AIOps solutions in production environments (e.g., anomaly detection, alert optimization, automation)
  • Experience standardizing observability frameworks (e.g., OpenTelemetry or similar)
  • Experience working in complex enterprise or regulated environments
  • Background supporting large-scale, mission-critical systems
  • Proven ability to influence senior technical stakeholders
  • Location: MN

*All employees working remotely will be required to adhere to UnitedHealth Group's Telecommuter Policy.
Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $xx,xxx to $xx,xxx annually based on full-time employment. We comply with all minimum wage laws as applicable.
Application Deadline: This will be posted for a minimum of 2 business days or until a sufficient candidate pool has been collected. Job posting may come down early due to volume of applicants.
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
UnitedHealth Group is an Equal Employment Opportunity employer under applicable law and qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations.
UnitedHealth Group is a drug - free workplace. Candidates are required to pass a drug test before beginning employment.

Skills Required

  • Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent practical experience
  • 10+ years of experience in Site Reliability Engineering, Software Engineering, or Cloud Engineering
  • Experience influencing multiple teams or platforms without direct ownership
  • Experience improving reliability through automation, tooling, or AI-enabled approaches
  • Proven hands-on expertise in reliability engineering (SLOs, SLIs, incident management, observability)
  • Experience with distributed systems in cloud environments
  • Solid understanding of system design, performance, scalability, and failure modes
  • Experience with Azure
  • Experience implementing AI/ML or AIOps solutions in production
  • Experience standardizing observability frameworks (e.g., OpenTelemetry)
  • Experience working in complex enterprise or regulated environments supporting large-scale mission-critical systems
  • Proven ability to influence senior technical stakeholders

What the Team is Saying

Optum Compensation & Benefits Highlights

  • Healthcare Strength Health coverage offers copay and HSA medical options with dental, vision, company‑paid life and disability, and free or low‑cost virtual visits. Feedback suggests the offering is comprehensive and competitive on paper.
  • Parental & Family Support Time off and family supports include PTO, eight paid holidays plus a floating day, six weeks paid parental leave, up to two weeks paid caregiver leave, Bright Horizons back‑up care, and adoption assistance up to $10,000. Feedback suggests these resources are meaningful for caregivers and family needs.
  • Retirement Support Savings programs include a 401(k) with employer match (after one year, vesting after two) and a 10%‑discount Employee Stock Purchase Plan. These programs bolster long‑term financial security when combined with other savings resources.

Optum Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Eden Prairie, MN
160,000 Employees
Year Founded: 2011

What We Do

Optum, part of the UnitedHealth Group family of businesses, is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. At Optum, we support your well-being with an understanding team, extensive benefits and rewarding opportunities. By joining us, you’ll have the resources to drive system transformation while we help you take care of your future. We recognize the power of connection to drive change, improve efficiency and make a difference in health care. Join a team where your skills and ideas can make an impact and where collaboration is key to creating technology that produces healthier outcomes.

Gallery

Gallery
Gallery
Gallery

Optum Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Optum has three workplace models that balance the needs of the business and the responsibilities of each role. These models, core on‑site (5 days/week), hybrid (4 days/week) and telecommute or fully remote, vary by country, role and location.

Typical time on-site: Not Specified
HQEden Prairie, MN
Metro Manila, Philippines
Cebu, Philippines
Davao, Philippines
Ann Arbor, MI
Atlanta, GA
Baltimore, MD
Bengaluru, India
Chennai, India
Dallas, TX
Detroit, MI
Dublin, Ireland
Hartford, CT
Houston, TX
Hyderabad, India
Jacksonville, FL
Las Vegas, NV
Letterkenny, Ireland
Louisville, KY
Madison, WI
Minneapolis, MN
Nashville, TN
New Delhi, India
Philadelphia, PA
Phoenix, AZ
Pune, India
Raleigh, NC
San Diego, CA
Washington, DC
Learn more

Similar Jobs

Optum Logo Optum

Epic Ambulatory Manager - Remote

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Eden Prairie, MN, USA
160000 Employees
113K-193K Annually

Optum Logo Optum

Manager UIUX Design Advisory Services - Remote

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Eden Prairie, MN, USA
160000 Employees
113K-193K Annually

Optum Logo Optum

Director, Brand Analytics and Agentic Enablement - Remote

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Minnetonka, MN, USA
160000 Employees
135K-231K Annually

Optum Logo Optum

Product Owner

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Minnetonka, MN, USA
160000 Employees
92K-164K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account