We’re determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too. Join our team as we help shape the future.
Position Overview
We are seeking an experienced and highly motivated Sr Staff Reliability Engineer. The Sr Staff Reliability Engineer will have end-to-end accountability for the reliability of IT services within a defined application portfolio. A prerequisite to the role will be a “build-to-manage”, problem-solving and innovative mindset applied to the design, build, test, deploy, change and maintenance of services drawing from deep engineering expertise.
Key measures of success will include service stability, effective delivery and environment instrumentation, deployment quality, technical debt reduction, asset resiliency, risk/security compliance, cost efficiency, proactive and preventative maintenance mechanisms, top quartile operating norms.
The Sr Staff Reliability Engineer will actively contribute to sustained advancement of the RE practice within and beyond a given area of responsibility.
Key Responsibilities
Guide the use of best-in-class software engineering standards and design practices for instrumenting code/application technology stack to enable the generation of relevant metrics on overall technology health - availability, performance, quality, currency and resiliency.
Serve as key liaison between the architecture and software engineering teams to influence the technical strategy for the organization, keeping in mind its cross-functional impacts, integration across the organization, and architecture rationalization.
Function as the go-to technical leader for the applications supported, requiring depth and breadth of knowledge in technologies, applications, integration, interfaces and business domain.
DevSecOps Solution Responsibilities:
Design, build, and maintain scalable and reliable systems for production environments.
Automate infrastructure provisioning, CI/CD pipelines, and incident response process.
Identify and mitigate risks to system reliability, security, and performance.
Develop effective tooling, alerts, and response mechanisms to identify and address reliability risks leveraging automation to support problem prevention, detection, mitigation, and resolution.
Enhance the delivery flow by engineering the appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability.
Progressively implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines
Promote and implement innovative solutions.
IT Ops Responsibilities:
Ensure operational excellence. Independently drive the triaging and service restoration of all high impact incidents in order to minimize the mean time to service restoration and impact to the business. Demonstrate end-to-end ownership.
Partner with infrastructure teams to design and implement intelligent incident routing, enhanced monitoring/alerting capabilities and automated service restoration processes. Take proactive measures to prevent high impactful incidents.
Achieve and maintain the continuity of Hartford and third-party assets that support a business function. Accountable for keeping the IT application and infrastructure metadata repositories current.
Required Skills & Experience
System Thinking end-to-end - Broad understanding of enterprise architectures and complex (backend) systems (understand more than the component itself)
Highly collaborative, partners with peers, stakeholders with a passion about delighting customers.
Expert experience with Performance and Observability tools such as DynaTrace, Splunk, TrueSight, CloudWatch, CloudTrail, and related tools.
Strong solution architecture orientation to enable expedient troubleshooting, issue-resolution and root-cause removal in a hybrid cloud environment.
Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Jenkins, Nexus, Rally, SonarQube etc..
Experience with cloud platforms (AW, GCP, or Azure)
Deep understanding of Linux systems, containers (Docker), and orchestration tools (Kubernetes)
Expertise with Infrastructure as Code (Terraform, CloudFormation).
Knowledge of complex traditional and modern enterprise architectures and systems (understand more than the component itself).
Strong hybrid cloud experience (private and public) across various service delivery models – IaaS, PaaS, SaaS.
Strong communication (verbally and written) / collaboration / negotiation skill, working in a diverse team cross business units
Preferred Qualifications
Understanding FinOps or cost-optimization practices in the cloud.
Experience with API gateways, and network-level observability.
Experience in regulated environments (Insurance)
AWS Solutions Architect certification
Keeps abreast with new market technologies and adept at learning and adopting new models. Promotes and applies continuous learning.
What We Offer
Opportunity to work on cutting-edge automation technologies including GenAI in testing.
Collaborative and innovative work culture.
Competitive compensation and benefits.
Continuous learning and growth opportunities.
About Us | Our Culture | What It’s Like to Work Here
Skills Required
- Expert experience with Performance and Observability tools
- Strong solution architecture orientation
- Experience with cloud platforms (AWS, GCP or Azure)
- Deep understanding of Linux systems, containers, and orchestration tools
- Expertise with Infrastructure as Code (Terraform, CloudFormation)
The Hartford Financial Services Group, Inc. Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about The Hartford Financial Services Group, Inc. and has not been reviewed or approved by The Hartford Financial Services Group, Inc..
-
Retirement Support — The retirement savings plan pairs matching with an additional company contribution and guidance, strengthening long‑term financial security. Consistent 401(k) generosity elevates perceived total compensation across roles.
-
Leave & Time Off Breadth — Paid time off, holidays, and paid leaves are described as generous and accessible, supporting work‑life balance. The ability to take meaningful time away adds value beyond base pay.
-
Healthcare Strength — Health, dental, and vision options are comprehensive, with supplemental coverages that help manage out‑of‑pocket costs. Mental health resources, EAP access, and wellness programs further reinforce overall benefits value.
The Hartford Financial Services Group, Inc. Insights
What We Do
Human achievement is at the heart of what we do. We put our belief into action by not only ensuring individuals and businesses are well protected, but by going even further – making an impact in ways that go beyond an insurance policy







