IND - Staff Engineer, Reliability

Reposted 12 Days Ago
Be an Early Applicant
Puppalagunda, Manikonda, Rangareddy, Telangana, IND
In-Office
Mid level
Fintech • Payments • Financial Services
The Role
The Staff Engineer for Reliability will implement reliability and resilience requirements, develop frameworks, collaborate with stakeholders, and drive operational efforts for the platform using best practices.
Summary Generated by Built In
IND - Staff Engineer, Reliability - GCC070

We’re determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too. Join our team as we help shape the future.

Key Responsibilities 

  • Partner with Enterprise governors to ascertain key reliability, security, and resilience requirements set by The Hartford and bring those requirements into the Platform team for implementation 

  • Patternize resilience capabilities into useful tools, services, and products to be used by customers to ensure users of the Platform build fault-tolerant systems 

  • Develop governing frameworks to ensure each release is compliant with the standards we expect 

  • Liaise with key business and technical customers to understand predictive applications and their infrastructure. Through this consultation, you would be working with them to build resilience and reliability capabilities into their application. 

  • Drive IM, cloud ops, and RE efforts across the platform by applying industry best practices and maturing existing the problem management lifecycle by building standards and contributing to runbooks, standard operating procedures, and incident management lifecycle 

  • Performance engineering of deployed analytics and AI solutions across the portfolio to ascertain enhancement opportunities 

Required Skills & Experience 

  • 4+ years of experience programming in Python to build automation tools, operational scripts, and platform support capabilities, including infrastructure and reliability automation. 

  • 4+ years of experience using Infrastructure as Code to provision and manage cloud environments, including Terraform and/or CloudFormation, with a focus on repeatability, security, and scalability. 

  • 2–3 years of experience deploying and operating systems on public cloud platforms such as AWS and/or Google Cloud Platform, including familiarity with serverless architectures, multi-region deployments, and recoverability strategies. 

  • 4+ years of experience designing and operationalizing resilience, reliability, and disaster recovery capabilities for distributed systems and ML/AI platforms, including performance engineering and fault-tolerant system design. 

  • 2+ years of experience building and maintaining CI/CD pipelines using tools such as GitHub and Jenkins, including embedding security checks, compliance gates, and automated validation into deployment workflows. 

  • 4+ years of experience applying core reliability engineering concepts, including authoring runbooks, operational guides, and automation to support resilient platform operations. 

  • 3+ years of experience designing observability solutions, including logging, monitoring, and alerting using tools such as Splunk, with dashboards and metrics that surface service health, SLO/SLA adherence, and early-warning signals for ML and data workloads. 

  • 4+ years of hands-on experience with incident and problem management practices, including ITIL-based processes, postmortems, and blameless root cause analysis, as well as disaster recovery planning, failover testing, and resilience frameworks such as FMEA. 

  • Foundational knowledge of networking fundamentals and operations architecture to support IT service management (ITSM) automation and distributed system reliability. 

  • Familiarity with relational databases such as Snowflake or other RDBMS platforms, with an understanding of data reliability, availability, and consistency requirements in analytics and ML environments. 

About Us | Our Culture | What It’s Like to Work Here

Skills Required

  • 4+ years of experience programming in Python
  • 4+ years of experience using Infrastructure as Code with Terraform and/or CloudFormation
  • 2-3 years of experience deploying systems on public cloud platforms like AWS or Google Cloud Platform
  • 4+ years of experience designing resilience and reliability capabilities for distributed systems
  • 2+ years of experience building CI/CD pipelines using GitHub and Jenkins
  • 4+ years of experience applying reliability engineering concepts and authoring operational guides
  • 3+ years of experience designing observability solutions using tools like Splunk
  • 4+ years of experience with incident and problem management practices
  • Foundational knowledge of networking fundamentals and operations architecture
  • Familiarity with relational databases such as Snowflake

The Hartford Financial Services Group, Inc. Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about The Hartford Financial Services Group, Inc. and has not been reviewed or approved by The Hartford Financial Services Group, Inc..

  • Retirement Support The retirement savings plan pairs matching with an additional company contribution and guidance, strengthening long‑term financial security. Consistent 401(k) generosity elevates perceived total compensation across roles.
  • Leave & Time Off Breadth Paid time off, holidays, and paid leaves are described as generous and accessible, supporting work‑life balance. The ability to take meaningful time away adds value beyond base pay.
  • Healthcare Strength Health, dental, and vision options are comprehensive, with supplemental coverages that help manage out‑of‑pocket costs. Mental health resources, EAP access, and wellness programs further reinforce overall benefits value.

The Hartford Financial Services Group, Inc. Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Hartford, Connecticut
20,002 Employees
Year Founded: 1810

What We Do

Human achievement is at the heart of what we do. We put our belief into action by not only ensuring individuals and businesses are well protected, but by going even further – making an impact in ways that go beyond an insurance policy

Similar Jobs

The Hartford Financial Services Group, Inc. Logo The Hartford Financial Services Group, Inc.

Reliability Engineer

Fintech • Payments • Financial Services
In-Office
Puppalagunda, Manikonda, Rangareddy, Telangana, IND
20002 Employees

The Hartford Financial Services Group, Inc. Logo The Hartford Financial Services Group, Inc.

Staff Engineer

Fintech • Payments • Financial Services
In-Office
Puppalagunda, Manikonda, Rangareddy, Telangana, IND
20002 Employees

The Hartford Financial Services Group, Inc. Logo The Hartford Financial Services Group, Inc.

Staff Engineer

Fintech • Payments • Financial Services
In-Office
Puppalagunda, Manikonda, Rangareddy, Telangana, IND
20002 Employees

The Hartford Financial Services Group, Inc. Logo The Hartford Financial Services Group, Inc.

Staff Engineer

Fintech • Payments • Financial Services
In-Office
Puppalagunda, Manikonda, Rangareddy, Telangana, IND
20002 Employees

Similar Companies Hiring

Scotch Thumbnail
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
US
35 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account