Group 1001 is a consumer-centric, technology-driven family of insurance companies on a mission to deliver outstanding value and operational performance by combining financial strength and stability with deep insurance expertise and a can-do culture. Group1001’s culture emphasizes the importance of collaboration, communication, core business focus, risk management, and striving for outcomes. This goal extends to how we hire and onboard our most valuable assets – our employees.
Why This Role Matters:
The Platform Engineering Services team at Group 1001 is building a Site Reliability Engineering practice with a network scope. We're hiring an Sr. Network Reliability Engineer who embodies Innovation and Excellence, and will apply SRE principles — code-as-source-of-truth, SLOs and error budgets, alerting on symptoms rather than causes, failure-mode-first design, and the elimination of toil — to the firm's network platform from carrier edge through cloud fabric to Kubernetes pod boundary. This is not a "keep the lights on" role. You will systematically engineer the lights-on work out of existence, build the abstractions that let other engineering teams express network intent in code, and treat the network as a single engineered system rather than a collection of vendor consoles. You will operate inside a DevSecOps practice spanning multi-cloud, multi-region environments, and you will partner closely with Cloud and Data Platforms, the NOC/SOC, and Cyber Security to extend reliability practice across the firm.
How You’ll Contribute:
- Treat reliability as an engineered property. Define SLOs and error budgets for the network platform — DNS resolution, edge availability, mesh ingress success, cross-region path health — and use them to gate changes, not just to color dashboards. Lead postmortems with a focus on permanent remediation, not pattern-recognition. Alert on symptoms users feel, not on causes that may or may not produce impact.
- Move network state into code. Use Terraform (or Pulumi), Ansible, and Python to replace CLI-driven configuration with declarative, version-controlled, peer-reviewed change running through Infra CI/CD. This applies equally to the edge tier (Cloudflare), security platforms (Zscaler ZIA/ZPA, ZTNA policies, next-gen firewalls), the cloud network fabric (Transit Gateway, Cloud WAN, VPCs, Route53, IPAM), and increasingly the Kubernetes and service-mesh layer.
- Build network policy as intent, not rule lists. Express what flows are permitted, what segments are isolated, what egress is inspected, what zones share DNS — and engineer the compilers that turn that intent into per-vendor configuration. Use Policy as Code (OPA/Rego, Sentinel, Cilium NetworkPolicy) to catch invariant violations at plan time, not apply time.
- Infrastructure as Code (IaC): Design, deploy, and manage network infrastructure using Terraform or Ansible, moving the firm away from manual configuration to a code-first approach.
- Engineer the cloud network platform. Operate and extend our multi-account AWS Landing Zone — Cloud WAN segmentation, Transit Gateway peering, IPAM-driven CIDR allocation, shared private DNS, cross-account telemetry pipelines. Build the platform abstractions that make a new account or service land correctly with policy and connectivity composed from declarative inputs.
- Extend platform thinking into the container tier. Kubernetes networking, service mesh (Istio, Linkerd, Consul Connect), eBPF-based observability and policy (Cilium, Hubble), and the integration points where mesh-level authz meets cloud-tier identity. Recognize that an "internal" service is one logical hop on a chain of policy enforcement points and engineer for that explicitly.
- Improve telemetry and observability with intent. Build alerts as structured payloads with runbook links, suspected blast radius, and dependency-aware suppression. Author both system-health dashboards for operators and end-user monitoring dashboards that reflect actual user experience. Use Grafana, Elastic, Open Telemetry where each fits.
- Mentor and grow the team. Provide technical guidance to junior engineers, foster a culture of learning, and work out loud across Platform Engineering so the patterns you build cross-pollinate to adjacent domains.
- Handle hardware when required. Provide maintenance and configuration support for routers, switches, and firewalls at data centers and offices when needed — bringing code-first practices to physical hardware where possible (templating, change validation, zero-touch provisioning) and direct hands-on competence where it isn't.
- Incident Response: Serve as an escalation point for network issues, some complex and some basic but not yet covered by runbooks. Troubleshooting with a focus on root cause analysis and permanent remediation with a documentation-first mindset.
- Reduce toil and hand off cleanly. Repetitive operational tasks are scoped engineering problems with measurable payoff. Author runbooks and SOPs that the NOC can execute confidently; package routine work for L1/L2 handoff so engineering interrupt drops over time. Coordinate across Data Platforms, NOC/SOC, and Cyber Security so reliability practices spread instead of staying siloed.
What We’re Looking For:
- Network Engineering: Deep understanding of TCP/IP, BGP, OSPF, VPNs, and SD-WAN architecture.
- Automation: Proven experience with Terraform (state management, modules) and Ansible (playbooks, roles) – or similar – in a production environment. Proficiency in Python for automation and API interaction, or similar.
- Security Platforms: Hands-on experience with Cloudflare, zScaler, and/or enterprise firewalls.
- Observability: Experience configuring monitoring tools (e.g., Datadog, Prometheus, Grafana) to create meaningful alerts and dashboards.
Nice to Have
- Service mesh experience (Istio, Linkerd, Consul Connect, Cilium).
- eBPF-based observability (Hubble, Pixie).
- AWS Multi-account landing zone tooling experience (AFT, Control Tower, or equivalent).
- Policy as Code experience (OPA/Rego, Sentinel, Cilium NetworkPolicy).
- Professional Attributes
- Documentation First: A strong belief that a job isn't done until the documentation in written.
- Toil Reduction: A mindset that actively seeks to automate repetitive tasks.
- Hybrid Capability: Willingness to handle physical hardware tasks when required while maintaining a software-centric engineering mindset.
Compensation:
Our compensation reflects the cost of labor across several U.S. geographic markets. The base pay for this position ranges from $135,000/year in our lowest geographic market up to $190,000/year in our highest geographic market. Pay is based on factors such as market location, job-related skills, and experience.
Benefits Highlights:
Employees who meet benefit eligibility guidelines and work 30 hours or more weekly, have the ability to enroll in Group 1001’s benefits package. Employees (and their families) are eligible to participate in the Company’s comprehensive health, dental, and vision insurance plan options. Employees are also eligible for Basic and Supplemental Life Insurance, Short and Long-Term Disability. All employees (regardless of hours worked) have immediate access to the Company’s Employee Assistance Program and wellness programs—no enrollment is required. Employees may also participate in the Company’s 401K plan, with matching contributions by the Company.
Group 1001, and its affiliated companies, is strongly committed to providing a supportive work environment where employee differences are valued. Diversity is an essential ingredient in making Group 1001 a welcoming place to work and is fundamental in building a high-performance team. Diversity embodies all the differences that make us unique individuals. All employees share the responsibility for maintaining a workplace culture of dignity, respect, understanding and appreciation of individual and group differences.
#LI-REMOTESkills Required
- Deep understanding of TCP/IP, BGP, OSPF, VPNs, and SD-WAN architecture.
- Proven experience with Terraform and Ansible in a production environment.
- Proficiency in Python for automation and API interaction.
- Hands-on experience with Cloudflare, zScaler, and/or enterprise firewalls.
- Experience configuring monitoring tools like Datadog, Prometheus, Grafana.
Group 1001 Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Group 1001 and has not been reviewed or approved by Group 1001.
-
Fair & Transparent Compensation — Pay is considered competitive for many specialized and senior roles, with employer-posted salary ranges providing clarity for candidates. Job postings indicate market-aligned compensation in several corporate, tech, and sales functions.
-
Leave & Time Off Breadth — Time off is portrayed as generous, including paid holidays and multiple leave types alongside hybrid work options for eligible roles. Some positions also offer remote flexibility that supports overall work–life balance.
-
Retirement Support — Retirement offerings include a 401(k) with company matching and are emphasized as a core part of total rewards. Certain postings reference immediate vesting, reinforcing attention to long-term savings.
Group 1001 Insights
What We Do
Group 1001 Insurance Holdings, LLC (“Group 1001”) is an insurance holding company in the United States, with current combined assets under management of approximately $57.5 billion as of June 30, 2022, and a mission for setting a new standard in the insurance industry by making insurance more useful and intuitive for everyone. Group 1001 is a long-standing, nimble, and tech-driven financial services enterprise established on deep industry expertise and reliable delivery of long-term value through empowering its customers, employees, and communities. Leveraging upon its record of building successful businesses and strong operating fundamentals, Group 1001 powers the next generation of insurance businesses with useful and intuitive solutions and products accessible to everyone. Group 1001 invests in strategic partnerships as part of our mission to transform communities through sports and education. Group 1001 and our subsidiaries have a strong commitment to service and community transformation. Education and sports initiatives, coupled with impactful partnerships, allow Group 1001 to improve lives through positive change in our communities. Learn more at Group1001.com.







