What You’ll Do
- Deploy and maintain observability stack (Grafana, Mimir, Prometheus) across multiple customer clusters and DoD networks
- Build Helm chart abstractions and automation to streamline monitoring deployments for new customers
- Troubleshoot and debug complex Kubernetes issues, networking problems, and monitoring stack failures
- Configure and maintain BigBang charts and DoD Platform One integrations
- Design and implement infrastructure automation using tools like Pulumi, ArgoCD, and Flux
- Work with Istio service mesh and Keycloak for authentication in secure environments
- Monitor and optimize performance of monitoring infrastructure across multiple environments
- Collaborate with security teams to ensure compliance with NIST requirements and DoD standards
- Participate in on-call rotation and incident response for production environments
Skills You’ll Bring to Our Team
- 5+ years of Site Reliability Engineering or DevOps experience
- Deep experience with Kubernetes administration, troubleshooting, and scaling
- Hands-on experience deploying and maintaining observability tools (Prometheus, Grafana, Mimir/Cortex)
- Strong understanding of Helm charts, GitOps practices, and CNCF tooling
- Experience with service mesh technologies (Istio preferred)
- Proven ability to debug complex distributed systems and networking issues
- Understanding of authentication systems and security in regulated environments
- Ability to work independently and collaborate with team members in a remote environment
Preferred Qualifications
- Active security clearance or ability to obtain a Secret-level security clearance
- Previous experience with DoD software deployments and Platform One
- Experience with BigBang charts and Iron Bank containers
- Experience working in national security or highly regulated environments
- Familiarity with compliance frameworks (NIST, FedRAMP, etc.)
- Experience with infrastructure as code (Pulumi, Terraform)
Technologies we Use
- Observability: Grafana stack, Prometheus, custom alerting tools
- Kubernetes: Helm, ArgoCD, Flux, Tekton, BigBang charts
- Security: Istio, Keycloak, Kyverno
- Infrastructure: AWS/GCP/Azure, Pulumi, Git/GitLab
- Languages: YAML, Bash, Go
Second Front Systems Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Second Front Systems and has not been reviewed or approved by Second Front Systems.
-
Healthcare Strength — Health coverage is described as 100% employer-paid for employees and dependents, which is positioned as a standout element of the package. This signals strong protection for families without added premium costs.
-
Leave & Time Off Breadth — Time off policies include flexible PTO, paid parental leave, and recognition of federal holidays, indicating broad leave options. Employer materials cite 11 federal holidays, with flexibility noted across sources.
-
Fair & Transparent Compensation — Publicly posted salary bands and role-based ranges point to market-aware, competitive pay for multiple positions. Aggregated compensation snapshots also indicate strong on‑target earnings for sales and solid total compensation in senior technical roles.
Second Front Systems Insights
Similar Jobs
What We Do
At Second Front Systems, we build software that accelerates delivery of emerging commercial technologies to U.S. warfighters. By harnessing insights and methodologies from the private sector and aligning them with government priorities and processes, we enable defense and national security professionals to effectively engage in long-term, continuous competition for access to emerging technologies.
Gallery









