We are looking for a seasoned Staff DevOps and Platform Engineer to own and evolve the infrastructure that powers Liberate’s real-time AI voice and workflow automation systems. This is a critical technical leadership role. You will inherit and advance a modern AWS-based platform that spans PBX telephony, canary routing, MTLS-based integrations with carriers, secure production environments, CI/CD, and compliance posture.
You will drive reliability, scalability, and operational rigor across our multi-agent runtime. You will also mentor engineers, design forward-looking system improvements, and create the platform foundations that enable Liberate’s rapid product expansion.
Key Responsibilities
- Lead architecture and operation of core AWS infrastructure including PBX systems, EKS, networking, IAM, VPC design, and secure environment isolation.
- Own and improve Canary routing infrastructure for LRA (LLM REST API) via Traefik and GitOps patterns.
- Maintain and optimize CI/CD flows including GitHub-based CodeBuild jobs, artifact pipelines, and environment promotion workflows.
- Manage and evolve MTLS proxy infrastructure used to integrate with carrier systems like Frontline.
- Own HAProxy-based proxy fleet, certificate lifecycle, root CA management, and IP-restricted ingress patterns.
- Ensure secure, audited access to production systems, tokens, and root-level accounts.
- Lead incident response, on-call rotations, and postmortems. Improve reliability metrics (SLA/SLO/SLI) for voice, agent runtime, and workflow systems.
- Maintain and improve non-obvious production infra details including external service dependencies, version pinning, and update cadences.
- Partner with AWS Support to optimize pricing, scaling configs, and resource utilization.
- Modernize developer workflows: streamlined builds, repeatable environments, safe deployment strategies (blue/green, canary, feature flags).
- Build internal tools and abstractions to make engineers productive while enforcing safety, configuration hygiene, and compliance requirements.
- Lead infrastructure-related components of SOC2, pen-testing, and Vanta-driven controls.
- Ensure auditability, traceability, secure storage of credentials, and alignment with enterprise customer expectations.
- Work closely with AI Platform, Forward Deployed Engineering, and Product teams to translate business goals into scalable infrastructure decisions.
- Mentor engineers across DevOps, platform, and backend areas. Help set engineering standards and raise operational maturity across the org.
Required Qualifications
- 8+ years of DevOps, SRE, or platform engineering experience operating production systems at scale.
- Deep hands-on AWS expertise (EKS, IAM, VPC, ALB/NLB, CloudWatch, KMS).
- Strong experience with Kubernetes, container orchestration, and multi-environment management.
- Proficiency with Terraform or other IaC tools and GitOps workflows.
- High proficiency in Python, Go, or Typescript for tooling, automation, and internal platform services.
- Experience with Traefik, HAProxy, or similar load-balancing and routing systems.
- Familiarity with secure network architectures, MTLS, certificate hierarchies, and service-to-service authentication.
- Strong background managing CI/CD systems such as GitHub Actions and CodeBuild.
- Ability to lead incidents, design SLOs, and drive reliability across mission-critical systems.
- Excellent communication and leadership skills in distributed teams.
Preferred Qualifications
- Experience with PBX or telephony systems in AWS, SIP routing, or real-time communication pipelines.
- Experience with voice agents, WebRTC, or low-latency streaming services.
- Prior work in regulated or enterprise environments where compliance is a first-class requirement.
- Experience scaling infra in fast-growth startups.
- Contributions to open source, infrastructure design talks, or technical publications.
Why This Role Matters
Our platform supports real-time, multi-agent reasoning and voice workflows that depend on low latency, reliability, and airtight security. This role is the backbone of that capability. If you're excited to own mission-critical infrastructure in a company where infrastructure is product, we’d love to talk.
Strong preference for Boston or San Francisco based, but open to remote within the U.S.
Top Skills
What We Do
Liberate Innovations Inc. is a software-as-a-service (SaaS) platform for the P&C insurance industry to fully automate claims and underwriting journeys enabling P&C insurers to deliver an exceptional customer experience at the industry’s lowest cost. Insurers use the cloud-based low-code platform to build digital self-serve experiences and orchestrate an ecosystem of solutions providers and core systems to automate complex business processes. For more information, visit www.liberateinc.com.
Why Work With Us
Liberate is a fast-growing, exciting San Francisco-based AI Startup (Series B) that still maintains a people-first culture.
Gallery
.png)






