This role goes beyond infrastructure management; you will build systems that make deployment, observability, scaling, and incident response increasingly automated and self-service for product teams.
A key part of this role is supporting our AI-native engineering strategy: enabling the tools, environments, and workflows that allow our engineers to work effectively with AI coding assistants and agentic workflows at scale.
OUR STACK:
- Cloud: AWS
- Compute: ECS Fargate, Lambda, Event-Driven Services
- Infrastructure as Code: Terraform
- Languages: Python, TypeScript, Bash
- Datastores: Aurora Postgres, DynamoDB, Redis (Elasticache), OpenSearch
- Streaming & Queues: Kinesis, SQS, BullMQ
- CI/CD: GitHub Actions (Enterprise)
- Observability: Datadog, AWS CloudWatch
- Architecture: Containerized APIs, Serverless Processing, Distributed Systems
- AI Infrastructure: SageMaker and/or self-hosted model serving systems
WHAT YOU'LL WORK ON:
- Designing and maintaining Infrastructure as Code using Terraform to provision and manage AWS infrastructure.
- Building and evolving internal developer platforms (IDPs) that establish a golden path. These are opinionated, self-service workflows for deployment, environment provisioning, and operational tooling, so engineers spend less time on undifferentiated infrastructure decisions.
- Supporting self-serve developer environments that enable engineers and AI agents to iterate quickly and independently.
- Enabling and scaling AI-assisted engineering operations, including access management, adoption visibility, and developer productivity measurement across the team’s AI tooling.
- Improving system reliability, scalability, and performance across distributed services and real-time workloads.
- Optimizing CI/CD systems to reduce deployment friction and accelerate engineering feedback loops.
- Operating production systems with strong ownership of uptime, incident response, and recovery automation.
- Partnering with engineering teams to improve service architecture, resiliency, and operational maturity.
- Improving cost efficiency and infrastructure utilization across AWS and other infrastructure.
- Supporting secure and compliant cloud infrastructure aligned with SOC2 and enterprise security practices.
- Participating in on-call rotations.
You will help build and operate Regal’s voice agent platform across containerized services, event-driven systems, and AI workloads running at production scale.
ABOUT YOU:
- 5+ years of experience in DevOps, Platform Engineering, or Cloud Infrastructure roles.
- Deep experience operating production workloads on AWS.
- Strong hands-on experience with Terraform or comparable IaC tooling.
- Experience running containerized and serverless systems at scale.
- Strong understanding of distributed systems reliability and failure modes.
- Experience building CI/CD pipelines and developer automation tooling.
- Experience with observability platforms such as Datadog, Prometheus, or OpenTelemetry.
- Familiarity with incident response, on-call operations, and production debugging.
- Working knowledge of networking, IAM, and cloud security best practices.
- Comfortable collaborating across engineering teams and influencing technical direction.
NICE TO HAVE:
- Experience building or operating Internal Developer Platforms (IDPs) with golden path tooling.
- Experience building or managing self-serve developer environments at scale.
- Exposure to AI coding tools (e.g., code assistants, agentic coding workflows) and the infrastructure required to support them.
- Experience supporting AI/ML or real-time workloads.
- Cost optimization and cloud efficiency initiatives.
- Exposure to compliance frameworks (SOC2, HIPAA, PCI).
BENEFITS/PERKS:
- We care about your health!
- Medical, Dental, and Vision plans - 80% covered by the company
- Flexible PTO & 11 paid holidays/year
- We care about future you!
- 401k Plan
- Paid parental leave
- Pre-tax commuter benefits
- We care about connection!
- In-office breakfast and snacks daily
- Happy hours, team outings, & annual off-sites
- Complete laptop workstation
- & more to come!
Top Skills
What We Do
Founded in 2020, Regal is the AI Agent Platform. Regal gives every company the tools to transform customer communications with delightful AI Agents that are connected to your data, easy to customize and monitor, always-available, and ready to take action. Our Values Customers are Royalty We serve our customers above all else. If we don’t earn their love, someone else will. Fast Execution Wins We work with urgency and speed because by moving forward we learn more about how to solve the problem than by theorizing. Growth Mindset Your learning curve should be as steep as the company’s growth curve. Data Beats Opinion We make decisions based on analysis and data, not anecdotes. Enjoy the Journey We bring our whole selves to work and build meaningful friendships. We appreciate, and are kind to, each other.
Why Work With Us
At Regal, every team member makes an impact. We move fast, solve complex challenges, and build game-changing AI solutions—together. Our culture thrives on fast execution, data-driven decisions, and continuous learning. If you want to grow, be challenged, and work with brilliant, motivated people, Regal is where you want to be.
Gallery
Regal Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
We're in the office Tues-Thurs & WFH/Office Optional on Mon/Fri. Our HQ is in NYC. Annual offsites in fun spots such as Breckenridge, Miami, CT & more!













.png)