We're looking for a Senior Site Reliability Engineer who genuinely enjoys the craft. Someone who takes pride in a clean Terraform module, cares about observability because they've felt the pain of flying blind, and believes good documentation is an act of kindness for your teammates. You'll be hands-on with our AWS infrastructure, especially EKS, IAM, and RBAC, building things that are secure by default, not as an afterthought. You'll own our CI/CD pipelines in GitHub Actions, set up guardrails that let engineers ship quickly and confidently, and keep Datadog tuned so we know what's happening in our systems before our customers do. On any given week you might be writing Terragrunt modules, building a Python script to eliminate a tedious manual process, writing a runbook that'll save someone's 2am, or digging through a postmortem with the team with a focus on learning, not blame.
We work in an Agile environment with an on-call rotation. We approach our processes with thoughtfulness and the intent to constantly iterate and make it better. You don't need to have all the answers; you just need curiosity, clear communication, and a willingness to own your slice of the system while keeping it accessible and scalable, enabling us to build together.
What You’ll Do
- Design, scale, and operate resilient, cloud-native infrastructure in AWS with a strong emphasis on EKS, IAM, RBAC, and modern security-first practices.
- Build and optimize CI/CD pipelines with GitHub Actions and GitHub Advanced Security, enabling velocity without compromising safety.
- Own observability across the stack using Datadog (metrics, logging, alerting, and tracing).
- Write and maintain Terragrunt, Terraform modules, and infrastructure-as-code (IaC) automation.
- Develop internal tools and scripts in Python to automate operational workflows and reduce manual overhead.
- Document everything from runbooks to standards so teams stay aligned and systems stay stable.
- Actively contribute to Agile workflows using Jira, with clear tracking of work, priorities, and progress.
- Participate in on-call rotations, postmortems, and continuous improvement efforts — always with a blameless, team-first mindset.
What You’ll Bring
- 4+ years in a Senior SRE or DevOps role supporting production cloud infrastructure at scale, preferably in SaaS, PaaS, high-growth, or fast-paced environment.
- Deep experience with AWS (IAM, EKS, VPC, EC2, Secrets Manager, Serverless) and RBAC.
- Knowledge of compliance standards like HIPAA, HITRUST, or SOC 2.
- Hands-on proficiency with Terraform, Terragrunt, Helm, and container orchestration.
- Proven experience building and maintaining GitHub Actions for CI/CD, including GitHub Advanced Security features like secret scanning and code policy enforcement.
- Strong Datadog experience building dashboards, tuning alerts, setting up monitors, and interpreting telemetry.
- Solid Python scripting experience for automation and internal tools.
- You value clear, accurate documentation as a core part of engineering, not an afterthought.
- Comfortable working in Agile/Scrum environments with well-tracked Jira workflows.
- Practical experience with resource analysis and infrastructure optimization.
- AWS DevOps Engineer Professional Certification
- Familiarity with Lambda, Fargate, and serverless infrastructure.
- Experience with multitenant platforms or customer-isolated deployments.
- Experience with Azure or moving from Azure to AWS
Preferred Experience
Skills Required
- 4+ years in a Senior SRE or DevOps role supporting production cloud infrastructure at scale
- Deep experience with AWS (IAM, EKS, VPC, EC2, Secrets Manager, Serverless) and RBAC
- Knowledge of compliance standards like HIPAA, HITRUST, or SOC 2
- Hands-on proficiency with Terraform, Terragrunt, Helm, and container orchestration (Kubernetes/EKS)
- Proven experience building and maintaining GitHub Actions for CI/CD, including GitHub Advanced Security
- Strong Datadog experience (dashboards, alerts, monitors, tracing)
- Solid Python scripting experience for automation and internal tools
- Experience with Agile/Scrum workflows and Jira
- Experience participating in on-call rotations, incident response, and blameless postmortems
- Practical experience with resource analysis and infrastructure optimization
- AWS DevOps Engineer Professional Certification
- Familiarity with Lambda, Fargate, and serverless infrastructure
- Experience with multitenant platforms or customer-isolated deployments
- Experience with Azure or migrating from Azure to AWS
What We Do
DexCare is a data-driven intelligence company making access to healthcare easier and better for everyone. We enable patients to get the care they need faster, easier, more affordably, with better results. We enable providers to see more patients, easily and more effectively through different modalities. We enable health systems to utilize resources more efficiently and effectively, lower costs, grow revenue, and build enduring relationships with patients and providers.
Why Work With Us
DexCare is a series B venture-backed healthtech startup backed by leading healthcare investors. DexCare is led by a customer-focused mission-driven team experienced in building successful, innovative, and transformative solutions to healthcare’s biggest problems. Join our team and lend your talent to continue moving the initiatives forward.
Gallery

.png)





