Key Responsibilities
- Build and maintain observability stacks using Prometheus and Grafana; define SLOs, SLIs, SLAs and error budgets.
- Own incident response: on-call rotation, triage, mitigation, and blameless post-mortems.
- Automate repetitive operational tasks and eliminate toil through scripting and tooling (Python, Bash, Go).
- Design, deploy, and maintain highly available infrastructure on AWS using Terraform and Ansible for infrastructure-as-code workflows.
- Manage and optimize Kubernetes clusters (EKS) and containerized workloads with Docker to support microservices architecture.
- Collaborate with engineering teams during design reviews to embed reliability and scalability requirements.
- Monitor capacity and performance trends; proactively identify and resolve bottlenecks.
- Maintain and improve CI/CD pipelines and deployment automation.
Qualifications Required
- 2–8 years of experience in Site Reliability Engineering, DevOps, or a closely related discipline.
- Working knowledge of monitoring and logging tools like Prometheus, Grafana, Dynatrace or Datadog, OpenSearch and Victoria metrics etc.
- Tracking and monitoring SLAs for all critical services.
- Experience with Linux systems administration.
- Hands-on experience with Kubernetes and Docker in production environments.
- Proficiency with AWS services (EC2, EKS, RDS, S3, VPC, IAM, CloudWatch).
- Experience with Infrastructure-as-Code tools such as Terraform or Ansible.
- Strong scripting skills in Python or Bash.
- Familiarity with CI/CD tools (e.g., GitHub Actions, Jenkins, GitLab CI).
- Familiarity with GitOps workflows (ArgoCD, Rancher etc).
Preferred
- Experience in financial services, FinTech, or other regulated industries.
- Knowledge of service mesh technologies (Istio, Linkerd).
- Familiarity with distributed tracing tools (Jaeger, OpenTelemetry).
- AWS certifications (Solutions Architect, DevOps Engineer, or equivalent).
- Experience with cost optimization strategies in cloud environments.
Clearwater Analytics (CWAN) Compensation & Benefits Highlights
-
Retirement Support — A company 401(k) match with immediate vesting is consistently included alongside tax‑advantaged accounts. This indicates reliable long‑term savings support as part of the package.
-
Equity Value & Accessibility — Equity participation is available through an employee stock purchase plan, with RSUs included for some roles. This adds ownership potential beyond base pay and bonus.
-
Leave & Time Off Breadth — Paid time off is available from day one with a baseline around three weeks, plus company holidays and volunteer time. Flexible elements like work‑from‑home Fridays and limited “work from anywhere” periods broaden practical time‑off utility.
Clearwater Analytics (CWAN) Insights
Similar Jobs
What We Do
CWAN was founded on a simple belief: investment professionals deserve modern technology that actually works for them. Not legacy systems that slow them down. Not fragmented data that creates confusion. But one comprehensive platform that gives you complete visibility and crystal-clear insights. The result? Investment management that works as seamlessly as your investment strategy. Since our founding in 2004, CWAN has been the trusted technology partner powering the world’s leading institutional investors — from insurance companies, asset managers, and hedge funds to asset owners like corporations, endowments, and pension funds managing over $10 trillion in assets.
Why Work With Us
We continue to grow, fueled by a strong foundation, an ambitious vision, and a commitment to delivering exceptional value to our clients, partners, and team members around the world. What started as a bold idea in Boise, Idaho has rapidly transformed into a global presence. We’ve expanded our footprint significantly—now operating out of 24 offices
Gallery
Clearwater Analytics (CWAN) Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.


_1.jpg)





_1.jpg)


