Why Work for Us
- As a reliable and trusted financial solutions provider with expanding reach to 1 in 3 households nationwide, we believe it takes extraordinary people to disrupt decades of legacy financial practices to reimagine solutions that serve customers at scale.
- We are 130+ employees strong and steadily building. Our world-class team, affectionately coined ‘SigFigians,’ is growing with an industry-savvy board and strategic executive team guiding us forward.
- We offer competitive benefits that include Flexible PTO, Wellness benefit, Mobile/Internet subsidy, Employee Recognition Programs, and more!
- We are a remote-first company! We have regional hubs nationwide, with presence in 4 Countries: United States, Canada, India, and Singapore.
- We believe that one size fits one and embrace a culture that honors and celebrates diversity of backgrounds, approaches and experiences.
- We are guided by our core values: Customer Delight, Make It Happen, Think Big, and We Over Me. Read more about our core values and how we live them every day on our website here.
How You’ll Make an Impact
- Lead a global, distributed SRE/DevOps team operating in a 24/7 production environment.
- Develop and implement automation frameworks for self-healing, auto-remediation, and system optimization.
- Enhance monitoring and observability through tools like Splunk, Prometheus, and AI-powered alerting platforms.
- Improve CI/CD pipelines using Jenkins, GitHub Actions, ArgoCD, and drive continuous delivery at scale.
- Manage and scale infrastructure using Terraform, Kubernetes, Puppet, and similar tools.
- Act as the first technical escalation point for Level-2/L-3 troubleshooting of production incidents involving Linux servers, cloud networking, and Kubernetes clusters.
- Lead post-incident reviews, implement automated solutions for root cause issues, and contribute to a growing incident knowledge base.
- Collaborate cross-functionally with Engineering, Security, and Product to align reliability initiatives with business objectives.
- Establish and enforce SLOs and error budgets to continually raise system reliability standards.
Ideal SigFigian for this Role
- 7+ years of experience in SRE, DevOps, or Technical Operations roles.
- 2+ years in a leadership role managing global, distributed teams in a high-uptime environment.
- Proven experience with AWS, GCP, or Azure, and implementing infrastructure as code at scale.
- Strong scripting skills in Python, Bash or similar for automation and operational tooling.
- Deep understanding of observability and incident management best practices.
- Experience with CI/CD and deployment orchestration tools.
- Familiarity with containerized and microservices-based architectures.
- Passion for automation, reliability engineering, and continuous improvement.
- Excellent communication and leadership skills to coordinate across global teams.
- Previous experience in fintech or highly regulated environments is a plus.
Perks and Benefits
- Tax-friendly Compensation
- Liberal Leave Policy
- Medical cover for the family, including parents
- Quarterly Wellness Benefit
- WFH Allowance
- Mobile/Internet subsidy (for smooth WFH experience)
- Employee Referral Program
- Employee Recognition Program
- And more!
Similar Jobs
What We Do
Using a combination of design, data science, and technology, SigFig builds platforms for financial institutions that delight consumers and empower bankers to have more advice-driven conversations.
For consumer banking, we've built SigFig Atlas, a digital platform that guides consumers through rich needs-discovery journeys, delivers advice, and connects them to the right resources and financial products.
For wealth management, we've built SigFig CoPilot, a modern managed accounts platform that digitizes processes, freeing up full-service financial advisor teams to nurture client relationships and dramatically improve the client experience.