The Role
Lead the development of scalable applications, maintain high-quality code, mentor engineers, and drive project delivery in a dynamic team environment.
Summary Generated by Built In
About Docsumo:
Docsumo is a Document Workflow platform that converts unstructured documents (like bank statements, financials, policies) into structured, actionable data with the help of Agentic Workflows. We’re backed by Sequoia, Barclays, Fifth Wall, Common Ocean, and Techstars — and trusted by leading banks, insurers, and fintechs worldwide.
The opportunity as Senior DevOps / SRE Engineer:
We’re looking for a Senior SRE (Python) to lead a small team (2 engineers) and own the reliability, deployment, and automation of our AI platform. You’ll work hands-on with Kubernetes, GCP, AWS, Python (Flask/FastAPI) and ensure our infrastructure and applications run securely, reliably, and at scale.
Key Responsibilities:
- Lead SRE initiatives and mentor 2 junior engineers.
- Own deployments and monitoring across GCP (K8s, Cloud Run, VPC, networking) and AWS (Lambda, SES).
- Debug & fix issues in Python apps (Flask, FastAPI), with occasional Lua for canary deployments.
- Set up automation, infra-as-code, CI/CD pipelines, and incident response.
- Optimize for cost, performance, and reliability across infra and applications.
- Work closely with backend engineers, product, and operations to keep our services running smoothly.
Need to have:
- 4+ years in SRE/DevOps with strong Python scripting & backend debugging skills.
- Hands-on with Kubernetes, Docker, and cloud infra (GCP & AWS).
- Experience with MongoDB, Elastic, monitoring tools (Prometheus, Grafana).
- Strong troubleshooting, debugging, and problem-solving skills.
- Ability to lead small teams and drive reliability culture.
Nice to have:
- Experience with Temporal, Redis, or serverless (Cloud Run, Lambda).
- Exposure to high-traffic SaaS or AI/ML infrastructure.
- Prior team leadership/mentorship experience.
Why join us?
- Lead the SRE charter and shape reliability for our platform.
- Work on modern infra (K8s, Cloud-native, Temporal, serverless).
- High ownership, visible impact — report directly to Engineering leadership.
- Opportunity to grow into Principal Engineer / SRE Manager.
- Fast-paced startup, strong learning curve, and a collaborative culture.
Top Skills
AWS
Fastapi
Flask
Google Cloud Platform
Kubernetes
MongoDB
Python
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Docsumo is document AI software with Intelligent OCR technology helps you convert unstructured documents such as pay stubs, invoices and bank statements to actionable data. Works with documents in any format with minimal setup.