Who We Are & Why Join Us
Avathon is the only physical AI unicorn headquartered in the San Francisco Bay Area, we go beyond models and dashboards we deploy AI that continuously computes across supply chains, energy systems, and industrial operations. Our Operational Technology platform turns fragmented data into real-time, autonomous decisioning across the systems that power the global economy.
This is not digital AI. This is AI for operational reality where latency, constraints, and failure have real-world consequences.
Cutting-Edge AI Innovation – Join a team at the forefront of AI, developing groundbreaking solutions that shape the future.
High-Growth Environment – Thrive in a fast-scaling startup where agility, collaboration, and rapid professional growth are the norm.
Meaningful Impact – Work on AI-driven projects that drive real change across industries and improve lives.
Learn more at: Avathon
Position Summary:
We are seeking an experienced DevOps Engineer to join our dynamic team operating in an agile development environment. The ideal candidate will have a solid foundation in DevOps across building software pipelines, creating repeatable deployments, and experience with various infrastructure components in the cloud. As a Sr. DevOps Engineer you will be working closely with team members across domains with deep technical skills and passion for AI.
You Will:
- Design and continuously improve the infrastructure for cloud-based services and client interfaces
- Manage the day-to-day operations of our build, testing, and continuous integration environment
- Work with internal IT, Software Engineers, Cloud Architects, and fellow DevOps Engineers to build & maintain dev & prod environments
- Implement best practices for always-up, always-available services
- Proactively communicate project & task status to project stakeholders
- Occasional on-call support and customer meetings may include irregular hours as needed
You'll Have Skills:
- Kubernetes Architecture & Operations: 2+ years of experience managing production-grade clusters (GKE preferred), including expertise in K8s internals, Helm chart creation, Service Mesh (Istio), and autoscaling strategies (HPA/VPA).
- GitOps & CI/CD: Proven ability to design container-native pipelines and manage cluster state using GitOps methodologies (ArgoCD) alongside standard CI tools.
- Infrastructure as Code (IaC): Experience on Terraform for provisioning cloud resources and bootstrapping clusters.
- Observability & Reliability: Hands-on experience on full-stack observability for microservices using Prometheus, Grafana, ELK/Loki, and distributed tracing to ensure system reliability and rapid incident response.
- Core Engineering Foundation: 2+ years in DevOps/Linux Administration with strong scripting proficiency (Python, Bash, or Go) and a track record of automating workflows in fast-paced cloud environments.
**A PLUS if you provide the link to your GitHub Website
Avathon is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, pregnancy, genetic information, disability, status as a protected veteran, or any other protected category under applicable federal, state, and local laws.
Avathon is committed to providing reasonable accommodations throughout the recruiting process. If you need a reasonable accommodation, please contact us to discuss how we can assist you.
Top Skills
What We Do
Avathon, a leader in Industrial AI, extends the life of critical infrastructure while advancing the journey toward full autonomy. Avathon’s Industrial AI platform empowers commercial and government customers with scalable, secure, and value-driven solutions that enhance efficiency and resilience across heavy industry.








