Who We Are & Why Join Us
Avathon is the leading Industrial AI autonomy platform, helping customers across heavy industries -- energy, mining, manufacturing, aerospace, defense, and logistics -- accelerate the journey toward autonomous operations. Our platform is built on a Computational Knowledge Graph foundation that contextualizes and connects operational data across siloed systems, bringing together time series, structured, unstructured, and machine vision data to power AI-driven applications in asset performance management, supply chain intelligence, visual AI, and global trade management. With capabilities spanning digital twins, normal behavior modeling, natural language processing, and computer vision, Avathon delivers real-time predictive intelligence and agentic decision-making at industrial scale.
Cutting-Edge AI Innovation -- Join a team at the forefront of AI, developing groundbreaking solutions that shape the future. High-Growth Environment -- Thrive in a fast-scaling startup where agility, collaboration, and rapid professional growth are the norm. Meaningful Impact -- Work on AI-driven projects that drive real change across industries and improve lives.
Learn more at: avathon.com
Position Summary:
We are seeking an experienced DevOps Engineer to join our dynamic team operating in an agile development environment. The ideal candidate will have a solid foundation in DevOps across building software pipelines, creating repeatable deployments, and experience with various infrastructure components in the cloud. As a Sr. DevOps Engineer you will be working closely with team members across domains with deep technical skills and passion for AI.
You Will:
- Design and continuously improve the infrastructure for cloud-based services and client interfaces
- Manage the day-to-day operations of our build, testing, and continuous integration environment
- Work with internal IT, Software Engineers, Cloud Architects, and fellow DevOps Engineers to build & maintain dev & prod environments
- Implement best practices for always-up, always-available services
- Proactively communicate project & task status to project stakeholders
- Occasional on-call support and customer meetings may include irregular hours as needed
You'll Have Skills:
- Kubernetes Architecture & Operations: 2+ years of experience managing production-grade clusters (GKE preferred), including expertise in K8s internals, Helm chart creation, Service Mesh (Istio), and autoscaling strategies (HPA/VPA).
- GitOps & CI/CD: Proven ability to design container-native pipelines and manage cluster state using GitOps methodologies (ArgoCD) alongside standard CI tools.
- Infrastructure as Code (IaC): Experience on Terraform for provisioning cloud resources and bootstrapping clusters.
- Observability & Reliability: Hands-on experience on full-stack observability for microservices using Prometheus, Grafana, ELK/Loki, and distributed tracing to ensure system reliability and rapid incident response.
- Core Engineering Foundation: 2+ years in DevOps/Linux Administration with strong scripting proficiency (Python, Bash, or Go) and a track record of automating workflows in fast-paced cloud environments.
**A PLUS if you provide the link to your GitHub Website
Avathon is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, pregnancy, genetic information, disability, status as a protected veteran, or any other protected category under applicable federal, state, and local laws.
Avathon is committed to providing reasonable accommodations throughout the recruiting process. If you need a reasonable accommodation, please contact us to discuss how we can assist you.
Skills Required
- Bachelor's degree in Computer Science or related field or equivalent experience
- 5+ years of hands-on experience in DevOps
- Strong background in Linux/Unix Administration
- Experience with automation/configuration management tools
- 4+ years of experience with production cloud environments
- Strong experience with at least one programming language
- Experience with continuous integration and automated testing
- Experience with container orchestration systems
What We Do
Avathon, a leader in Industrial AI, extends the life of critical infrastructure while advancing the journey toward full autonomy. Avathon’s Industrial AI platform empowers commercial and government customers with scalable, secure, and value-driven solutions that enhance efficiency and resilience across heavy industry.









