What You’ll Be Doing
- Managing all aspects of the team's day-to-day work, planning, and prioritizing tasks and agendas while working closely with development teams, support engineers, and R&D leadership.
- Leveraging a highly technical background to take on complex tasks as needed.
- Ensuring the availability and smooth, cost-efficient operation of our production environment, including leading efficient incident response.
- Maintaining and improving the release lifecycle for various products, ensuring frequent and reliable delivery.
- Designing and embedding industry best practices for online services, including disaster recovery, business continuity, and service health measurement.
- Providing development teams with platforms and tools that streamline their work and increase productivity.
- Establishing a robust and reliable monitoring framework for complex distributed systems.
- Driving the professional growth and development of a highly motivated team.
- Constantly learning and exploring the cutting edge of DevOps tools and practices, and leading their implementation.
Requirements:
- 5+ years of experience with DevOps and cloud technologies.
- 2+ years of experience as a team lead.
- Possess exceptional troubleshooting skills and diagnostic intuition for solving challenging problems.
- Extensive experience with Kubernetes (both operational and deployment) and Docker.
- High familiarity with public cloud providers (AWS, GCP, and Azure); proven expertise with more than one is a significant advantage.
- Familiarity with continuous monitoring solutions, including Prometheus, Grafana stack (Loki, Mimir, Pyroscope), and OpenTelemetry.
- Experience with event-driven microservices and event buses such as RabbitMQ or Kafka.
- Strong knowledge of deployment models, capacity management, and service utilization.
- Expertise in the design, architecture, and operation of complex, large-scale online services.
- Programming and scripting proficiency in Bash, Python, or Golang.
- Familiarity with GitOps principles is a must; strong experience with one or more Argo projects is a plus.
- Proven experience with Infrastructure as Code (IaC) tooling like Terraform; experience with Crossplane is an advantage.
- Experience working with databases (e.g., NoSQL, MongoDB) and a strong understanding of system and networking concepts and troubleshooting techniques.
- Proven ability to collaborate effectively with cross-functional, global, and remote teams from diverse backgrounds.
Why Join Us?
- Cutting-Edge Technology: Be part of a company that lives and breathes Kubernetes, GitOps, and cloud-native technologies, working on innovative solutions at scale.
- Collaborative Environment: Join a supportive and highly skilled team, where your expertise and ideas will shape the future of our product.
- Professional Growth: Enjoy opportunities for continuous learning, growth, and staying ahead in the fast-evolving DevOps ecosystem.
- Impactful Work: Contribute to building and maintaining a platform that supports thousands of developers worldwide, making a real difference in the industry.
Similar Jobs
What We Do
Octopus Deploy is one of Australia’s fastest-growing software companies (and we’re taking on the world). After bootstrapping for a decade, in 2021, we quietly raised Australia's second-largest ever venture capital raise, accepting a USD 172M minority investment from Insight Partners. In short: Octopus simplifies the most complicated deployments wherever you deploy your software.
Why Work With Us
You'll be joining a high-growth company with numerous opportunities to learn and advance your career. We offer great benefits and value transparency and fairness in every aspect of our business. Octopus is a high-trust environment that values each individual’s contributions while encouraging a work/life balance.
Gallery









