The Observability Patronus team is responsible for building and operating Atlassian's core observability platforms, including Grafana Mimir and self-hosted Sentry (FedRAMP). Our mission is to empower engineering teams with reliable, scalable, and compliant monitoring solutions that ensure the health and performance of Atlassian's products.
The Role
As a Senior Software Engineer in Observability, you'll be building the platforms and defining practices that keep Atlassian's products rock-solid and lightning-fast for millions of users.
We're not talking about your average telemetry here-we're collecting, moving, and ingesting multiple petabytes of data every single day, and making it all available within seconds. It's a massive scale, and it's critical to everything we do. You'll empower hundreds of Atlassian engineering teams with world-class observability solutions, helping them ship reliable, high-quality products our customers love.
This is a platform team that sits at the intersection of infrastructure and software engineering. Some candidates expect all software engineering, others expect all infrastructure-here, you'll need to be passionate and skilled in both. You'll be building and operating ultra-high-scale systems for metrics, including one of the world's largest Mimir clusters (at 10x the certified load!). You'll face architectural and scaling challenges that will stretch your skills and imagination.
If you're excited by big challenges, love working with smart people, and want to make a real impact, you'll fit right in.
What you'll do
- Drive complex projects autonomously, from technical design to launch.
- Manage and improve mission-critical observability platforms, including Grafana Mimir and self-hosted Sentry (FedRAMP)
- Lead initiatives to automate processes, enhance operational excellence, and reduce costs.
- Be a role model for delivering clean, high-quality code, mentoring more junior members in their growth
- Be a promoter of the best Observability standards and practices to hundreds of Atlassian teams
- Deliver work that impacts thousands of engineers and services improving the reliability of our products
Requirements
Sound like an exciting opportunity? We think so too... To set you up for impact on day one, we'll expect you to have this on your first day:
- Have 6+ years experience in building and operating highly-available observability(metrics) platforms at scale.
- Experience with Prometheus, OpenTelemetry, or similar observability tools.
- Proficiency with at least one programming language (e.g., Golang, Python)
- Expert in handling and managing infrastructure with AWS, GCP or other cloud providers.
- Knowledge of high-volume data pipelines (e.g., Kinesis, Kafka).
- Comfortable working in with ambiguous requirements, being able to identify the most valuable problems to solve and pave a path forward
- Demonstrated skills in the Observability domain and working knowledge in multiple tech domains
- Great soft skills, with the ability to collaboratively deliver with other world-class engineers
We'd also see it as a big bonus if you have any of the following skills:
- Expertise in operating Grafana Mimir at scale.
- Hands-on experience with SignalFX, Splunk, and Sentry.
Top Skills
What We Do
Atlassian creates teamwork solutions for high-performing teams. Our portfolio of collaboration and work management software products includes Jira, Confluence, Trello, Loom and Rovo. More than 300,000 businesses worldwide rely on Atlassian’s technology, including 80 percent of Fortune 500 companies. Our solutions support various business teams and they help organizations plan, track, and deliver their biggest ideas together.
Why Work With Us
At Atlassian, we believe we can accomplish so much more together than apart — which is why everything from our tooling — to our distributed workforce — to how our teams are structured is rooted in collaboration. Come join us and help unleash the potential of every team.
Gallery
Atlassian Teams
Atlassian Offices
Remote Workspace
Employees work remotely.
Atlassians have flexibility in where they work to support their family, personal goals, and other priorities. Our approach to distributed work allows us to tap into talent beyond our office locations, and to reimagine how work gets done.














