Senior Site Reliability Engineer

Reposted Yesterday
Be an Early Applicant
4 Locations
In-Office or Remote
Senior level
Cloud • Software
The Role
As a Senior Site Reliability Engineer at Megaport, you will enhance system reliability, engage with stakeholders, and write code while promoting SRE practices and collaboration within the team.
Summary Generated by Built In
About Megaport
We’re not your typical tech company – and we don’t want to be. Megaport is the global leader in Network as a Service (NaaS), and has transformed the way businesses connect to the cloud, data centers, and each other. We’re publicly listed on the Australian Stock Exchange and partnered with the biggest names in tech like Amazon, Microsoft, Google, Oracle, IBM, and more. Headquartered in Brisbane with a crew of over 600 people spread across Asia-Pacific, Europe, and the Americas, our employees enjoy an environment that is collaborative, supportive, and (actually) fun.
 
Our Team Culture
We’re a team of problem solvers, pixel pushers, code slingers, and cloud fanatics. Culture is more than a poster on the wall here – collaboration beats hierarchy, curiosity fuels our growth, and everyone’s voice matters. We take our work seriously, but not ourselves. We work across time zones to execute on our global vision, trust each other to get things done, and never compromise our values for commercial gain. Most importantly, we place our customers at the center of everything we do.
 
We’re committed to increasing representation in the tech industry and welcome applicants from all backgrounds. Don’t meet every requirement? That’s okay. If you’re excited about this role, we encourage you to apply.

The Role 
As a Senior Platform Engineer, you are a champion for DevOps and SRE culture and industry best practice within Megaport. You will work alongside talented team members in multiple timezones ensuring that systems are secure, maintainable and available. External to the team you will be engaging with stakeholders in requirements analysis and demonstrations. Technically you will be very hands on. Continually evolving your skills through a mix of peer reviews and research. Ultimately your obsession is customer success and ensuring company goals are met.

What You Will Be Doing

  • Improving production reliability and system resilience within an SRE scoped team
  • Championing high standards of work and industry best practices
  • Communicating with teams and stakeholders at all stages
  • Bringing fresh ideas to the table and encouraging others
  • Diving into complex technical problems with a can-do attitude
  • Working across numerous technologies in a fast-changing industry
  • Participating in on-call rotation, incident response, and blameless post-incident reviews
  • Writing code, handling alerts, improving solutions, and supporting others
  • Playing a crucial role in the success of your company and team

What We Are Looking For

  • 5+ years administering Linux systems and related infrastructure in production environments
  • A collaborative SRE mindset, with familiarity around SLIs/SLOs/SLAs, error budgets, blast radius, and blameless postmortems
  • A focus on automation, reducing toil, and preventing problem recurrence
  • A track record of writing runbooks that work for the broader team, not just yourself
  • Strong Kubernetes and broader ecosystem fundamentals
  • Cloud infrastructure experience; AWS strongly preferred and bare-metal is a bonus
  • Strong tool development - Bash, plus either Python or Go preferred, or similar
  • Infrastructure-as-code tooling experience - Terraform preferred
  • CI/CD and version control, GitHub preferred
  • Database experience - one of Postgres, Cassandra, or ClickHouse preferred
  • Experience operating a production observability stack (metrics, logs, traces), with an eye for signal over noise
  • Comfortable working on live production infrastructure, with strong troubleshooting instincts and ownership of incident response
  • A history of continual professional development
  • A self-directed style suited to an async, globally distributed team, and comfortable picking up adjacent work when the situation calls for it

What We Offer

  • Flexible working environments
  • Birthday Leave
  • Generous study and training allowance + 5 days paid study leave
  • Creative, fun, and contemporary workspaces
  • Motivated team of industry experts and new talent
  • Celebrated success with ‘Legend’ and ‘Kudos’ Awards
  • Health and wellness program

#LI-DNI


If you have any questions, please reach out to Megaport's Talent Acquisition Team at [email protected]
 
NOTE: All Megaport business correspondence is conducted via our business email accounts (@megaport.com). If you have any concerns, please reach out to Megaport's careers team [email protected] directly and we will verify the legitimacy of any communication. Megaport will not ask you to create an account via Microsoft teams, and does not associate with any email accounts under "@megaportau.com".
 
All applications will be treated in confidence.
Please see Part 2 of our Privacy Policy to see what information Megaport collects from job applicants, why, and how we store and use it. Note that you’re entitled to know what personal data of yours Megaport holds, to request updates, rectification, and in some circumstances restriction or deletion thereof if you object (you being entitled to withdraw your consent to our holding your information at any time). Please see Part 5 of our Privacy Policy for more details on this and how to contact Megaport's data protection officer if you have any further privacy-related questions. Candidates who meet the selection criteria will be invited to attend an interview. Strictly no Recruitment Agencies.

Skills Required

  • 5+ years administering Linux systems in production environments
  • Familiarity with SLIs/SLOs/SLAs and blameless postmortems
  • Strong Kubernetes and cloud infrastructure experience
  • Proficient in Bash, Python or Go, Terraform
  • Experience with CI/CD and GitHub
  • Database experience with Postgres, Cassandra, or ClickHouse
  • Experience with production observability stacks
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, California
340 Employees
Year Founded: 2013

What We Do

We make connectivity easy. Megaport is changing the way people, enterprises, and service providers interconnect globally. Our Software Defined Network (SDN) connects 850+ enabled data centres in 25+ countries across North America, Asia Pacific, and Europe. We enable customers with fast, flexible, secure and on-demand connectivity to leading cloud, network, and managed service providers. Our Network as a Service solution offers greater agility, reduced operating costs, and increased speed to market compared to traditional connectivity options

Similar Jobs

Block Logo Block

Senior Site Reliability Engineer

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
In-Office or Remote
Melbourne, Victoria, AUS
12000 Employees
Remote
Australia
79 Employees
Remote
Australia
91 Employees
80K-150K Annually

Algolia Logo Algolia

Senior Site Reliability Engineer

Natural Language Processing • Software
Remote
Australia
700 Employees
152K-178K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account