Engineering Director (Platform) - Reliability & Observability
Join a movement in which everyone can win. We started a movement in which everyone can win – shoppers, retailers, society and every person on our team. To play fair, trust people and reward them for doing the right thing. We see and feel the impact of our work as more and more people gain financial freedom and retailers grow across the globe.
Founded five years ago in Sydney, Australia, Afterpay has over 11 million active customers globally and more than 64,000 of the world’s best retailers around the world including Anthropologie, Revolve, DSW, GOAT, Finish Line, Levi’s, Mac Cosmetics, Ray-Ban and many others. Afterpay is on a mission to power an economy in which everyone wins.
Afterpay is completely free for customers who pay on time – helping people spend responsibly without incurring interest, fees or extended debt. Afterpay empowers customers to access the things they want and need, while still allowing them to maintain financial wellness and control, by splitting payments in four, for both online and in-store purchases. Afterpay is deeply committed to delivering positive outcomes for customers. We are focused on supporting our community of shoppers.
We trust in the next generation and share a vision of a more accessible and sustainable world in which people are rewarded for doing the right thing.
The Opportunity
Afterpay is looking for an engineering leader to grow and lead teams building the platform systems that underpin our products. Our mission in Platform Engineering is to empower product teams to deliver value to the end customer in a faster and safer manner by focusing on availability, reliability and scalability.
This role requires expertise in building out mission-critical systems at scale that can evolve quickly with our rapidly growing business. Additionally, we are looking for leaders with a proven record of building world-class engineering teams that focus on scalability, flexibility, and most importantly resilience.
This is an exciting and unique opportunity to join the leadership team within Platform Engineering that powers one of the world’s largest payment networks and which makes a direct, tangible impact on Afterpay’s growth and success.
We are much more than our job descriptions, but here’s where you will begin….
The Engineering Director of Reliability & Observability will be responsible for:
• Building from zero a world class SRE team that will ensure Afterpay customers can rely on the software and systems behind our products to be always available and performant for their critical needs.
• Setting strategy and developing a roadmap for the SRE team with the goal of reducing the operational overhead of keeping Afterpay services reliable, secure and available for our customers.
• Setting strategy and developing a roadmap for the Observability team that are responsible for logging, tracing, metrics, monitoring systems and developer productivity tooling to provide insights through data on the behaviour of services and the underlying platform infrastructure.
• Improving the overall observability for all services and infrastructure operated at Afterpay.
• Championing SLAs, SLOs, SLIs by working with the business to define SLAs for our customers and then tracking SLIs with service teams to determine alignment.
• Improving the on-call incident remediation process.
• Driving proactive reliability testing culture through the use of chaos engineering and simulated load.
• Advocating for and driving the implementation of reliable design patterns.
It is anticipated the Engineer Director of Reliability & Observability will focus on the following initiatives / objectives in FY22:
• Build a new SRE practice that will partner with product teams in design, build and operational phases of the SDLC - You build it, you run it better when partnered with SRE.
• Create a SRE culture that is viewed within the organisation as a center of excellence for reliability practices.
• Collaborate with key stakeholders across Product Engineering, Platform Engineering and Security to improve baseline operational performance to exceed a SLO 99.99% for the checkout critical user journey.
• Build tooling and drive organisational change to ensure measurement of DORA metrics and SLOs/SLIs focused on the signals error rate, latency and throughput.
• Transition mission critical systems to on-call support for teams that are committed to an investment in reliability.
Who are you?
Like us, you’ll be deeply committed to delivering positive outcomes for customers and passionate about shaping the future of Afterpay.
You'll have:
• Experience managing teams that designed and operated critical infrastructure, observability and importantly with an SRE culture that views availability as a software engineering problem.
• Proven track record of improving reliability, availability and performance of complex distributed cloud based systems.
• Ten or more years’ experience in related technology roles, with at least five years building and architecting software systems.
• Three or more years of Engineering leadership experience.
• Experience architecting and executing large-scale, performant systems that are critical to the business.
• The skill and experience necessary to build alignment, drive decision making, and communicate transparently.
• The ability to marry long-term thinking with short-term action.
• An execution mindset and the ability to deliver with cross-functional teams that are globally distributed.
• Excellent judgment for what are the most important tasks and actions that can move the needle on complex organisational objectives.
How we reward you. We have a pay for performance culture so you can expect to be rewarded for high performance. We pride ourselves on fairness and offer a competitive total reward package made up of salary, incentives and benefits including the opportunity to enroll in our share matching plan. We have a strong focus on health and wellbeing at Afterpay as we aim to support you to succeed in both your career and personal lives, such as providing employees with a corporate membership to Headspace. We offer a wide range of insurance programs so you have the flexibility to choose what is best for you. Afterpay covers 100% of the employee cost and 75% of the cost of your dependents. We value diversity and a collaborative and inclusive environment where everyone feels they belong is important to us.
How to Apply: We don’t know what the future holds. That’s the exciting part; we show up and make it happen. If you’re brave, if you’re committed to doing the right thing and excited by this opportunity, click apply now!
Afterpay is continuing to hire for all open roles with all interviewing and on-boarding done virtually due to COVID-19. All new team members, in addition to current staff, will temporarily work from home until it is safe to return to our offices.