Senior Site Reliability Engineer

Posted 7 Days Ago
Be an Early Applicant
Nottingham, Nottinghamshire, England, GBR
In-Office
Senior level
Big Data • Marketing Tech • Analytics
The Role
Lead SRE initiatives to improve reliability and performance of production systems on AWS. Define SRE best practices, establish SLIs/SLOs, run incident response and postmortems, implement observability and self-healing automation, mentor SREs, and collaborate with development and senior stakeholders to align reliability with business goals.
Summary Generated by Built In
Company Description

Experian is a global data and technology company, powering opportunities for people and businesses around the world. We help to redefine lending practices, uncover and prevent fraud, simplify healthcare, create marketing solutions, and gain deeper insights into the automotive market, all using our unique combination of data, analytics and software. We also assist millions of people to realize their financial goals and help them save time and money.

We operate across a range of markets, from financial services to healthcare, automotive, agribusiness, insurance, and many more industry segments.

We invest in people and new advanced technologies to unlock the power of data. As a FTSE 100 Index company listed on the London Stock Exchange (EXPN), we have a team of 22,500 people across 32 countries. Our corporate headquarters are in Dublin, Ireland. Learn more at experianplc.com.

Job Description

We are looking for a Site Reliability Engineer to improve the reliability, and performance of business-critical systems. Reporting into our Head of SRE you will focus on AWS cloud infrastructure, DevOps tooling, and core SRE practices within a distributed, production environment.

Main Responsibilities:

  • Leadership & Strategy
    • Define and implement SRE best practices across the organization.
    • Proven expertise in production support, engineering, disaster recovery (DCR), automation, and cloud operations
    • Mentor and guide a team of SREs, fostering growth.
    • Collaborate with senior stakeholders to align reliability goals with business objectives.
  • Reliability & Performance
    • Establish SLIs, SLOs, and SLAs for critical services and ensure adherence.
    • Drive initiatives to improve system resilience and reduce operational toil.
    • Excellent in designing systems that detect and remediate issues without manual intervention – Self Healing systems, Runbook automation
    • Exposure to tools like Gremlin, Chaos Monkey, AWS FIS to simulate outages and improve fault tolerance
  • Incident Management
    • Act as the primary point of escalation for critical production issues and lead major incident response, root cause analysis, and postmortems.
    • Perform detailed post-incident investigations to identify underlying causes. Document findings and share learnings to prevent recurrence.
    • Implement preventive measures and continuous improvement processes.
  • Observability
    • Champion monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK, and AWS CloudWatch.
    • Build real-time dashboards to visualize system health and reliability metrics.
    • Configure intelligent alerting based on anomaly detection and thresholds.
    • Combine metrics, logs, and traces to enable root cause analysis and reduce Mean Time to Resolution (MTTR).
    • Knowledge of AIOps or ML-based anomaly detection for proactive reliability management.
  • Collaboration
    • Work closely with development teams to integrate reliability into application design and deployment
    • Promote a culture of shared responsibility for uptime and performance across engineering teams.

Qualifications

  • Deep expertise with various AWS services. Advanced knowledge of monitoring and observability tools.
  • Strong leadership capabilities with a focus on setting clear direction, aligning team efforts with organizational goals, and maintaining high levels of motivation and engagement across the team.
  • Excellent communication skills, with the ability to articulate complex ideas, solutions, and feedback clearly to both technical and non-technical stakeholders. Adept at managing conflict constructively and facilitating consensus.
  • Proven track record of building secure, mission-critical, high-volume transaction web-based software systems, preferably in regulated environments (finance and insurance industries).
  • Hands on technologist working in software development including leading an SRE team.

Additional Information

  • Hybrid working, 2 days a week our Nottingham Office
  • Great compensation package and discretionary bonus
  • Core benefits include pension, bupa healthcare, sharesave scheme and more
  • 25 days annual leave with 8 bank holidays and 3 volunteering days. You can purchase additional annual leave.

Our uniqueness is that we celebrate yours. Experian's culture and people are important differentiators. We take our people agenda very seriously and focus on what matters; DEI, work/life balance, development, authenticity, collaboration, wellness, reward & recognition, volunteering... the list goes on. Experian's people first approach is award-winning; World's Best Workplaces™ 2024 (Fortune Top 25), Great Place To Work™ in 24 countries, and Glassdoor Best Places to Work 2024 to name a few. Check out Experian Life on social or our Careers Site to understand why.

Experian is proud to be an Equal Opportunity and Affirmative Action employer. Innovation is an important part of Experian's DNA and practices, and our diverse workforce drives our success. Everyone can succeed at Experian and bring their whole self to work, irrespective of their gender, ethnicity, religion, colour, sexuality, physical ability or age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity.

Experian Careers - Creating a better tomorrow together

Find out what its like to work for Experian by clicking here

#LI-Hybrid

Experian Careers - Creating a better tomorrow together

Find out what its like to work for Experian by clicking here

Skills Required

  • Deep expertise with various AWS services
  • Advanced knowledge of monitoring and observability tools (Prometheus, Grafana, ELK, CloudWatch)
  • Proven track record of building secure, mission-critical, high-volume transaction web-based systems
  • Experience in production support, disaster recovery, automation, and cloud operations
  • Experience defining and implementing SLIs, SLOs, and SLAs
  • Strong leadership and people-management skills; mentoring and guiding SRE teams
  • Excellent communication skills for technical and non-technical stakeholders
  • Hands-on software development experience and leading an SRE team
  • Experience designing self-healing systems and runbook automation
  • Exposure to chaos engineering tools (Gremlin, Chaos Monkey, AWS FIS)
  • Knowledge of AIOps or ML-based anomaly detection
  • Experience in regulated environments (finance or insurance)

Experian Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Experian and has not been reviewed or approved by Experian.

  • Healthcare Strength Medical and dental coverage is described as strong, with expanded mental health resources and telemedicine options. Coverage includes inclusive services such as gender transition and fertility support.
  • Leave & Time Off Breadth Time-off offerings are generous, including substantial PTO/vacation, paid holidays, and paid volunteer days with options to purchase additional leave. Parental leave is available for birth and non-birth parents alongside flexible working arrangements that support work-life balance.
  • Retirement Support Retirement programs include a 401(k) with company matching and contributory pension schemes in some regions. These elements complement base pay and bonuses to form a competitive total rewards package.

Experian Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Costa Mesa, CA
16,292 Employees
Year Founded: 1980

What We Do

Experian unlocks the power of data to create opportunities for consumers, businesses and society. During life’s big moments – from buying a home or car, to sending a child to college, to growing a business exponentially by connecting it with new customers – we empower consumers and our clients to manage data with confidence so they can maximize every opportunity. We gather, analyse and process data in ways others can’t. We help individuals take financial control and access financial services, businesses make smarter decision and thrive, lenders lend more responsibly, and organizations prevent identity fraud and crime. For more than 125 years, we’ve helped consumers and clients prosper, and economies and communities flourish – and we’re not done. Our 20,600 people in 43 countries believe the possibilities for you, and our world, are growing. We’re investing in new technologies, talented people and innovation so we can help create a better tomorrow. About Experian: Bringing data to life requires creativity, passion, flexibility and expertise. We want you to share in our success. That's why we offer rewards that recognise great performance. Working in a culture of collaboration, achievement and respect we will give you the support and encouragement you need to develop your skills and talents and progress your career. Everyday our people bring enthusiasm, innovation and inspiration to work and if this sounds like you connect with us at Experian.

Similar Jobs

Binance Logo Binance

Senior Site Reliability Engineer

Blockchain • Fintech • Software • Cryptocurrency • Metaverse
In-Office or Remote
45 Locations
7696 Employees

Lloyds Banking Group Logo Lloyds Banking Group

Senior Site Reliability Engineer

Fintech • Software • Financial Services
In-Office
2 Locations
60287 Employees
73K-81K Annually

NewDay Logo NewDay

Senior Site Reliability Engineer

Software • Financial Services
In-Office
London, Greater London, England, GBR
1150 Employees

Navan Logo Navan

Senior Site Reliability Engineer

Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Easy Apply
Hybrid
London, Greater London, England, GBR
3300 Employees

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
US
35 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account