Please carefully review the position requirements before submitting a potential candidate for consideration.
Location: Cork, Ireland OR Prague, Czech Republic
Hybrid: 3 days in the office/week
As a Lead Site Reliability Engineer, you’ll be at the forefront of building scalable, resilient, and observable systems that power Tricentis SaaS products globally. This is a hands-on engineering leadership role—balancing technical delivery, process ownership, and team mentorship.
You will drive initiatives across multiple products, shape SRE standards, and serve as a trusted partner to both engineering and product leaders. You will be responsible for elevating engineering quality and reliability while enabling scale and speed.
Your Impact as an 🚀
Lead and deliver cross-cutting initiatives to improve platform scalability, resilience, and cost efficiency.
Architect and implement cloud-native infrastructure that supports multi-region, multi-tenant deployments.
Improve observability strategy across systems and teams—including SLOs, error budgets, and alerting standards.
Coach and mentor engineers, guiding technical design reviews and promoting engineering excellence.
Own post-incident analysis and ensure learning loops are completed with preventive action.
Influence product reliability from early-stage design to production readiness reviews.
Establish and evolve standards for deployments, operational readiness, and incident response.
Serve as a technical advisor for engineering and product managers across the org.
As a valuable member of our SRE team, you'll have the opportunity to 💪
Drive architectural discussions and make decisions that influence the SRE org and wider engineering teams.
Define and evolve technical roadmaps and execution plans aligned with company goals.
Partner with peers in security, infrastructure, and product to drive platform-wide improvements.
Lead incident response for high-impact outages and continuously reduce incident recurrence.
Contribute to SRE hiring through interviews, onboarding, and process refinement.
Guide the adoption of modern tooling and practices across teams (e.g., GitOps, self-service platforms, chaos engineering).
Represent SRE in leadership forums, bringing insights, trade-offs, and forward-looking strategies.
About You 🎯
6+ years of experience in SRE, Infrastructure, or DevOps roles, including technical leadership.
Expertise in building and operating production systems in public cloud (Azure).
Deep understanding of observability principles (SLOs, SLIs, metrics, traces, logs).
Strong experience with infrastructure-as-code, container orchestration, and CI/CD (Terraform, K8s, GitHub Actions).
Proven track record in leading technical projects, influencing architecture, and mentoring engineers.
Excellent communication and cross-functional collaboration skills.
Proactive, ownership-driven mindset with a passion for reliability and continuous improvement.
Our Tech Stack 🌐
AZURE , AWS, Terraform, GitHub Actions, Kubernetes, DataDog, Prometheus, Grafana, Betterstack, All-in-one incident management platform | incident.io , Jira and more
Our Culture 🦄
We don't just preach our values; we embody them in everything we do. We are committed to creating an environment that empowers, supports, and includes individuals, where trust, transparency, creativity, curiosity, and continuous improvement thrive on a daily basis.
Tricentis Core Values:
Knowing what we need to achieve and how to achieve it is important. Tricentis' core values define our ways of working and the behaviors we model that create an enjoyable and successful Tricentis life.
- Demonstrate Self-Awareness: Own your strengths and limitations.
- Finish What We Start: Do what we say we are going to do.
- Move Fast: Create momentum and efficiency.
- Run Towards Change: Challenge the status quo.
- Serve Our Customers & Communities: Create a positive experience with each interaction.
- Solve Problems Together: We win or lose as one team.
- Think Big & Believe: Set extraordinary goals and believe you can achieve them.
For additional details regarding submission eligibility and payment terms, please refer to your contract. Only submissions from agencies with current service contracts in place will be considered.
Tricentis is proud to be an equal opportunity workplace. Qualified applicants will receive consideration for employment without regard to race, color, ethnicity, gender, religious affiliation, age, sexual orientation, socioeconomic status, or physical and mental disability and other statuses protected by law.
Global Sanctions Compliance
We comply with all applicable global sanctions and export control laws. Candidates must not be listed on any government restricted party lists (including OFAC SDN List and U.S. Commerce Department restricted lists) and must certify that their employment would not violate any sanctions or export control regulations. Candidates must notify us of any changes to their status during the application process or subsequent employment.
Top Skills
What We Do
Tricentis is the global leader in enterprise continuous testing, widely credited for reinventing software testing for DevOps, cloud, and enterprise applications. The Tricentis AI-powered, continuous testing platform provides a new and fundamentally different way to perform software testing. An approach that’s totally automated, fully codeless, and intelligently driven by AI. It addresses both agile development and complex enterprise apps, enabling enterprises to accelerate their digital transformation by dramatically increasing software release speed, reducing costs, and improving software quality. Tricentis has been widely recognized as the leader by all major industry analysts, including being named the leader in Gartner’s Magic Quadrant five years in a row. Tricentis has more than 1,800 customers, including the largest brands in the world, such as McKesson, Accenture, Nationwide Insurance, Allianz, Telstra, Moet-Hennessy-Louis Vuitton, and Vodafone.







