At Guidewire, we deliver the software that Property and Casualty (P&C) insurance companies rely on to protect their customers during crises, natural disasters, accidents, and cyber risks. Our core applications enable insurers to sell and underwrite policies, settle claims, and bill their customers. We also offer a suite of innovative products for data management, digital portals, and predictive analytics. Hundreds of insurers worldwide use Guidewire's products, running on our cutting-edge Guidewire Cloud Platform, to handle billions of dollars in business. We are dedicated to providing the tools and technology that help insurers protect and support their customers when they need it most.
The Opportunity-
We are seeking a Site Reliability Engineer III who is eager to contribute to the transformation of the insurance industry with our leading cloud platform. As a member of the SRE-Application team, you'll play a critical role in ensuring the reliability, performance, and scalability of applications running on our Guidewire Cloud Platform. This position offers a unique opportunity to apply your skills in automation, software engineering, and operational discipline to support our cloud-based solutions.
What You'll Do
- Work with development teams to troubleshoot and resolve issues, minimizing customer impact.
- Develop and maintain automated runbooks to manage issues proactively.
- Apply engineering principles and automation to enhance our operating environments.
- Monitor and improve the reliability and performance of applications on the Guidewire Cloud Platform.
- Use your software engineering expertise to optimize systems and reduce manual toil.
- Document incidents and develop processes to prevent future occurrences.
- Stay current with industry trends, tools, and best practices in site reliability engineering.
- Foster a culture of innovation, learning, and continuous improvement.
- Participate in on-call rotations to ensure the availability and reliability of our services.
What You'll Bring
- Experience as an SRE or similar role, with a focus on improving system reliability.
- Strong problem-solving skills and the ability to analyze complex systems and devise effective solutions.
- Effective collaboration and communication skills to work cross-functionally and document processes clearly.
- Experience with automation, monitoring, and performance optimization tools and techniques.
- Commitment to maximizing uptime, scalability, and delivering an exceptional end-user experience.
- Passion for technology and a desire to continuously learn and grow your skills.
- Alignment with Guidewire's mission to leverage technology to help protect and support others.
Required Skills:
- Software engineering background with experience in Python, Go, or Java, following best practices (SOLID, DRY, KISS) and writing clean, testable code
- Experience with designing and implementing SLI's, SLO's, and Error Budgets
- Familiarity with application performance monitoring (APM) and telemetry tools to maintain expected service levels for applications
- Experience troubleshooting and debugging distributed systems on cloud infrastructure
- Experience with CICD pipelines within K8S and legacy ecosystems
- Experience creating monitors, dashboards, and synthetic transactions in monitoring tools like Datadog
- Experience deploying and managing scalable infrastructure within AWS and Kubernetes ecosystems using Terraform and other cloud-native approaches
- Experience with infrastructure configuration management using tools such as GitOps, Puppet, or Ansible
- Good understanding of cloud networking, security, and vulnerability management, with the ability to programmatically remediate infrastructure issues
Preferred Skills
- SRE Certification in one or more categories
- AWS Certification in one or more categories
- Experience with SQL, database administration, data pipelines, performance tuning, and schema design
- Familiarity with pipelining tools such as Team City, Bitbucket Pipelines, Jenkins, or GitHub Actions
- Exposure to open-source distributed data processing frameworks such as Hadoop, Apache Spark, AWS RedShift, etc.
- Experience with distributed systems, including microservices and event-driven architectures
Why Guidewire-
This is an opportunity to join a mission-driven company and make a real impact in the lives of people facing challenges. You'll work with cutting-edge technology, collaborate with talented peers, and grow your skills in a culture that values innovation, teamwork, and work-life balance. We offer competitive compensation, comprehensive benefits, and opportunities for career development.If you're a Senior SRE who combines deep technical expertise with a passion for problem-solving and a commitment to reliability, we'd love to hear from you. Join us in building the software that helps insurers care for their customers when they need it most.This position requires participation in mandatory on-call rotations to ensure the availability and reliability of our services. This includes responding to incidents and alerts outside of regular business hours, on weekends, and during holidays, as per the established on-call schedule. Candidates must be willing and able to fulfill this critical responsibility.
About Guidewire
Guidewire is the platform P&C insurers trust to engage, innovate, and grow efficiently. We combine digital, core, analytics, and AI to deliver our platform as a cloud service. More than 540+ insurers in 40 countries, from new ventures to the largest and most complex in the world, run on Guidewire.
As a partner to our customers, we continually evolve to enable their success. We are proud of our unparalleled implementation track record with 1600+ successful projects, supported by the largest R&D team and partner ecosystem in the industry. Our Marketplace provides hundreds of applications that accelerate integration, localization, and innovation.
For more information, please visit www.guidewire.com and follow us on Twitter: @Guidewire_PandC.
Guidewire Software Inc. provides equal employment opportunities to all applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. All offers are contingent upon passing a criminal history and other background checks where it's applicable to the position.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Top Skills
What We Do
Guidewire is the platform P&C insurers trust to engage, innovate, and grow efficiently. We combine digital, core, analytics, and AI to deliver our platform as a cloud service. More than 540 insurers, from new ventures to the largest and most complex in the world, run on Guidewire.
As a partner to our customers, we continually evolve to enable their success. We are proud of our unparalleled implementation track record, with 1,000+ successful projects, supported by the largest R&D team and partner ecosystem in the industry. Our marketplace provides hundreds of applications that accelerate integration, localization, and innovation.
Why Work With Us
We're focused on each and every employees' personal and professional development, and offer internal career mobility programs and growth opportunities that make Guidewire unique. Other perks like generous PTO, flexible working, our Guidewire Gives Back charitabeland our "Work From Almost Anywhere" program support our employees' work-life balance