Senior Principal Service Reliabiliity Engineer
Discover. A brighter future.
With us, you’ll do meaningful work from Day 1. Our collaborative culture is built on three core behaviors: We Play to Win, We Get Better Every Day & We Succeed Together. And we mean it — we want you to grow and make a difference at one of the world's leading digital banking and payments companies. We value what makes you unique so that you have an opportunity to shine.
Come build your future, while being the reason millions of people find a brighter financial future with Discover.
Job Description
At Discover, be part of a culture where diversity, teamwork and collaboration reign. Join a company that is just as employee-focused as it is on its customers and is consistently awarded for both. We’re all about people, and our employees are why Discover is a great place to work. Be the reason we help millions of consumers build a brighter financial future and achieve yours along the way with a rewarding career.
The Senior Principal Service Reliability Engineer role will be responsible for working with Discover Network business, internal infrastructure and application teams to evaluate and assess processes, products and partners, and provide technical leadership.
The preferred candidate is one who enjoys working within a team in a fast paced, entrepreneurial environment, influencing the long term direction of our solutions while working with a similar minded group of people. Our team operates very much like a startup that just happens to reside within a Fortune 300 Company.
Initiatives typically involve the engagement of resources across multiple business and technical units, so a broad business/technical engagement background is highly desired. This role will require both strategic and tactical cross-functional leadership.
What You’ll Do
- Handle responsibilities for operational stability and performance of one or more critical business services used by Discover customers and employees.
How You’ll Do It
Operational stability and performance
- Work with other members of their assigned value stream to ensure that in-scope applications/platforms are meeting performance and stability requirements. This includes managing major incidents to mitigation/resolution.
Problem management:
- Perform post-incident reviews of all major incidents and determine action items required to avoid similar issues/minimize downtime for future incidents.
Monitors and metrics:
- Work with Application Development to ensure that assigned applications/platforms have appropriate monitoring and metrics in place to appropriately measure performance and stability.
Identify functional and non-functional improvements:
- Act as the Operations representative in value stream planning and prioritize sessions to ensure that operational needs of assigned applications/platforms are addressed as needed. Hold quarterly operational performance reviews with value stream management.
Release planning and coordination:
- Work with other members of his/her assigned value stream to ensure that the production releases for their in scope applications/platforms are properly planned and coordinated. This includes Holds Change/Release implementation reviews to ensure thorough and appropriate implementation plans.
Review and sign-off/approval of change tickets for the assigned value stream:
- Represent the value stream at Change Advisory Board Meetings.
- Participate in Program Increment Planning Sessions as a liaison for Operations and Infrastructure support.
- Provide information regarding upcoming critical changes to the value stream.
Operational readiness:
- Ensure that applications/platforms in the value stream are operationally ready for production. This includes Annual Review of all SOPs/knowledge articles.
- Monitor review for any new feature launch or other significant change that may impact monitoring.
- Review SOP/knowledge article for any new feature launch or other significant change that may impact support documentation.
- Train Command Center and Application 1st level Support on new SOPs, knowledge articles, and any other support-related needs.
- Perform monthly capacity analysis of applications/platforms within the value stream. Create and maintain operationally focused ELK dashboards for the value stream.
We’re dedicated to helping people spend smarter, manage debt better and save more for a brighter financial future. Together, we work on challenges to help millions of customers, all while giving back to our communities. Starting with our employees, we create an inclusive environment where everyone has the resources to thrive.
With Discover, we want to inspire you to learn new skills and feel challenged by your work. We are focused on transparency, building expertise through problem solving and experimentation, and getting things done the right way. Here, you’ll make an impact from Day 1 by tackling meaningful work you’re passionate about.
What You’ll Do:
As a Full Stack Senior Principal Software Engineer, you will:
- Develop and maintain full stack solutions in one or more ways:
- Complex front-ends (user interfaces), with a focus on user experience
- Back-end systems (APIs)
- Database and devops development
- Create and maintain DevOps processes and application infrastructure while leveraging cloud services (including database systems and models)
- Plan how we build and integrate new systems by leveraging your holistic knowledge of the products in our ecosystem
- Collaborate with our business partners to understand all business requirements
- Innovate on best practices for our team and advocate for improved processes with our internal partners
- Support live systems to ensure business continuity
- Solve complex technical problems and explore new ways to bring value
- Guide our team through implementing new solutions from inception to production
- Mentor and inspire our team members
- Keep up-to-date with new technology trends and innovations
Qualifications You’ll Need
The Basics:
- Bachelor’s degree in Information Technology or related field
- 10+ years of experience in Computer Science, Information Technology or related field
- In lieu of education experience, 12+ years of work experience in Computer Science, Information Technology or related field
Bonus Points If You Have:
- Experience working in a highly complex and scaled global technology organization
- Worked for a financial services or technology company
- 5+ years of Development experience in either Java/C# or another object oriented programming language.
- REST Services
- CI with Jenkins or similar; Git/GitHub; Artifactory
- OpenShift &/or Kubernetes
- Jira
- AWS Cloud & Services, e.g. Lambda
- Docker
- JSON/Yaml
- Agile development
- Pivotal Cloud Foundry
What are you waiting for? Apply today!
The same way we treat our employees is how we treat all applicants – with respect. Discover Financial Services is an equal opportunity employer (EEO is the law). We thrive on diversity & inclusion. You will be treated fairly throughout our recruiting process and without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status in consideration for a career at Discover.