Manager - Site Reliability Engineering at Truist (Atlanta, GA)

| Atlanta, GA
Sorry, this job was removed at 8:25 p.m. (CST) on Friday, June 17, 2022
Find out who's hiring in Atlanta, GA.
See all Developer + Engineer jobs in Atlanta, GA
Apply Now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.
The position is described below. If you want to apply, click the Apply button at the top or bottom of this page. You'll be required to create an account or sign in to an existing one.

Need Help?

If you have a disability and need assistance with the application, you can request a reasonable accommodation. Send an email to Accessibility or call 877-891-2510 (accommodation requests only; other inquiries won't receive a response).

Regular or Temporary:

Language Fluency: English (Required)

Work Shift:
1st shift (United States of America)

Please review the following job description:

*** Remote or hybrid work options available ***

At Truist, we want to inspire and build better lives and communities. With our collective passion, and commitment to innovation, we're creating better financial experience to help our customers achieve more. We're looking for talented people who will put our customers at the center of everything we do. Join our diverse and inclusive team where you'll feel valued and inspired to contribute your unique skills and experience. We are hiring a Site Reliability Engineer (SRE) to build and grow the Consumer Technology CIO Site Reliability Engineering (SRE) practice at Truist.

Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Truist's services have reliability and uptime appropriate to business needs and make rapid improvements and while closely monitoring capacity and performance. SRE uses innovative solutions by leveraging automation and code to improve production stability. Intensive focus on optimizing existing systems, building infrastructure and eliminating work through automation. Responsible for the big picture of how our systems relate to each other. You will ensure applications on-boarded to SRE are instrumented for full-stack observability and continuous testing, introduce continuous improvements, integrate into IT Operations, and share support responsibilities for critical customer journeys, business flows, and applications. You will also help us develop process improvements for change and release compliance.

Following is a summary of the essential functions for this job. Other duties may be performed, both major and minor, which are not mentioned below. Specific activities may change from time to time.
  • Work with internal IT partners in evaluating and gathering requirements for establishing and/or enhancing application monitoring, observability, resiliency, and incident management.
  • Manage change and release compliance using industry best practices.
  • Communicate and document potential solutions, impact analysis, benefits/risks, implementation requirements, and recommended approach.
  • Maintain a high-level of awareness and understanding of existing and emerging technologies, as well as industry and bank issues in order to recommend the utilization of the appropriate technologies to solve for business challenges and help guide Retail Technology in accomplishing goals.
  • Review processes with the Architecture Working Groups (AWG) to identify potential architectural issues early in the development/procurement cycle for the purpose of steering the proposed solution towards a sound architectural conclusion.
  • Provide application architecture consulting services to Retail Technology as requested/needed.
  • Perform approved proof-of-concepts "testing" for application monitoring and resiliency.
  • Participate in chaos testing to document and ensure application performance meets functional and operational standards, and does not impact the business needs of bank personnel and clients.

Required Qualifications:

The requirements listed below are representative of the knowledge, skill and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
  • Bachelor's degree and eight years of experience in development or production support or an equivalent combination of education and work experience.
  • Deep specialized and/or broad functional knowledge.
  • Sound understanding of business and organizational strategies and processes.
  • Ability to interpret internal and external business challenges and recommend best practices.
  • Ability to lead complex projects.
  • Sophisticated analytical skills and the ability to solve complex technical and business problems.
  • Ability to influence others at senior levels to adopt a new perspective.

Preferred Qualifications:
  • Bachelor's degree in Business or IT, or equivalent education and related training
  • 5+ years of demonstrated experience in application development/support
  • Significant knowledge in networking, database, and servers in a medium to large corporation at the enterprise level or similar consulting experience
  • Change and release process understanding
  • Strong analytical skills
  • Strong verbal and written communication skills
  • Significant knowledge of current and emerging application architecture principles, methodologies and tools
  • Ability to interact with all levels of an organization
  • Demonstrated competency in strategic thinking with ability to differentiate feasible from academic solutions
  • Ability to translate high-level planning information into application needs/solutions
  • Ability to grasp the 'big picture' for a solution by considering all potential options and impacted areas
  • Aptitude to understand and adapt to newer technologies
  • Proficient in understanding client service models and customer orientation in a service delivery
  • Demonstrated proficiency in basic computer applications, such as Microsoft Office software products
  • You understand what it takes to solve problems in a complex system of interacting components.
  • You are a Team Player. You enjoy collaborating, learning from and teaching others so we can all become better developers. You assume good intent in others, and actively do your part to make a positive work environment.
  • Engage in and improve the whole lifecycle of services from inception and design, through deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.
  • Ability to travel, occasionally overnight
  • Proven Technical Expertise with one or more of the following
  • ServiceNow
  • Software Development Java, Go, C/C++, Angular, R
  • OS and Platform AWS, Lamda, EMR, PCF, Kubernetes, OpenShift, Linux, Azure, Windows, VMware
  • CI/CD and Automation Jenkins, Gitlab, SonarQube, Artifactory, Ansible, Puppet, Apigee
  • Observability and AIOps using one or more: Dynatrace, DataDog, Grafana, Prometheus, ELK, Elastic, Kibana, Kafka, Splunk, CloudWatch, Jaeger, Zipkin, Kinesis, Apache Airflow, AppDynamics
  • Experience in one or more of the following areas is desired
  • Financial Services experience
  • AIOps Moogsoft, BigPanda, Robotic Process Automation (RPA), UIpath, Artificial Intelligence (AI) and Machine Learning (ML) Frameworks
  • Operations Tools ServiceNow, PagerDuty, Microsoft Teams, Symphony/Slack, Remedy, IBM Netcool
  • Data/Data Structures Oracle, SQL, Mongo, Hadoop, Cloudera, Spark
  • Chaos Engineering and Performance Testing using: Gremlin, Chaos Monkey, Selenium, jmeter, Blazemeter, Performance Center, Quality Center/ALM, DevTest
  • Experience with Agile Scrum (Daily Standup, Sprint Planning and Sprint Retrospective meetings) and Kanban
  • 3+ years of experience with Cloud technologies

Truist supports a diverse workforce and is an Equal Opportunity Employer who does not discriminate against individuals on the basis of race, gender, color, religion, national origin, age, sexual orientation, gender identity, disability, veteran status or other classification protected by law. Drug Free Workplace.

Thank you for your interest in Truist! BB&T and SunTrust have come together in a transformational merger of equals to create Truist, the premier financial organization in the country. You may notice references to our legacy company names, BB&T and SunTrust, in places throughout this site. All such references should be understood to refer to Truist moving forward while we continue to transition to the Truist name.

EEO is the Law Pay Transparency Nondiscrimination Provision E-Verify
More Information on Truist
Truist operates in the Fintech industry. The company is located in Charlotte, NC. Truist was founded in 2019. It has 12339 total employees. It offers perks and benefits such as Flexible Spending Account (FSA), Disability Insurance, Dental Benefits, Vision Benefits, Health Insurance Benefits and Life Insurance. To see all 441 open jobs at Truist, click here.
Read Full Job Description
Apply Now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.

Similar Jobs

Apply Now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.
Save jobView Truist's full profileFind similar jobs