Engineering Division - BCP Engineering - Associate - Bengaluru

Posted 6 Days Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka, IND
In-Office
Mid level
Fintech • Financial Services
The Role
Manage and lead resiliency, disaster recovery, power-down and cyber-attack recovery testing for critical applications and infrastructure. Coordinate cross-functional teams, develop scenario-based test events, validate recovery plans, identify gaps and automation opportunities, oversee execution and incident escalation, and ensure regulatory and audit compliance across regions.
Summary Generated by Built In
YOUR IMPACT

        The Engineering Resiliency and Recovery Specialist Engineer manages resiliency and recovery testing for critical applications within GS's regional Engineering division and Business Units. The ETO Recovery Engineering team establishes policies, governance, and standards           to ensure business resilience and service continuity are properly verified.

        The Resiliency and Recovery Specialist collaborates closely with Technology Infrastructure, Application, Risk Management, and Corporate Services teams to coordinate and ensure the seamless execution of Technology events, including building powerdowns, Data Center             failover tests, and Disaster Recovery testing. Additionally, this role participates in team projects aimed at enhancing the effectiveness of these programs across the US, Asia, Bengaluru, and EMEA regions.

        Resiliency BCP testing ensures essential business functions continue during emergencies of any kind. The Resiliency and Recovery Specialist develops and tests plans to reduce disruptions and protect the firm's operations, reputation, and financial stability.

In this capacity the individual will develop scenario based test events and verify recovery plans against them. The Engineering Resiliency and Recovery Specialist will work closely with Technology Infrastructure, Risk Management, Corporate Services and application          development teams to coordinate (plan) and ensure the smooth execution of Technology (application and Infrastructure) events such as pandemic tests, concentration testing, Disaster Recovery testing as well as work on projects to improve the effectiveness of such programs across all regions (Americas, Asia, Bengaluru and EMEA).

III. OUR IMPACT Division Description
THE TECHNOLOGY DIVISION

Our team of engineers builds solutions to the most complex problems. We develop cutting-edge systems and processes that form the core of our key business and enable transactions to move in milliseconds. We provide real-time access to critical deal information and crunch billions of data points each day to inform firm-wide market insights and strategies. Team members have the opportunity to work at the forefront of technology innovation alongside industry leaders and make significant contributions to the field.

Team Description:
ENTERPRISE TECHNOLOGY OPERATIONS – ENGINEERING Resiliency and Recovery Specialist

Engineering Resiliency and Recovery Specialist is responsible for the strategic initiatives to reduce risk and improve resiliency throughout the operational organization. BCP ensures effective Engineering recovery plans are in place and in compliance with the firm’s overall resiliency strategies to ensure continuity of operations in crisis events. BCP provides a platform for Engineering to validate recovery strategies across people, products, platforms and functions. To reduce resiliency risk and validate recovery strategies, BCP drives adoption of various Core and ETO platforms to drive adherence to controls,  automate recovery plans, and reduce manual work by utilizing various Core and ETO products. Furthermore, the organization identifies applications in scope for resiliency and recovery testing, tracks exemptions and maintains evidence for Business Continuity test credit. BCP addresses the recovery sequencing problem by use of Topology mapping between applications and infrastructure and derives the recovery order and identification of key dependencies during outages.

IV. HOW YOU WILL FULFILL YOUR POTENTIAL
  • Lead Infrastructure and Application Disaster Recovery testing and Data Center Power-down events
  • Drive adoption of the mandated controls which are in place with application teams.
  • Provide guidance to application owners on how they can adapt a recovery procedures to adhere to the uplifted controls in place. 
  • Disaster Recovery tests scope events to include the interdependencies of shared services, up-steam and downstream application dependencies, Order of recovery, etc.
  • Cyber Attack Recovery Testing. Driving teams to become resilient and have the ability to recover during a cyber attack.  Test the cyber attack recovery procedures.
  • Power-down events establish critical milestones, establish order of recovery, verify dependency of various infrastructure components
  • Coordinate and manage regulatory resiliency recovery tests, such as SIFMA's industry-wide exercises, SPOOR-related tests, and those guided by the Monetary Authority of Singapore (MAS), to ensure compliance with industry standards and regulatory requirements. This involves liaising with various internal & external teams, scheduling test activities, monitoring progress, and documenting outcomes to support robust audit and risk management processes

  • Identify gaps in process and procedures and enhance those processes. 
  • Identify opportunities for automation
  • Oversee and Manage the execution plans
  • Initiate inventory, infrastructure & Application ready for business checks
  • Manage incidents and escalations related to the activities we perform. 
V. SKILLS & EXPERIENCE WE’RE LOOKING FOR A. Basic Qualifications [required skills & experience that are relevant to the performance of the position]
  • Bachelor Degree
  • Minimum 4-5 years of experience in technology stack including infrastructure and application
  • Experience in Managing Resiliency testing for On-Prem Database, NAS, Object Storage, Block Storage etc.,
  • Understanding of disaster recovery procedures
  • Understanding of RTO, RPO and how these metrics are calculated
  • Knows differences between resiliency testing and cyber attack recovery/Repave test. 
  • Background in cyber attack recovery
  • Background in disaster recovery.  
  • Strong analytical, communication, interpersonal, problem solving, organizational and time management skills
  • Basic understanding of excel and the ability to manipulate data using excel.    Knowledge of basic excel formulas used in data manipulation
  • Self-motivated with an ability to work on one’s own with a strong sense of ownership and accountability
  • Highly organized, strong attention to detail and excellent follow-up skills
  • Strong process and project management skills with the ability to improve process efficiency and effectiveness
  • Strong written and verbal communication skills with an ability to summarize complicated technical information to people with less technical knowledge
  • Excellent influencing skills at all levels and the ability to develop and maintain good relationships with senior leadership, colleagues and clients
B. Preferred Qualifications [skills & experience used to identify the most qualified or ideal candidates]
  • 5-7 years of experience in disaster recovery and cyber attack recovery programs. 
  • Hands on experience in Managing Resiliency testing for On-Prem Database, NAS, Object Storage, Block Storage etc.,
  • Hands-on expertise with Cloud platforms (AWS, Azure, GCP) and Kubernetes to support, manage and DR activities
  • key player in building a disaster recovery program and extensive knowledge of RTO, RTA, RPO, RPA, MTD and other DR metrics. 
  • Has guided teams in building recovery test plans and has understanding of what should be in disaster recovery plans. 
  • Candidate posses solid understanding of core Data Center Infrastructure ( Network Appliances, Storage technology, Unix/Linux/Windows, IP Telephony etc), order of recovery in case of any incident.
  • Strong understanding of various excel formulas used for data manipulation in excel. 
  • Project Management skills with ability to coordinate multiple Disaster Recovery tests and/or power down events simultaneously
  • An understanding of any one, or more, of the following Technology Risk domains to include information security, business continuity, technology resilience, controls monitoring, risk assurance and risk governance
  • Prior experience as either System Administrator or Application support role
  • Ability to perform analysis or troubleshooting when an issue arises and provide possible alternatives to help establish solutions and confirm remediation of the issue

Skills Required

  • Bachelor Degree
  • 4-5 years of experience in technology stack including infrastructure and application
  • Experience managing resiliency testing for On-Prem Database, NAS, Object Storage, Block Storage
  • Understanding of disaster recovery procedures
  • Understanding of RTO and RPO and how these metrics are calculated
  • Knowledge of differences between resiliency testing and cyber attack recovery/Repave tests
  • Background in cyber attack recovery
  • Background in disaster recovery
  • Strong analytical, communication, interpersonal, problem solving, organizational and time management skills
  • Basic understanding of Excel and ability to manipulate data using Excel formulas
  • Self-motivated with strong ownership and accountability
  • Highly organized with strong attention to detail and follow-up skills
  • Strong process and project management skills with ability to improve process efficiency
  • Strong written and verbal communication skills and ability to simplify technical information for non-technical audiences
  • Excellent influencing skills and ability to develop relationships with senior leadership
  • 5-7 years of experience in disaster recovery and cyber attack recovery programs
  • Hands-on expertise with Cloud platforms (AWS, Azure, GCP) and Kubernetes to support DR activities
  • Experience building a disaster recovery program and knowledge of RTO, RTA, RPO, RPA, MTD and other DR metrics
  • Guided teams in building recovery test plans and disaster recovery plan content
  • Solid understanding of core Data Center Infrastructure (network appliances, storage tech, Unix/Linux/Windows, IP Telephony)
  • Project management skills coordinating multiple DR tests and power-down events
  • Understanding of Technology Risk domains (information security, business continuity, technology resilience, controls monitoring, risk assurance, governance)
  • Prior experience as System Administrator or Application Support role
  • Ability to perform analysis/troubleshooting and propose remediation options

Goldman Sachs Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Goldman Sachs and has not been reviewed or approved by Goldman Sachs.

  • Healthcare Strength Coverage includes medical, dental, vision, disability, life and accident insurance, with multiple plan options and most premiums subsidized; coverage often starts on day one. Wellness resources, on-site health centers in some locations, and EAP access reinforce the depth of health support.
  • Parental & Family Support Family care includes on-site childcare in some offices, expectant parent resources, and transitional programs for returning parents. Feedback suggests parental leave is very generous, with reports of around 20 weeks paid leave and stipends for adoption, surrogacy, and fertility-related services.
  • Retirement Support The firm provides a 401(k) plan with employer matching contributions and broad financial education to help employees plan for retirement. Resources also support saving for education and preparing for unexpected events.

Goldman Sachs Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
67,118 Employees

What We Do

At Goldman Sachs, we believe progress is everyone’s business. That’s why we commit our people, capital and ideas to help our clients, shareholders and the communities we serve to grow. Founded in 1869, Goldman Sachs is a leading global investment banking, securities and investment management firm. Headquartered in New York, we maintain offices in all major financial centers around the world. More about our company can be found at www.goldmansachs.com

Similar Jobs

Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
897 Employees

Bounteous Logo Bounteous

Senior Engineer

Artificial Intelligence • Information Technology • Professional Services • Software • Analytics • Generative AI • Big Data Analytics
Remote or Hybrid
India
5000 Employees

JumpCloud Logo JumpCloud

Support Engineer

Cloud • Information Technology • Security • Software
Easy Apply
In-Office or Remote
Bangalore, Bengaluru, Karnataka, IND
800 Employees

Cargill Logo Cargill

Senior Data Engineer

Food • Greentech • Logistics • Sharing Economy • Transportation • Agriculture • Industrial
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
155000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account