Associate Director Application Support Engineering

Posted 12 Days Ago
Be an Early Applicant
Chennai, Tamil Nadu, IND
In-Office
Senior level
Financial Services
The Role
The Associate Director of Application Support Engineering leads reliability and performance of systems, implements monitoring solutions, manages incidents, and collaborates on automation with development teams.
Summary Generated by Built In

Are you ready to make an impact at DTCC? 
Do you want to work on innovative projects, collaborate with a dynamic and supportive team, and receive investment in your professional development? At DTCC, we are at the forefront of innovation in the financial markets. We are committed to helping our employees grow and succeed. We believe that you have the skills and drive to make a real impact. We foster a thriving internal community and are committed to creating a workplace that looks like the world that we serve. 
The Information Technology group delivers secure, reliable technology solutions that enable DTCC to be the trusted infrastructure of the global capital markets. The team delivers high-quality information through activities that include development of essential, building infrastructure capabilities to meet client needs and implementing data standards and governance.

Pay and Benefits:

  • Competitive compensation, including base pay and annual incentive
  • Comprehensive health and life insurance and well-being benefits, based on location
  • Pension / Retirement benefits
  • Paid Time Off and Personal/Family Care, and other leaves of absence when needed to support your physical, financial, and emotional well-being.
  • DTCC offers a flexible/hybrid model of 3 days onsite and 2 days remote (onsite Tuesdays, Wednesdays and a third day unique to each team or employee).

The Impact you will have in this role:

The Enterprise Application Support (EAS) team is responsible for providing technical application support for ITP and ECS lines of business. Within EAS, the Associate Director Application Support Engineering / SRE Lead (Site Reliability Engineer Lead) is a senior technical role responsible for driving the overall reliability, scalability, and performance of critical systems by implementing standard methodologies, participating in incident response, automating processes, and collaborating with development teams to ensure system stability and uptime across the organization, often acting as a technical partner in promoting a strong SRE culture within the company; key responsibilities include designing monitoring systems, capacity planning, and actively identifying and mitigating potential issues before they impact users. 

The SRE team works closely with development teams, infrastructure and network partners, security partners, Scrum Masters, and internal / external clients to improve observability, operational supportability, resiliency, and mean time to restore service through driving improvements to support capabilities.

Your Primary Responsibilities:

  • Scrum Participation: Join all project collaborators planning and design sessions, sprint zero and stand-ups for all new delivery, to champion SRE requirements reflective of a strong observability and resiliency traits.
  • System Reliability Architecture: Drive Design and help implement reliable, resilient, and scalable systems, considering redundancy, fault tolerance, and disaster recovery strategies. Make design recommendations that will allow the application to recover without cleanup activities or create a recovery runbook for application support team to follow for improved application recovery times.
  • Monitoring and Alerting: Develop comprehensive monitoring systems to identify potential issues proactively, define actionable alerts, and establish SLIs (Service Level Indicators) and SLOs (Service Level Objectives).
  • Incident Management: Lead incident response during critical system outages, facilitating timely problem diagnosis and resolution, conducting post-mortem analysis to identify root causes and prevent future occurrences.
  • Automation and Tooling: Develop and maintain automation scripts to streamline operational tasks, including self-healing, application deployments, scaling, and infrastructure management.
  • Collaboration with Development Teams: Work closely with development teams to integrate SRE practices into the software development lifecycle, promoting code quality, reliability, and observability.
  • Security Integration: Collaborate with security teams to ensure system resilience against cyber threats, implementing security best practices and supervising for vulnerabilities.
  • Technical Expertise: Stay updated on emerging technologies and industry trends related to cloud computing, distributed systems, and reliability engineering.
  • Operational Readiness: Attend and present operational readiness with application support (EAS L2) at each project management meeting - raise any operational risks and concerns. Test SRE requirements in UAT environments to validate effectiveness and completeness of operational capabilities.
  • Risk Management: Partner with IT Embedded Risk Managers to identify strategic solutions for risk incidents.
  • Metrics and Reporting: Demonstrate operational improvements through defined KPIs.
  • Capacity Planning: Proactively assess system capacity needs, plan for future growth, and implement scaling strategies to ensure optimal performance under high load.
  • Performance Optimization: Analyze system performance metrics to identify bottlenecks and implement optimization strategies to improve system responsiveness and efficiency.

Qualifications:

  • Minimum of 8 years of related experience
  • Bachelor's degree preferred or equivalent experience

Talents Needed for Success:

  • Strong Programming Skills: Proficiency in one or more programming languages like Python, Java, Go, etc. including the use of AI technology (Amazon Q, Kiro), for automation and development of monitoring and SRE compliance validation tools. 
  • System Administration: Expertise in Linux/Unix operating systems, network administration, and cloud platforms (AWS, Azure, GCP). Mainframe experience is a plus.
  • Monitoring and Observability: Deep understanding of monitoring tools (Splunk, Grafana, Dynatrace, ITSI, etc.) and experience in designing robust monitoring systems.
  • Incident Management: Proven track record to participate in incident response teams under pressure, effectively solving complex issues.
  • Experience in the financial services industry is good to have

Actual salary is determined based on the role, location, individual experience, skills, and other considerations. We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Skills Required

  • Minimum of 8 years of related experience
  • Bachelor's degree preferred or equivalent experience
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
5,075 Employees
Year Founded: 1973

What We Do

With over 45 years of experience, DTCC is the premier post-trade market infrastructure for the global financial services industry. From 21 locations around the world, DTCC, through its subsidiaries, automates, centralizes and standardizes the processing of financial transactions, mitigating risk, increasing transparency and driving efficiency for thousands of broker/dealers, custodian banks and asset managers. Industry owned and governed, the firm simplifies the complexities of clearing, settlement, asset servicing, data management, data reporting and information services across asset classes, bringing increased security and soundness to financial markets. In 2021, DTCC’s subsidiaries processed securities transactions valued at nearly U.S. $2.4 quadrillion. Its depository provides custody and asset servicing for securities issues from 177 countries and territories valued at U.S. $87.1 trillion. DTCC’s Global Trade Repository service, through locally registered, licensed, or approved trade repositories, processes 16 billion messages annually. To learn more, please visit us at www.dtcc.com.

Similar Jobs

Capco Logo Capco

ETL Testing

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Capco Logo Capco

Senior Project Program Portfolio Mgmt - Portfolio Manager - SC

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

TransUnion Logo TransUnion

Engineer, Data Development (SSIS, SQL)

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Chennai, Tamil Nadu, IND
13000 Employees

Pfizer Logo Pfizer

Dir, Clinical Trial and Data Transparency

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Remote or Hybrid
2 Locations
121990 Employees

Similar Companies Hiring

Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account