The Senior Incident & Automation Engineer serves as a critical bridge between the Technology Incident Optimization Program and the core Compute, Virtualization, Cloud Services, and Storage technology domains. This role demands deep technical expertise combined with strategic thinking to drive tactical incident reduction while architecting the future state of intelligent event management and automation.
You will be responsible for building automated incident remediation workflows and achieving measurable incident reduction within your domain through event optimization, correlation, and automation while ensuring comprehensive observability is maintained and enhanced. This position offers the unique opportunity to shape the future of enterprise event management.
Key Responsibilities- Incident & Alert Analysis: Conduct comprehensive analysis of alert and incident patterns to identify top sources of operational noise, determine root causes, and develop data-driven strategies for reduction.
- Intelligent Event Management: Design, implement, and optimize rules for event correlation, de-duplication, and suppression on AIOps and event management platforms. Develop domain-specific correlation logic leveraging configuration management data and infrastructure topology.
- Automation & Self-Healing: Architect and develop automation playbooks for incident data enrichment and create self-healing capabilities for common and recurring infrastructure incident scenarios.
- Observability Enhancement: Assess the current observability footprint across all infrastructure domains to identify gaps and propose enhancements that align with enterprise event management standards.
- Cross-Functional Collaboration: Partner closely with infrastructure operations, engineering, and platform teams to understand incident drivers, validate correlation logic, and provide expert guidance on event management best practices.
- Quality Assurance: Continuously validate the effectiveness of implemented rules and automation to ensure no business-impacting alerts are missed. Monitor and report on alert quality metrics and lead iterative improvements.
- Education: Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related technical field.
- Experience: A minimum of 8+ years of hands-on experience in IT operations, infrastructure engineering, or system architecture within large-scale enterprise environments.
- Event Management & Incident Reduction: Proven experience and demonstrated success in leading event management and incident reduction initiatives with quantifiable results. Direct, hands-on experience with modern AIOps and event management platforms is required.
- Technical Expertise:
- Deep understanding of enterprise infrastructure including virtualization architectures, container orchestration, microservices, and various storage architectures (block, file, object).
- Expertise with a broad range of domain-specific monitoring tools for compute, virtualization, storage, and cloud platforms.
- Automation & Orchestration: Hands-on experience developing robust automation solutions using scripting languages and modern automation frameworks.
- Data Analysis: Proficiency in log analysis, pattern recognition, and using query languages for data analysis on log aggregation platforms.
- Problem-Solving & Analytical Skills: Excellent analytical abilities with a systematic approach to troubleshooting complex issues and a holistic view of technology systems.
- Communication & Leadership: Exceptional communication skills with the ability to influence and collaborate effectively across diverse, cross-functional teams and present technical concepts to various audiences.
- An advanced degree (Master's) in a relevant technical field.
- Relevant industry certifications (e.g., Cloud, Virtualization, Automation, ITIL).
- Experience with AIOps, machine learning for IT operations, and Site Reliability Engineering (SRE) practices.
- Knowledge of ITSM platforms, CMDB management, and infrastructure-as-code (IaC) principles.
- Familiarity with financial services regulatory requirements.
------------------------------------------------------
Job Family Group: Technology------------------------------------------------------
Job Family:Infrastructure------------------------------------------------------
Time Type:Full time------------------------------------------------------
Primary Location:Irving Texas United States------------------------------------------------------
Primary Location Full Time Salary Range:$125,760.00 - $188,640.00
In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit citibenefits.com. Available offerings may vary by jurisdiction, job level, and date of hire.
------------------------------------------------------
Most Relevant Skills Please see the requirements listed above.------------------------------------------------------
Other Relevant Skills For complementary skills, please see above and/or contact the recruiter.------------------------------------------------------
Anticipated Posting Close Date:May 08, 2026------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
Skills Required
- Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related technical field
- A minimum of 8+ years of hands-on experience in IT operations, infrastructure engineering, or system architecture within large-scale enterprise environments
- Proven experience in event management and incident reduction initiatives with quantifiable results
- Deep understanding of enterprise infrastructure including virtualization architectures, container orchestration, and various storage architectures
- Hands-on experience developing automation solutions using scripting languages and modern automation frameworks
- Proficiency in log analysis, pattern recognition, and query languages for data analysis
Citi Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Citi and has not been reviewed or approved by Citi.
-
Healthcare Strength — Benefits coverage is positioned as comprehensive, including health, dental, and vision insurance plus on-site clinics, prescription drug support, and disability coverage. Family-building support such as fertility assistance is described as a notable differentiator within the overall package.
-
Retirement Support — Retirement benefits are framed as strong, highlighted by a 401(k) with matching and additional plan options like a Roth 401(k). Financial support is reinforced through discounts and broader financial guidance resources tied to the benefits ecosystem.
-
Wellbeing & Lifestyle Benefits — Wellbeing support extends beyond insurance through programs like an Employee Assistance Program, counseling/legal resources, and gym or wellness reimbursement. These offerings increase the perceived total rewards value even when cash compensation sentiment varies by role.
Citi Insights
What We Do
Citi's mission is to serve as a trusted partner to our clients by responsibly providing financial services that enable growth and economic progress. Our core activities are safeguarding assets, lending money, making payments and accessing the capital markets on behalf of our clients. We have 200 years of experience helping our clients meet the world's toughest challenges and embrace its greatest opportunities. We are Citi, the global bank – an institution connecting millions of people across hundreds of countries and cities.








