Abnormal AI is seeking an experienced Engineering Manager to lead our Aggregates Systems team, which owns some of the most mission-critical infrastructure in the company. These systems sit at the core of our detection architecture, transforming raw behavioral signals into high-quality aggregate datasets that directly power security decisions across all Abnormal products.
The Aggregates Systems operate at extreme scale—processing tens of billions of events per day and approaching one million events per second at peak—and correctness, availability, and timeliness are non-negotiable. In this role, you will manage and grow a team of engineers responsible for building, operating, and evolving large-scale batch and real-time aggregation systems that Abnormal’s detection, messaging, and customer trust depend on. You will partner closely with engineering, data science, product, and infrastructure teams to ensure these systems remain reliable, performant, and scalable as the company grows.
What You’ll DoTechnical & Systems Leadership- Own the technical direction, reliability, and long-term evolution of aggregation systems spanning both batch and real-time processing.
- Guide architectural decisions for distributed data processing, storage, and retrieval systems with strict correctness, latency, and availability requirements.
- Ensure aggregation systems consistently meet SLAs for data freshness, accuracy, and uptime across detection and messaging use cases.
- Act as a senior technical steward for systems whose failure or inaccuracy would have direct customer and security impact.
- Manage, mentor, and develop a team of software engineers working across batch and streaming aggregation systems.
- Foster a culture of technical excellence, operational ownership, and continuous improvement.
- Support career growth through coaching, feedback, and clear performance expectations.
- Partner with Detection Engineering, Data Science, Product, and Infrastructure teams to translate detection requirements into scalable aggregation capabilities.
- Collaborate with stakeholders to define roadmaps that balance feature delivery with system stability and long-term maintainability.
- Serve as a technical and organizational point of accountability for aggregation systems.
- Drive strong operational practices, including monitoring, alerting, incident response, and post-incident analysis.
- Improve observability, data quality validation, and resiliency across batch and real-time systems.
- Ensure systems scale efficiently as data volume, customer footprint, and use cases expand.
Aggregates System
- In Online systems, the team operates low-latency aggregation systems designed to process high-throughput event data with strict timeliness guarantees.
- Composed of event producers, a Kafka-based event stream, and a signals stack including ingestion and storage update services.
- Enable fast, flexible retrieval of aggregate values for security and messaging applications.
- In Offline systems, the team operates batch processing systems that transform large volumes of raw behavioral signals (e.g., emails and user activity) into comprehensive aggregate datasets.
- Operate on scheduled cadences and support long lookback windows for historical analysis and trend detection.
- Play a foundational role in determining whether messages are safe or malicious by powering detection logic and models.
- 5+ years of software engineering experience, with 2+ years in an engineering management role.
- Strong experience designing and operating distributed, data-intensive systems.
- Hands-on familiarity with batch processing and/or real-time streaming systems (e.g., Kafka, Spark, Flink, Beam).
- Proven ability to lead complex, high-impact technical initiatives and make sound architectural tradeoffs.
- Excellent communication skills and experience working cross-functionally.
- Experience building or owning mission-critical aggregation or signals systems.
- Familiarity with data correctness, quality guarantees, and observability in production environments.
- Experience in security, messaging, or other high-reliability SaaS domains.
- Track record of scaling systems and teams in fast-growing organizations.
#LI-ML1
At Abnormal AI, certain roles are eligible for a bonus, restricted stock units (RSUs), and benefits. Individual compensation packages are based on factors unique to each candidate, including their skills, experience, qualifications and other job-related reasons.
Abnormal AI is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status or other characteristics protected by law. For our EEO policy statement please click here. If you would like more information on your EEO rights under the law, please click here.
Top Skills
What We Do
The Abnormal Security platform protects enterprises from targeted email attacks. Abnormal Behavior Technology (ABX) models the identity of both employees and external senders, profiles relationships and analyzes email content to stop attacks that lead to account takeover, financial damage and organizational mistrust. Though one-click, API-based Office 365 and G Suite integration, Abnormal sets up in minutes and does not disrupt email flow.
Abnormal Security was founded in 2018 by CEO Evan Reiser, CTO Sanjay Jeyakumar, Head of Machine Learning Jeshua Bratman, and Founding Engineers Abhijit Bagri and Dmitry Chechik. The team previously built behavioral profiling and machine learning technologies at Twitter, Google and Pinterest that are being applied to solve a problem that costs organizations $1 billion per year, according to the FBI. The Abnormal Security platform stops targeted phishing, business email compromise and account takeover attacks that have never been seen before.









