Staff / Principal Data Engineer

Posted 2 Days Ago
Be an Early Applicant
New York, NY, USA
In-Office
180K-270K Annually
Expert/Leader
Information Technology • Software • Cybersecurity
The Role
Own and operate the unified data platform and data lake powering fraud detection. Design, build, and run low-latency batch and streaming ingestion pipelines, ensure data quality/lineage/observability, support embedding/vector pipelines for generative AI, and manage deployment and CI/CD on Kubernetes while partnering with data science and product teams.
Summary Generated by Built In

About the Role

We are building an AI-native data platform that powers fraud detection and response across 360 Fraud Protection. We are hiring a Staff or Principal Data Engineer to own the data platform and data lake at the heart of that work. You work hands-on and own the domain end-to-end, alongside a small group of senior engineers, data scientists and product partners.

  • Owns the unified data platform and data lake that powers detection and response across 360 Fraud Protection.
  • Every detection model and downstream AI capability depends on this data foundation, which makes it one of the highest-leverage engineering roles on the team.
  • Stronger, broader and more reliable fraud signal directly improves detection accuracy, reduces customer losses and protects brand trust.

Key Responsibilities

  • Own the design, build and operation of the data lake and ingestion platform end-to-end, from architecture through production reliability.
  • Build low-latency batch and streaming pipelines that ingest signals from internal and external sources, normalize them to a common schema, enrich them with context and serve model-ready data to the layers above.
  • Make adding a new data source a routine task rather than a project, so our view of risk keeps widening over time.
  • Establish data quality, freshness, completeness, lineage and observability so the platform is trustworthy enough to automate on top of.
  • Build data pipelines that ground generative AI, including unstructured text and threat intelligence processing, embedding generation, vector storage and retrieval.
  • Own deployment, CI/CD and operational reliability of the platform on Kubernetes.
  • Partner with data science, product and architecture to turn the platform into a shared foundation across 360 Fraud Protection.


Required Qualifications

  • Extensive experience building and operating large-scale data platforms and data lakes, with comfort working at high data volumes.
  • Deep, hands-on expertise with Apache Spark, Apache Flink and modern big-data systems.
  • Proven command of best practices for building and maintaining data pipelines in both batch and streaming modes.
  • Strong production engineering skills across the full delivery lifecycle, including Kubernetes and CI/CD tooling, with the ability to ship end-to-end.
  • A track record of owning data infrastructure end-to-end with limited supervision.

Preferred Qualifications

  • Experience with generative AI and embedding models, including embedding pipelines, vector databases and retrieval.
  • A cybersecurity or threat intelligence background, with hands-on exposure to threat types such as phishing, mobile threats and malware.
  • Familiarity with transaction data and transaction fraud signals.

Compensation

  • Base salary range: $180 – $270
  • Bonus / commission: 15%

Travel

  • Minimal travel expected. This is an on-site role based in New York City, with 3–4 days per week in the office.

AppGate is An Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or veteran status, age or any other federally protected class. In furtherance of AppGate's policy regarding affirmative action and equal employment opportunity, AppGate has developed a written affirmative action program. This program is available for review upon request by any applicant or employee during normal business hours by contacting the company's EEO Coordinator.


Skills Required

  • Extensive experience building and operating large-scale data platforms and data lakes
  • Deep, hands-on expertise with Apache Spark
  • Deep, hands-on expertise with Apache Flink
  • Proven best practices for building and maintaining batch and streaming data pipelines
  • Strong production engineering skills across full delivery lifecycle, including Kubernetes and CI/CD tooling
  • Track record of owning data infrastructure end-to-end with limited supervision
  • Experience with generative AI, embedding models, and vector databases
  • Cybersecurity or threat intelligence background (phishing, mobile threats, malware)
  • Familiarity with transaction data and transaction fraud signals
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Coral Gables, Florida
374 Employees

What We Do

AppGate secures and protects an organization's most valuable assets with its high performance Zero Trust Network Access (ZTNA) solution and Cyber Advisory Services. Appgate is the only direct-routed ZTNA solution built for peak performance, superior protection and seamless interoperability. AppGate Cyber Advisory services harden your security posture and ensure business continuity. Appgate safeguards enterprises and government agencies worldwide. Learn more at appgate.com.

Similar Jobs

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Temporary Sales Associate

eCommerce • Fashion • Retail • Sales • Wearables • Design
Hybrid
New York, NY, USA
16000 Employees
15-24 Hourly

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Sales Associate III

eCommerce • Fashion • Retail • Sales • Wearables • Design
Hybrid
East Geneva, Waterloo, NY, USA
16000 Employees
15-20 Hourly

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Sales Associate III

eCommerce • Fashion • Retail • Sales • Wearables • Design
Hybrid
East Geneva, Waterloo, NY, USA
16000 Employees
15-20 Hourly

Micron Technology Logo Micron Technology

Controller

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office
Clay, NY, USA
45000 Employees
157K-275K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account