Data Engineer 1

Posted Yesterday
Hiring Remotely in United States
Remote
108K-150K Annually
Mid level
Security
The Role
Operate and own real-time, petabyte-scale streaming pipelines (Pub/Sub/Kafka, Dataflow/Beam), integrate LLMs/Vertex AI enrichment, build monitoring/alerting, respond to incidents, ensure data integrity, and collaborate with Product and Analysts to improve observability and reliability.
Summary Generated by Built In

Flashpoint is the pioneering leader in threat data and intelligence. We empower commercial enterprises and government agencies to decisively confront complex security challenges, reduce risk, and improve operational resilience amid fast-evolving threats. Through the Flashpoint Ignite platform, we deliver unparalleled depth, breadth and speed of data from highly relevant sources, enriched by human insights. Our solutions span cyber threat intelligence, vulnerability intelligence, geopolitical risk, physical security, fraud and brand protection. The result: our customers safeguard critical assets, avoid financial loss, and protect lives. Discover more at flashpoint.io

Are you a data engineer who actually enjoys being the person who keeps critical systems alive? Flashpoint is looking for a Data Engineer I to own the real-time data infrastructure behind our intelligence platform, the Pub/Sub topics, Dataflow pipelines, and AI enrichment that let our customers spot threats the moment they emerge. This is a depth role, not a learning role. You're joining a small, senior team where the architecture is already solid and proven; your job is to be its operational heartbeat, keeping petabyte-scale pipelines flowing 24/7/365, troubleshooting under pressure, and continuously hardening the systems our customers depend on.

We have a role for you if, you:

  • You've operated production streaming pipelines at real scale, Pub/Sub, Kafka, or similar, and you know how message queues actually behave when traffic spikes and consumers fall behind.

  • You've debugged and optimized GCP Dataflow (or comparable Beam-based) jobs in production, not just stood them up.

  • You've been first responder for systems with no tolerance for downtime, and you've owned the incident from page to postmortem.

  • You've built the monitoring and alerting that catches failures before customers do, using tools like Prometheus, Grafana, or Stackdriver.

  • You've integrated LLMs into a data pipeline, Vertex AI and Gemini, or strong transferable work with similar platforms, and understand prompt engineering in a data context.

  • You've worked terabyte-to-petabyte datasets and kept systems responsive while filtering high-volume data.

What you will get to do on our team:

  • Own end-to-end operations of real-time pipelines that ingest, enrich, filter, and route data through Vertex AI and Gemini for risk assessment.

  • Own our Pub/Sub infrastructure end to end: message delivery, consumer groups, and the production incident response when something goes sideways.

  • Scale, monitor, and optimize Dataflow streaming and batch jobs processing petabytes of data, diagnosing failures and shipping the fix.

  • Build and maintain the monitoring, alerting, and incident-response tooling for systems that can't go down.

  • Safeguard data integrity end to end, ensuring data reaches customers accurately and on time.

  • Partner with Product, the broader Data team, and Intelligence Analysts to turn requirements into operational reality.

What you will achieve:

Within 30 days:

  • Onboarded into the GCP environment with full access to pipelines, dashboards, and runbooks; shadowed the on-call rotation.

  • Mapped the end-to-end data flow, Pub/Sub through Dataflow through Vertex AI enrichment, and can explain where it's fragile.

  • Resolved your first production alert with team support.

Within 60 days:

  • Carrying on-call independently and resolving common incidents without escalation.

  • Shipped at least one observability or reliability improvement (new alert, dashboard, or runbook) to the existing stack.

  • Identified and fixed a recurring pipeline pain point.

By 90 days:

  • Operating as the primary owner of the streaming infrastructure, trusted to run it autonomously.

  • Reduced false-positive alerts and/or measurably improved a key SLA (latency, uptime, or throughput).

  • Documented decisions and hardened systems so the next incident is easier for everyone.

  • Acting as the go-to person Product and Analysts come to with pipeline questions.

To be successful in this role, you will need:

  • Hands-on production experience building or operating streaming data pipelines (Pub/Sub, Kafka, or similar).

  • Demonstrated ability to debug and optimize GCP Dataflow (or comparable Beam-based stream processing).

  • A reliability mindset, fluency with SLAs, observability, on-call, and incident response, and the autonomy to dive into logs, metrics, and traces unaided.

  • Proficiency with Python, SQL, and Linux.

  • Real experience integrating Vertex AI / Gemini, or strong transferable experience with similar LLM platforms, into data workflows.

Nice to have (not required): BigQuery, BigTable, Cloud Storage, Cloud Functions; Terraform; data quality / validation frameworks; threat intelligence or security data workflows; experience on small, senior teams.

Base Pay Range: $107,500 - $150,000/yr. base + target bonus

Why Flashpoint is a Great Place to Work:

  • Diversity.  Flashpoint is committed to fostering, cultivating and preserving a culture of diversity, inclusion, belonging, and equity. We recognize that diversity is key to achieving our vision. We believe that every person and their experiences contribute to building a work environment and products and services that will change the world.

  • Culture and Belonging.  Our company’s culture isn’t something you join, it’s something you build and shape, and each person's unique backgrounds and experiences contribute to who Flashpoint is and will become.  You will have ample opportunities to connect with coworkers through various communication channels and company-funded virtual events: book clubs, happy hours, committees, DIBE discussion group, Donut mixers, local team member meetups and much more. 

  • Perks. Flashpoint understands that personal wellness is one of the keys to a happy, healthy and productive work environment.  That’s why we also prioritize health and wellness perks like gym reimbursements, expensed lunches, cool cultural initiatives and inclusive employee events.

  • Career Growth. Flashpoint is invested in the growth of our team members and understands that frequent, two-way feedback is critical to that growth. We encourage regular one-on-ones with your manager, a regular schedule of performance reviews, learning and development opportunities, and guidance through formalized career paths; whether that be towards being a great manager, being a great individual contributor, or a lateral move to gain breadth of knowledge and experience.

Are you unsure if this role suits you or not? Unsure about the timing? Interested in future opportunities? Stay connected by joining our Talent Network. By doing so, you'll stay updated with Flashpoint news and upcoming career opportunities. Even if you're not ready to apply now, being part of our Talent Network ensures you won't miss out on exciting opportunities in the future.

Skills Required

  • Operate production streaming pipelines (Pub/Sub, Kafka, or similar)
  • Debug and optimize GCP Dataflow or comparable Beam-based jobs in production
  • Hands-on experience integrating LLMs into data pipelines (Vertex AI/Gemini or strong transferable experience)
  • Proven reliability mindset with SLA-driven observability, on-call, incident response, and postmortems
  • Build and maintain monitoring/alerting (Prometheus, Grafana, Stackdriver or similar)
  • Proficiency with Python
  • Proficiency with SQL
  • Proficiency with Linux and debugging in Linux environments
  • Experience working with terabyte-to-petabyte datasets and keeping systems responsive under load
  • Experience with BigQuery, BigTable, Cloud Storage, Cloud Functions
  • Experience with Terraform
  • Familiarity with data quality/validation frameworks
  • Domain knowledge in threat intelligence or security data workflows
  • Experience working on small, senior teams

Flashpoint Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Flashpoint and has not been reviewed or approved by Flashpoint.

  • Fair & Transparent Compensation Pay is characterized as decent to good overall, with common role benchmarks clustering around low–mid six figures for tech/cyber positions. Sales compensation is portrayed as comparatively strong in on-target earning potential for certain roles.
  • Healthcare Strength Health coverage is framed as a modern, comprehensive offering, with signals that medical insurance quality is a standout part of the package. The broader health and wellness suite is repeatedly positioned as a core strength.
  • Retirement Support A retirement plan is consistently included as part of the total rewards package, and 401(k) matching is referenced as a valued component. Retirement support appears to contribute meaningfully to perceived overall benefits strength.

Flashpoint Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
312 Employees
Year Founded: 2010

What We Do

Flashpoint delivers actionable intelligence that empowers organizations of all sizes to take rapid, decisive actions to protect against threats. The company's technology, advanced data collections, and human-powered analysis uniquely enable teams to mitigate threats related to cybersecurity, fraud, insider threats, corporate and physical security, executive protection, and third-party risk. For more information, visit https://www.flashpoint-intel.com/ or follow us on Twitter at @FlashpointIntel.

Similar Jobs

Remote
USA
684 Employees

Teamworks Logo Teamworks

Data Engineer

Fitness • Information Technology • Software • Sports • Wearables
In-Office or Remote
2 Locations
302 Employees
Remote
United States
34 Employees

CertifID Logo CertifID

Senior Data Engineer

Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
Easy Apply
Remote or Hybrid
3 Locations
130 Employees

Similar Companies Hiring

Oso Thumbnail
Software • Security • Infrastructure as a Service (IaaS)
New York, New York
36 Employees
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Milestone Systems Thumbnail
Artificial Intelligence • Security • Software • Analytics • Big Data Analytics
Lake Oswego, OR
1500 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account