Market Data Engineer

Posted 5 Hours Ago
Be an Early Applicant
Hiring Remotely in Barcelona, Cataluña, ESP
In-Office or Remote
Senior level
Fintech • Software • Financial Services
The Role
Design, build, and maintain the Market Data Platform, managing data ingestion, storage, tooling, and reliability while collaborating with diverse teams.
Summary Generated by Built In
Company Description

BHFT is a proprietary algorithmic trading firm. Our team manages the full trading cycle, from software development to creating and coding strategies and algorithms.
Our trading operations cover key exchanges. The firm trades across a broad range of asset classes, including equities, equity derivatives, options, commodity futures, rates futures, etc. We employ a diverse and growing array of algorithmic trading strategies, utilizing both High- and Medium-Frequency Trading approaches.

We’re a team of 200+ professionals, with a strong emphasis on technology—70% are technical specialists in development, infrastructure, testing, and analytics spheres. The remaining part of the team supports our business operations, such as Risks, Compliance, Legal, Operations and more.

With a strong focus on innovation and performance, BHFT is actively expanding its presence in traditional financial markets. We value a results-driven culture, emphasizing collaboration, transparency, and constant improvement, all while offering the flexibility of remote work and a globally distributed team.

Job Description

The Data Engineering team is responsible for designing, building, and maintaining the Market Data Platform — a lakehouse infrastructure spanning the full path from raw exchange feeds to reliable, petabyte-scale data for research, backtesting, and real-time trading.

Key Responsibilities

  • Capture & Ingestion. Own the full capture path from wire to lake: decode and normalize raw exchange feeds (pcap, multicast UDP / ITCH / FIX) and vendor sources (OneTick, Refinitiv, Bloomberg, ICE) into a unified canonical model with nanosecond timestamps. Build batch + stream pipelines (Airflow, Spark, dbt) for tick and reference data. Own L2/L3 order-book reconstruction with gap handling. Provide Python and Rust producer SDKs for internal feed handlers.
  • Storage & Modeling — Apache Iceberg. Own the Iceberg-over-S3 lakehouse: design partitioning, sort orders, and row-group layout for fast scans; manage schema evolution, snapshots, time travel, compaction, and TTL. Maintain reference data as slowly-changing tables with point-in-time correctness for backtests. Drive storage cost optimisation via compaction, tiering, and snapshot expiry.
  • Tooling & Libraries. Build libraries for schema management, data contracts, validation, and lineage on top of the Iceberg catalog. Develop shared access services (Spark + Polars) so Research, backtesting, and trading share one normalized data layer, including gap detection and pcap-vs-lake reconciliation.
  • Reliability & Observability. Embed monitoring, alerting, SLAs/SLOs, and CI/CD across capture and pipeline layers on Kubernetes (EKS). Own data-quality dashboards and incident runbooks for the capture fleet.
  • Collaboration. Partner with Quant Research, Data Science, Backend, and DevOps to translate requirements into platform capabilities and champion market-data engineering best practices.

Qualifications

  • 5+ years building production-grade data systems, with proven expertise architecting and launching data lakes / lakehouses from scratch.
  • Hands-on experience with Apache Iceberg (or comparable table formats — Delta / Hudi): partitioning, schema evolution, snapshots, compaction, and catalog operations; familiarity with Apache Arrow for zero-copy, columnar in-memory interchange.
  • Experience with market data and/or network packet capture — decoding pcap, exchange feed protocols (ITCH, FIX/FAST, multicast UDP), order-book reconstruction, and time-series at scale (strong plus; willingness to learn required).
  • Experience normalizing market data from multiple vendors — e.g. OneTick, Refinitiv/Reuters, Bloomberg, ICE — into a unified schema and symbology (strong plus).
  • Expert-level Python (incl. Polars and/or PySpark); Rust a strong plus (relevant for high-performance capture/decoding).
  • Modern orchestration (Airflow) and distributed processing (Apache Spark).
  • Advanced SQL: complex aggregations, window functions, query optimization, partition pruning.
  • Solid fundamentals in Linux, containerization (Docker, Kubernetes / EKS), and cloud object storage (AWS S3).
  • DevOps & observability: CI/CD, infrastructure-as-code (Terraform), GitOps (ArgoCD), and metrics/dashboards/alerting (Grafana, Prometheus).
  • Strong grasp of structured + unstructured / binary data, and storage optimization — partitioning, compression, cost management.
  • English fluency for documentation and collaboration in an international team.

Additional Information

We Offer

  • Work in a modern IT company — no bureaucracy or legacy systems.
  • Real opportunities for professional growth and to make your mark.
  • Fully remote work from anywhere in the world, on a flexible schedule.
  • Compensation for health insurance, sports, professional development, and more.

Skills Required

  • 5+ years building production-grade data systems
  • Experience architecting and launching data lakes
  • Hands-on experience with Apache Iceberg or comparable table formats
  • Expert-level Python
  • DevOps & observability experience
  • Strong grasp of structured and unstructured data
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Dubai
58 Employees

What We Do

In today's complex financial markets, effective strategies are essential for success. At BHFT, we excel in creating these strategies, distinguishing ourselves in the automated trading platform arena with our unique approach. Our advanced fintech solutions are the product of extensive research and development by a team with more than 25 years of trading experience. With the help of 100+ professionals engaged we succeed in balancing high alpha and low-risk arbitrage. We emphasize the importance of a holistic approach in achieving undeniable results in trading, integrating it into every aspect from business structure to hardware. This comprehensive strategy enables us to adeptly handle any market condition. Beyond automated trading strategies, we focus on in-depth research into market microstructure and liquidity dynamics, quickly incorporating these insights into our trading logic. Our industry-standard backtesting infrastructure provides eliminating errors. Hence, combined with our bespoke high-frequency trading (HFT) strategies, our trading platform offers unmatched flexibility, speed, and performance.

Similar Jobs

Zscaler Logo Zscaler

Senior Sales Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
Spain
8697 Employees
Remote
26 Locations
393 Employees
179K-179K Annually

Dynatrace Logo Dynatrace

Sr. Customer Success Engineer (French speaking)

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
Barcelona, Cataluña, ESP
5600 Employees

Dynatrace Logo Dynatrace

Partner Marketing Executive - EMEA Central (German Speaking)

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
Barcelona, Cataluña, ESP
5600 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account