Senior Data Engineer

Posted 2 Days Ago
Be an Early Applicant
Warsaw, Warszawa, Mazowieckie, POL
Hybrid
Senior level
Artificial Intelligence • HR Tech • Software • Automation
The Role
Design and build scalable, multi-tenant data infrastructure and ETL pipelines, implement real-time ingestion and OLAP optimizations, enforce RLS/RBAC and session-based access controls, ensure data quality and compliance, and integrate analytical databases with embedded BI for enterprise-scale analytics.
Summary Generated by Built In
Job Description

We are seeking a Senior Data Engineer to thoughtfully architect and build the foundation of our next-generation multi-tenant influencer marketing analytics platform. In this role, you’ll play a key part in shaping scalable, reliable data infrastructure that empowers our enterprise partners with accessible, meaningful analytics. You’ll work with modern analytical databases and BI platforms to help deliver clear, real-time insights at a significant scale.

 

Location: Poland (Hybrid / Warsaw)

Working hours: 12 pm - 9 pm CET


Key Responsibilities

Build Scalable Data Infrastructure

  • Design and implement ETL pipelines migrating from transactional databases to analytical data warehouses

  • Create real-time data ingestion systems processing campaign data, user metrics, and business intelligence

  • Build multi-tenant data models with proper partitioning strategies for enterprise-scale clients

  • Develop data quality frameworks with comprehensive validation, monitoring, and alerting

Architect Multi-Tenant Security

  • Implement Row-Level Security (RLS) and Role-Based Access Control (RBAC) in analytical databases

  • Design dynamic permission models supporting organization-level and division-level data access

  • Build session-based context management for secure multi-tenant queries

  • Create comprehensive audit trails and access logging for compliance requirements

Optimize Performance at Scale

  • Design database schemas with advanced partitioning and indexing strategies

  • Build materialized views and aggregated tables for real-time analytics

  • Implement query optimization, data skipping, and compression techniques

  • Handle high-concurrency embedded dashboard usage with sub-second query performance

Integrate Analytics Platform

  • Build dashboard data sources with optimized SQL transformations

  • Handle complex data structures and parsing requirements

  • Create flat, denormalized tables optimized for embedded analytics consumption

  • Implement custom field handling for tenant-specific metadata requirements

Required Qualifications
  • 5+ years of data engineering experience with production-scale systems

  • Expert-level SQL skills with analytical databases (columnar databases preferred)

  • Strong Python programming with data libraries: pandas, numpy, pyarrow

  • Experience with ETL orchestration tools: Apache Airflow, Prefect, dbt, or similar

  • Deep understanding of analytical databases, partitioning strategies, and OLAP optimization

  • Experience building SaaS data platforms with tenant isolation requirements

  • Knowledge of Row-Level Security (RLS) implementation in analytical databases

  • Understanding of RBAC patterns and session-based access control

  • Experience with authentication flows in data systems

  • Familiarity with compliance requirements (SOC2, GDPR) for multi-tenant data

Nice-to-Have Experience

Database Experience

  • Experience with columnar databases: ClickHouse, BigQuery, Redshift, Snowflake

  • Knowledge of streaming data platforms: Apache Kafka, Pulsar, Kinesis

  • Understanding of distributed systems and database replication

Analytics & BI Experience

  • Experience with embedded analytics platforms and dashboard frameworks

  • Knowledge of semantic modeling and metrics layers

  • Familiarity with data visualization and self-service analytics

Advanced Technical Skills

  • Experience with real-time data processing and event streaming

  • Knowledge of data mesh or data fabric architectures

  • Understanding of machine learning pipeline integration

  • Experience with data lineage and data catalog tools

Required Skills
  • Advanced SQL

  • Analytical Database Design

  • ETL Pipeline Development

  • Real Time Data Ingestion

  • Python Data Processing

  • Pandas

  • NumPy

  • PyArrow

  • Airflow Orchestration

  • dbt

  • OLAP Performance Tuning

  • Partitioning Strategy Design

  • Multi Tenant Data Modeling

  • Row Level Security

  • RBAC Access Control

  • Session Based Access Control

  • Data Quality Monitoring

  • Data Validation Frameworks

  • Audit Logging

  • Authentication Flows

  • SOC 2 Compliance

  • GDPR Compliance

Benefits and Perks

Umowa o pracę

Languages
  • en:fluent

Job Overview

Location: Warsaw

Seniority Level: Senior

Salary Range: Not specified

Skills Required

  • 5+ years of data engineering experience with production-scale systems
  • Expert-level SQL skills with analytical databases (columnar databases preferred)
  • Strong Python programming with data libraries: pandas, numpy, pyarrow
  • Experience with ETL orchestration tools: Apache Airflow, Prefect, dbt, or similar
  • Deep understanding of analytical databases, partitioning strategies, and OLAP optimization
  • Experience building SaaS data platforms with tenant isolation requirements
  • Knowledge of Row-Level Security (RLS) implementation in analytical databases
  • Understanding of RBAC patterns and session-based access control
  • Experience with authentication flows in data systems
  • Familiarity with compliance requirements (SOC2, GDPR) for multi-tenant data
  • Experience with advanced partitioning, indexing, materialized views, and OLAP performance tuning
  • Ability to design real-time data ingestion systems and denormalized tables for embedded analytics
  • Fluent English
  • Experience with columnar databases (ClickHouse, BigQuery, Redshift, Snowflake)
  • Knowledge of streaming data platforms (Apache Kafka, Pulsar, Kinesis)
  • Experience with embedded analytics platforms, semantic modeling, data lineage, and data catalog tools
  • Familiarity with data mesh/fabric architectures and ML pipeline integration
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
9 Employees
Year Founded: 2025

What We Do

CareerSwift is an AI-powered recruitment and job search automation platform designed for both professionals and growing companies. It streamlines the hiring process through AI-driven resume optimization, intelligent job discovery, and automated candidate screening and interviewing. By utilizing context-aware AI agents, CareerSwift helps job seekers land roles faster and enables companies to identify high-quality talent more efficiently.

Similar Jobs

SD Solutions Logo SD Solutions

Senior Data Engineer

HR Tech • Information Technology • Professional Services
In-Office
Warsaw, Warszawa, Mazowieckie, POL
452 Employees

Veeam Logo Veeam

Senior C# Developer (Data Management)

Cloud • Security • Software • Cybersecurity
In-Office
Warsaw, Warszawa, Mazowieckie, POL
4172 Employees

ZS Logo ZS

Senior Data Engineer

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
Warsaw, Warszawa, Mazowieckie, POL
15000 Employees

Aviva plc Logo Aviva plc

Senior Data Engineer

Insurance • Financial Services
In-Office
Warsaw, Warszawa, Mazowieckie, POL
36000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account