Main Responsibilities
- Design, build, and manage scalable data pipelines using Python, SQL, and PySpark.
- Develop and maintain lakehouse architectures, with hands-on use of Apache Hudi for data versioning, upserts, and compaction.
- Implement efficient ETL/ELT processes for both batch and real-time data ingestion.
- Optimize data storage and query performance across large datasets (partitioning, indexing, compaction).
- Ensure data quality, governance, and lineage, integrating validation and monitoring into pipelines.
- Work with cloud-native services (preferably AWS – S3, Athena, EMR) to support modern data workflows.
- Collaborate closely with data scientists, analysts, and platform engineers to deliver reliable data infrastructure
Core Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field.
- 2+ years of experience as a Data Engineer, working with large-scale distributed systems.
- Proven expertise in Lakehouse architecture and Apache Hudi in production environments.
- Experience with Airflow, Kafka, or streaming data pipelines.
- Strong programming skills in Python and PySpark.
- Comfortable working in a cloud-based environment (preferably AWS).
- Strong communication and collaboration skills.
- Knowledge of CI/CD, Infrastructure as Code (IaC) like Terraform.
- Exposure to data cataloging tools (e.g., Glue Data Catalog, Amundsen).
- Interest or experience in cybersecurity or secure data design.
Salary Range
- Gross Salary 3400-7100 EUR/Month.
Top Skills
What We Do
Nord Security is one of the world’s leading providers of digital security and privacy solutions for businesses and individuals. We are a home for advanced security solutions that share the Nord brand and values. Today, our products are used by millions of customers worldwide and praised by all the major cybersecurity experts and top media outlets. Since 2012, we have been creating and building award-winning products: NordVPN - the fastest VPN on the planet, built to protect your online traffic and privacy with next-generation encryption. NordLayer - an adaptive network access security solution for modern businesses, helping organizations to fulfill scaling and integration challenges. NordPass - a password manager designed with the user in mind, from simplicity to security. Built using zero-knowledge encryption. NordLocker - a powerful end-to-end encryption tool for safely storing and sharing files. Comes with secure cloud storage. Our community of cybersecurity experts, software developers, engineers, data analysts, and other tech professionals share one common goal – create a safe cyber future for everyone. Explore our open positions here: https://nordsecurity.com/careers Or refer a friend or colleague: https://nordsecurity.com/referrals Learn about our Privacy notice for recruitment candidates here: https://bit.ly/3mJFoAy