Appsilon

Data Engineer

Reposted Yesterday

Be an Early Applicant

Hiring Remotely in Warszawa, Mazowieckie, POL

In-Office or Remote

23K-23K Annually

Mid level

Artificial Intelligence • Information Technology • Machine Learning • Pharmaceutical

The Role

Design, build, and maintain scalable data pipelines; integrate diverse data sources into warehouses or lakes; collaborate with data scientists and engineers to ensure quality and availability; optimize schemas, models, and performance; implement governance, security, and compliance; monitor reliability and document systems.

Summary Generated by Built In

Why we need you?

At Appsilon, we empower global organizations to make smarter decisions with data. Our solutions help Fortune 500 companies discover new drugs, save lives, optimize operations, and unlock millions in value. To do this, we rely on robust, scalable, beautifully engineered data systems.

We're looking for a Data Engineer who can elevate how our clients collect, process, and leverage massive datasets — someone who loves building modern data pipelines and wants their work to power meaningful, real-world impact.

Your responsibilities:

Design, build, and maintain scalable data pipelines across diverse environments.
Integrate data from multiple internal and external sources into data warehouses or data lakes.
Collaborate closely with Data Scientists, ML Engineers, and Developers to ensure data quality, structure, and availability.
Monitor and improve data integrity, performance, and reliability.
Build and optimize database schemas, data models, and documentation.
Implement data governance, security best practices, and compliance standards.

We’re looking for somebody with:Backend Python Development

Strong experience building scalable backend systems in Python.
Comfortable with modern language features (type hints, decorators, generators).
Able to design clean, maintainable APIs using FastAPI, Django REST Framework, or Flask.
Good understanding of performance optimization and Python internals.
A collaborative mindset — you enjoy working closely with cross-functional teams.

Data Engineering

Hands-on experience designing and operating ETL/ELT pipelines.
Solid SQL skills and ability to model, optimize, and maintain database structures.
Experience integrating data from multiple sources (databases, APIs, streaming).
Familiarity with large-scale data processing tools or distributed systems.

Nice to have:

Experience with cloud platforms (AWS/Azure/GCP).
Knowledge of R.
Experience with Docker, Kubernetes, and CI/CD tools (GitHub Actions, GitLab CI).
Understanding of data governance, metadata management, and security.
Experience in life sciences, biotech, genomics, or enterprise data environments.
Prior remote work experience with international teams.

Life science skills:

Molecular Biology & Bioinformatics: Leverages molecular biology and bioinformatics to analyze data and communicate biological insights.
Clinical Trials - Data Tools & Flow: Builds and analyzes clinical trial data pipelines, ensuring auditability and delivering insights through collaboration and visualization tools.
CDISC & Clinical Data Standards: Applies and designs clinical data structures using CDISC standards, ensuring compliance and supporting best practices across teams.
Nextflow: Develops scalable, reproducible bioinformatics pipelines with Nextflow across local, HPC, and cloud environments.

What we offer:

Competitive B2B compensation with clear salary ranges (up to 23.000 PLN net B2B).
Modern equipment (MacBook / ThinkPad + Linux environment).
Work on high-impact, cutting-edge projects in biotech, pharma, research, and enterprise analytics.
Budget for professional development (certifications, courses, conferences).
Opportunity to collaborate with industry experts on innovative data products.
A supportive, ambitious, and friendly team that cares about excellence.
Fully remote work.

Important note: To complete the hiring process, you need to have a valid government-issued ID (for Polish citizens) or a valid passport (for non-Polish citizens).

What can you expect during the process:

Intro call with our People Team.
Technical task.
Technical interview with the Engineering Team.
Final interview with Head of Technology + offer.

Does this sound like a great opportunity for you?Use the Apply button below!

Appsilon is committed to being a diverse and inclusive workplace. We encourage applicants of different backgrounds, cultures, genders, experiences, abilities, and perspectives to apply. All qualified applicants will receive consideration for employment without regard to race, color, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed.

Skills Required

Strong experience building scalable backend systems in Python
Familiarity with modern Python language features (type hints, decorators, generators)
Ability to design clean, maintainable APIs using FastAPI, Django REST Framework, or Flask
Understanding of Python performance optimization and internals
Hands-on experience designing and operating ETL/ELT pipelines
Solid SQL skills and ability to model, optimize, and maintain database structures
Experience integrating data from multiple sources (databases, APIs, streaming)
Familiarity with large-scale data processing tools or distributed systems
Collaborative mindset; experience working with cross-functional teams (Data Scientists, ML Engineers, Developers)
Implement data governance, security best practices, and compliance standards
Experience with cloud platforms (AWS/Azure/GCP)
Knowledge of R
Experience with Docker, Kubernetes, and CI/CD tools (GitHub Actions, GitLab CI)
Understanding of data governance, metadata management, and security
Experience in life sciences, biotech, genomics, or enterprise data environments
Experience with Nextflow and clinical/biotech data pipeline standards (CDISC)
Prior remote work experience with international teams

View all jobs at Appsilon

View Appsilon Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

89 Employees

Year Founded: 2013

What We Do

Appsilon is a technology company providing innovative data analytics, machine learning, and managed services solutions for Fortune 500 companies, NGOs, and non-profit organizations. They specialize in developing advanced R Shiny applications and open-source solutions in R and Python, particularly for life sciences and pharmaceutical sectors, helping clients visualize data and make data-driven decisions through AI and specialized consulting.