At Appsilon, we empower global organizations to make smarter decisions with data. Our solutions help Fortune 500 companies discover new drugs, save lives, optimize operations, and unlock millions in value. To do this, we rely on robust, scalable, beautifully engineered data systems.
We're looking for a Data Engineer who can elevate how our clients collect, process, and leverage massive datasets — someone who loves building modern data pipelines and wants their work to power meaningful, real-world impact.
Your responsibilities:Design, build, and maintain scalable data pipelines across diverse environments.
Integrate data from multiple internal and external sources into data warehouses or data lakes.
Collaborate closely with Data Scientists, ML Engineers, and Developers to ensure data quality, structure, and availability.
Monitor and improve data integrity, performance, and reliability.
Build and optimize database schemas, data models, and documentation.
Implement data governance, security best practices, and compliance standards.
Strong experience building scalable backend systems in Python.
Comfortable with modern language features (type hints, decorators, generators).
Able to design clean, maintainable APIs using FastAPI, Django REST Framework, or Flask.
Good understanding of performance optimization and Python internals.
A collaborative mindset — you enjoy working closely with cross-functional teams.
Hands-on experience designing and operating ETL/ELT pipelines.
Solid SQL skills and ability to model, optimize, and maintain database structures.
Experience integrating data from multiple sources (databases, APIs, streaming).
Familiarity with large-scale data processing tools or distributed systems.
Experience with cloud platforms (AWS/Azure/GCP).
Knowledge of R.
Experience with Docker, Kubernetes, and CI/CD tools (GitHub Actions, GitLab CI).
Understanding of data governance, metadata management, and security.
Experience in life sciences, biotech, genomics, or enterprise data environments.
Prior remote work experience with international teams.
Molecular Biology & Bioinformatics: Leverages molecular biology and bioinformatics to analyze data and communicate biological insights.
Clinical Trials - Data Tools & Flow: Builds and analyzes clinical trial data pipelines, ensuring auditability and delivering insights through collaboration and visualization tools.
CDISC & Clinical Data Standards: Applies and designs clinical data structures using CDISC standards, ensuring compliance and supporting best practices across teams.
Nextflow: Develops scalable, reproducible bioinformatics pipelines with Nextflow across local, HPC, and cloud environments.
Competitive B2B compensation with clear salary ranges (up to 23.000 PLN net B2B).
Modern equipment (MacBook / ThinkPad + Linux environment).
Work on high-impact, cutting-edge projects in biotech, pharma, research, and enterprise analytics.
Budget for professional development (certifications, courses, conferences).
Opportunity to collaborate with industry experts on innovative data products.
A supportive, ambitious, and friendly team that cares about excellence.
Fully remote work.
Important note: To complete the hiring process, you need to have a valid government-issued ID (for Polish citizens) or a valid passport (for non-Polish citizens).
What can you expect during the process:Intro call with our People Team.
Technical task.
Technical interview with the Engineering Team.
Final interview with Head of Technology + offer.
Appsilon is committed to being a diverse and inclusive workplace. We encourage applicants of different backgrounds, cultures, genders, experiences, abilities, and perspectives to apply. All qualified applicants will receive consideration for employment without regard to race, color, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed.
Skills Required
- Strong experience building scalable backend systems in Python
- Familiarity with modern Python language features (type hints, decorators, generators)
- Ability to design clean, maintainable APIs using FastAPI, Django REST Framework, or Flask
- Understanding of Python performance optimization and internals
- Hands-on experience designing and operating ETL/ELT pipelines
- Solid SQL skills and ability to model, optimize, and maintain database structures
- Experience integrating data from multiple sources (databases, APIs, streaming)
- Familiarity with large-scale data processing tools or distributed systems
- Collaborative mindset; experience working with cross-functional teams (Data Scientists, ML Engineers, Developers)
- Implement data governance, security best practices, and compliance standards
- Experience with cloud platforms (AWS/Azure/GCP)
- Knowledge of R
- Experience with Docker, Kubernetes, and CI/CD tools (GitHub Actions, GitLab CI)
- Understanding of data governance, metadata management, and security
- Experience in life sciences, biotech, genomics, or enterprise data environments
- Experience with Nextflow and clinical/biotech data pipeline standards (CDISC)
- Prior remote work experience with international teams
What We Do
Appsilon is a technology company providing innovative data analytics, machine learning, and managed services solutions for Fortune 500 companies, NGOs, and non-profit organizations. They specialize in developing advanced R Shiny applications and open-source solutions in R and Python, particularly for life sciences and pharmaceutical sectors, helping clients visualize data and make data-driven decisions through AI and specialized consulting.







