Data Engineer

Posted Yesterday
Be an Early Applicant
Hiring Remotely in Warszawa, Mazowieckie, POL
In-Office or Remote
23K-23K Annually
Mid level
Artificial Intelligence • Information Technology • Machine Learning • Pharmaceutical
The Role
Design, build, and maintain scalable data pipelines; integrate diverse data sources into warehouses or lakes; collaborate with data scientists and engineers to ensure quality and availability; optimize schemas, models, and performance; implement governance, security, and compliance; monitor reliability and document systems.
Summary Generated by Built In
Why we need you?

At Appsilon, we empower global organizations to make smarter decisions with data. Our solutions help Fortune 500 companies discover new drugs, save lives, optimize operations, and unlock millions in value. To do this, we rely on robust, scalable, beautifully engineered data systems.

We're looking for a Data Engineer who can elevate how our clients collect, process, and leverage massive datasets — someone who loves building modern data pipelines and wants their work to power meaningful, real-world impact.

Your responsibilities:
  • Design, build, and maintain scalable data pipelines across diverse environments.

  • Integrate data from multiple internal and external sources into data warehouses or data lakes.

  • Collaborate closely with Data Scientists, ML Engineers, and Developers to ensure data quality, structure, and availability.

  • Monitor and improve data integrity, performance, and reliability.

  • Build and optimize database schemas, data models, and documentation.

  • Implement data governance, security best practices, and compliance standards.

We’re looking for somebody with:Backend Python Development
  • Strong experience building scalable backend systems in Python.

  • Comfortable with modern language features (type hints, decorators, generators).

  • Able to design clean, maintainable APIs using FastAPI, Django REST Framework, or Flask.

  • Good understanding of performance optimization and Python internals.

  • A collaborative mindset — you enjoy working closely with cross-functional teams.

Data Engineering
  • Hands-on experience designing and operating ETL/ELT pipelines.

  • Solid SQL skills and ability to model, optimize, and maintain database structures.

  • Experience integrating data from multiple sources (databases, APIs, streaming).

  • Familiarity with large-scale data processing tools or distributed systems.

Nice to have:
  • Experience with cloud platforms (AWS/Azure/GCP).

  • Knowledge of R.

  • Experience with Docker, Kubernetes, and CI/CD tools (GitHub Actions, GitLab CI).

  • Understanding of data governance, metadata management, and security.

  • Experience in life sciences, biotech, genomics, or enterprise data environments.

  • Prior remote work experience with international teams.

Life science skills:
  • Molecular Biology & Bioinformatics: Leverages molecular biology and bioinformatics to analyze data and communicate biological insights.

  • Clinical Trials - Data Tools & Flow: Builds and analyzes clinical trial data pipelines, ensuring auditability and delivering insights through collaboration and visualization tools.

  • CDISC & Clinical Data Standards: Applies and designs clinical data structures using CDISC standards, ensuring compliance and supporting best practices across teams.

  • Nextflow: Develops scalable, reproducible bioinformatics pipelines with Nextflow across local, HPC, and cloud environments.

What we offer:
  • Competitive B2B compensation with clear salary ranges (up to 23.000 PLN net B2B).

  • Modern equipment (MacBook / ThinkPad + Linux environment).

  • Work on high-impact, cutting-edge projects in biotech, pharma, research, and enterprise analytics.

  • Budget for professional development (certifications, courses, conferences).

  • Opportunity to collaborate with industry experts on innovative data products.

  • A supportive, ambitious, and friendly team that cares about excellence.

  • Fully remote work.

Important note: To complete the hiring process, you need to have a valid government-issued ID (for Polish citizens) or a valid passport (for non-Polish citizens).

What can you expect during the process:
  • Intro call with our People Team.

  • Technical task.

  • Technical interview with the Engineering Team. 

  • Final interview with Head of Technology + offer.

Does this sound like a great opportunity for you?Use the Apply button below!

Appsilon is committed to being a diverse and inclusive workplace. We encourage applicants of different backgrounds, cultures, genders, experiences, abilities, and perspectives to apply. All qualified applicants will receive consideration for employment without regard to race, color, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed.

Skills Required

  • Strong experience building scalable backend systems in Python
  • Familiarity with modern Python language features (type hints, decorators, generators)
  • Ability to design clean, maintainable APIs using FastAPI, Django REST Framework, or Flask
  • Understanding of Python performance optimization and internals
  • Hands-on experience designing and operating ETL/ELT pipelines
  • Solid SQL skills and ability to model, optimize, and maintain database structures
  • Experience integrating data from multiple sources (databases, APIs, streaming)
  • Familiarity with large-scale data processing tools or distributed systems
  • Collaborative mindset; experience working with cross-functional teams (Data Scientists, ML Engineers, Developers)
  • Implement data governance, security best practices, and compliance standards
  • Experience with cloud platforms (AWS/Azure/GCP)
  • Knowledge of R
  • Experience with Docker, Kubernetes, and CI/CD tools (GitHub Actions, GitLab CI)
  • Understanding of data governance, metadata management, and security
  • Experience in life sciences, biotech, genomics, or enterprise data environments
  • Experience with Nextflow and clinical/biotech data pipeline standards (CDISC)
  • Prior remote work experience with international teams
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
89 Employees
Year Founded: 2013

What We Do

Appsilon is a technology company providing innovative data analytics, machine learning, and managed services solutions for Fortune 500 companies, NGOs, and non-profit organizations. They specialize in developing advanced R Shiny applications and open-source solutions in R and Python, particularly for life sciences and pharmaceutical sectors, helping clients visualize data and make data-driven decisions through AI and specialized consulting.

Similar Jobs

Alguna Logo Alguna

Data Engineer

Artificial Intelligence • Fintech • Payments • Software
Remote
29 Locations
66 Employees
Remote
15 Locations
20 Employees

Neurons Lab Logo Neurons Lab

Data Engineer

Information Technology • Consulting
In-Office or Remote
18 Locations
54 Employees
Remote
27 Locations
30 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account