Dandelion Health Inc

Software Engineer

Reposted 16 Days Ago

Hiring Remotely in USA

Remote

135K-150K Annually

Mid level

Artificial Intelligence • Big Data • Healthtech • Analytics

The Role

Build and optimize high-throughput de-identification pipelines for large clinical datasets, execute QA, run and tune pipelines in health system cloud environments, reduce errors and costs, and collaborate with privacy and clinical informatics stakeholders.

Summary Generated by Built In

Our Team

Dandelion Health was founded in 2020 by experts in health tech, hospital systems, academia, and clinical AI. We are building the world’s largest AI training and clinical development platform. Today, we pride ourselves on our ability to make data access as easy as possible for AI developers, pharma, and medical devices, while raising the bar for patient safety and data quality. Tomorrow, we will be the place where any healthcare organization can go to build a responsible clinical AI product. Our culture is all about learning from data and improving, so we can help our clients improve health through AI. Meet the rest of our team here.

Our Data

We partner with health systems to safely and ethically make their de-identified patient data available to AI developers. Currently, the data is acquired from Sharp HealthCare, Sanford Health, and Texas Health Resources – with two additional U.S. health systems joining soon.

We have clinical data dating back to July 1, 2016. This data represents over 10 million patients and includes but is not limited to:

Structured data (e.g., 100% of the EMR, including some claims)
Unstructured text (e.g., clinical notes, radiology reports)
Images (e.g., DICOM, pathology)
Video
Waveforms
Continuous streaming monitoring data

Your Role

Dandelion is constantly expanding the breadth, depth, and completeness of health system datasets while improving the speed and quality of our de-identification pipeline. As an engineer working on our de-identification pipelines, you will:

Design and implement software systems that perform these de-identification rules at high scale and throughput (we de-identify billions of rows of data and millions of images each month) while constraining costs.
Generate and execute quality assurance plans to validate our de-identification processes.
Run de-identification pipelines in health system cloud environments, and optimize these pipelines to minimize error rates, improve processing efficiency, and reduce manual effort and cost.
Partner with our Director of Privacy and Clinical Informaticists to define de-identification rules.

Required technical skills

3+ years of development experience in Python or an equivalent language in a professional setting, across the full software development lifecycle (design, implementation, testing, deployment, maintenance);
Familiarity with one or more command languages (e.g. Bash) and SQL.

Required Non-technical skills

Demonstrated ability to design and improve workflows, including associated operating procedures, cost management, and quality assurance;
Strong analytical decision-making and organizational skills;
Perseverance and practical problem solving;
Humility and strong team collaboration;
Enthusiasm about protecting patients’ personal data.

We are an AWS and Python shop, and our datasets are stored in AWS Redshift, Snowflake, or Parquet files which are processed in Pandas DataFrames.

Preferred skills

Proficiency with data structures such as Pandas DataFrames;
Previous software deployment in a cloud computing environment (e.g., AWS, Azure);
Familiarity with virtualization and containerization (e.g., Docker, VMware);
Prior experience working with healthcare data;
Experience interacting with non-technical stakeholders to deploy software solutions.

Team Benefits

Remote work and flexible hours. Availability needed for meetings, which we try to keep to a healthy minimum
Complete wellness benefits including healthcare, dental, vision, PTO, sick days and more. Ask for details

Professional development days to build your skills
Collegial work environment
Academic bent towards inquiry and problem solving but start-up speed and flexibility
Great balance of focus time to work on projects but easy to access team members to discuss issues and work collaboratively
Dandelion is a mission-driven company that is focused on improving patient care

Skills Required

3+ years of development experience in Python or an equivalent language across the full software development lifecycle
Familiarity with one or more command languages (e.g. Bash)
Familiarity with SQL
Demonstrated ability to design and improve workflows, including operating procedures, cost management, and quality assurance
Strong analytical decision-making and organizational skills
Perseverance and practical problem solving
Humility and strong team collaboration
Enthusiasm about protecting patients' personal data
Proficiency with data structures such as Pandas DataFrames
Previous software deployment in a cloud computing environment (e.g., AWS, Azure)
Familiarity with virtualization and containerization (e.g., Docker, VMware)
Prior experience working with healthcare data
Experience interacting with non-technical stakeholders to deploy software solutions
Experience with AWS Redshift, Snowflake, or Parquet files

View all jobs at Dandelion Health Inc

View Dandelion Health Inc Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

49 Employees

Year Founded: 2020

What We Do

Dandelion Health is a health tech startup that develops a clinical data and artificial intelligence platform. By combining proprietary AI tools with a vast repository of high-quality, de-identified real-world patient data, the company enables researchers and life sciences firms to train, test, and validate algorithms, thereby accelerating clinical development and commercialization of precision medicine.