Big Data Engineer

Job Posted 3 Days Ago Posted 3 Days Ago
Hiring Remotely in US
Remote
100K-120K Annually
Mid level
Artificial Intelligence • Big Data • Healthtech • Database • Business Intelligence
We are creating a healthier future by connecting the world with the right doctors.
The Role
As a Data Engineer at H1, you will build scalable data pipelines and workflows to transform raw client data into actionable insights. Your role involves integrating diverse client data sources, ensuring data quality and scalability, and collaborating within a team to contribute to data strategies.
Summary Generated by Built In

At H1, we believe access to the best healthcare information is a basic human right. Our mission is to provide a platform that can optimally inform every doctor interaction globally. This promotes health equity and builds needed trust in healthcare systems. To accomplish this our teams harness the power of data and AI-technology to unlock groundbreaking medical insights and convert those insights into action that result in optimal patient outcomes and accelerates an equitable and inclusive drug development lifecycle. Visit h1.co to learn more about us.


Data Engineering is responsible for the development and delivery of our most important asset - our data. Looking across thousands of data sources from across the globe, the data engineering team is responsible for making sense out of that data to create the world’s most extensive and comprehensive knowledge base of healthcare stakeholders and the ecosystem they influence. It is our job to ensure that only accurate, normalized data flows through to our customers, and at a velocity that keeps up with the changes in the real world.



WHAT YOU'LL DO AT H1

As a Data Engineer on the H1DN Team, you will play a key role in building scalable data pipelines and enrichment workflows that transform raw client data into accurate, actionable insights. With minimal guidance, you will focus on data ingestion and enrichment, ensuring seamless integration of client data from diverse sources (CSV, Parquet, JSON, APIs) while tackling challenges related to scalability, data quality, and standardization.


You will:

- Develop and enhance processes to enrich raw or partially processed data using established business logic, ensuring it is accurate and ready for product use.

- Build and maintain scalable and reliable data pipelines that support the team’s enrichment workflows.

- Integrate enriched data from core platforms (e.g., CT platform) into broader data systems, applying necessary transformations and aligning with business requirements.

- Contribute to code reviews, holding a high bar for quality and aligning with organizational engineering guidelines.

- Follow development workflows, including coding, testing, deployment, and monitoring, to ensure quality and efficiency.

- Work collaboratively with team members and escalate issues appropriately when challenges arise.

- Contribute to the understanding and execution of tasks with a strong focus on accuracy, scalability, and performance.



ABOUT YOU

You have strong hands-on technical skills and experience in data engineering, with a track record of building and maintaining scalable data systems and pipelines. You excel at solving data engineering challenges and contributing to innovative solutions.

- Experience developing and optimizing data workflows, applying business logic for data enrichment, and addressing technical challenges with creative solutions

- Strong knowledge of building and scaling data infrastructure, including integration with core platforms

- Experience working with data quality challenges and implementing validation mechanisms

- Self-motivated with the ability to manage tasks and collaborate effectively within a team

- Ability to align work with broader organizational goals and contribute to strategic initiatives

- Proactively identifies potential risks and helps implement solutions early in the project lifecycle

- Eager to learn, grow, and contribute to a collaborative, high-performing engineering team



REQUIREMENTS

- 3+ years of experience in data engineering, specializing in building scalable data pipelines and enrichment processes, with a track record of working with large datasets, including ingestion, transformation, and optimization

- Proficiency in Spark, Python, and SQL for building scalable data processing pipelines

- Hands-on experience with Kubernetes for container orchestration and deployment

- Strong background in AWS, including services such as S3, Lambda, ECS, and RDS for data infrastructure

- Experience with EMR and Databricks to optimize large-scale data workflows

- Has an understanding of LLM usage in production



COMPENSATION

This role pays $100,000 to $120,000 per year, based on experience, in addition to stock options.


Anticipated role close date: 05/28/2025



H1 OFFERS

- Full suite of health insurance options, in addition to generous paid time off

- Pre-planned company-wide wellness holidays

- Retirement options

- Health & charitable donation stipends

- Impactful Business Resource Groups

- Flexible work hours & the opportunity to work from anywhere

- The opportunity to work with leading biotech and life sciences companies in an innovative industry with a mission to improve healthcare around the globe



H1 is proud to be an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for all applicants and teammates. Our goal is to recruit the most talented people from a diverse candidate pool regardless of race, color, ancestry, national origin, religion, disability, sex (including pregnancy), age, gender, gender identity, sexual orientation, marital status, veteran status, or any other characteristic protected by law.

 

H1 is committed to working with and providing access and reasonable accommodation to applicants with mental and/or physical disabilities. If you require an accommodation, please reach out to your recruiter once you've begun the interview process. All requests for accommodations are treated discreetly and confidentially, as practical and permitted by law.



#H1-HF

Top Skills

AWS
Databricks
Ecs
Emr
Kubernetes
Lambda
Python
Rds
S3
Spark
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York , NY
500 Employees
On-site Workplace
Year Founded: 2017

What We Do

Access to medicine and healthcare is a basic human right. At H1, we believe access to the best healthcare information is also a basic human right, one that will be more important in the 21st century than ever before.

Our commitment to creating a healthier future for everyone drives us to build and maintain the most current, accurate, and comprehensive healthcare knowledge base available, as well as the tools and intelligence to extract unparalleled insights to carry global healthcare forward.

Why Work With Us

We’re a team of people building products that help solve difficult problems in healthcare. We work through complex challenges every day, navigating ambiguity, wrestling with uncertainty, and pushing the boundaries of what’s possible–all while caring deeply about one another and the people we seek to help.

Gallery

Gallery

Similar Jobs

Nagarro Logo Nagarro

Associate Principal Engineer, Big Data Engineer

Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
Remote
USA
19994 Employees

Nagarro Logo Nagarro

Big Data Engineer with Databricks

Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
Remote
USA
19994 Employees

Nagarro Logo Nagarro

Senior Staff Engineer, Big Data

Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
Remote
USA
19994 Employees

Rackspace Technology Logo Rackspace Technology

Sr Big Data Engineer Airflow and Oozie (GCP)

Cloud • Information Technology • Software
Remote
United States
7509 Employees
116K-198K Annually

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account