At H1, we believe access to the best healthcare information is a basic human right. Our mission is to provide a platform that can optimally inform every doctor interaction globally. This promotes health equity and builds needed trust in healthcare systems. To accomplish this our teams harness the power of data and AI-technology to unlock groundbreaking medical insights and convert those insights into action that result in optimal patient outcomes and accelerates an equitable and inclusive drug development lifecycle. Visit h1.co to learn more about us.
Data Engineering is responsible for the development and delivery of our most important asset—our data. With thousands of data sources from around the world, the team ensures that data is accurate, normalized, and delivered at a velocity that keeps up with real-world changes. As we expand our markets and the scope of data we provide to our customers, our team must scale to meet that demand.
- Work on developing strategies and frameworks to capture web data at scale.
- Design, develop, and maintain scalable data extraction frameworks that ingest structured and unstructured data from diverse sources.
- Build and optimize robust ETL/ELT pipelines using big data technologies, especially Apache Spark on cloud platforms (preferably AWS EMR).
- Improve the efficiency, reliability, and performance of data processing systems through thoughtful design and continuous optimization.
- Transform, clean, and normalize complex datasets for downstream use, ensuring high standards of data quality and consistency.
- Partner with senior engineers to evolve H1’s data architecture and infrastructure in support of product and platform scalability.
- Lead data integration efforts across multiple systems, ensuring accuracy and seamless collaboration across teams.
- Monitor and troubleshoot data flows and pipelines, proactively identifying and resolving performance issues.
- Maintain clear documentation of systems, workflows, and processes to promote transparency and operational excellence.
- Participate in code reviews and promote a culture of engineering excellence, mentorship, and continuous improvement.
- Collaborate closely with cross-functional teams to align technical execution with business goals
- It’s a bonus if you’re familiar with model training and fine-tuning, particularly in NLP (Natural Language Processing) contexts.
- You possess a basic knowledge of network, security, and encryption protocols such as HTTP/HTTPS/TLS.
- You’re able to work collaboratively across teams and communicate effectively with both technical and non-technical stakeholders.
- You have strong analytical and problem-solving skills with a focus on data quality and performance optimization.
- You have a passion for writing clean, efficient code and following best practices.
- Strong proficiency in Python.
- Proficiency in web scraping strategies and technologies: curl, network analysis, proxies and selenium/playwright.
- Strong SQL skills and experience with PostgreSQL.
- Experience with big data tools like Apache Spark, particularly on cloud platforms, with a preference for AWS EMR.
- Experience with Docker or other containerization technologies.
Top Skills
What We Do
Access to medicine and healthcare is a basic human right. At H1, we believe access to the best healthcare information is also a basic human right, one that will be more important in the 21st century than ever before. Our commitment to creating a healthier future for everyone drives us to build and maintain the most current, accurate, and comprehensive healthcare knowledge base available, as well as the tools and intelligence to extract unparalleled insights to carry global healthcare forward.
Why Work With Us
We’re a team of people building products that help solve difficult problems in healthcare. We work through complex challenges every day, navigating ambiguity, wrestling with uncertainty, and pushing the boundaries of what’s possible–all while caring deeply about one another and the people we seek to help.
Gallery









