As a Principal Analyst, Data Integration, you will own the end-to-end process of evaluating, scoping, and onboarding new data sources into H1's platform. This is a senior IC role at the intersection of data, engineering, and product — the connective tissue between raw data acquisition and what ultimately ships to clients. You will work across Data & Research, Engineering, and Product to define what a new source is, how it maps to H1's schemas, what it can realistically deliver, and what it can't. You will also work directly with client-facing teams to gather requirements before integration decisions are made, translating commercial needs into data specs and data constraints back into product expectations.
You will:
- Lead structured evaluation of new data sources from scratch — assessing schema, coverage, freshness, legal constraints, and fit against H1's product needs before any engineering work begins
- Own field mapping from source to H1's bronze/silver/gold layers, producing data dictionaries, entity definitions, and structural guidance for downstream teams
- Partner with engineering and Data Lake to define ingestion requirements, entity resolution rules, and refresh cadences for new sources
- Gather requirements from client-facing teams and translate them into integration specifications; serve as the authoritative voice on what a new source can and cannot deliver before product commitments are made
- Shepherd each source end-to-end: scoping → QA → entity matching → product launch, including product QA and communicating source capabilities and limitations to product and enablement partners
- Work with the Insights team to develop new taxonomies and QA mechanisms for novel data types
- Define acceptance criteria and lead QA validation including field-level fill rates, count comparisons, and cycle-over-cycle anomaly detection
- Investigate and resolve data quality issues post-integration, coordinating with DART and engineering as needed
- Hand off to the maintaining team with complete mapping documentation; you own onboarding, not ongoing maintenance
- Produce and maintain documentation other people actually use — across scoping assessments, field mapping specs, and post-mortems
- Demonstrated end-to-end ownership of data integrations built from scratch — scoping, field mapping, QA, and handoff — with documentation to show for it
- Healthcare or life sciences domain context required; ability to ramp on new datasets and source types each quarter without needing deep subject matter expertise upfront
- Analytical fluency to assess data quality; hands-on experience with tools such as VBA, R, or SPSS; SQL a plus but not a primary requirement
- Familiarity with data lake architectures (bronze/silver/gold or equivalent) and how raw data moves through normalization and entity resolution to a product-ready state
- Experience gathering requirements from client-facing stakeholders and translating them into data or product specifications
- Experience at a B2B data company where you understood how external clients consumed your data and where client retention drove decisions
- AWS infrastructure familiarity (Athena, S3, Glue) at a query and inspection level preferred
- Comfort working in Jira or Monday in a ticket-based workflow
- Exceptional written communication — your documentation is legible, maintained, and actually used
Skills Required
- 8-12+ years in data-focused roles at healthcare data companies, pharma/biotech data vendors, or health IT firms
- Demonstrated end-to-end ownership of data integrations (scoping, field mapping, QA, handoff) with documentation
- Healthcare or life sciences domain context experience
- Analytical fluency to assess data quality with hands-on experience using VBA, R, or SPSS
- SQL knowledge (listed as a plus)
- Familiarity with data lake architectures (bronze/silver/gold), normalization, and entity resolution
- Experience gathering requirements from client-facing stakeholders and translating into data/product specifications
- Experience at a B2B data company with understanding of external client data consumption and retention-driven decisions
- AWS infrastructure familiarity (Athena, S3, Glue) at a query and inspection level
- Comfort working in Jira or Monday within a ticket-based workflow
- Exceptional written communication and maintainable documentation skills
What We Do
Access to medicine and healthcare is a basic human right. At H1, we believe access to the best healthcare information is also a basic human right, one that will be more important in the 21st century than ever before. Our commitment to creating a healthier future for everyone drives us to build and maintain the most current, accurate, and comprehensive healthcare knowledge base available, as well as the tools and intelligence to extract unparalleled insights to carry global healthcare forward.
Why Work With Us
We’re a team of people building products that help solve difficult problems in healthcare. We work through complex challenges every day, navigating ambiguity, wrestling with uncertainty, and pushing the boundaries of what’s possible–all while caring deeply about one another and the people we seek to help.
Gallery







