Semantic Data Modeler

Posted 4 Days Ago
Be an Early Applicant
4 Locations
Remote or Hybrid
110K-160K Annually
Mid level
Artificial Intelligence • Healthtech • Information Technology • Natural Language Processing • Software • Analytics • Generative AI
IMO Health is a clinical data intelligence company, improving how data is used across the healthcare landscape
The Role
The Semantic Data Modeler will develop semantic models, integrate NLP workflows, ensure data quality for knowledge graphs, and collaborate across teams.
Summary Generated by Built In
At IMO Health, Semantic Data Modelers are key members of our ontology-driven graph engineering team, helping to build and maintain a virtualized, intelligent, and scalable medical terminology platform. Your work will empower over 740,000 clinicians by enhancing how healthcare data is structured, delivered, and understood.

We are seeking an experienced Semantic Data Modeler to join our team, focusing on the development and application of Knowledge Graphs integrated with Data Science and Natural Language Processing (NLP) workflows. In this critical role, you will contribute to the design and implementation of semantic models, bridging the gap between raw data sources and actionable clinical insights. You will utilize your skills in NLP and data analysis to enrich our knowledge graph, ensuring high data quality and accessibility for semantic enrichment and clinical interoperability initiatives. This position requires strong technical proficiency and exceptional collaboration skills, working closely with staff semantic engineers, clinicians, and content teams. 

WHAT YOU’LL DO:

  • Semantic Model Design: Contribute to the design, development, and iterative refinement of complex semantic data models, with a focus on ontologies, knowledge graphs, and property graphs. 
  • Requirement Translation: Assist senior team members in translating intricate, cross-functional business needs into formal, scalable knowledge graph structures, ensuring tight alignment with the enterprise data strategy. 
  • Governance and Standards: Document semantic assets, including detailed entity definitions, relationship types, axioms, constraints, and data lineage, ensuring adherence to established best practices and fostering consistency across the organization. 
  • Graph Data Analysis & Feature Engineering: Utilize data science methodologies and Python scripting to conduct exploratory data analysis (EDA) on graph structure, identify data patterns, perform feature engineering, and support advanced analytics based on the knowledge graph. 
  • Data-Driven NLP Integration: Apply Natural Language Processing (NLP) techniques and tooling to unstructured clinical text and data streams to identify, extract, and map new entities, attributes, and relationships directly into the semantic models, effectively structuring raw data for machine learning consumption. 
  • Robust Data Quality: Implement and maintain data quality frameworks, validation rules (e.g., using SHACL), and transformation logic within the semantic layer to ensure the accuracy, reliability, and consistency of the knowledge graph. 
  • Cross-Functional Partnership: Partner closely with Staff semantic engineers, clinicians, content teams, and business leaders to understand domain knowledge and requirements, and ensure semantic solutions effectively meet organizational objectives. 
  • Research & Evaluation: Assist in researching, evaluating, and utilizing new technologies, methodologies, and best practices in semantic modeling, knowledge graph technologies, and NLP to drive continuous process improvement. 
  • Knowledge Sharing: Proactively share technical expertise and knowledge with peers and cross-functional teams. 

WHAT YOU’LL NEED:

  • BA/BS in a STEM field (e.g., Computer Science, Data Science, Bioinformatics) with 3-5 years of hands-on work experience in data modeling or data engineering. 
  • Proven hands-on experience in semantic modeling concepts (ontologies, knowledge graphs, property graphs). 
  • Expertise in Python for data manipulation, analysis, and pipeline development, including libraries like Pandas/NumPy. 
  • Strong understanding of statistical and machine learning concepts (e.g., classification, clustering, regression) and their application to graph-based data. 
  • Demonstrated experience with NLP technologies and libraries (e.g., NLTK, spaCy, Gensim, Hugging Face) for text extraction, named entity recognition, or relationship extraction. 
  • Strong working knowledge of graph database platforms (e.g., Amazon Neptune, Neo4j, etc.) and graph query languages (e.g., SPARQL or Gremlin). 
  • Familiarity with semantic web standards like OWL, RDFS, and SHACL. 
  • Experience with relational databases (SQL) and general data warehousing concepts. 
  • Ability to communicate complex technical concepts effectively to both technical and non-technical stakeholders. 
  • Experience in an Agile/Scrum environment, iteratively developing and deploying data solutions.

NICE TO HAVE:

  • Practical experience with AWS data services (e.g., Glue, Sagemaker) and ETL/ELT methodologies.
  • Understanding healthcare ontologies and standards like SNOMED-CT, LOINC, RxNorm, and ICD-10. 

Top Skills

Amazon Neptune
Gensim
Hugging Face
Neo4J
Nltk
Numpy
Owl
Pandas
Python
Rdfs
Shacl
Spacy
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chicago, IL
320 Employees
Year Founded: 1994

What We Do

We are a team of dedicated clinical terminologists, data scientists, industry subject matter experts, and informaticists who helped facilitate the evolution from analogue to digital capture of clinical events, the precise code-mapping that simplifies complex workflows, and the translation of unstructured into structured data. We “wrote the digital dictionary” used in every major EHR, and we are leveraging clinical AI to generate insights that expand and deepen our impact across the healthcare ecosystem.  

At the end of the day, we don’t make decisions for our clients. We provide them with the digital tools to enable sound decision-making.  

Why Work With Us

We are building a clinical intelligence stack—medical ontology, human expertise, and AI—that makes data more useful and more powerful. By enhancing data’s structure, richness, and precision, we reduce noise and error, streamline complexity, and create clarity across the clinical information chain.

Gallery

Gallery

Similar Jobs

Cox Enterprises Logo Cox Enterprises

Sr Customer Care Specialist (Manheim)

Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Remote or Hybrid
AZ, USA
21-32

Gusto Logo Gusto

Staff Software Engineer

Fintech • HR Tech
Easy Apply
Remote or Hybrid
10 Locations
191K-265K

Gusto Logo Gusto

Staff Product Designer

Fintech • HR Tech
Easy Apply
Remote or Hybrid
6 Locations
146K-222K
Easy Apply
Remote or Hybrid
7 Locations
84K-106K

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account