• Home
  • About
  • Experience
  • Projects
  • Skills
  • Blog
  • Contact
© 2025 iAMVamsi.

Made with by iAMVamsi.

← Back to Projects
AI/Life Sciences

Medical Knowledge Graph Construction with MedCAT

Developed a comprehensive medical knowledge graph system using MedCAT for entity extraction and Neo4j for storage, integrating SNOMED CT medical terminology.

Completed: June 23, 2025AI/Life Sciences
PythonMedCATNeo4jSNOMED CTMedical NLPKnowledge Graphs
Source Code Available on Request
Medical Knowledge Graph Construction with MedCAT

Project Overview

Built a medical knowledge graph construction pipeline as part of my Master's studies in computational life sciences. The system uses MedCAT (Medical Concept Annotation Toolkit) for automated medical entity extraction from clinical texts and integrates SNOMED CT medical terminology for standardized concept mapping. The knowledge graph is stored and queried using Neo4j graph database with RDF framework for managing medical relationships. This project provided hands-on experience with medical NLP, healthcare ontologies, and graph database technologies. I learned about the challenges of processing medical text data, entity linking, and building semantic relationships between medical concepts. The work demonstrates practical applications of knowledge graphs in healthcare and clinical decision support systems.

Key Features

  • ✓
    MedCAT medical entity extraction
  • ✓
    SNOMED CT terminology integration
  • ✓
    Neo4j graph database implementation
  • ✓
    RDF framework for semantic relationships
  • ✓
    Clinical text processing pipeline
  • ✓
    Medical concept annotation and mapping
  • ✓
    Standardized healthcare terminology
  • ✓
    Graph-based medical knowledge representation
  • ✓
    Efficient entity relationship storage
  • ✓
    Medical ontology integration
  • ✓
    Clinical decision support capabilities
  • ✓
    Scalable healthcare data architecture

Technical Challenges

  • ⚡
    Integrating complex medical terminology standards
  • ⚡
    Optimizing Neo4j performance for medical data
  • ⚡
    Ensuring accurate medical entity recognition
  • ⚡
    Managing SNOMED CT concept hierarchies
  • ⚡
    Handling clinical text variations and ambiguity
  • ⚡
    Designing efficient graph schema for medical relationships
  • ⚡
    Maintaining data privacy for healthcare information
  • ⚡
    Scaling entity extraction for large clinical datasets

Technologies Used

PythonMedCATNeo4jSNOMED CTRDFMedical OntologiesspaCyHealthcare NLPGraph DatabasesClinical Informatics

Project Info

CategoryAI/Life Sciences
CompletedJune 23, 2025
FeaturedNo

Screenshots

Medical Knowledge Graph Construction with MedCAT screenshot 1
Medical Knowledge Graph Construction with MedCAT screenshot 2
Medical Knowledge Graph Construction with MedCAT screenshot 3

Related Projects

NLP Pipeline for Medical Data Processing

NLP Pipeline for Medical Data Processing

Built an NLP pipeline to process Medline XML and ChEBI ontology data for clinical research and pharmaceutical applications.

December 1, 2024 • AI/Life Sciences
Comparative LLM Fine-tuning for Knowledge Extraction

Comparative LLM Fine-tuning for Knowledge Extraction

Conducted systematic comparative experiments on Mistral-7B fine-tuning using three distinct approaches on NewsKG21 dataset to optimize knowledge extraction performance.

November 15, 2024 • AI/Life Sciences
Bio-Inspired Optimization for Personalized Diabetes Management

Bio-Inspired Optimization for Personalized Diabetes Management

Developed a bio-inspired optimization system integrating genetic algorithms with physiological modeling for personalized Type 2 diabetes management.

April 20, 2025 • AI/Life Sciences

Interested in This Project?

Have questions about the implementation or want to discuss similar projects? Let's connect!

Get in TouchView More Projects