Diagnose-Me: GraphRAG-Based Medical Diagnosis System

Overview

Diagnose-Me is a GraphRAG-based medical diagnosis system that integrates graph-based entity relationships with Retrieval-Augmented Generation (RAG) techniques. This project leverages a combination of Neo4j for knowledge graphs, FAISS for vector-based retrieval, and LLMs to generate accurate medical diagnoses based on user queries.

Features

GraphRAG Entity Relationship Mapping: Utilizes a knowledge graph to structure medical entities and their relationships.
RAG for Contextual Search: Implements a chunking mechanism with overlap to store medical data in a FAISS vector database, allowing for accurate similarity comparisons.
Hybrid Query Processing: User queries are converted to vector representations and compared against both the vector database and knowledge graph for relevant results.
LLM-Based Diagnosis: The combined results from the vector database and graph database are sent to an LLM to generate a proper medical diagnosis.
Doctor Approval System: Doctors can review and approve the generated diagnosis for better reliability.
Patient Medical History Contextualization: Users can enter their medical history, enhancing the relevance of diagnoses.
Chat Storage: Patient-doctor chats are stored securely in MongoDB, along with user credentials.

Technical Stack

Backend

Flask: Manages the API and backend logic.
Neo4j: Stores medical entities and relationships in a structured knowledge graph.
FAISS: Handles vector-based retrieval for efficient similarity searches.
MongoDB: Stores patient and doctor chats securely.

Frontend

Next.js: Provides an interactive and user-friendly interface for patients and doctors.
Tailwind CSS: Enhances UI aesthetics.

Implementation Details

GraphRAG

Implemented using llmgraphtransformer in turning natural language into entity relations which uploads data to Neo4j.
Graph relationships improve diagnosis accuracy by structuring medical knowledge.
Dataset used mimi_ex(https://huggingface.co/datasets/morson/mimic_ex)

RAG with FAISS

Medical texts are chunked with overlap and indexed in FAISS for similarity comparison.
Queries are vectorized and matched against stored chunks to retrieve relevant medical data.

Query Processing

The system retrieves relevant data from both FAISS and Neo4j.
The retrieved results, along with the user query, are sent to the LLM.
The LLM generates a diagnosis based on symptoms and contextual information.
The processed response can be approved and edited by doctor for further use

Basic Workflow

                   User Query
                       │
           ┌───────────┴───────────┐
           ▼                       ▼
     FAISS Vector DB           Neo4j Graph DB
   (Document Similarity)    (Entity Relationships)
           │                       │
           └───────────┬───────────┘
                       ▼
                Gemini LLM Processor
                       │
                       ▼
          Combined Diagnostic Report
                       │
                       ▼
          Doctor Review & Approval
                       │
                       ▼
          Patient Feedback & History
                       │
                       ▼
          MongoDB (Chat History Storage)

Folder Structure

backend/
  ├── graphRAG/ (Graph-based entity relationship scripts)
  ├── rag/ (Vector database and retrieval components)
  ├── server.py (Flask backend API)
  ├── combine_response.py (to compare both results and combine )
  ├── requirements.txt (Dependencies)

frontend/
  ├── src/
  │   ├── app/ (Pages and API routes)
  │   ├── components/ (UI components)
  │   ├── lib/ (MongoDB and authentication utilities)
  │   ├── models/
  ├── package.json (Frontend dependencies)

How to Run

Backend

Install dependencies:

cd backend
pip install -r requirements.txt

Run the Flask server:
```
python server.py
```

Frontend

Install dependencies:
```
cd frontend
npm install
```
Start the Next.js frontend:
```
npm run dev
```

Future Improvements

Enhance the graph-based reasoning with additional medical knowledge sources and improved cyphertext creation from natural language.
Improve user experience with real-time chat and multilingual support.
Optimize FAISS indexing for faster vector retrieval.
Add option to to improve knowledge from previous chats.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
Readme.md		Readme.md
graphrag.ipynb		graphrag.ipynb
sample_query.txt		sample_query.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Diagnose-Me: GraphRAG-Based Medical Diagnosis System

Overview

Features

Technical Stack

Backend

Frontend

Implementation Details

GraphRAG

RAG with FAISS

Query Processing

Basic Workflow

Folder Structure

How to Run

Backend

Frontend

Future Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

nandhu-44/diagnose-me

Folders and files

Latest commit

History

Repository files navigation

Diagnose-Me: GraphRAG-Based Medical Diagnosis System

Overview

Features

Technical Stack

Backend

Frontend

Implementation Details

GraphRAG

RAG with FAISS

Query Processing

Basic Workflow

Folder Structure

How to Run

Backend

Frontend

Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages