Medical Chatbot using finetuned LLM and RAG on Pubmed dataset

	GitHub Handle	E-Mail	Course of Study	Matriculation Number
Jonas Gann	@J-Gann	gann@stud.uni-heidelberg.de	Data and Computer Science	3367576
Christian Teutsch	@chTeut	christian.teutsch@outlook.de	Data and Computer Science	3729420
Saif Mandour	@saifmandour	saifmandour@gmail.com	Computer Science	4189231

Advisor: Robin Khanna

This repository contains a medical chatbot using a finetuned LLM and RAG system on a Pubmed dataset.

See the Documentation for more information.

Installation and Running

Requirements

install pip requirements using pip3 install -r requirements.txt
install nodejs (version 20.x) using e.g. https://github.com/nvm-sh/nvm
install ollama: https://github.com/ollama/ollama
install docker: https://docs.docker.com/engine/install/

Setup

pull the used llm using ollama pull cniongolo/biomistral
pull chat-ui dependencies using cd chat-ui-rag && npm i
pull the dataset

Starting

start pinecone vectorstore using python3 chat-ui-rag/src/lib/server/rag/pinecone/pineconeEndpoint.py
as alternative to pinecone: start OpenSearch using docker run -d -p 9200:9200 -p 9600:9600 -e "discovery.type=single-node" -e "OPENSEARCH_INITIAL_ADMIN_PASSWORD=qaOllama2" opensearchproject/opensearch:latest. Change "vectorStoreType" to "opensearch" at .env
start mongodb server using docker run -d -p 27017:27017 --name mongo-chatui mongo:latest
start chat-ui dev server using cd chat-ui-rag && npm run dev
to start production chat-ui server run cd chat-ui-rag && npm run build && pm2 start ecosystem.config.cjs

Access

access dev chat-ui with browser at http://localhost:5173
access production chat-ui at http://localhost:8080

Repository Overview

The main components of the repository are:

User Interface: ChatUI ^1

Here you can find the user interface for the chatbot based on the open-source project. We expanded the project to include a RAG system inserting the content of scientific papers retrieved from the Pubmed dataset.

RAG System: RAG

Here you can find the RAG system used for the chatbot. It provides an endpoint for the chat-ui to query the RAG system for papers relevant to questions posed by the user. The retrieved papers are inserted into the user prompt at buildPrompt.ts.

Data Preprocessing: Preprocessing

This notebook containes the code we used to retrieve and process the Pubmed dataset as well as upload embeddings of the papers to the Pinecone vectorstore.

System Evaluation: Evaluation

This folder contains the code and results of the evaluation of the chatbot system.

Meetings: Meetings

This folder contains the notes of the meetings we had during the project.

Notes: Notes

This folder contains the notes we took during the project.

Opensearch: Opensearch

The OpenSearch Vectorbase runs on a localhost. The pubmed_preprocessing.ipynb notebook can be used to preprocess the PubMed data, creating an index and bulk loading the data into the Vectorbase. It also provides an index mapping to create a k-NN search. The k-NN can be tested in the last code section. This code is also used in the opensearchEndpoint.py. To use OpenSearch instead of Pinceone Vector Database, the .env file must be modified. The Rag attribute in the MODELS variable must be changed to "vectorStoreType": "opensearch" and "url": "http://127.0.0.1:9300".

J-Gann / medical-rag-chatbot

readme

Medical Chatbot using finetuned LLM and RAG on Pubmed dataset

Installation and Running

Requirements

Setup

Starting

Access

Repository Overview

User Interface: ChatUI ^1

RAG System: RAG

Data Preprocessing: Preprocessing

System Evaluation: Evaluation

Meetings: Meetings

Notes: Notes

Opensearch: Opensearch

J-Gann / medical-rag-chatbot

readme

Medical Chatbot using finetuned LLM and RAG on Pubmed dataset

Installation and Running

Requirements

Setup

Starting

Access

Repository Overview

User Interface: ChatUI^1

RAG System: RAG

Data Preprocessing: Preprocessing

System Evaluation: Evaluation

Meetings: Meetings

Notes: Notes

Opensearch: Opensearch

User Interface: ChatUI ^1