Medical RAG using BioMistral 7B LLM Running Locally 🏥🩺

This project implements a RAG (Retrieval-Augmented Generation) system using an open-source stack. It utilizes BioMistral 7B as the main model along with other technologies such as PubMedBert for embedding, Qdrant for a self-hosted Vector DB, and Langchain & Llama CPP for orchestration frameworks.

Bio Mistral

BioMistral refers to a collection of specialized Large Language Models (LLMs) designed specifically for the medical domain. It's built upon the foundation of the Mistral LLM, a powerful language model in its own right, and then further trained on a massive dataset of medical text and scientific publications, particularly from PubMed Central. This extra training allows BioMistral to understand and respond to medical inquiries and tasks with greater accuracy and nuance.

Here are some key points about BioMistral:

Open-source: All BioMistral models are available under an Apache License, meaning they are free to use and modify for research and development purposes.
Multiple models: BioMistral offers a suite of models with different sizes and capabilities, catering to diverse needs and hardware limitations.
High performance: BioMistral consistently ranks among the top open-source medical LLMs in various benchmarks, demonstrating its effectiveness in understanding and generating medical text.
Applications: BioMistral can be used for various tasks in the medical field, including:
- Generating medical summaries and reports
- Answering medical questions in a comprehensive and informative way
- Assisting with clinical decision-making
- Analyzing medical literature
- Developing chatbots for patient education and support

UI

Getting Started

To run this project, follow these steps:

Install Docker.
Pull the Qdrant Docker image:
```
docker pull qdrant/qdrant
```

Run the Qdrant container:

docker run -p 6333:6333 --rm qdrant/qdrant

Check if the vector DB is working by visiting http://localhost:6333/dashboard.
Download BioMistral 7B using this link and place the model file in the main working directory of the project.
Install the required libraries using:
```
pip install -r requirements.txt
```
Create a vector database of the files in the data folder by running:
```
python create_vector_db.py
```
Finally, run the application:
```
uvicorn app:app --reload
```

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Assets		Assets
data		data
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
create_vector_db.py		create_vector_db.py
requirements.txt		requirements.txt
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical RAG using BioMistral 7B LLM Running Locally 🏥🩺

Bio Mistral

UI

Getting Started

Contributing

About

Releases

Packages

Languages

AquibPy/Medical-RAG-LLM

Folders and files

Latest commit

History

Repository files navigation

Medical RAG using BioMistral 7B LLM Running Locally 🏥🩺

Bio Mistral

UI

Getting Started

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages