Ask your documents!

Implement a Retrieval Augmented Generation (RAG) system to query and retrieve information from your local documents efficiently.

Hands-on Workshop.

Gain practical experience with embeddings, vector databases, and local Large Language Models (LLMs)

Getting Started

Download or clone the repository

Installation

Install it step by step (or check Auto Installation for a single command)

Create and activate a virtual environment

$ python3 -m venv myvenv
$ source myvenv/bin/activate

Install dependencies

$ pip3 install -r requirements.txt

Ollama

Install to run large language models locally.

$ curl -fsSL https://ollama.ai/install.sh | sh

Or follow the installation instructions for your operating system: Install Ollama

Choose and download an LLM model. For example:

$ ollama pull phi3

Auto Installation

Alternatively, on bash, run the following installation script:

$ bin/install.sh

Usage

(myvenv)$ python3 local-rag-gui.py

And open the exposed link with your browser for the Graphical User Interface version.

Or, run the following for the command line input version

(myvenv)$ python3 local-rag-cli.py

In case the LLM server is not running start it in a different terminal with:

$ ollama serve

Additional Input parameters on the Frontend

Top k: Ranks the output tokens in descending order of probability, selects the first k tokens to create a new distribution, and it samples the output from it. Higher values result in more diverse answers, and lower values will produce more conservative answers. ([0, 10]. Default: 5)
Top p: Works together with Top k, but instead of selecting a fixed number of tokens, it selects enough tokens to cover the given cumulative probability. A higher value will produce more varied text, and a lower value will lead to more focused and conservative answers. ([0.1, 1] Default: 0.9)
Temp: This affects the “randomness” of the answers by scaling the probability distribution of the output elements. Increasing the temperature will make the model answer more creatively. ([0.1, 1]. Default: 0.5)

Development

Before commiting, format the code by using black as following on the project folder:

$ black -t py311 -S -l 99 .

You can Install Black with:

$ python3 -m pip install black

License

GPLv3

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
CloudNotebook		CloudNotebook
__pycache__		__pycache__
bin		bin
chroma_data		chroma_data
extras		extras
images		images
sample_data		sample_data
00_Overview.ipynb		00_Overview.ipynb
01_RAG.ipynb		01_RAG.ipynb
02_Vector_databases.ipynb		02_Vector_databases.ipynb
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
local-rag-cli.py		local-rag-cli.py
local-rag-gui.py		local-rag-gui.py
parameters.py		parameters.py
ragfuncs.py		ragfuncs.py
requirements.txt		requirements.txt
utils.py		utils.py
workflow.ipynb		workflow.ipynb
workflow.py		workflow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ask your documents!

Implement a Retrieval Augmented Generation (RAG) system to query and retrieve information from your local documents efficiently.

Hands-on Workshop.

Gain practical experience with embeddings, vector databases, and local Large Language Models (LLMs)

Getting Started

Download or clone the repository

Installation

Create and activate a virtual environment

Install dependencies

Ollama

Auto Installation

Usage

Additional Input parameters on the Frontend

Development

License

About

Releases

Packages

Languages

License

mt0rm0/dsr-rag

Folders and files

Latest commit

History

Repository files navigation

Ask your documents!

Implement a Retrieval Augmented Generation (RAG) system to query and retrieve information from your local documents efficiently.

Hands-on Workshop.

Gain practical experience with embeddings, vector databases, and local Large Language Models (LLMs)

Getting Started

Download or clone the repository

Installation

Create and activate a virtual environment

Install dependencies

Ollama

Auto Installation

Usage

Additional Input parameters on the Frontend

Development

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages