SPBRAG - Optimizing Retrieval Augmented Generation Pipelines 🚀

Advanced RAG system with dynamic query classification, supporting Llama-3.2 models through Ollama integration.

📖 Table of Contents

Overview
Key Features
Evaluation Metrics
Installation
Configuration
Fine Tuning
Usage

🌟 Overview

SPBRAG enhances RAG pipelines using a hybrid approach:

BERT-based Query Classification - Determines context requirement
Llama-3.2 LLM - Generates context-aware responses
Evaluation Framework - Measures precision/recall of retrieval components

✨ Key Features

Multi-Model Support 🤖
llama3.2:1B (fast) and llama3.2:3B (high-quality) variants
Automatic Model Handling ⚙️
One-line model downloads via Ollama
Secure Configuration 🔑
Environment-based API key management
Flexible Training 🏋️
Custom BERT fine-tuning capabilities

📊 Evaluation Metrics

For detailed explanations of our evaluation metrics and interpretation guidelines, see: Metrics Documentation

Key tracked metrics include:

F1 score and EM score
Metrics For Evaluation RAG
Classification Accuracy For BERT
Latency Benchmarks

📚

To review the progress of my work and my thoughts, I suggest you read: Workflow

🛠 Installation

Step 1: System Preparation

Install micromamba if missing

"${SHELL}" <(curl -L micro.mamba.pm/install.sh)

For fish users

curl -L micro.mamba.pm/install.sh | sh

Step 2: Repository Setup

Clone the project

git clone https://github.com/Dnau15/SPBRAG.git
cd SPBRAG

Step 3: Environment Configuration

Create and activate environment, install dependencies

micromamba create -n spbrag python=3.11 -y
micromamba activate spbrag

./setup.sh

If you have problems with executing .sh

chmod +x ./setup.sh

Step 4: LLM Setup

Install Ollama

curl -fsSL https://ollama.com/install.sh | sh

ollama pull llama3.2:3B  # High-quality version (3B params)

🔧 Configuration

If you want to use Huggingface api and don't want to use Ollama 😞

API Keys Setup

Create .env file:

touch .env

Add Hugging Face credentials:

HUGGINGFACE_API_KEY=your_hf_api_key_here

Python Path Configuration

Bash/Zsh

export PYTHONPATH="$(pwd)/src:$PYTHONPATH"

Fish

set -x PYTHONPATH (pwd)/src $PYTHONPATH

#### Windows (Powershell)
$env:PYTHONPATH = "$(pwd)/src;$env:PYTHONPATH"

🤖 Fine Tuning

BERT Models Setup For Training

Create model directory

mkdir -p models/bert-text-classification-model

Generate Base Dataset for BERT training

python src/rag_system/data/data_creation.py

Fine-tune BERT Classifier

python src/rag_system/training/fine_tune_bert.py \
  --file_path=your_path or default \
  --num_samples_per_class=1500 \
  --num_epochs=5 \
  --learning_rate=2e-5

Dataset Schema

Column	Type	Description
query	text	User input
requires_context	bool	Context flag
reference_text	text	Ground truth

🚀 Usage

Standard Evaluation

python src/rag_system/evaluation/evaluator.py \
  --collection_name=TestCollection5 \
  --model_type=ollama \ # or mistral

Advanced Options

# Model Configuration
--bert_path="google-bert/bert-base-uncased"  # Path to BERT model
--tokenizer_path="google-bert/bert-base-uncased"  # Custom tokenizer
--embedding_model_path="sentence-transformers/all-mpnet-base-v2"  # Embedding model

# Vector Database Settings
--milvus_uri="./data/milvus_demo.db"  # Local Milvus instance path
--collection_name="rag_eval"  # Collection name for stored embeddings

# LLM Configuration
--llm_repo_id="mistralai/Mistral-7B-Instruct-v0.2"  # Alternative LLM
--llm_max_new_tokens=100  # Maximum response length
--model_type="mistral"  # [llama3|mistral] LLM variant

# Retrieval Parameters
--top_k=30  # Number of context chunks to retrieve
--context_len=1000  # Context window size (in tokens)

# Evaluation Settings
--num_test_samples=5  # Number of test cases to evaluate
--use_classifier=True  # Enable/disable query classification

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
docs		docs
models		models
notebooks		notebooks
src/rag_system		src/rag_system
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh
setup.sh		setup.sh

Dnau15/SPBRAG

Folders and files

Latest commit

History

Repository files navigation