💬 Receipts_VQA 📖

Introduce

In this project, We will using LLMs LLaVA and RAG to build a VQA Chatbot.

Results are available at Jira

Overview of the app

Installation and Requirements

Clone project:

git clone ...
cd Receipts_VQA/

Create conda environments:

conda create --name < ENV_NAME > python=3.11 -y
conda activate < ENV_NAME >

Install torch
Run this command to install dependenies in the requirements.txt file

pip install -r requirements.txt

Run Project

Run the streamlit server

streamlit run app.py

Access the application in your browser at [http://localhost:8501].
Start chatting with the assistant!

How it works

The app as follows:

The user enters an image in the upload image field.
User enters a question about uploaded image.
User messages are sent to the OCR and LLaVA model for processing.
The user's input, along with the chat history, is used to generate a response.
The LLaVA model generates a response based on the patterns it learned during training.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
storage		storage
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
chat_history.py		chat_history.py
database_manager.py		database_manager.py
llava_response.py		llava_response.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💬 Receipts_VQA 📖

Introduce

Overview of the app

Installation and Requirements

Run Project

How it works

About

Releases

Packages

Languages

License

mikun19/Receipts_VQA

Folders and files

Latest commit

History

Repository files navigation

💬 Receipts_VQA 📖

Introduce

Overview of the app

Installation and Requirements

Run Project

How it works

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages