MMHQA-ICL

A simple approach to use LLM to solve question answering over Text, Tables and Images using in-context learning.

Modules

There are a total of four modules: a question classifier, an image retriever, a text retriever, and a question-answering module.

The question-answering module utilizes the results from the question classifier and the content retrieved from the given text and images by the image and text retrievers. It then employs a large language model to provide answers to the questions through in-context learning.

Run

Make sure to modify the dataset location in each module's dataset.py file.

Download the model weights of deberta-large from huggingface and put them in ptm/deberta-large.

classifier

PYTHONPATH=$PYTHONPATH:$(pwd) python ./classifier_module/train.py \
--batch-size 4 \
--lr 8e-7 \
--test \
--epoch 5

retriever - image

CUDA_VISIBLE_DEVICES=0,1 PYTHONPATH=$PYTHONPATH:$(pwd) python ./retriever_module/train.py \
--n-gpu 2 \
--log-file train_image.log \
--image_or_text image \
--lr 5e-6 \
--test \
--epoch 20

retriever - text

CUDA_VISIBLE_DEVICES=0,1 PYTHONPATH=$PYTHONPATH:$(pwd) python ./retriever_module/train.py \
--n-gpu 2 \
--test \
--lr 6e-6 \
--image_or_text text \
--epoch 5

question-answering

oracle settings（oracle-classifier + oracle-retriever）：

PYTHONPATH=$PYTHONPATH:$(pwd) python ./run.py --dataset mmqa \
--dataset_split validation \
--prompt_file templates/prompt.json \
--n_parallel_prompts 1 \
--n_processes 1 \
--temperature 0.4 \
--engine "text-davinci-003" \
--max_api_total_tokens 4200 \
--oracle-classifier \
--oracle-retriever \
--retriever dpmlb

non-oracle settings（using trained classifier and retriever）

PYTHONPATH=$PYTHONPATH:$(pwd) python ./run.py --dataset mmqa \
--dataset_split validation \
--prompt_file templates/prompt.json \
--n_parallel_prompts 1 \
--n_processes 1 \
--temperature 0.4 \
--engine "text-davinci-003" \
--max_api_total_tokens 4200 \
--retriever dpmlb

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
classifier_module		classifier_module
datasets		datasets
qa		qa
retriever_module		retriever_module
templates		templates
utils		utils
README.md		README.md
key.txt		key.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MMHQA-ICL

Modules

Run

classifier

retriever - image

retriever - text

question-answering

About

Releases

Packages

Languages

NeosKnight233/MMHQA-ICL

Folders and files

Latest commit

History

Repository files navigation

MMHQA-ICL

Modules

Run

classifier

retriever - image

retriever - text

question-answering

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages