PromptMTopic

Official implementation for paper Prompting Large Language Models for Unsupervised Multimodal Meme Topic Modeling

Setup

Clone the repo

git clone https://github.com/Social-AI-Studio/PromptMTopic.git 
pip install -r requirements.txt

If run with OpenAI models, set up OpenAI API key and intialize openai.api_key in api_utils.py If run with LLaMA, download weights from huggingface

Usage

Prepare Input

Prepare input folder. This contains subfolders of datasets you run the model on. Each dataset subfolder contains two json files below, each with key as image name and value as corresponding caption/text.

captions.json
text.json

Concatenate captions and OCR text in a single line for each sample and save in a 'corpus' directory inside input folder for each dataset used. This is required by the evaluation library.

In the paper, we removed text from the images then use BLIP-2 for captioning.

Change the path of input and output folders to your path in config.yaml.

Generating Topics and Evaluation

Insert corresponding arguments in < >

python3 promptMtopic.py -model <model> -dataset <dataset> -merging <wsm or pbm> -k_range <range of generated topics>

For example:

python3 promptMtopic.py -model chatGPT -dataset TDefMemes -merging pbm -k_range 10,20,30,40,50

Datasets

The topic models have been evaluated on the following datasets:

Dataset	#Memes
Facebook Hateful Memes(FHM)	10,000
Total Defence Memes(TDefMeme)	2,513
Memotion	6,992

Acknowledgements

Topic Metric: Preferred.AI's Topic Metric
Captioning Module: LAVIS's BLIP-2
ctfidf: BERTopic

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
__pycache__		__pycache__
models		models
topic_metrics		topic_metrics
.DS_Store		.DS_Store
README.md		README.md
WSM.py		WSM.py
api_utils.py		api_utils.py
config.yaml		config.yaml
ctfidf.py		ctfidf.py
promptMtopic.py		promptMtopic.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PromptMTopic

Setup

Usage

Prepare Input

Generating Topics and Evaluation

Datasets

Acknowledgements

About

Releases

Packages

Contributors 3

Languages

Social-AI-Studio/PromptMTopic

Folders and files

Latest commit

History

Repository files navigation

PromptMTopic

Setup

Usage

Prepare Input

Generating Topics and Evaluation

Datasets

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages