Exploring Vision-Language Models for Imbalanced Learning

Code and experimental logs for Exploring Vision-Language Models for Imbalanced Learning.

Requirements

The code is based on USB.

Python 3.10
PyTorch 2.0
CUDA 11.8

To reproduce our results, you can create our exact same conda environment with:

conda env create -f environment.yml

Datasets

Imagenet_LT: Download ILSVRC2012_img_train.tar & ILSVRC2012_img_val.tar from https://image-net.org/index and extract with https://github.com/pytorch/examples/blob/main/imagenet/extract_ILSVRC.sh.
Places: Download places365standard_easyformat.tar from http://places2.csail.mit.edu/download.html and extract.
iNaturalist: Download iNaturalist18_train_val2018.tar.gz from https://github.com/visipedia/inat_comp/tree/master/2018 and extract.

Training

Modify paths to your datasets at scripts/config_generator_imb_clip.py L237 and generate config files:

cd Imbalance-VLM && mkdir logs && mkdir config
python3 scripts/config_generator_imb_clip.py

Then you can run experiments with commands like:

python3 train.py --c ./config/imb_clip_stage1_algs/supervised/imagenet_lt_softmax_None_None_0.yaml

You could also run all commands required to reproduce results generated by scripts/config_generator_imb_clip.py in all_commands.txt with https://github.com/ExpectationMax/simple_gpu_scheduler.

Experiment Results

The logs of training can be found at Internet Archive. All our experiment data (including debug runs) were uploaded to wandb, please refer to our wandb projects: Imagenet_LT, iNaturalist and Places.

Cite US

@article{wang2023exploring,
  title={Exploring Vision-Language Models for Imbalanced Learning},
  author={Wang, Yidong and Yu, Zhuohao and Wang, Jindong and Heng, Qiang and Chen, Hao and Ye, Wei and Xie, Rui and Xie, Xing and Zhang, Shikun},
  journal={arXiv preprint},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
assets		assets
imblearn		imblearn
scripts		scripts
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
environment.yml		environment.yml
eval.py		eval.py
main-figure.png		main-figure.png
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Vision-Language Models for Imbalanced Learning

Requirements

Datasets

Training

Experiment Results

Cite US

About

Releases

Packages

Languages

License

jingzhengli/Imbalance-VLM

Folders and files

Latest commit

History

Repository files navigation

Exploring Vision-Language Models for Imbalanced Learning

Requirements

Datasets

Training

Experiment Results

Cite US

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages