GitHub - labaffa/fewsblip: PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

This is the PyTorch code of the BLIP paper [blog]. The code has been tested on PyTorch 1.10. To install the dependencies, run

pip install -r requirements.txt

Pre-trained checkpoints:

Num. pre-train images	BLIP w/ ViT-B and CapFilt-L
129M	Download

Citation

If you find this code to be useful for your research, please consider citing.

@inproceedings{li2022blip,
      title={BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation}, 
      author={Junnan Li and Dongxu Li and Caiming Xiong and Steven Hoi},
      year={2022},
      booktitle={ICML},
}

Acknowledgement

The implementation of BLIP relies on resources from ALBEF, Huggingface Transformers, and timm. We thank the original authors for their open-sourcing.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
configs		configs
data		data
models		models
transform		transform
BLIP.gif		BLIP.gif
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Image_captioning.py		Image_captioning.py
LICENSE.txt		LICENSE.txt
README.md		README.md
SECURITY.md		SECURITY.md
cog.yaml		cog.yaml
demo.ipynb		demo.ipynb
eval_nocaps.py		eval_nocaps.py
eval_retrieval_video.py		eval_retrieval_video.py
predict.py		predict.py
pretrain.py		pretrain.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test.py		test.py
train_caption.py		train_caption.py
train_nlvr.py		train_nlvr.py
train_retrieval.py		train_retrieval.py
train_vqa.py		train_vqa.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Pre-trained checkpoints:

Citation

Acknowledgement

About

Releases

Packages

Languages

License

labaffa/fewsblip

Folders and files

Latest commit

History

Repository files navigation

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Pre-trained checkpoints:

Citation

Acknowledgement

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages