Stars
✨Multimodal
6 repositories
Code for ALBEF: a new vision-language pre-training method
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
An open source implementation of CLIP.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
LAVIS - A One-stop Library for Language-Vision Intelligence