Skip to content

Code for paper: Tag2Text: Guiding Vision-Language Model via Image Tagging

License

Notifications You must be signed in to change notification settings

jingzhengli/Tag2Text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Tag2Text: Guiding Vision-Language Model via Image Tagging

Official PyTorch Implementation of the Tag2Text paper. We will fully open ource the code and data in the future.

Welcome to try out Tag2Text Web demo! It is integrated into Huggingface Spaces 🤗 using Gradio.

Abstract

Tag2Text achieves a superior image tag recognition ability by exploiting fine-grained text information. By leveraging tagging guidance, Tag2Text effectively enhances the performance of vision-language models on both generation-based and alignment-based tasks.

About

Code for paper: Tag2Text: Guiding Vision-Language Model via Image Tagging

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%