Introduction

This is Regularizing Visual Semantic Embedding with Contrastive Learning for Image-Text Matching, source code of ConVSE. This paper accepted by IEEE SPL. It is built on the top of the VSE$\infty$ in PyTorch.

Requirements and Installation

We recommended the following dependencies.

Python3.6+
Pytorch 1.9.0+

Download data

Download the dataset files. We use the image feature created by SCAN, download here[https://github.com/kuanghuei/SCAN].

Training new models

Run train.py:

python train.py --data_path "$DATA_PATH" --data_name "$DATA_NAME" --vocab_paath "$VOCAB_PATH" --model_name "runs/convse/model/" --use_contrastive

Evaluate trained models

from vocab import Vocabulary
import evalution
evalution.evalrank("$PATH/model_best.pth.tar", data_path="$DATA_PATH", split="test")

Reference

If you found this code useful, please cite the following paper:

@article{liu2022regularizing,
  title={Regularizing Visual Semantic Embedding with Contrastive Learning for Image-Text Matching},
  author={Liu, Yang and Liu, Hong and Wang, Huaqiu and Liu, Mengyuan},
  journal={IEEE Signal Processing Letters},
  year={2022},
  publisher={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
vocab		vocab
.gitignore		.gitignore
data.py		data.py
evaluation.py		evaluation.py
model.py		model.py
readme.md		readme.md
test.py		test.py
train.py		train.py
utils.py		utils.py
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Requirements and Installation

Download data

Training new models

Evaluate trained models

Reference

About

Releases

Packages

Languages

liuyyy111/ConVSE

Folders and files

Latest commit

History

Repository files navigation

Introduction

Requirements and Installation

Download data

Training new models

Evaluate trained models

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages