The official code for the paper：Sep-NMS: Unlocking the Aptitude of Two-Stage Referring Expression Comprehension

We follow the settings of ：Official codebase for AAAI 2021 paper "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding". Please ensure that the parameter settings in the code adhere to the experimental details provided in the Sep-NMS paper.

Prerequisites

The following dependencies should be enough. See environment.yml for complete environment settings.

python 3.7.6
pytorch 1.1.0
torchvision 0.3.0
tensorboard 2.1.0
spacy 2.2.3

Data Preparation

Follow instructions in data/README.md to setup data directory.

Initial Filtering

This part is identical to the content in Ref-NMS. So we follow the code of Ref-NMS: ChopinSharp/ref-nms: Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding" Ref-NMS

CLIP† relatedness

1、 CLIP† denotes a variant of the CLIP model that has been fine-tuned on the MS COCO dataset. We utilze the ITM module in GR-GAN paper. The original code is following: BoO-18/GR-GAN: GRADUAL REFINEMENT TEXT-TO-IMAGE GENERATION (github.com) BoO-18/GR-GAN

2、 The following code corresponds to the CLIP† relatedness mentioned in our paper. CLIP† relatedness aims to filter referent and context proposals, the output is the simliarity score.

tools/ybclip_ann_sent.py

Ctx-relatedness && Ref-relatedness

The Ctx-Relatedness module is identical to the Ref-NMS model. The original code for this component can be found in the following directory of the Ref-NMS codebase.

/lib/predictor.py"

In our codebase, the architectures for both the Ctx-Relatedness and Ref-Relatedness models can be found in the following directory:

“lib/my_sep_qkad_predictor.py”

which incoporates the Ctx-relatedness and Ref-relatedness module

Train：

Train Ctx-relatedness && Ref-relatedness with binary XE loss:

tools/my_qkad_train2.py

other codings：

The tools and lib directories contain various experiments and attempts of our method under different settings and parameter configurations.

For the test part， you can follow the RefNMS： Ref-NMS

Pretrained Models

We provide pre-trained model weights as long as the corresponding MAttNet-style detection file (note the MattNet-style detection files can be directly used to evaluate downstream REG task performance). With these files, one can easily reproduce our reported results.

[Google Drive] [Baidu Disk] (extraction code: 5a9r)

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
CLIP_main		CLIP_main
cache		cache
data		data
lib		lib
output		output
scripts		scripts
tb		tb
tools		tools
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The official code for the paper：Sep-NMS: Unlocking the Aptitude of Two-Stage Referring Expression Comprehension

Prerequisites

Data Preparation

Initial Filtering

CLIP† relatedness

Ctx-relatedness && Ref-relatedness

Train：

other codings：

Pretrained Models

About

Releases 2

Packages

Languages

License

1240446371/Sep-NMS-REC

Folders and files

Latest commit

History

Repository files navigation

The official code for the paper：Sep-NMS: Unlocking the Aptitude of Two-Stage Referring Expression Comprehension

Prerequisites

Data Preparation

Initial Filtering

CLIP† relatedness

Ctx-relatedness && Ref-relatedness

Train：

other codings：

Pretrained Models

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages