M²: Meshed-Memory Transformer Fork

Original Code available: https://github.com/aimagelab/meshed-memory-transformer

Fork Contributions

Fixed the / vs // error in the original code following this discussion
Fixed the UserWarning: masked_fill_ received a mask with dtype torch.uint8, this behavior is now deprecated,please use a mask with dtype torch.bool instead. warning in the attention mechanism
Improved speed of the Dataset class
Added hpc scripts and setup.qsub
Added loss/eval training and validation plot functionality (runs automatically)
Added a potential fix to the <eos> bug in SCST.

$\mathcal{M}^2$: Meshed-Memory Transformer

This repository contains a fork of the reference code for the paper Meshed-Memory Transformer for Image Captioning (CVPR 2020).

Please cite with the original work BibTeX:

@inproceedings{cornia2020m2,
  title={{Meshed-Memory Transformer for Image Captioning}},
  author={Cornia, Marcella and Stefanini, Matteo and Baraldi, Lorenzo and Cucchiara, Rita},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2020}
}

Setup

SPICE Evaluations

Run the following:

cd evaluations
bash get_stanford_models.sh

See this post for more information.

Environment Setup

See setup.qsub. On QMUL's Apocrita/Andrena hpc system, this job can be automated with the following steps:

Check the directories are as expected
Run qsub setup.qsub

Training procedure

See train.py for the complete list of arguments. An hpc system script has been provided in hpc/train.qsub. Ensure the script is ammeded to account for your username and directory structure. i.e. Don't use $USER$ in the header information. Submit the job with qsub train.qsub from within the hpc directory.

Results

References

[1] P. Anderson, X. He, C. Buehler, D. Teney, M. Johnson, S. Gould, and L. Zhang. Bottom-up and top-down attention for image captioning and visual question answering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.vscode		.vscode
dataset		dataset
evaluation		evaluation
hpc		hpc
images		images
models		models
output_logs		output_logs
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.qsub		setup.qsub
test.py		test.py
train.py		train.py
vocab.pkl		vocab.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

M²: Meshed-Memory Transformer Fork

Fork Contributions

$\mathcal{M}^2$: Meshed-Memory Transformer

Setup

SPICE Evaluations

Environment Setup

Training procedure

Results

References

About

Releases

Packages

Languages

License

Delphboy/meshed-memory-transformer

Folders and files

Latest commit

History

Repository files navigation

M²: Meshed-Memory Transformer Fork

Fork Contributions

$\mathcal{M}^2$: Meshed-Memory Transformer

Setup

SPICE Evaluations

Environment Setup

Training procedure

Results

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages