simple-image-caption-with-Pytorch

This is a reimplementation of the basic image caption structures(CNN-RNN). CNN-(ResNet18), RNN-(LSTM), dataset(MSCOCO), Toolkit(Pytorch)

Background

Image caption is some techniques that help computers to understand the picture given to them and express the picture by nature languages.

Algorithm

Extract features from the input images with convolutional neural network (in this work is pretrained Resnet18)

Input: batch of images with the shape(N, C, H, W)
Output: batch of features of shape(N, D)
N:batch size, C:image channel(RGB), H:image height, W:image weight, D:feature dimensions(512)

just as the figure shows:

Encode the sentence into vectors with a dictionary and put <start>, <end>, <pad> into sentences.

Input: batch of strings with shape(N, *)
Output: batch of vectors with shape(N, L)
N:batch size, *:length of the sentence, L:fixed length of the vector

just as the figure shows:

Use the long short-term memory(LSTM) model as the RNN to realize the generation part.

Input: batch of encode captions of shape (N, L, C)
Initial hidden layer: extracted features of shape (N, D)
Output : (N, L, C)
C:dictionary size

just as the figure shows:

Example result

the Experiment metrics is as follows:

Several generation captions:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
code		code
data		data
model		model
mypic/1		mypic/1
result		result
CNN-RNN-train-val.ipynb		CNN-RNN-train-val.ipynb
COCO_Resnet18CNN_featuresExtractor.ipynb		COCO_Resnet18CNN_featuresExtractor.ipynb
README.md		README.md
my_evaluate_test.ipynb		my_evaluate_test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

simple-image-caption-with-Pytorch

Directory

Background

Algorithm

Example result

Reference

About

Releases

Packages

Languages

fishermanxx/Image-Captioning-with-Pytorch

Folders and files

Latest commit

History

Repository files navigation

simple-image-caption-with-Pytorch

Directory

Background

Algorithm

Example result

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages