ImageCaptioning

This is a project implemented for the course EEE 443 - Neural Networks at Bilkent University.

In this project our aim is to develop a model which will produce a caption for a given naturalimage. This problem is particularly interesting since we can use a couple of deep learning architectures that we have learned throughout this course together such as Convolutional Neural Networks and Recurrent Neural Networks. In particular, we have used the ResNet 152 [1] structure to encode the given image for the CNN part, and LSTM cells are used to decode the features to captions for the RNN part. Pre-trained word embeddings from GloVe [2] are used in the embedding layer of the decoder. Cross entropy loss is used to evaluate the performance of the network. The expected outcome is to reach a generative model that is as close to a human as possible in captioning an unseen image.

Sample Results

Here are some of the captions generated for sample images:

REFERENCES

[1] He, Kaiming, et al. “Deep Residual Learning for Image Recognition.” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2016, doi:10.1109/cvpr.2016.90.

[2] Pennington, Jeffrey, et al. “Glove: Global Vectors for Word Representation.” Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2014, doi:10.3115/v1/d14-1162.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
images		images
README.md		README.md
codec.py		codec.py
data_prep.py		data_prep.py
embeds.py		embeds.py
embeds.txt		embeds.txt
embeds300.txt		embeds300.txt
extract_features.py		extract_features.py
loaderTensor.py		loaderTensor.py
main.py		main.py
mainCrossVal.py		mainCrossVal.py
mainNew.py		mainNew.py
newLoader.py		newLoader.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImageCaptioning

About

Releases

Packages

Languages

johnberg1/ImageCaptioning

Folders and files

Latest commit

History

Repository files navigation

ImageCaptioning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages