Image Caption Generator

Image captioning is the process of generating textual descriptions for images automatically. It combines computer vision and natural language processing techniques to understand the content of an image and generate a coherent and relevant description.

CNN & RNN (LSTM)

To perform Image Captioning we will require two deep learning models combined into one for the training purpose.
CNNs extract the features from the image of some vector size aka the vector embeddings. The size of these embeddings depend on the type of pretrained network being used for the feature extraction LSTMs are used for the text generation process. The image embeddings are concatenated with the word embeddings and passed to the LSTM to generate the next word

Dataset

A new benchmark collection for sentence-based image description and search, consisting of 8,000 images that are each paired with five different captions which provide clear descriptions of the entities and events.
The images don't contain any well-known people or locations, but were manually selected to depict a variety of scenes and situations.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.gitignore		.gitignore
Image-Caption-Generator.ipynb		Image-Caption-Generator.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Caption Generator

CNN & RNN (LSTM)

Dataset

Notebook

Caption Generation (Good Examples)

Caption Generation (Bad Examples)

About

Contributors 2

Languages

YoussefAboelwafa/Image-Caption-Generator

Folders and files

Latest commit

History

Repository files navigation

Image Caption Generator

CNN & RNN (LSTM)

Dataset

Notebook

Caption Generation (Good Examples)

Caption Generation (Bad Examples)

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages