Twitter Captioning using Show and Tell

Uses a variant of the model presented by [1] [Vinyals et al 2014] (http://arxiv.org/abs/1411.4555) to generate tweets that accompany pictures from Twitter.

Model

As detailed in [1], the model extracts convolutional features and uses them as the initial input to an LSTM, with each word of the accompanying tweet as the next input in the sequence. Both the convolutional features and the one-hot word vector are projected into the LSTM's input vector.

This model uses a Lasagne implementation of VGG-net [2] Simonyan et al 2014 for feature extraction. Weights can be downloaded from:

wget https://dl.dropboxusercontent.com/u/63070080/vgg19_normalized.pkl

For data gathering, it's also necessary to have a Twitter developer account with API key and access token, which are saved in private_config.py. However, any set of images and and text can be used.

Results

Some of the captions almost make sense:

from details st for january snow colors

this boy is is my epic favorite shirt show but

The current training set is only around 20,000 images and captions, but I hope to be training it with more data and improving the results soon!

Dependencies

lasagne
theano
fuel
hdf5
tqdm

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
output		output
LICENSE		LICENSE
README.md		README.md
config.py		config.py
fetch_data.py		fetch_data.py
models.py		models.py
process_data.py		process_data.py
train_rnn.py		train_rnn.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Captioning using Show and Tell

Model

Results

Dependencies

About

Releases

Packages

Languages

License

tencia/twitter_caption

Folders and files

Latest commit

History

Repository files navigation

Twitter Captioning using Show and Tell

Model

Results

Dependencies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages