Skip to content

Latest commit

 

History

History
57 lines (42 loc) · 2.82 KB

README.md

File metadata and controls

57 lines (42 loc) · 2.82 KB

Variational Autoencoder (VAE) for Text Generation

This example builds a VAE for text generation, with an LSTM as encoder and an LSTM or Transformer as decoder. Training is performed on the official PTB data and Yahoo data, respectively.

The VAE with LSTM decoder is first decribed in (Bowman et al., 2015) Generating Sentences from a Continuous Space

The Yahoo dataset is from (Yang et al., 2017) Improved Variational Autoencoders for Text Modeling using Dilated Convolutions, which is created by sampling 100k documents from the original Yahoo Answer data. The average document length is 78 and the vocab size is 200k.

Data

The datasets can be downloaded by running:

python prepare_data.py --data ptb
python prepare_data.py --data yahoo

Training

Train with the following command:

python vae_train.py --config config_trans_ptb

Here:

Generation

Generating sentences with pre-trained model can be performed with the following command:

python vae_train.py --config config_file --mode predict --model /path/to/model.ckpt --out /path/to/output

Here --model specifies the saved model checkpoint, which is saved in ./models/dataset_name/ at training time. For example, the model path is ./models/ptb/ptb_lstmDecoder.ckpt when generating with a LSTM decoder trained on PTB dataset. Generated sentences will be written to standard output if --out is not specifcied.

Results

Language Modeling

Dataset Metrics VAE-LSTM VAE-Transformer
Yahoo Test PPL
Test NLL
68.11
337.13
59.95
326.93
PTB Test PPL
Test NLL
104.61
101.92
103.68
101.72

Generated Examples

We show the generated examples with transformer as decoder trained on PTB training data.

Examples
i 'm always looking at a level of $ N to $ N billion <EOS>
after four years ago president bush has federal regulators decided to file financing for the waiver<EOS>
the savings & loan association said total asset revenue was about $ N billion compared with $ N billion <EOS>
the trend would seem to be effective <EOS>
chicago city 's computer bank of britain posted a N N jump in third-quarter net income <EOS>