Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
config.py		config.py
lm_ptb_memnet.py		lm_ptb_memnet.py
ptb_reader.py		ptb_reader.py

README.md

End-to-End Memory Network for Language Modeling

This example builds a Memory Network language model, and trains on PTB data. Model and training are described in
(Sukhbaatar, et. al.) End-To-End Memory Networks. Model details are implemented in texar.modules.memnet.

Though the example is for language modeling, it is easy to adapt to other tasks like Question Answering, etc, as described in the above paper.

Dataset

The standard Penn Treebank (PTB) dataset is used.

If data does not exist under data_path, the program will automatically download the data.

Usage

The following cmd trains the model:

python3 lm_ptb_memnet.py --config config --data_path ./

Here:

--config specifies the config file to use. E.g., the above use the configuration defined in config.py.
--data_path specifies the directory containing PTB raw data (e.g., ptb.train.txt). If the data files do not exist, the program will automatically download, extract, and pre-process the data.
--lr specifies the initial learning rate. If not specified, the program will use the learning rate in the config file.

The model will begin training, and will evaluate on the validation data periodically, and evaluate on the test data after the training is done. Checkpoints are saved every 5 epochs.

Configurations

config.py is the largest and best configuration described on the last line of Table 2 in (Sukhbaatar, et. al.) End-To-End Memory Networks. It sets number of hops to 7, hidden dim to 150, and memory size to 200. This model has 4,582,500 parameters in total.

Results

The perplexity of different configs is:

config	epochs	train	valid	test
config	51	50.70	120.97	113.06

This result of config.py is slightly inferior to the result presented in the paper, since the result in the paper is the best among 10 runs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memory_network_lm

memory_network_lm

README.md

End-to-End Memory Network for Language Modeling

Dataset

Usage

Configurations

Results

Files

memory_network_lm

Directory actions

More options

Directory actions

More options

Latest commit

History

memory_network_lm

Folders and files

parent directory

README.md

End-to-End Memory Network for Language Modeling

Dataset

Usage

Configurations

Results