Language Modeling with Gated Convolutional Networks

This is a Tensorflow implementation of Facebook AI Research Lab's paper: Language Modeling with Gated Convolutional Networks. This paper applies a convolutional approach to language modelling with a novel Gated-CNN model.

Architecture

Requirements

Download and extract the Google 1 Billion Word dataset in the data folder.
TensorFlow 0.12.1

Usage

To train the model using the default hyperparameters:

$ python main.py
$ tensorboard --logdir=logs --host=0.0.0.0

Check main.py for tunable hyperparameter flags.

TODO

Replace NCE loss with Adaptive Softmax.
Remove restricted training on fixed sized sentences (20, for now) and extend to account for all varied sentence lenghts.
Implement Weight Normalisation for faster convergence.
Train extensively on deeper models to match the results with the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
conf_utils.py		conf_utils.py
data_utils.py		data_utils.py
main.py		main.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language Modeling with Gated Convolutional Networks

Architecture

Requirements

Usage

TODO

About

Releases

Packages

Languages

gumanchang/Language-Modeling-GatedCNN

Folders and files

Latest commit

History

Repository files navigation

Language Modeling with Gated Convolutional Networks

Architecture

Requirements

Usage

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages