Code for "Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders"

This branch is for version 1 of the paper. For updated version checkout branch version2 and svhn

See an extension of this work https://github.com/51616/split-vae

Code for "Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders"

(https://arxiv.org/abs/1611.02648) (version 1) By Nat Dilokthanakul, Pedro A.M. Mediano, Marta Garnelo, Matthew C.H. Lee, Hugh Salimbeni, Kai Arulkumaran, Murray Shanahan

Abstract

We study a variant of the variational autoencoder model with a Gaussian mixture as a prior distribution, with the goal of performing unsupervised clustering through deep generative models. We observe that the standard variational approach in these models is unsuited for unsupervised clustering, and mitigate this problem by leveraging a principled information-theoretic regularisation term known as consistency violation. Adding this term to the standard variational optimisation objective yields networks with both meaningful internal representations and well-defined clusters. We demonstrate the performance of this scheme on synthetic data, MNIST and SVHN, showing that the obtained clusters are distinct, interpretable and result in achieving higher performance on unsupervised clustering classification than previous approaches.

Requirements

Luarocks packages:

mnist ( torch-rocks install https://raw.github.com/andresy/mnist/master/rocks/mnist-scm-1.rockspec )

Python packages:

torchfile (pip install torchfile)

Instructions

To run a spiral experiment,

./run.sh spiral

To visualise spiral experiment (can be used while training)

cd plot
python plot_latent.py
python plot_recon.py

To turn off CV regularisation term (set alpha)

set flag

./run.sh spiral -cvWeight 0.0

To turn off z-prior term (set eta)

./run.sh spiral -zPriorWeight 0.0

To run MNIST

./run.sh mnist

To use GPU

set flag

./run.sh mnist -gpu 1

To run quick MNIST on fully-connected network

./run.sh mnist_fc

Code reading guide

The neural network components are constructed with nngraph in the folder 'models'.

The network's components are put together in the main.lua file at line 126.

During optimisation, the GMVAE objectives (5 contributing terms) are calculated from line 202 until 250 in main.lua.

Acknowledgements

I would like to thanks the following, whose github's repos were used as inital templates for implementing this idea.

Rui Shu https://github.com/RuiShu/cvae
Joost van Amersfoort https://github.com/y0ast/VAE-Torch
Kai Arulkumaran https://github.com/Kaixhin/Autoencoders

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
criteria		criteria
datasets		datasets
layers		layers
models		models
plot		plot
.gitignore		.gitignore
ClusteringEvaluations.lua		ClusteringEvaluations.lua
README.md		README.md
UtilsNetwork.lua		UtilsNetwork.lua
main.lua		main.lua
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This branch is for version 1 of the paper. For updated version checkout branch version2 and svhn

See an extension of this work https://github.com/51616/split-vae

Code for "Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders"

Abstract

Requirements

Instructions

Code reading guide

Acknowledgements

About

Releases

Packages

Languages

Nat-D/GMVAE

Folders and files

Latest commit

History

Repository files navigation

This branch is for version 1 of the paper. For updated version checkout branch version2 and svhn

See an extension of this work https://github.com/51616/split-vae

Code for "Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders"

Abstract

Requirements

Instructions

Code reading guide

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages