GitHub - ritheshkumar95/pytorch-vqvae: Vector Quantized VAEs

Reproducing Neural Discrete Representation Learning

Course Project for IFT 6135 - Representation Learning

Instructions

To train the VQVAE with default arguments as discussed in the report, execute:

python vqvae.py --data-folder /tmp/miniimagenet --output-folder models/vqvae

To train the PixelCNN prior on the latents, execute:

python pixelcnn_prior.py --data-folder /tmp/miniimagenet --model models/vqvae --output-folder models/pixelcnn_prior

Datasets Tested

Image

MNIST
FashionMNIST
CIFAR10
Mini-ImageNet

Video

Atari 2600 - Boxing (OpenAI Gym) code

Reconstructions from VQ-VAE

Top 4 rows are Original Images. Bottom 4 rows are Reconstructions.

MNIST

Fashion MNIST

Class-conditional samples from VQVAE with PixelCNN prior on the latents

MNIST

Fashion MNIST

Comments

We noticed that implementing our own VectorQuantization PyTorch function speeded-up training of VQ-VAE by nearly 3x. The slower, but simpler code is in this commit.
We added some basic tests for the vector quantization functions (based on pytest). To run these tests

py.test . -vv

Authors

Rithesh Kumar
Tristan Deleu
Evan Racah

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
samples		samples
.gitignore		.gitignore
README.md		README.md
datasets.py		datasets.py
final_project.pdf		final_project.pdf
functions.py		functions.py
modules.py		modules.py
pixelcnn_baseline.py		pixelcnn_baseline.py
pixelcnn_prior.py		pixelcnn_prior.py
test_functions.py		test_functions.py
vae.py		vae.py
vqvae.py		vqvae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reproducing Neural Discrete Representation Learning

Course Project for IFT 6135 - Representation Learning

Instructions

Datasets Tested

Image

Video

Reconstructions from VQ-VAE

MNIST

Fashion MNIST

Class-conditional samples from VQVAE with PixelCNN prior on the latents

MNIST

Fashion MNIST

Comments

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

ritheshkumar95/pytorch-vqvae

Folders and files

Latest commit

History

Repository files navigation

Reproducing Neural Discrete Representation Learning

Course Project for IFT 6135 - Representation Learning

Instructions

Datasets Tested

Image

Video

Reconstructions from VQ-VAE

MNIST

Fashion MNIST

Class-conditional samples from VQVAE with PixelCNN prior on the latents

MNIST

Fashion MNIST

Comments

Authors

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages