NeurIPS'19-MV-Kumaraswamy

Code for reproducing some key results of our NeurIPS 2019 submission.

A New Distribution on the Simplex with Auto-Encoding Applications

Prerequisites

Python (version 3.6.7 or higher)
Python requirements are captured in requirements.txt
- For GPU accelerated TensorFlow:
  - run pip install -r requirements.txt
- For CPU TensorFlow:
  - Change tensorflow-gpu to tensorflow in requirements.txt
  - run pip install -r requirements.txt

Retrieving Data Sets

We utilize the TensorFlow Datasets API. Running any of our code that requires data will automatically download the requisite data.

Repository Overview

Experiment Scripts
- experiments_ss_run.py runs ours semi-supervised learning experiments.
- experiments_ss_analyze.py analyzes the results generated by experiments_ss_run.py.
Multivariate Kumaraswamy Code
- mv_kumaraswamy_sampler.py contains a TensorFlow implementation of the Multivariate Kumaraswamy.
- mv_kumaraswamy_theory.py contains a symbolic implementation of stick-breaking process that supports Beta and Kumaraswamy distributions.
Model Files
- models_vae.py contains our VAE models:
  - VariationalAutoEncoder base class that contains common parameters and functions
  - AutoEncodingKumaraswamy our proposed model (MV-Kumaraswamy) that works for dim(z) >= 0
  - AutoEncodingSoftmax our softmax baseline model that works for dim(z) >= 0
  - KingmaM2 our implementation of Kingma's M2 model, which works for dim(z) > 0
- model_lib.py contains functions used to construct the inference and recognition network operations.
- model_utils.py contains data loading/splitting, training routines, and other support functions.
Miscellaneous
- unit_test.py

Reproducing Report Images

python mv_kumaraswamy_sampler.py will plot and show Figure 1 (among other non-utilized figures).
python mv_kumaraswamy_theory.py will plot and show Figures 2-4 (among other non-utilized figures).
ars-reparameterization/dirichlet-multinomial.ipynb (modified from https://github.com/blei-lab/ars-reparameterization) was used to generate Figure 5.
python model_utils.py will plot and show something similar to Figure 6.

Reproducing Experimental Results

To fully rerun our experiments, we recommend defining a new and unused data directory prefix. For example, 'your_results_' suffices.

Table 1:
- First run and wait for completion:
  - python experiments_ss_run.py --dir_prefix your_results_ --num_runs 10 --data_set mnist --num_labelled 600 --dim_z 0
  - python experiments_ss_run.py --dir_prefix your_results_ --num_runs 10 --data_set mnist --num_labelled 600 --dim_z 2
  - python experiments_ss_run.py --dir_prefix your_results_ --num_runs 10 --data_set mnist --num_labelled 600 --dim_z 50
- Second run:
  - python experiments_ss_analyze.py --dir_prefix your_results_ --data_set mnist
Table 2:
- First run and wait for completion:
  - python experiments_ss_run.py --dir_prefix your_results_ --num_runs 4 --data_set svhn_cropped --num_labelled 1000 --dim_z 50
- Second run:
  - python experiments_ss_analyze.py --dir_prefix your_results_ --data_set svhn_cropped

Saved Experimental Results

results_ss_mnist and results_ss_svhn_cropped contain the results from our original submission. Per reviewer feedback, we augmented this data with some additional baselines. However, these new baselines were not guaranteed to experience the same data folds. To analyze this data call the following:
- python experiments_ss_analyze.py --dir_prefix results_ss_ --data_set mnist
- python experiments_ss_analyze.py --dir_prefix results_ss_ --data_set svhn_cropped
To prepare our camera-ready submission, we reran all experiments such that all baselines would experience the same data folds. This "new" data is contained in new_results_ss_mnist and new_results_ss_svhn_cropped. These new sets can be analyzed with:
- python experiments_ss_analyze.py --dir_prefix new_results_ss_ --data_set mnist
- python experiments_ss_analyze.py --dir_prefix new_results_ss_ --data_set svhn_cropped

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeurIPS'19-MV-Kumaraswamy

Prerequisites

Retrieving Data Sets

Repository Overview

Reproducing Report Images

Reproducing Experimental Results

Saved Experimental Results

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
ars-reparameterization		ars-reparameterization
new_results_ss_mnist		new_results_ss_mnist
new_results_ss_svhn_cropped		new_results_ss_svhn_cropped
results_ss_mnist		results_ss_mnist
results_ss_svhn_cropped		results_ss_svhn_cropped
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
experiments_ss_analyze.py		experiments_ss_analyze.py
experiments_ss_augment.py		experiments_ss_augment.py
experiments_ss_run.py		experiments_ss_run.py
model_lib.py		model_lib.py
model_utils.py		model_utils.py
models_vae.py		models_vae.py
mv_kumaraswamy_sampler.py		mv_kumaraswamy_sampler.py
mv_kumaraswamy_theory.py		mv_kumaraswamy_theory.py
poster.pdf		poster.pdf
requirements.txt		requirements.txt
unit_tests.py		unit_tests.py
variance_tests.py		variance_tests.py

License

astirn/MV-Kumaraswamy

Folders and files

Latest commit

History

Repository files navigation

NeurIPS'19-MV-Kumaraswamy

Prerequisites

Retrieving Data Sets

Repository Overview

Reproducing Report Images

Reproducing Experimental Results

Saved Experimental Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages