Skip to content

marsupialtail/SVS

Repository files navigation

Accelerating CNN Training Through Gradient Approximation

Ziheng Wang | Sree Harsha Nelaturu

The repository for the workshop paper presented at EMC^2, ISCA 2019.

To reproduce results for CIFAR10:

  1. Ensure you have CUDA 8 and Tensorflow GPU Installed (Tested on TF 1.9)
  2. In the CUDA_Kernels/ folder, run create_ops.sh.
  3. Place the compiled approx_kernel2.so in the model_harness/models/
  4. Run python cifar10_epoch_wise.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •