🎨 Generative Adversarial Network (GAN)

This project is a deep convolutional generative adversarial network that can create high quality images from a random seed like portraits, animals, drawings and more.

📋 Summary

📋 Summary
🤖 Model
📦 Dependencies
🦾 Training
⚗️ Testing
🙏 Credits

🤖 Model

🏗️ Architecture

The model is a Generative Adversarial Network (GAN) like described in the paper Generative Adversarial Nets from Montreal University (2014)

The generator and the discriminator are both deep convolutional neural networks like in the paper Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks from Facebook AI Research (2015) but with a few improvements:

For both networks:

I added Equalized Learning Rate Layers from the paper Progressive Growing of GANs for Improved Quality, Stability, and Variation by Nvidia (2017)
I added Bilinear Upsampling / Downsampling from the paper Making Convolutional Networks Shift-Invariant Again by Adobe Research (2019)
I implemented Wavelet Transform from the paper SWAGAN: A Style-based Wavelet-driven Generative Model by Tel-Aviv University (2021)

For the generator:

I used a Style-Based Architecture with a Constant Input, Learned Styles from a Mapping Network and Noise Injection from the paper A Style-Based Generator Architecture for Generative Adversarial Networks by Nvidia (2018)
I added Skip Connections from the paper MSG-GAN: Multi-Scale Gradients for Generative Adversarial Networks by TomTom and Adobe (2019)

For the discriminator:

I added Residual Blocks from the paper Deep Residual Learning for Image Recognition by Microsoft Research (2015)
I added Minibatch Standard Deviation at the end of the discriminator from the paper Improved Techniques for Training GANs by OpenAI (2016)

For training:

I kept the original Non-Saturating Loss from the paper Generative Adversarial Nets by Montreal University (2014)
I added Path Length Regularization on the generator from the paper Analyzing and Improving the Image Quality of StyleGAN by Nvidia (2019)
I added Gradient Penalty Regularization on the discriminator from the paper Improved Training of Wasserstein GANs by Google Brain (2017)
I added Adaptive Discriminator Augmentation (ADA) from the paper Training Generative Adversarial Networks with Limited Data by Nvidia (2020) but the augmentation probability is not trained and has to be set manually (and some augmentations are disabled because of a missing PyTorch implementation)

For testing:

I added the computation of the Fréchet Inception Distance (FID) during training from the paper GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium by University Linz (2017) using the pytorch-fid module
I added a Projector like in the paper Analyzing and Improving the Image Quality of StyleGAN by Nvidia (2019)

🧪 Tests

Human faces 256*256 (FID: 5.97)

Animal faces 256*256 (FID: 6.56)

Anime faces 256*256 (FID: 3.74)

Painting faces 256*256 (FID: 20.32)

🎛️ Trained weights

The trained weights on multiple datasets are available on Google Drive, you just need to download the .pt files and put them in the models folder.

📦 Dependencies

Run the following command to install the dependencies:

$ pip install -r requirements.txt

(You may need to use a specific command for PyTorch if you want to use CUDA)

🦾 Training

First, you need to find and download a dataset of images (less than 5,000 may be too little and more than 150,000 is not necessary). You can find a lot of datasets on Kaggle and the ones I used on my Google Drive.
Then, in the training/settings.py file, specify the path to the dataset
If you don't have an overpriced 24GB GPU like me, the default settings may not work for you. You can try to:
- Lower the batch size (less stable and worse lower point)
- Increase the accumulation steps (fix previous problems but slower)
- Lower the min features (worse results)
- Decrease the image size
Run the training.ipynb file (you can stop the training at any time and resume it later thanks to the checkpoints)

⚗️ Testing

Run the testing.ipynb file to generate random images
Run the testing/interpolation.ipynb file to generate the images of a smooth interpolation video
Run the testing/projector.ipynb file to project real images into the latent space
Run the testing/style_mixing.ipynb file to generate the images of a style mixing interpolation video
Run the testing/timelapse.ipynb file to generate the images of a training timelapse video

🙏 Credits

Angel Uriot : Creator of the project.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
models		models
resources/misc		resources/misc
testing		testing
training		training
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
requirements.txt		requirements.txt
testing.ipynb		testing.ipynb
training.ipynb		training.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎨 Generative Adversarial Network (GAN)

📋 Summary

🤖 Model

🏗️ Architecture

For both networks:

For the generator:

For the discriminator:

For training:

For testing:

🧪 Tests

🎛️ Trained weights

📦 Dependencies

🦾 Training

⚗️ Testing

🙏 Credits

About

Languages

License

angeluriot/Generative_adversarial_network

Folders and files

Latest commit

History

Repository files navigation

🎨 Generative Adversarial Network (GAN)

📋 Summary

🤖 Model

🏗️ Architecture

For both networks:

For the generator:

For the discriminator:

For training:

For testing:

🧪 Tests

🎛️ Trained weights

📦 Dependencies

🦾 Training

⚗️ Testing

🙏 Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Languages