GAN+BetaVAE models #1

AtarSander · 2024-11-14T21:18:04Z

Project consists of two separate models created for car images generation:

GAN model - trained on all parts of dataset, tested, with results in README.md.
betaVAE model - works, but development still in progress to achieve satisfying results.

…sample image from NuImages dataset. - new file car_finder with class CarFinder - function fetch_vehicles_bounding_boxes iterates through each object and checks if its a vehicle

…es bounding boxes for each image in dataset. - function iterates through images in dataset and uses fetch_vehicles_bboxes_from_img to get bounding boxes of all vehicles

…mages of cars from given bounding boxes. Minor names changes in car_finder. - function cut_out_bbox uses cv2 library for cutting an image fragment where the car is positioned - function cut_out_vehicles iterates through dataset and cuts out all of vehicles there - in car_finder changed names for clarity

…nder. - check_bbox_size checks if bounding box of a vegicle is not below given threshold - implemented check_bbox_size into fetch_vehicles_bboxes_from_img

…for possible data pruning/augmentation in the future. Added .gitignore. - functions brigthness, sharpness, focus, contrast, noise_frequency, color_balance, uniformity measure different image metrics - their results are added into dictionary in function analyze_image - set image enables changing an image without creating new object

… For now it resizes images to one, target width and size. Created requirements file. - DataPreprocessor consist of function is_too_small which is used in resize_image for checking what type of interpolation to use for the best effect - deleted old commented code from image_quality_analyzer

…eQualityAnalyzer and DataPreprocessor - created analyze_dataset function which iterates through filenames in given directory and calculates metrics for each image - created resize dataset which iterates through filenames in given directory and resizes each image

- created Generator model class for generating images from noise input - created Discriinator model class for veryfiying image accuracy

… model. - Generator and Discriminator created on init, needed parameters passed as init arguments - traning happens inside train function which is split into smaller parts - separated functions for initializing weights, optim setup, loss calculations, backprop and evaluation to keep the code clean

…lasses. - both models use xavier normal init for Conv2d or ConvTranspose2d layers and constant init for BatchNorm2d layers

- fixed nn.Module inheritance in, added forward methods - fixed optimizer arguments in GAN, added retain_graph to backprop - deleted comment in carfinder

- load_dataset uses torchvision.transforms for chaning input images into traning data for GAN model.

…odel. Updated requirements, and GAN class. - test.py uses all of the class components to first extract and preprocess training data, and then train the model. - torch and matplotlib added to requirements - plots now saved as a file in GAN.

- created src folder for cleaner structure - fixed import paths

…odel.py, implemented weights saving and added progress bar to car_cutter. - In GAN, Generator and Discriminator fixed cuda usage so it works for every step of the training. - Implemented models weights saving in GAN. New directory models created for keeping the weights. - Added progress bar to cut_out_vehicles_from_dataset function in CarCutter to keep note of the progress (it takes a lot of time).

GAN network performs training without crashing or returning errors. In this state it doesn't provide satisfying results after training (work needed with parameters tuning).

… measurement per epoch, implemented tensorboard visualization. - in model.py changed features dimension in convolutional blocks for both generator and discriminator - in gan.py implemented elapsed time measurement per epoch for training - in gan.py added temporary implementation of tensorboard visualization for training

- label loop iterates through given directory files - it uses label_file to pick if the car is placed front, back or sideways to camera (or bad picture and discard) - it is possible to pick sort_type - it is possible to print images in cv2 window while labeling

…dded labeler class and labeling script to enable labeling images, started work on data augmeneter. - in traning loop comented epoch end condition and implemented time constraints - created class labeler for labeling images to correct subdirectories - labeling script enables same thing with different technique - data augmenter implements function for creating similar images to the original

- implemented flipping, rotating, brightness and contrast adjusting and gaussian noise application functions - augment_dataset function iterates through a folder and creates 6 more variations of each image

…ories. - added .vscode to .gitignore - fixed path passing in data augmenter - added line clearing in terminal for progress bar and implemented it in classes using the progress bar - implemented giving path to logs in GAN manually

- logs directory consists of fake and real subdir for tensorboard log keeping

…into model

Version 1.0

…g readme images.

…, removed unused code in gan, fixed types in model eval. - implemented try-except handling within commands in ADShell, especially ones with parameters passed from terminal - fixed bugged model calls in ADShell - changed name of .json config file, added additional parameters for complete training flexilibty - corrected types in both model_eval and ADShell

- added link to dataset in kaggle - added instalation instruction - implemented usage guide with separate section for training, inference and evaluation - added images of generated cars - grammar corrections

- class inherits from nn.Module - it consists of encoder and decoder with helper functions for building them - they are joined in forward method together with reparametrize

…taVAETrainer class. - removed beta parameter from beta_vae class (it's needed for training, not creating the model) - added weights initialization for decoder and encoder - created rough draft of BetaVAETrainer class for training process with data loading, forward prop, loss calculation and backpropagation

…rs dataset.

…py, updated autodoppelganger shell for betavae. - fixed wrong arguments passing in beta_vae - added weights saving and loading into BetaVAETrainer - added printing training_stats - added generating samples - implemented commands in adshell for betavae usage

- Added normalization for GAN generation. - Implemented cpu usage in both models generation.

- generate disentangled samples in both ad shell and train - added more error catching

…nt setup.

- replaced old style data resizing with appropriate torchvision transformations - added many data augmentations transformation in place of unused data augmentation class

- removed unecessary blank line in train.py - removed test print statement in gan - corrected mean transformation in data_preprocessor

- implemented saving of weight every 20 epoch in train.py - updated autodoppelganger_shell accordingly

…training is finished. - updated json config file - changed weight saving logic in adshell (for bvae only)

AtarSander and others added 30 commits March 18, 2024 22:39

Initial commit

1444f4c

Created class CarFinder for finding vehicles bounding boxes in given …

d0e8bb6

…sample image from NuImages dataset. - new file car_finder with class CarFinder - function fetch_vehicles_bounding_boxes iterates through each object and checks if its a vehicle

Created function fetch_vehicles_bboxes_from_dataset which gets vehicl…

c5c4d06

…es bounding boxes for each image in dataset. - function iterates through images in dataset and uses fetch_vehicles_bboxes_from_img to get bounding boxes of all vehicles

Implemented function for checking minimum bounding box size in car fi…

c05e962

…nder. - check_bbox_size checks if bounding box of a vegicle is not below given threshold - implemented check_bbox_size into fetch_vehicles_bboxes_from_img

First version of generator and discriminator classes

aa6b431

- created Generator model class for generating images from noise input - created Discriinator model class for veryfiying image accuracy

Added weights initialization for Generator and Discriminator inside c…

aa730e6

…lasses. - both models use xavier normal init for Conv2d or ConvTranspose2d layers and constant init for BatchNorm2d layers

Minor fixes in classes car_finder, mode, data preprocessor

2b59215

- fixed nn.Module inheritance in, added forward methods - fixed optimizer arguments in GAN, added retain_graph to backprop - deleted comment in carfinder

Added load_dataset function for data_preprocessor.

0dbd0f3

- load_dataset uses torchvision.transforms for chaning input images into traning data for GAN model.

Moved source code into src folder.

41d1936

- created src folder for cleaner structure - fixed import paths

Created progress bar class for printing out loading bar on the terminal.

5e10dd3

Implemented weights saving to temporary test function.

fb75622

Merge first working version of the model into main

07adf41

GAN network performs training without crashing or returning errors. In this state it doesn't provide satisfying results after training (work needed with parameters tuning).

IMPLEMENTATION: Implemented script that labels cut photos

fab2588

FIX: Corrected paths in script and added auto-size

335c6a6

Resolved merge conflicts in labeling_script

c652a01

Created data augmenter class for multipling dataset image.

a92c0ae

- implemented flipping, rotating, brightness and contrast adjusting and gaussian noise application functions - augment_dataset function iterates through a folder and creates 6 more variations of each image

Added directory for log keeping, deleted unnecessary code in labeler.

03f7b6a

- logs directory consists of fake and real subdir for tensorboard log keeping

IMPLEMENTATION: Created methods for loading model weights from file

efe9469

Merge branch 'model' of https://github.com/AtarSander/AutoDoppelGANger …

a4b752a

…into model

mikorozek and others added 30 commits April 24, 2024 22:33

REFACTOR: Created main.py in the main directory for using the app

b295624

Merge branch 'model' of https://github.com/AtarSander/AutoDoppelGANger …

1e65204

…into model

REFACTOR: Corrected file names and also minor changes

28a14c9

IMPLEMENTATION: Created computind FID and Inception in shell

a30071a

Merge pull request #2 from AtarSander/model

de8139a

Version 1.0

Created first version of README, added images subdirectory for storin…

564012c

…g readme images.

Created first version of README, added images subdirectory for storin…

dbf94b3

…g readme images.

Merge remote-tracking branch 'refs/remotes/origin/main'

71142c7

Adding the project to the fork of KNSI Golem official repository.

ac4edeb

Updated README

4bf62e8

- added link to dataset in kaggle - added instalation instruction - implemented usage guide with separate section for training, inference and evaluation - added images of generated cars - grammar corrections

Created BetaVAE class which implements whole model's architecture.

e135cc1

- class inherits from nn.Module - it consists of encoder and decoder with helper functions for building them - they are joined in forward method together with reparametrize

Saved weights checkpoints for GAN models, divided by categories in ca…

c2901cc

…rs dataset.

Updated generating samples for cpu.

71dd3b6

- Added normalization for GAN generation. - Implemented cpu usage in both models generation.

New betavae trained weights

ff7d157

Saved b-vae model weights with parameter beta=20.

f630a1f

Saved weights for side model with beta=10

d7c055e

Implemented disentangled images generation. Fixed errors.

4d2458f

- generate disentangled samples in both ad shell and train - added more error catching

Updated requirement file, added environment.yml for virtual environme…

6187535

…nt setup.

Added option for customizable naming of weights checkpoints files.

e6d7ee5

Removed unused functions. Added more transformations.

0308354

- replaced old style data resizing with appropriate torchvision transformations - added many data augmentations transformation in place of unused data augmentation class

Replaced timebased training stop condition with epoch num based.

dfbeb1c

Corrected the path where tensorboard logs are saved

d48e17b

Minor formatting and development changes.

e6c08af

- removed unecessary blank line in train.py - removed test print statement in gan - corrected mean transformation in data_preprocessor

Updated values of model's training configuration for testing.

88bb4e8

Created checkpoint saving of weights for bvae model.

52b860b

- implemented saving of weight every 20 epoch in train.py - updated autodoppelganger_shell accordingly

Configure checkpoint weight saving so it saves the weights after the …

6093340

…training is finished. - updated json config file - changed weight saving logic in adshell (for bvae only)

Newest weights for bvae model.

1a68c94

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GAN+BetaVAE models #1

GAN+BetaVAE models #1

AtarSander commented Nov 14, 2024

GAN+BetaVAE models #1

Are you sure you want to change the base?

GAN+BetaVAE models #1

Conversation

AtarSander commented Nov 14, 2024