shakes76 · ryanjohnson08 · Oct 4, 2022 · Oct 18, 2022 · Oct 18, 2022 · Oct 18, 2022
diff --git a/recognition/45307915/README.MD b/recognition/45307915/README.MD
@@ -0,0 +1,123 @@
+# Improved UNET - Segmentation of ISIC Dataset
+The ISIC challenge is an international effort to improve melanoma diagnosis, one of the deadliest forms of skin cancer. Early detection greatly increases the chances of survival for melanoma victims. The ISIC have published datasets for the community to further computer vision research.
+
+By using a neural network a model can be trained to segment the melanoma within a skin image. This project shows the use of an Improved UNET to achieve this.
+
+## ISIC Dataset
+The ISIC dataset contains a set of skin lesion images and their respective segmentation masks. The 2017 dataset used was already split into training, testing and validation sets.
+
+Due to all the images in the set being different sizes they were all resized. The skin lesion images were resized to (256, 256, 3), keeping their colour channels. The segmentation masks were resized to (256, 256, 1), reducing to a single colour channel. Thresholding was then completed on the segmentation masks, pixels with a value >0.5 were set to 1, and all other pixels set to 0.
+
+![Training Image](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/image.png)
+![Training Mask](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/mask.png)
+
+Available at https://challenge.isic-archive.com/data/#2017
+
+## Improved UNET Model Architecture
+
+The following UNET model was proposed in the Brain Tumor Segmentation and Radiomics paper. [1]
+
+![Improved UNET Model Architecture](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/IUNET_Architecture.JPG)
+
+Like the original UNET the Improved UNET is made up of encoding and decoding layers. Components of the improved UNET Architecture are as follows.
+
+- Leaky ReLU - "leaky ReLU nonlinearities with a negative slope of $10^{−2}$ for all feature map computing convolutions" [1]
+- Instance Normalisation - "replace the traditional batch with instance normalization" [1]
+- Context Module - "Each context module is in fact a pre-activation residual block with two 3x3x3 convolutional layers and a dropout layer (pdrop = 0.3) in between." [1]
+- Upsampling Module - "This is achieved by first upsampling the low resolution feature maps, which is done by means of a simple upscale that repeats the feature voxels twice in each spatial dimension, followed by a 3x3x3 convolution" [1]
+- Localisation Module - "A localization module consists of a 3x3x3 convolution followed by a 1x1x1 convolution that halves the number of feature maps." [1]
+- Skip Connection - Like the original UNET the improved UNET makes use of skip connections
+- Segmentation Layer - These are 3x3 convolutions with a single output filter. These segments are summed to form the final network output.
+
+## Training
+
+### Optimiser and Loss Function
+The optimiser used for training was "the adam optimizer with an initial learning rate $lr_{init} = 5 · 10^{−4}$, the following learning rate schedule: $lr_{init} · 0.985^{epoch}$ ". [1]
+
+For the loss function the dice coefficient for the true and predicted values was used. The dice score is defined as follows:
+$Dice = 2 |A∩B| / (|A|+|B|)$
+
+```
+def diceCoefficient(self, y_true, y_pred):
+        """
+        Dice Coefficient
+
+        Credit Karan Jakhar
+        https://karan-jakhar.medium.com/100-days-of-code-day-7-84e4918cb72c
+
+        Parameters:
+            y_true (tf.Tensor): true output
+            y_true (tf.Tensor): output predicted by model
+
+        Return:
+            tf.Tensor: Dice coefficient based on true output and prediction
+
+        """
+        y_true_f = K.flatten(y_true)
+        y_pred_f = K.flatten(y_pred)
+        intersection = K.sum(y_true_f * y_pred_f)
+        dice = (2. * intersection + 1.) / (K.sum(y_true_f) + K.sum(y_pred_f) + 1.)
+        return dice
+```
+
+### Training Dice Coefficient and Loss
+The following graphs show the training Dice Coefficient Loss and Accuracy over 20 Epochs.
+![Dice Coefficient Loss](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/diceCoefficientLoss.png)
+![Dice Coefficient](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/diceCoefficient.png)
+
+### Adjustments
+Initially, the structure layed out in the paper was followed for model training. However, the model test accuracy didn't have a Dice similarity coefficient of greater than 0.8.
+
+![Test data dice coefficient original](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/TestDataDiceCoefficientOriginal.JPG)
+
+Through testing the following adjustments were found to make a difference.
+First, the learning rate scheduler which was used as per the paper was removed.
+```
+results = model.fit(train_batch, epochs=self.epochs, validation_data=validate_batch,
+                            callbacks=[tf.keras.callbacks.LearningRateScheduler(
+                                lambda epoch: self.learning_rate * 0.985 ** (epoch)
+                            )])
+```
+Replaced with a constant learning rate.
+```
+adamOptimizer = Adam(learning_rate=self.learning_rate)
+model.compile(optimizer=adamOptimizer, loss=self.diceLoss, metrics=[self.diceCoefficient])
+```
+This resulted in a slight increase in the dice similarity coefficient.
+![Test data dice coefficient adjust learning rate](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/TestDataDiceCoefficientLearningRate.JPG)
+
+By decreasing the batch size the model was able to achieve a dice similarity coefficient greater than 0.8 for the test set.
+![Test data dice coefficient adjust batch size](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/TestDataDiceCoefficientDecreaseBatchSize.JPG)
+
+## Predictions
+Using the model the following segmentation predictions were made from test set inputs.
+![prediction1](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p1.png)
+![prediction2](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p2.png)
+![prediction3](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p3.png)
+![prediction4](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p4.png)
+![prediction5](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p5.png)
+![prediction6](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p6.png)
+![prediction7](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p7.png)
+![prediction8](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p8.png)
+![prediction9](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p9.png)
+![prediction10](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p10.png)
+![prediction11](https://github.com/ryanjohnson08/45307915/blob/topic-recognition/recognition/45307915/images/p11.png)
+
+## Usage
+To use, first ensure all dependencies are downloaded.
+
+Download the 2017 ISIC dataset and place training, testing, and validation image and mask sets in the working directory.
+
+By running ```train.py``` an Improved UNET model will be trained on the training data, and validated on the validation data. The batch size, learning rate and number of epochs for the model can be adjusted within the ```train.py``` file.
+
+By running ```predict.py``` predictions will be made on a test dataset using the trained model.
+
+### Dependencies
+- Python: 3.7.15
+- Numpy: 1.21.6
+- Matplotlib: 3.2.2
+- Tensorflow: 2.9.2
+- Tensorflow Addons: 0.18.0
+
+## References
+[1] F. Isensee, P. Kickingereder, W. Wick, M. Bendszus, and K. H. Maier-Hein, “Brain Tumor Segmentation and Radiomics Survival Prediction: Contribution to the BRATS 2017 Challenge,” Feb. 2018. [Online]. Available: https://arxiv.org/abs/1802.10508v1
diff --git a/recognition/45307915/dataset.py b/recognition/45307915/dataset.py
@@ -0,0 +1,131 @@
+import tensorflow as tf
+
+TRAIN_IMAGES_PATH = "./ISIC-2017_Training_Data/*.jpg"
+TRAIN_MASKS_PATH = "./ISIC-2017_Training_Part1_GroundTruth/*.png"
+
+TEST_IMAGES_PATH = "./ISIC-2017_Test_v2_Data/*.jpg"
+TEST_MASKS_PATH = "./ISIC-2017_Test_v2_Part1_GroundTruth/*.png"
+
+VALIDATE_IMAGES_PATH = "./ISIC-2017_Validation_Data/*.jpg"
+VALIDATE_MASKS_PATH = "./ISIC-2017_Validation_Part1_GroundTruth/*.png"
+
+IMAGE_HEIGHT = 256
+IMAGE_WIDTH = 256
+
+class DataLoader():
+
+    def __init__(self, train_images_path=TRAIN_IMAGES_PATH, train_masks_path=TRAIN_MASKS_PATH, 
+                 test_images_path=TEST_IMAGES_PATH, test_masks_path=TEST_MASKS_PATH,
+                 validate_images_path=VALIDATE_IMAGES_PATH, validate_masks_path=VALIDATE_MASKS_PATH, 
+                 img_width=IMAGE_WIDTH, img_height=IMAGE_HEIGHT):
+        """ 
+        Create a new Data Loader instance to load in the training, testing and validation data
+
+        Parameters:
+            images_path (str): Path of the dataset images
+            masks_path (str):  Path of the dataset masks
+            img_width (int): Image Width
+            img_height (int): Image Height
+        """
+        self.train_images_path = train_images_path
+        self.train_masks_path = train_masks_path
+
+        self.test_images_path = test_images_path
+        self.test_masks_path = test_masks_path
+
+        self.validate_images_path = validate_images_path
+        self.validate_masks_path = validate_masks_path
+
+        self.img_width = img_width
+        self.img_height = img_height
+
+    def preprocessImages(self, filenames):
+        """
+        Load and preprocess the image files.
+
+        Image loading code from TensorFlow
+        https://www.tensorflow.org/api_docs/python/tf/io/read_file
+
+        Parameters:
+            filenames (tf.string): names of all image files
+
+        Return:
+            tf.Dataset: Dataset containing all the image file data
+
+        """
+        raw = tf.io.read_file(filenames)
+        images = tf.io.decode_jpeg(raw, channels=3)
+
+        #resize images
+        images = tf.image.resize(images, [self.img_height, self.img_width])
+
+        #Normalise
+        images = images / 255.
+
+        return images
+
+
+    def preprocessMasks(self, filenames):
+        """
+        Load and preprocess the mask files.
+
+        Image loading code from TensorFlow
+        https://www.tensorflow.org/api_docs/python/tf/io/read_file
+
+        Parameters:
+            filenames (tf.string): names of all mask files
+
+        Return:
+            tf.Dataset: Dataset containing all the mask file data
+
+        """
+        raw = tf.io.read_file(filenames)
+        images = tf.io.decode_png(raw, channels=1)
+
+        #resize images
+        images = tf.image.resize(images, [self.img_height, self.img_width])
+
+        #Normalise
+        images = images / 255.
+
+        #Threshold image to 0-1
+        images = tf.where(images > 0.5, 1.0, 0.0)
+
+        return images
+
+    def loadData(self):
+        """
+        Loads and prepocesses all the image and mask data, for the training, testing and validation datasets
+
+        Return:
+            (tf.Dataset, tf.Dataset, tf.Dataset): 3 datasets containing all the image and mask data
+        """
+
+        #train
+        train_image_data = tf.data.Dataset.list_files(self.train_images_path, shuffle=False)
+        train_processed_images = train_image_data.map(self.preprocessImages)
+
+        train_mask_data = tf.data.Dataset.list_files(self.train_masks_path, shuffle=False)
+        train_processed_masks = train_mask_data.map(self.preprocessMasks)
+
+        train_dataset = tf.data.Dataset.zip((train_processed_images, train_processed_masks))
+
+        #test
+        test_image_data = tf.data.Dataset.list_files(self.test_images_path, shuffle=False)
+        test_processed_images = test_image_data.map(self.preprocessImages)
+
+        test_mask_data = tf.data.Dataset.list_files(self.test_masks_path, shuffle=False)
+        test_processed_masks = test_mask_data.map(self.preprocessMasks)
+
+        test_dataset = tf.data.Dataset.zip((test_processed_images, test_processed_masks))
+
+        #validate
+        validate_image_data = tf.data.Dataset.list_files(self.validate_images_path, shuffle=False)
+        validate_processed_images = validate_image_data.map(self.preprocessImages)
+
+        validate_mask_data = tf.data.Dataset.list_files(self.validate_masks_path, shuffle=False)
+        validate_processed_masks = validate_mask_data.map(self.preprocessMasks)
+
+        validate_dataset = tf.data.Dataset.zip((validate_processed_images, validate_processed_masks))
+
+        return train_dataset, test_dataset, validate_dataset
diff --git a/recognition/45307915/images/IUNET_Architecture.JPG b/recognition/45307915/images/IUNET_Architecture.JPG
diff --git a/recognition/45307915/images/TestDataDiceCoefficientDecreaseBatchSize.JPG b/recognition/45307915/images/TestDataDiceCoefficientDecreaseBatchSize.JPG
diff --git a/recognition/45307915/images/TestDataDiceCoefficientLearningRate.JPG b/recognition/45307915/images/TestDataDiceCoefficientLearningRate.JPG
diff --git a/recognition/45307915/images/TestDataDiceCoefficientOriginal.JPG b/recognition/45307915/images/TestDataDiceCoefficientOriginal.JPG
diff --git a/recognition/45307915/images/diceCoefficient.png b/recognition/45307915/images/diceCoefficient.png
diff --git a/recognition/45307915/images/diceCoefficientLoss.png b/recognition/45307915/images/diceCoefficientLoss.png
diff --git a/recognition/45307915/images/image.png b/recognition/45307915/images/image.png
diff --git a/recognition/45307915/images/mask.png b/recognition/45307915/images/mask.png
diff --git a/recognition/45307915/images/p1.png b/recognition/45307915/images/p1.png
diff --git a/recognition/45307915/images/p10.png b/recognition/45307915/images/p10.png
diff --git a/recognition/45307915/images/p11.png b/recognition/45307915/images/p11.png
diff --git a/recognition/45307915/images/p2.png b/recognition/45307915/images/p2.png
diff --git a/recognition/45307915/images/p3.png b/recognition/45307915/images/p3.png
diff --git a/recognition/45307915/images/p4.png b/recognition/45307915/images/p4.png
diff --git a/recognition/45307915/images/p5.png b/recognition/45307915/images/p5.png
diff --git a/recognition/45307915/images/p6.png b/recognition/45307915/images/p6.png
diff --git a/recognition/45307915/images/p7.png b/recognition/45307915/images/p7.png
diff --git a/recognition/45307915/images/p8.png b/recognition/45307915/images/p8.png
diff --git a/recognition/45307915/images/p9.png b/recognition/45307915/images/p9.png