This project aims to fine-tune a ResNet50 model to classify flowers that are commonly occurring in the United Kingdom. The research that introduced this dataset utilized an SVM and a combination of features to achieve 72.8% accuracy. This project shows that a 93.04% accuracy can be achieved with a more sophisticated deep learning model.
The dataset contains 8189 images and 102 classes. Each class consists of between 40 and 258 images.
As described in the dataset
Each images have large scale, pose and light variations. In addition, there are categories that have large variations within the category and several very similar categories.
- Flowers might change color throughout their lifespan.
- Flowers can deform in various ways.
- Many types of flowers share similar shapes and colors.
A ResNet50 pretrained on IMAGENET1K_V2 is used. To fit the data into the model, images are resized and normalized. The FC layer is replaced so that it can produce scores for each of the 102 classes. Layers other than the FC layer are frozen.
Predictions were evaluated using balanced accuracy. The model achieved a 93.04% accuracy after 10 epochs of training. The model was trained on Google Colab using a T4 GPU.
- Dealing with overfitting problem (i.e. data augmentaion, L2 regularization, learning rate scheduler)
- Try with a different model (i.e. Transformer)