In this project, a novel approach is proposed for classification of the Modified MNIST dataset. The task is to identify the numerically largest value among three handwritten digits within each image. The proposed algorithm is a modified version of the well known VGGNet. Since convolutional neural networks (CNN) automatically capture the relevant features, we observe that additional feature selection and preprocessing on dataset are unnecessary. We also observe that data augmentation, optimizer tuning and model ensembling contribute to our best performance. The proposed algorithm reached an accuracy of 99.3% on the test set and ranked the first on Kaggle’s Modified MNIST competition.
The repository contains 3 files:
- 1 Jupyter notebook file - Miniproject3_Submission.ipynb
- 1 ReadMe file - ReadMe.md
- 1 Project writeup - writeup.pdf
The Modified MNIST dataset is hosted on Kaggle at:
https://www.kaggle.com/c/modified-mnist/data
- Open Jupyter notebook in Google Colab using a Google account (https://colab.research.google.com)
- Upload the notebook Miniproject3_Submission.ipynb
- Switch Runtime type to GPU to ensure faster execution
- The second code block contains Google Drive connection. Ensure that connection is established and Change directory into directory where the dataset files are stored.
- Run all the cells.