Project: An End to end FoodVision101 using CNN with VGG16&EfficientnetB1 + DVC

✨ Project information:

An end-to-end CNN Image Classification Model was developed using Transfer Learning to identify food items in images. The popular EfficientnetB1 model, which had been pretrained on the large Food101 dataset, was employed and retrained for the project's purposes. Remarkably, the DeepFood Paper's model, which had an accuracy of 77.4% and was also trained on Food101, was outperformed by the Model developed in this project. The project uses DVC (data version control) for managing data. It is built on a microservices architecture and is an end-to-end project. The dataset can be downloaded from this link.

Dataset : Food101

Model : EfficientNetB1 & VGG16 The project's model will be built using all of the data from the Food101 dataset, comprising 75,750 training images and 25,250 testing images.

Two methods to significantly improve the speed of the model training: *Prefetching *Mixed precision training

Checking the GPU

For this Project we will working with Mixed Precision. And mixed precision works best with a with a GPU with compatibility capacity 7.0+.

At the time of writing, colab offers the following GPU's :

Nvidia K80
Nvidia T4
Nvidia P100

Colab allocates a random GPU everytime we factory reset runtime. So you can reset the runtime till you get a Tesla T4 GPU as T4 GPU has a rating 7.5.

In case using local hardware, use a GPU with rating 7.0+ for better results.

📚 Libraries used :

Tensorflow
tfds
Keras
pandas
numpy
seaborn
os
DVC

Preprocessing the Data

Since we've downloaded the data from TensorFlow Datasets, there are a couple of preprocessing steps we have to take before it's ready to model.

More specifically, our data is currently:

In uint8 data type
Comprised of all differnet sized tensors (different sized images)
Not scaled (the pixel values are between 0 & 255)

Whereas, models like data to be:

In float32 data type
Have all of the same size tensors (batches require all tensors have the same shape, e.g. (224, 224, 3))
Scaled (values between 0 & 1), also called normalized

To take care of these, we'll create a preprocess_img() function which:

Resizes an input image tensor to a specified size using tf.image.resize()
Converts an input image tensor's current datatype to tf.float32 using tf.cast()

Building the Model : EfficientNetB1

Implemented Mixed Precision training and Prefetching to decrease the time taken for the model to train.

Getting the Callbacks ready

As we are dealing with a complex Neural Network (EfficientNetB0) its a good practice to have few call backs set up. Few callbacks I will be using throughtout this Notebook are :

TensorBoard Callback : TensorBoard provides the visualization and tooling needed for machine learning experimentation
EarlyStoppingCallback : Used to stop training when a monitored metric has stopped improving.
ReduceLROnPlateau : Reduce learning rate when a metric has stopped improving.

Evaluating the results

Loss vs Epochs

Accuracy vs Epochs

🚀 Project structure (MLOps-DVC):

🐨 DagsHub Data Pipeline

Complete Project Data Pipeline is available at DagsHub Data Pipeline

🔥 Technologies Used:

1. Python 
2. shell scripting 
3. aws cloud Provider 
4. DVC

🔌 Infrastructure:

1. AWS S3
2. GitHub
3. DaghsHub

👷 Initial Setup:

conda create --prefix ./env python=3.9
conda activate ./env 
pip install -r requirements.txt
dvc init

Conclusion

This project is production ready to be used for the similar use cases and it will provide the automated and orchesrated production ready pipelines(Training & Serving)

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.dvc		.dvc
.github/workflows		.github/workflows
artifacts		artifacts
configs		configs
data		data
docs/templates		docs/templates
notebook		notebook
src		src
.dvcignore		.dvcignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dvc.lock		dvc.lock
dvc.yaml		dvc.yaml
init_setup.sh		init_setup.sh
params.yaml		params.yaml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: An End to end FoodVision101 using CNN with VGG16&EfficientnetB1 + DVC

✨ Project information:

Checking the GPU

📚 Libraries used :

Preprocessing the Data

Building the Model : EfficientNetB1

Getting the Callbacks ready

Evaluating the results

Loss vs Epochs

Accuracy vs Epochs

🚀 Project structure (MLOps-DVC):

🐨 DagsHub Data Pipeline

🔥 Technologies Used:

🔌 Infrastructure:

👷 Initial Setup:

Conclusion

Thanks for taking a look at this project. If you find it valuable, kindly rate it by clicking the star icon. Your support is highly appreciated! 😊🙏⭐

📃 License

About

Releases

Packages

Languages

License

hamehrabi/Project-DVC-FoodVision

Folders and files

Latest commit

History

Repository files navigation

Project: An End to end FoodVision101 using CNN with VGG16&EfficientnetB1 + DVC

✨ Project information:

Checking the GPU

📚 Libraries used :

Preprocessing the Data

Building the Model : EfficientNetB1

Getting the Callbacks ready

Evaluating the results

Loss vs Epochs

Accuracy vs Epochs

🚀 Project structure (MLOps-DVC):

🐨 DagsHub Data Pipeline

🔥 Technologies Used:

🔌 Infrastructure:

👷 Initial Setup:

Conclusion

Thanks for taking a look at this project. If you find it valuable, kindly rate it by clicking the star icon. Your support is highly appreciated! 😊🙏⭐

📃 License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages