Plant Disease Classification using CNN and Transfer Learning

This project leverages Convolutional Neural Networks (CNNs) both Training from scratch and Transfer Learning to classify diseases in apple plants. Using multiple deep learning models, including ResNet50, VGG16, InceptionV3, DenseNet121, Xception and MobileNet, the models are trained to identify diseases like black rot, Black spot (scab), Glomerella leaf spot, mosaic virus, and European canker from Apple Leaf, Stem and Fruit images. The project also includes a comparison of models trained from scratch versus transfer learning.

Project Overview

This project addresses the problem of plant disease classification, particularly for apple plants. Traditional methods require expert pathologists, which is time-consuming and expensive. By utilizing deep learning models, this project aims to automate and simplify the identification process, providing a faster and more accessible solution to farmers. The project uses the NZDLPlantDisease-v1 dataset, consisting of 15,706 images of diseased and healthy apple plants across 7 categories, such as black spot, leaf scab, and mosaic virus.

Dataset

The dataset contains both healthy and diseased apple plant images, collected under different lighting conditions and angles to simulate real-world horticultural environments. The image dataset was extracted from this GitHub repository - https://github.com/hsaleem1/NZDLPlantDisease-v1 and was originally used in the work of (Saleem et. al, 2022). The original dataset consists of 5 crops which include Apple, avocado, Grapevine, Kiwi and Pear.

Size of Dataset used in this Project:

15,706 augmented images of apple plant leaves, stems, and fruits.

Classes:

7 classes that incudes black rot, Black spot (scab), Glomerella leaf spot, mosaic virus, European canker, Healthy Leaf and Healthy Fruit

External Test Dataset:

726 images from the same 7 classes.

Data Partitioning

The dataset was split into 3 Subsets, namely.

Training
Validation
Test

The split of the dataset is in the ratio of 80:20:10. 80% of the dataset for training, 20% for Validation and 10% for testing the models.

Data Augmentation:

The dataset is augmented with rotations, brightness adjustments, and flips to increase variability and prevent overfitting.

Model Architecture

The project compares multiple CNN architectures:

ResNet50
VGG16
InceptionV3
Xception
DenseNet121
MobileNet

Two training strategies are employed:

Training from scratch
Transfer Learning: Training CNN architectures using transfer learning involves leveraging the weights of a pre-trained model.

Each model is evaluated based on performance metrics like accuracy, precision, recall, F1-score, and ROC-AUC.

Training and Evaluation

The models are trained using Google Colab with GPU support for faster computations.

Normalization

All images are normalized to a range of [0,1] to ensure stable training.

Optimization Algorithm

Early stopping and learning rate reduction techniques are used to optimize training.

Evaluation Metrics:

Accuracy Precision Recall F1-score Confusion Matrix - True Positves, False Positives, True Negatives and False Negatives ROC-AUC

Result for all Models:

Best Model:

The DenseNet121 model, trained from scratch, achieved the highest accuracy on the test set at 98.46%.

However, when tested on an external dataset, the accuracy dropped to 56%, highlighting potential challenges with model generalization. Models performed well in test datase but had difficulty generalizing to external datasets.

Conclusion

This project demonstrates the power of deep learning for prediction of plant diseases in apple plants. The best-performing model, DenseNet121, shows high accuracy in on both training and test datasets, it also was able to predict external datasets but there is need for more iteration and use of different optimization algorithms for better results.

Future Work

Future work might include comparing transfer learning (feature extraction) and fine tuning on classification problems using the NZDLPlantDisease-v1. Another study can use different optimization algorithms aside from Adam or compare various optimization algorithms to train models with the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
DenseNet 121 Classification training from scratch and transfer learning.ipynb		DenseNet 121 Classification training from scratch and transfer learning.ipynb
Folders reorganization.ipynb		Folders reorganization.ipynb
Inception-V3 classifications from scratch and Transfer Learning.ipynb		Inception-V3 classifications from scratch and Transfer Learning.ipynb
README.md		README.md
ResNet50 classifications from scratch and Transfer Learning.ipynb		ResNet50 classifications from scratch and Transfer Learning.ipynb
VGG16 classifications from scratch and Transfer Learning (1).ipynb		VGG16 classifications from scratch and Transfer Learning (1).ipynb
Xception classifications from scratch and Transfer Learning (1).ipynb		Xception classifications from scratch and Transfer Learning (1).ipynb
mobilenet classification from scratch and using Transfer learning .ipynb		mobilenet classification from scratch and using Transfer learning .ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plant Disease Classification using CNN and Transfer Learning

Table of Contents

Project Overview

Dataset

Size of Dataset used in this Project:

Classes:

External Test Dataset:

Data Partitioning

Data Augmentation:

Model Architecture

Two training strategies are employed:

Training and Evaluation

Normalization

Optimization Algorithm

Evaluation Metrics:

Result for all Models:

Best Model:

Conclusion

Future Work

About

Uh oh!

Releases

Packages

Languages

Muhyd33n/Apple-Plant-Diseases-Classification

Folders and files

Latest commit

History

Repository files navigation

Plant Disease Classification using CNN and Transfer Learning

Table of Contents

Project Overview

Dataset

Size of Dataset used in this Project:

Classes:

External Test Dataset:

Data Partitioning

Data Augmentation:

Model Architecture

Two training strategies are employed:

Training and Evaluation

Normalization

Optimization Algorithm

Evaluation Metrics:

Result for all Models:

Best Model:

Conclusion

Future Work

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages