Coffee Disease Visualization and Classification

Yebasse, Milkisa; Shimelis, Birhanu; Warku, Henok; Ko, Jaepil; Cheoi, Kyung Joo

doi:10.3390/plants10061257

Open AccessArticle

Coffee Disease Visualization and Classification

by

Milkisa Yebasse

¹

,

Birhanu Shimelis

²

,

Henok Warku

³,

Jaepil Ko

¹ and

Kyung Joo Cheoi

^4,*

¹

Department of Computer Engineering, Kumoh National Institute of Technology, Gumi 39177, Korea

²

Artificial Intelligence Center (AIC), Addis Ababa 2Q92+88, Ethiopia

³

Department of IT-Bio Convergence System, Electronics Engineering, Graduate School, Chosun University, Gwangju 61452, Korea

⁴

Department of Computer Science, Chungbuk National University, Cheongju 28644, Korea

^*

Author to whom correspondence should be addressed.

Plants 2021, 10(6), 1257; https://doi.org/10.3390/plants10061257

Submission received: 20 May 2021 / Revised: 17 June 2021 / Accepted: 18 June 2021 / Published: 21 June 2021

Download

Browse Figures

Versions Notes

Abstract

:

Deep learning architectures are widely used in state-of-the-art image classification tasks. Deep learning has enhanced the ability to automatically detect and classify plant diseases. However, in practice, disease classification problems are treated as black-box methods. Thus, it is difficult to trust the model that it truly identifies the region of the disease in the image; it may simply use unrelated surroundings for classification. Visualization techniques can help determine important areas for the model by highlighting the region responsible for the classification. In this study, we present a methodology for visualizing coffee diseases using different visualization approaches. Our goal is to visualize aspects of a coffee disease to obtain insight into what the model “sees” as it learns to classify healthy and non-healthy images. In addition, visualization helped us identify misclassifications and led us to propose a guided approach for coffee disease classification. The guided approach achieved a classification accuracy of 98% compared to the 77% of naïve approach on the Robusta coffee leaf image dataset. The visualization methods considered in this study were Grad-CAM, Grad-CAM++, and Score-CAM. We also provided a visual comparison of the visualization methods.

Keywords:

coffee disease classification; coffee disease visualization; deep learning; Grad-CAM; Score-CAM

1. Introduction

Ethiopia is the birthplace of coffee and the current source of the best coffee in the world [1]. Today, coffee is grown in more than 50 countries worldwide and is consumed on all continents. For Ethiopia, coffee is the largest export good, accounting for 20–25% of the total foreign exchange earnings. At least 15 million people rely on coffee for their livelihood [2]. While Ethiopia has a rich coffee heritage, its market share has not yet been fully developed [3]. More than 80 percent of the coffee growers are peasant farmers. Entire communities depend on coffee for their survival, with small farmers fighting to stay alive while facing growing environmental challenges [4]. The average coffee yield in the country is generally low. This is partly due to diseases and the limited use of advanced technologies. The method used for the supervision of diseases is observation through the naked eye, which is time-consuming and expensive and requires considerable expertise. Therefore, it is important to automatically identify the diseases without the need for experts. Object detection can act as a powerful tool in disease detection [5].

In the past few years, deep learning, object detection, and image classification have made tremendous advances. Krizhevsky proposed a deep convolutional neural network that achieved high accuracy in the classification of images into 1000 possible categories [6]; since then, it has revolutionized most aspects of computer vision. The VGG network from the visual geometry group is the most notable one. It achieved a remarkable result of 93.2% top-5 accuracy and 76.3% top-1 accuracy on the ImageNet test set [7].

However, it is sometimes difficult to obtain accurate ground truth labels for many tasks because of the cost of data labeling. Thus, in 2015, Maxime Oquab proposed a weakly supervised learning strategy using a convolutional neural network to reduce the annotation burden [8]. A weakly supervised learning approach uses partially labeled or annotated data when training a machine learning model. Thus, weak supervision is more applicable and desirable for deep learning techniques [9]. This observation has sparked many weakly supervised learning studies [10,11,12,13], while other studies have attempted semi-supervised anomaly detection using autoencoders [14].

Deep neural networks have made great achievements; however, a deeper understanding of the computations they perform at their intermediate layers remains limited. Some researchers have developed techniques to visualize and localize saliency maps: Zeiler and Fergus proposed a novel visualization technique that provides insight into the top operation of convolutional neural network (CNN) [15]; Zhou proposed learning deep features for discriminative localization [16]; Yoo designed multi-scale pyramid pooling for deep convolutional representation [17]; Jo and Yu improved localization by matching partial and full features [18].

Recent advances in computer vision and machine learning have made it possible to automate disease diagnosis. A wide range of disease detection systems that use convolutional neural networks have been published recently, Kumer presented the use of a pre-trained model for the detection and classification of healthy and defected coffee plants [19]. Giuliano L. Mansoa presented a method that identifies leaves from images by using a segmentation algorithm, separating them from the background, and finally applying an artificial neural network trained using an extreme learning machine [20]. In contrast, Shanwen Zhang developed a novel approach to identifying cucumber diseases by segmenting diseased leaf images by K-means clustering, examining the shape and color of the diseased leaf lesions, and classifying diseased leaf images using sparse representation [21]. Enhanced visual attention Guided Deep Neural Networks for Image Classification has been proposed by Yeh by learning feature maps that highlight salient regions and weaken meaningless connected layers [22].

Many state-of-the-art methods already exist for plant disease classification and detection [23,24,25,26,27] and defect detection in general [28,29,30]. However, to the best of our knowledge, there are only a few studies on coffee disease detection and they focus only on classifying healthy and non-healthy leaves using transfer learning [31] or using an annotated bounding box for detection [32]. Nevertheless, these deep models are seen as “black box” approaches, and researchers face a lot of trial and error when developing a satisfactory CNN model. Even accuracy can be inflated if the model overfits the data. Visualization can help open the black box and see what is happening in the model. Therefore, in this paper, we propose a method for visualizing coffee disease classification using well-established visualization techniques. The visualization techniques considered in this study were Grad-CAM, Grad-CAM++, and Score-CAM.

To summarize, our contributions are as follows:

We present a guided approach that achieved 98% accuracy in coffee disease classification. Further, in this study, we provide visualization of coffee disease, which exclusively highlights the region responsible for classification.
In this study, we implement three visualization approaches: Grad-CAM, Grad-CAM++, and Score-CAM. We also provided a visual comparison of those approaches.
In this study, we demonstrate the relevance of visualization in coffee disease classification. In support of our argument, we present two models and compare their accuracy and visualization results. By comparing the naïve approach and guided approach, this paper will provide new researchers with insights into the factors to consider when applying visualization and classification.

The remainder of this article is organized as follows. In Section 2, we briefly review the visualization methods and present two network architectures to demonstrate the importance of visualizing coffee disease classification. In Section 3, we present the experimental results and a visual comparative analysis of the visualization methods. The conclusion follows in Section 4.

2. Materials and Methods

2.1. Visualization Method

In this section, we briefly review the well-established visualization techniques. This paper implemented three visualization methods, Grad-CAM, Grad-CAM++, and Score-CAM.

2.1.1. Grad-CAM

Grad-CAM is a technique for visualizing important regions for available classes using guided propagation. It uses the gradient of any targeted class, passing into the final CNN layer to highlight important regions in the image for prediction [33]. Grad-CAM computes the gradient with respect to the feature map of a convolutional layer. To compute Grad-CAM, the gradient of the score for class c is computed first, with respect to a feature map activation A^k of a convolutional layer, that is ∂y^c/∂A^K. These gradients are global-average-pooled over the width and height dimensions (indexed by i and j, respectively) to obtain the neuron importance weights w (Equations (1) and (2)):

w_{k}^{c} = \frac{1}{z} \sum_{i} \sum_{j} \frac{\partial y^{C}}{\partial A_{i j}^{K}}

(1)

In addition, the heatmap of Grad-CAM is computed using a combination of feature maps, followed by relu:

L_{Grad - CAM} = relu (\sum_{k} w_{k}^{c} A^{K})

(2)

Grad-CAM calculates the effect of each area of the image on the final output based on the gradient of the parameter of the final convolutional layer and calculates the degree of influence, represented by a heatmap. We used Grad-CAM to exclusively highlight the defective region. Additionally, we provided an emphasized Grad-CAM by calculating the sigmoid heatmap before adding the input image. Figure 1 shows the architecture of Grad-CAM, where y is the score of the class, and w is the neuron importance.

2.1.2. Grad-CAM++

Grad-CAM++ is derived from Grad-CAM. Grad-CAM++ provides improved visual explanations of CNN model predictions, thereby providing better localization of objects and explanations of the occurrences of multiple objects in a single image [34]. In Grad-CAM, if there are multiple objects with slightly different views or orientations inside the box, the feature map weights are different. In contrast, Grad-CAM++ highlights all relevant input regions equally. Figure 2 shows the weight combination of the Grad-CAM and Grad-CAM++. The values in each pixel on the saliency map represent the levels of intensity at that point.

It may be that the feature map with lesser weight becomes insignificant and fades away in the final saliency map; to solve this issue, Grad-CAM++ takes the weighted average of the pixel-wise gradients;

α_{i j}^{k c}

is the weighting coefficient for the pixel-wise gradient Equation (3).

w_{k}^{c} = \sum_{i} \sum_{j} α_{i j}^{k c} . r e l u (\frac{\partial y^{C}}{\partial A_{i j}^{K}})

(3)

2.1.3. Score-CAM

In gradient-based methods, the gradient of a target class is back-propagated to the input layer to highlight the region of the sample that is highly relevant to the prediction. In contrast, with Score-CAM, instead of dependence on gradients for the weight of each activation map, each activation map is obtained by passing its forward-passing score on the target class [35]. The working process of Score-CAM involves two phases. Step 1 is the extraction of activation maps, where each activation serves as a mask on the original image and calculates its forward-passing score to the target class. Step 2 is the linear combination of score-based weights and activation maps.

2.2. Visualization of Coffee Disease

In this section, we present a visualization of coffee disease to illustrate how its classification distinguishes between healthy and unhealthy leaves. We built deep learning models to classify and localize the defective regions. Visualization can also be a useful tool to help find issues in the learning process and even provide guidance on how to fix them. Grad-CAM, Grad-CAM++, and Score-CAM were used as visualization tools in this study. To illustrate our point, we developed two models for classifying coffee diseases: the naïve approach and the guided approach.

2.2.1. Coffee Leaf Images Dataset

Our experiment was conducted using the Robusta coffee leaf image dataset (RoCoLe) [36]. The dataset consists of 1560 Robusta coffee leaf images with visible spots for non-healthy cases and healthy images. Images that are not healthy were infected by coffee rust. Coffee leaf rust, discovered at the end of the 19th century in Brazil, is one of the most destructive diseases of coffee plants in the world. Despite effective and integrated approaches using fungicides and resistant varieties developed to control rust, coffee leaf rust continues to have a debilitating effect on coffee growth [37]. Some of the symptoms of coffee rust include yellow spots on top of the leaf. Sometimes the spots may expand into larger round spots that turn bright orange to brown with a yellow border. Figure 3 shows a sample image of the coffee leaf dataset.

2.2.2. Naïve Approach

In the naïve approach, the network was trained using only original images from the dataset. We used ResNet as the backbone of our model [38]. The final pooling and prediction layers of Resnet are removed, while Global Average Pooling and dense output layers are added (Figure 4). The dense layers served as the classification layer. We used categorical cross-entropy as a loss function for our model. It compares the distribution of our predictions with the ground truth. The probability of the true class is kept as 1, and that of the false class is 0. We set the training hyperparameters as follows: the batch size was 32 and the RMSprop optimizer with an initial learning rate of 0.0001 was used. Grad-CAM was selected as the visualization tool. Figure 5 illustrates the naïve approach.

2.2.3. Guided Approach

A neural network has difficulty focusing only on the region of interest without any guidance. The naïve approach acquires too much functionality by learning commonalities that are not related to the disease, which can lead to overfitting. For example, by focusing on background soil structure. Therefore, we changed the background into something more meaningful and easier to analyze. We segmented the region of interest (leaf) from noise backgrounds like soil and small leaves, to guide the neural network. We used u²net to segment the images. The u²net is a powerful unsupervised neural network for salient object detection. The architecture of our u²net was a two-level nested U-structure. We used the pretrained u²net provided by the authors [39]. A pretrained u²net generates a corresponding masked image from the original input. Taking the masked image, we performed a bitwise operation between the original and segmented images (as shown in Figure 6).

We also noticed that a rust spot could be quite small. For the model to work properly, attention must be paid to the smallest details when working with small local features. Such type of small details can be dealt with by extracting small patches from the segmented images. Our base dataset was obtained by extracting 4000 image patches from the segmented dataset. The extracted small patches have dimensions of 128 × 128 pixels. Approximately 80% of our dataset were used to train the models, while the remaining 20% were used to test them. Figure 7 shows the extracted patches.

By segmenting the image, we can isolate only the region of interest, which will help guide the network. Additionally, by extracting patches, we can narrow the focus of the network to the rusty areas and increase our dataset as well. As a result, the proposed method (as shown in Figure 8) uses both segmented images and extracted patches as inputs for training. All images are resized to 224 × 224 and then, fed into the DNN model. We kept the DNN model similar to the naïve approach.

3. Results and Discussion

3.1. Naïve Approach

In this experiment, the network was trained using only the original images. Approximately 80% of the raw dataset was used for training, and the other 20% were used for testing. Figure 9 shows that the model achieved 99% accuracy in the training dataset. However, it performs poorly in terms of test accuracy. This model fails to generalize well from our training data to unseen data. This is called overfitting. Overfitting usually occurs when a model fits too closely to a limited dataset.

Visualizing our model can help resolve this problem. Visualization can serve as a guide for determining ways to improve the classifier. For example, Figure 10 depicts the visualization results of non-healthy leaves. Regions that are the most responsible for classifying objects are highlighted. Although the classifier correctly identified the rusty region, the background was considered more informative than the non-healthy region of the leaf. As a result, our model classifies the data by examining unrelated regions in the image. The visualization plot suggests that in its current state, our classifier is likely to associate non-healthy images with soil backgrounds. This can indicate the reasons the model has diverged. The model is suitable; the problem is the way it was fed data, which led to low test accuracy. This problem motivated us to propose a guided approach.

3.2. Guided Approach

The naïve approach encountered difficulty localizing only the defective region. The background soil texture was incorrectly used to distinguish between healthy and non-healthy images. Hence, a guidance system was needed to focus on the region of interest. The Guided approach was trained on segmented input images and extracted patches. Our model was guided to focus on the most important region of the image. As a result, the proposed approach achieved a validation accuracy of 98%, which is significantly higher than the 77% accuracy of the naïve approach (Table 1).

We introduced two improvements with the Guided approach: using masked images without a background noise (leaves, soil, etc.) and using small patches of images with diseased regions. We carried out further experiments to determine which factor most influenced the outcome. Figure 11 demonstrates that segmenting has a greater impact, however, Training with both the segmented and segmented images achieved the highest accuracy.

We can remark that the guided approach is superior to the naïve approach and generalizes well on unseen data. The visualization results helped us find what went wrong in the naïve approach. Figure 12 shows the visualization result of the guided approach: Grad-CAM effectively localized the defected regions. The proposed model seems to focus mostly on the defective regions; thus, we can trust it.

3.3. Comparison of Visualization Methods

It is apparent from the literature review that most of the studies have targeted the problem of plant disease classification [23,24,25,26,27,28,29,30]. Nevertheless, models were considered as a black box, and it is difficult to trust a model that we can’t explain how it operates. Moreover, in disease detection and classification systems, we cannot afford to be wrong. Visualization techniques provide a way to visualize the pixels in an image that contribute the most to its classification by the model. A heatmap was generated to localize the responsible region for classification. Visualization techniques highlighted exclusively the defective regions. A visual comparison of Grad-CAM, Grad-CAM++, and Score-CAM is presented in this section. We hope this will provide insight into how to build better models for plant disease applications.

Visualization techniques were applied to the extracted patches and full-size images for comparison. Interestingly, we found that all visualization techniques effectively localized the defective region on the patches of the image. A heatmap is shown in Figure 13, the heatmap shows that Grad-CAM, Grad-CAM++, and Score-CAM effectively identify the defective regions. The dark red line on the heatmap shows the location responsible for classifying the image as an abnormal image.

Furthermore, we tested our model on full-size images and added heatmaps outputs to input images to generate the visualization result. In most cases, all visualization techniques successfully identified the most distinguishable region of the target object, which indicates that any of these visualizations achieve competitive results for visualization (as shown in Figure 14). The emphasized versions were identified by calculating the sigmoid of the heatmap before incorporating it into the input image. The defective area was more heavily emphasized than normal grade-CAM. Moreover, none of these methods was exposed to bounding box annotation, but they all effectively localized the defective region. This indicates that visualization can also be used for coffee disease localization in a weakly supervised manner.

Grad-CAM and Score-CAM both show the ability to locate defected regions, but the saliency maps of Score-CAM are more focused than those of Grad-CAM++. For this reason, occasionally, Score-CAM localizes a healthy region that resembles a diseased region. In Score-CAM, each target object with a high confidence score predicted by the model can be highlighted independently [35]. Therefore, all pieces of evidence related to the target class can obtain responses and are assembled through linear combination. In Figure 15, on the right side of the image, there is a structure that looks like a disease; however, it is not one. As the Score-CAM can focus on a small amount of detail, it highlighted the structure as a defect.

4. Conclusions

In this paper, we analyzed and presented the use of visualization in disease detection methods. We demonstrated the possibility of using visualization to see the effectiveness of our model in detecting diseases instead of treating disease detection systems as black boxes. Further, a classification accuracy of 98% was achieved using the guided approach. Our study utilized three visualization methods: Grad-CAM, Grad-CAM++, and Score-CAM. A visual comparison of all visualization methods for coffee disease classification was also performed. This paper is intended to provide an understanding of how to build a better model for detecting plant diseases, and we consider visualization to be a vital component.

In future work, we plan to develop a coffee disease detection and localization system. Annotating images by object bounding boxes is expensive, especially in developing countries where expertise and technology are scarce. We have already shown a practical method of localizing coffee diseases without annotating images with object bounding boxes during training. Thus, in the future, we aim to investigate a weakly supervised coffee disease detection system.

Author Contributions

Conceptualization, M.Y. and J.K.; methodology, investigation, visualization, M.Y., B.S. and H.W.; writing—original draft preparation, M.Y., B.S., and H.W.; writing—review and editing, M.Y. and K.J.C.; supervision, project administration, M.Y., K.J.C. and J.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available in Mendeley Data at doi:10.17632/c5yvn32dzg.2, reference number 36.

Acknowledgments

This research was supported by Kumoh National Institute of Technology (2019104153).

Conflicts of Interest

The authors declare no conflict of interest.

References

Getachew, S. Status of Forest Coffee (Coffea arabica L.) Diseases in the Afromontane Rainforest Areas of Ethiopia: A review. Greener J. Agric. Sci. 2017, 7, 19–31. [Google Scholar] [CrossRef]
Teferi, D. Status of Major Coffee Diseases of Coffea arabica L. in Afromontane Rainforests of Ethiopia. A Review. Food Sci. Qual. Manag. 2018, 76, 35–40. [Google Scholar]
Degaga, J. Review on Coffee Production and Marketing in Ethiopia. J. Mark. Consum. Res. 2020, 67, 7–15. [Google Scholar] [CrossRef]
Moat, J.; Williams, J.; Baena, S.; Wilkinson, T.; Gole, T.W.; Challa, Z.K.; Demissew, S.; Davis, A.P. Resilience potential of the Ethiopian coffee sector under climate change. Nat. Plants 2017, 3, 17081. [Google Scholar] [CrossRef] [PubMed]
Liu, J.; Wang, X. Plant diseases and pests detection based on deep learning: A review. Plant Methods 2021, 17, 1–18. [Google Scholar] [CrossRef] [PubMed]
Gonzalez, T.F. Handbook of Approximation Algorithms and Metaheuristics; Chapman and Hall/CRC: New York, NY, USA, 2007; 1432p. [Google Scholar] [CrossRef]
Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
Oquab, M.; Bottou, L.; Laptev, I.; Sivic, J. Is object localization for free?—Weakly-supervised learning with convolutional neural networks. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015. [Google Scholar]
Zhou, Z.-H. A brief introduction to weakly supervised learning. Natl. Sci. Rev. 2018, 5, 44–53. [Google Scholar] [CrossRef] [Green Version]
Zhou, M.; Bai, Y.; Zhang, W.; Zhao, T.; Mei, T. Look-Into-Object: Self-Supervised Structure Modeling for Object Recognition. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 11771–11780. [Google Scholar]
Yao, Q.; Gong, X. Saliency Guided Self-Attention Network for Weakly and Semi-Supervised Semantic Segmentation. IEEE Access 2020, 8, 14413–14423. [Google Scholar] [CrossRef]
Sun, G.; Wang, W.; Dai, J.; Van Gool, L. Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation. In Computer Vision—ECCV 2020; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2020; pp. 347–365. [Google Scholar] [CrossRef]
Fan, J.; Zhang, Z.; Song, C.; Tan, T. Learning Integral Objects With Intra-Class Discriminator for Weakly-Supervised Semantic Segmentation. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 4282–4291. [Google Scholar]
Minhas, M.S.; Zelek, J. Semi-supervised anomaly detection using autoencoders. arXiv 2020, arXiv:2001.03674. [Google Scholar]
Zeiler, M.D.; Fergus, R. Visualizing and understanding convolutional networks. In Computer Vision—ECCV 2014; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2014; pp. 818–833. [Google Scholar]
Zhou, B.; Khosla, A.; Lapedriza, A.; Oliva, A.; Torralba, A. Learning Deep Features for Discriminative Localization. In Proceedings of the CVPR 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016; IEEE: New York, NY, USA, 2016; pp. 2921–2929. [Google Scholar]
Yoo, D.; Park, S.; Lee, J.-Y.; Kweon, I.S. Multi-scale pyramid pooling for deep convolutional representation. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Boston, MA, USA, 7–12 June 2015; pp. 71–80. [Google Scholar] [CrossRef]
Jo, S.; Yu, I.-J. Puzzle-CAM: Improved localization via matching partial and full features. arXiv 2021, arXiv:2101.11253. [Google Scholar]
Kumar, M.; Gupta, P.; Madhav, P. Sachin Disease Detection in Coffee Plants Using Convolutional Neural Network. In Proceedings of the 2020 5th International Conference on Communication and Electronics Systems (ICCES), Coimbatore, India, 10–12 June 2020; pp. 755–760. [Google Scholar]
Manso, G.L.; Knidel, H.; Krohling, R.A.; Ventura, J.A. A smartphone application to detection and classification of coffee leaf miner and coffee leaf rust. arXiv 2019, arXiv:1904.00742. [Google Scholar]
Zhang, S.; Wu, X.; You, Z.; Zhang, L. Leaf image based cucumber disease recognition using sparse representation classification. Comput. Electron. Agric. 2017, 134, 135–141. [Google Scholar] [CrossRef]
Yeh, C.-H.; Lin, M.-H.; Chang, P.-C.; Kang, L.-W. Enhanced Visual Attention-Guided Deep Neural Networks for Image Classification. IEEE Access 2020, 8, 163447–163457. [Google Scholar] [CrossRef]
Hsiao, J.-K.; Kang, L.-W.; Chang, C.-L.; Lin, C.-Y. Comparative study of leaf image recognition with a novel learning-based approach. In Proceedings of the 2014 Science and Information Conference, London, UK, 27–29 August 2014; pp. 389–393. [Google Scholar] [CrossRef]
Xiong, J.; Yu, D.; Liu, S.; Shu, L.; Wang, X.; Liu, Z. A Review of Plant Phenotypic Image Recognition Technology Based on Deep Learning. Electronics 2021, 10, 81. [Google Scholar] [CrossRef]
Singh, V.; Misra, A. Detection of plant leaf diseases using image segmentation and soft computing techniques. Inf. Process. Agric. 2017, 4, 41–49. [Google Scholar] [CrossRef] [Green Version]
Kalvakolanu, A.T. Plant disease detection using deep learning. arXiv 2020, arXiv:2003.05379. [Google Scholar]
Saleem, M.H.; Khanchi, S.; Potgieter, J.; Arif, K.M. Image-Based Plant Disease Identification by Deep Learning Meta-Architectures. Plants 2020, 9, 1451. [Google Scholar] [CrossRef] [PubMed]
Tabernik, D.; Šela, S.; Skvarč, J.; Skočaj, D. Segmentation-based deep-learning approach for surface-defect detection. J. Intell. Manuf. 2020, 31, 759–776. [Google Scholar] [CrossRef] [Green Version]
Dong, X.; Taylor, C.J.; Cootes, T. Small Defect Detection Using Convolutional Neural Network Features and Random Forests. In Computer Vision—ECCV 2018 Workshops; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2018; pp. 398–412. [Google Scholar] [CrossRef]
Chen, Y.-F.; Yang, F.-S.; Su, E.; Ho, C.-C. Automatic Defect Detection System Based on Deep Convolutional Neural Networks. In Proceedings of the 2019 International Conference on Engineering, Science, and Industrial Applications (ICESI), Tokyo, Japan, 22–24 August 2019. [Google Scholar] [CrossRef]
Sorte, L.X.B.; Ferraz, C.T.; Fambrini, F.; Goulart, R.D.R.; Saito, J.H. Coffee Leaf Disease Recognition Based on Deep Learning and Texture Attributes. Procedia Comput. Sci. 2019, 159, 135–144. [Google Scholar] [CrossRef]
Hong, H.; Lin, J.; Huang, F. Tomato disease detection and classification by deep learning. In Proceedings of the 2020 International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE), Fuzhou, China, 12–14 June 2020; pp. 25–29. [Google Scholar]
Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Int. J. Comput. Vis. 2020, 128, 336–359. [Google Scholar] [CrossRef] [Green Version]
Chattopadhay, A.; Sarkar, A.; Howlader, P.; Balasubramanian, V.N. Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks. In Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA, 12–15 March 2018; pp. 839–847. [Google Scholar]
Wang, H.; Wang, Z.; Du, M.; Yang, F.; Zhang, Z.; Ding, S.; Mardziel, P.; Hu, X. Score-CAM: Score-Weighted Visual Explanations for Convolutional Neural Networks. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA, 14–19 June 2020; pp. 111–119. [Google Scholar] [CrossRef]
Parraga-Alava, J.; Cusme, K.; Loor, A.; Santander, E. RoCoLe: A robusta coffee leaf images dataset. Mendeley Data V2 2019. [Google Scholar] [CrossRef]
Hindorf, H.; Omondi, C.O. A review of three major fungal diseases of Coffea arabica L. in the rainforests of Ethiopia and progress in breeding for resistance in Kenya. J. Adv. Res. 2011, 2, 109–120. [Google Scholar] [CrossRef] [Green Version]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Qin, X.; Zhang, Z.; Huang, C.; Dehghan, M.; Zaiane, O.R.; Jagersand, M. U2-Net: Going deeper with nested U-structure for salient object detection. Pattern Recognit. 2020, 106, 107404. [Google Scholar] [CrossRef]

Figure 1. Block diagram of Grad-Cam on coffee disease detection.

Figure 2. The intuition behind Grad-CAM.

Figure 3. Robusta coffee leaf images dataset. (a) non-healthy image; (b) healthy image.

Figure 4. Deep Neural Network model (DNN-model).

Figure 5. The pipeline of the naïve approach.

Figure 6. Segmented images using u²net.

Figure 7. This figure shows a sample image of extracted patches. (a) non-healthy patch image; (b) healthy patch.

Figure 8. The pipeline of the proposed approach.

Figure 9. Training and test accuracy of naïve approach.

Figure 10. Visualization Result on naïve approach (a) input image; (b) highlighted image.

Figure 11. Accuracy on patch image only, segmented only, and the proposed approach (Guided approach).

Figure 12. Visualization Result on Guided approach. (a) input image; (b) highlighted image.

Figure 13. Heatmaps on patch images.

Figure 14. Visualization Result of Grad-CAM, Grad-CAM++, and Score-CAM.

Figure 15. Visualization Result of Grad-CAM, Grad-CAM++, and Score-CAM.

Table 1. Comparisons of Naïve approach and guided approach.

Epoch	Naïve Approach	Proposed Approach (Guided Approach)
4	72%	71%
5	73%	72%
6	75%	86%
10	74%	98%
14	75%	98%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yebasse, M.; Shimelis, B.; Warku, H.; Ko, J.; Cheoi, K.J. Coffee Disease Visualization and Classification. Plants 2021, 10, 1257. https://doi.org/10.3390/plants10061257

AMA Style

Yebasse M, Shimelis B, Warku H, Ko J, Cheoi KJ. Coffee Disease Visualization and Classification. Plants. 2021; 10(6):1257. https://doi.org/10.3390/plants10061257

Chicago/Turabian Style

Yebasse, Milkisa, Birhanu Shimelis, Henok Warku, Jaepil Ko, and Kyung Joo Cheoi. 2021. "Coffee Disease Visualization and Classification" Plants 10, no. 6: 1257. https://doi.org/10.3390/plants10061257

APA Style

Yebasse, M., Shimelis, B., Warku, H., Ko, J., & Cheoi, K. J. (2021). Coffee Disease Visualization and Classification. Plants, 10(6), 1257. https://doi.org/10.3390/plants10061257

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Coffee Disease Visualization and Classification

Abstract

1. Introduction

2. Materials and Methods

2.1. Visualization Method

2.1.1. Grad-CAM

2.1.2. Grad-CAM++

2.1.3. Score-CAM

2.2. Visualization of Coffee Disease

2.2.1. Coffee Leaf Images Dataset

2.2.2. Naïve Approach

2.2.3. Guided Approach

3. Results and Discussion

3.1. Naïve Approach

3.2. Guided Approach

3.3. Comparison of Visualization Methods

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI