Embedded System-Based Sticky Paper Trap with Deep Learning-Based Insect-Counting Algorithm

Sütő, József

doi:10.3390/electronics10151754

Open AccessArticle

Embedded System-Based Sticky Paper Trap with Deep Learning-Based Insect-Counting Algorithm

by

József Sütő

Department of IT Systems and Networks, University of Debrecen, 4032 Debrecen, Hungary

Electronics 2021, 10(15), 1754; https://doi.org/10.3390/electronics10151754

Submission received: 30 June 2021 / Revised: 14 July 2021 / Accepted: 19 July 2021 / Published: 21 July 2021

(This article belongs to the Special Issue AI for Embedded Systems)

Download

Browse Figures

Versions Notes

Abstract

:

Flying insect detection, identification, and counting are the key components of agricultural pest management. Insect identification is also one of the most challenging tasks in agricultural image processing. With the aid of machine vision and machine learning, traditional (manual) identification and counting can be automated. To achieve this goal, a particular data acquisition device and an accurate insect recognition algorithm (model) is necessary. In this work, we propose a new embedded system-based insect trap with an OpenMV Cam H7 microcontroller board, which can be used anywhere in the field without any restrictions (AC power supply, WIFI coverage, human interaction, etc.). In addition, we also propose a deep learning-based insect-counting method where we offer solutions for problems such as the “lack of data” and “false insect detection”. By means of the proposed trap and insect-counting method, spraying (pest swarming) could then be accurately scheduled.

Keywords:

deep learning; embedded system; insect pest counting; sticky paper trap

1. Introduction

In order to achieve high crop yields, farmers use insecticides at scheduled times [1]. A better approach would be to use insecticides only when the likelihood of the presence of pests in the field is higher than a pre-defined threshold. Therefore, insect monitoring and the evaluation of insect density in the field are very important tasks in agriculture because they make the forecast of insect invasion possible. Such a forecast has significant environmental and economic effects because farmers can apply insecticides at the right time to defend their crops. Thus, they can save money and time (less spraying, less amount of insecticide, etc.).

The most common information source of inspect density prediction is the outsourced (pheromone or other type) traps [2]. The traditional trap-based insect monitoring requires an expert who manually recognizes and counts pests. Manually counting pests on pheromone trap images is a very slow, labor-intensive, and expensive process. In addition, it requires a skilled person capable of distinguishing insect species. Therefore, an automated pest management system would be a great improvement as it would solve additional problems such as the lack of agricultural experts. In such a system, an automated insect recognition method would be necessary in order to minimize human interaction.

Recently, IoT (Internet of Things) devices have been widely applied in agriculture and they allow remote farm monitoring with the use of camera(s). Among various vision systems, color cameras are the most popular due to their low price and ease of use [3]. In this work, we used the OpenMV device, which is dedicated to embedded computer vision tasks.

Beyond remote sensing devices, an accurate insect-counting method is also needed. Insect counting can be seen as a special version of object detection. Fortunately, the rapidly developing machine vision and learning technologies can be used to solve object detection problems with promising performance, not just in computer science but also in agriculture.

In this paper, we propose a new pest trap which contains an embedded system with a deep learning-based insect-counting algorithm. Our primary goal was to detect and count a particular moth: the Cydia pomonella. However, due to the very small amount of Cydia pomonella images, we developed a general moth recognizer and counting model that works for several moth species.

2. Related Work

Earlier works in insect identification can be categorized according to several factors. The first is the image quality. Some authors used high quality images (uniform illumination, well-adjusted pose, lack of noise) made in the laboratory environment [4,5], while others used images taken in the field [6]. Not surprisingly, a pest identification algorithm that has been trained on laboratory images will provide a significantly smaller recognition accuracy on field images [7].

Another factor is the number of insects on the image. In several studies, images were captured with a mobile device and contained only one insect. In a smaller subset of studies, there were more than one insect on the images (typically trap images) [8].

Images which have been taken on the field and contain more than one insect require a more complex segmentation phase to separate insects from each other before the identification step. Moreover, we should consider noise and artifacts such as herbs, very small insects, and stripes of glue. The survey of Matineau et al. [9] on image-based insect classification well summarizes the different research directions, research conditions, and the state of the art until 2017.

2.1. Databases

Probably the biggest problem in insect counting is the lack of sample images. The authors of the survey in [9] also did not find a freely available reference dataset until 2017. Several researchers worked on nonpublic datasets, while in other works the number of images was rather small, especially for training deep learning algorithms.

Xie et al. [10] collected images of about 24 insect species such as the Cifuna locuples and Tettigella viridis, with approximately 60 images per class. Wang et al. [3] worked on and made available 225 images that belong to 9 insect orders. Deng et al. [11] acquired images of about 10 insect species with approximately 40–70 images per class.

To overcome the above-mentioned weakness, Wu et al. [12] created a freely available database for pest recognition in 2019. It is called IP102. The IP102 database contains more than 75,000 images belonging to 102 categories. Out of all images, approximately 19,000 images have been annotated. The dataset has a hierarchical structure where insects that mainly affect a specific agricultural product are grouped into the same upper-level category. The authors of IP201 claimed that earlier deep learning approaches on insect pest recognition were restricted to small data sets, which do not contain enough samples. They also gave a review of the used databases in previous works.

In 2019, another freely available large-scale database (Fieldguide Challenge: Moths & Butterflies) was published by the Kaggle community [13]. It contains 530,000 images and 5419 classes. The weakness of this database is the lack of species name (classes were identified by ID without name).

Unfortunately, none of the above datasets include the Cydia pomonella insect, which is the most harmful apple insect in Hungary. In addition, the above-mentioned database images contain an average of one insect, while our trap images (test images) may contain several insects. This huge difference between the available sample images and the trap images makes the design of a counting model very difficult. Overall, the lack of sufficient real samples prevents the optimization of the huge number of weights in deep classifier models.

2.2. Segmentation

Generally, insect detection (and counting) can be seen as an object detection problem that consists of three stages (regardless of the used method): segmentation, feature extraction, and classification.

At first glance, segmentation algorithms such as thresholding-based, clustering-based, contour-based, or region-based are possible solutions for automated insect counting. Among them, the simplest segmentation methods rely on thresholding where one or more threshold values are derived from the image histogram [14]. However, threshold-based methods are efficient only when the image does not contain a large amount of noise or other objects (stripes of glue, discolorations, etc.).

In contour-based segmentation, the two key factors are edge and contour detection. Widely used edge detectors are the Sobel operator and the Canny detector. Contour detection can be applied on the image gradient and its outcome will provide an estimation of the object shapes on the image [15]. However, contour-based algorithms are also not sufficiently robust against noise.

In the clustering-based segmentation category, the k-means technique is the most popular [16]. It is used on color images to search for the centroid of regions where pixels have a similar color intensity. The weakness of the k-means clustering is its parameters such as the number of regions, which strongly influences the speed and performance of the algorithm.

In order to improve segmentation, some researchers have used multi-stage segmentation strategies. For example, Wen et al. [17] used a two-stage morphology-based image segmentation. The first stage was a global thresholding (with the Otsu method + morphological opening), where small objects were eliminated and the remaining objects were labelled. In the second stage, the object center was predicted.

A detailed description of insect segmentation strategies can be found in the study of Bakkay et al. [6]. They defined insect detection as an object segmentation problem that utilizes shape and color characteristics in non-homogeneous background. They established that region-based segmentation algorithms such as the watershed are able to separate touching insects. Based on this, they proposed an adaptive k-means clustering algorithm and introduced their own region-merging algorithm to separate touching insects. Unfortunately, their proposed algorithm has not been tested widely. In practice, there are many kinds of sticky traps with different backgrounds, for various insect types, and capable of holding a diverse number of insects.

Beyond “conventional” segmentation techniques, sliding window-based image segmentation is also popular in object detection and it has also been used to address the insect recognition problem. For example, Ding and Taylor [18] used the sliding window-based image segmentation pipeline where the classifier model was a convolutional neural network. However, more efficient object proposal techniques than sliding window can be found in the literature. A qualitative evaluation of their work showed several false positive and false negative detections.

In the object detection literature, some additional and more sophisticated image segmentation (object proposal) techniques are available such as Edge Boxes or Multiscale Combinatorial Grouping [19]. For object proposal, we used the selective search method, which is more powerful than the segmentation techniques used earlier; however, it is a time-consuming method. Fortunately, the time requirement of an insect-counting algorithm is not significantly limited because we take only one picture a day. Thus, selective search can be used without any problems.

2.3. Feature Extraction and Classification

In the last years, the automated insect detection methods have shifted from shallow machine learning techniques to deep learning. Shallow techniques are widely adopted to identify insects in combination with the texture, color, and shape features. However, their accuracy is limited [3,20]. Therefore, deep learning algorithms are currently more popular for this task.

Sun et al. [2] tried to develop an automatic Red Turpentine Beetle (RTB) monitoring system with a self-developed lightweight deep network that could run on embedded devices to count adult RTB insects on trap images. They used their own dataset to train the insect detector model but unfortunately, this dataset is not available. In addition, their images were significantly different from ours because their trap was similar in size to a cup with a smaller number of cached insects.

Wen et al. [17] also used a deep neural network architecture to identify moth species with 154 hand-crafted features (color, texture, shape, etc.). Their dataset is not publicly available.

The pest identification algorithm of Zhong et al. [1] used two stages. In the first stage, the YOLO object detector produced coarse detections (segmentation); in the second stage, an SVM performed the final classification. The goal of this two-stage classifier model was to reduce the amount of necessary training samples.

Xia et al. [21] also used a deep learning approach to detect insects on crop field images. They used the VGG19 (Visual Geometry Group) architecture (VGG16 with some additional convolutional layers) as a feature extractor in combination with a region proposal network.

Hong et al. [22] tested seven deep object detector networks, including faster R-CNN (Region Based Convolutional Neural Network), SSD (Single-Shot Detector), and Mobilenet, in order to compare their speed and accuracy. Their results showed that mean average precision (mAP) was between 76 and 90, depending on the type of the model. Not surprisingly, the faster R-CNN model produced the highest recognition accuracy, but its inference time was also the highest.

In this work, we also used a deep classifier model to count the number of insects in a trap, but it has been designed in a totally different way as compared to the earlier studies due to our purpose.

2.4. Insect Pest Monitoring Systems

In the literature, the authors of some papers have already proposed wireless imaging systems for insect monitoring and counting. Zhong et al. [1] designed an automatic pest counting system where the central unit was a Raspberry Pi, which executes a two-stage classification algorithm (as mentioned above).

Rustia et al. [3] also used a Raspberry Pi 3 mini-computer as a controller board with a Raspberry Pi Camera, and some environmental sensors such as temperature, humidity, and light sensors.

Unlike the above-mentioned systems, which can be used only under limited circumstances (typically in greenhouses), our novel embedded system-based trap can be used anywhere in the field without restrictions.

3. Materials and Methods

3.1. The Structure of the Prototype Trap

Owing to the rapid development of hardware technology, nowadays, more versatile devices are available for computer vision applications. One of them is the low-cost and Python-powered OpenMV microcontroller board. As the main controller board, we used the OpenMV Cam H7, which is dedicated to machine vision applications. In the OpenMV Cam H7, there is an STM32H743VI microcontroller unit (MCU) with an ARM Cortex M7 32-bit RISC processor core. This core includes hardware acceleration to floating-point arithmetic, and its clock frequency is 480 MHz.

Beyond the MCU, the board includes an OV7725 image sensor. It is capable of taking 640 × 480 grayscale and 16-bit RGB565 images at 75 FPS. An illustration of the OpenMV Cam H7 board can be seen in Figure 1.

The OpenMV board runs the MicroPython operating system, which makes possible Python’s development. Today’s Python is the primary programming language for machine learning; in this high-level language, it is easier to deal with the complex structure of machine learning algorithms.

The OpenMV board is placed in a small circuit. This circuit (mother board) consists of 4 high-brightness LEDs, a battery connector, a GPS, and a GSM module. The Quectel GSM module is responsible for the daily insect pest information transmission. Beyond localization, a Quectel L80 GPS is used for time synchronization. Originally, the task of the 4 LEDs was to improve illumination conditions. However, the experimental results showed that the LED lighting caused reflection and degraded the image quality. Therefore, the 4 LEDs are not used now.

The power supply is provided by a 3.65 V, 11,000 mAh LiPo battery without solar panel charger electronics. Due to the low power consumption of the circuit, it is enough to rechange the battery once a year. In fact, the OpenMV board is on standby (or deep sleep) mode most of the time. In deep sleep mode, its energy consumption is less than 50 μA, while in active mode, it is 140 mA. The device is in active mode once a day for approximately 1 min. In this time period, it performs the image taking, time synchronization, and the information transmission.

The electronic components are placed inside a waterproof plastic box that has a transparent cover. A photo of its arrangement within the waterproof box can be seen in Figure 2. The box is connected to the top of the trap’s housing, while the sticky paper is placed at the bottom of the housing. The camera is positioned approximately 20 cm away from the sticky paper trap. The trap’s housing is made of a UV-resistant polycarbonate sheet. Its dimensions can be seen in Figure 3.

Unlike earlier proposed insect pest monitoring systems, our embedded system-based trap does not require an indoor environment, AC power supply, WIFI coverage, and human interaction. In addition, using our trap does not require the installation of costly hardware infrastructure on the field.

3.2. Dataset Generation

In parallel to the hardware design, we started the development of a deep learning-based insect-counting model. The first step was the dataset construction. Unfortunately, we do not have our own dataset. As mentioned earlier, there are some freely available datasets for insect pest management, but none of them contain images of the Cydia pomonella insect pest. Therefore, at first, we tried to gather images of the Cydia pomonella from the Internet, but we were able to collect only 60 images.

After that, our idea was to gather images of moth species, which are visually similar to the Cydia pomonella, and use them to train a deep classifier model. After a detailed review of the available datasets, we found the Kaggle Fieldguide Challenge: Moths & Butterflies dataset to be the most useful for our purpose. Out of the 5419 classes of the Kaggle Fieldguide Challenge dataset, the images of 25 classes were used as additional source images in this study. Those 25 insect species were selected according to 3 criteria: the insect’s shape, texture, and tentacle length. All of the images were annotated by hand (using the Labelme software), using bounding boxes to indicate moth locations on the image [23].

After the annotation of source images, a binary dataset was generated with positive (“moth”) and negative (“no moth”) classes. For dataset creation, we used the selective search [24] region proposal algorithm. Selective search is a widely used algorithm in object detection research that examines an input image and then identifies where potential objects can be located. All of the annotated images passed through the selective search algorithm and the proposed regions (bounding boxes) were evaluated with the intersection over union (IoU) metrics. The IoU evaluation metrics is also widely used in object detection research to measure the localization accuracy of an object detector. More specifically, the IoU is a ratio where the numerator is the area of overlap between the ground truth bounding box and the predicted bounding box, while the denominator is the area of their union.

Out of the proposals where the IoU > 0.8, all bounding boxes were used (if available) as positive samples, and out of the proposals where IoU < 0.05, 10 bounding boxes were used (if available) as negative samples in the case of all images. At the end of the database generation process, 17,011 samples were available for us to use in training the classifier model. An illustration of 3 positive and 3 negative samples can be seen in Figure 4.

3.3. Deep Classifier Model

A well-known fact about deep classifier models is their huge data requirement. Training deep classifier models requires a significantly larger set of sample images than shallow classifiers. Acquiring insect images is a more tedious process than using generic images. This issue was the main limitation to the use of deep classifier models in insect pest management. We tried to overcome this issue with the above detailed data generation process and a pre-trained deep classifier model base (convolutional and pooling layers). Pre-trained models use knowledge gained from huge, generic databases such as the famous ImageNet and can be a good initialization point in recognizing insect pests. In the literature, there are already several studies that demonstrate how pre-trained models can be used with great success in object classification problems even when the target object or objects are totally different from the object images used to train the model [25].

Many pre-trained deep models can be found in machine learning-specific development environments. Choosing the “right” model requires compromise between accuracy and time requirement. We used the MobileNetV2 deep convolutional architecture (trained on ImageNet data) as our base model, which was designed for mobile devices. It has a significantly smaller architecture size and calculation complexity than other widely used object detector models such as the R-CNN. Therefore, this model is suitable for any device with low computational power. More information about the MobileNetV2 model can be found in the original paper [26].

In our implementation, two fully connected layers were added to the MobileNetV2 base with 128 and 2 neurons each. The activation function on the output layer was softmax, while the cost function was binary cross-entropy. This extended MobileNetV2 architecture was fine-tuned on our dataset during the training process. More precisely, this means that only the fully connected layers were trained, while the weights of the MobileNetV2 base remained unchanged.

4. Results

Tests of the proposed prototype trap have only been performed since March 2021. During this time, very few useful images have been taken. Therefore, the required amount of data such as the training and test sets must be obtained from external sources.

As described in the previous section, we generated a binary dataset to train the moth classifier model. As usual, the whole dataset was divided into training and validation sets. In total, 20% of all images were used for validation, while the remaining images were for the training set. After 30 epochs with a 0.0001 learning rate, the training process was terminated because the validation accuracy did not increase further. The graphs of training and validation accuracies can be seen in Figure 5.

Unlike the sample images (cut out from general images of insects) used in the training process, trap images were used in the test phase. Our test set consists of 30 different and independent sticky trap images, which have also been annotated by hand [23].

In order to find image regions on test (trap) images where insects could be located, we used the same selective search method used in the dataset generation step. Depending on the image, selective search may give back more than a thousand proposals. To filter out proposals with an inappropriate size, we defined a lower and upper size boundary. The lower limit was 2% of the test image size, while the upper limit was 30%. This means that the height and width of the bounding box proposed by the algorithm had to be greater than 2% of the image size (height and width) but less than 30% of the image size.

To measure the accuracy of the selective search method (with size restriction), we introduced the following metrics:

A C C_{S S} = \frac{1}{N} \sum_{i = 1}^{N} \frac{1}{M_{i}} \sum_{j = 1}^{M_{i}} \max (I o U (t r u e_b o x_{i, j}, p r o p o s e d_b o x_{i, k}))

(1)

where N is the number of test images and M_i is the number of ground truth boxes on the ith test image. This metric (in percentage) is 82% on our test set, which indicates that the selective search method can be used with high confidence. As an example, Figure 6 shows all proposals on a test image where the original selective search method generated 4217 proposals, while the size restriction reduced this value to 1994. To visually demonstrate the efficiency of the used region proposal method, Figure 7 shows only those proposals where the IoU value between proposed and ground truth boxes was higher than 0.8. We can say that the region proposal method found one or more regions to all moths on a given test image; this was also true for other test images.

All of the selected image regions were passed to the trained moth classifier model. Since the model’s output activation function was softmax, the model output could be seen as a probability value indicating if the proposed region contains moths or not. The final decision would depend on those probability values. In this study, we used 0.5 as the probability threshold value. This means that if a probability value of a region is higher than the probability threshold, the region will be seen as a possible moth; otherwise, we will ignore that region. A sample image of the outcome of the classification step can be seen in Figure 8.

In Figure 8, there are more regions that cover exactly the same moth. Therefore, in the last step of the classification process, we had to “suppress” overlapping regions. A commonly used solution for this problem is the application of the Non-Max Suppression (NMS). The goal of NMS is to eliminate the less likely bounding boxes and to keep only the best ones according to the probabilities and the IoU values of overlapping boxes. The effect of NMS can be seen in Figure 9. The number of detected regions by NMS can be seen as the number of detected moths (c^p).

To measure the performance of our insect-counting method, we introduced the following loss function:

l (c^{p}, c^{r}) = \frac{1}{N} \sum_{i = 1}^{N} \frac{|c_{i}^{p} - c_{i}^{r}|}{c_{i}^{r} + 1}

(2)

where N is the number of sample images, c^p is the number of predicted boxes, and c^r is the number of real insects (ground truth boxes). The loss values of the insect-counting algorithm with the NMS can be seen in Table 1. In the table, IoU Thr. values indicate the overlapping threshold levels in the NMS algorithm.

In the NMS algorithm, neighboring detections will be removed if their overlap is higher than a pre-defined threshold. In this case, other possible objects will be missed in neighboring detections. In other words, all of those proposed boxes (b_i) which are close to another object but have a lower score than b_x (neighbor object with a higher score) will be suppressed if the IoU threshold level is low. This means that true positive bounding boxes will be removed in those regions of the image where there are more nearby (or touching) insects. This phenomenon can be observed in Figure 9 where the conventional NMS cannot handle touching insects well. To overcome this issue, we tried to use the Soft-NMS method [27] instead of the conventional NMS.

The key idea of Soft-NMS is to keep neighboring detections with a decreased probability value (score) as a function of its overlap. The authors of Soft-NMS proposed Equation (3) for the rescoring of bounding boxes; in this equation, s_i is the score of box b_i.

s_{i} = s_{i} e^{- \frac{I o U {(b_{x}, b_{i})}^{2}}{σ}}

(3)

An important parameter in in the exponential penalty function above is the σ, which controls the penalty strength (score decrease) of overlapping boxes.

After rescoring overlapping boxes, we had to use another probability threshold to decide on which boxes were true positive. To analyze the capabilities of Soft-NMS, we tried more probability threshold levels in combination with more σ values. The loss values of our insect-counting method with Soft-NMS can be found in Table 2.

5. Discussion

In the case of Soft-NMS, the bounding boxes around close insects will be kept with higher chances. As an example, Figure 10 shows the outcome of our moth counting algorithm with Soft-NMS on the same test image.

The smallest loss value was 0.164, with σ = 0.01 and a 0.1 probability threshold. Evidently, the disadvantage of Soft-NMS compared to the NMS is its parameter adjustment. Although we tested only a very small subset of possible σ and probability threshold combinations, the results in Table 2 show a clearly observable trend. We can thus say that as the exponential penalty function decays (smaller σ value), the probability threshold should also be reduced due to the higher penalty. This trend can be utilized to adjust the parameters of Soft-NMS. We suggest the usage of a quickly decaying exponential penalty function with a small probability threshold value.

6. Conclusions

In this work, the hardware plus software design and the implementation of an automated insect pest counting system has been presented. We proposed a new embedded system-based insect trap with an OpenMV Cam H7 microcontroller board, which can be used anywhere in the field without the restrictions of previous traps.

In addition, we also proposed a deep learning-based insect-counting method instead of previously used segmentation techniques. To overcome the problem of having a lack of data seen in previous articles, we generated our own dataset and used a pre-trained deep model for the classification. The pre-trained model (MobileNetV2) was fine-tuned on our dataset, which is freely available for the scientific community.

For object proposal, the selective search algorithm was used. Using the IoU metric, we showed that the selective search algorithm can be applied reliably on trap images as object proposal. To suppress false proposals (bounding boxes), the NMS and the Soft-NMS algorithms were investigated. Our test results show that the Soft-NMS with the exponential penalty function achieves less loss (introduced in Equation (2)) than the conventional NMS method.

Even though the proposed insect-counting method is not yet perfect (due to the huge difference between training and test data), it can reliably be used in the field. In fact, spraying should be started when abrupt change occurs in the number of captured insects. This event can be accurately indicated by the proposed trap and insect-counting method. Consequently, time and cost can be saved and the environment will be better protected.

7. Future Work

The presented insect-counting algorithm can be improved further in different ways. One way is to use a more sophisticated parameter adjustment of Soft-NMS. A sophisticated parameter adjustment requires an optimization technique such as the Bayesian [28,29]; however, it was out of the scope of this work. Another proposal is the enlargement of the training set with additional trap images.

Funding

This work was supported by the construction EFOP-3.6.3-VEKOP-16-2017-00002. The project was supported by the European Union, co-financed by the European Social Fund.

Data Availability Statement

Data available in a publicly accessible website that does not issue DOIs. Publicly available datasets were analyzed in this study. This data can be found here: https://irh.inf.unideb.hu/~sutoj/projects.php (accessed on 20 July 2021).

Acknowledgments

We would like to thank the Eközig Company for the technical background.

Conflicts of Interest

The author declares no conflict of interest.

References

Zhong, Y.; Gao, J.; Lei, Q.; Zhou, Y. A vision-based counting and recognition system for flying insects in intelligent agriculture. Sensors 2018, 18, 1489. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Sun, Y.; Liu, X.; Yuan, M.; Ren, L.; Wang, J.; Chen, Z. Automatic in-trap pest detection using deep learning for pheromone-based Dendroctonus valens monitoring. Biosyst. Eng. 2018, 176, 140–150. [Google Scholar] [CrossRef]
Rustia, D.J.A.; Lin, C.E.; Chung, J.Y.; Zhuang, Y.J.; Hsu, J.C.; Lin, T.T. Application of image and environmental sensor network for automated greenhouse insect pest monitoring. J. Asia Pac. Etomol. 2020, 23, 17–28. [Google Scholar] [CrossRef]
Wang, J.; Lin, C.; Ji, L.; Liang, A. A new automatic identification system of insect images at the order level. Knowl. Based Syst. 2012, 33, 102–110. [Google Scholar] [CrossRef]
Yalcin, H. Vision Based Automatic Inspection of Insects in Pheromone Traps. In Proceedings of the 2015 Fourth International Conference on Agro-Geoinformatics, Istanbul, Turkey, 20–24 July 2015; pp. 333–338. [Google Scholar]
Bakkay, M.C.; Chambon, S.; Rashwan, H.A.; Lubat, C.; Barsotti, S. Automatic detection of individual and touching moths from trap images by combining contour-based and region-based segmentation. IET Comput. Vis. 2018, 12, 138–145. [Google Scholar] [CrossRef] [Green Version]
Wen, C.; Guyer, D. Image-based orchard insect automated identification and classification method. Comput. Electron. Agric. 2012, 89, 110–115. [Google Scholar] [CrossRef]
Liu, H.; Lee, S.H.; Chahl, J.S. A review of recent sensing technologies to detect invertebrates on crops. Precis. Agric. 2017, 18, 635–666. [Google Scholar] [CrossRef]
Martineau, M.; Conte, D.; Raveaux, R.; Arnault, I.; Munier, D.; Venturini, G. A survey on image-based insect classification. Pattern Recognit. 2017, 65, 273–284. [Google Scholar] [CrossRef] [Green Version]
Xie, C.; Zhang, J.; Rui, L.; Li, J.; Hong, P.; Xia, J.; Chen, P. Automatic classification for field crop insects via multiple-task sparse representation and multiple-kernel learning. Comput. Electron. Agric. 2015, 119, 123–132. [Google Scholar] [CrossRef]
Deng, L.; Wang, Y.; Han, Z.; Yu, R. Research on insect pest image detection and recognition based on bio-inspired method. Biosyst. Eng. 2018, 169, 139–148. [Google Scholar] [CrossRef]
Wu, X.; Zhang, C.; Lai, Y.K.; Cheng, M.M.; Yang, J. IP102: A Large Scale Benchmark Dataset for Insect Pest Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 8787–8796. [Google Scholar]
Kaggle, Fieldguide Challenge: Moth and Butterflies. Available online: https://www.kaggle.com/c/fieldguide-challenge-moths-and-butterflies/ (accessed on 4 May 2021).
Qing, Y.; Jun, L.V.; Qing-jie, L.; Guang-qiang, D.; Bao-jun, Y.; Hong-ming, C.; Jian, T. An insect imaging system to automated rice light -trap pest identification. J. Integr. Agric. 2012, 11, 978–985. [Google Scholar]
Cho, J.; Choi, J.; Qiao, M.; Ji, C.W.; Kim, H.Y.; Uhm, K.B.; Chon, T.S. Automatic identification of whiteflies, aphids, and thrips in greenhouse based on image analysis. Int. J. Math. Comput. Simul. 2007, 1, 46–53. [Google Scholar]
Wang, Z.; Wang, K.; Liu, Z.; Wang, X.; Pan, S. A Cognitive Vision Method for Insect Pest Image Segmentation. In Proceedings of the 6th IFAC Conference on Bio-Robotics, Beijing, China, 13–15 July 2018; pp. 13–15. [Google Scholar]
Wen, C.; Wu, D.; Hu, H.; Pan, W. Pose estimation-dependent identification method for moth images using deep learning architecture. Biosyst. Eng. 2015, 136, 117–128. [Google Scholar] [CrossRef]
Ding, W.; Taylor, G. Automatic moth detection from trap images for pest management. Comput. Electron. Agric. 2016, 123, 17–28. [Google Scholar] [CrossRef] [Green Version]
Zhu, Z.; Liang, D.; Zhang, S.; Huang, X.; Li, B.; Hu, S. Traffic Sign Detection and Classification in the Wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2110–2118. [Google Scholar]
Kasinathan, T.; Uyyala, S.R. Machine learning ensemble with image processing for pest identification and classification in field crops. Neural Comput. Appl. 2021. [Google Scholar] [CrossRef]
Xia, D.; Chen, P.; Wang, B.; Zhang, J.; Xie, C. Insect detection and classification based on an improved convolutional neural network. Sensors 2018, 18, 4169. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hong, S.J.; Kim, S.Y.; Kim, E.; Lee, C.H.; Lee, J.S.; Lee, D.S.; Bang, J.; Kim, G. Moth detection from pheromone trap images using deep learning object detectors. Agriculture 2020, 10, 170. [Google Scholar] [CrossRef]
Available online: https://irh.inf.unideb.hu/~sutoj/projects.php (accessed on 20 July 2021).
Uijlings, J.R.R.; Van de Sende, K.E.A.; Gevers, T.; Smeulders, A.W.M. Selective search for object recognition. Int. J. Comput. Vis. 2013, 104, 154–171. [Google Scholar] [CrossRef] [Green Version]
Suto, J. Plant leaf recognition with shallow and deep learning: A comprehensive study. Intell. Data Anal. 2020, 24, 1311–1328. [Google Scholar] [CrossRef]
Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 4510–4520. [Google Scholar]
Bodla, N.; Singh, B.; Chellappa, R.; Davis, L.S. Soft-NMS-Improving Object Detection with One Line of Code. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 5561–5569. [Google Scholar]
Snoek, J.; Larochelle, H.; Adams, R.P. Practical Bayesian Optimization for Machine Learning Algorithms. In Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA, 3–6 December 2012; pp. 2951–2959. [Google Scholar]
Suto, J. The effect of hyperparameter search on artificial neural network in human activity recognition. Open Comput. Sci. 2021, 11, 411–422. [Google Scholar] [CrossRef]

Figure 1. The OpenMV Cam H7 board.

Figure 2. Arrangement of the electronic components inside the waterproof box (without cover). The GPS and GSM modules are on the backside of the circuit.

Figure 3. The trap’s housing.

Figure 4. Three positive (top row) and three negative (bottom row) samples.

Figure 5. Training and validation accuracies.

Figure 6. All proposals of the selective search method. Proposals are visualized in random colors.

Figure 7. Proposals where the IoU ≥ 0.8. Proposals are visualized in random colors.

Figure 8. Moth candidate regions where the classifier’s probability value > 0.5.

Figure 9. Detected moths with NMS.

Figure 10. Detected moths with Soft-NMS.

Table 1. Loss values of our insect-counting algorithm with conventional NMS.

IoU Thr.	Loss
0.01	0.207
0.1	0.174
0.2	0.237
0.3	0.510
0.4	0.966
0.5	1.619

Table 2. Loss values of our insect-counting algorithm with Soft-NMS.

Probability Thr.	σ = 1	σ = 0.5	σ = 0.1	σ = 0.01
0.1	5.050	2.954	0.810	0.164
0.2	3.722	2.152	0.598	0.171
0.3	2.813	1.584	0.387	0.178
0.4	2.183	1.230	0.268	0.188
0.5	1.596	0.943	0.243	0.184
0.6	1.151	0.665	0.196	0.202
0.7	0.773	0.388	0.174	0.193
0.8	0.421	0.278	0.209	0.222
0.9	0.309	0.265	0.297	0.304

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sütő, J. Embedded System-Based Sticky Paper Trap with Deep Learning-Based Insect-Counting Algorithm. Electronics 2021, 10, 1754. https://doi.org/10.3390/electronics10151754

AMA Style

Sütő J. Embedded System-Based Sticky Paper Trap with Deep Learning-Based Insect-Counting Algorithm. Electronics. 2021; 10(15):1754. https://doi.org/10.3390/electronics10151754

Chicago/Turabian Style

Sütő, József. 2021. "Embedded System-Based Sticky Paper Trap with Deep Learning-Based Insect-Counting Algorithm" Electronics 10, no. 15: 1754. https://doi.org/10.3390/electronics10151754

APA Style

Sütő, J. (2021). Embedded System-Based Sticky Paper Trap with Deep Learning-Based Insect-Counting Algorithm. Electronics, 10(15), 1754. https://doi.org/10.3390/electronics10151754

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Embedded System-Based Sticky Paper Trap with Deep Learning-Based Insect-Counting Algorithm

Abstract

1. Introduction

2. Related Work

2.1. Databases

2.2. Segmentation

2.3. Feature Extraction and Classification

2.4. Insect Pest Monitoring Systems

3. Materials and Methods

3.1. The Structure of the Prototype Trap

3.2. Dataset Generation

3.3. Deep Classifier Model

4. Results

5. Discussion

6. Conclusions

7. Future Work

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI