Next Article in Journal
Coastal Areas Division and Coverage with Multiple UAVs for Remote Sensing
Next Article in Special Issue
A Quadrilateral Geometry Classification Method and Device for Femtocell Positioning Networks
Previous Article in Journal
Estimation of Antenna Pose in the Earth Frame Using Camera and IMU Data from Mobile Phones
Previous Article in Special Issue
Liquid Temperature Measurements Using Two Different Tunable Hollow Prisms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Method of Identifying Paddy Seed Varieties

Department of Bio-Industrial Mechatronics Engineering, National Chung Hsing University, Tai-Chung 402, Taiwan
*
Author to whom correspondence should be addressed.
Sensors 2017, 17(4), 809; https://doi.org/10.3390/s17040809
Submission received: 11 February 2017 / Revised: 1 April 2017 / Accepted: 5 April 2017 / Published: 9 April 2017
(This article belongs to the Special Issue Innovative Sensing Control Scheme for Advanced Materials)

Abstract

:
This paper presents a novel method for identifying three varieties (Taikong 9, Tainan 11, and Taikong 14) of foundation paddy seeds. Taikong 9, Tainan 11, and Taikong 14 paddy seeds are indistinguishable by inspectors during seed purity inspections. The proposed method uses image segmentation and a key point identification algorithm that can segment paddy seed images and extract seed features. A back propagation neural network was used to establish a classifier based on seven features that could classify the three paddy seed varieties. The classification accuracies of the resultant classifier for varieties Taikong 9, Tainan 11, and Taikong 14 were 92.68%, 97.35% and 96.57%, respectively. The experimental results indicated that the three paddy seeds can be differentiated efficiently using the developed system.

1. Introduction

Paddy is one of the main crops in Taiwan and can be planted twice each year. Purity analysis is crucial for nurseries and farmers, and purity is determined by paddy variety inspection. Purity is defined by professional inspectors according to the paddy’s appearance, shape, and color. However, there are approximately 500 cases (every case including approximately 4000 paddy seeds) of incorrect purity analyses every year in Taiwan. The jobs of inspection burden the inspectors with loading. Because healthy seedlings from seedling propagation stations (nurseries) are used to cultivate fields of paddy, seed quality is a critical factor when growing seedlings.
Image processing is widely used to inspect grain. MousaviRad et al. [1] used a scanner to capture images of five Iranian rice kernel varieties and then extracted 41 shape features using image processing. The k-nearest neighbors algorithm, a support vector machine, and a backpropagation neural network (BPNN) were used for classification. The classifications of the support vector machine and BPNN were favorable, with accuracies of 97% and 96%, respectively. Mebatsion et al. [2] used a least-squares classifier to identify five varieties of grain through their shape and color features, which were extracted using image processing. The average accuracy of classification was 99.6%. Kuo et al. [3] used image processing and sparse-representation-based classification to distinguish between 30 varieties of rice grains. However, the appearances of these rice grains apparently differ.
Machine learning has been widely used in the establishment of classification mechanisms. Lee et al. [4] used a CCD (charge-coupled device) camera to capture seven varieties of grain kernel and extract 10 shape features and 4 color features using image processing. A BPNN was established in four forms with features identified using principal component analysis and linear discriminant analysis (LDA). The performance of the BPNN with one hidden layer and features in six dimensions (as identified using LDA) was favorable and its classification accuracy was 95%. Sun et al. [5] compared the advantages of a BPNN and a learning vector quantization network (LVQN) for the identification of thermal fuses. The results demonstrated that a BPNN with 20 hidden layer nodes, a learning rate of 0.01, and a tangent sigmoid transfer function exhibited good classification performance with a classification accuracy of 98.0%. However, the LVQN with 160 hidden layer nodes and a learning rate of 0.1 had 91% accuracy. Additionally, Huang [6] presented a BPNN classifier for sorting the quality of areca nuts according to geometric features. Zhang et al. [7] proposed an improved probabilistic neural network for classifying remote-sensing images. Subsequently, Zhang et al. [8] used a fitness-scaled chaotic artificial bee colony (FSCABC) algorithm and feedforward neural network (FNN) to classify fruit. The results showed that the accuracy of the FSCABC–FNN was higher than that of the genetic algorithm–FNN (84.8%), particle swarm optimization–FNN (87.9%), artificial bee colony algorithm (85.4%), and kernel support vector machine (88.2%).
The rice grains examined in previous studies [1,2,3,4] have different appearances and are more distinguishable than the Taikong 9 (TK9), Tainan 11 (TN11), and Taikong 14 (TK14) seeds, which were the focus of the present study. TK9, TN11, and TK14 are so similar on appearance that they are difficult to identify (Figure 1). Therefore, the purpose of this study was to establish an algorithm for recognizing these three paddy seeds. Specifically, the geometric features of the seeds were to be extracted and then used to differentiate between the three paddy seed varieties (TK9, TN11, and TK14).

2. Materials and Methods

2.1. Image Capture System and Experimental Samples

The image capture system developed in this study comprised a USB CCD color camera (DFK-21BU04, ImagingSource Inc., Taipei, Taiwan), a low-distortion lens (ML-MC25HR, MORITEX Inc., Saitama, Japan), a shadowless lamp (MSRL-CW33, MORITEX Inc.), and a computer (Intel Core i5-4460 CPU, 3.26 GB of RAM, Santa Clara, CA, USA). It captured RGB color images measuring 640 × 480 pixels in the bitmap format. The CCD camera was employed for image acquisition with 28,200 lx and the work distance was 11.0 cm. Image processing software was developed using Visual Basic 6.0 and the Matrox Imaging Library (MIL) 8.0. Paddy seeds (which were foundation seeds in 2014)—varieties TK9, TN11, and TK14 (Figure 1)—were provided by the Taiwan Seed Improvement and Propagation Station.

2.2. Image Segmentation

Segmenting of the paddy seed images is an essential procedure once the features of the paddy seeds have been extracted. The segmentation steps and results (Figure 2) are thus described:
Step 1
Red and hue band images are obtained from the original image.
Step 2
Red and hue band images are treated using a smoothing operator and converted into binary images with an optimum threshold value using Otsu’s method [9].
Step 3
Complete paddy seed binary images are obtained using the OR logic operator and filling operator on the hue and red binary images, respectively.
Step 4
The entire segmented image is obtained using the AND operator on the binary and original images.
In this study, paddy seeds were auto-segmented according to these steps:

2.3. Feature Extraction

Key lines (Figure 3) defined by the contour of a seed are related to the geometric features of the seed. AB ¯ , CD ¯ , CO ¯ , DO ¯ , P 1 P 2 ¯ , P 3 P 4 ¯ , and O were identified according to a contour-following algorithm [9]. The features of the seeds are defined as follows:
  • The lemma, palea, glume, and chaff tip of a seed are illustrated in Figure 4.
  • AB ¯ is the longest line segment in the seed contour.
  • O is the midpoint of AB ¯ .
  • CD ¯ is the perpendicular bisector of AB ¯ , and thus O is the intersection of AB ¯ and CD ¯ .
  • CO ¯ crosses the lemma.
  • DO ¯ crosses the palea.
  • P 1 P 2 ¯ is the perpendicular line crossing the 1/5 position of AB ¯ .
  • P 3 P 4 ¯ is the perpendicular line crossing the 4/5 position of AB ¯ .
In this study, several special geometric features were extracted to identify seed varieties TK9, TN11, and TK14. Two concaves (RK and RL) to the side of the chaff tip are indicated by red curves in Figure 5. Points L, Lu, Ld, K, Ku, and Kd (Figure 6) on the concaves were crucial for feature extraction. The hull points Ld, Lu, Ku, and Kd of the seed contour were obtained using the convex hull algorithm [10].
Geometric features analysis was employed extensively for classification. In this study, seven features were extracted using the developed algorithm. The feature definition and extraction method proceeded as follows:
(1)
AB ¯ is the longest line segment on the seed contour.
(2)
CD ¯ is the perpendicular bisector of AB ¯ .
(3)
The chaff-tip width ( LK ¯ ) is as illustrated in Figure 7.
(4)
The height (hc = max(hi)) of the chaff tip is the maximum height of the chaff tip from LK ¯ , where di is the distance between a point on the chaff-tip contour and LK ¯ , as illustrated in Figure 7.
(5)
The depth dK is the maximum distance between K u K d ¯ and the concave RK (dK K u K d ¯ ), as indicated in Figure 8. dK is obtained when dK = max(di) at point K.
(6)
The depth dL can be similarly computed, as shown in Figure 8.
(7)
The interior angle φ is described by KK d ¯ and LL d ¯ and illustrated in Figure 9.

2.4. Classifier

In this study, geometric features were employed to differentiate between three paddy seed varieties: TK9, TN11, and TK14. A total of seven geometric features ( AB ¯ , the perpendicular bisector CD ¯ , the chaff-tip width LK ¯ and height, the depths dK and dL, and the interior angle φ) were applied in a BPNN [11]. The BPNN classifier consisted of input, hidden, and output layers. The input features were normalized between 0 and 1. The output layer was composed of nodes related to the three categories: TK9, TN11, and TK14. The number of nodes in the hidden layer ( n h ) was calculated using the following formula [12]:
n h = n i + n o + k
where ni and no are the number of input and output nodes, respectively, k = −2, 0, 2. The structure of the BPNN classifier is illustrated in Figure 10, wherein Wij and bij are the weight and bias of the input layer in the hidden layer and Wjk and bjk are the weight and bias of the hidden layer in the output layer. Xi, Hj, and Ok denote the input layer, hidden layer, and output layer values, respectively.
After its structure was determined, the BPNN classifier was trained. The purpose of BPNN training is to identify relationships between patterns composed of features in each variety of paddy seed. During training, the BPNN classifier analyzed training samples at a given learning rate, and its weights and biases were adjusted until the mean squared error was less than the tolerance error, which indicated that the BPNN classifier had completed training and its weights and biases were stable. In this study, the BPNN classifier analyzed 500 training samples of each variety at a learning rate of 0.01 before training was complete, as defined by a tolerance error of 0.01.

3. Results and Discussion

The identifying software for the paddy seeds was developed using Visual Basic 6.0 and MIL 8.0. The functions of software include file operations (acquire, load, and save images), image analysis operations (i.e., binary operator, hole-filling, remove noises using closing, opening, and smoothing), the feature extraction, and BPNN. The variety of paddy can be identified by the software computation accurately and rapidly.
In this study, 1,156 paddy seeds of variety TK9, 1,180 paddy seeds of variety TN11, and 1170 paddy seeds of variety TK14 were used as experimental samples. Of these, 500 seeds of each variety were used as training samples to establish the BPNN classifier, and the remainder was used to test the BPNN classifier. Overfitting often occurs when the training set contains some incorrect samples in the BPNN. However, because the varieties of seed in the training samples were known prior to the training process, overfitting was unlikely to occur here.
The classification accuracies of the BPNN are presented in Table 1 and Table 2, obtained using the TK9, TN11, and TK14 test samples when the number of nodes in the hidden layer was 8, 10, or 12. The accuracy of the BPNN was highest when the number of nodes was 10; nevertheless, the results were also mostly accurate when there were 8, 10, or 12 hidden nodes. We also compared the BPNN with a Bayesian classifier [9] using the same samples and features, the results of which are presented in Table 2. The BPNN’s accuracy was slightly higher than that of the Bayesian classifier for samples TN11 and TK14, but the Bayesian classifier performed favorably for samples TK9. Thus, the BPNN and Bayesian classifiers have the same classification ability for the TK9, TN11, and TK14 varieties. As mentioned previously, the features employed in this study can classify the paddy varieties using different classifiers.
The accuracy of the BPNN when presented with variety TK9 was the lowest, probably because the correlation between the adopted features and the TK9 seed’s contour was not sufficiently strong in this study.

4. Conclusions

In this study, we developed a novel method for classifying three varieties (Taikong 9, Tainan 11, and Taikong 14) of foundation paddy seeds. The shape features of the seeds were obtained to establish a BPNN classifier. The test results show that three varieties of foundation paddy seeds can be classified efficiently with this method. In a future study, we intend to further refine the classification algorithm or use other classifiers to increase the seed classification accuracy.

Acknowledgments

The authors thank the Taiwan Seed Improvement and Propagation Station (Contract No. 103B051-B) and the Ministry of Science and Technology, Taiwan (Contract No. MOST 105-2313-B-005-023) for financially supporting this research.

Author Contributions

Huang K.Y. had the initial idea to develop a sorting system for Chinese cabbage seeds. Huang K.Y. and Chien M.C. developed the algorithms and classifier. Chien M.C. wrote the programs and performed the experiment. Huang, K.Y. contributed to the paper organization and added technical writing to the final manuscript.

References

  1. MousaviRad, S.J.; Rezaee, K.; Nasri, K. A new method for identification of Iranian rice kernel varieties using optimal morphological features and an ensemble classifier by image processing. Majlesi J. Multimedia Process. 2012, 1, 1–8. [Google Scholar]
  2. Mebatsion, H.K.; Paliwal, J.; Jayas, D.S. Automatic classification of non-touching cereal grains in digital images using limited morphological and color features. Comput. Electron. Agric. 2012, 90, 99–105. [Google Scholar] [CrossRef]
  3. Kuo, T.Y.; Chung, C.L.; Chen, S.Y.; Lin, H.A.; Kuo, Y.F. Identifying rice grains using image analysis and sparse-representation-based classification. Comput. Electron. Agric. 2016, 127, 716–725. [Google Scholar] [CrossRef]
  4. Lee, C.Y.; Yan, L.; Wang, T.F.; Lee, S.R.; Park, C.W. Intelligent classification methods of grain kernels using computer vision analysis. Meas. Sci. Technol. 2011, 22, 64006–64012. [Google Scholar] [CrossRef]
  5. Sun, T.H.; Tien, F.C.; Kuo, R.J. Automated thermal fuse inspection using machine vision and artificial neural networks. J. Intell. Manuf. 2016, 27, 639–651. [Google Scholar] [CrossRef]
  6. Huang, K.Y. Detection and classification of areca nuts with machine vision. Comput. Math. Appl. 2012, 64, 739–746. [Google Scholar] [CrossRef]
  7. Zhang, Y.; Wu, L.; Neggaz, N.; Wang, S.; Wei, G. Remote-sensing image classification based on an improved probabilistic neural network. Sensors 2009, 9, 7516–7539. [Google Scholar] [CrossRef] [PubMed]
  8. Zhang, Y.; Wang, S.; Ji, G.; Phillips, P. Fruit classification using computer vision and feedforward neural network. J. Food Eng. 2014, 143, 167–177. [Google Scholar] [CrossRef]
  9. Gonzalez, R.C.; Woods, R.E. Digital Image Processing, 3rd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
  10. Cormen, T.H.; Leiserson, C.E.; Rivest, R.L.; Stein, C. Finding the convex hull. In Introduction to Algorithms, 3rd ed.; The MIT Press: London, UK, 2009. [Google Scholar]
  11. Hagan, M.T.; Demuth, H.B.; Beale, M.H.; Jesus, O.D. Neural Network Design, 2nd ed.; eBook; Oklahoma State University: Stillwater, OK, USA, 2014. [Google Scholar]
  12. Rocco, F.; Governi, L.; Volpe, Y. ANN-based method for olive Ripening Index automatic prediction. J. Food Eng. 2010, 101, 318–328. [Google Scholar]
Figure 1. Three paddy seed varieties.
Figure 1. Three paddy seed varieties.
Sensors 17 00809 g001
Figure 2. Segmentation procedure.
Figure 2. Segmentation procedure.
Sensors 17 00809 g002
Figure 3. Key lines in the seed contour.
Figure 3. Key lines in the seed contour.
Sensors 17 00809 g003
Figure 4. Lemma, palea, glume, and chaff tip of the seed.
Figure 4. Lemma, palea, glume, and chaff tip of the seed.
Sensors 17 00809 g004
Figure 5. Two concaves at the chaff tip.
Figure 5. Two concaves at the chaff tip.
Sensors 17 00809 g005
Figure 6. Crucial points on the concaves.
Figure 6. Crucial points on the concaves.
Sensors 17 00809 g006
Figure 7. Width and height of chaff tip.
Figure 7. Width and height of chaff tip.
Sensors 17 00809 g007
Figure 8. Depths of concaves.
Figure 8. Depths of concaves.
Sensors 17 00809 g008
Figure 9. Interior angle φ described by concaves.
Figure 9. Interior angle φ described by concaves.
Sensors 17 00809 g009
Figure 10. Structure of backpropagation neural network (BPNN) classifier.
Figure 10. Structure of backpropagation neural network (BPNN) classifier.
Sensors 17 00809 g010
Table 1. Results using BPNNs.
Table 1. Results using BPNNs.
nh = 8
VarietyTK9TN11TK14
TK9606415
TN112366515
TK142711640
Classification Accuracy (%)92.3897.7995.52
Average accuracy (%)95.26
nh = 10
VarietyTK9TN11TK14
TK9608513
TN112166210
TK142713647
Classification Accuracy (%)92.6897.3596.57
Average accuracy (%)95.56
nh = 12
VarietyTK9TN11TK14
TK9608512
TN112166215
TK142713643
Classification Accuracy (%)92.6897.3595.97
Average accuracy (%)95.36
Table 2. Results using Bayes classifier.
Table 2. Results using Bayes classifier.
VarietyTK9TN11TK14
TK9613810
TN112565110
TK141821646
Total656680670
Classification accuracy (%)93.4495.796.4
Average accuracy (%)95.21

Share and Cite

MDPI and ACS Style

Huang, K.-Y.; Chien, M.-C. A Novel Method of Identifying Paddy Seed Varieties. Sensors 2017, 17, 809. https://doi.org/10.3390/s17040809

AMA Style

Huang K-Y, Chien M-C. A Novel Method of Identifying Paddy Seed Varieties. Sensors. 2017; 17(4):809. https://doi.org/10.3390/s17040809

Chicago/Turabian Style

Huang, Kuo-Yi, and Mao-Chien Chien. 2017. "A Novel Method of Identifying Paddy Seed Varieties" Sensors 17, no. 4: 809. https://doi.org/10.3390/s17040809

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop