- freely available
Sensors 2013, 13(2), 1872-1883; doi:10.3390/s130201872
Abstract: Two sensitive wavelength (SW) selection methods combined with visible/near infrared (Vis/NIR) spectroscopy were investigated to determine the levels of some trace elements (Fe, Zn) in rice leaf. A total of 90 samples were prepared for the calibration (n = 70) and validation (n = 20) sets. Calibration models using SWs selected by LVA and ICA were developed and nonlinear regression of a least squares-support vector machine (LS-SVM) was built. In the nonlinear models, six SWs selected by ICA can provide the optimal ICA-LS-SVM model when compared with LV-LS-SVM. The coefficients of determination (R2), root mean square error of prediction (RMSEP) and bias by ICA-LS-SVM were 0.6189, 20.6510 ppm and −12.1549 ppm, respectively, for Fe, and 0.6731, 5.5919 ppm and 1.5232 ppm, respectively, for Zn. The overall results indicated that ICA was a powerful way for the selection of SWs, and Vis/NIR spectroscopy combined with ICA-LS-SVM was very efficient in terms of accurate determination of trace elements in rice leaf.
Recently, variable selection or uninformative variable elimination has attracted more and more attention for the development of multi-component calibrations using spectroscopic techniques. The recently developed methods for variable selection include generalized simulated annealing , genetic algorithm , correlation coefficients and B-matrix coefficients , latent variables analysis (LVA) , x-loading weights , uninformative variable elimination , regression coefficient analysis (RCA) [7,8], independent component analysis (ICA) [9,10] and so on. Among these methods, ICA has recently attracted much attention and has been successfully used in many fields, e.g., medical signal analysis, image processing, dimension reduction, fault detection and near-infrared spectral data analysis [11–15].
Various calibration methods have been used to relate near-infrared spectra (NIRS) with measured properties of materials. Principal components regression (PCR), partial least squares (PLS), multiple linear regression (MLR) and artificial neural networks (ANN) are the most used multivariate calibration techniques for NIRS [16–19]. PLS is usually considered for a large number of applications in fruit and juice analysis and is widely used in multivariate calibration because it takes advantage of the correlation relationships that already exist between the spectral data and the constituent concentrations. However PLS is based on linear models and unsatisfactory results may occur when non-linearity is present [20,21].
The least-squares support vector machine (LS-SVM) can handle the linear and nonlinear relationships between the spectra and response chemical constituents [22,23], therefore, a new combination of ICA with LS-SVM was proposed as a nonlinear calibration model for quantitative analysis using spectroscopic techniques. The performance of ICA-LS-SVM was evaluated by a case study to determine the trace elements in rice, with the purpose of developing a fast and accurate nonlinear model using fewer selected variables for the determination of the trace elements in rice.
The objective of this study were (1) to investigate the feasibility of using Vis/NIRS to predict trace elements such as Fe and Zn in rice leaf; (2) to compare the performance of ICA and the newly proposed ICA-LS-SVM model, variable selection methods (PCA, LVA and ICA) to predict the trace elements in rice.
2. Materials and Methods
2.1. Experimental Design
The experimental samples in this study were 15 basins of rice, which were planted in conditioned soil with three nitrogen levels: 0, 120, and 240 kg/ha. To avoid accidental damage to the basins or samples, a duplicate set of basins was prepared, so there were 30 basins in total. For each nitrogen level, there were 10 basins, including the additional basins. Each basin's inner diameter and height were 30 and 45 cm, respectively. Each basin contained 10 kg soil and four rice plants. The basins were placed in a slotted field using the surrounding soil for backfill, and they were placed along the line from north to south. The soil used in this experiment was from the 20 to 40 cm depth of the experimental field.
2.2. Data Acquisition and Preprocessing
Three leaf samples from each of 15 basins were selected for spectral measurement. Samples were also selected from another 15 replicate basins, so a total of 90 samples were obtained. The measurements were made at the booting stages. All 90 leaf samples reflectance measurements were made using a portable Spectroradiometer (FieldSpec Vis/NIR, Analytical Spectral Device, Boulder, CO, USA), with a sensitivity range from 325 to 1,075 nm. The instrument uses a sensitive 512-element, photo-diode array spectroradiometer, with a resolution of 3.5 nm. The scan number for each spectrum was set to 10 at the same position, and for each sample, three reflection spectra were taken, thus a total of 30 data points were properly stored for later analysis. To achieve the relative reflectance measurements, the white reference (a white panel purchased with the spectroradiometer used as white reference) was collected before scanning samples until a nice, clean, 100% reference line was obtained. All leaves were randomly divided into two sets, one was used as a calibration set (n = 70) and the remaining samples as a validation set (n = 20). In order to compare the performance of different calibration models, the samples in the calibration and validation sets were kept the same for all the models.
2.3. Trace Elements (Fe, Zn) Measurement
In the study, we used the national standard method to measure the trace elements Fe and Zn . First, HNO3, HClO4, and distilled water were diluted and adjusted to the required concentration solution. Rice leaf samples were finely ground and then passed through a 20 mesh sieve to obtain very fine particles. An air-dried, ground and sieved sample (2.0 g) was placed in an Erlenmeyer flask and the extracting solution (20 mL) was added. Then it was placed on a magnetic stirrer and the mixture was stirred for 20 minutes. The resulting solution was filtered through a filter paper into a 50 mL polypropylene vial and diluted to 50 mL with the extracting solution. After that, a Perkin-Elmer Analyst™800 atomic absorption spectrometer (PerkinElmer, Inc., Shelton, CT, USA) was used to measure the signal strength of the elements Fe and Zn in each Erlenmeyer flask, and the results were shown using the software package of the instrument. After calculation, the Fe content was from 39.951 ppm to 134.254 ppm, and Zn content was from 9.085 ppm to 49.927 ppm in all 90 samples. Table 1 shows the statistic values of Fe and Zn contents in calibration and validation sets.
2.4. Data Pretreatment
Due to the potential system imperfections, obvious scattering noises could be observed at the beginning and end of the spectral data. Thus, the first and last 75 wavelength data points were eliminated to improve the measurement accuracy, i.e., all visible and NIR spectroscopy analyses were based on a 400–1,000 nm scan. The above spectral data preprocessing was finished in ViewSpec Pro V4.02 (Analytical Spectral Device, Inc.). After that, the spectral data was preprocessed using Savitzky-Golay smoothing with a window width of 7 (3-1-3) points . The data preprocessing was implemented by the software Unscrambler V 9.6 (Camo Process AS, Oslo, Norway).
2.5. Principal Components Analysis (PCA)
Reducing the number of inputs to the LS-SVM can reduce training time. Furthermore, it can also reduce repetition and redundancy of the input spectra data. PCA is a method of data reduction that constructs new uncorrelated variables, known as principal components (PCs). They account for as much information as possible for the variability of the original variables, which are then used as the inputs of network. In addition, PCs can also eliminate noises and random errors in the original data. The equation of PCA could be described as follows:
2.6. Partial Least Squares Analysis
In the development of PLS model, calibration models were built between the spectra and the content of trace element (Fe and Zn), full cross-validation was used to evaluate the quality and to prevent over-fitting of calibration models. Latent variables (LVs) can be used to reduce the dimensionality of data, and the optimal number of LVs was determined by the lowest value of predicted residual error sum of squares (PRESS). The prediction performance was evaluated by the coefficients of determination (R2) and root mean square error of calibration (RMSEC) or prediction (RMSEP), and bias. The ideal model should have higher r value, lower RMSEC, RMSEP and bias. The RMSEP and bias could be calculated via:
2.7. Independent Component Analysis
Independent component analysis is a well-established statistical signal processing technique that aims to decompose a set of multivariate signals into a base of statistically independent components with the minimal loss of information content. The independent components are latent variables, meaning that they cannot be directly observed, and the independent component must have non-Gaussian distributions. A brief explanation of noise-free ICA model could be expressed by the following equation:
There are lots of algorithms for performing ICA . Among these algorithms, the fast fixed-point algorithm (FastICA), which was developed by Hyvarinen and Oja , is highly efficient for performing the estimation of ICA. FastICA was chosen for ICA and carried out in Matlab 7.0 (The Math Works, Natick, MA, USA).
2.8. Least Squares-Support Vector Machine
Least squares-support vector machine can work with linear or non-linear regression or multivariate function estimation in a relatively fast way . It uses a linear set of equations instead of a quadratic programming (QP) problem to obtain the support vectors (SVs). The details of LS-SVM algorithm could be found in the literature [29,30]. The LS-SVM model can be expressed as:
In the model development using LS-SVM and radial basis function (RBF) kernel, the optimal combination of gam(γ) and sig2(σ2) parameters was selected when resulting in smaller root mean square error of cross validation (RMSECV). In this study, gam(γ) were optimized in the range of 2−1–210 and 2–215 for sig2(σ2) with adequate increments. These ranges were chosen from previous studies where the magnitude of parameters was optimized. The grid search had two steps the first step was for a crude search with a large step size, and the second step was for the specified search with a small step size. The free LS-SVM toolbox (LS-SVM v 1.5, Suykens, Leuven, Belgium) was applied with MATLAB 7.0 to develop the calibration models.
3. Results and Discussion
3.1. Overview of Spectra and Statistic Values of Trace Elements
The lack of trace elements such as Fe, S, Mg, Mn may reduce the chlorophyll content of plant leaf, and will affect the solar radiation absorption by the leaf, so the changes of plant nutritional elements such as nitrogen, water content, and trace elements may directly result in the spectral reflectance changes . Figure 1(a) shows the Vis/NIRS spectral curves of 90 leaf samples. The trend of spectral curves in Vis/NIR region is similar, a small peak appeared at the green band from 560 to 580 nm, and reflectance increased rapidly at about 690–740 nm (red edge) from 10% to 30%–70%. Wavelengths at 580 nm were close to the green pigments, and wavelengths near 680 nm or 710–730 nm was at the red edge position .Treated them with 2nd derivative, some peaks and valleys were shown in Figure 1(b). There exists peaks at the wavebands near 690–700 nm, and at the wavebands 720–740 nm and 550–570 nm are troughs.
3.2. PLS Models
Calibration models were built between the spectra and content of trace elements (Fe and Zn). Different LVs were applied to build the calibration models, and no outliers were detected in the calibration set during the development of PLS models. The models were used to predict the left 20 samples, and the best performance was achieved with six LVs for Fe and five LVs for Zn. The R2, RMSEP and bias were 0.3820, 26.1431 ppm and −9.3674 ppm for Fe, 0.5800, 6.9637 ppm and 2.2320 ppm for Zn, respectively.
3.3. LS-SVM Models with Different SWs Selection Methods
3.3.1. PCA-LS-SVM Models
PCs obtained from PCA were applied as inputs of LS-SVM models to improve the training speed and reduce the training error of Vis/NIR model because the training time increased with the square of the number of training samples and linearly with the number of variables. From the aforementioned analysis of the performance of PCA models, the PCs from the Vis/NIR region were used as new eigenvectors to enhance the features of spectra and reduce the dimensionality of the spectra data matrix. Several PCs were extracted from the spectra of 90 samples.
Before the LS-SVM calibration model was built, three steps are crucial for the optimal input feature subset, proper kernel function and the optimal kernel parameters. Firstly, the six PCs obtained from PCA analysis were used as the input data set, and the accumulated contribution of it was reached 95.2%. Secondly, radial basis function could handle the nonlinear relationships between the spectra and target attributes. Finally, two important parameters gam (γ) and sig2 (σ2) should be optimal for RBF kernel function as aforementioned in multivariate analysis.
The performance of the Vis/NIR models was evaluated by 20 samples in validation set. The R2, RMSEP and bias for validation sets were 0.4012, 23.9920 ppm and −7.8789 ppm for Fe, 0.6109, 6.5308 ppm and 2.0571 ppm for Zn, respectively. Figure 2(a,b) compare the predicted values and measured values for Fe and Zn, respectively, by the PCA-LS-SVM model. The diagonal line (y = x) shows the ideal results that mean the predicted values are equal to the measured values. The closer the sample plots are to this line, the better is the model. From these figures, the sample plots in the validation sets were distributed near the ideal line for Zn, but the prediction performance is not good for Fe.
3.3.2. LV-LS-SVM Models
Latent variables obtained from PLS were applied as inputs of LS-SVM models to improve the training speed and reduce the training error of Vis/NIR model. From the aforementioned analysis of the performance of PLS models, the LVs from the Vis/NIR region were used as new eigenvectors to enhance the features of spectra and reduce the dimensionality of the spectra data matrix. Several LVs were extracted from the spectra of 90 samples.
The performance of the Vis/NIR models was evaluated by 20 samples in validation set. With a comparison of the results for calibration and validation sets, the best performance was achieved with six LVs for Fe and five LVs for Zn. The R2, RMSEP and bias for validation sets were 0.4070, 23.3845 ppm and −7.4975 ppm for Fe, 0.6067, 6.4869 ppm and 2.2336 ppm for Zn, respectively. Figure 3(a,b) show the predicted versus reference charts. Compared with the PCA-LS-SVM models, the prediction performance for Fe was improved a little, but still not good. The PCA-LS-SVM calibration model has better performance than the LV-LS-SVM model for Zn.
3.3.3. ICA-LS-SVM Models
Independent component analysis was applied for the selection of sensitive wavelengths (SWs), which could reflect the main features of the raw absorbance spectra. FastICA was used to the preprocessed spectra data, and the main absorbance peaks and valleys were indicated by the spectra of ICs. The SWs were selected by the weights of the first four ICs, which wavelengths with the highest weights of each IC were selected as the SWs. Figure 4(a,b) show the four ICs for Fe and Zn. Six SWs were selected corresponding to four ICs, and they were wavelengths near 680, 580, 960, 730, 760 and 830 nm for Fe, 680, 710, 640, 720, 580, and 800 nm for Zn. In order to evaluate the performance of SWs, they were applied as the input data matrix to develop the ICA-LS-SVM models. The validation results showed the R2, RMSEP and bias were 0.6189, 20.6510 ppm and −12.1549 ppm for Fe, 0.6731, 5.5919 ppm, 1.5232 ppm for Zn, respectively. Figure 5(a,b) show the predicted versus reference graphs. The ICA-LS-SVM models achieved a better performance compared to the best LV-LS-SVM models both in calibration and validation sets. Wavelengths at 580 nm were close to the chlorophyll content of leaf, and wavelengths at 680 nm, 710 nm or 720 nm were near the red edge position. The wavelength 960 nm was close to the water absorbance bands, and it means Fe may affect by the intimidating of water . Therefore, the selection of SWs was suitable for such situation in the present study and the effectiveness of SWs was also validated. The SWs represented most of the features of the original spectra, and could replace the whole wavelength region to predict the trace elements in rice.
Ma et al. reported that the element Co had high correlation near the wavelength 569.22 nm, with an R2 value of 0.623 . They claimed this might caused by the variation of chlorophyll content. Al Abbas et al. studied the spectra of “normal” and six types of nutrient-deficient maize leaves, and it showed that the chlorophyll concentration of leaves in all nutrient-deficiency treatments was lower than of leaves in the control . This was accordant with the results concerning Co in the paper of Ma et al. In our study, Fe belongs to the family of iron elements, and Zn is kindred with sulfur elements. Fe and Co belong to the same element family, so it is normal that the spectral response of Fe is similar to that of Co, and both of them have high correlation near the wavelength of 580 nm. For Zn, the sensitive wavelengths near 680, 710 and 720 nm were near the red edge.
3.4. Analysis of the Results
Compared with the above PLS, PCA-LS-SVM, LV-LS-SVM and ICA-LS-SVM model, the nonlinear PCA-LS-SVM, LV-LS-SVM, ICA-LS-SVM models turned out to be better than linear model of PLS. The best model was obtained by using the ICA-LS-SVM model for prediction of trace elements in rice. Table 2 shows all the parameters of RMSEP and R2 in the four models.
The ICA-LS-SVM models had a better performance, and the reason might be that the LS-SVM models took the nonlinear information of the spectral data into consideration and the nonlinear information had improved the prediction precision. The ICs from ICA were obtained by a high-order statistic that is much stronger condition than orthogonality, so the SWs selected from ICs were more effective, and it could be very helpful for the development of portable instrument or real-time monitoring of the rice trace elements.
Vis/NIR spectroscopy was successfully utilized for the determination of some trace elements (Fe, Zn) in rice. A new combination of ICA-LS-SVM was proposed with comparison of nonlinear LV-LS-SVM models, PCA-LS-SVM modes and linear PLS models. ICA-LS-SVM model turned out to be the best for prediction of trace elements in rice, and was better than the nonlinear LV-LS-SVM model. The R2, RMSEP and bias by ICA-LS-SVM were 0.6189, 20.6510 ppm and −12.1549 ppm for Fe, and 0.6731, 5.5919 ppm and 1.5232 ppm for Zn, respectively. The overall results demonstrated ICA was a powerful tool for variable selection, and the newly proposed ICA-LS-SVM method could be applied as an alternative fast and accurate method for the determination of trace elements in rice.
This work was supported by the 863 National High Technology Research and Development Program of China (2011AA100705), Zhejiang Provincial Natural Science Foundation of China (Z3090295), and China Postdoctoral Science Foundation (2011M501009).
- Kalivas, J.H.; Roberts, N.; Sutter, J.M. Global optimization by simulated annealing with wavelength selection for ultraviolet-visible spectrophotometry. Anal. Chem. 1989, 61, 2024–2030. [Google Scholar]
- Jouan-Rimbaud, D.; Massart, D.L.; Leardi, R.; de Noord, O.E. Genetic algorithms as a tool for wavelength selection in multivariate calibration. Anal. Chem. 1995, 67, 4295–4301. [Google Scholar]
- Min, M.; Lee, W.S. Determination of significant wavelengths and prediction of nitrogen content for citrus. Trans. ASABE 2005, 48, 455–461. [Google Scholar]
- Christensen, W.F.; Amemiya, Y. Latent variable analysis of multivariate spatial data. J. Am. Stat. Assoc. 2002, 97, 302–317. [Google Scholar]
- Esbensen, K.H. Multivariate Data Analysis in Practice, 5th ed; CAMO Process As: Oslo, Norway, 2002. [Google Scholar]
- Centner, V.; Massart, D.L.; de Noord, O.E.; de Jong, S.; Vandeginste, B.M.; Sterna, C. Elimination of uninformative variables for multivariate calibration. Anal. Chem. 1996, 68, 3851–3858. [Google Scholar]
- Liu, F.; He, Y.; Wang, L. Determination of effective wavelengths for discrimination of fruit vinegars using near infrared spectroscopy and multivariate analysis. Anal. Chim. Acta 2008, 615, 10–17. [Google Scholar]
- Chong, I.G.; Jun, C.H. Performance of some variable selection methods when multicollinearity is present. Chemom. Intell. Lab. Syst. 2005, 78, 103–112. [Google Scholar]
- Hyvarinen, A.; Karhunen, J.; Oja, E. Independent Component Analysis; John Wiley & Sons: New York, NY, USA, 2001. [Google Scholar]
- Krier, C.; Rossi, F.; François, D.; Verleysen, M. A data-driven functional projection approach for the selection of feature ranges in spectra with ICA or cluster analysis. Chemom. Intell. Lab. Syst. 2008, 91, 43–53. [Google Scholar]
- Hyvarinen, A. Sparse code shrinkage: Denoising of nongaussian data by maximum likelihood estimation. Neural Comput. 1999, 11, 1739–1768. [Google Scholar]
- Hoyer, P.O.; Hyvarinen, A. Independent component analysis applied to feature extraction from colour and stereo images. Netw. Comput. Neural Syst. 2000, 11, 191–210. [Google Scholar]
- Hyvarinen, A.; Hoyer, P.O. Emergence of phase and shift invariant features by decomposition of natural images into independent feature subspaces. Neural Comput. 2000, 12, 1705–1720. [Google Scholar]
- Chen, J.; Wang, X.Z. A new approach to near-infrared spectral data analysis using independent component analysis. J. Chem. Inform. Comput. Sci. 2001, 41, 992–1001. [Google Scholar]
- Bi, X.; Li, T.H.; Wu, L. Application of independent component analysis to the IR spectra analysis. Chem. J. Chin. Univ. 2004, 25, 1023–1027. [Google Scholar]
- Mobley, P.R.; Kowalski, B.R.; Bro, R. Review of chemometrics applied to spectroscopy: 1985–98. Part 1. Appl. Spectrosc. Rev. 1996, 31, 73–124. [Google Scholar]
- Balabin, R.M.; Safieva, R.Z.; Lomakina, E.I. Wavelet neural network (WNN) approach for calibration model building based on gasoline near infrared (NIR) spectra. Chemom. Intell. Lab. Syst. 2008, 93, 58–62. [Google Scholar]
- Balabin, R.M.; Safieva, R.Z. Gasoline classification by source and type based on near infrared (NIR) spectroscopy data. Fuel 2008, 87, 1096–1101. [Google Scholar]
- Yang, H.; Griffiths, P.R.; Tate, J.D. Comparison of partial least squares regression and multi-layer neural networks for quantification of non-linear systems and application to gas phase fourier transfrom infrared spectra. Anal. Chim. Acta 2003, 489, 125–136. [Google Scholar]
- Cozzolino, D.; Cynkar, W.U.; Shah, N.; Dambergs, R.G.; Smith, P.A. A brief introduction to multivariate methods in grape and wine analysis. Int. J. Wine Res. 2009, 1, 123–130. [Google Scholar]
- Berrueta, L.A.; Alonso-Salces, R.M.; Heberger, K. Supervised pattern recognition in food analysis. J. Chromatogr. A 2007, 1158, 196–214. [Google Scholar]
- Suykens, J.A.K.; Vanderwalle, J. Least squares support vector machine classifiers. Neural Process. Lett. 1999, 9, 293–300. [Google Scholar]
- Suykens, J.A.K.; van Gestel, T.; de Brabanter, J.; de Moor, B.; Vandewalle, J. Least Squares Support Vector Machines; World Scientific: Singapore, Singapore, 2002. [Google Scholar]
- Lindsay, W.L.; Norvell, W.A. Development of a DTPA soil test for zinc, iron, manganese, and copper. Soil Sci. Soc. Am. J. 1978, 42, 421–428. [Google Scholar]
- Savitzky, A.; Golay, M.J.E. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar]
- Lee, T.W. Independent Component Analysis: Theory and Application; Kluwer: Boston, MA, USA, 1998. [Google Scholar]
- Hyvarinen, A.; Oja, E. Independent component analysis: Algorithms and applications. Neural Netw. 2000, 13, 411–430. [Google Scholar]
- Borin, A.; Ferrao, M.F.; Mello, C.; Maretto, D.A.; Poppi, R.J. Least-squares support vector machines and near infrared spectroscopy for quantification of common adulterants in powdered milk. Anal. Chim. Acta 2006, 579, 25–32. [Google Scholar]
- Guo, H.; Liu, H.P.; Wang, L. Method for selecting parameters of least squares support vector machines and application. J. Syst. Simul. 2006, 18. [Google Scholar]
- Chen, Q.S.; Zhao, J.W.; Fang, C.H.; Wang, D.M. Feasibility study on identification of green, black and Oolong teas using near-infrared reflectance spectroscopy based on support vector machine (SVM). Spectrochim. Acta A 2007, 66, 568–574. [Google Scholar]
- Ma, C.F.; Ma, J.W.; Han, X.Z. Mechanism analysis of leaf spectrum response resulted from trace elements. J. Remote Sens. 2001, 5, 334–339. [Google Scholar]
- Ding, P.H.; Fuchigami, L.H. Simple linear regression and reflectance sensitivity analysis used to determine the optimum wavelengths for the nondestructive assessment of chlorophyll in fresh leaves using spectral reflectance. J. Am. Soc. Hort. Sci. 2009, 134, 48–57. [Google Scholar]
- Thenkabail, P.S.; Smith, R.B.; de Pauw, E. Evaluation of narrowband and broadband vegetation indices for determining optimal hyperspectral wavebands for agricultural crop characterization. Photogramm. Eng. Remote Sens. 2002, 68, 607–621. [Google Scholar]
- Al Abbas, A.H.; Barr, R.; Hall, J.D.; Crane, F.L.; Baumgardner, M.F. Spectra of normal and nutrient-deficient maize leaves. Agron. J. 1974, 66, 16–20. [Google Scholar]
|Element||Data set||Samples||Range (ppm)||Mean (ppm)||Standard deviation (ppm)|
© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).