Exploring the Best Hyperspectral Features for LAI Estimation Using Partial Least Squares Regression

The use of spectral features to estimate leaf area index (LAI) is generally considered a challenging task for hyperspectral data. In this study, the hyperspectral reflectance of winter wheat was selected to optimize the selection of spectral features and to evaluate their performance in modeling LAI at various growth stages during 2008 and 2009. We extracted hyperspectral features using different techniques, including reflectance spectra and first derivative spectra, absorption and reflectance position and vegetation indices. In order to find the best subset of features with the best predictive accuracy, partial least squares regression (PLSR) and variable importance in projection (VIP) were applied to estimated LAI values. The results indicated that the red edge–NIR spectral region (680 nm–1300 nm) was the most sensitive to LAI. Most features in this region exhibited a high correlation with LAI and had higher VIP values, especially the first derivative waveband at 750 nm (r = 0.900, VIP = 1.144). Adding a large number of features would not OPEN ACCESS Remote Sens. 2014, 6 6222 significantly improve the accuracy of the PLSR model. The PLSR model based on the fourteen features with the highest VIP values predicted LAI with a mean bootstrapped R value of 0.880 and a mean RMSE of 0.943 on the validation dataset and produced an estimated LAI result better than that, including the entire 54-feature dataset with a mean R of 0.875 and a mean RMSE of 0.965. The results of this study thus suggest that the use of only a few of the best features by VIP values is sufficient for LAI estimation.


Introduction
Defined as one-half of the total green leaf area (all-sided) per unit ground surface area [1], the leaf area index (LAI) is an important parameter and is commonly applied in environmental studies examining growth monitoring, yield estimation, evapotranspiration, radiation extinction, carbon cycling and climate [2][3][4][5].Direct measurement of the LAI via the use of destructive field measurements is extremely labor intensive, tedious and limited to experimental plots.Remote sensing techniques have been recognized as a reliable method to provide a fast, non-destructive and relatively cheap way to measure LAI on different scales [6,7].
Hyperspectral remote sensing can produce hundreds or even thousands of narrow, contiguous spectral bands, which may provide crucial additional information, potentially representing a significant improvement over broad bands in quantifying biophysical and biochemical variables, such as LAI [8,9].However, hyperspectral data are much more complex than multispectral data.Although they provide a vast amount of information, most adjacent wavebands are redundant and often highly correlated [10].It is therefore essential to determine the best spectral features derived from hyperspectral data in order to accurately quantify LAI.
Various methods have been proposed, applied and improved in recent decades for the extraction of the spectral features of hyperspectral information.These selection techniques can be classified into three broad groups.(i) Waveband features: Compared with broad bands, narrow bands in specific portions of the spectrum are known to improve discrimination capabilities for various vegetation.Thenkabail et al. [10] determined 22 optimal hyperspectral wavebands with which to best characterize vegetation and agricultural crops over the spectral range of 400-2500 nm.Becker et al. [11] used second-derivative analysis to identify eight optimal spectral bands in the visible-NIR wavelength region that appeared to contain the majority of the coastal wetland information content of the full spectral resolution.Wang et al. [12] employed three methods, including correlation coefficient-based, vegetation index-based and the stepwise regression method to select 15 suitable wavebands for paddy rice LAI estimation.(ii) Spectral position features: Reflectance and absorption features that characterize hyperspectral data are also related to specific physical and chemical crop characteristics [13].Pu et al. [14] found strong correlations between forest LAI and various red-edge parameters, including the red-edge position (REP) and the red-well position (RWP).Spectral features in the shortwave infrared (SWIR) regions (as well as those in the near-infrared) are also important in predicting LAI [9,15].(iii) Vegetation indices: Spectral vegetation indices are mathematical combinations of different spectral bands, mostly in the visible and near-infrared regions of the electromagnetic spectrum.Although the normalized difference vegetation index (NDVI) is by far the most well-known and widely used method of estimating LAI [16], it is sensitive to soil and saturates at a relatively low LAI level.In contrast, VIs, such as the soil adjusted vegetation index (SAVI; [17]), second modified SAVI (MSAVI2; [18]), renormalized difference vegetation index (RDVI; [19]) and triangular vegetation index (TVI; [20]), have now been devised to improve LAI estimation.It also has been demonstrated that VIs with red-edge bands are good predictors of widely variable green LAI, such as CI Red-edge [21,22].
The above-mentioned studies have made important progresses in detecting canopy information via the use of hyperspectral remote sensing.However, the research that systematically summarizes and analyzes the different spectral features of hyperspectral remote sensing data in terms of their performance in estimating LAI is rare, with the analysis of a single feature typically not sufficient to explore such rich information.Several studies have focused on statistical techniques, such as stepwise multiple linear regression (SMLR), which makes use of the information provided by several spectral features to estimate biochemical and biophysical vegetation properties [23,24].In either case, multi-collinearity is a common problem inherent to hyperspectral datasets [25].
Partial least squares regression (PLSR) is a data compression technique that reduces a large number of collinear variables to a few non-correlated latent variables or factors [26][27][28].A number of studies have shown that PLSR is a powerful tool able to extract significant signals and to create reliable models [8,[29][30][31][32], and it has the potential to accurately predict LAI.Although these previous studies employed all available spectral wavelengths simultaneously for PLSR, others have revealed that the use of only a few features is sufficient to extract and discriminate essential information and characteristics [10].As the use of full spectral subsets or the greatest available amount of spectral information would likely not improve retrieval performance, but simply increase computation time [24], it may therefore be more effective to obtain the most accurate biophysical vegetation data possible to build the model.
The objectives of this study were to: (1) systematically summarize the spectral features of hyperspectral canopy reflectance in terms of three aspects: feature wavebands, feature positions and vegetation indices; (2) evaluate every feature's potential for LAI estimation; and (3) identify optimal features (and their numbers) for LAI estimation via PLSR.

In situ Data Collection
The experiment was carried out during different years and growth stages of winter wheat, including:  .For each experimental point, winter wheat canopy spectral measurements were taken from 0.5 m × 0.5 m plots randomly selected in the central 30 m × 30 m field.
An ASD FieldSpec Pro spectrometer (Analytical Spectral Devices, Boulder, CO, USA) was used to measure canopy reflected radiances in the field.This instrument records reflectance in the range between 350 and 1050 nm, with a sampling interval of 1.40 nm, and between 1050 and 2500 nm, with a sampling interval of 2 nm.Spectral data were interpolated to a spectral band width of 1 nm using the ASD software.The instrument also has a 25° fiber optic field of view.Spectral measurements operated at a nadir of 1.3 m above the winter wheat canopy.All spectral measurements were made during cloud-free periods ±2 h from solar noon, between 10:00 a.m. and 2:00 p.m. local time.Prior to each reflectance measurement, the radiance of a 40 cm × 40 cm BaSO 4 standard panel was recorded under the same illumination conditions to convert the spectral radiance measurements into reflectance.Vegetation reflectance measurements were obtained by averaging 20 scans at optimized integration times.Due to severe noise associated with water absorption, the spectral regions of 1350-1480 nm, 1781-1990 nm and 2400-2500 nm were also excluded from the analysis.A moving Savitzky-Golay filter [33] with a frame-size of 17 data points and a second polynomial was employed to smooth the spectra.
All of the plants within a 0.25 m 2 area of winter wheat in each experimental point were harvested immediately after the spectral measurement, with the samples placed in black plastic bags and transported to the laboratory for subsequent analysis.In the lab, plants were dissected into green leaves, dead leaves, stems and roots.The green leaves were used to measure the leaf area by a leaf area meter (Li-Cor 3100, LICOR, Inc., Lincoln, NE, USA).Table 1  Both reflectance spectra and first derivative spectra were studied in order to select waveband features.The first derivative of reflectance was calculated from each reflectance spectrum using the following equation: where FDS is the first derivative of reflectance at wavelength midpoint i between wavebands j and j + 1, R λ(j) is the reflectance at waveband j, R λ(j+1) is the reflectance at waveband j + 1 and Δλ the difference in wavelength between j and j + 1.

Spectral Position Features
Figure 1 shows three absorption (560-760 nm, 920-1080 nm and 1120-1280 nm) and six reflectance positions (500-670 nm, 780-970 nm, 980-1200 nm, 1200-1350 nm, 1480-1720 nm and 2000-2300 nm) for winter wheat, used for the extraction of absorption and reflectance feature parameters.Continuum removal was applied throughout the full spectrum in order to enhance the differences in absorption and reflectance [34].The three parameters proposed were employed in the present study at every absorption and reflectance position: (1) depth; (2) area; and (3) normalized depth [35,36].For the three absorption positions, each absorption depth feature (A_Depth i ) was defined as: where continuum-removed reflectance R ʹ λ is obtained by dividing the minimum reflectance value R λ in the absorption position by the continuum line R ci (λ min ) at the corresponding wavelength and i is the number of absorption positions (i = 1,2,3).
The absorption area feature (A_Area i ) was calculated as the area bounded by the reflectance spectrum and continuum line in each absorption region as followsj: where R ci (λ) and R i (λ) are, respectively, the reflectance of the continuum line and reflectance at the corresponding wavelength λ in the absorption region and λ Si and λ Ei are the start and end wavelengths, respectively, in each absorption region.The normalized absorption depth (A_ND i ) was calculated by dividing the absorption depth feature by the absorption area feature, as follows: For the six reflectance positions, each reflectance depth feature (R_Depth i ) was defined as: where the continuum-removed reflectance R i (λ max ) is obtained by dividing the maximum reflectance value R i (λ max ) in the reflectance position by the inner continuum line R ci (λ max ) at the corresponding wavelength and i is the number of reflectance positions (i = 1,2,3,4,5,6).
The reflectance area feature (R_Area i ) was calculated as the area bounded by the reflectance spectrum and inner continuum line in the reflectance region as follows: where R ci (λ) and R i (λ) are the reflectance of the inner continuum line and reflectance, respectively, at the corresponding wavelength λ in the reflectance region and λ Si and λ Ei are, respectively, the start and end wavelengths in each reflectance region.
The normalized reflectance depth (R_ND i ) was calculated by dividing the reflectance depth feature by the reflectance area feature, as follows: After a literature review, other spectral position-based variables obtained from first derivative spectra [37] were also used in this study and are listed in Table 2.
Notes: ρ , ρ , ρ , ρ and ρ are the reflectance in spectral bands of the near-infrared, red, green and blue light bands, respectively; ρ denotes reflectance at the band i wavelength (nanometers).

Partial Least Squares Regression
The partial least squares regression technique generalizes and combines features of principal component regression (PCR) and multiple linear regression.The method is recognized as a powerful modeling tool with which to model relations when the number of predictor variables is large and the collinearity among the variables is strong [57].The aim of PLSR is to build a linear model as follows: where Y is a mean-centered vector of a dependent variable, X is a mean-centered matrix of independent variables, β is a matrix of regression coefficients, and ε is a matrix of residuals [28,58].
The optimal number of factors in PLSR analysis is determined by minimizing the prediction residual error sum of squares (PRESS) statistic.Here, the PRESS statistic is calculated via cross-validation (CV) prediction for each model [4,59].The root mean squared error of cross validation (RMSCV) is also used to assess the predictive abilities of the PLS models.
In order to evaluate the relative importance of variables in the PLSR model, their variable importance in projection (VIP) scores were computed.VIP calculates the contribution of independent variables to Vegetation Index Formulas Ref.
Three gradient difference vegetation index (TGDVI Triangular vegetation index (TVI)  [56] the contribution of the dependent variable, with the most influential predictors in the model selected according to the magnitude of their values [60].VIP scores serve as an apt measure for identifying individual waveband importance and for providing valuable insight into the most effective spectral regions [60,61].All calculations were carried out using MATLAB software (The Math Works, Inc., Natick, MA, USA).

Calibration and Validation
The 76 samples collected in 2009 were used as a calibration data set and the 71 samples collected in 2008 as an independent validation data set.Regression analyses were performed on the former, with the latter then employed to conduct an empirical validation of the regression models.The performance of the different PLSR models was compared using the coefficient of determination (R 2 ), root mean square error (RMSE) and normalized mean bias (NMB).Definitions of each metric are given below: where P is the estimated values, Q represents the measured values and n is the total number of samples.
The bootstrap process was performed on the independent validation data to assess the robustness of the regression models.The validation data set was bootstrapped with replacement for n = 1000 times.The mean and 95% confidence levels of R 2 values, as well as RMSE values for the validation data were calculated and recorded [62,63].

Optimal Spectral Waveband Feature Dataset
Spectral reflectance data and their first derivative spectra were correlated with measured LAI values using both 2008 and 2009 data sets (Figure 2). Figure 2a reveals negative correlation coefficients in the blue (503 nm), red (661 nm) and shortwave infrared regions (1990-2400 nm) and a positive correlation coefficient in the near infrared region (740-1300 nm).The strongest correlations occur at 439 nm, 554 nm, 750 nm, 948 nm, 1030 nm, 1147 nm, 1236 nm, 1295 nm, 1602 nm and 1745 nm for the first derivative spectra (Figure 2b).The ten best wavebands in both reflectance spectra and first derivative spectra were then selected for establishing an optimal spectral waveband dataset (Table 4).

Optimal Spectral Position Feature Dataset
A total of 40 spectral position features were calculated using absorption position features, reflectance position features and other spectral position-based variables.Correlation analysis was carried out between these 40 spectral position features and LAI values (Table 5), with the results revealing A_Area 2 to exhibit the strongest correlation with LAI (r = 0.854).The top 10 spectral position features were then selected in order to establish an optimal spectral position feature dataset: R_Area 2 , R_Area 3 , R_Area 4 , A_Area 1 , A_ND 1 , A_Area 2 , A_Area 3 , A_ND 3 , Dr and SDr.

Vegetation Index Dataset
We studied the correlations between 24 vegetation indices (VIs) and LAI (Table 6), with the results indicating significant relationships existing between LAI and all spectral parameters.A vegetation index dataset was then established based on these vegetation indices.

Best Features and Models
The 20 spectral waveband features, 10 spectral position features and 24 vegetation indices were then combined into a single dataset in order to assess its collective strength in predicting LAI.Every VIP value of the 54 spectral features was calculated and ranked in descending order (i.e., from highest to lowest; Figure 3), ranging from 1.144 to 0.746.A high VIP value indicated that the spectral feature was of major importance in estimating LAI and also had the greater coefficient of determination in two years' data.The best spectral feature was found to be FD3 (the first derivative at a wavelength of 750 nm).Most intermediate variables have similar R 2 to FD3, but their VIP values were smaller.Waveband features in the visible and SWIR regions had the lowest VIP values.
In order to identify the predictive accuracies when different numbers of features were analyzed via PLSR, we tested a forward variable selection procedure on the VIP-selected features.This process involved ranking the VIP features based on their VIP scores and then iteratively adding the best ranked indices in a new PLS model.Firstly, the three best VIP spectral features were employed to predict LAI via PLSR.Then, the four best VIP spectral features were analyzed, and so on, until all of the features had been used to predict LAI.For each stage of the forward selection procedure, the coefficients of determination of the first ten PLS factors and the optimal components were recorded (Figure 4).PLSR searched the sensitive information from a different number of spectral features based on VIP values in descending order.The addition of more spectral features resulted in a decrease in the coefficient of determination of the first PLS factor and an increase in that of the tenth PLS factor.The optimum number of factors for use in the PLSR models was estimated via cross-validation, with the optimum number of PLS factors for the different datasets being four or five.The final R 2 values ranged from 0.805 to 0.849, with the top 30 features exhibiting the highest R 2 values (R 2 = 0.849).However, the increase in R 2 after the addition of more features was unremarkable; only a minor decrease in the number of VIP features was required in order to improve LAI estimation and to influence the final results.As analysis of Figure 4 reveals, the dataset comprising 14 spectral features was sufficient to reach an R 2 value (R 2 = 0.842) in the plateau region of the graph, beyond which only very small increases in R 2 could be observed.The RMSECV varied between 0.731 and 0.812, with the smallest value in the top 14 features.Therefore, these 14 spectral variables were finally used to construct a PLSR model that had both fewer spectral features and more accurate estimates.

Calibration and Validation
PLSR analyses were conducted using the optimal waveband feature dataset, the optimal position feature dataset, the vegetation index dataset, the all-feature dataset and the 14 best features dataset.Table 7 shows the results produced by these models for both calibration and validation samples.Among the three different optimal spectral feature datasets, the PLSR model based on the 24-variable vegetation index dataset produced the highest estimation accuracy.The PLSR model based on the 10-variable optimal position feature dataset exhibited only intermediate accuracy, with that based on the 20-variable optimal waveband feature dataset being associated with the lowest level of accuracy.However, these three models with different features were able to extract spectral information and produce similar LAI estimation results.The PLSR model based on the combination of all features produced estimation accuracies for both the calibration sample (R 2 = 0.845, RMSE = 0.733) and the validation sample (R 2 = 0.878, RMSE = 0.940).The best feature dataset constructed via VIP forward selection, which reduced the number of spectral features from 54 to 14, produced a similar estimation accuracy with that of the all-feature dataset in the calibration dataset (R 2 = 0.842, RMSE = 0.731).The 14 best feature dataset using the developed regression equation from the calibration data set yielded higher R 2 to the all-feature dataset in the independent data set (R 2 = 0.881, RMSE = 0.937).The prediction results of estimation models tended to be underestimated (NMBs less 0).The NMBs of the best feature model tended to be lower than that of the all dataset model for both calibration and validation analyses.To assess the capability of the results, the bootstrap process was adopted.Figure 5 shows the normal distribution of the R 2 values calculated from the predicted and measured LAI in the independent data set by the bootstrapping methodology.Table 8 details the mean bootstrapped regression results in the form of the mean R 2 values, as well as the mean RMSE using PLS models.The best feature model performed the best with a mean R 2 of 0.880 and a mean RMSE of 0.943.

Discussion
To obtain adequate information from hyperspectral data, many features were identified based on spectral wavebands, spectral positions and vegetation indices.Additionally, the correlations between these features and wheat LAI values were studied.The analysis of the spectral waveband features revealed important features correlating with LAI across a broad range.However, compared to the performance of data produced via first derivative analysis, as well as absorption and reflectance position features and vegetation indices, the spectral features exhibited lower correlation coefficients, due to the influence of external factors, such as underlying soil brightness, leaf angle distribution and leaf optical properties [15,24].
The red-edge region is characterized by a sharp rise in the reflectance of green vegetation between the local minimum reflectance band in the red spectral region and the maximum reflectance band in the NIR spectral region.This region is considered contain more information regarding biomass quantity and LAI than other parts of the electromagnetic spectrum [64,65].In this study, many spectral features identified in the red-edge region had high precision and were more accurate (i.e., FD3, CI Red-edge and A_Area 1 ).FD3 was the first derivative at a wavelength of 750 nm and had both the strongest correlation with LAI (R 2 = 0.800) and the highest VIP value (VIP = 1.144) of all features.This confirmed previous findings by Wang et al. [12] with 723 nm and Thenkabail et al. [10] with 735 nm.CI Red-edge led to an R 2 of 0.766 and a VIP of 1.121, which was more sensitive to LAI variability than the NDVI.Viña [21] also showed that the CI Red-edge exhibited low sensitivity to soil background effects, and it constitutes a simple, yet robust tool for the remote and synoptic estimation of green LAI.
In the NIR region (800-1300 nm), reflectance spectra and first derivative spectra (SP4, SP5, FD4 and FD5), absorption and reflectance features (R_Area 3 , A_Area 3 , A_Area 2 , R_Area 2 and R_Area 4 ) and most vegetation indices (RVI, EVI, DVI and MTVI2) performed more effectively.The reflectance in this spectral region is mainly influenced by the arrangement of cells within the mesophyll layer of leaves, as well as by canopy structure, especially the number of vertical leaf layers.NIR water absorption regions are also sensitive to leaf moisture content [12].
The advantage of vegetation indices is that they can be used to obtain relevant information rapidly and easily, and the underlying mechanisms are well-understood [66].For the 24 VIs examined in this study, NDVI had a lower accuracy than most of the studied vegetation indices.The obtained results are in agreement with those found elsewhere in the literature [5,16,42].Most modified vegetation indices were better than their respective originals, including: MNLI vs. NLI, MSAVI2 vs. SAVI and RDVI vs. DVI, which is consistent with previous studies [18,19,42].Some vegetation indices based on three discrete bands also produced strong correlation, including the following: EVI, TVI, TGDVI, MTVI2 and sLAIDI*, which take advantage of sensitive spectral regions to reduce external factor and are highly sensitive to LAI [16,56].
The PLSR approach is considered to be the most useful explorative tool with which to unravel the relationship between canopy spectral reflectance and grass characteristics at the canopy scale.It is able to effectively address strong collinearity and noise in dependent variables [24].Although the parameter number of the optimal spectral waveband dataset was double that of the optimal spectral position dataset, the prediction accuracy of the 10-variable optimal position feature model performed better than that of the 20-variable optimal waveband feature model.Indeed, the all-feature (54 variables) dataset yielded a lower level of accuracy than the top 30 variables dataset.The results therefore indicate that the continuous addition of variables may not always improve LAI estimation accuracy.Indeed, the inclusion of an increasing number of less important spectral features in PLSR models can negatively influence prediction accuracy [22,26].
The calculated VIP scores provided an insight into the usefulness of each variable in the PLS The spectral dataset containing the top 14 variables was able to achieve a high level of estimation accuracy with the use of fewer spectral features; the subsequent inclusion of additional features resulted in only a minor improvement in model accuracy.These 14 features included the red-edge region (FD3), the NIR region (FD5, FD6, FD4, FD8, A_Area 2 , A_Area 3 , R_Area 2 and R_Area 3 ) and the best vegetation indices (RVI, CI Red-dege , MSR, MNLI and MTVI2); these were also the best features in the three datasets employed for LAI estimation discussed above.The presented results demonstrate the potential of PLSR and VIP techniques in identifying important variables for the estimation of LAI.It is important to select appropriate features and to determine the optimal variable number(s).Selecting only the very best features selected by VIP values may therefore be sufficient in terms of exploring the rich information available for LAI estimation, with the use of whole feature and/or full spectrum datasets being unnecessary.This finding is in agreement with that of [25], who identified the most significant indices (chosen via VIP) producing the best PLS model prediction of T. peregrinus damage.

Conclusions
In order to select suitable spectral features for LAI estimation, different features based on spectral wavebands, spectral positions and vegetation indices were evaluated, with all exhibiting the same changing tendency in two years of hyperspectral data.The best features in three different spectral feature groups exhibited a similar correlation with LAI.Derivative analysis, a combination of vegetation index, as well as absorption and reflectance position features generally proved to be better predictors of LAI variability.Spectral features in the red-edge and NIR regions were the most sensitive for predicting LAI.The first derivative at a wavelength of 750 nm exhibited the highest correlation with LAI for all features.
PLSR and VIP analyses were conducted on spectral feature data to estimate LAI and to identify the subset of features with the best predictive accuracy.Our findings suggest that LAI estimation accuracy could be improved by employing the most sensitive spectral features in conjunction with PLSR models.The application of these methods made it possible to extract sufficient signals covering the full spectral range of information, reducing the dimensionality of the hyperspectral data and improving the steady estimation accuracy of winter wheat LAI.The 14 features with the highest VIP values provided a higher level of accuracy in predicting LAI than the entire 54-feature dataset.The validation of the new model indicated that the best feature model performed the best with the mean R 2 of 0.880 and the mean RMSE of 0.943.
Compared to other multivariate statistical models, such as principal component regression (PCR) and stepwise multiple linear regression (SMLR), PLSR outperformed other techniques in estimating canopy chlorophyll content, vegetation water content, nitrogen content, LAI, and so on [24,67,68].However, some methods, such as support vector machines (SVM) and artificial neural networks (ANNs), are also useful for nonlinear models and vegetation canopy property estimations.To evaluate applications of these features and models proposed in this study, other vegetation types and the radiative transfer model approach will be conducted.

Figure 1 .
Figure 1.Three absorption (A) and six reflectance (R) characteristics of the wheat canopy spectrum.

Figure 2 .
Figure 2. Correlation analysis of reflectance spectra (a) and first derivative spectra (b) against LAI for each wavelength.

Figure 3 .
Figure 3. Variable importance in projection (VIP) and R 2 for each spectral feature.

Figure 4 .
Figure 4. Variation in the coefficient of determination for different numbers of spectral features analyzed via partial least squares regression (PLSR).Note: The black horizontal dashes represent the coefficients of determination of the first ten PLS factors for different numbers of spectral features.Each optimal PLS factor is marked as a red circle, and the corresponding RMSECV is marked as green triangles.

Figure 5 .
Figure 5. Histograms showing the frequency of R 2 values between the measured and predicted LAI for the independent dataset.(a) Optimal waveband feature model, (b) Optimal position feature model, (c) Vegetation Index model, (d) All feature model, (e) Best feature model.

Table 1 .
Descriptive statistics of LAI in two years' data.

Table 2 .
Spectral features derived from first derivative spectra.

Table 3 .
Vegetation indices compiled from the literature.

Table 4 .
Correlations between optimal spectral waveband features and LAI in different years.
r: the correlation coefficients between features and LAI for the calibration data set (in 2009) and the validation data set (in 2008), respectively; ** correlation significant at the 0.01 level.

Table 5 .
Correlations between spectral position features and LAI in different years.

Table 6 .
Correlation between vegetation indices and LAI in different years.

Table 7 .
Performance of the PLSR models in predicting LAI based on the use of different datasets.NMB, normalized mean bias.

Table 8 .
Bootstrap methods for PLS models in the independent data set.