Introduction of Variable Correlation for the Improved Retrieval of Crop Traits Using Canopy Reﬂectance Model Inversion

: Look-up table (LUT)-based canopy reﬂectance models are considered robust methods to estimate vegetation attributes from remotely sensed data. However, the LUT inversion approach is sensitive to measurements and model uncertainties, which raise the ill-posed inverse problem. Therefore, regularization options are needed to mitigate this problem and reduce the uncertainties of estimates. In this study, we introduce a new method to regularize the LUT inversion approach to improve the accuracy of biophysical parameters (leaf area index (LAI) and fractional vegetation cover (fCover)). This was achieved by incorporating known variable correlations that existed at the test site into the LUT approach to correlate the model variables of the Soil–Leaf–Canopy (SLC) model using the Cholesky decomposition algorithm. The retrievals of 27 potato plots obtained from the regularized LUT (LUTreg) were compared with the standard LUT (LUTstd), which did not consider variable correlations. Different solutions from both types of LUTs (LUTreg and LUTstd) were utilized to improve the quality of the model outputs. Results indicate that the present method improved the accuracy of LAI estimation, with the coefﬁcient of determination R 2 = 0.74 and normalized root-mean-square error NRMSE = 24.45% in LUTreg, compared with R 2 = 0.71 and NRMSE = 25.57% in LUTstd. In addition, the variability of LAI decreased in LUTreg (5.10) compared with that in LUTstd (12.10). Hence, our results give new insight into the impact of adding the correlation between variables to the LUT inversion approach to improve the accuracy of estimations. In this study, only two correlated variables (LAI and fCover) were examined; in subsequent studies, the full correlation matrix based on the Cholesky algorithm should be explored.


Introduction
The quantitative estimation of vegetation characteristics from hyperspectral remote sensing is required for various applications in agriculture, ecology, plant physiology and meteorology [1,2]. In the agricultural context, among many vegetation characteristics, leaf area index (LAI), chlorophyll content and fractional with the nonparametric regression approach can be employed to derive biophysical and biochemical parameters [52,53]. Nevertheless, this algorithm has not yet been applied directly to physical models (RTMs) to retrieve the corresponding parameters [34].
The present study intends to address the following research question: To what extent does integrating the correlation structure of selected variables into the LUT inversion approach using the SLC model improve the retrieval of LAI and fCover? We hypothesized that adding a known relation between variables to the LUT (denoted as regularized LUT (LUTreg)) would increase the accuracy of LAI and fCover retrievals compared with the standard LUT (LUTstd). Besides addressing the relation between variables, an existing regularization technique (multiple solutions) was examined as an additional means of stabilizing the LUT retrieval approach. Retrievals from both types of LUTs (LUTreg and LUTstd) were validated by in situ measurements and systematically compared to highlight the added value of using variable relation in the generation of LUT.

Study Area and Experimental Design
The study area is located in the southwest of Luxembourg 49 • 36 47.1 N, 5 • 55 6.7 E ( Figure 1A). It is characterized by a temperate climate with an annual average precipitation of about 700 mm and different loam soil textures, which are mainly sandy loams. The experiment was established in 2016 during the spring season. In this experiment, an agricultural parcel was planted with a potato crop (Victoria cultivar). Plots (5 × 3 m) with three nitrogen fertilization levels were established (80, 180, and 280 kg/ha nitrogen), representing under-, standard, and over-fertilization, respectively ( Figure 1B). For each fertilization level, nine replicates were realized, resulting in a total sample of 27 plots. Measurements of canopy reflectance and vegetation parameters were taken on 20 July 2016, under cloud-free conditions with a blue sky. The plants' growth stage was tuber bulking, flowering or both. Measurements were taken in the middle area of each plot, along with the positions shown in Figure 1C, to avoid border effects. The three nitrogen levels created different growth conditions, which resulted in a variation in biophysical and biochemical variables.

Spectral Measurements and Preprocessing
Canopy reflectance was measured by an ASD FieldSpec-3 spectroradiometer (Analytical Spectral Devices, Boulder, CO, USA) over 27 plots in a spectral range of 350-2500 nm. Spectral measurements were taken from a distance of about 80 cm above the canopy. Given the sensor field of view of 25 • , the effective area of the footprint was 0.88 cm 2 . An average of eight measurements at distributed positions ( Figure 1C) was used to represent the reflectance of a plot. The measurements were obtained at the nadir position under direct solar illumination between 11:30 a.m. and 2:00 p.m. in order to minimize solar angle effects. The spectral bands in the water absorption region (1350-1460 nm, 1790-1960 nm, and above 2400 nm) and in the ozone region (bands below 400 nm) (21 bands in total) were excluded from further analysis. This is because these are noisy bands where the atmosphere strongly absorbs energy, leading to poor signal strength and errors in the analysis. In addition, a shift correction was applied to correct for effects caused by the three detectors [54]. For further analysis, the spectra were resampled to a 10 nm resolution, resulting in 180 wavebands.

Biophysical and Biochemical Measurements
LAI was measured nondestructively with a Licor LAI-2000 device. Measurements were taken in the morning under clear sky conditions and low sun angles by an operator who stood with his back toward the sun, and the measured plot was shielded from direct sun illumination using a large umbrella. Figure 1C shows the exact locations of below-canopy readings (red crosses). Readings above the canopy were taken before and after the below-canopy readings. The set of above-and below-canopy readings were averaged for each plot. fCover was visually estimated by a trained person in steps of 5% by looking from above in vertical direction on the canopy at the plot level [13]. LAI and fCover measurements were used for validation to evaluate the accuracy of estimated LAI and fCover, as well as for estimating the values of the leaf inclination distribution function (Section 2.5). The SPAD-502 Konica Minolta device was used to estimate leaf chlorophyll content. It is a nondestructive method of measuring the leaf transmittance using two wavelengths in the red spectral range (650 nm) and the near-infrared spectral range (940 nm). Six leaves per plot were selected randomly, and the average value of five readings per leaf from different positions of top leaflet was determined. The relative values of the SPAD readings were converted into leaf chlorophyll content per leaf surface area, expressed in µg cm −2 , using an empirical model for potato [55].
In Equation (1), LCC is the absolute amount of chlorophyll in µmol m −2 , and SPAD is a dimensionless value. With the molecular mass of chlorophyll a and chlorophyll b, units were converted from µmol m −2 to µg cm −2 .

Radiative Transfer Model
In this study, a modified version of the Soil-Leaf-Canopy radiative transfer model (SLC) was employed [24]. SLC is an integrated model that combines the SAIL and geometric models to simulate reflectance from discontinuous canopies. Three submodels-the modified Hapke's soil module [56], the modernized canopy module 4SAIL2 [22], and the PROSPECT-4 leaf module [57]-are combined in SLC. It simulates canopy bidirectional reflectance over a spectral range of 400-2500 nm at a 10 nm spectral resolution for arbitrary sun-target-sensor geometries. An amended version of the SAIL model is 4SAIL2, which includes the crown clumping (in the horizontal direction) and leaf color gradients (in the vertical direction) of a canopy. It simulates the bidirectional reflectance factor of a turbid medium plant by solving the scattering and absorption of four upward/downward radiative fluxes [22]. The input variables include LAI, the leaf inclination distribution function (LIDF), hotspot (hot), the layer dissociation factor (D), fraction brown leaf (fB), the tree shape factor (zeta), and vertical crown coverage (Cv). The last two parameters (zeta and Cv) describe the crown clumping effect, while D and fB characterize the distribution of brown and green leaves between the upper and lower layers of the canopy. The fraction of ground data covered by crowns in the nadir direction is calculated by directly determining the effective fractional vegetation cover or crown fraction (fCover) by two input parameters, LAI and Cv, as follows [24]: where Cv is the vertical crown cover, and e −k * LAI is the gap fraction following the Lambert-Beer law, and k is the extinction coefficient in the vertical direction and depends on the leaf inclination distribution function (LIDF) and the viewing angle (Θ). As input parameters, 4SAIL2 requires the directional-hemispherical reflectance and transmittance of a single leaf for both green and brown pigments to discriminate between photosynthetic and non-photosynthetic light absorption, which are supplied by the PROSPECT-4 model. This model is an updated version of PROSPECT, which does not have biochemical data pertaining to brown material [58]. The input variables of the PROSPECT-4 model are the leaf structure parameter (N) and leaf biochemical constituents, including leaf chlorophyll content (LCC), leaf dry matter content (Cm), leaf water content (Cw), and leaf senescent matter content (Cs).

Model Parameterization
The look-up table (LUT) approach was used in the forward mode to generate a set of simulated canopy reflectances from the table of input variables and to adapt the model for the potato crop. Some input variables (hotspot, dry matter, water content, senescent material, fraction brown leaf, the dissociation factor, and the zeta parameter) were effectively fixed using the values reported in Table 1, while the other variables were kept free because of the high sensitivity of canopy spectra. From the prior information from the field, previous studies, and user experience, the bounds and ranges of the variation in each variable in the LUT were defined. Since no particular distribution function was specified for sampling the input variable space, a uniform distribution was used for the model inputs. One further requirement to run the SLC model is the leaf inclination distribution function parameter, which needs to be defined to describe the orientations of potato leaves. Since no observations of the LIDF parameter were available in this study, we estimated it from previous studies, which assumed that the oriented leaves of potatoes could be planophile, spherical or erectophile [8,59,60]. For this reason, we compared the measured canopy reflectance of four canopies with simulated canopy reflectance using four different LIDF types in the forward direction. The best-simulated reflectance was selected to represent the orientation of potato leaves. The four types of LIDF were planophile (type 1: a = 1, b = 0), spherical (type 2: a = −0.35, b = −0.15), transitional-1 (type 3: mean of type 1 and type 2; a = 0.33, b = −0.08), and transitional-2 (type 4: mean of type 1 and type 3; a = 0.66, b = −0.04). The four simulated canopies had low levels of LAI and LCC (case 1); high levels of LAI and LCC (case 2); a medium level of LCC and a high level of LAI (case 3); and a medium level of LAI and a high level of LCC (case 4). The value of the Cv parameter was calculated by solving Equation (3) for four cases using the measured values of LAI and fCover and the fixed value of the extinction coefficient (K = 0.55). However, the rest of the input parameters of the SLC model were fixed at default values. For the Hapke soil model, the input parameters were set to default values for plowed soil [22]. A total of 17,280 simulations were carried out by varying the LAI, LCC, Cv, and N variables.  Table Inversion Two types of LUTs were designed. The first type was a standard LUT (LUTstd), in which all input model variables were independent following the sampling form of a uniform distribution [61]. The second scenario was the regularized LUT (LUTreg), which was integrated by using Cholesky decomposition in order to restrict the sampling of the LUT input space. The Cholesky decomposition method is employed to transform a set of independent (or uncorrelated) random variables to correlated ones [62].
The correlation matrix between LAI and fCover, which naturally existed in the study area, was used to generate linear combinations of the LAI and Cv variables. These variables, in turn, affected the derived quantity of fCover (Equation 3). Figure 2 illustrates that the linear relationship between the LAI and Cv variables improved significantly after using the Cholesky algorithm in the forward simulation. It should be highlighted that we considered only the correlation between measured LAI and fCover to correlate the input model variables (LAI and Cv) because of the lack of ground data in this study. In addition, we selected a strong correlation between measured variables for improving the model predictions. The advantage of using Cholesky method in LUT is that the generation of unrealistic variable combinations that may never occur at the test site can be prevented. Consequently, the accuracy of the retrievals can be enhanced in the model inversion.
The Cholesky algorithm is a square root matrix that results in the original correlation matrix when multiplied by itself. It decomposes a Hermitian positive-definite matrix into the product of a lower triangular matrix (L) and its conjugate transpose (L T ) [63]. By using Matlab 2007, the variance-covariance or Cholesky matrix (M) can be decomposed as follows: where the variance-covariance matrix is identical for the same set of the correlation matrix M n × n , R is the correlation coefficient, σ i is the standard deviation of the variable x i , a and b are the positive values of off-diagonal values and are calculated by a = ρ 1,2 σ 2 and b = σ 2 2 − a 2 , and ρ 1,2 is the covariance between x i and j .
Once the correlation matrix of LAI and fCover variables is calculated from measured data of 27 plots for potato crop, the lower triangular matrix (L) was defined as follows: Respecting the size of the lower triangular matrix (L) and number of desired simulations, the random variables of Z i , j were generated which are uncorrelated random numbers.
To get correlated random variates (x i and x j ), a transpose of lower triangular form was multiplied by the number of desired simulations of uncorrelated random variables (Z i and Z j ) respecting to the size of the Cholesky matrix (M) as Then, the products of Equation 7 (x i for LAI and x j for Cv) were randomly drawn from the uniform distribution with the limited boundaries (min and max values) as defined in Table (1 Relationships between LAI and Cv variables using forward simulations (a) without variable correlation and (b) with variable correlation using the Cholesky method.

Model Inversion and Validation Data
For reducing the ill-posed inverse problem and improving the LUT-based inversion approach, the multiple solutions method was utilized as a regularization strategy for both types of LUTs. Several studies have recommended the use of multiple solutions instead of using a single solution, which does not lead to the best accuracy of retrievals [12,14,20,64,65]. To identify the optimal solution or estimations in each of the 27 plots, we used the root-mean-square error (RMSE) as a cost function that quantifies the agreement between the observed and modeled spectra generated from the SLC model. The mean and median of the multiple solutions that yielded the lowest errors were examined in this study as statistical indicators to obtain the final and optimal solution.
For the validation of the model inversion, the retrieved predictions were validated with in situ data using statistical measures, including the correlation of determination (R 2 ), root-mean-square error (RMSE), and normalized root-mean-square error (NRMSE) (Equation (10)). R 2 and NRMSE were used mainly to evaluate the performance of the two LUT types. The parameters were retrieved with high R 2 and low NRMSE values, which led to the success and accuracy of model inversion [65]. Besides the statistical measures (R 2 , RMSE, NRMSE), the variance of retrievals was calculated for each plot to measure the variability or the spread of data (estiamted variable) about the mean of estimations for both LUTs. The variance is defined as the sum of the square of the deviations from the mean (the basis for measure of spread), when the mean is used as a measure of central tendency. Table 2 depicts the variation of biochemical and biophysical charateristics (LAI, fCover, LCC, and CCC) among 27 plots of potato crop.The table reveals that the variation coefficients (C.V (%)) for LAI, CCC, and fCover are similar, while it is considerably lower for LCC. The reason is that a limited range of LCC values was estimated from SPAD readings. Table 3 illustrates that a strong positive correlation exists between LAI and fCover, with R = 0.86. However, the correlation between other variables (LAI and LCC) was relatively low. The measured correlation between LAI and fCover was chosen to regularize the LUT approach rather than the other one (LAI and LCC).

LIDF Estimation from the Forward Simulation
The simulated spectra were obtained by varying the four types of LIDFs for the four cases using the SLC model in the forward mode ( Figure 3). The result shows that the canopy reflectance in the NIR domain was controlled by canopy architectural parameters, such as LAI, Cv and LIDF variables, which varied under different levels of N treatments. In fact, the strongest effect of nitrogen (N) fertilization treatments occurred in the near-infrared (NIR) domain of the canopy spectrum when increasing the level of N treatment. The three cases (2, 3 and 4) that were treated with the high levels of nitrogen (N280) had higher reflectance in the NIR than the first case (N80). To characterize the distance between measured and simulated spectra, the root-mean-square error (RMSE) and sum-square error (SSE) as a quantitative analysis were used. From Table 4 along with the Figure 3, we concluded that the simulated results of type 4 (transitional-2) achieved the best match between the simulated and measured spectra compared with the other LIDF types for the potato crop. Therefore, the values of LIDF type 4 were fixed in both LUTs (LUTstd and LUTreg).  . Comparison between measured (black) and simulated (red) spectra for four types of leaf inclination distribution functions (LIDFs) and four selected potato plots (cases) to identify the LIDF type that best represents potato.

Inversion Results for Standard LUT and Regularized LUT
The top 5, 10, 100, 250, 300 and 500 best solutions were aggregated using the mean and the median and needed to get the final estimate. We used this range of multiple solutions to investigate how different solutions affect the inversion results compared with using the single solution (ill-posed). The results from both types of LUTs (LUTreg and LUTstd) revealed that the mean and median from the first 300 best solution provided the best results for LAI, fCover and CCC retrievals in terms of R 2 and NRMSE (Tables 5 and 6). Additionally, we used Fisher Z-Transformation to test the significant difference between the mean and median for multiple solutions. The results show that, generally, there were no significant differences between the mean and median (z = 0.474, p > 0.05). Despite these results, the median approach of multiple solutions was utilized as a final indicator to reduce the statistical impacts of outliers in the data.
In the inversion process, the correlation between estimated LAI and Cv in LUTreg was improved by using the Cholesky method compared to LUTstd, as shown in Figure 4. Subsequently, the accuracy of LAI from LUTreg was higher than that from LUTstd when validated with ground data (R 2 = 0.74, NRMSE = 24.45% for LUTreg; R 2 = 0.71, NRMSE = 25.57% for LUTstd). To compare results between LUTstd and LUTreg, the relative difference (RRMSE) and absolute difference (NRMSE) were calculated by using LUTstd as the bench-line method. LUTreg reduced the error by about 6.5% in RRMSE and 1.12% in NRMSE, while the CCC retrieval decreased about 7.3% in RRMSE and 0.26% in NRMSE, respectively. Moreover, by using a paired t-test the results of LAI and CCC showed that there were statistically significant differences between LUTreg and LUTstd.
On the other hand, LUTreg failed to improve the accuracy of fCover (R 2 = 0.69, NRMSE = 18.60% for the LUTreg; R 2 = 0.70, NRMSE = 17.85% for LUTstd) and the scatterplot of LUTreg for fCover ( Figure 6) was identical to that obtained by LUTstd. Lower fCover values (<0.4) were overestimated, while high values (>0.4) were well estimated. Lastly, the inversion scheme did not produce accurate estimates of the LCC obtained from LUTreg or LUTstd (Appendix A, Table A1).    Note: The highlighted numbers indicate the best retrieval; LAI is the leaf area index; CCC is the canopy chlorophyll content; fCover is the fractional vegetation cover; NRMSE is the normalized root-mean-square-error; R 2 is the correlation of determination; RMSE is the root-mean-square-error. Note: The highlighted numbers indicate the best retrieval; LAI is the leaf area index; CCC is the canopy chlorophyll content; fCover is the fractional vegetation cover; NRMSE is the normalized root-mean-square-error; R 2 is the correlation of determination; RMSE is the root-mean-square-error.
Each estimated parameter in the multiple solutions was not a single value but with a wide range of values of different frequency. When the mean value of estimation and the maximum frequency value are close to the measured parameter, then the estimated parameter will be the best and more accurate. Though the LUTreg might have similar mean (or median) values of estimations to LUTstd in some plots ( Figure 5), the variance of both LUTs was different. This refers to the various distributions of estimates in LUTreg and LUTstd. Figure 7 shows that LUTreg has lower variance than LUTstd. It means that the distribution of LAI estimates in LUTreg was close to the mean value and more consistent, whereas those estimates obtained by LUTstd were dissimilar and spread out. In other words, the variability of model predictions in LUTreg was less and more stable than others. Additionally, the variance of the estimated Cv and fCover obtained from LUTreg was considerably lower than those from LUTstd, except for a few plots (Figures 8 and 9). Hence, the difference in the total variance of Cv and fCover between LUTreg and LUTstd was less than that in the variance of LAI.

Discussion
This study extends previous studies that suggested several regularization schemes to mitigate the ill-posedness problem in radiative transfer models. We proposed a new way to regularize the LUT inversion approach by introducing the correlation between variables to improve the variable retrieval. The results obtained from LUTreg confirm that the accuracy of LAI retrieval was improved significantly compared with the retrieval using LUTstd. However, fCover retrieval did not show any improvement in LUTreg. Quan et al. [47] used a correlation matrix of five input model variables in the Bayesian network-based inversion approach (without using the Cholesky algorithm) in order to improve the accuracy of LAI and canopy water content (CWC) retrievals. In their study, the estimated LAI values were improved (R 2 = 0.77 and RMSE = 0.51) compared with those obtained by using independent input variables (R 2 = 0.69 and RMSE = 0.55). These results were comparable to our result. However, we used fewer variables (only two variables, LAI and Cv) to improve the accuracy of the parameters of interest (LAI and fCover) using the Cholesky decomposition.
As another regularization measure, we used a priori knowledge about the crop species to specify the leaf inclination distribution function parameters (LIDFa and LIDFb). We compared the simulated spectra of four types of LIDFs (planophile, spherical, transitional-1 and transitional-2) with the measured reflectance data and selected the best type of LIDF. LIDF type 4 ("transitional-2"), which represents a transition between planophile and spherical, was determined to be the best match. This result of Table 4 is consistent with the results of References [26,59]. They found that a planophile canopy structure matches a sugar beet type of canopy more than it does a potato canopy. By contrast, other studies have stated that potato leaves mainly tend to be of the planophile type in fertilized plots [8,60].
In addition, for multiple solutions the RMSE and NRMSE for more than 100 solutions from both LUTs improved the accuracy of estimations which is comparable with the result of Reference [66]. In this study, we found that 300 best entries was optimal solution for size of 17,280 LUT and more than 300 solutions the accuracy started to become stable and slightly decrease. In contrary to other studies, they found that the optimal solution for a size of 100,000 and 280,000 LUT was between 10 and 250 entries [12,20,66,67]. The accuracy of LAI estimations increased when using LUTreg (R 2 = 0.74, NRMSE = 24.45%) compared with LUTstd (R 2 = 0.71, NRMSE = 25.57%). Especially for low and medium LAI values, LUTreg decreased the underestimation of LAI values compared with LUTstd. Several studies have also observed the underestimation of LAI values in their predictions and attributed these underestimations to the effects of soil background, the heterogeneity of vegetation cover and shadow [37,39,59]. However, in the studied experimental plots, the reflectance contribution from the background soil was less than the contribution from green plants. Furthermore, the influence of row-induced shadow effects was minimal because the reflectance measurements were acquired from the nadir position. As shown in Figure 5, LUTreg decreased the underestimation phenomenon compared with LUTstd. On the other hand, for the plots that relatively covered the soil, LUTreg retained the high values of LAI overestimation (above 3), which are poorly estimated by a 1D turbid medium RTM (i.e., SLC) as a result of the saturation effect [20,59]. This result is consistent with other studies that have used the PROSAIL model and overestimated high LAI values (above 3.5) [37,39,68].
Both types of LUTs failed to accurately estimate the LCC variable using the SLC model (see Appendix A, Table A1). Other studies that have used the SLC model [33] or the PROSAIL model [12,13,24,36,39,59] have reported similar failures to accurately derive LCC. This issue might be caused by the limited range of LCC values in LUT and the uncertainty of using SPAD measurements [39]. For fCover, LUTreg could not improve the accuracy compared with LUTstd (R 2 = 0.69, NRMSE = 18.60% for LUTreg; R 2 = 0.70, NRMSE = 17.85% for LUTstd). However, the results of both scenarios revealed that the accuracy increased at high values of retrieved fCover (>0.4) and scattered points were distributed around the 1:1 regression line, comparable to Reference [69]. This result means that high values of fCover are indicative that the potato crop is homogeneous (fully covered by vegetation) in the horizontal direction. This observation is aligned with the assumption of the turbid medium of canopy RTMs. On the contrary, lower values of fCover (<0.4) were overestimated. This overestimation might be explained by the fact that non-leaf elements (i.e., stem, shoot and branch) that greatly affect the canopy reflectance are not reflected in the SLC model. In a study in Reference [70], when the vegetation cover was at a low level, the PROSAIL model overestimated the fCover for a corn crop, whose planting pattern is similar to that of the potato crop (discontinuous crops).
Calculating the variance also confirmed the improvement of the LAI estimate derived from LUTreg. This implies that the proposed method (Cholesky algorithm) reduced the inverse problem by decreasing the uncertainties of modelled spectra generated from LUT for each plot. However, compared with LUTstd, the percentage of variance for the Cv and fCover retrievals from LUTreg (50% for Cv and fCover, respectively) (Figures 8 and 9) was not obvious, as in the case of LAI (60%) (Figure 7). The errors mainly came from the five plots of retrieved Cv from LUTreg. This is because these plots were not fully covered by the canopy (heterogeneous). In addition, the Cv parameter caused a nearly linear mixing of canopy and soil spectra, while LAI mainly modified the near-infrared spectrum, with deeper dips at high LAI. This likely induced problems related to the compensation effect between variables (LAI and Cv) since the fCover variable is a derived quantity from basic inputs (LAI, Cv, and the LIDF variables). The different combinations of Cv and LAI can produce the same value of fCover [33]. For instance, plots with high LAI and low Cv or vice versa increase the uncertainty in retrievals of fCover. Additionally, the plots that had low LAI and Cv were characterized by a high uncertainty in their corresponding estimations because of the contributions of the soil signature. Nevertheless, in plots with high values of LAI and Cv, the model represented more accurate estimations of fCover (corresponding to high values), indicating well-developed canopies.
Although LAI retrieval was improved by considering the variable correlations between LAI and fCover, there is still a need to examine the full correlation matrix among model variables to enhance fCover and CCC retrieval, as well. It is supposed that appraising the other correlations between free model variables (such as LAI-Cw/Cm/SM/LIDF and Cm-N/Cv) in LUT using the measured correlation variables will likely increase the reduction of the development of the unrealistic simulated reflectance produced by an unrealistic combination of input model variables. It is recommended that future research apply the presented method to different RTMs (i.e., PROSAIL, INFORM, or SCOPE) and to different areas at different scales in order to investigate the retrievals of various types of crops from different remote sensing products. To apply the proposed method for different observation dates or areas in a simple way an average correlation coefficient (between this range 0.65-0.90) can be used to improve the results instead of using correlation for a single date or area. However, for representing the actual situation for each observation data or area the Cholesky method can not be easily applicable in term of time calculations. Moreover, it is suggested to examine the given strategy with other inversion methods, such as OPT, Bayesian, and ANN.

Conclusions
This study presented the feasibility of using a Cholesky algorithm in the LUT-based inversion approach to improving predictions of the SLC model. To quantify vegetation attribute for 27 potato plots the LUT inversion approach was used as a robust and simple method using hyperspectral remote sensing data. Besides regularization techniques to optimize LUT inversion method, the incorporating known variable correlations were considered as an additional source of a priori information to avoid the generation of meaningless canopy spectra. The proposed method (Cholesky algorithm) utilizes the correlation between LAI and fCover that naturally exists in the study field to correlate the independent model variables LAI and Cv. Retrievals from the regularized LUT (LUTreg), which was modified by using known variable correlation were compared with the standard LUT (LUTstd).
The results revealed that LUTreg was appeared to be successful for improving the accuracy of retrieved LAI and CCC rather than LUTstd in term of R 2 and NRMSE%, because of reducing the probabilities of unrealistic combinations of model parameters. However, the estimated fCover was not improved by LUTreg that is due to the error of the estimated Cv. By calculating the variability of model predictions the estimated LAI (5.10) was remarkably decreased over 27 plots in LUTreg compared to LUTstd (12.10), while the estimated Cv and fCover from LUTreg were slightly decreased (0.10 and 0.02) as compared to LUTstd (0.20 and 0.04, respectively).
For further studies, two main issues could be addressed and taken into account. First, measuring the LIDF parameter and soil spectra in the field should be considered to increase the accuracy of fCover retrieval. Second, based on ground data sufficiency a full variance-covariance or correlation matrix of all involved variables of RTMs should be considered to implement the Cholesky algorithm efficiently for decreasing the uncertainties of retrievals especially for Cv, which is related to improve the accuracy of fCover variables.
Author Contributions: A.A. wrote the manuscript and analyzed the data. All authors designed the conceptual framework of this research. M.S. secured the funding of the field campaign, made the sample design and took the LAI measurements. T.U. proposed the idea of introducing variable correlation to the model inversion. T.U. and M.S. contributed significantly to the structure and phrasing of the manuscript. In addition, W.V. reviewed the model parameterization and results.
Funding: The publication was funded by the Open Access Fund of Universität Trier and the German Research Foundation (DFG) within the Open Access Publishing funding programme.

Conflicts of Interest:
The authors declare no conflict of interest. Note: The highlighted numbers indicate the best retrieval; LCC is the leaf chlorophyll content; NRMSE is the normalized root-mean-square-error; R 2 is the correlation of determination; RMSE is the root-mean-square-error. Figure A1. The estimated CCC obtained from LUTstd (a) and LUTreg (b).