1. Introduction
With the global increase in food demand, obtaining timely information regarding crop growth and retrieving detailed data of crop health have become essential aspects for developing strategic food policies and ensuring sustainable agroecosystem management. Reliable monitoring of crop yields at the regional scale can support policy makers in quantifying food supply (GEOGLAM initiative,
https://www.earthobservations.org/geoglam.php, accessed on 20 April 2021), while mapping crop conditions at a field scale can assist farmers in agroecosystem management [
1]. The leaf area index (LAI), defined as the total one-sided area of leaf tissue per unit ground, is a biophysical indicator used to represent the dimension of the crop canopy and its variation over time [
2]. Indeed, LAI measurements have been widely adopted for crop monitoring, as well as for modelling applications [
3,
4,
5], being a key state variable associated with processes including light interception and soil–crop water balance [
6,
7,
8]. Moreover, at the landscape level, LAI maps can provide information on cropping system status, according to crop rotation, soil coverage, crop phenological development, and their response to management and anomalies caused by extreme events in time (across seasons) and space (among and within fields) [
4,
9].
Remote sensing (RS) provides an effective way to retrieve LAI values at different spatial and temporal scales, as has been successfully demonstrated in different contexts [
10,
11,
12]. According to Verrelst et al. [
13], the approaches for biophysical parameter retrieval (e.g., LAI) can be classified in four main categories: (i) parametric regression methods, which assume an explicit relationship between spectral data (e.g., Vegetation Indices) and biophysical data (e.g., LAI); (ii) non-parametric regression methods, which do not require an explicit relationship and data distribution; (iii) physical-based methods, using radiative transfer models (RTMs) to simulate the interaction between spectral radiation and vegetation biophysical and biochemical parameters; and (iv) hybrid retrieval methods, combining non-parametric and physical-based approaches.
The first category relies on regression analyses (ground-LAI vs. VI) and their easy implementation for operational vegetation cover monitoring applications [
13]. However, the drawbacks of VI-based methods are related to the implicit assumption that the reflectance variability depends only (or mainly) on the LAI [
14]. Instead, canopy reflectance is strongly affected by several factors, such as the aboveground biomass, chlorophyll content, canopy architecture, and soil background, which vary in space and time, according to the crop phenology and seasonal conditions [
15,
16]. Therefore, parametric approaches are often crop- and site-specific, due to their dependence on the regression data set, thus making them inadequate for the general purpose of retrieving LAI values from a diversified landscape mosaic [
17]. In this context, several VI formulations, according to different band compositions, have been developed, in order to cope with these limitations and to be better suited to mixed-crop scenarios [
18,
19,
20,
21]. For example, previous research has shown the potential of VIs based on red-edge (RE) and short-wave infrared regions (SWIR), in terms of being less sensitive to specific crop types than traditional Red/NIR-based indices (e.g., NDVI) [
19,
22,
23,
24]. Nevertheless, further investigations into the accuracy of parametric methods based on RE and SWIR vegetation indices are needed, in order to assess their accuracy for satellite remote sensing.
The second category refers to machine learning regression algorithms (MLRAs). MLRAs have gained widespread popularity, as they address the limitations of VI-based methods [
24]. For this reason, different studies interested in LAI estimation have compared the performance of different MLRAs, and showed that the Gaussian processes regression (GPR), bagging trees (BAGTREE), and boosting trees (BOOST) are robust algorithms for LAI retrieval [
13,
25,
26]. However, a drawback of MLRA methods is their instability when applied to data sets deviating from the training data set [
25]. Therefore, additional investigation of such algorithms trained over a data set for crop-specific and mixed-crop estimation is required.
The third category includes the RTMs for the simulation of canopy light interception processes. Several authors have suggested the use of such complex models to exploit the full spectrum acquired with the RS sensors. Nevertheless, RTM calibration requires several input parameters, where the potential lack of these inputs could induce several uncertainties compromising the estimation accuracy [
13].
Numerous LAI products have been developed, according to these different methodological approaches [
14]. Gonsamo and Chen [
27] used the MODIS/MISR sensor, with a parametric approach (LAI–VIs relationship), to obtained a global LAI at 250 m spatial resolution and 10 days temporal resolution. Yan et al. [
28] used SNPP/VIIRS data retrieved a global LAI product at 8 days of temporal resolution and 500 m of spatial resolution by RTM. Moreover, García-Haro et al. [
29] used an AVHRR sensor to obtain an operational LAI product at 1.1 km of spatial resolution and 10 days of temporal resolution using GPR. Baret et al. [
30] used a SPOT/VEGETATION sensor and, based on a hybrid non-parametric approach (Neural Network and GPR), retrieved a global LAI with 10 days of temporal resolution and 1.5 km of spatial resolution. Despite the large number of retrieved LAI products and the efforts to develop new and updated algorithms for LAI estimation, the available products are not yet capable of capturing the site-specific variability required in many agricultural applications, Therefore, specific LAI data sets with higher spatial and temporal consistency are required. To cope with the spatial and temporal variability of heterogeneous agricultural systems, high spatial resolution satellite systems (10–30 m), such as Sentinel-2, have been gathering growing interest in the field of agronomic research [
24,
26]. Thus, studies devoted to improving the accuracy and spatial resolution of LAI estimation are needed, in order to respond to the increasing demand for tools to support the site-specific management of crops and landscape [
31]. Therefore, this study was focused on the robustness of non-parametric approaches in relation to different sources of variability such as crop species-, growth stage-, farm-, and year-specific conditions, in order to capture site-specific variability.
The objectives of this study were: (i) to evaluate the potential of non-parametric algorithms at pixel level (within field) and at field level for multi-crop and multi-temporal LAI retrieval; and (ii) to test the temporal consistency of the retrieved LAI at field level for crop monitoring over the entire growing season. Thus, to achieve these objectives, we performed an intensive field campaign, contemporary to S2 data acquisition, to collect ground-LAI measurements collected in Tuscany (Central Italy) over two growing seasons (2018 and 2019), including three crops (i.e., winter wheat, maize, and alfalfa), characterized by different growing periods and canopy structures, and considering different agronomic conditions (i.e., three farms in three different sites). Indeed, the final database (ground-LAI and S2 spectral data) is a potential contribution for other studies, thanks to the spatial–spectral–temporal characteristics of LAI data.
4. Discussion
Considering spectral bands (e.g., near-infrared), indicators (e.g., NDVI), and final useful products (e.g., land-cover), there is a need to obtain valuable information for the development of LAI, as elaborated for the Copernicus indictors and products. Thus, assessment of the statistical relationship between ground-LAI and satellite remote sensing data for LAI prediction requires assessing the accuracy of LAI retrieval methods, according to ground-LAI estimation errors [
42,
57]. In this study, we measured ground-LAI for three crops, and showed a wide range of values, compared with those reported in the literature. Specifically, Revill et al. [
58], using SunScan, exhibited a lower range of ground-LAI values (min, 0.5; max, 3.5) for winter wheat. However, our field data covered the entire cycle, from emergence to maturity, and agreed with Upreti et al. [
26] (min, 1; max, 6.5).
Regarding maize, Facchi et al. [
59], using LAI-2000, a ceptometer, and a Hemispherical camera in an experiment to compare (with destructive sampling measurements) the range of maize ground-LAI, the results were comparable to those we sampled in the field (min, 1; max, 5). Finally, for alfalfa, Verger et al. [
60] measured a range of ground-LAI (min, 0.8; max, 6.5) that was lower with respect to our collected maxima (LAI > 8) using the LAI-2000 instrument. From this literature comparison, the only anomalous apparent difference of ground-LAI values was observed for alfalfa. However, the average value was coherent for different ESU in the same field for the 2018 sampling at flowering date (just before mowing) and, so, it was not in disagreement with the analysis. This difference, with respect to the literature, could arise from the uncertainty due to the optical instrument, the phenological stage, and the crop reflectance response [
61,
62].
Uncertainty of LAI estimations could arise from many factors, including crop type, farm management, and temporal variability of ground-LAI measurements [
42,
63].
In this work, we observed that, when using parametric approaches, SeLI proved to be the most suitable VI for LAI prediction under a mixed crop scenario. This result is in close agreement with previous findings highlighting that the combination of NIR and RE may provide more accurate LAI estimates for different crop types [
18,
34]. However, in the assessment of parametric methods, we also demonstrated that, at pixel level, the accuracy of LAI estimation was strongly affected not only by the VI selection but also (and not in a negligible manner) by the regression function as well as the parameterization data set [
20,
63]. The results of VIs also demonstrated the better LAI prediction ability of SeLI for wheat and maize, compared to alfalfa. The different LAI prediction ability of Vis, according to crop types, is in agreement with Herrmann et al. [
64]. In this regard, Dong et al. [
17], using RapidEye reflectance data and ground-LAI data of different crop types, demonstrated that VIs based on visible and RE regions are generally affected by chlorophyll, water content, and the structural properties of leaves and, therefore, the LAI predictability of a VI may vary markedly among crops and growing stages, according to canopy characteristics. Furthermore, when the crop-specific and mixed-crop parameterization were compared under both regression methods based on linear and logistic models (LM and LogIF
d), we demonstrated that the parameterization data set strongly influenced the LAI prediction and its accuracy. This result built on the study of Nguy-Robertson et al. [
20] who, using hyperspectral data, found that RE-based VIs are little affected by crop type and, thus, may facilitate the prediction of LAI for different crops characterized by different canopy structure. This finding is very important for the future availability of operational hyperspectral missions, such as the foreseen Copernicus CHIME and NASA SBG [
31,
65]. Thus, to obtain an accurate LAI prediction, the selection of suitable VIs is critical, as ideal VIs should be sensitive to the ground-LAI but insensitive to interference factors (e.g., the soil background, canopy structure, and chlorophyll content) [
66]. Thus, despite the fact that SeLI yielded the most accurate results, it was also demonstrated that, due to the above-mentioned uncertainties, it might be not as accurate at pixel level.
Moreover, we showed how the three MLRAs (and, in particular, GPR) were able to exploit all information available from a multispectral data set, thus providing more accuracy in LAI prediction compared to VIs. Indeed, considering the relative weight of bands in the GPR algorithm, it was observed that, although the NIR and RE bands were the most relevant for LAI estimation, the other bands also contributed—varying according to the crop type and development stage—to LAI prediction. This finding is in agreement with previous outcomes. Delloy et al. [
67] showed how the RE S2 bands (B5, B6, and B7) can improve winter wheat LAI estimation by using a hybrid approach (RTM + neural network). Verrelst et al. [
13] compared different MLRAs, using simulated Sentinel-2 reflectance data over different crops types, and concluded that GPR was the most effective algorithm for LAI retrieval. However, despite the promising results of MLRAs, it was also observed that the performance was influenced by the training data set (i.e., MC vs. CS). This is in agreement with Mao et al. [
25], who tested the influence of sample size on MLRA performances and highlighted how its influences the accuracy of the algorithms.
Regarding the temporal consistency of LAI estimation, our results showed that LAI retrieved by the parametric approach, at pixel level, was less suitable, with respect to the non-parametric approach. Indeed, the parametric method based on VIs showed a low accuracy, in terms of representing variability within the field, due to a saturation effect occurring especially at high vegetation density. In contrast, GPR allowed us to point out such variation, regardless of the crop development stage [
58]. The low ability of parametric methods to account for the within variability of LAI was evidenced by the weak metrics of cross-validation carried out at pixel level. In particular, LM made it evident that parametric approaches may lead to the overestimation of LAI at early stages of crop development and underestimation at full canopy development. In this latter condition, VIs showed their limit in detecting LAI variation, due to the well-known issue of saturation [
45]—a limit of VIs that was particularly exacerbated in the case of alfalfa, which reaches canopy closure, after resprouting, in less time compared to a winter cereal (wheat) and a row crop (maize). Conversely, GPR showed a high ability to detect LAI variation at the pixel level, regardless of the development stage, vegetation density, and crop type.
It is well-known that the canopy reflectance is affected by several biophysical and biochemical variables and, thus, the regions of the reflectance spectrum can be associated with different vegetation properties [
68,
69]. Therefore, exploitation of the full spectrum with non-parametric methods can improve the quality of LAI retrieval [
13]. Indeed, our results demonstrated that the GPR outperformed the parametric methods; in addition, it was the most accurate MLRA for LAI prediction at both field and pixel level. The results of this study showed that the VI-based parametric method had a lower accuracy for LAI retrieval than MLRAs. These results suggest that GPR based on Sentinel-2 multispectral images is promising for crop monitoring, from a multi-crop mosaic scenario perspective. Further work will involve applying the MLRAs trained in this work to verify the model stability when applied to an independent data set; this analysis will allow for a full assessment of the robustness and exportability of the model developed.
The capacity of MLRAs to deal with full spectral information is a promising aspect that makes these approaches candidates for the investigation of new-generation hyperspectral data available from ASI-PRISMA, as well as those expected from the foreseen Copernicus CHIME and NASA SBG missions.
5. Conclusions
In the present study, the capability of different parametric and non-parametric methods for the retrieval of LAI for different arable crops using Sentinel-2 data was assessed. The accuracy and robustness of LAI estimates were compared, based on repeated in situ ground-LAI measurements throughout the crop growing season. Regarding the VI-based parametric regressions, the normalized index SeLI was evaluated as more suitable (i.e., than NDVI and NBR) for LAI retrieval at field level, providing good evaluation metrics by the cross-validation analysis for winter wheat and maize. However, VI-based parametric methods were shown: (i) to be unsuitable for LAI retrieval of alfalfa and mixed crop scenario; (ii) to have a very low accuracy for LAI retrieval at pixel level; and (iii) to have an accuracy of prediction the largely depends on VI selection, the fitting function, and the parameterization data set.
Among the non-parametric regression methods evaluated, the best-performing MLRA belonged to the kernel machine learning regression algorithms. Indeed, GPR was evaluated as the best-performing algorithm for LAI prediction, for the three arable crops evaluated. Using GPR, Sentinel-2 imagery can be used to map the spatial variability of the LAI of different arable crops, having prediction accuracy which is very high at the pixel level regardless of the crop type, growth stage, and the training data set. Moreover, GPR analysis of spectral bands in different phenological stages provided information on the relevance of relative bands in contributing to LAI prediction. However, further studies are required to fully assess the potential of GPR across different crops in different areas, as well as under contrasting agronomic conditions.