3.2. Soil Management Classification
In satellite imagery, different soil management can be interpreted as a variation of vigor for a specific field or a more widespread area. For vineyards, the signal returned by satellite images is the sum of spectral signatures from different elements (i.e., the main crop, bare or grass-covered soil, and other materials such as pruning residues or stubble that can be found in the inter-rows). For this study, the TS of every VI was analyzed to see whether the spectral response obtained was able to identify variations in soil management. The initial assumption was that higher reflectance values for the TS of every considered VI corresponded to grass-covered vineyards, suggesting a higher biomass content at the field level, while lower reflectance values were assumed to belong to vineyards with less vigor due to the adoption of traditional management practices that tend to leave a bare soil in the inter-rows during the growing season.
Areas were mapped using the unsupervised classification by the K-means clustering algorithm, considering two classes. For each class, the corresponding average trend (
Figure 5) was also calculated to correctly interpret the results. The clustering algorithm generated a vigor map (VM) with two classes (or clusters) for both calibration and validation areas (
Figure 6 and
Figure 7). All VMs presented one cluster with higher and another one with lower reflectance values (hereinafter referred to as clusters 1 and 2, respectively). Differences between the two clusters were less evident during the first part of the agrarian year, probably due to the period of dormancy during winter and the reduced amount of green vegetation within the vineyards until the beginning of spring. Starting in May, differences between clusters usually became more evident when the initial mechanical operations were carried out in the fields (typically, tillage or mowing). From that time on, vine canopies started to expand together with the increase in soil coverage.
The unsupervised classification results (
Table 5) indicate that traditional indices such as NDVI provided a more balanced distinction between the two managements, slightly in favor of GC (51.9% and 50.4% in Carpeneto and Gavi, respectively) as opposed to CT (>48% in both areas). On the other hand, classification using NDRE was more in favor of GC (>57%), and in detriment of CT (<43%) for both areas. Results from NDWI appeared closer to reference values obtained by photointerpretation, tending to overestimate GC (44.7% and 37.7% in Carpeneto and Gavi, respectively) compared to the reference values obtained using photointerpretation. Regarding the newly proposed index, NDVI
W performed poorly in the municipality of Carpeneto, by estimating 80.1% of the study area as vineyards with CT management and underestimating GC. In contrast, NDVI
W clustering results appeared to be more like the reference values obtained for the validation area of Gavi.
To test clustering results, the obtained VMs were compared to the reference map, where vineyards were identified via photointerpretation (
Figure 8 and
Figure 9). Accuracy for every considered VI was then evaluated through the metrics that can be derived by the confusion matrix (
Table 6). As a matter of fact, the assumption ‘higher reflectance equal higher vigor’ appeared to be true for all the considered VIs to some extent. More specifically, NDVI and NDWI performed better than the others, with more appreciable results for the OA (>0.7) in the calibration area and discrete ones (OA > 0.6) in the validation area. The unsupervised classification method was able to identify with a certain degree of accuracy the differences in soil management, which can be expressed in terms of major or minor vigor observed. NDRE results were appreciable for the calibration area (OA = 0.64), while those were slightly inferior in the validation area (OA = 0.58). At last, among the other VIs that have been considered, NDVI
W performed poorly, with OA ≤ 0.6 for both tested areas, showing lesser capabilities in detecting differences between the two clusters.
Among the other accuracy metrics, producer and user accuracy (PA and UA, respectively) presented a higher variability for every analyzed index. As well as for the OA, NDVI reported the best results in both areas compared to the other VIs. Above all, PA scored the best values (0.84 both in Carpeneto and Gavi) for the identification of pixels belonging to Class 2 (lower vigor), while PA for Class 1 (higher vigor) was mediocre (≤0.65). According to these results, the classifier was more able to detect vineyards (or parts of them) with traditional inter-row soil management. Similar behavior was observed for the other indices, with appreciable results predicting Class 2 using NDWI and NDRE (PA > 0.7), while NDVIW provided the lowest results for both areas (PA < 0.7). In a similar way, PA was quite low (<0.7) predicting Class 1 pixels. UA, on the other hand, reported higher values in Class 1 (UA > 0.8 in Carpeneto and >0.7 in Gavi) for NDVI and NDRE, while discrete results for UA in Class 2 (≥0.7) were obtained by NDWI and NDVIW.
Regarding the surface variations highlighted by the confusion matrix (
Table 7), the cluster analysis appeared to be less accurate in identifying vineyards (or portions of them) belonging to Cluster 1 (i.e., higher vigor), although NDVI and NDWI had more than 64% of the area that was correctly identified, and to a lesser extent, NDRE performed moderately as well (>50%), while lower results were those provided by NDVI
W (<40%) for the municipality of Carpeneto. Results in Gavi were slightly inferior at identifying Cluster 1, but all VIs performed similarly, matching the reference map and the corresponding cluster map (GC_Cl1) ranging from 42 to 52%. Even in Gavi, NDVI gave the best results (52.5%). For the identification of pixels belonging to Cluster 2 (i.e., lower vigor), matching results (CT_Cl2) indicated a better capability of all considered VIs in identifying the pixels correctly. In fact, results in both areas for the traditional VIs were almost 85% in NDVI, and higher than 70% for both NDWI and NDRE. Even in this case, only NDVI
W exhibited lower results (<70%). This difference between the two test areas may be related to factors affecting the capability of VIs to identify the different soil management, especially regarding the newly proposed NDVI
W. Further studies may be moving towards a better definition of this modified version of the traditional NDVI formula.
Results suggested that the bands in the red and infrared regions combined in the tested VIs may be able to detect variations in terms of soil coverage, especially in the red and NIR and, to some extent, in the red-edge and SWIR as well. The NIR has some advantages for the detection and assessment of vegetation canopies since a higher amount of light is reflected in this region (50–60%) compared to that reflected in the blue, green, or red (2%, 5%, and 3% respectively) for vegetation canopy [
107]. Since NIR is a combined function of leaf optical properties, canopy geometry, and soil reflectance [
108], it is also able to reflect the soil, which is characterized by lower reflectance values compared to vegetation signals.
In this research, VIs including the NIR band in their formula displayed the best results and respected the assumption that the higher vigor class observed could have been attributed to grass-covered vineyards, while lower values were related to a minor vegetation density, typical of vineyards managed in a traditional way, leaving bare soil or scarce spontaneous vegetation growing in the inter-rows. The SWIR spectral region is used instead for different applications, such as distinguishing cloud types and estimating water content in soils and plants. Although classification proved to be able to identify variations in vineyards, accuracy values were usually lower than those obtained by NDVI.
A consideration must be made: the classification performed by the operator generated a map in which the single field was classified in its whole, referring to a precise time of the year when the orthophotos were taken. The automatic classification, on the other hand, was pixel-based, distinguishing different portions within the field and identifying the areas with greater vegetation and those where the vegetation (whether vine plants or the vegetation in the inter-row) growth was stunted. This may be one of the reasons explaining the fact that producer and user accuracy presented differences for the same class since the reference map is a simplification of the reality observed on the ground, considering the vineyard as a whole without regard to the spatial variations that occur during the GS. Despite the reduced geometric resolution of the S2 data, the use of cluster analysis was able to highlight the differences in vigor within every field. This spatial variability was probably due to both the different vigor of vine growth and the presence of spontaneous herbaceous plants growing into the fields as well. Thus, vineyards that in the reference map were originally classified with tilled inter-rows presented spots with higher vegetation that had been classified as grass-vegetated areas by the unsupervised classification. Oppositely, vineyards identified as having grass-covered inter-rows could have presented sparse vegetation in some portions of the field, especially in the middle or along the boundaries.
3.4. Relationship between Ground Cover and Remotely Sensed Data
The capability of VIs to provide information regarding vegetation cover, vigor, and growth dynamics [
76,
109,
110] is known, although other studies showed that such information could be obtained using native bands provided by satellites in different spectral regions as well [
111,
112,
113]. With this in mind, we aimed to assess the capability of both VIs and S2 native bands to describe the inter-row degree of coverage taking place within the vineyards with different soil management. Once the ground cover was estimated, a further analysis was carried out to evaluate if the S2 native bands and the considered VIs were able to describe the degree of vegetation cover within the inter-rows based on the different types of soil management during the reference period (agrarian year 2017–2018). To do so, the regression between the TS for every piece of satellite data (either S2 native bands or the derived VIs) and inter-row’s ground cover estimation was calculated considering different time intervals. In the first case, the agrarian year was analyzed as a whole. The TS was then split into two parts to test the regression results and evaluate the presence of a seasonality effect.
The first part considered the period ranging from November 2017 until the end of April 2018, when the first tillage or mowing operations were carried out in CT and GC plots, respectively. During this time interval, it is assumed that the vines are in the vegetative rest phase, with no (or very low) green canopy, and therefore have a reduced influence or almost nothing on the reflectance signal recorded by the Sentinel-2 sensors. Having made this assumption, values in the different spectral regions and VIs originated almost exclusively from the spontaneous vegetation growing in the inter-rows of the vineyards.
The second time interval instead ranged from the beginning of May, after tillage or mowing (based on the different soil management per experimental plot), and lasted until the end of the agrarian reference period in October 2018. During this second period, vine canopies kept growing over time, supposedly affecting the capability of remotely sensed data to describe inter-rows coverage development. From the end of April to early May, however, we assumed that the signal relating to the degree of coverage of the inter-rows was also influenced by that of the developing vine plants until the senescence phase took place in autumn. For these time intervals, first-order polynomial linear regression was calibrated to assess the correspondence between the two datasets and to verify the presence of a “seasonality effect”. Error assessment was also measured considering MAE and RMSE (
Table 8). The aim was to identify the most suitable band or index to describe inter-rows vegetative growth in relation to the different periods of the year. Appreciable results were those whose coefficient of determination reported
values higher than 0.7.
For both plots, first-order polynomial linear regression showed lower results when the entire reference period was considered (
Figure A1). More specifically, all S2 native bands performed poorly (
< 0.2), and similarly, VIs had low
values but were characterized by higher variability within the results. Only NDVI scored
= 0.61 and 0.43 for CT and GC, respectively. To some extent, the NDVI was the only index that showed, in the long term and on vineyards with tilled inter-rows, a good predictive capability of the coverage degree. Gaps between MAE and RMSE were considerable, suggesting a high variability within the datasets, especially for the GC plot. However, taking into consideration the two intervals separately, the results changed.
During the first part of the reference period (
Figure A2), S2 native bands, and more in detail, those recorded in the visible, displayed a very high coefficient of determination (
≥ 0.9 in CT and >0.8 in GC). The bands ranging from the red-edge to the SWIR regions highlighted lower values, but the results were satisfactory nevertheless. CT coefficients were higher (
≥ 0.75) compared to those recorded for GC (
≥ 0.6). The best records for both CT and GC in the red-edge band were obtained by B5 (
= 0.87 and 0.78 for CT and GC, respectively), with good results given by the NIR band for CT (
= 0.83), while values for GC were the lowest among the native bands (
= 0.6). In a similar way, SWIR bands gave good results for both plots, suggesting that further analysis on these bands could be carried out. Regarding the VIs, results obtained for CT indicated that all tested indices were able to predict ground cover in tilled vineyards in a suitable way, while results for GC displayed a higher variability, highlighting the importance of choosing the most fitted index to describe inter-row vegetation development based on soil management adopted within the vineyard. In the same way, gaps in MAE and RMSE values were smaller in CT compared to those in GC, and as a reflection of
results, those gaps were higher for GC vegetation indices. More specifically, NDRE performed better than the other indices both in CT and GC (
= 0.77 and 0.78, respectively), while NDVI obtained the worst result in GC (
= 0.10). The newly proposed index NDVI
W, on the other hand, performed very well in CT (
= 0.77) and reported a decent value in GC (
= 0.52). Oppositely, NDVI and NDWI reported the worst coefficient values (
< 0.3) in this first timescale.
After mechanical operations in spring, when soil tillage in CT and the first mowing in GC in the inter-rows were carried out, the influence of the growing vine canopies on the signal detected by satellite sensors increased over the season. That determined a reduction in the accuracy of inter-row cover assessment in the second part of the GS (
Figure A3). Since the satellite sensors’ reflectance signal also started being influenced and describing the growth dynamics of the vine plants,
values were affected in a variable manner according to the different spectral band that is taken into analysis. Like in the previous timescales,
values were higher in CT other than in GC. Among the spectral bands, the best responses for both plots had been observed in the green (B3,
= 0.76 and 0.69 for CT and GC, respectively) and red-edge bands. Regarding the red-edge bands, B5 and B6 performed better in CT (
≥ 0.7), while growth dynamics in GC were best described by B6 and B7 (
≥ 0.7). Values in the NIR band as well were appreciable for the GC plot (
= 0.74), while the coefficient was reduced in CT (
= 0.64). Apart from that, results for both SWIR bands returned appreciable results for neither considered management, even though the NDWI performed better among all VIs for the CT plot (
= 0.73), while values decreased significantly in GC (
= 0.22). Similar behavior was observed for NDRE (
= 0.68 and 0.51 for CT and GC, respectively). While NDVI reported the lowest values in both cases (
< 0.3), NDVI
W results were quite low but still noticeable (
≥ 0.4). In this phase, VIs, in general, proved less suitable than the previous time interval to describe the evolution of ground cover.