1. Introduction
The world’s cultivated area depends on crop yields, as increased yields would reduce the area of land used for agriculture. This leads to considerable efforts to produce according to efficiency criteria, more with less [
1]. Rice is the staple food in several countries and is essential for food security [
2].
Breeding programs are vital to increase yields, but their techniques are usually time-consuming [
3]. In this sense, efficient crop management through Precision Agriculture (PA) tools can be an alternative to improve average yields [
4]. Precision Agriculture is based on the recognition of spatial and temporal variability in crop production. It uses information technology (IT) in agricultural management with the aim of increasing productivity. Crop management is carried out considering a yield goal per cultivar; field decisions during the season when an adverse condition occurs, e.g., weather constraints, failures in management, or pest incidence, are based on a large amount of data from different sources. Therefore, several authors studied rice yield variability to build a pre-harvest forecast model, joining the different crop inputs [
5,
6,
7,
8,
9]. The ORYZA2000 model developed by the International Rice Research Institute (IRRI) is a suitable example [
10], but its applicability depends on the availability of an enormous database [
11].
The use of PA requires different approaches to quickly evaluate field situations [
12]. As a solution, in recent years, remote sensing (RS) in satellite platforms has emerged as a helpful technology in improving yield forecasts [
13]. Images acquired by the Sentinel-2 satellite have introduced a new era in RS applications due to free data access, high temporal (5 days) and spatial resolution (10 m), and accuracy in measured reflectance [
14]. Other existing technologies can be of higher spatial resolution, i.e., drone systems and equipped cameras on the farm machinery [
15]; furthermore, private satellite platforms, such as the Planet Dove constellation, exhibit a better temporal resolution [
16]. However, the open access to Sentinel-2 derived from the European Union (EU) public service demands novelty studies to deeply understand the data profitability for agronomic decisions at the farm scale [
17]. Both industry and public institutions can benefit from analyzing the relations between crop productivity and data from public investments in new technologies, a key challenge in the EU’s future [
18].
Currently, in the application of Sentinel-2 data, the analysis at the within-field level based on commercial fields across different years and areas to obtain an interpretable framework is a critical necessity [
19,
20]. The analytic methods to estimate yield variability are changeable. According to de la Torre et al. [
21], empirical, process-based crop models, and semi-empirical models are the three main categories. On the one hand, empirical models are simple and site-specific; the equations are dependent on the database used in the study. On the other hand, process-based and semi-empirical models can be mechanistic and have high accuracy; however, several parameters about crop management and growing are required [
22]. These mechanistic methods can lead to limitations similar to those of the classical models described above due to the needed complementary sensor systems and the problematic implementation in a changing agricultural system at the commercial level [
23]. New efforts in the empirical approaches using Machine Learning (ML) techniques could obtain a transferable solution at the within-field level [
24,
25,
26] and provide a fast yield forecast. However, interpretability is still challenging when the results are applied to a different database [
27]. In addition, linear regression analysis can be performed to interpret ML outputs. The literature has widely studied linear regression, which is usually considered a pre-analysis for developing RS-based models [
28]. Especially, the linear correlations between surface reflectance and important biophysical parameters in assessing crop status, such as the Leaf Area Index (LAI), Leaf Chlorophyll Content (LCC), or biomass, are used to explain the implementation of predictive models for such parameters [
29].
Regarding this, leaf reflectance is directly correlated with the mentioned biophysical parameters [
30], but regarding yield assessments, correlations can turn indirect, and the expected implications that reflectance and derived products show on yield can be highly variable [
31,
32]. Farmers aim to obtain fast control of all yield input parameters; for that reason, crop systems are often designed based on linear correlations to modify crop management rapidly [
33,
34]. The difficulty relies on identifying the limit of linear correlations; for instance, nitrogen rates are increased linearly up to a saturation point, and a weed incidence assessment is performed considering a partial linear correlation with yields in pest and disease management or weather effects [
35,
36,
37,
38].
Despite having the Sentinel-2 constellation fully completed at the operational level since 2017, few studies have simulated the real applicability of rice, comparing various years and regions at the within-field level [
32]. The implementation of models to estimate yield variability in the rice industry when satellite imagery is used is commonly based on the spectral band’s combination in Vegetation Indices (VIs), aiming for a normalized value to interpret the measurement efficiently [
39]. Several indices were designed based on linear correlations among bands, canopy properties, and the negative collinearity in comparing visible (blue, green, red) and near-infrared (NIR) [
40]. In different satellite platforms of lower spatial (MODIS) and temporal resolution (Landsat missions), the use of VIs has been the solution to ensure easy data interpretability in cereal crops [
41], but the results were highly variable to confirm the actual yield variability accurately [
42]. The yield physiology may be uncorrelated with LAI, LCC, and biomass, owing to the carbohydrate partitioning in the grain-filling process [
43]. Specifically, the correlation between reflectance and yield may vary with rice growth stages, summarized as intensive vegetative growth to produce fertile tillers, the process of panicle differentiation and flowering, and the accumulation of biomass in the grains, consuming the previous vegetative structures [
44]. Therefore, the expected knowledge of RS can be altered to interpret each growth stage, modifying some usually assumed correlations; for example, an excessive crop greenness can result from low grain filling, resulting in higher NDVI but lower yields [
45].
Recent studies have used the satellite bands from Sentinel-2, without combination in an index, to obtain linear regression models in wheat, corn, soybean, and rice [
46,
47,
48]. The results were promising for better transferability because when only VIs are used, one spectral band can be overwhelmed by the other, losing information in the actual reflectance at each spectral region. Thenkabail et al. [
49] reported that the red band reaches a saturation value level before the NIR band. The increasing resolution and accuracy for future spatial missions add special significance to analyzing individual reflectance values. The Sentinel-2 mission provides new bands at a 20 m resolution in the Red edge region (wavelengths between visible and NIR) and the Short Wave Infrared (SWIR) region. Spectral signatures of crop canopies can improve the applicability of the VIs, enhancing their interpretability. Similarly, ML algorithms require the analysis of basic correlations between each spectral band and crop yields at the within-field level. Thus, a fusion among spectral properties, VIs, and yield at commercial farms to study interpretable correlations between all variables lacks extensive database works in the rice industry.
The study of Franch et al. [
48] in rice showed an interesting correlation between Sentinel-2 data and yields; such work was performed on a small dataset (<70 ha) and a single year and location. The results supported the industry’s utilization of RS commercial platforms (mainly Sentinel-2); however, more studies are needed to validate correlations in different years and locations. Skakun et al. [
47] reported, in corn and soybean, a critical range of correlations, i.e., 0.67, between the yield and spectral bands in a two-year study in commercial farms. Similarly, Maestrini and Basso [
50] found low correlations in corn, soybean, wheat, and cotton, showing an important threshold value of 0.50 in the Spearman correlation coefficient. Thus, as also reported by Kayad et al. [
51], when several years are considered, stable correlations could be hard to find and interpret across the time-series.
It is supposed that surface reflectance and VI exhibit correlations with crop status and yield; thus, the research gap is placed in simulating operability for agronomic decisions. Are the correlations between Sentinel-2 data and yield significantly the same across years, locations, and growth stages? Do Sentinel-2 data always present the expected behavior for each spectral band and VI? Detailed analysis of the collinearities between spectral bands across growth stages, combined with the classical VI performance, may be crucial to provide a framework for data interpretability.
As a consequence, the central hypothesis of the present work is the existence of significant correlations between all Sentinel-2 spectral bands or VIs and yield variability for each growth stage within the field and for a broader data set representative of commercial fields spanning different years and locations. Four years of yield maps acquired by a combine harvester in 404 commercial fields (a total area of 706 ha) distributed in two Spanish locations (Valencia and Seville) were studied. We aim to study such correlations and the collinearity between bands and VIs, seeking the satellite data applicability to improve crop management. The results of the hypothesis would provide a source of information to ensure a more efficient implementation of Sentinel-2 in Spanish rice cultivation. Therefore, this work transfers the interpretability of the data rather than an empirical yield forecast.
4. Discussion
In
Section 1, the actual within-field yield anomaly provided the potential yield lost or gained compared to the average field. Few studies on rice cultivation analyze the actual within-field variability. Sarasso et al. [
67] reported a maximum standard deviation of 1013 kg·ha
−1 in Italy. Also, Bellis et al. [
68] found a similar deviation in Arkansas (U.S.). Despite the different metrics, this value would have a higher absolute variability than our study. The differences could be explained in terms of the experimental design; the first-mentioned study carried out different PA experimental practices, whereas, in our study, all fields followed the standard grower practices with no PA practices. Older studies, such as the one by Roel and Plant [
69], showed differences between the maximum and minimum pixel of around 5000 kg·ha
−1 in California (U.S.); in that case, fields of larger areas (38 and 52 ha) were studied. In Russia, in a 3.60 ha field, Dobermann [
70] measured differences of about 8000 kg·ha
−1 between the maximum and minimum yield, attributed to extreme weed growth. The variability found in our work was low.
The yield variability when the data were considered as a single matrix (mixing the 10 m pixels of all fields by location and year) made studying the correlation with Sentinel-2 reflectance and VIs possible. For instance, mean-field yields at location 1 differed by an average of 4609 kg·ha−1, comparing the best with the worst field for each year; at location 2, the difference was 1940 kg·ha−1. These differences between fields would be notable since the cultivar was the same, and there were no water supply restrictions. Regarding the productivity increase, the differences would show an interesting yield gap in both locations, indicating the existence of crop management with excellent yields in some fields. The applicability of Sentinel-2 data should aim to find this variability during the growing season.
Locations and years were analyzed as separate matrices, with a similar study conducted by Maestrini and Basso [
50] on corn, wheat, soybean, and cotton, who reported a significant influence of weather variables when years and locations are mixed. Several studies focused on modeling weather effects on rice yield, finding complex interactions between them and crop yields [
71,
72,
73]; in our research, we considered year-by-year and location-by-location to avoid these interactions. As a result, we can believe that the variability would be due to other yield input parameters. Therefore, the fields in each location were nearby, featuring highly similar annual weather trends. Thus, the main differences appeared between years and locations. Future work should be addressed to examine the inconsistencies in the
R values. We could not find a clear pattern for integrating weather variables into our analysis. This assumption was derived from the database structure, i.e., crop management differed in each field, resulting in multiple strategies to avoid weather constraints. For example, the delayed sowing in 2022 at location 1 provoked essential changes in the timing of drying downs for herbicide spraying and top-dressing fertilization; some farmers carried out shorter drying downs to mitigate the detrimental effect of high temperatures. Consequently, the high temperatures may have impacted the fields differently. The correlation value in a year and location comparison could be due to weather adversities, as different authors have recently developed models integrating satellite data with meteorology [
74,
75]. Models incorporating weather variables are designed to provide the potential yield loss, but crop management can be crucial to modify the expected behavior [
76,
77]. Few studies have analyzed the physiological response to different crop-management strategies in extensive data at the within-field level [
78,
79]; regarding this, we found difficulties in explaining the differences between locations and years.
The correlations between Sentinel-2 and yields were also variable depending on the spectral region and growth stage. Firstly, the
R values were similar to those obtained by the literature in similar studies with drone imagery and ML techniques; in this regard, Bellis et al. [
68] also reported low yield correlations. Correlations were of high values when an external source of variability was studied, e.g., different nitrogen rates [
80,
81]. Secondly, in Sentinel-2 studies, high correlations can also be found when external variability is included in the experimental design [
82]. From this consideration, few studies on rice at the within-field level examine the general estimation of yield variability with Sentinel-2 in a large dataset. Consequently, a comparison with similar studies in the experimental design is addressed to field-level studies; de la Torre et al. [
21] reviewed the state-of-the-art reporting with satellite imagery, low coefficients of determination, and considerable errors. In addition, nonlinear and ML approaches showed similar final transferability to the linear analysis.
In other cereal and soybean crops, we can find studies at a within-field level in commercial fields with a similar experimental design. Maestrini and Basso [
50] compared airborne and satellite imagery using yield maps from previous years; the correlations with yields using the images were weak and similar to those reported in our study. The historical yield map was a better source of information for anticipating variability when repeated over the years. However, they found a representative number of fields with errors in the estimation when such variability was not repeated, making the images a suitable alternative. The results of those authors could be related to the present work. Thus, the weak correlation in a large dataset of commercial fields should be an expected result. As noted in the study, the correlation between yield and the red reflectance was weak but consistently negative over locations and years. Similarly, Skakun et al. [
47] reported higher importance of NIR reflectance in estimating the yield in various studies [
46,
83]. However, the correlation values were also insufficient; other authors have shown the year dependence [
51] and random patterns as significant limitations in providing an applicable framework [
84]. Consequently, the industry’s applicability is focused on identifying similar management zones based on VIs rather than absolute yield values [
85]. We can find recent studies in the EU with the use of Sentinel-2 [
86]. Thus, despite the difficulties in finding high-accuracy models, PA practices consider such limitations and base its applicability on the positive or negative sense of correlations with yields, as well as the theoretical basis that reflectance shows with some biophysical parameters (LAI, LCC, or biomass) [
87,
88,
89]. Visible bands strongly correlate with the chlorophyll content, decreasing its reflectance values when the pigment content is increased [
39]. The correlation between a high chlorophyll content and a high yield can be found in several studies about rice genetics [
90,
91,
92], showing an indirect correlation between the reflectance and yield. Knowing this interaction, we can discuss the different results across the growth stages.
According to the literature, rice plants experiment with a high vegetative growth rate at MT, and a low visible reflectance would indicate a high rate (i.e., chlorophyll content is increasing) [
93], but the correlations with yield were weak. Interestingly, we could assume that since tillering to harvest, more variables were affecting the yield besides the chlorophyll content. The importance of visible bands in the other growth stages was less noticeable, suggesting that the crop greenness had a low implication on the yield at PI and GF. The different senses of correlations for the spectral bands B06, B07, B08, and B8A can also be discussed based on the leaf optical properties. The NIR reflectance has no direct correlation with the pigment content; all the radiation is reflected or transmitted by the cell structures of the leaves, increasing with the amount of canopy [
94]. The highest amount of canopy indicates high values of vegetative biomass, and this biomass can quickly provide more fertile tillers, the number of grains, and weight if the carbohydrate partitioning is correctly maintained [
95].
The relation between NIR reflectance and the canopy amount reported by the literature would explain a positive correlation with the yield variability [
45]. The highest correlation was obtained at PI when the vegetative growth ended, and the amount of canopy reached saturation. This correlation was lower at GF; this inconsistency could be associated with higher variability in the sub-growth stage sub-growth panicle presence. That means, at PI, the plants are very uniform in the vegetative canopy; however, at GF, the panicles are visible, and the canopy loses its uniformity, coexisting vegetative with reproductive structures. Moreover, from flowering to harvest, the subgrowth stages within the field can be more variable than in the vegetative period. At this point, the NIR radiation reflected by a canopy with panicles would be different from that without panicles, with differences in the panicle ripening. The panicles cover the leaves, and there is no clear evidence in the literature of how this can modify the reflectance.
The last significant region in the results was the SWIR; those spectral bands are correlated with the water content, decreasing the reflectance with a water content increase [
96]. A high-water content would indicate a non-stressed plant; thus, when those bands or the LSWI are used, the goal is to find a negative correlation with the yield in the spectral bands and a positive correlation in the index [
97]. However, the correlation in the spectral bands was positive at MT, with remarkable collinearity with the NIR region and closer to zero in the other growth stages. LSWI exhibited the worst correlation among all the indices. Given those results, SWIR would not be a suitable region to study rice yield variability. The water sheet maintained on the fields could alter all the optical properties; thus, a low number of plants at MT would allow the satellite to measure the field water better, showing low reflectance values. For example, the values would be higher when the canopy is superior and covers the soil surface. The inundated conditions could explain the low correlation between NDVI and LSWI [
98]. B08 and B11 were positively correlated, losing the opposite sense of correlation that NDVI exhibits between B08 and B04.
To summarize the collinearity results, visible bands and B05 were highly correlated at the three stages; similarly, the NIR B08 band and B06, B07, and B8A were also highly correlated. Those results showed the dependence between spectral regions (visible, NIR, and SWIR), with the Red edge highly correlated with visible bands or NIR. The improvement given by the Red edge would be negligible, according to our study. This result is controversial with the new studies focused on exploiting the Red edge region, but, as far as our concern, those studies are more focused on other chemical and biological properties rather than the yield [
99,
100]. When the VIs were used, we observed an essential collinearity between them, and the correlation with the yield was similar or even lower than the bands. The VIs should be interpreted together with their band components, seeking the collinearities with the bands when this collinearity is the basis for interpreting the index [
101]. In using the NDVI, the lack of correlation between NIR and the red band would represent a distinct interpretability to the most common use since the fundament is the correlation between a high NIR and low red reflectance value [
102]. At PI and GF, the individual use of VIs would hamper the knowing of a low correlation between visible bands and the yield. The NIR region could better inform us about the yield variability, according to the variability of NIR around the Valencia and Seville land rice crop area (
Figure S2). The recommendation should be to perform different interpretations for B04 (red reflectance) and B08 (NIR reflectance) variability. The strong influence of chlorophyll in the visible bands, reported in the literature [
39], may have caused the low R. The NIR region, influenced by the cell structure, rather than the chlorophyll [
30], also showed a different behavior in our results. Therefore, future work is presented to study why the cell structure might better predict yield than chlorophyll content [
103].
The new VIs observed a slight improvement in comparison with the NDVI that it would obtain the exact estimation of yield variability using any of the other indices, except LSWI. The vast number of publications about the NDVI and its increasing use by the rice industry prompted us to consider this index as the best-normalized alternative for analyzing yield variability at the commercial level. Based on the correlations for each year and location, the best moment to use a normalized VI would be MT; however, for the use of a band, the best moment would be PI; in any case, at MT, the use of indices did not significantly improve the use of bands. The use of VIs should be adequate if the biophysical basis is previously confirmed. Thus, at MT, the increase in crop greenness and cell structures in a similar way would also result in similar correlations with the yield for spectral bands and VIs [
104,
105]. However, at PI and GF, the interpretability of VIs would be confusing, and it would not be the best option for studying yield variability.
This study has been focused on detailed research of direct correlations between the yield and spectral bands or VIs. Pearson’s and Spearman’s tests are simple methods that limit the current scope for yield estimations; however, the novel database of commercial yield maps requires first exploring simple statistical outputs. The present work can be useful in continuing the research to develop advanced yield forecasting models. Moreover, PA practices can be directly modified, especially when using VI or spectral bands at different growth stages. The work is considered a first step to ensure data interpretability, providing key information for various research gaps, such as yield estimations, PA practices, and rice physiology.