Next Article in Journal
Lightweight Improved YOLOv5s-CGhostnet for Detection of Strawberry Maturity Levels and Counting
Next Article in Special Issue
Remote Monitoring of Coffee Leaf Miner Infestation Using Machine Learning
Previous Article in Journal
Challenges of Digital Solutions in Sugarcane Crop Production: A Review
Previous Article in Special Issue
Some Geospatial Insights on Orange Grove Site Selection in a Portion of the Northern Citrus Belt of Mexico
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series

by
Daniel A. B. de Siqueira
1,
Carlos M. P. Vaz
2,*,
Flávio S. da Silva
1,
Ednaldo J. Ferreira
2,
Eduardo A. Speranza
3,
Júlio C. Franchini
4,
Rafael Galbieri
5,
Jean L. Belot
5,
Márcio de Souza
5,
Fabiano J. Perina
6 and
Sérgio das Chagas
7
1
Central Paulista University Center, São Carlos 13563-470, SP, Brazil
2
Brazilian Agricultural Research Corporation, Embrapa Instrumentation, São Carlos 13561-354, SP, Brazil
3
Brazilian Agricultural Research Corporation, Embrapa Digital Agriculture, Campinas 13083-886, SP, Brazil
4
Brazilian Agricultural Research Corporation, Embrapa Soybean, Londrina 86001-970, PR, Brazil
5
Mato Grosso Cotton Institute, Cuiabá 78049-015, MT, Brazil
6
Brazilian Agricultural Research Corporation, Embrapa Cotton, Campina Grande 58428-095, PB, Brazil
7
Amaggi Group, Sapezal 78365-000, MT, Brazil
*
Author to whom correspondence should be addressed.
AgriEngineering 2024, 6(2), 947-961; https://doi.org/10.3390/agriengineering6020054
Submission received: 21 February 2024 / Revised: 23 March 2024 / Accepted: 1 April 2024 / Published: 9 April 2024

Abstract

:
Satellite remote sensing data expedite crop yield estimation, offering valuable insights for farmers’ decision making. Recent forecasting methods, particularly those utilizing machine learning algorithms like Random Forest and Artificial Neural Networks, show promise. However, challenges such as validation performances, large volume of data, and the inherent complexity and inexplicability of these models hinder their widespread adoption. This paper presents a simpler approach, employing linear regression models fitted from vegetation indices (VIs) extracted from MODIS sensor data on the Terra and Aqua satellites. The aim is to forecast cotton yields in key areas of the Brazilian Cerrado. Using data from 281 commercial production plots, models were trained (167 plots) and tested (114 plots), relating seed cotton yield to nine commonly used VIs averaged over 15-day intervals. Among the evaluated VIs, Enhanced Vegetation Index (EVI) and Triangular Vegetation Index (TVI) exhibited the lowest root mean square errors (RMSE) and the highest determination coefficients (R2). Optimal periods for in-season yield prediction fell between 90 and 105 to 135 and 150 days after sowing (DAS), corresponding to key phenological phases such as boll development, open boll, and fiber maturation, with the lowest RMSE of about 750 kg ha−1 and R2 of 0.70. The best forecasts for early crop stages were provided by models at the peaks (maximum value of the VI time series) for EVI and TVI, which occurred around 80–90 DAS. The proposed approach makes the yield predictability more inferable along the crop time series just by providing sowing dates, contour maps, and their respective VIs.

1. Introduction

Cotton holds significant economic importance in Brazil, ranking as the world’s fourth largest producer and the second largest exporter. In the 2022–2023 season, Brazil cultivated cotton across 1.66 million hectares, resulting in a total lint production of 3.03 million tons [1].
The majority of the cotton cultivation occurs within the Brazilian Cerrado, characterized by flat and arable lands with favorable rainfall ranging from 750 to 2000 mm/year across the biome [2]. This region employs a highly mechanized, rainfed production system, with approximately 92% occurring without irrigation [3]. Typically, cotton is cultivated from January to July, following the soybean harvest [4]. The primary cotton producing states are Mato Grosso and Bahia, jointly contributing to around 90% of Brazil’s total output [5].
Estimating in-season cotton yield over large areas at a regional or national level provides critical information for farmers, policymakers, governments, crop insurers, and commodity traders. Yield estimations involve a combination of field surveys, remote sensing, statistical regression, and crop simulation models [6,7]. Mid-season crop forecasting is particularly vital for farmers, as it informs management decisions regarding harvesting, storage, transport logistics, and planning for subsequent crops [8,9]. Early forecasting cotton yield, before 90–100 days after sowing, provides valuable management information for implementing corrective interventions, such as adjusting input applications (e.g., topdressing fertilizer and growth regulators), and comparing predicted yields to historical or expected yields for specific fields.
While crop yield models have demonstrated adequate performances at the field scale, their applications across large areas face challenges due to the substantial data volume and computational processing costs [10]. To address them, statistical regression models such as multiple linear regression and machine learning techniques have been employed to integrate spectral indices from satellite data with climate variables at regional or national levels [10,11,12]. Table 1 presents various studies on cotton yield estimation utilizing statistical regression models or crop models combined with remote sensing across the regional and field scale under different platforms (satellite, airplane, unmanned aerial vehicle (UAV), and ground sensors) in diverse countries. Three of these studies developed regression models, including Random Forest, to forecast cotton yield at a regional scale in China [10], India [11] and Australia [12], using satellite-derived vegetation indices (VIs) and climate variables as covariates. The models incorporated precipitation, temperature, evapotranspiration, vapor pressure, soil moisture, and other climate variables. They utilized hundreds of data instances from thousands of hectares across different farm fields over multiple crop years for training and testing, resulting in root mean square errors (RMSE) for seed cotton yield of 157 kg ha−1 [10], 375 kg ha−1 [11], and 976 kg ha−1 [12] in China, India, and Australia, respectively.
Simple linear regression models have also been developed to predict cotton yield using solely satellite-derived vegetation indices as independent variables [13,14,15,16], unmanned aerial vehicle (UAV) [17,18,19,20], airplane [21], and ground sensor [22]. While vegetation indices alone have limitations in capturing the complex relationships among production, plant physiology, climate, soil nutrition, soil water parameters, pest and disease infestation, and crop management characteristics [10], several studies have demonstrated their ability to predict crop yield with reasonable accuracy [7,23,24]. On average, the RMSEs for seed cotton yield for the linear regressions were comparable to those assessed by using machine learning techniques and crop model approaches (Table 1), indicating the potential advantages of linear regression models. These models operate with a smaller set of well-defined independent variables, require less data manipulation, and utilize smaller data instances compared to other approaches. Moreover, the greater mathematical explainability inherent in linear equations for specific satellite products, vegetation indices, and crop phenological stages allows for direct transferability and easy adaptation to other locations, facilitating clearer understandings and comparisons among different models.
Table 1. A review of previous studies for estimation of seed cotton yield using remote sensing.
Table 1. A review of previous studies for estimation of seed cotton yield using remote sensing.
ReferenceCountryPlotAreaModelRS SourceRegressionRMSER2
haAproachModelkg ha−1
[10]China355-CV/RSModis/SentinelLSTM, SVM, RF3750.65
[25]EUA12150CM/RSSpectroradiometer 468-
[13]USA3188RSModis/LandsatLR6730.52
[14]USA--RSModisLR-0.16
[11]India--CV, RSModisRF1570.69
[12]Australia253-CV, RSLandsatRF976-
[17]USA15RSUAVMLR2610.87
[26]USA8050.65RSUAVANN, RF-0.72
[27]USA157RSModis/Landsat 4630.84
[28]USA--EM/RSSentinel --
[18]USA25506RSUAVLR5500.92
[29]Brazil190RSOptical sensordecision trees-0.81
[30]USA-73RSLandsatANN375–4700.71
[31]USA2120RSLandsatexponential4810.81
[19]Australia907RSUAVLR and quadradic-0.75
[15]USA949-RSModisLR-0.48
[16]USA240.2RSNASA dataLR-0.85
[21]USA481.5RSAirborneLR-0.47
[22]USA445.3RSSpectroradiometerLR-0.89
CV: Climate Variables, RS: Remote Sensing, CM: Crop Model, EM: Ecosystem Model, RS: Remote Sensing, UAV: Unmanned Aerial Vehicle, LSTM: Long Short-Term Memory, SVM: Support Vector Machine, RF: Random Forest, LR: Linear Regression, MLR: Multiple Linear Regression, ANN: Artificial Neural, Network, RMSE: Root Mean Square Error of seed cotton yield, R2: Coefficient of determination.
The current study assesses the feasibility of employing simple linear models to correlate cotton yield with VIs derived from satellite data obtained from the Moderate-Resolution Imaging Spectroradiometer (MODIS) sensors aboard the Terra and Aqua satellites. The aim is to forecast in-season cotton yield within the Brazilian Cerrado region. Reflectance time series data from multiple MODIS spectral bands were extracted to assess 15-day interval averages spanning from sowing to harvest of widely recognized VIs. These VI averages, computed over various 15-day intervals, were accessed as predictors (independent variables) within linear regression models to estimate cotton yield.

2. Materials and Methods

2.1. Experimental Areas and Dataset

The regression models utilized data from 281 commercial cotton farm fields (hereinafter referred to as plots) across the states of Mato Grosso (125 plots), Goiás (83 plots), and Bahia (73 plots) during the growing seasons from 2016 to 2022. Data from 2016–2018 seasons were gathered from surveys conducted by the Mato Grosso Cotton Institute [32,33] and the Brazilian Agricultural Research Corporation [34], while data from the 2019 to 2022 seasons were obtained directly from cotton farmers. Figure 1 illustrates the spatial distribution of the 281 plots. Most plots in Mato Grosso, Goiás, and Bahia were situated at the western, southern, and western parts of the states, respectively.
The dataset contains information on seed cotton yield (kg ha−1), plot area (ha), cultivar, plant line spacing (m), seed population (seed ha−1), sowing date, and water management (irrigated vs. rainfed) for 281 plots. The average seed cotton yield was 4209 kg ha−1, ranging from 694 kg ha−1 to 7361 kg ha−1. Low-yield plots occurred in rainfed plots located in Mato Grosso and Goiás during the 2016 and 2022 seasons, likely due to drought or irregular rainfall [35]. Conversely, most of the high-yielding plots were found in Bahia under central pivot irrigation and in Mato Grosso and Goiás during seasons with favorable rainfall distribution (2017 and 2018). Only 12% (34) of the plots were irrigated, all located in Bahia (21) and Goiás (13). The remaining 88% (247 plots) were rainfed, with plots in Mato Grosso (125), Bahia (52), and Goiás (70) relying solely on rainfall. Plant line spacing was predominantly 0.9 m, with an average seed linear density of 10 seed m−1, resulting in an average population of 111,111 seed ha−1. The average plot area was 188 ha and the average cotton cycle lasted 196 days. Cultivars FM975WS, FM944GL, FM913GLT, and FM940GLT (FiberMax, Basf, São Paulo, SP, Brazil), TMG42WS, TMG47B2RF, and TMG81WS (Tropical, Melhoramento e Genética, Londrina, PR, Brazil), DP1243B2RF and DP1536B2RF (Deltapine, Bayer, São Paulo, SP, Brazil), and IMA8405 (Instituto Matogrossense do Algodão, Cuiabá, MT, Brazil) were the dominant choices for cotton growers in this study, representing approximately 80% of all planted varieties.
Daily accumulated precipitations and extreme air temperatures (maximums and minimums) data were obtained for each plot by mean of the Brazilian agrometeorological monitoring system (web application) AGRITEMPO [36], which collects data from conventional and automatic weather stations, by taking the closest station to each plot and downloading the daily climate data from 0 to 180 days after sowing (DAS). The data were then averaged for month 1 (0–30 DAS) to 6 (150–180 DAS) and then correlated to the observed yield only to evaluate the effects of the climate variables on yield and to help explain possible bias between observed and predicted yields.

2.2. Satellite Data Acquisition and Preprocessing

With geographic co-ordinates of all plots, contour maps (kml format) were generated using the Google Earth Pro (Google, Mountain View, CA, USA) and then converted to shapefile format using the QGIS software [37]. Spectral reflectance data and VIs were obtained as time series from the EOS-MODIS MCD43A4 V6.1 product [38]. This product provides daily surface reflectance data (NBAR) adjusted for Nadir bidirectional reflectance distribution function (BRDF) effects. It offers smoothed data with a 500 m spatial resolution. The product uses 16-day compositions with the best pixel selection (maximum value composite) to create daily time series. The product includes spectral bands for blue (0.459–0.479 μm), green (0.545–0.565 μm), red (0.62–0.67 μm), near infrared (NIR) (0.841–0.876 μm), short-wave infrared (SWIR1) (1.23–1.25 μm), SWIR2 (1.628–1.656 μm), and SWIR3 (2.105–2.155 μm). Google Earth Engine (GEE) platform [39], and its JavaScript code editor, was used to extract these time series. The GEE code selects pixels within each plot contour area, excluding those within 250 m of the boundary using a buffer command. This step ensured focus on the core area of each plot. Finally, the code calculated and exported average reflectance values for each band across the entire growing season (from sowing to harvest). The reflectance image and the time series graph were visually inspected before exporting the representative time series for each plot in comma-separated values (csv).
Spectral bands were then combined in an Excel®, (Tokyo, Japan) worksheet and the VIs were calculated, averaged in 15-day intervals from 0 to 240 DAS, and correlated to cotton yield for the each mutually exclusive interval. Reflectance data are plotted in a logarithmic scale for better comparisons among the spectral bands. In Table 2 is shown a list of the VIs and their definitions. A subset of plots (167) was randomly selected (balanced by yield) and used for generating the regression models (training set). The remaining dataset (114 plots) was used for the independent model validation (test set), employing a well-known cross-validation method set as 60–40% (train–test). Linear regression were fitted for each combination of VI (Table 2) and time interval. Models’ accuracies were measured by the Root Mean Square Error (RMSE) and determination coefficient (R2) estimated from the test set. Linear regressions were performed, and the RMSE, R2, and statistical significance (Analysis of Variance—ANOVA, New Providence, NJ, USA) of the fittings were calculated using a spreadsheet program (Microsoft® Excel® 2019, version 2403).

3. Results and Discussion

Examples of MODIS images (NIR band) and time series of all spectral reflectance bands are shown in Figure 2 for a plot with very low cotton yield (1665 kg ha−1 in season 2016; Figure 2a, left) and high yield (4965 kg ha−1 in season 2017; Figure 2a, right). The original smoothed daily time series are shown in Figure 2b and 15-day averaged time series in Figure 2c. Reflectance data are plotted in a logarithmic scale for better comparisons among the spectral bands. In general, lower reflectance values (greater absorption of electromagnetic radiation) are observed for the blue, red, and green bands due to the absorption of these light wavelengths by leaves through photosynthesis. Higher reflectance in this range can be associated to plant stresses caused by biotic and/or abiotic factors. The spectral range between 0.4 and 0.7 μm (visible region) is the photosynthetically active radiation (PAR) that is the most efficient portion of the electromagnetic spectrum used by the plant for photosynthesis. Healthy plants present high light absorption in the visible range (0.4–0.7 μm) mainly by the leaf pigments like chlorophyll and carotenoids, relatively high reflectance in the NIR range (0.7–1.1 μm) due to leaf and cell structure scattering effects, and relatively low reflectance in the SWIR range (1.1–2.5 μm) due to water and chemicals into the leaf structure [48,49]. For an actual instance, the low cotton yield (left graph in Figure 2) was mainly caused by low precipitation (water stress) in the 2016 season [35], which is noted by slight increases in red and blue reflectance (lower absorbance by cotton plants), especially between 70 and 170 DAS, and decreasing NIR reflectance, as compared to the high-yield plot (right).
Figure 3a shows the average time series of the seven MODIS spectral bands for five classes of cotton yield (lower than 3, 3–3.75, 3.75–4.5, 4.5–5.25, and greater than 5.25 ton ha−1). As expected, the reflectance of NIR and SWIR1 bands increased with higher cotton yield. Conversely, reflectance in the red, blue, SWIR2, and SWIR3 decreases as yield increases, particularly from around 60–80 DAS until harvest. This trend is confirmed by the correlation coefficients (r) between cotton yield and reflectance displayed in Figure 3b. Notably, the green band reflectance, while higher than red and blue as expected, exhibits no clear visual relationship with yield in Figure 3a. Figure 3b highlights the strongest correlations between spectral reflectance bands and cotton yield. Positive correlations are observed for NIR and SWIR1 bands, while negative correlations are found for SWIR3, red, SWIR2, and blue bands. These findings, obtained using broadband satellite sensors, are consistent with studies employing narrow-band ground measurements with spectroradiometers. For example, Thenkabail et al. [50] reported a negative correlation between cotton biophysical parameters (leaf area index, wet biomass, plant height, and crop yield) for reflectance in the visible light range (350 nm to 700 nm) and positive correlations in the near-infrared to short-wave infrared range (730 nm to 1050 nm).
Cotton yield prediction for each 15-day interval during the growing season was attempted using simple linear regression models. The nine vegetation indices listed in Table 2 were used as independent variables and the training dataset (167 plots) was employed for model fitting. Figure 4b displays the determination coefficients (R2) achieved for these VIs across DAS intervals. The highest R2 values were obtained for the TVI (0.69), EVI (0.68), SAVI (0.67), and NDVI (0.61) during intervals between 120 and 165 DAS. This period corresponds to the cotton late-season stage, encompassing boll opening and defoliation. Figure 4a presents the average time series of these VIs for different cotton yield classes. A general trend of saturation was observed for all VIs at the peak in the time series for the highest yield classes. This effect was most pronounced for NDVI, which is consistent with previous studies reporting NDVI saturation at high levels of green biomass or leaf area index [51,52].
The results suggest that TVI, EVI, and SAVI hold promise for in-season cotton yield estimation within the training dataset, as they yielded the highest determination coefficients for yield prediction. Table 3 presents the linear regression models for the four VIs with the highest R2 and statistically significant overall fit (analysis of variance, ANOVA, p-value < 0.05) of the adjusted models for different 15-day intervals, ranging from 75–90 to 180–195 DAS. The table also includes models for the average VI from 75 DAS to harvest (75–195 DAS) and the peak of the observed VI values throughout the season. Figure 5 illustrates the relationships between cotton yield and TVI for different DAS intervals. While the highest R2 (0.73) was achieved for the 75–195 DAS interval for TVI (Figure 5) and the other VIs (Table 3), these equations would only be practical at the very end of the season since they require data accumulation until 195 DAS for averaging. For in-season forecasting, models derived from VI data between 105–120 DAS and 165–180 DAS yielded the most promising results. However, intervals like 150–165 and 165–180 DAS are too close to harvest and may not provide sufficient lead time for farmers to implement management interventions effectively. Peak VI values have been explored in various studies for in-season yield estimation [14,53]. In this study, the peak models exhibited lower R2 values compared to those obtained between 105–120 and 165–180 DAS (Table 3) but remained higher than those for 90–105 DAS interval. This suggests potential for earlier in-season forecasts considering that peak VI values for TVI, EVI, SAVI, and NDVI typically occur around 80 to 100 DAS, as seen in Figure 4.
Model estimates for cotton yields using TVI equations (Table 3) are compared to observed yields alongside their respective RMSE in Figure 6. Figure 7 illustrates variations in RMSE (Figure 7a) and R2 (Figure 7b) across different DAS intervals for the four best VIs. The lowest RMSE and highest R2 were achieved by EVI and TVI between 90–105 and 135–150 DAS (four DAS intervals), enabling in-season yield forecasting with RMSE of approximately 750 kg ha−1 (Figure 7a). Averaged VIs for the period 75–195 DAS and peak models displayed comparable RMSE values to the best models (EVI, TVI, and SAVI) for 15-day intervals (90–105 to 135–150 DAS) (Figure 7c). However, for EVI, the peak model exhibited the lowest RMSE (726 kg ha−1). Figure 7d presents the average yield predictions for the 114 testing plots at 105–120 DAS, 75–195 DAS, and peaks. The average observed yield for the 114 plots stood at 4372 kg ha−1 (represented by the dotted horizontal line in Figure 7d), with most linear models tending to overestimate yield by approximately 160 kg ha−1 on average. Overall, there were tendencies of overestimation for lower-yield plots and underestimation for higher-yield plots, as evidenced in Figure 6 for TVI. However, the 75–195 DAS model appears to reduce such bias. Although NDVI provided a closer average yield forecast for the 114 plots compared to the measured average yield (Figure 7d), its RMSE was significantly higher than the other VIs (EVI, TVI, and SAVI) for all intervals between 75 and 165 DAS. Therefore, the best performances were achieved with EVI and TVI for the 105–120, 120–35, and 135–150 DAS intervals. If an earlier prediction is required, the 90–105 DAS or peak models are preferred, as the RMSE for 75–90 DAS and earlier DAS intervals were excessively high, and the R2 values were insignificant.
The observed trend of overestimation for low-yield plots and underestimation for high yields (Figure 6) can be further examined by correlating the residual error (yieldestimated − yieldobserved; estimated by the peak model with TVI) for the observed yield of each plot (Figure 8a). In this analysis, negative residual errors (indicating underestimation) are evident for yields higher than approximately 4000 kg ha−1, while positive residual errors (indicating overestimation) are observed for lower yield values. Several factors may contribute and explain the variation in low and high yield plots, such as climate conditions, soil type, soil fertility, diseases, cultivars, and others. However, in rainfed cotton production, one of the primary driven factors affecting yield has consistently been climatic condition, particularly variations in precipitation and temperature throughout the season. Consequently, the choice of sowing date and cycle duration, determined based on the recommended cultivar and local climatic patterns, is influenced by water demands and temperature fluctuations during each stage of cotton development, ultimately impacting the expected cotton yield. For the testing dataset, the earliest sowing date was 19 November. Therefore, observed yield was correlated with sowing dates after 1 November as a proxy for sowing data, independent of the year (Figure 8b), revealing a negative influence on yield. Additionally, the cotton cycle duration, primarily dictated by the cultivar used, exhibited a positive correlation with yield (Figure 8c). These two parameters are intrinsically linked to climatic conditions and both significantly influence observed yields, with a discernible trend of increasing yield associated with earlier sowing dates and longer cotton cycle durations.
The most significant correlations between monthly accumulated precipitation and yield were observed in the third and fourth months after sowing (r = 0.59 for 60–90 DAS and 0.39 for 90–120 DAS; Figure 8d), corresponding to the mid-season of cotton growth, from canopy closure until flowering and boll development stages. In the first month, rainfall demonstrated an adverse effect on yield (r = −0.29), as high-intensity rainfall during the early season can potentially damage the germination process. Monthly averaged maximum and minimum temperatures, along with their differences (Tmax − Tmin) also exerted an influence on yield, particularly after the third month (r = −0.49).
Several studies, such as those listed in Table 1, have investigated the optimal timing for acquiring satellite and UAV images or ground measurements for cotton yield predictions. Some studies have relied on single-date images, while others have assessed images at specific phenological stages or analyzed complete time series data. Table 4 provides a summary of the optimal periods, expressed in DAS, as inferred from the literature cited in Table 1. The most favorable correlations between cotton yield and the VIs have been reported across various stages, ranging from the flowering and boll development period [12,18,22,26,27,28,29,30,31] to boll opening, maturation, and defoliant application [10,15,16,17,19,21,27,30]. Notably, these findings align with the results obtained in the present study. Specifically, the study conducted by Lang et al. [10] in Xinjiang Province, China, spanning from 2012 to 2019 and encompassing 355 plots, yielded results strikingly similar to ours, with the lowest RMSE and highest R2 observed during the fourth and fifth months after sowing (90–120 to 120–150 DAS).
The most effective forecasting models for the mid to late-season period (90 to 150 DAS) on a regional scale can serve as valuable advance information for various stakeholders, including commodity traders, policymakers, governments, and, in some instances, farmers. This information can aid farmers in logistical planning and preparing for the next crop season, such as determining fertilizer needs based on the nutrient exportation from the previous crop. However, for certain in-season interventions like topdressing fertilizer applications, some forecasting models may not be practical. This is because most topdressing fertilizers (both macro and micronutrients) are typically applied before 100 DAS. Nonetheless, earlier prediction using peak models still offers a window of opportunity for topdressing fertilizer intervention. For instance, by around 80 DAS, cotton plants have absorbed 45%, 50%, and 80% of sulfur, potassium, and nitrogen, respectively [54].

4. Conclusions

The forecasting methodology, employing simple regression models and time series intervals for cotton yield prediction using the MODIS product MCD43A4 V6.1 allowed in-season prediction with an RMSE of approximately 750 kg ha−1 at a regional scale across three cotton-producing states in the Brazilian Cerrado. This straightforward approach offers ease of application and can be readily utilized for predicting seed cotton yield at farm, region, and national levels. One limitation of the approach is the coarse spatial resolution of the MODIS product, which restricts its applicability primarily to large commercial plots, characteristic of crop production systems in the Brazilian Cerrado.
Among the nine VIs evaluated, EVI and TVI emerged as the most effective individual predictors. However, it is important to note that accuracies, as assessed by mean of RMSEs, were notably low up to 75 DAS, presenting a limitation to the application of this approach for estimating cotton yield during the initial stages of cotton development. The most reliable in-season predictions were achieved through 15-day intervals ranging from 90–105 to 135–150 DAS, corresponding to the mid to late stages of cotton development (including boll development, open boll, and fiber maturation). For earlier stages (below 90–105 DAS), the most accurate forecasts were obtained from the model fitted for peaks using EVI and TVI, typically occurring around 80–90 DAS.
Future validation experiments should focus on assessing the accuracies and practical utility of various models in upcoming cotton seasons across different sub-regions. This evaluation should include their ability to predict cotton net production at various scales, ranging from individual farms to states, regions, and even at the national level. When considering the applicability of models at the farm level for specific management interventions, such as within-season topdressing fertilizer application or planning for the next crop based on exported nutrients, caution is warranted. Additional validations or complimentary approaches may be necessary to ensure the reliability and effectiveness of these models in practical agricultural decision making.
The proposed approach can be extended to corn, which is also commonly grown as a second harvest in the Brazilian Cerrado. Additionally, it could be adapted for soybean, the primary first crop cultivated in Brazil, despite facing challenges such as increased cloud interference in satellite images due to its cultivation during the rainy season.

Author Contributions

Conceptualization, C.M.P.V. and F.S.d.S.; Data curation, R.G., J.L.B., M.d.S., F.J.P. and S.d.C.; Formal analysis, D.A.B.d.S., C.M.P.V. and F.S.d.S.; Investigation, D.A.B.d.S., C.M.P.V. and F.S.d.S.; Methodology, D.A.B.d.S., C.M.P.V., F.S.d.S., E.J.F. and J.C.F.; Resources, R.G., J.L.B., M.d.S., F.J.P. and S.d.C.; Software, F.S.d.S. and E.A.S.; Supervision, C.M.P.V.; Validation, D.A.B.d.S. and C.M.P.V.; Writing—original draft, D.A.B.d.S. and C.M.P.V.; Writing—review and editing, C.M.P.V., E.J.F., E.A.S., J.C.F., R.G., J.L.B., M.d.S. and F.J.P. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge the financial support from the Brazilian Agricultural Research Corporation (EMBRAPA) and the Mato Grosso Cotton Institute (IMAmt) through a joint co-operation project (30.21.90.03) and thank the farmers in Mato Grosso, Bahia, and Goiás who voluntarily provided access to their commercial plots data.

Data Availability Statement

The data will be made available upon request to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. ABRAPA. Safra Brasil. 2024. Available online: https://abrapa.com.br/dados/ (accessed on 16 February 2024).
  2. Eiten, G. Cerrado vegetation: Vegetation and climate in Brazil. In Cerrado: Characterization, Occupation and Perspectives, 2nd ed.; Pinto, M.N., Ed.; UNB: Brasília, Brazil, 1993; pp. 17–73. (In Portuguese) [Google Scholar]
  3. COTTON BRAZIL. Market Report, 19 January 2024. 2024. Available online: https://cottonbrazil.com/download/abrapa-cotton-brazil-report-january-2024/ (accessed on 16 February 2024).
  4. Echer, F.R.; Mello, P.R.; Rosolem, C.A. Management of plant growth regulators. In Best Management Practices for Cotton in Mato Grosso State, 4th ed.; Belot, J.L., Vilela, P.M.C.A., Eds.; IMAmt-AMPA: Cuiabá, Brazil, 2020; pp. 312–319. (In Portuguese) [Google Scholar]
  5. CONAB. Companhia Nacional de Abastecimento. 2024. Available online: https://www.conab.gov.br/info-agro/safras/serie-historica-das-safras/itemlist/category/898-algodao (accessed on 16 February 2024).
  6. Basso, B.; Liu, L. Seasonal crop yield forecast: Methods, applications, and accuracies. Adv. Agron. 2019, 154, 201–255. [Google Scholar]
  7. Li, F.; Miao, Y.; Chen, X.; Sun, Z.; Stueve, K.; Yuan, F. In-season prediction of corn grain yield through PlanetScope and Sentinel-2 images. Agronomy 2022, 12, 3176. [Google Scholar] [CrossRef]
  8. Khanal, S.; Kushal, K.C.; Fulton, J.P.; Shearer, S.; Ozkan, E. Remote sensing in agriculture—Accomplishments, limitations, and opportunities. Remote Sens. 2020, 12, 3783. [Google Scholar] [CrossRef]
  9. Taskiner, T.; Bilgen, B. Optimization models for harvest and production planning in agri-food supply chain: A systematic review. Logistics 2021, 5, 52. [Google Scholar] [CrossRef]
  10. Lang, P.; Zhang, L.; Huang, C.; Chen, J.; Kang, X.; Zhang, Z.; Tong, Q. Integrating environmental and satellite data to estimate county-level cotton yield in Xinjiang Province. Front. Plant Sci. 2023, 13, 1048479. [Google Scholar] [CrossRef] [PubMed]
  11. Prasad, N.R.; Patel, N.R.; Danodia, A. Crop yield prediction in cotton for regional level using Random Forest approach. Spat. Inf. Res. 2020, 29, 195–206. [Google Scholar] [CrossRef]
  12. Filippi, P.; Whelan, M.B.; Vervoort, R.W.; Bishop, T.F.A. Mid-season empirical cotton yield forecasts at fine resolutions using large yield mapping datasets and diverse spatial covariates. Agric. Syst. 2020, 184, 102894. [Google Scholar] [CrossRef]
  13. Meng, L.; Liu, H.; Ustin, S.L.; Zhang, X. Assessment of FSDAF accuracy on cotton yield estimation using different Modis products and Landsat based on the mixed degree index with different surroundings. Sensors 2021, 21, 5184. [Google Scholar] [CrossRef] [PubMed]
  14. Johnson, D.M.; Rosales, A.; Mueller, R.; Reynolds, C.; Frantz, R.; Anyamba, A.; Pak, E.; Tucker, C. USA crop yield estimation with MODIS NDVI: Are remotely sensed models better than simple trend analyses? Remote Sens. 2021, 13, 4227. [Google Scholar] [CrossRef]
  15. Johnson, D.M. A comprehensive assessment of the correlations between field crop yields and commonly used MODIS products. Int. J. Appl. Earth Obs. Geoinf. 2016, 52, 65–81. [Google Scholar] [CrossRef]
  16. Iqbal, J.; Read, J.; Whisler, D. Using remote sensing and soil physical properties for predicting the spatial distribution of cotton lint yield. Turk. J. Field Crops 2013, 18, 158–165. [Google Scholar]
  17. Feng, A.; Zhou, J.; Vories, E.D.; Sudduth, K.A.; Zhang, M. Yield estimation in cotton using UAV-based multi-sensor imagery. Biosyst. Eng. 2020, 193, 101–114. [Google Scholar] [CrossRef]
  18. Feng, A.; Zhang, M.; Sudduth, K.A.; Vories, E.D.; Zhou, J. Cotton yield estimation from UAV-based plant height. Trans. ASABE 2019, 62, 393–403. [Google Scholar] [CrossRef]
  19. Ballester, C.; Hornbuckle, J.; Brinkhoff, J.; Smith, J.; Quayle, W. Assessment of in-season cotton nitrogen status and lint yield prediction from Unmanned Aerial System imagery. Remote Sens. 2017, 9, 1149. [Google Scholar] [CrossRef]
  20. Huang, Y.; Brand, H.J.; Sui, R.; Thomson, S.J.; Furukawa, T.; Ebelhar, M.W. Cotton yield estimation using very high-resolution digital images acquired with a low-cost Small Unmanned Aerial Vehicle. Trans. ASABE 2016, 59, 1563–1574. [Google Scholar]
  21. Huang, Y.; Sui, R.; Thomson, J.S.; Fisher, D.K. Estimation of cotton yield with varied irrigation and nitrogen treatments using aerial multispectral imagery. Int. J. Agric. Biol. Eng. 2013, 6, 37–41. [Google Scholar]
  22. Zhao, D.; Reddy, K.R.; Kakani, V.G.; Read, J.J.; Koti, S. Canopy reflectance in cotton for growth assessment and lint yield prediction. Eur. J. Agron. 2007, 26, 335–344. [Google Scholar] [CrossRef]
  23. He, Y.; Qiu, B.; Cheng, F.; Chen, C.; Sun, Y.; Zhang, D.; Lin, L.; Xu, A. National scale maize yield Estimation by integrating multiple spectral indexes and temporal aggregation. Remote Sens. 2023, 15, 414. [Google Scholar] [CrossRef]
  24. Fu, Y.; Huang, J.; Shen, Y.; Liu, S.; Huang, Y.; Dong, J.; Han, W.; Ye, T.; Zhao, W.; Yuan, W. A satellite-based method for national winter wheat yield estimating in China. Remote Sens. 2021, 13, 4680. [Google Scholar] [CrossRef]
  25. Jeong, S.; Shin, T.; Ban, J.; Ko, K.J. Simulation of spatiotemporal variations in cotton lint yield in the Texas high plains. Remote Sens. 2022, 14, 1421. [Google Scholar] [CrossRef]
  26. Ashapure, A.; Jung, J.; Chang, A.; Oh, S.; Yeom, J.; Maeda, M.; Maeda, A.; Dube, N.; Landivar, J.; Hague, S.; et al. Developing a machine learning based cotton yield estimation framework using multi-temporal UAS data. ISPRS J. Photogramm. Remote Sens. 2020, 169, 180–194. [Google Scholar] [CrossRef]
  27. Meng, L.; Liu, H.; Zhang, X.; Ren, C.; Ustin, S.; Qiu, Z.; Xu, M.; Guo, D. Assessment of the effectiveness of spatiotemporal fusion of multi-source satellite images for cotton yield estimation. Comput. Electron. Agric. 2019, 162, 44–52. [Google Scholar] [CrossRef]
  28. He, L.; Mostovoy, G. Cotton yield estimate using Sentinel-2 data and an ecosystem model over the southern US. Remote Sens. 2019, 11, 2000. [Google Scholar] [CrossRef]
  29. Baio, F.H.R.; da Silva, E.E.; Martins, P.H.A.; da Silva-Júnior, C.A.; Teodoro, P.E. In situ remote sensing as a strategy to predict cotton seed yield. J. Biosci. 2019, 35, 1847–1854. [Google Scholar] [CrossRef]
  30. Haghverdi, A.; Washington-Allen, R.A.; Leib, B.G. Prediction of cotton lint yield from phenology of crop indices using artificial neural networks. Comput. Electron. Agric. 2018, 152, 186–197. [Google Scholar] [CrossRef]
  31. Meng, L.; Zhang, X.; Huanjun, L.; Guo, D.; Yan, Y.; Qin, L.; Pan, Y. Estimation of cotton yield using the reconstructed time-series vegetation index of Landsat data. Can. J. Remote Sens. 2017, 43, 244–255. [Google Scholar] [CrossRef]
  32. Galbieri, R.; Vaz, C.M.P.; Pessatto-Filho, D.; Crestana, S.; Chitarra, L.G.; Lobo-Junior, M.; Lanças, F.M.; Silva, J.F.V.; Faleiro, V.O.; Sarques, B. Cotton Production in Goiás: White Mold, Fusarium Wilt, Cultivation System and Soil Physical Attributes; Circular Técnica; AGOPA: Goiânia, Brazil, 2018; 20p. (In Portuguese) [Google Scholar]
  33. Galbieri, R.; Vaz, C.M.P.; Silva, J.F.V.; Amus, G.L.A.; Crestana, S.; Matos, E.S.; Magalhaes, C.A.S. Influence of soil parameters on the occurrence of phytonematodes. In Phytoparasitic Nematodes of Cotton in Brazilian Cerrados: Biology and Control Measures, 1st ed.; Galbieri, R., Belot, J.L., Eds.; IMAmt: Cuiabá, Brazil, 2016; pp. 37–90. (In Portuguese) [Google Scholar]
  34. Perina, F.J.; Bogiani, J.C.; Ribeiro, G.C.; Breda, C.E.; Fabris, A.; dos Santos, I.A.; Seibel, D.P. Survey and Management of Phytonematodes in Cotton in Western Bahia, Results for Season 2016/17; Circular Técnica; Fundação: Bahia, Brazil, 2017; 8p. [Google Scholar]
  35. INMET—Instituto Nacional de Meteorologia. Boletim de Monitoramento Agrícola—Culturas de Verão 21/22, 11(5):1-19. 2022. Available online: https://portal.inmet.gov.br/ (accessed on 16 February 2024).
  36. AGRITEMPO. Agrometeorological Monitoring System. 2024. Available online: http://www.agritempo.gov.br/agritempo/sobre.jsp?lang=en (accessed on 16 February 2024).
  37. QGIS Development Team. QGIS Geographic Information System. Open Source Geospatial Foundation Project. 2022. Available online: http://qgis.osgeo.org (accessed on 16 February 2024).
  38. MODIS. Moderate Resolution Imaging Spectroradiometer. 2024. Available online: https://modis.gsfc.nasa.gov/ (accessed on 21 March 2024).
  39. GEE—Google Earth Engine. 2024. Available online: https://earthengine.google.com/platform/ (accessed on 21 March 2024).
  40. Gitelson, A.A.; Gritz, Y.; Merzlyak, M.N. Relationships between leaf chlorophyll content and spectral reflectance and algorithms for non-destructive chlorophyll assessment in higher plant leaves. J. Plant Physiol. 2003, 160, 271–282. [Google Scholar] [CrossRef] [PubMed]
  41. Jordan, C.F. Derivation of leaf-area index from quality of light on the forest floor. Ecology 1969, 50, 663–666. [Google Scholar] [CrossRef]
  42. Vincini, M.; Frazzi, E.; D’Alessio, P. A broad-band leaf chlorophyll vegetation index at the canopy scale. Precis. Agric. 2008, 9, 303–319. [Google Scholar] [CrossRef]
  43. Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
  44. Broge, N.H.; Leblanc, E. Comparing prediction power and stability of broadband and hyperspectral vegetation indices for estimation of green leaf area index and canopy chlorophyll density. Remote Sens. Environ. 2001, 76, 156–172. [Google Scholar] [CrossRef]
  45. Kaufman, Y.J.; Merzlyak, M.N. Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sens. Environ. 1996, 58, 289–298. [Google Scholar]
  46. Huete, A.; Didan, K.; Miura, T.; Rodriguez, E.P.; Gao, X.; Ferreira, L.G. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens. Environ. 2002, 83, 195–213. [Google Scholar] [CrossRef]
  47. Rouse, J.W. Monitoring vegetation systems in the great plains with ERTS. In Proceedings of the Third Earth Resources Technology Satellite-1 Symposium—Volume I: Technical Presentations; Goddard Space Flight Center NASA SP-351; NASA: Washington, DC, USA, 1974; Paper A-20; pp. 309–317. [Google Scholar]
  48. Chattopadhyay, N.; Shukla, K.K.; Birah, A.; Khokhar, M.K.; Kanojia, A.K.; Nigam, R.; Roy, A.; Bhattacharya, B.K. Identification of spectral bands to discriminate wheat Spot Blotch using in situ hyperspectral data. J. Indian Soc. Remote Sens. 2023, 51, 917–934. [Google Scholar] [CrossRef]
  49. Knipling, E.B. Physical and physiological basis for the reflectance of visible and near-infrared radiation from vegetation. Remote Sens. Environ. 1970, 1, 155–159. [Google Scholar] [CrossRef]
  50. Thenkabail, P.S.; Smith, R.B.; De Pauw, E. Hyperspectral vegetation indices and their relationships with agricultural crop characteristics. Remote Sens. Environ. 2000, 71, 158–182. [Google Scholar] [CrossRef]
  51. Tesfaye, A.A.; Awoke, B.G. Evaluation of the saturation property of vegetation indices derived from sentinel-2 in mixed crop-forest ecosystem. Spat. Inf. Res. 2020, 1, 109–121. [Google Scholar]
  52. Xing, N.; Huang, W.; Xie, Q.; Shi, Y.; Ye, H.; Dong, Y.; Wu, M.; Sun, G.; Jiao, Q. A transformed triangular vegetation index for estimating winter wheat leaf area index. Remote Sens. 2020, 12, 16. [Google Scholar] [CrossRef]
  53. Shammi, S.A.; Meng, Q. Use time series NDVI and EVI to develop dynamic crop growth metrics for yield modeling. Ecol. Indic. 2021, 121, 107124. [Google Scholar] [CrossRef]
  54. Carvalho, M.C.S.; Ferreira, G.B. Cotton Liming and Fertilization in the Cerrado; Circular Técnica 92; Embrapa: Campina Grande, Brazil, 2006; 16p. (In Portuguese) [Google Scholar]
Figure 1. Spatial location of the 281 plots cultivated with cotton in the states of Mato Grosso (MT), Goiás (GO), and Bahia (BA) (typical Brazilian Cerrado biome) used to train (yellow dots) and test the models (blue dots) and three cotton plots zoomed (Plots 1, 2, and 3).
Figure 1. Spatial location of the 281 plots cultivated with cotton in the states of Mato Grosso (MT), Goiás (GO), and Bahia (BA) (typical Brazilian Cerrado biome) used to train (yellow dots) and test the models (blue dots) and three cotton plots zoomed (Plots 1, 2, and 3).
Agriengineering 06 00054 g001
Figure 2. MODIS product MCD43A4 V6.1: (a) NIR images obtained for two commercial cotton production plots; (b) original extracted time series (daily data) for red, NIR, blue, green, and SWIR spectral bands; (c) time series in 15-day interval averages. Plot on left presented low cotton yield on right high cotton yield.
Figure 2. MODIS product MCD43A4 V6.1: (a) NIR images obtained for two commercial cotton production plots; (b) original extracted time series (daily data) for red, NIR, blue, green, and SWIR spectral bands; (c) time series in 15-day interval averages. Plot on left presented low cotton yield on right high cotton yield.
Agriengineering 06 00054 g002
Figure 3. Average time series of reflectance for seven MODIS spectral bands for five cotton yield classes (a) and the linear correlation coefficients (r) between cotton yield and reflectance for the different 15-day days after sowing (DAS) intervals in the training dataset (167 plots) (b).
Figure 3. Average time series of reflectance for seven MODIS spectral bands for five cotton yield classes (a) and the linear correlation coefficients (r) between cotton yield and reflectance for the different 15-day days after sowing (DAS) intervals in the training dataset (167 plots) (b).
Agriengineering 06 00054 g003
Figure 4. Time series of NDVI, EVI, SAVI, and TVI from MODIS sensor for five cotton yield classes (a) and determination coefficients (R2) of linear correlations between cotton yield and 9 vegetation indices (Table 2) (b) for the different 15-day days after sowing (DAS) classes for the training dataset (167 plots).
Figure 4. Time series of NDVI, EVI, SAVI, and TVI from MODIS sensor for five cotton yield classes (a) and determination coefficients (R2) of linear correlations between cotton yield and 9 vegetation indices (Table 2) (b) for the different 15-day days after sowing (DAS) classes for the training dataset (167 plots).
Agriengineering 06 00054 g004
Figure 5. Linear correlations between TVI and seed cotton for different 15-day DAS for the training dataset, from 75–90 to 180–195 DAS, 75–195 DAS (averaged period) and the time series peak value. Linear equations shown in Table 3.
Figure 5. Linear correlations between TVI and seed cotton for different 15-day DAS for the training dataset, from 75–90 to 180–195 DAS, 75–195 DAS (averaged period) and the time series peak value. Linear equations shown in Table 3.
Agriengineering 06 00054 g005
Figure 6. Comparisons between observed and predicted seed cotton yield for the testing dataset (114 plots) for different 15-day DAS (75–90 to 180–195 DAS), 75–195 DAS (averaged period), the time series peak value, and their respective root mean square errors (RMSE), obtained using TVI linear regression models (Table 3).
Figure 6. Comparisons between observed and predicted seed cotton yield for the testing dataset (114 plots) for different 15-day DAS (75–90 to 180–195 DAS), 75–195 DAS (averaged period), the time series peak value, and their respective root mean square errors (RMSE), obtained using TVI linear regression models (Table 3).
Agriengineering 06 00054 g006
Figure 7. Root mean square error (RMSE) (a) and linear determination coefficients (R2) (b) between predicted and observed yield for the 114 plots of the testing dataset for different 15-day DAS; comparison of RMSE (c) and predicted yield (d) for 105–120 DAS, 75–195 DAS, and peak equations for TVI, EVI, SAVI, and NDVI. Dotted horizontal line in (d) represents the average observed yield for the 114 plots.
Figure 7. Root mean square error (RMSE) (a) and linear determination coefficients (R2) (b) between predicted and observed yield for the 114 plots of the testing dataset for different 15-day DAS; comparison of RMSE (c) and predicted yield (d) for 105–120 DAS, 75–195 DAS, and peak equations for TVI, EVI, SAVI, and NDVI. Dotted horizontal line in (d) represents the average observed yield for the 114 plots.
Agriengineering 06 00054 g007
Figure 8. Relationships between the residual error (Yieldestimated − Yieldobserved; estimated by TVI peak model) and observed cotton yield (a); observed yield and sawing day from 1 November (b); observed yield and cotton cycle (c); and correlation coefficients (r) between observed yield and monthly accumulated precipitation, monthly averaged minimum (Tmin) and maximum (Tmax) temperatures, and Tmax − Tmin (d) for month 1 to 6 after sowing.
Figure 8. Relationships between the residual error (Yieldestimated − Yieldobserved; estimated by TVI peak model) and observed cotton yield (a); observed yield and sawing day from 1 November (b); observed yield and cotton cycle (c); and correlation coefficients (r) between observed yield and monthly accumulated precipitation, monthly averaged minimum (Tmin) and maximum (Tmax) temperatures, and Tmax − Tmin (d) for month 1 to 6 after sowing.
Agriengineering 06 00054 g008
Table 2. Vegetation indices used in this study.
Table 2. Vegetation indices used in this study.
Vegetation IndicesFormulationReference
Green Index (GI)GI = G/R[40]
Ratio Vegetation Index (RVI)RVI = NIR/R[41]
Chlorophyll Vegetation Index (CVI)CVI = (NIR × R)/G2[42]
Soil-Adjusted Vegetation Index (SAVI)SAVI = 1.5 × (NIR − R)/(0.5 × NIR + R)[43]
Chlorophyll index—green (CIG)CIG = (NIR/G) − 1[40]
Triangular Chlorophyll Absorption Ratio Index (TVI)TVI = 60 × NIR − G − 100 × (R – G)[44]
Green NDVI (GNDVI)GNDVI = (NIR − G)/(NIR + G)[45]
Enhanced Vegetation Index (EVI)EVI = 2.5 × (NIR − R)/(1 + NIR + 6 × R − 7.5 × B)[46]
Normalized Differential Vegetation Index (NDVI)NDVI = (NIR − R)/(NIR + R)[47]
G: green, R: red, NIR: near infrared, B: blue.
Table 3. Linear regression models for predicting cotton yield from averaged 15-day TVI, EVI, SAVI, and NDVI vegetation indices, averaged 75–195 DAS and peak (maximum value in the time series), using the training dataset.
Table 3. Linear regression models for predicting cotton yield from averaged 15-day TVI, EVI, SAVI, and NDVI vegetation indices, averaged 75–195 DAS and peak (maximum value in the time series), using the training dataset.
DASLinear Model (TVI)R2RMSEp-ValueDASLinear Model (EVI)R2RMSEp-Value
kg ha−1(α = 0.05)kg ha−1(α = 0.05)
75–90Y = 111.96 TVI + 794.220.3110883.3 × 10–1575–90Y = 5133.1 EVI + 315.90.2811194.1 × 10–13
90–105Y = 139.78 TVI − 90.9180.479539.5 × 10−2590–105Y = 6863.1 EVI − 1066.80.469669.0 × 10−24
105–120Y = 152.11 TVI − 269.520.588561.7 × 10−32105–120Y = 7316.1 EVI − 1249.50.558791.4 × 10−30
120–135Y = 152.87TVI + 166.220.677527.6 × 10−42120–135Y = 7013.0 EVI − 608.50.657782.3 × 10−39
135–150Y = 148.0 TVI + 878.890.697351.8 × 10−43135–150Y = 6484.9 EVI + 292.10.687782.7 × 10−42
150–165Y = 147.88 TVI + 1553.70.647836.5 × 10−39150–165Y = 6170.2 EVI + 1127.90.657761.5 × 10−39
165–180Y = 162.65 TVI + 1957.80.598431.4 × 10−33165–180Y = 6456.5 EVI + 1636.50.598421.1 × 10−33
180–195Y = 195.06 TVI + 2130.50.499411.1 × 10−25180–195Y = 7602.2 EVI + 1801.60.489473.2 × 10−25
75–195Y = 196.28 TVI − 189.770.726905.9 × 10−4875–195Y = 8891.9 EVI − 1026.20.736883.9 × 10−48
PeakY = 150.52 TVI − 634.870.539007.4 × 10−29PeakY = 7813.9 EVI − 1996.10.539018.4 × 10−29
DASLinear Model (SAVI)R2RMSEp-valueDASLinear Model (NDVI)R2RMSEp-value
kg ha−1(α = 0.05)kg ha−1(α = 0.05)
75–90Y = 6804.1 SAVI − 543.30.2511408.2 × 10−1275–90Y = 6111.6 NDVI − 1057.40.1212324.5 × 10−6
90–105Y = 9495.4 SAVI − 2479.40.459775.8 × 10−2390–105Y = 11,059 NDVI − 5334.60.3110894.3 × 10−15
105–120Y = 10,008 SAVI − 26960.5511326.1 × 10−30105–120Y = 11,093 NDVI − 5287.10.3510583.3 × 10−17
120–135Y = 9164.3 SAVI − 1712.50.647848.2 × 10−39120–135Y = 9500.8 NDVI − 3674.20.469668.7 × 10−24
135–150Y = 7937.6 SAVI − 415.880.677539.7 × 10−42135–150Y = 7473.6 NDVI − 16180.558843.6 × 10−30
150–165Y = 7729.7 SAVI + 624.530.668095.6 × 10−40150–165Y = 6243.6 NDVI − 80.360.618166.0 × 10−36
165–180Y = 7244.4 SAVI + 1251.70.598407.9 × 10−34165–180Y = 5550.8 NDVI + 10020.578614.8 × 10−32
180–195Y = 8316.9 SAVI + 1422.30.479527.5 × 10−25180–195Y = 5922.8 NDVI + 1386.40.469701.6 × 10−23
75–195Y = 11,152 SAVI − 2066.50.736881.9 × 10−4875–195Y = 11,230 NDVI + 4003.90.677528.1 × 10−42
PeakY = 11,044.7 SAVI − 37700.519192.3 × 10−27Peak Y = 15,470 NDVI − 9131.60.3910293.2 × 10−19
Y: seed cotton yield (kg ha−1); Peak: maximum value of the VI in the time series; p-value determined by ANOVA.
Table 4. Optimal periods for remote sensing (RS) image or data acquisition, expressed in days after sowing (DAS), considering best correlations between yield and VIs from different studies.
Table 4. Optimal periods for remote sensing (RS) image or data acquisition, expressed in days after sowing (DAS), considering best correlations between yield and VIs from different studies.
ReferenceRSDASReferenceRSDAS
[19]UAV150–170[27]Satellite100
[30]Satellite90–160[26]UAV90–100
[10]Satellite90–150[12]Satellite90 *
[15]Satellite120[31]Satellite80–90
[21]Airborne115 *[18]UAV80 *
[17]UAV90–120[29]Active Sensor60–100
[16]Satellite105[22]Spectroradiometer75
* single date applied.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

de Siqueira, D.A.B.; Vaz, C.M.P.; da Silva, F.S.; Ferreira, E.J.; Speranza, E.A.; Franchini, J.C.; Galbieri, R.; Belot, J.L.; de Souza, M.; Perina, F.J.; et al. Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series. AgriEngineering 2024, 6, 947-961. https://doi.org/10.3390/agriengineering6020054

AMA Style

de Siqueira DAB, Vaz CMP, da Silva FS, Ferreira EJ, Speranza EA, Franchini JC, Galbieri R, Belot JL, de Souza M, Perina FJ, et al. Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series. AgriEngineering. 2024; 6(2):947-961. https://doi.org/10.3390/agriengineering6020054

Chicago/Turabian Style

de Siqueira, Daniel A. B., Carlos M. P. Vaz, Flávio S. da Silva, Ednaldo J. Ferreira, Eduardo A. Speranza, Júlio C. Franchini, Rafael Galbieri, Jean L. Belot, Márcio de Souza, Fabiano J. Perina, and et al. 2024. "Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series" AgriEngineering 6, no. 2: 947-961. https://doi.org/10.3390/agriengineering6020054

APA Style

de Siqueira, D. A. B., Vaz, C. M. P., da Silva, F. S., Ferreira, E. J., Speranza, E. A., Franchini, J. C., Galbieri, R., Belot, J. L., de Souza, M., Perina, F. J., & das Chagas, S. (2024). Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series. AgriEngineering, 6(2), 947-961. https://doi.org/10.3390/agriengineering6020054

Article Metrics

Back to TopTop