Next Article in Journal
Differential Response of the Proteins Involved in Amino Acid Metabolism in Two Saccharomyces cerevisiae Strains during the Second Fermentation in a Sealed Bottle
Next Article in Special Issue
Remote and Proximal Sensing Techniques for Site-Specific Irrigation Management in the Olive Orchard
Previous Article in Journal
From Feather to Adsorbent: Keratin Extraction, Chemical Modification, and Fe(III) Removal from Aqueous Solution
Previous Article in Special Issue
Real-Time Remote Sensing of the Lobesia botrana Moth Using a Wireless Acoustic Detection Sensor
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improvement of Wheat Grain Yield Prediction Model Performance Based on Stacking Technique

1
School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454099, China
2
Beijing Research Center for Information Technology in Agriculture, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
*
Author to whom correspondence should be addressed.
Changchun Li and Yilin Wang contributed equally to this work and co-first authors.
Appl. Sci. 2021, 11(24), 12164; https://doi.org/10.3390/app112412164
Submission received: 18 November 2021 / Revised: 2 December 2021 / Accepted: 17 December 2021 / Published: 20 December 2021

Abstract

:
Crop growth and development is a dynamic and complex process, and the essence of yield formation is the continuous accumulation of photosynthetic products from multiple fertility stages. In this study, a new stacking method for integrating multiple growth stages information was proposed to improve the performance of the winter wheat grain yield (GY) prediction model. For this purpose, crop canopy hyperspectral reflectance and leaf area index (LAI) data were obtained at the jointing, flagging, anthesis and grain filling stages. In this case, 15 vegetation indices and LAI were used as input features of the elastic network to construct GY prediction models for single growth stage. Based on Stacking technique, the GY prediction results of four single growth stages were integrated to construct the ensemble learning framework. The results showed that vegetation indices coupled LAI could effectively overcome the spectral saturation phenomenon, the validated R2 of each growth stage was improved by 10%, 22.5%, 3.6% and 10%, respectively. The stacking method provided more stable information with higher prediction accuracy than the individual fertility results (R2 = 0.74), and the R2 of the model validation phase improved by 236%, 51%, 27.6%, and 12.1%, respectively. The study can provide a reference for GY prediction of other crops.

1. Introduction

Wheat is one of the most widely cultivated food crops in the world, with China ranking first in terms of yield and sales [1]. Therefore, a timely and accurate prediction of grain yield (GY) can not only help guide agricultural production management and assist relevant government departments to formulate scientific food policies, but also has significance for mitigating climate risks, ensuring national food security and economic stability [2,3,4,5,6,7]. The essence of the formation of the yield is the accumulation of dry matter produced by photosynthesis in different growth stages. The level of GY is influenced by complex factors such as light, soil, moisture, and atmosphere, which poses a great challenge to the accurate prediction of GY [8]. The conventional GY measurement method is not only expensive and less efficient, but also harmful to crops, making it unsuitable for large-areas crop yield estimation [9,10]. In recent decades, remote sensing technology has been successfully applied to crop growth monitoring through satellite platforms, manned airborne platforms, and ground spectral equipment [11]. However, several limitations such as deficient temporal-spatial resolution and cloud cover contamination restrain the application of satellite-based platforms [12]. In comparison, near-ground hyperspectral remote sensing data is not only cost-effective, but also has narrow band, strong continuity, and large spectral information contents [13]. In recent years, scholars at home and abroad have carried out research and made progress in yield estimation of various crops using remote sensing technology [14,15,16]. To the best of our knowledge, most current studies construct GY prediction models by using a single index such as spectral reflectance, spectral index, or vegetation index. However, as the plants grow, spectral saturation occurs when the crop coverage is high, making it difficult to continue improving GY prediction accuracy.
The essence of crop yield formation is the continuous accumulation of dry matter produced by photosynthesis in the plant body [17,18]. It was found that LAI determines many biophysical processes, such as photosynthesis, respiration, and transpiration in crops [19,20]. LAI is used as a common monitoring variable during crop growth and development and has a close relationship with final crop yield [21]. For example, Li et al. [22] used UAV-based hyperspectral remote sensing data to evaluate correlations between soybean yield and physiological and ecological parameters such as plant height, LAI, fresh biomass, and dry biomass, and identified LAI and fresh biomass as key indicators of soybean yield. Dente et al. [23] proved that LAI was closely related to GY by studying the influence of LAI at different growth stages of wheat on GY estimation and concluded that LAI can be used as an important index for GY estimation. Delécolle et al. [24] used remote sensing inversion data on the LAI as the input parameter of an output estimation model to improve the accuracy of the output estimation. Based on this, an important objective of this study is to use the winter wheat canopy structure parameter LAI coupled with vegetation indices as input features for yield prediction, in order to effectively overcome the canopy spectral saturation phenomenon.
The level of GY mainly depends on the number of ears, grains, and thousand seed weight of the crop [25]. During the growth stage of wheat from the jointing stage to the flag-raising stage, the LAI increases quickly, which is more conducive to accelerating the photosynthesis of plants. This period plays a decisive role in the number of ears of wheat. The number of grains in wheat mainly depends on the intensity of the photosynthesis from the flagging stage to the anthesis stage. In this growth period, there is a transition from vegetative growth to reproductive growth, and the nutrient transfers from the leaves to the ear, so it is the key period of grain formation and accumulation. After the anthesis stage, the number of grains per panicle is determined. The thousand seed weight of wheat is mainly determined by the intensity of photosynthesis and the rate of energy transfer from the anthesis to the grain filling stage [26,27]. Therefore, the physiology, shape, and structure of the crop vary throughout the growth cycle from germination, flowering, and fruiting to senescence and death, and the formation of GY occurs sequentially and overlappingly at different growth stages [28]. A study has shown that each key growth period of crop is closely related to the formation of the final GY [4]. Most existing studies considered the grain filling stage as the key stage of GY formation; therefore, it was considered to have the best prediction accuracy in many studies [29]. Hernandez et al. [30] reached a similar conclusion in that a penalized ridge regression model constructed using spectral reflectance data at the anthesis or grain filling stages could effectively predict GY at different water levels. However, the GY prediction model constructed using only a single critical growth stage will lose part of GY correlation information, and the prediction results may vary with the changes in the vegetation types, growth stages, and environmental conditions. Moreover, the robustness of the model is not strong, and the generalization ability is weak [31,32]. Based on this, many researchers have proposed multitemporal vegetation index-based GY prediction models to improve the accuracy and universality of model prediction. For example, Patel et al. [33] used multitemporal Terra/MODIS satellite data to simulate GY in the west of Uttar Pradesh, India. Li et al. [34] used the support vector regression method to construct a GY prediction model based on the normalized vegetation index obtained from multitemporal Landsat TM images, which effectively improved the prediction accuracy. Li et al. [35] used a large regional remote sensing GY estimation model based on multifertility MODIS-NDVI data and obtained a higher estimation accuracy than the single-fertility GY prediction model. Ghizlane et al. [36] obtained multispectral data of the four key growth periods of wheat based on the UAV remote sensing platform and estimated GY using the method of random forest combined with the vegetation index. To the best of our knowledge, there are no studies related to the ensemble of ground spectral data and LAI data from multiple growth stages of crops for yield prediction. Therefore, it is important to find a method that can combine the advantages of multiple growth stages to achieve an improved prediction accuracy and robustness for grain yield prediction.
As an important part of ensemble learning, the stacking method trains a secondary model through “learning method”, which is used to combine other primary models. Most existing studies applied this method to heterogeneous models to integrate and improve the final prediction accuracy of the model using the differences between the primary models [37]. Currently, this method has been applied to the prediction of plant photosynthesis capacity, map synthesis, and forest coverage monitoring [38,39,40]. Considering the heterogeneity of the canopy spectral information in the different growth stages of crops, multiple key growth stages related to GY formation were selected, and the integration of hyperspectral data and phenotypic data of the multiple growth stages based on the stacking method was studied. This was helpful to improve the estimation accuracy and generalization ability of the GY prediction model.
In recent years, machine-learning techniques using spectral reflectance data as input to build predictive models for plant traits have been employed and have shown improved prediction accuracy and robustness [41]. The elastic net algorithm is a type of neural network first proposed by Durbin and Willshaw [42]. Recently, the elastic network algorithm has been proven to achieve excellent results in machine learning, which can effectively avoid over fitting of the model prediction results due to small sample size and high dimension. For example, based on the elastic network regression model, Wang et al. [43] constructed the transmission and distribution cost allocation model of a power grid project to realize a reasonable allocation of operation and maintenance cost. Chen et al. [44] proposed a dam deformation prediction model based on the extreme learning machine and elastic network. The elastic network algorithm effectively alleviated the over fitting phenomenon, which remains challenging for the extreme learning machine, and the prediction results showed better stability. These results indicate that elastic networks can be used to solve regression problems.
To our knowledge, no studies have applied the strategy of combining the ENET algorithm and stacking techniques to stack LAI and hyperspectral data from multiple growth cycles to construct an ensemble learning framework for GY prediction. The main objectives of this study are as follows: (1) To assess the ability of winter wheat canopy hyperspectral vegetation indices and LAI to be used for grain yield prediction based on the ENET algorithm; (2) to study the effect of Stacking technology in improving the accuracy of grain yield prediction based on ensemble learning; (3) to analyze the influence of each individual growth stage on the development of an accurate integrated learning model. To achieve these goals, the correlations between 15 vegetation indices and LAI from four growth stages and yield were analyzed. Then, yield prediction models for individual growth stage of winter wheat were constructed based on the ENET algorithm in two cases of vegetation indices without or with coupled LAI, respectively. Finally, stacking techniques were used to integrate the yield prediction capabilities in the four growth stages, develop an ensemble learning framework to improve the robustness of the model, and analyze the weights attributed to each growth stage based on the prediction results.

2. Materials and Methods

2.1. Experiment Location and Research Material

Field trials were conducted in Yongxingtun village, Shanyang District, Jiaozuo City, Henan Province, China, during two cropping seasons in 2016–2017 and 2017–2018. Figure 1 showed the specific location of the study area. The experimental area was 110 m in length from east to west and 38 m in length from north to south, with each plot measuring 6 m × 8 m. There were 48 plots, 16 treatments, and three replications. The orthogonal experiment was conducted with different amounts of nitrogen fertilizer, different moisture contents, and different crop varieties (Table A1). Four levels of N fertilizer were set: 0, 0.0195, 0.0390, and 0.0585 kg/m2, respectively; three levels of water irrigation were set: rainfed (W1, no irrigation), normal water (W2, irrigation water 192 mm), and twice normal water (W3, irrigation water 384 mm); there are two crop varieties: Jing 9843 (J9843) and Zhong Mai 175 (ZM175). The experimental design scheme was described in Figure A1.
Figure 2 showed the general framework of this paper, which consists of four main parts. (1) Data acquisition: simultaneous acquisition of winter wheat canopy hyperspectral data and LAI data at the jointing, flagging, anthesis, and grain filling stages, and acquiring the GY data during the winter wheat harvest period; (2) Data processing: pre-processing and statistical analysis of measured data using ViewSpecPro, SPSS, Origin and Excel, and construction of various vegetation indices; (3) Data analysis: to explore the relationship between GY and LAI and vegetation indices, respectively, through correlation analysis; (4) Model framework construction: first, 15 vegetation indices and LAI coupled vegetation indices were used as input features for the ENET model to construct individual growth stages GY prediction models. Subsequently, based on the Stacking technology, the GY prediction results of the different growth stages were stacked to construct the GY prediction model that integrates multiple growth stages. Finally, the accuracy of each model was evaluated to screen the optimal model for winter wheat GY prediction.

2.2. Data Collection and Processing

2.2.1. ASD Canopy Hyperspectral Data

The canopy spectral reflectance of winter wheat measurements was performed using an ASD FieldSpec spectroradiometer. The spectral range of the ground object spectrometer was 350–2500 nm, the spectral sampling interval was 1.4 nm in the range of 350–1000 nm, the spectral sampling interval was 2 nm in the range of 1000–2500 nm, the resampling interval was 1 nm, and the field of view of the spectrometer was 25° [45]. To obtain a higher-precision spectral reflectance, the data were collected between 10:00 and 14:00 Beijing time under clear and windless weather conditions. Before obtaining the spectral reflectance, a BaSO4 calibration panel was used for the spectral calibration to eliminate the influence of sky light change on the spectral measurement [46]. The whiteboard was calibrated once in about 15 min in sunny and windless weather. When collecting the spectral reflectance, the probe should be kept vertically downward, and the height should be stable at approximately 0.6–1.0 m above the canopy. Each plot randomly selected four sample points that represent the overall growth of the wheat for spectral data collection. The parameter setting was that each sample point automatically collected 10 spectral curves, so that 40 spectral curves could be obtained from each plot. After collecting the spectral data, the ViewSpec Pro software was used to process the spectral reflectance data. The average value of 10 spectral curves was taken as the reflectance of a sample point. After processing, the reflectance was derived, and the average reflectance values of the four sample points in each cell were obtained. The average reflectance values of the four sample points in each cell were then averaged again; finally, the canopy spectral reflectance data of each experimental plot were obtained. In this study, the average spectral reflectance of the winter wheat canopy was obtained at the jointing stage, flagging stage, anthesis stage, and grain filling stage.

2.2.2. Data Collection of Grain Yield

GY data were obtained after winter wheat ripening. To ensure the accuracy of data collection, a plot combine harvester was used for harvesting, and each plot was then individually bagged and dried. When the grain moisture content was approximately 12.5%, GY was measured by weighing, GY unit was kg/hm2, and GY data of 48 plots were obtained. In addition, the yield characters of different treatments were significantly different (Table A2).

2.2.3. Data Collection of Leaf Area Index

Leaf area index (LAI) was measured by using the LAI 2200 plant canopy analyzer (LI-COR Inc., Lincoln, NE, USA) at the jointing, flagging, anthesis, and grain filling stages of winter wheat. Firstly, the LAI 2200 were performed backlit measurement in an open area to ensure that all LAI data obtained were accurate and valid. Then, the LAI 2200 was placed at the sample point parallel to the ridge and perpendicular to the ridge, respectively, with the probe placed close to the wheat plant. For wheat LAI measurement, three sample points were randomly selected for each plot. LAI values were measured for 4 times, and the arithmetic mean value was taken as the LAI of the sample point, and finally the average of the LAI values at the three sample points was calculated as the LAI value of the plot. There were significant differences in the statistical characteristics of LAI between different treatments at different growth stages (Table A3, Table A4, Table A5 and Table A6).

2.3. Method

2.3.1. Construction of Vegetation Index

The acquired hyperspectral data contain hundreds of spectral bands, and many adjacent bands are highly correlated. To reduce the data dependency, instead of using all the original bands, we extracted the narrow-band indices as spectral features and used them for modeling the winter wheat yield [36]. As listed in Table 1, 15 vegetation indices were calculated as secondary traits to predict the grain yield in this study. These indices showed resistance to environmental factors such as background (e.g., soil, atmosphere, etc.) and showed potential in assessing yield and physiological and biochemical parameters of plants.

2.3.2. Correlation Analysis

A correlation analysis is a consistency analysis of the change trend in two or more sets of data, and the magnitude of the correlation coefficient is often used to assess whether the relationship between each set of data is close and its degree. The correlation coefficient is obtained by dividing the covariance of two random variables by the standard deviation. The correlation coefficient lies between 1 and 1. The greater its absolute value, the greater the degree of correlation. When its value is close to 0, it indicates that there is no correlation. The calculation method is as follows:
ρ X , Y = cov ( X , Y ) σ X σ Y = E ( X Y ) E ( X ) E ( Y ) E ( X 2 ) E 2 ( X ) E ( Y 2 ) E 2 ( Y ) ,
where ρ X , Y represents the correlation coefficient, and cov ( X , Y ) and σ represent the covariance and standard deviation, respectively.

2.3.3. Elastic Network Algorithm

The elastic network (ENET) algorithm uses an adjustable convex combination of L1 regular and L2 regular, by introducing an additional parameter α to control the ratio of L1 regular and L2 regular term [62]. Thus, the objective function of defining ENET is as follows:
L = min | | H β T | | + λ ( α | | β | | L 1 + ( 1 α ) | | β | | L 2 2 ) ,
The ENET algorithm realizes the linear combination of ridge regression and lasso regression by introducing L1 and L2 penalty terms into the objective function [63]. The L1 regular term in the Lasso regression can avoid the loss of important variables by screening the variables and eliminating irrelevant variables, which helps improve the prediction accuracy of the model. The L2 regularized shrinkage regression coefficient in the ridge regression can achieve the stability of model prediction by reducing the error. The elastic network regression algorithm can automatically solve a specific objective function. The solution method is generally to determine the minimum value of the objective function through continuous iteration. When the objective function value reaches the local minimum or the global minimum, the network stops running. The algorithm does not require manual training and can ensure the global convergence of the network [64]. It can balance the complexity of the model while maintaining the prediction accuracy. It has the advantages of strong robustness and stable operation, which is fully reflected in this study.

2.3.4. Stacking Technology

Ensemble learning can achieve better generalization performance than using a single learner by constructing and combining multiple learners. Based on the different strategies of the learner combination, it can be divided into boosting, bagging, and stacking learning methods [37]. Among them, stacking is a typical representative of the learning method, which learns how to best combine the predictions from multiple machine learning models with good performance and is typically used to generate a set of heterogeneous prediction variables. The basic idea of stacking regression is as follows: first, the entire training set is divided into T blocks by bootstrapped sampling, and the subset of T-1 blocks is randomly selected for model training of the primary learners. The model is tested on the remaining subset. Subsequently, through the training of the primary learner, the output of various learners is used as input of the next level to train the secondary learner to obtain a final output. Figure 3 showed the working principal diagram of this study based on the use of the stacking regression technology to predict the yield of winter wheat.
The stacking method of different growth periods is similar to the stacking method of the regression model, and the entire process is divided into two levels. In this study, the ENET was used as the primary model and the secondary model for testing, to effectively avoid collinearity between the yield prediction results of each growth period of the winter wheat because of the overlap of information. First, the dataset was randomly and evenly divided into 10 parts, 9 of which were randomly selected as the original training dataset to train the first-level learner, and the remaining one was used as the model test set. Subsequently, the primary model was trained with ten-fold cross-validation to obtain the prediction of each fold, and the results were collected in the prediction matrix outside the sample. At the same time, the 10 models trained in the test set were used to predict the yield of winter wheat. The results produced 10 groups of predicted values in the test set, and the average predicted value was calculated. In this study, the regression models of the four growth stages were tested, and the outputs of each primary model were used as input to the secondary model to train again, thus integrating the prediction ability of the multiple growth stages. Similarly, ten-fold cross-validation was also used in the secondary model to obtain the final predicted value to minimize model uncertainty. To compare the prediction accuracies of the different models more fairly, the dataset was divided into training set and test set with a sampling strategy of 9:1, which was repeated 100 times, and the same ten-fold cross-validation was performed on the primary model and the secondary model in each division process. Finally, 1000 models were generated, and the average values of the coefficient of determination (R2) and root-mean-square error (RMSE) on these 1000 test sets were used as the final evaluation index of the model. The stacking regression framework was built in R v4.0.2.

2.3.5. Cross-Validation

Cross-validation [65,66], also known as rotation estimation, this theory was proposed by Seymour [67]. It is a statistically practical method of cutting data samples into smaller subsets. Typically, it is used as an accuracy test and hyperparameter screening method when the amount of sample data is small. This study used 10-fold cross-validation to determine the best parameters of the model to obtain a model with higher accuracy. By dividing the dataset into 10 parts, 9 of them were used as training data in turn, and the remaining one was used as test data. Each test yielded a corresponding accuracy. The average error rate of 10 results was used to estimate the accuracy of the algorithm. The advantage of this method is that the randomly generated sub samples could be repeatedly used for training and verification, and the results could be verified each time.

2.3.6. Model Performance Evaluation

The coefficient of determination (R2) and the root-mean-square error (RMSE) were selected as the evaluation indicators of the modeling accuracy and verification accuracy. The closer the R2 value to 1 and the lower the RMSE value, the better the model fitting effect. The calculation method is as follows:
R 2 = ( i = 1 n y i y ¯ ) 2 ( i = 1 n x i y ¯ ) 2 ,
RMSE = i = 1 , j = 1 n ( x i y i ) 2 n ,
where x i is the measured value of the yield; y i is the predicted value of yield; y ¯ is the average yield; n is the number of samples.

2.3.7. t-test

The t-test [68], also known as the student’s test, uses the t-distribution theory to infer the probability of the differences. Typically, in the case of small sample data, the t-test is performed to determine whether the p value is greater than 0.05 to compare whether the difference between the two sets of data is significant. If p < 0.05, there is a significant difference between the two groups. If p > 0.05, there is little possibility of a significant difference between the two groups. The calculation formula is as follows:
t = X ¯ 1 X ¯ 2 ( n 1 1 ) S 1 2   +   ( n 2 1 ) S 2 2 n 1   +   n 2 2 ( 1 n 1   +   1 n 2 ) ,
where X ¯ 1 and X ¯ 2 are the mean values of two samples; S 1 2 and S 2 2 are their variances; n 1 and n 2 are the two sample sizes, respectively.

3. Results

3.1. Phenotypic Analysis

The measured GY of each plot was obtained in the winter wheat harvest period, and the main statistical parameters of GY from different growing seasons were shown in Table 2. The results show that the coefficient of variation (CV) values of GY from both growing seasons separately were similar, while the CV of overall GY from both growing seasons had elevated compared to the results from single growing season only.
LAI measured values of each plot were obtained the jointing, flagging, anthesis, and grain filling stages of winter wheat, and the main statistical parameters and variation characteristics of LAI were shown in Table 3. Overall, the mean values of LAI from different growing seasons all showed a trend of increasing and then decreasing with the advancement of the fertility period, with the greatest increase rate of LAI from the jointing stage to the flagging stage and the peak of LAI at the anthesis stage, and the smallest mean value of LAI at the grain filling stage. In addition, the CV of LAI from single growing seasons were similar, while the CV of LAI from both growing seasons were significantly higher.

3.2. Correlations between Hyperspectral Vegetation Indices and Grain Yield

A correlation analysis between GY and 15 hyperspectral vegetation indices and LAI at the jointing, flagging, anthesis, and grain filling stages of wheat was conducted, as shown in Figure 4. The results showed that at the jointing stage, the absolute value of the correlation coefficient (|r|) between GY and 15 vegetation indices and LAI ranged from 0.39 to 0.49, and the absolute value of the correlation coefficient between NDWI and GY was the highest, with the best correlation (|r| = 0.49). At the flagging stage, the |r| value between GY and indices ranged from 0.47 to 0.67, among which the absolute value of the NDWI and GY correlation coefficient was the highest, that was, the strongest correlation (|r| = 0.67). At the anthesis stage, the |r| value between GY and the indices ranged from 0.48 to 0.81, and the absolute value of the correlation coefficient between NDWI and GY was the highest (|r| = 0.81). In the grain filling period, the |r| value between GY and the indices was between 0.7 and 0.82, and the correlation between the VREI and GY was the strongest (|r| = 0.82). The results showed that the correlation coefficients of LAI and GY were 0.47, 0.67, 0.68, and 0.78 at the jointing, flagging, anthesis, and grain filling stages, respectively. Overall, the correlation between GY and the hyperspectral vegetation indices and LAI showed the gradually increasing trend as the growth period progressed. All indices of each growth period were significantly correlated with GY (p < 0.001), which could be used to predict the yield of winter wheat. The correlations between different indices were shown in the attached Figure A2.

3.3. Elastic Network for Grain Yield Prediction

Based on the ENET, the vegetation indices and the vegetation indices coupled LAI were used as the input factors for the model to construct GY prediction models for four growth stages, respectively. Based on the prediction results, the box-plot of GY prediction accuracy for each growth stage was drawn with or without LAI, as shown in Figure 5.
Only 15 vegetation indices were used as the input features of the model to construct the GY prediction model for each growth period. The results showed that GY prediction accuracy at the jointing stage was R2 = 0.20, RMSE = 1.33 t ha−1; the GY prediction accuracy at the flagging stage was R2 = 0.40, RMSE = 1.18 t ha−1; the GY prediction accuracy at the anthesis stage was R2 = 0.56, RMSE = 1.06 t ha−1; the GY prediction accuracy during the grain filling period was R2 = 0.60, RMSE = 0.94 t ha−1. The above results show that the prediction accuracy R2 of the GY prediction model based on the vegetation indices increased with the growth period and that the RMSE decreased with the growth of winter wheat, indicating that GY prediction ability for winter wheat increased with the growth period and reached the best prediction effect at the grain filling stage.
To study the influence of LAI on the accuracy of winter wheat GY prediction model, 15 vegetation indices of the four growth periods coupled with LAI of the same growth period were used as input factors for the model to construct a single-growth-period GY prediction model. The results showed that R2 = 0.22, RMSE = 1.31 t ha−1 at the jointing stage, R2 = 0.49, RMSE = 1.1 t ha−1 at the flagging stage, R2 = 0.58, RMSE = 1.02 t ha−1 at the anthesis stage, and R2 = 0.66, RMSE = 0.88 t ha−1 at the grain filling stage. Overall, the prediction accuracy of GY of winter wheat gradually increased with the growth period, and the prediction accuracy of the model at the grain filling stage was the highest.
The results showed that the 15 vegetation indices could be coupled with LAI to construct a GY prediction model. During the jointing, flagging, anthesis, and grain filling stages, the average R2 of GY prediction model increased by 10%, 22.5%, 3.6% and 10%, the average RMSE were reduced by 1.5%, 6.7%, 3.8%, and 6.4%, respectively. A comparative analysis of the above results showed that GY prediction accuracy of each growth period had been improved after coupling the vegetation indices with LAI. Among them, the GY prediction accuracy of the flagging stage had improved the most.
To further verify the performance of GY prediction by coupling LAI with the vegetation indices, a paired t-test was used to evaluate whether there was a significant difference in the impact of vegetation indices on GY prediction accuracy when coupling with LAI, as listed in Table 4.
The results showed that the p-values for all four growth stages were less than 0.05, indicating that the vegetation indices coupled with LAI were used to construct GY prediction models with statistically significant improvement in prediction accuracy. The T value of the flagging stage was the highest (t = 32.264), and the accuracy was improved most evidently.
As shown in Figure A3, the Pearson’s correlations between the GY predictions for the four individual growth stages were high (r = 0.6–0.95). By contrast, the density curves of the grain yield predictions were slightly different from each other. Moreover, the base learners differed significantly in the distribution interval of the prediction accuracy.

3.4. Comparative Analysis of Grain Yield Prediction Accuracy from Multiple Growth Stages

To evaluate the performance of the stacking method, the ENET algorithm was used as a secondary model to construct GY prediction model by integrating the prediction results of the four growth stages with and without coupled LAI in two cases, respectively. The accuracy of GY prediction results obtained by combining four of the base learners were presented in Figure 6.
The GY prediction model based on multiple growth stages was constructed using only the vegetation indices as the input factors for the model. The average prediction accuracy of the integrated model was R2 = 0.67, RMSE = 0.85 t ha−1. Overall, the GY prediction accuracy of the integrated multiple growth stages had been improved to varying degrees compared with the four single growth stages. Compared with the prediction accuracy of GY at the jointing, flagging, anthesis, and grain filling stages, the average R2 of the integrated model increased by 235%, 67.5%, 19.6% and 11.7%, and the average RMSE decreased by 36.1%, 28%, 19.8%, and 9.6%, respectively.
The vegetation indices coupled with LAI were used as the input factors of the model to construct a GY prediction model by integrating multiple growth stages. The results showed that the prediction accuracy of the integrated model was R2 = 0.74, RMSE = 0.81 t ha−1. Overall, after integrating the multiple growth stages, the prediction accuracy of GY had been improved to varying degrees. Compared with the jointing, flagging, anthesis and grain filling stages, the average R2 values of the integrated model increased by 236%, 51%, 27.6% and 12.1%, and the average RMSE decreased by 38.2%, 26.4%, 20.6%, and 8%, respectively. Based on the analysis of the above research results, the vegetation indices coupled with LAI were used as the model input factors, and based on the stacking technology, a GY prediction model integrating the four growth stages was constructed, yielding highest verification accuracy (R2 = 0.74, RMSE = 0.81 t ha−1).
To further verify whether the improvement in GY prediction accuracy by integrating information from multiple growth stages is significant, paired samples t-tests were used to assess whether there is a statistical difference in the validation R2 between the integrated model and the single-growth stage model under two cases of vegetation indices with and without coupled LAI, respectively. The results were listed in Table 5.
The results showed that the integrated model outperformed the other base models with or without LAI, and the p values were less than 0.05 indicating that the accuracy improvement obtained by the integrated model was statistically significant. This further demonstrated the effectiveness of the stacking technique proposed in this paper for GY prediction. Among them, the accuracy of the jointing stage was improved most evidently, and the t value was the highest (t = 51.153 and t = 60.18, respectively), consistent with the above results. Moreover, when the vegetation indices coupled with LAI were used as the input factors of the model, the prediction accuracy of the integrated model improved more evidently, and GY prediction effect was better.

3.5. Weight Results for the Secondary Models

To determine the weight of each growth stage in the model stacking process, the regression coefficients of the prediction results of each growth period in the secondary model were statistically analyzed with and without coupling LAI, as shown in Figure 7.
The results showed that when only the vegetation indices were used as the input factors for GY prediction model, the average regression coefficients of the jointing, flagging, anthesis, and grain filling stages in the integrated model were 0.04, 0.05, 0.49, and 0.66, respectively. In the case of vegetation indices coupled with LAI, a GY prediction model integrating multiple growth stages was constructed, and the average regression coefficients of the jointing, flagging, anthesis, and grain filling stages were 0.06, 0.23, 0.48, and 0.72, respectively. Overall, with the development of the growth period, the proportion of each growth stage in the integrated model showed a gradual upward trend, in which the average regression coefficient of the grain filling stage and the weight were the highest. Compared with only using vegetation indices for GY prediction, after coupling the vegetation indices with LAI, the weight of the jointing, flagging, and grain filling stages in the integrated model all increased, while the weight of the anthesis stage decreased slightly, and the weight of the flagging stage increased most evidently.

4. Discussion

This study aims to improve the accuracy and robustness of crop yield prediction models. In this study, the data of two planting seasons from 2016 to 2018 were used to evaluate the improvement effect of GY prediction model. The information from multiple cropping seasons increased the variability of the data (Table 2 and Table 3), which was due to the differences in environmental and phenological conditions between different years and enhanced the prediction accuracy and transferability of the model. In recent years, hyperspectral remote sensing has great potential for application in rapid and accurate estimation of crop parameters and yields in agricultural fields because of its advantages of narrow waveband and strong continuity [69]. Considering the information overlap between adjacent bands, 15 vegetation indices were calculated in this study for constructing the GY prediction model instead of all bands in order to avoid generating data redundancy. These indices are located in the visible red and near-infrared wavelengths, which are important for photosynthesis in plants and are often used for timely monitoring of vegetation cover and growth vigor, etc. [70]. The results of the correlation analysis (Figure 4) indicated that all vegetation indices performed a highly significant correlation with GY (p < 0.001). However, there are differences in the potential of different vegetation indices for GY prediction due to the fact that the vegetation information obtained from different spectral channels has various correlations with different elements of vegetation or a certain character state. The NDWI index showed a high correlation with GY due to its ability to effectively extract the water content of the vegetation canopy and respond to moisture changes in a timely manner [71]. The VREI index provides a timely response to changes in chlorophyll concentration, canopy structure and water content in plants, and is often used in vegetation phenology change monitoring, precision agriculture and vegetation productivity modeling [59]. The vegetation cover will change as the fertility period advances, and the effect of soil background can be effectively reduced by introducing MCARI and SAVI. The ARVI can effectively reduce the dependence of spectral reflectance on atmospheric properties [48], which is important for the transfer of models to airborne remote sensing platforms. NDVI eliminates most of the variation in irradiance associated with instrument calibration, sun angle, topography, cloud shadows, and atmospheric conditions, enhancing the response to vegetation [52]. PRI is sensitive to changes in carotenoid content in plants and gradually increases in correlation with GY during late crop growth. SIPI and PSRI indices are often used for canopy physiological stress monitoring and crop yield analysis [56,58]. In addition, the correlation between the same vegetation index and GY varies at different growth stages. This is due to the fact that as the crop fertility progresses, the growth and development of the plant and the influence of the external environment led to changes in its internal components and structure, which cause the spectral characteristics of the plant canopy or leaves to change subsequently [72]. Thus, the combination of multiple vegetation indices used to construct the GY prediction model can achieve complementary information to overcome the limitations of a single band or a single index, which effectively improves the prediction accuracy and robustness of the model (Table A7). This has important significance for the transfer of the model to practical applications.
The essence of yield formation is the complex process of production, transportation, accumulation, and distribution of photosynthetic products [73]. As the reproductive period advances, spectral saturation of the vegetation index occurs when the vegetation cover is greater than 80%. In this study, the coupling of structural parameters (LAI) and spectral parameters (vegetation indices) of the crop effectively improved the accuracy of GY prediction at each growth stage, and the results of the t-test (Table 4) also fully demonstrated that the improvement of model effects at each growth stage was significant (p < 0.05). This is because LAI is an important canopy structure information of crops and one of the important variables to determine the intensity of photosynthesis [4,74]. The size of vegetation leaf area directly affects light interception, photosynthesis and transpiration efficiency of vegetation canopy, and is considered to be the most important factor affecting plant productivity [75]. The results are consistent with those reported by Delécolle et al. [24] and Wang et al. [76]. Among them, the most significant improvement in model accuracy was observed in the flagging stage (the relative value of R2 increased by 22.5%) which is consistent with our statistical t-test (t = 32.264). This is because the flagging stage is a critical period for the formation of wheat spike number, and this growth stage is mainly reflected in the rapid growth of leaf area and the consequent increase in photosynthetic efficiency, which contributes more to the accumulation of dry matter [77]. Our study demonstrated that canopy spectral saturation can be effectively overcome by coupling LAI and vegetation indices, and this study can provide a new method for accurate and rapid estimation of crop yield.
The results of GY estimation at single growth stage indicated that there were significant differences in yield estimation at different fertility stages, which showed an increasing accuracy of prediction with the advancement of growth stage. This is because crop growth and development is a dynamic and continuous process, and changes in the growth state and environmental characteristics of the crop at each growth stage can lead to different morphological characteristics of the spectral curve, and thus the accuracy of the yield estimation models at different growth stages varies. Since each key stage of crop growth and development can have a significant impact on yield formation, it is important to find a method to combine information from multiple growth stages to improve the accuracy and robustness of the model. The objective of employing a stacking technique was to improve the prediction ability by combining data from multiple growth stages instead of using single growth stage data. When implementing a stacking approach, it is necessary to include self-sufficient, independent, and diverse base learners for analysis (Figure A3) [47]. The results showed that there was a strong correlation between the predicted results of different growth stages (correlation coefficients of R = 0.6–0.95), which was due to the overlap of information between adjacent fertility stages. ENET is used as a secondary model to perform the multicollinearity data analysis in the model when stacking multiple base learners due to its advantages of both ridge regression and lasso regression [63]. In this study, vegetation indices and LAI from four key growth stages of the crop were successfully combined to achieve higher accuracy in predicting grain yield than any single growth stage (Figure 5 and Figure 6). In addition, the accuracy parameters (R2 and RMSE) of the base learner fluctuated more and had a wider range than those of the ensemble approach, which also demonstrated the stability of the GY model constructed based on the Stacking technique. However, stacking techniques are sensitive to the timing of data collection at different growth stages. Due to the differences in developmental rates of different varieties, this may lead to the fact that different varieties may be at different growth stages at the same time. In addition, due to the different gradients of water and fertilizer stress treatments, which may have an impact on the growth and development rate of the crop, this poses a significant challenge to our data collection process.
The analysis of the regression coefficients combining the four base learners showed that the highest weight was given to the grain filling stage with or without LAI, which is consistent with the highest accuracy of the previous prediction of the grain filling stage. Therefore, the use of the grain filling stage as a key learner in the grain yield prediction has a very positive effect on the improvement of the prediction effectiveness of the ensemble model. This is rational since grain filling is the stage where wheat transfers organic matter such as starch and protein produced by photosynthesis from the vegetative organs to the grains, and the vegetation indexes in this period are closely related to the final quality of thousand-grain weight [29]. However, the regression coefficients for the jointing period showed negative values in some cases, which is obviously detrimental to improving the model accuracy. The reason for this phenomenon is that the crop cover is small and the ground surface is exposed at the jointing stage, so it is easy to be disturbed by the soil background and other factors during the spectral data collection [78]. Moreover, the LAI at this stage is relatively small and photosynthesis is weak, resulting in the accumulation of little dry matter yet, and its contribution in the integrated model is poor compared to other growth stages. The t-test results (Table 5) show that the GY prediction model with stacked four growth stages performed with good accuracy and robustness than any single growth stage, which has important implications for the practical application of the prediction model.

5. Conclusions

Winter wheat is one of the most widely grown grain crops in the world, and pre-harvest insights into yields can help optimize management practices. In this study, we developed an ensemble learning framework for stacking information from multiple growth stages of the crop. The results showed that the ensemble model outperformed other individual growth stage models. In addition, coupling vegetation indices with LAI can effectively overcome the spectral saturation phenomenon and improve the prediction accuracy of the model. Our study demonstrated the effectiveness of Stacking technique for improving GY prediction accuracy (Figure A4), which can provide new ideas for yield estimation of other crops.

Author Contributions

Conceptualization, C.L.; Data curation, C.M.; methodology, Y.W.; software, Y.W.; formal analysis, Y.L.; writing original draft preparation, Y.W.; writing review and editing, C.L.; validation, Y.W.; investigation, W.C. and Z.X.; supervision, F.D.; visualization, J.L.; funding acquisition, C.L. and C.M. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Natural Science Foundation of China (41871333), the Important Project of Science and Technology of the Henan Province (212102110238) and Key scientific research project of Henan college and university (20B420002).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Since the data sets were acquired through field collection, all data cannot be shared due to legal, ethical, and privacy restrictions.

Acknowledgments

We are grateful to the anonymous reviewers for their comments and recommendations.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Combination of different treatments.
Table A1. Combination of different treatments.
N2—W1—P2N1—W1—P1N4—W1—P2N3—W1—P1
N4—W2—P2N3—W2—P1N2—W2—P2N1—W2—P1
N3—W2—P2N4—W2—P1N1—W2—P2N2—W2—P1
N1—W3—P2N2—W3—P1N3—W3—P2N4—W3—P1
Variety treatment—P1: Jing 9843; P2: Zhong Mai 175. Nitrogen treatment—N1: 0 kg/hm2; N2: 195 kg/hm2; N3: 390 kg/hm2; N4: 585 kg/hm2. Moisture treatment—W1: no irrigation; W2: irrigation water 192 mm; W3: irrigation water 384 mm.
Table A2. Main statistical parameters of grain yield under different treatments.
Table A2. Main statistical parameters of grain yield under different treatments.
Measurement Dates2016–20172017–20182016–2018
TreatmentsMEAN
(t ha−1)
SD
(t ha−1)
CV
(%)
MEAN
(t ha−1)
SD
(t ha−1)
CV
(%)
MEAN
(t ha−1)
SD
(t ha−1)
CV
(%)
N2—W1—P26.820.7911.587.360.7610.337.090.8812.41
N4—W2—P27.890.668.318.470.657.678.180.698.44
N3—W2—P26.181.3622.076.651.3219.856.411.4622.76
N1—W3—P24.231.3130.914.781.2826.784.501.5434.20
N1—W1—P14.050.327.804.610.286.074.330.4610.62
N3—W2—P15.630.173.056.100.213.445.860.335.63
N4—W2—P16.800.446.517.30.415.627.050.496.95
N2—W3—P15.520.9717.526.070.8914.665.801.0918.80
N4—W1—P27.660.385.028.130.354.317.890.577.22
N2—W2—P26.820.9013.167.370.9512.897.091.1816.63
N1—W2—P23.870.8922.944.340.8820.284.101.0625.84
N3—W3—P24.860.091.895.360.122.245.110.142.74
N3—W1—P15.300.8215.405.80.7613.105.550.9016.22
N1—W2—P15.700.447.796.210.58.055.950.6811.42
N2—W2—P14.441.2227.484.931.0421.104.691.3328.38
N4—W3—P17.650.8811.538.130.9111.197.891.0513.31
SD means standard deviation; CV means coefficient of variation.
Table A3. Main statistical parameters of LAI at the jointing stage under different treatments.
Table A3. Main statistical parameters of LAI at the jointing stage under different treatments.
Measurement Dates2016–20172017–20182016–2018
TreatmentsMEANSDCV
(%)
MEANSDCV
(%)
MEANSDCV
(%)
N2—W1—P22.330.521.432.830.5218.372.580.5722.08
N4—W2—P23.930.184.594.410.214.764.170.266.24
N3—W2—P23.250.5516.923.750.6617.603.500.6919.71
N1—W3—P21.620.1710.522.170.198.761.890.2312.15
N1—W1—P12.040.314.672.610.3212.262.330.3515.04
N3—W2—P13.420.257.323.930.246.113.670.297.90
N4—W2—P13.310.39.053.870.287.243.590.339.19
N2—W3—P11.620.2112.952.170.29.221.900.2714.24
N4—W1—P23.440.4613.383.960.5112.893.700.5915.96
N2—W2—P22.560.259.763.040.3511.512.800.3813.57
N1—W2—P21.670.2917.412.160.3817.591.910.4020.91
N3—W3—P22.190.29.152.710.259.242.450.2711.03
N3—W1—P13.110.6420.563.620.6818.783.370.7221.39
N1—W2—P13.010.154.993.490.216.023.250.247.39
N2—W2—P12.010.3919.402.530.4317.002.270.4620.26
N4—W3—P13.930.205.094.460.214.714.190.327.63
SD means standard deviation; CV means coefficient of variation.
Table A4. Main statistical parameters of LAI at the flagging stage under different treatments.
Table A4. Main statistical parameters of LAI at the flagging stage under different treatments.
Measurement Dates2016–20172017–20182016–2018
TreatmentsMEANSDCV
(%)
MEANSDCV
(%)
MEANSDCV
(%)
N2—W1—P23.900.7820.004.610.8117.574.260.8920.92
N4—W2—P25.290.234.355.90.335.595.600.346.08
N3—W2—P24.270.7116.634.930.6613.394.600.8117.61
N1—W3—P22.150.6329.302.840.6422.542.500.7429.66
N1—W1—P12.150.4621.362.860.5418.882.510.5622.34
N3—W2—P13.520.329.084.180.358.373.850.379.61
N4—W2—P14.040.276.684.710.316.584.380.398.91
N2—W3—P12.820.3913.813.510.4211.973.170.4815.16
N4—W1—P23.580.226.154.230.255.913.900.276.92
N2—W2—P22.900.3211.023.60.3910.833.250.4012.30
N1—W2—P21.670.3420.422.410.4217.432.040.4321.10
N3—W3—P22.610.103.833.320.175.122.960.186.07
N3—W1—P12.820.4716.663.380.4613.613.100.5216.77
N1—W2—P13.910.8120.724.530.8719.214.220.8820.85
N2—W2—P12.550.5421.163.250.6921.232.900.6723.10
N4—W3—P14.050.8521.014.620.9420.354.330.9622.16
SD means standard deviation; CV means coefficient of variation.
Table A5. Main statistical parameters of LAI at the anthesis stage under different treatments.
Table A5. Main statistical parameters of LAI at the anthesis stage under different treatments.
Measurement Dates2016–20172017–20182016–2018
TreatmentsMEANSDCV
(%)
MEANSDCV
(%)
MEANSDCV
(%)
N2—W1—P23.420.7822.794.310.8118.793.870.8923.02
N4—W2—P25.891.1619.696.681.0916.326.291.3120.84
N3—W2—P25.091.4227.915.921.3823.315.501.5528.16
N1—W3—P22.090.5425.832.950.5819.662.520.6626.19
N1—W1—P11.540.2818.192.430.3213.171.980.3718.64
N3—W2—P14.260.358.235.120.387.424.690.418.75
N4—W2—P14.840.428.685.670.437.585.250.499.33
N2—W3—P12.680.4014.953.560.4612.923.120.4715.08
N4—W1—P23.920.4010.204.730.449.304.330.4911.33
N2—W2—P23.270.4915.004.140.5012.083.700.5615.12
N1—W2—P21.250.3427.202.150.5224.191.680.4727.97
N3—W3—P22.750.4817.443.650.513.703.200.5617.49
N3—W1—P13.180.4514.143.720.4712.633.450.5114.78
N1—W2—P14.320.7517.375.210.713.444.760.8317.42
N2—W2—P12.550.8332.543.50.8925.433.030.9832.39
N4—W3—P14.220.358.295.060.316.134.640.439.26
SD means standard deviation; CV means coefficient of variation.
Table A6. Main statistical parameters of LAI at the grain filling stage under different treatments.
Table A6. Main statistical parameters of LAI at the grain filling stage under different treatments.
Measurement Dates2016–20172017–20182016–2018
TreatmentsMEANSDCV
(%)
MEANSDCV
(%)
MEANSDCV
(%)
N2—W1—P22.270.5323.332.430.5522.632.350.5623.82
N4—W2—P23.210.7222.413.270.6921.103.240.7322.52
N3—W2—P22.180.8036.682.090.7435.412.140.8137.93
N1—W3—P20.690.0912.970.820.1113.410.780.1215.38
N1—W1—P10.880.1820.341.400.2316.431.140.2421.01
N3—W2—P12.000.3517.282.130.3817.842.070.3717.90
N4—W2—P12.640.7729.022.740.7627.742.690.8029.72
N2—W3—P12.420.187.442.570.218.172.490.239.22
N4—W1—P22.610.176.522.690.145.202.650.186.80
N2—W2—P22.440.5120.932.580.4517.442.510.5321.13
N1—W2—P20.970.099.311.180.1311.021.070.1211.18
N3—W3—P21.080.1917.661.250.2116.801.160.2218.92
N3—W1—P11.100.2018.241.210.2319.011.150.2622.55
N1—W2—P11.260.2116.601.270.1914.961.270.2318.15
N2—W2—P11.240.4737.811.250.4536.001.250.4838.51
N4—W3—P12.530.3513.672.640.3312.502.590.3915.08
SD means standard deviation; CV means coefficient of variation.
Table A7. Statistical analysis of GY prediction accuracy for a single optimal variable and multiple variables.
Table A7. Statistical analysis of GY prediction accuracy for a single optimal variable and multiple variables.
Growth StageInput FeaturesR2RMSE (t ha−1)
JointingNDWI0.161.38
VIs0.21.33
FlaggingNDWI0.181.36
VIs0.41.18
AnthesisNDWI0.421.2
VIs0.561.06
Grain fillingVREI0.541.08
VIs0.60.94
VIs means 15 vegetation indices.
Figure A1. Experimental design diagram.
Figure A1. Experimental design diagram.
Applsci 11 12164 g0a1
Figure A2. The correlation matrices between each index and grain yield. (a) Jointing stage; (b) Flagging stage; (c) Anthesis stage; (d) Grain filling stage. Significant (**) at p < 0.01.
Figure A2. The correlation matrices between each index and grain yield. (a) Jointing stage; (b) Flagging stage; (c) Anthesis stage; (d) Grain filling stage. Significant (**) at p < 0.01.
Applsci 11 12164 g0a2
Figure A3. Regression plots, density curves and correlation coefficients for grain yield predictions generated by the four growth stages. In the four key growth stages. (a) Without LAI; (b) With LAI. Significant (***) at p < 0.001.
Figure A3. Regression plots, density curves and correlation coefficients for grain yield predictions generated by the four growth stages. In the four key growth stages. (a) Without LAI; (b) With LAI. Significant (***) at p < 0.001.
Applsci 11 12164 g0a3
Figure A4. Spatial distribution of grain yield (t ha−1) at the plot scale using an ensemble learning model.
Figure A4. Spatial distribution of grain yield (t ha−1) at the plot scale using an ensemble learning model.
Applsci 11 12164 g0a4

References

  1. Yang, M.; Adeel, H.M.; Xu, K.; Zheng, C.; Rasheed, A.; Zhang, Y.; Jin, X.; Xia, X.; Xiao, Y.; He, Z. Assessment of Water and Nitrogen Use Efficiencies Through UAV-Based Multispectral Phenotyping in Winter Wheat. Front. Plant Sci. 2020, 11, 927. [Google Scholar] [CrossRef]
  2. Hutchinson, C.F. Uses of satellite data for famine early warning in sub-Saharan Africa. Int. J. Remote Sens. 1991, 12, 1405–1421. [Google Scholar] [CrossRef]
  3. Franch, B.; Vermote, E.F.; Becker-Reshef, I.; Claverie, M.; Huang, J.; Zhang, J.; Justice, C.; Sobrino, J. Improving the timeliness of winter wheat production forecast in the United States of America, Ukraine and China using MODIS data and NCAR Growing Degree Day Information. Remote Sens. Environ. 2015, 161, 131–148. [Google Scholar] [CrossRef]
  4. Chen, Y.; Zhang, Z.; Tao, F. Improving regional winter wheat yield estimation through assimilation of phenology and leaf area index from remote sensing data. Eur. J. Agron. 2018, 101, 163–173. [Google Scholar] [CrossRef]
  5. Vanli, Ö.; Ahmad, I.; Ustundag, B.B. Area estimation and yield forecasting of wheat in southeastern turkey using a machine learning approach. J. Indian Soc. Remote Sens. 2020, 48, 1757–1766. [Google Scholar] [CrossRef]
  6. Wu, S.; Yang, P.; Ren, J.; Chen, Z.; Li, H. Regional winter wheat yield estimation based on the wofost model and a novel vw-4densrf assimilation algorithm. Remote Sens. Environ. 2021, 255, 112276. [Google Scholar] [CrossRef]
  7. Hatfield, J.L.; Prueger, J.H. Value of using different vegetative indices to quantify agricultural crop characteristics at different growth stages under varying management practices. Remote Sens. 2010, 2, 562–578. [Google Scholar] [CrossRef] [Green Version]
  8. Tao, H.; Feng, H.; Xu, L.; Miao, M.; Yang, G.; Yang, X.; Fan, L. Estimation of the yield and plant height of winter wheat using UAV-based hyperspectral images. Sensors 2020, 20, 1231. [Google Scholar] [CrossRef] [Green Version]
  9. Cao, J.; Zhang, Z.; Luo, Y.; Zhang, L.; Zhang, J.; Li, Z. Wheat yield predictions at a county and field scale with deep learning, machine learning, and google earth engine. Eur. J. Agron. 2021, 123, 126204. [Google Scholar] [CrossRef]
  10. Liu, C.; Chen, Z.; Shao, Y.; Chen, J.; Tuya, H.; Pan, H. Research advances of sar remote sensing for agriculture applications: A review. J. Integr. Agric. 2019, 18, 506–525. [Google Scholar] [CrossRef] [Green Version]
  11. Wang, P.; Li, Z.; Li, J.; Schmit, T.J. Added-value of geo-hyperspectral infrared radiances for local severe storm forecasts using the hybrid osse method. Ad. Atmos. Sci. 2021, 38, 1315–1333. [Google Scholar] [CrossRef]
  12. Fletcher, R.S. Using vegetation indices as input into random forest for soybean and weed classification. Am. J. Plant. Sci. 2016, 7, 2186–2198. [Google Scholar] [CrossRef] [Green Version]
  13. Wang, K.; Wang, X. Research on winter wheat yield estimation with the multiply remote sensing vegetation index combination. J. Arid Land Resour. Environ. 2017, 31, 44–49. [Google Scholar]
  14. Tao, H.; Xu, L. Winter wheat yield estimation based on UAV hyperspectral remote sensing data. Trans. Chin. Soc. Agric. Mach. 2020, 51, 146–155. [Google Scholar]
  15. Montesinos-López, O.A.; Montesinos-López, A.; Crossa, J.; de los Campos, G.; Alvarado, G.; Suchismita, M.; Jessica Rutkoski, J.; González-Pérez, L.; Burgueño, J. Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data. Plant Methods 2017, 13, 4. [Google Scholar] [CrossRef] [Green Version]
  16. Parker, J.E.; Crowder, D.W.; Eigenbrode, S.D. Trap crop diversity enhances crop yield. Agric. Ecosys. Environ. 2016, 232, 254–262. [Google Scholar] [CrossRef] [Green Version]
  17. Sadeghi-Tehran, P.; Virlet, N.; Ampe, E.M.; Reyns, P.; Hawkesford, M.J. Deepcount: In-field automatic quantification of wheat spikes using simple linear iterative clustering and deep convolutional neural networks. Front. Plant Sci. 2019, 10, 1176. [Google Scholar] [CrossRef] [PubMed]
  18. Yao, X.; Wang, N.; Liu, Y.; Cheng, T.; Tian, Y.; Chen, Q.; Zhu, Y. Estimation of Wheat LAI at Middle to High Levels Using Unmanned Aerial Vehicle Narrowband Multispectral Imagery. Remote Sens. 2017, 9, 1304. [Google Scholar] [CrossRef] [Green Version]
  19. Shi, X.; Yu, Z.; Zhao, J.; Shi, Y.; Wang, X. Effect of Nitrogen Application Rateon Photosynthetic Characteristics, Dry Matter Accumulation and Distribution and Yield of High-Yielding Winter Wheat. J. Triticeae Crops 2021, 41, 713–721. [Google Scholar]
  20. Novelli, F.; Spiegel, H.; Sandén, T.; Vuolo, F. Assimilation of sentinel-2 leaf area index data into a physically-based crop growth model for yield estimation. Agronomy 2019, 9, 255. [Google Scholar] [CrossRef] [Green Version]
  21. Song, Y.; Wang, J.; Shang, J.; Liao, C. Using UAV-based SOPC derived LAI and SAFY model for biomass and yield estimation of winter wheat. Remote Sens. 2020, 12, 2378. [Google Scholar] [CrossRef]
  22. Li, C.; Ma, C.; Cui, Y.; Lu, G.; Wei, F. Uav hyperspectral remote sensing estimation of soybean yield based on physiological and ecological parameter and meteorological factor in china. J. Indian Soc. Remote 2020, 49, 873–886. [Google Scholar] [CrossRef]
  23. Dente, L.; Satalino, G.; Mattia, F.; Rinaldi, M. Assimilation of leaf area index derived from asar and meris data into ceres-wheat model to map wheat yield. Remote Sens. Environ. 2007, 112, 1395–1407. [Google Scholar] [CrossRef]
  24. Delécolle, R.; Maas, S.J.; Guérif, M.; Baret, F. Remote sensing and crop production models: Present trends. ISPRS J. Photogramm. 1992, 47, 145–161. [Google Scholar] [CrossRef]
  25. Hasan, M.M.; Chopin, J.P.; Laga, H.; Miklavcic, S.J. Detection and analysis of wheat spikes using convolutional neural networks. Plant Methods 2018, 14, 100. [Google Scholar] [CrossRef] [Green Version]
  26. Wu, X.; Tang, Y.; Li, C.; Wu, C.; Huang, G. Effect of waterlogging at different growth stages on flag leaf chlorophyll fluorescence and grain-filling properties of winter wheat. Chin. J. Eco-Agric. 2015, 23, 309–318. [Google Scholar]
  27. Jin, P.; Ru, Z.; Zhou, M.; Hou, Y.; He, H.; Luo, Y. Effects of low temperature and overcast rainy during spring on the yield characters of wheat. Crops Res. 2010, 24, 103–108. [Google Scholar]
  28. Chen, J.; Zhang, F.; Zhou, H.; Zhao, Y.; Wei, X. Effect of irrigation at different growth stages and nitrogen fertilizer on maize growth, yield and water use efficiency. J. Northwest A F Univ.-Nat. Sci. Ed. 2011, 39, 322–331. [Google Scholar]
  29. Guan, K.; Wu, J.; Kimball, J.S.; Anderson, M.C.; Frolking, S.; Li, B.; Hain, C.R.; Lobell, D.B. The shared and unique values of optical, fluorescence, thermal and microwave satellite data for estimating large-scale crop yields. Remote Sens. Environ. 2017, 199, 333–349. [Google Scholar] [CrossRef] [Green Version]
  30. Hernandez, J.; Lobos, G.; Matus, I.; del Pozo, A.; Silva, P.; Galleguillos, M. Using ridge regression models to estimate grain yield from field spectral data in bread wheat (Triticum aestivum L.) grown under three water regimes. Remote Sens. 2015, 7, 2109–2126. [Google Scholar] [CrossRef] [Green Version]
  31. Tan, H.; Li, S.; Wang, K.; Xie, R.; Gao, S.; Ming, B.; Qing, Y.; Lai, J.; Liu, G.; Tang, Q. Monitoring canopy chlorophyll density in seedlings of winter wheat using imaging spectrometer. Acta Agron. Sin. 2008, 34, 1812–1817. [Google Scholar] [CrossRef]
  32. Sun, B.Y.; Chang, Q.R.; Liu, M.Y. Inversion chlorophyll mass fraction in winter wheat canopy by hyperspectral reflectance. Acta Agric. Boreali. Sin. 2017, 26, 552–559. [Google Scholar]
  33. Patel, N.R.; Mohammed, A.J.; Rakhesh, D. Modeling of wheat yields using multi-temporal terra/modis satellite data. Geocarto Int. 2006, 21, 43–50. [Google Scholar] [CrossRef]
  34. Li, R.; Li, C.; Xu, X.; Wang, J.; Yang, X.; Huang, W.; Pan, Y. Winter wheat yield estimation based on support vector machine regression and multi-temporal remote sensing data. Trans. Chin. Soc. Agric. Eng. 2009, 25, 114–117. [Google Scholar]
  35. Li, Z.; Song, M.; Feng, H. Dynamic characteristics of leaf area index and plant height of winter wheat influenced by irrigation and nitrogen coupling and their relationships with yield. Trans. Chin. Soc. Agric. Eng. 2017, 33, 195–202. [Google Scholar]
  36. Ghizlane, A.; Eddine, D.J.; Imane, S.; Samir, B.; Ettarid, M. Mapping wheat dry matter and nitrogen content dynamics and estimation of wheat yield using UAV multispectral imagery machine learning and a variety-based approach: Case study of morocco. AgriEngineering 2021, 3, 29–49. [Google Scholar]
  37. Feng, L.; Zhang, Z.; Ma, Y.; Du, Q.; Williams, P.; Drewry, J.; Luck, B. Alfalfa yield prediction using UAV-based hyperspectral imagery and ensemble learning. Remote Sens. 2020, 12, 2028. [Google Scholar] [CrossRef]
  38. Clinton, N.; Yu, L.; Gong, P. Geographic stacking: Decision fusion to increase global land cover map accuracy. ISPRS J. Photogramm. Remote Sens. 2015, 103, 57–65. [Google Scholar] [CrossRef]
  39. Healey, S.P.; Cohen, W.B.; Yang, Z.; Brewer, C.K.; Brooks, E.B.; Gorelick, N.; Hernandez, A.; Huang, C.; Hughes, M.J.; Kennedy, R. Mapping forest change using stacked generalization: An ensemble approach. Remote Sens. Environ. 2018, 204, 717–728. [Google Scholar] [CrossRef]
  40. Fu, P.; Meacham-Hensold, K.; Guan, K.; Bernacchi, C.J. Hyperspectral leaf reflectance as proxy for photosynthetic capacities: An ensemble approach based on multiple machine learning algorithms. Front. Plant Sci. 2019, 10, 730. [Google Scholar] [CrossRef] [PubMed]
  41. Zhou, Z.; Yufang, J.; Bin, C.; Patrick, B. California almond yield prediction at the orchard level with a machine learning approach. Front. Plant Sci. 2019, 10, 809. [Google Scholar]
  42. Durbin, R.; Willshaw, D. An analogue approach to the travelling salesman problem using an elastic net method. Nature 1987, 326, 689–691. [Google Scholar] [CrossRef] [PubMed]
  43. Wang, Y.; Wang, S.; Zheng, Y.; Song, F.; Ma, Y.; Zhang, H. Calculation and allocation of operation and maintenance cost of power grid project based on elastic net. Autom. Electr. Power Syst. 2020, 44, 165–172. [Google Scholar]
  44. Chen, Y.; Xiao, G.; Hu, M.; Huang, J. Dam deformation prediction based on extreme learning machine and elastic network. Sci. Surv. Mapp. 2020, 45, 20–27+40. [Google Scholar]
  45. Osco, L.P.; Ramos, A.P.; Faita Pinheiro, M.M.; Moriya, É.A.S.; Imai, N.N.; Estrabis, N. A machine learning framework to predict nutrient content in valencia-orange leaf hyperspectral measurements. Remote Sens. 2020, 12, 906. [Google Scholar] [CrossRef] [Green Version]
  46. Fei, S.; Hassan, M.A.; He, Z.; Chen, Z.; Shu, M.; Wang, J.; Li, C.; Xiao, Y. Assessment of ensemble learning to predict wheat grain yield based on UAV-multispectral reflectance. Remote Sens. 2021, 13, 2338. [Google Scholar] [CrossRef]
  47. Gao, B.C. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sens. Environ. 1996, 58, 257–266. [Google Scholar] [CrossRef]
  48. Miura, T.; Huete, A.R.; Yoshioka, H.; Holben, B.N. An error and sensitivity analysis of atmospheric resistant vegetation indices derived from dark target-based atmospheric correction. Remote Sens. Environ. 2001, 78, 284–298. [Google Scholar] [CrossRef]
  49. Gitelson, A.A.; Merzlyak, M.N. Remote sensing of chlorophyll concentration in higher plant leaves. Adv. Space Res. 1998, 22, 689–692. [Google Scholar] [CrossRef]
  50. Daughtry, C.S.T.; Walthall, C.L.; Kim, M.S.; De Colstoun, E.B.; McMurtrey, J.E., III. Estimating corn leaf chlorophyll concentration from leaf and canopy reflectance. Remote Sens. Environ. 2000, 74, 229–239. [Google Scholar] [CrossRef]
  51. George, J.P.; Yang, W.; Kobayashi, H.; Biermann, T.; Carrara, A.; Cremonese, E.; Pisek, J. Method comparison of indirect assessments of understory leaf area index (LAlu): A case study across the extended network of ICOS forest ecosystem sites in Europe. Ecol. Indic. 2021, 128, 107841. [Google Scholar] [CrossRef]
  52. Tucker, C.J. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef] [Green Version]
  53. Aparicio, N.; Villegas, D.; Royo, C.; Casadesus, J.; Araus, J.L. Effect of sensor view angle on the assessment of agronomic traits by ground level hyper-spectral reflectance measurements in durum wheat under contrasting mediterranean conditions. Int. J. Remote Sens. 2004, 25, 1131–1152. [Google Scholar] [CrossRef]
  54. Gitelson, A.A.; Viña, A.; Ciganda, V.; Rundquist, D.C.; Arkebauer, T.J. Remote estimation of canopy chlorophyll content in crops. Geophys. Res. Lett. 2005, 32, L08403. [Google Scholar] [CrossRef] [Green Version]
  55. Gamon, J.A.; Serrano, L.; Surfus, J.S. The photochemical reflectance index: An optical indicator of photosynthetic radiation use efficiency across species, functional types, and nutrient levels. Oecologia 1997, 112, 492–501. [Google Scholar] [CrossRef]
  56. Ren, S.; Chen, X.; An, S. Assessing plant senescence reflectance index-retrieved vegetation phenology and its spatiotemporal response to climate change in the Inner Mongolian Grassland. Int. J. Biometeorol. 2017, 61, 601–612. [Google Scholar] [CrossRef] [PubMed]
  57. Huete, A.; Didan, K.; Miura, T.; Rodriguez, E.P.; Gao, X.; Ferreira, L.G. Overview of the radiometric and biophysical performance of the modis vegetation indices. Remote Sens. Environ. 2002, 83, 195–213. [Google Scholar] [CrossRef]
  58. Peñuelas, J.; Filella, I. Visible and near-infrared reflectance techniques for diagnosing plant physiological status. Trends Plant Sci. 1998, 3, 151–156. [Google Scholar] [CrossRef]
  59. Manley, P.V.; Sagan, V.; Fritschi, F.B.; Burken, J.G. Remote sensing of explosives-induced stress in plants: Hyperspectral imaging analysis for remote detection of unexploded threats. Remote Sens. 2019, 11, 1827. [Google Scholar] [CrossRef] [Green Version]
  60. Gitelson, A.A.; Kaufman, Y.J.; Stark, R.; Rundquist, D. Novel algorithms for remote estimation of vegetation fraction. Remote Sens. Environ. 2002, 80, 76–87. [Google Scholar] [CrossRef] [Green Version]
  61. Monsef, H.A.E.; Smith, S.E. A new approach for estimating mangrove canopy cover using Landsat 8 imagery. Comput. Electron. Agric. 2017, 135, 183–194. [Google Scholar] [CrossRef]
  62. Cao, Y.; Wu, Y.; Li, M.; Liang, W.; Zhang, P. PolSAR image classification using a superpixel-based composite kernel and elastic net. Remote Sens. 2021, 13, 380. [Google Scholar] [CrossRef]
  63. Zou, H.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2005, 67, 301–320. [Google Scholar] [CrossRef] [Green Version]
  64. Zheng, N.; Luan, X.; Liu, F. Near infrared spectroscopy modeling based on adaptive elastic net method. Spectrosc. Spect. Anal. 2018, 39, 319–324. [Google Scholar]
  65. Dayananda, S.; Astor, T.; Wijesingha, J.; Chickadibburahalli Thimappa, S.; Dimba Chowdappa, H.; Mudalagiriyappa. Multi-temporal monsoon crop biomass estimation using hyperspectral imaging. Remote Sens. 2019, 11, 1771. [Google Scholar] [CrossRef] [Green Version]
  66. Yang, Q.; Shi, L.; Han, J.; Zha, Y.; Zhu, P. Deep convolutional neural networks for rice grain yield estimation at the ripening stage using UAV-based remotely sensed images. Field Crops Res. 2019, 235, 142–153. [Google Scholar] [CrossRef]
  67. Fisher, S. Boundary effects of persistent inputs and messages. J. Abnorm. Psychol. 1971, 77, 290–295. [Google Scholar] [CrossRef] [PubMed]
  68. Napier-Munn, T.J.; Meyer, D.H. A modified paired t-test for the analysis of plant trials with data autocorrelated in time. Miner. Eng. 1999, 12, 1093–1100. [Google Scholar] [CrossRef]
  69. Ma, Y.; Fang, S.; Peng, Y.; Gong, Y.; Wang, D. Remote estimation of biomass in winter oilseed rape (Brassica napus L.) using canopy hyperspectral data at different growth stages. Appl. Sci. 2019, 9, 545. [Google Scholar] [CrossRef] [Green Version]
  70. Kong, L.A.; Wang, F.; Feng, B.; Li, S.; Si, J.; Zhang, B. A root-zone soil regime of wheat: Physiological and growth responses to furrow irrigation in raised bed planting in northern China. Agron. J. 2010, 102, 154–162. [Google Scholar] [CrossRef]
  71. Bayat, B.; van der Tol, C.; Verhoef, W. Remote sensing of grass response to drought stress using spectroscopic techniques and canopy reflectance model inversion. Remote Sens. 2016, 8, 557. [Google Scholar] [CrossRef] [Green Version]
  72. Liu, C.; Sun, P.; Liu, S. A review of plant spectral reflectance response to water physiological changes. Chin. J. Plant Ecol. 2016, 40, 80–91. [Google Scholar]
  73. Zhang, J.; Zhang, H.; Li, S.; Li, J.; Yan, L.; Xia, L. Increasing yield potential through manipulating of an are1 ortholog related to nitrogen use efficiency in wheat by crispr/cas9. J. Integr. Plant Biol. 2021, 63, 1649–1663. [Google Scholar] [CrossRef]
  74. Slamet, W.; Purbajanti, E.D.; Darmawati, A.; Fuskhah, E. Leaf area index, chlorophyll, photosynthesis rate of lettuce (Lactuca sativa L) under N-organic fertilizer. Indian J. Agric. Res. 2017, 51, 365–369. [Google Scholar] [CrossRef] [Green Version]
  75. Ke, L.; Zhou, Q.; Wu, W.; Xia, T.; Tang, H. Estimating the crop leaf area index using hyperspectral remote sensing. J. Integr. Agric. 2016, 15, 475–491. [Google Scholar]
  76. Wang, P.; Chen, C.; Zhang, S.; Zhang, Y.; Li, H. Winter wheat yield estimation based on copula function and remotely sensed LAI and VTCI. Trans. Chin. Soc. Agric. Mach. 2021, 52, 255–263. [Google Scholar]
  77. Ren, J.; Chen, Z.; Zhou, Q.; Tang, H. Regional yield estimation for winter wheat with MODIS-NDVI data in Shandong, China. Int. J. Appl. Earth Obs. 2008, 10, 403–413. [Google Scholar] [CrossRef]
  78. Zhang, Z.; Zhou, Y.; Yang, S.; Tan, C.; Lao, C.; Xu, C. Inversion Method for Soil Water Content in Winter Wheat Root Zone with Eliminating Effect of Soil Background. Trans. Chin. Soc. Agric. Mach. 2021, 52, 197–207. [Google Scholar]
Figure 1. Geographical location of study area.
Figure 1. Geographical location of study area.
Applsci 11 12164 g001
Figure 2. Technology roadmap.
Figure 2. Technology roadmap.
Applsci 11 12164 g002
Figure 3. The workflow of the stacking regression model for predicting grain yield. In the figure: CV, cross-validation; P, prediction values of grain yield; J, jointing; F, flagging; A, anthesis; G, grain filling.
Figure 3. The workflow of the stacking regression model for predicting grain yield. In the figure: CV, cross-validation; P, prediction values of grain yield; J, jointing; F, flagging; A, anthesis; G, grain filling.
Applsci 11 12164 g003
Figure 4. Correlation between grain yield and indices of each growth stage.
Figure 4. Correlation between grain yield and indices of each growth stage.
Applsci 11 12164 g004
Figure 5. The statistical distributions of the prediction accuracy of each growth stage for grain yield prediction in the test phases; (a) distributions of R2 at each individual stage; (b) distributions of RMSE at the each individual stage.
Figure 5. The statistical distributions of the prediction accuracy of each growth stage for grain yield prediction in the test phases; (a) distributions of R2 at each individual stage; (b) distributions of RMSE at the each individual stage.
Applsci 11 12164 g005
Figure 6. The statistical distributions of prediction accuracy of the stacking regression when including four growth stages for predicting grain yield in the test phases; (a) distributions of R2 without or with LAI coupling; (b) distributions of RMSE without or with LAI coupling.
Figure 6. The statistical distributions of prediction accuracy of the stacking regression when including four growth stages for predicting grain yield in the test phases; (a) distributions of R2 without or with LAI coupling; (b) distributions of RMSE without or with LAI coupling.
Applsci 11 12164 g006
Figure 7. The distribution of regression coefficients for each growth stage within the level-2 model (ENET). (a) Without LAI; (b) With LAI.
Figure 7. The distribution of regression coefficients for each growth stage within the level-2 model (ENET). (a) Without LAI; (b) With LAI.
Applsci 11 12164 g007
Table 1. Introduction of the Vegetation Indices.
Table 1. Introduction of the Vegetation Indices.
Vegetation IndexNameFormulaReference
NDWINormalized Difference Water Index(R872 − R1245)/(R872 + R1245)[47]
ARVIAtmospherically Resistant Vegetation Index(R872 − (R661 − (R488 − R661)))/(R872 + (R661 − (R488 − R661)))[48]
GNDVIGreen Normalized Difference Vegetation Index(R780 − R550)/(R780 + R550)[49]
MCARIModified Chlorophyll Absorption Ratio Index((R702 − R671) − 0.2 × (R702 − R549)) × (R702/R671)[50]
SRSimple RatioR872/R661[51]
NDVINormalized Difference Vegetation Index(R800 − R670)/(R800 + R670)[52]
NPCINormalized Pigment Chlorophyll Index(R680 − R430)/(R680 + R430)[53]
NDRSRNormalized Difference Red-edge Simple Ratio(R872 − R712)/(R872 + R712)[54]
PRIPhotochemical Reflectance Index(R529 − R671)/(R529 + R671)[55]
PSRIPlant Senescence Reflectance Index(R680 − R500)/R750[56]
SAVISoil Adjusted Vegetation Index1.5 × (R872 − R661)/(R872 + R661 + 0.5)[57]
SIPIStructure Insensitive Pigment Index(R803 − R447)/(R803 − R681)[58]
VREIVogelmann Red Edge IndexR742/R722[59]
VARIVisible Atmospherically Resistant Index(R559 − R661)/(R559 + R661 − R488)[60]
GARIGreen Atmospherically Resistant Index(R872 − (R559 − (R488 − R661)))/(R872 + (R559 − (R488 − R661)))[61]
Rx represents reflectance, and x represents the band.
Table 2. Main statistical parameters of grain yield from different growing seasons.
Table 2. Main statistical parameters of grain yield from different growing seasons.
Measurement DatesMEAN (t ha−1)SD (t ha−1)CV (%)
2016–20175.840.9516.27
2017–20186.351.1217.64
2016–20186.101.2420.33
SD means standard deviation; CV means coefficient of variation.
Table 3. Main statistical parameters of LAI from each growth stage in different growing seasons.
Table 3. Main statistical parameters of LAI from each growth stage in different growing seasons.
Measurement Dates2016–20172017–20182016–2018
MEANSDCV (%)MEANSDCV (%)MEANSDCV (%)
Jointing2.710.5219.193.230.5918.272.970.6622.20
Flagging3.260.6118.713.930.6115.523.600.6919.17
Anthesis3.450.6819.714.30.8118.843.880.8722.42
Grain filling1.850.7138.381.970.7437.561.910.7639.79
SD means standard deviation; CV means coefficient of variation.
Table 4. Results from the paired sample t-test.
Table 4. Results from the paired sample t-test.
Growth Stagetp Value
Jointing12.1350.000
Flagging32.2640.000
Anthesis9.1060.000
Grain filling26.5250.000
Table 5. Results from the paired sample t-test.
Table 5. Results from the paired sample t-test.
Modeltp Value
Without LAIEnsemble vs. Jointing51.1530.000
Ensemble vs. Flagging39.9020.000
Ensemble vs. Anthesis17.0290.000
Ensemble vs. Grain filling15.7110.000
With LAIEnsemble vs. Jointing60.180.000
Ensemble vs. Flagging39.0920.000
Ensemble vs. Anthesis23.1880.000
Ensemble vs. Grain filling16.6460.000
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, C.; Wang, Y.; Ma, C.; Chen, W.; Li, Y.; Li, J.; Ding, F.; Xiao, Z. Improvement of Wheat Grain Yield Prediction Model Performance Based on Stacking Technique. Appl. Sci. 2021, 11, 12164. https://doi.org/10.3390/app112412164

AMA Style

Li C, Wang Y, Ma C, Chen W, Li Y, Li J, Ding F, Xiao Z. Improvement of Wheat Grain Yield Prediction Model Performance Based on Stacking Technique. Applied Sciences. 2021; 11(24):12164. https://doi.org/10.3390/app112412164

Chicago/Turabian Style

Li, Changchun, Yilin Wang, Chunyan Ma, Weinan Chen, Yacong Li, Jingbo Li, Fan Ding, and Zhen Xiao. 2021. "Improvement of Wheat Grain Yield Prediction Model Performance Based on Stacking Technique" Applied Sciences 11, no. 24: 12164. https://doi.org/10.3390/app112412164

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop