A Framework for Predicting Winter Wheat Yield in Northern China with Triple Cross-Attention and Multi-Source Data Fusion
Abstract
1. Introduction
1.1. Related Work
- (1)
- The method based on crop simulation models focuses on the growth process of crops combined with biological principles, using environmental data such as climate and soil, and crop data like photosynthesis and transpiration, to simulate crop growth and predict the final yield. Common models for crop modeling include the Agricultural Production Systems sIMulator (APSIM), the Decision Support System for Agrotechnology Transfer (DSSAT), and the Crop Simulation Model for Agricultural Management Decision Support (CropSyst). After collecting data related to crop growth, researchers need to select appropriate biophysical models and parameterize the models based on local conditions, then use these parameters to simulate the crop growth process through the selected model. They should then compare the predicted production data obtained from the simulation process with the actual production data and further adjust the model to improve prediction precision. Zhao et al. (2024) [1] suggested a model on the basis of APSIM for simultaneously predicting wheat and corn yields, which can analyze the relationship between wheat and corn yields and environmental factors. Zhao et al. (2022) [2] simulated various indicators such as cumulative biomass using APSIM and used them as inputs for statistical regression models to ultimately predict wheat yield. Uvirkaa Akumaga et al. (2023) [3] combined high-resolution remote sensing satellite data, observation data from the ground, and DSSAT to simulate the number of days during the partial growth stages of soybeans and corn, and ultimately the estimated yield. Yang et al. (2023) [4] proposed a simulation method for a pre-season crop yield prediction for corn, combined with DSSAT. Yang et al. (2023) [5] used DSSAT to evaluate the trend of maize yield changes with future climate change, and analyzed the changes in future yield and the reasons for these changes through experiments. Harsimran Kaur et al. (2022) [6] combined CropSyst with historical and recent climate data for yield prediction, and identified spring peas as the optimal elastic crop, which can help improve the sustainability of crop rotation systems. Simone Bregaglio et al. (2023) [7] proposed a method for predicting yield by combining time series remote sensing data and agricultural models. The above methods based on crop simulation models require high-quality input data and are costly to construct and calibrate.
- (2)
- The method based on traditional machine learning analyzes the data of different features by combining domain knowledge to further excavate the key factors that have a greater impact on crop yield and build a model for prediction. The commonly used traditional machine learning models include Linear Regression (LR), Support Vector Regression (SVR), Random Forest (RF), Gradient Boosting Decision Tree (GBDT), and eXtreme Gradient Boosting (XGBoost). Fei et al. (2024) [8] evaluated the performance of three methods, including RF, in predicting wheat yield using hyperspectral reflectance data from early- and mid-grain filling during wheat growth. Li et al. (2023) [9] established a soybean yield prediction model by integrating K-nearest neighbors, RF, and SVR through ensemble learning. Sun et al. (2024) [10] applied partial least squares regression, RF, and SVR to assess the connection between multi- vegetation indices and the yield of three types of rice at different growth stages. Li et al. (2023) [11] proposed a soybean yield forecast framework that combines XGBoost and multidimensional feature engineering. Yu et al. (2023) [12] proposed a meta learning ensemble regression framework that combines optical data, synthetic aperture radar data, and meteorological data to accurately predict rice yield. Diego Arruda Huggins de Sá Leitão et al. (2023) [13] compared different prediction methods and found that algorithms such as RF and SVR performed better than traditional linear regression methods. Moreover, traditional regression methods had overestimation and underestimation biases when predicting low-yield and high-yield areas. Zhang et al. (2023) [14] suggested a predictive model using Bayesian optimized Categorical Boosting (CatBoost) based on Landsat-8 and Sentinel-2 vegetation index time series data to estimate a winter wheat yield. Wang et al. (2023) [15] applied RF and other methods to study the relationship between three different satellite data and maize yield under different conditions of normal and drought years. Juan Skobalaski et al. (2024) [16] used methods such as RF and Gradient Boosting Regression (GBR) to predict soybean yield and proposed a novel transfer learning approach to the selection of genotypes and high-yield variety screening. Cheng et al. (2022) [17] assessed the effectiveness of RF, GBDT, SVR, and deep learning methods for predicting yield using multispectral, hyperspectral, and gridded yield data. The model construction and training process of the methods based on traditional machine learning mentioned above is relatively simple, but it usually requires manual feature selection before training the model, and the quality of the input data is critical to model performance.
- (3)
- The method based on deep learning is adept at capturing nonlinear relationships, complex patterns, and long-term dependencies between features through neural network models. By automatically extracting features, different weights are assigned to different features, ultimately achieving yield prediction. The commonly used models include Convolutional Neural Network (CNN), Long Short Term Memory (LSTM), Gated Recurrent Unit (GRU), Recurrent Neural Network (RNN), and Transformer. Wang et al. (2023) [18] proposed a model named CNN-GRU using three different remote sensing variables to estimate a winter wheat yield. Guo et al. (2023) [19] collected multispectral and hyperspectral images from ground measurements as model inputs, and used CNN and other models to predict a corn yield separately. Feng et al. (2024) [20] used CNN to extract soil features, meteorological features, and image features captured by drones, and then used GRU for yield prediction. Mahdiyeh Fathi et al. (2023) [21] proposed a model called 3D-ResNet-BiLSTM for predicting soybean yields. Tanabe et al. (2023) [22] used hyperspectral technology and CNN to assess the impact of four growing stages of winter wheat on predicting yield. Bi et al. (2023) [23] introduced a yield prediction model utilizing Transformer to comprehensively incorporate image features and seed features. Gregor Perich et al. (2023) [24] proposed a method for precise agricultural modeling using Sentinel-2 satellite data. The study selected and analyzed three mainstream methods: data analysis methods based on spectral indices, raw satellite reflectance, and RNN. The results indicated that the performance of RNN may not necessarily be superior to other methods, but it is more effective due to its end-to-end training approach. Cheng et al. (2024) [25] proposed a county-level winter wheat yield prediction method called GT-LSTM to address the difficulty of learning geographic spatial information in using RNN to process crop time-series data. Guo et al. (2024) [26] proposed a model called SSA-LSTM-transformer using multiple remote sensing variables to predict wheat yield. The model combines the automatic optimization capability of sparrow search algorithm with the long-term memory ability of LSTM. Kiran Kumar et al. (2023) [27] optimized hyperparameter configuration and fine-tuned LSTM and bidirectional LSTM to achieve the yield prediction of crops such as wheat. The above methods based on deep learning can automatically extract important features, reduce manual intervention, and are suitable for processing complex high-dimensional data, but the methods usually require a large amount of computation.
1.2. Existing Problems and Advantages
1.3. Contributions
- (1)
- We suggest a winter wheat yield prediction framework with triple cross-attention and multi-source data fusion. This framework consists of three modules, namely a multi-source data processing module, a multi-source feature fusion module, and a yield prediction module.
- (2)
- The multi-source data processing module obtains raw data through different platforms and combines relevant software to extract multi-source data, ultimately generating a sequence set corresponding to the multi-source data.
- (3)
- Different approaches are used to capture the internal information of data from different sources. For dynamic features that change over time during the growing period, a Temporal Feature Enhancement Module (TFE) is proposed to capture the temporal information. For static soil features that are almost unchanged during the growing period, a Convolutional Residual Block (CRB) is used to extract the deep features.
- (4)
- In order to fuse the extracted multi-source features, this paper proposes a novel fusion method called Triple Cross-Attention Fusion Mechanism (TCAFM), which captures the relationship between multi-source features while realizing multi-source feature fusion.
- (5)
- In the yield prediction module, the encoder uses the multi-head self-attention mechanism (MHSA) and the graph attention mechanism to construct a double branch, which allows the model to capture global dependencies in the features and enhances the transfer of local information. The decoder employs Fourier Analysis Networks (FAN) to capture the complex nonlinear interactions among the processed features and the predicted yield.
1.4. The Structure of This Paper
2. Materials and Methods
2.1. Materials
2.1.1. Research Area
2.1.2. Data
2.2. A Winter Wheat Yield Prediction Framework with Triple Cross-Attention and Multi-Source Data Fusion
2.2.1. Multi-Source Data Processing Module
2.2.2. Multi-Source Feature Fusion Module
2.2.3. Yield Prediction Module
2.3. The Framework Prediction Process in This Paper
3. Results
3.1. Performance Comparison
3.2. The Importance of Various Growth Phases of Winter Wheat in Yield Forecasting
3.3. The Effect of Time Window Length on Yield Prediction
3.4. Ablation Experiment
3.4.1. The Ablation Experiment of TFE and Graph Attention Mechanism
3.4.2. The Ablation Experiment of TCAFM
3.5. Efficiency of the Model
4. Discussion
4.1. Analysis of Comparative Test Results
4.2. Analysis of the Importance of Different Winter Wheat Growth Phases in Predicting Yield
4.3. Analysis of the Effect of Time Window Length on Yield Prediction
4.4. Analysis of Ablation Experimental Results of TFE and Graph Attention Mechanism
4.5. Analysis of Ablation Experiment Results of TCAFM
4.6. Analysis of Model Efficiency Experiment Results
4.7. Discussion Summary
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Zhao, Y.; Xiao, D.; Bai, H. The simultaneous prediction of yield and maturity date for wheat–maize by combining satellite images with crop model. J. Sci. Food Agric. 2024, 104, 8791–8800. [Google Scholar] [CrossRef] [PubMed]
- Zhao, Y.; Xiao, D.; Bai, H.; Tang, J.; Liu, D.L.; Qi, Y.; Shen, Y. The prediction of wheat yield in the North China Plain by coupling crop model with machine learning algorithms. Agriculture 2022, 13, 99. [Google Scholar] [CrossRef]
- Akumaga, U.; Gao, F.; Anderson, M.; Dulaney, W.P.; Houborg, R.; Russ, A.; Hively, W.D. Integration of remote sensing and field observations in evaluating DSSAT model for estimating maize and soybean growth and yield in Maryland, USA. Agronomy 2023, 13, 1540. [Google Scholar] [CrossRef]
- Yang, M.; Wang, G.; Wu, S.; Block, P.; Lazin, R.; Alexander, S.; Lala, J.; Haider, M.R.; Dokou, Z.; Atsbeha, E.A.; et al. Seasonal prediction of crop yields in Ethiopia using an analog approach. Agric. For. Meteorol. 2023, 331, 109347. [Google Scholar] [CrossRef]
- Yang, M.; Wang, G. Heat stress to jeopardize crop production in the US Corn Belt based on downscaled CMIP5 projections. Agric. Syst. 2023, 211, 103746. [Google Scholar] [CrossRef]
- Kaur, H.; Huggins, D.R.; Carlson, B.; Stockle, C.; Nelson, R. Dryland fallow vs flex-cropping decisions in inland Pacific Northwest of USA. Agric. Syst. 2022, 201, 103432. [Google Scholar] [CrossRef]
- Bregaglio, S.; Ginaldi, F.; Raparelli, E.; Fila, G.; Bajocco, S. Improving crop yield prediction accuracy by embedding phenological heterogeneity into model parameter sets. Agric. Syst. 2023, 209, 103666. [Google Scholar] [CrossRef]
- Fei, S.; Xiao, S.; Zhu, J.; Xiao, Y.; Ma, Y. Dual sampling linear regression ensemble to predict wheat yield across growing seasons with hyperspectral sensing. Comput. Electron. Agric. 2024, 216, 108514. [Google Scholar] [CrossRef]
- Li, Q.C.; Xu, S.W.; Zhuang, J.Y.; Liu, J.J.; Zhuo, Y.; Zhang, Z.X. Ensemble learning prediction of soybean yields in China based on meteorological data. J. Integr. Agric. 2023, 22, 1909–1927. [Google Scholar] [CrossRef]
- Sun, X.; Zhang, P.; Wang, Z. Potential of multi-seasonal vegetation indices to predict rice yield from UAV multispectral observations. Precis. Agric. 2024, 25, 1–27. [Google Scholar] [CrossRef]
- Li, Y.; Zeng, H.; Zhang, M.; Wu, B.; Zhao, Y.; Yao, X.; Cheng, T.; Qin, X.; Wu, F. A county-level soybean yield prediction framework coupled with XGBoost and multidimensional feature engineering. Int. J. Appl. Earth Obs. 2023, 118, 103269. [Google Scholar] [CrossRef]
- Yu, W.; Yang, G.; Li, D.; Zheng, H.; Yao, X.; Zhu, Y.; Cao, W.; Qiu, L.; Cheng, T. Improved prediction of rice yield at field and county levels by synergistic use of SAR, optical and meteorological data. Agr. For. Meteorol. 2023, 342, 109729. [Google Scholar] [CrossRef]
- de Sá Leitão, D.A.H.; Sharma, A.K.; Singh, A.; Sharma, L.K. Yield and plant height predictions of irrigated maize through unmanned aerial vehicle in North Florida. Comput. Electron. Agric. 2023, 215, 108374. [Google Scholar] [CrossRef]
- Zhang, H.; Zhang, Y.; Liu, K.; Lan, S.; Gao, T.; Li, M. Winter wheat yield prediction using integrated Landsat 8 and Sentinel-2 vegetation index time-series data and machine learning algorithms. Comput. Electron. Agric. 2023, 213, 108250. [Google Scholar] [CrossRef]
- Wang, Y.Q.; Leng, P.; Shang, G.F.; Zhang, X.; Li, Z.L. Sun-induced chlorophyll fluorescence is superior to satellite vegetation indices for predicting summer maize yield under drought conditions. Comput. Electron. Agric. 2023, 205, 107615. [Google Scholar] [CrossRef]
- Skobalski, J.; Sagan, V.; Alifu, H.; Al Akkad, O.; Lopes, F.A.; Grignola, F. Bridging the gap between crop breeding and GeoAI: Soybean yield prediction from multispectral UAV images with transfer learning. ISPRS J. Photogramm. Remote Sens. 2024, 210, 260–281. [Google Scholar] [CrossRef]
- Cheng, E.H.; Zhang, B.; Peng, D.L.; Zhong, L.H.; Yu, L.; Liu, Y.; Xiao, C.C.; Li, C.J.; Li, X.Y.; Chen, Y.; et al. Wheat yield estimation using remote sensing data based on machine learning approaches. Front. Plant Sci. 2022, 13, 1090970. [Google Scholar] [CrossRef] [PubMed]
- Wang, J.; Wang, P.; Tian, H.; Tansey, K.; Liu, J.; Quan, W. A deep learning framework combining CNN and GRU for improving wheat yield estimates using time series remotely sensed multi-variables. Comput. Electron. Agric. 2023, 206, 107705. [Google Scholar] [CrossRef]
- Guo, Y.; Xiao, Y.; Hao, F.; Zhang, X.; Chen, J.; Beurs, K.; He, Y.; Fu, Y.H. Comparison of different machine learning algorithms for predicting maize grain yield using UAV-based hyperspectral images. Int. J. Appl. Earth Obs. 2023, 124, 103528. [Google Scholar] [CrossRef]
- Feng, A.; Zhou, J.; Vories, E.; Sudduth, K.A. Prediction of cotton yield based on soil texture, weather conditions and UAV imagery using deep learning. Precis. Agric. 2024, 25, 303–326. [Google Scholar] [CrossRef]
- Fathi, M.; Shah-Hosseini, R.; Moghimi, A. 3D-ResNet-BiLSTM Model: A Deep Learning Model for County-Level Soybean Yield Prediction with Time-Series Sentinel-1, Sentinel-2 Imagery, and Daymet Data. Remote Sens. 2023, 15, 5551. [Google Scholar] [CrossRef]
- Tanabe, R.; Matsui, T.; Tanaka, T.S.T. Winter wheat yield prediction using convolutional neural networks and UAV-based multispectral imagery. Field Crop. Res. 2023, 291, 108786. [Google Scholar] [CrossRef]
- Bi, L.; Wally, O.; Hu, G.; Tenuta, A.U.; Kandel, Y.R.; Mueller, D.S. A transformer-based approach for early prediction of soybean yield using time-series images. Front. Plant Sci. 2023, 14, 1173036. [Google Scholar] [CrossRef] [PubMed]
- Perich, G.; Turkoglu, M.O.; Graf, L.V.; Wegner, J.D.; Aasen, H.; Walter, A.; Liebisch, F. Pixel-based yield mapping and prediction from Sentinel-2 using spectral indices and neural networks. Field Crop. Res. 2023, 292, 108824. [Google Scholar] [CrossRef]
- Cheng, E.; Wang, F.; Peng, D.; Zhang, B.; Zhao, B.; Zhang, W.; Hu, J.; Lou, Z.; Yang, S.; Zhang, H.; et al. A GT-LSTM Spatio-Temporal Approach for Winter Wheat Yield Prediction: From the Field Scale to County Scale. IEEE Trans. Geosci. Remote Sens. 2024, 62, 1–18. [Google Scholar] [CrossRef]
- Guo, F.; Wang, P.; Tansey, K.; Zhang, Y.; Li, M.; Liu, J.; Zhang, S. A novel transformer-based neural network under model interpretability for improving wheat yield estimation using remotely sensed multi-variables. Comput. Electron. Agric. 2024, 223, 109111. [Google Scholar] [CrossRef]
- Kiran Kumar, V.; Ramesh, K.V.; Rakesh, V. Optimizing LSTM and Bi-LSTM models for crop yield prediction and comparison of their performance with traditional machine learning techniques. Appl. Intell. 2023, 53, 28291–28309. [Google Scholar] [CrossRef]
- National Bureau of Statistics of China. China Statistical Yearbook. 2024. Available online: https://www.stats.gov.cn/sj/ndsj/ (accessed on 13 March 2024).
- NASA Land Processes Distributed Active Archive Center. MODIS/Terra Vegetation Indices 16-Day L3 Global 250 m SIN Grid V061. 2021. Available online: https://www.earthdata.nasa.gov/data/catalog/lpcloud-mod13q1-061 (accessed on 20 March 2024).
- McNally, A.; Arsenault, K.; Kumar, S.; Shukla, S.; Peterson, P.; Wang, S.G.; Funk, C.; Peters-Lidard, D.C.; Verdin, P.J. A land data assimilation system for sub-Saharan Africa food and water security applications. Sci. Data 2017, 4, 1–19. [Google Scholar] [CrossRef] [PubMed]
- Fischer, G.; Nachtergaele, F.; Prieler, S.; Teixeira, E.; Tóth, G.; Velthuizen, H.; Verelst, L.; Wiberg, D. Global Agro-Ecological Zones Assessment for Agriculture (GAEZ 2008); IIASA: Laxenburg, Austria; FAO: Rome, Italy, 2008; Volume 10. [Google Scholar]
- Dong, J.; Fu, Y.; Wang, J.; Tian, H.; Fu, S.; Niu, Z.; Han, W.; Zheng, Y.; Huang, J.; Yuan, W. Early season mapping of winter wheat in China based on Landsat and Sentinel images. Earth. Syst. Sci. Data 2020, 12, 3081–3095. [Google Scholar] [CrossRef]
- Shaker, A.; Maaz, M.; Rasheed, H.; Khan, S.; Yang, M.; Khan, F.S. Swiftformer: Efficient additive attention for transformer-based real-time mobile vision applications. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 27 March 2023; IEEE: New York City, NY, USA, 2023; pp. 17379–17390. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30. [Google Scholar] [CrossRef]
- Dong, Y.; Li, G.; Tao, Y.; Jiang, X.; Zhang, K.; Li, J.; Deng, J.; Su, J.; Zhang, J.; Xu, J. FAN: Fourier Analysis Networks. arXiv 2024. [Google Scholar] [CrossRef]
- Lemhadri, I.; Ruan, F.; Abraham, L.; Tibshirani, R. Lassonet: A neural network with feature sparsity. J. Mach. Learn. Res. 2021, 22, 1–29. [Google Scholar] [CrossRef]
- von Bloh, M.; Júnior, R.S.N.; Wangerpohl, X.; Saltık, A.O.; Haller, V.; Kaiser, L.; Asseng, S. Machine learning for soybean yield forecasting in Brazil. Agric. For. Meteorol. 2023, 341, 109670. [Google Scholar] [CrossRef]
- Raj, S.; Patle, S.; Rajendran, S. Predicting Crop Yield Using Decision Tree Regressor. In Proceedings of the International Conference on Knowledge Engineering and Communication Systems (ICKES), Chickballapur, India, 28–29 December 2022; IEEE: New York City, NY, USA, 2022; pp. 1–5. [Google Scholar] [CrossRef]
- Zhang, Y.; Li, Q. A regressive convolution neural network and support vector regression model for electricity consumption forecasting. In Advances in Information and Communication, Proceedings of the 2019 Future of Information and Communication Conference (FICC), San Francisco, CA, USA, 14–15 March 2019; Springer International Publishing: Cham, Switzerland, 2020; Volume 2, pp. 33–45. [Google Scholar] [CrossRef]
- Celik, M.F.; Isik, M.S.; Taskin, G.; Erten, E.; Camps-Valls, G. Explainable artificial intelligence for cotton yield prediction with multisource data. IEEE Geosci. Remote Sens. Lett. 2023, 20, 1. [Google Scholar] [CrossRef]
- Wang, X.; Yang, Y.; Zhao, X.; Huang, M.; Zhu, Q. Integrating field images and microclimate data to realize multi-day ahead forecasting of maize crop coverage using CNN-LSTM. Int. J. Agric. Biol. Eng. 2023, 16, 199–206. [Google Scholar] [CrossRef]
- Khaki, S.; Wang, L. Crop yield prediction using deep neural networks. Front. Plant Sci. 2019, 10, 621. [Google Scholar] [CrossRef] [PubMed]
- Pujitha, B.; Radha, K.; Samitha, K.; Prathyusha, N.; Vasanthi, P. Crop Prediction from Soil Parameters using Light Ensemble Learning Model. In Proceedings of the 2024 11th International Conference on Signal Processing and Integrated Networks (SPIN), Noida, India, 21–22 March 2024; IEEE: New York City, NY, USA, 2024; pp. 308–313. [Google Scholar] [CrossRef]
- Li, L.C.; Wang, B.; Feng, P.Y.; Liu, D.L.; He, Q.S.; Zhang, Y.J.; Wang, Y.K.; Li, S.Y.; Lu, X.L.; Yue, C. Developing machine learning models with multi-source environmental data to predict wheat yield in China. Comput. Electron. Agric. 2022, 194, 106790. [Google Scholar] [CrossRef]
Data | Detailed Variables | Temporal Resolution | Data Sources |
---|---|---|---|
Ground statistical data [28] | Yield per unit area, Sowing area | year | Statistical yearbook |
Satellite remote sensing data [29] | NDVI, EVI | month | MOD13Q1 |
Climate data [30] | Evapotranspiration, Net long-wave radiation flux, Net short-wave radiation flux, Surface pressure, Total precipitation rate, Near-surface air temperature, Near-surface wind speed | month | FLDAS |
Soil data [30,31] | Moisture and temperature of soil at different depths | month | FLDAS |
Reference bulk density, organic carbon, pH value, sand fraction, clay fraction, silt fraction, cation exchange capacity of the entire soil surface, and cation exchange capacity of the clay portion for both surface and deep soil | constant | National Cryosphere Desert Data Center | |
Spatio-Temporal data [28] | Time (year) | year | 2001–2022 |
Space (region code) | constant | 196 regions in total |
A Winter Wheat Yield Prediction Framework with Triple Cross-Attention and Multi-Source Data Fusion | |
---|---|
Input: Multi-source features during the winter wheat growing season , Time (T), Region Code (R), Planting Area (PA) | |
Output: Predicting yield and evaluation indicators | |
Step 1 | Construct dataset from multi-source data |
Step 2 | |
Step 3 | For each epoch : |
(a) | |
(b) | |
(c) | |
(d) | |
(e) | |
(f) | |
(g) | |
(h) | |
(i) | If does not decrease for 40 epoches: |
(j) | Save model and Early stop |
(k) | Test the saved model |
(l) | Compute and output metrics (, , , , and ) |
Methods | MAE (kg/hm2) | RMSE (kg/hm2) | MAPE (%) | R2 |
---|---|---|---|---|
LassoNet [36] | 1034.43 | 1292.36 | 6.8 | 0.39 |
LightGBM [43] | 563.47 | 701.54 | 4.48 | 0.82 |
DecisionTree [38] | 639.64 | 880.22 | 3.42 | 0.72 |
XGBoost [37] | 434.17 | 570.51 | 4.32 | 0.88 |
RCNN-SVR [39] | 950.75 | 1184.26 | 7.25 | 0.49 |
Random Forest [37] | 424.4 | 573.52 | 4.63 | 0.88 |
EBM [40] | 948.48 | 1179.33 | 3.9 | 0.5 |
LSTM [37] | 648.4 | 856.72 | 4.92 | 0.73 |
ANN [37] | 1107.33 | 1428.65 | 7.54 | 0.26 |
CNN-LSTM [41] | 763.03 | 980.36 | 3.84 | 0.65 |
DFNN [42] | 992.8 | 1246.97 | 7.12 | 0.44 |
Ours | 385.99 | 501.94 | 3.78 | 0.91 |
Stage Abbreviation | T1 | T2 | T3 | T4 |
---|---|---|---|---|
Growth period | emergence-tillering | winter dormancy stage | jointing-heading | heading-maturity |
Corresponding month | October to November | December to February of the following year | March to April of the following year | May to June of the following year |
Indicators | MAE (kg/hm2) | RMSE (kg/hm2) | MAPE (%) | R2 | |
---|---|---|---|---|---|
Stage | |||||
T1 | 581.61 | 813.91 | 3.4 | 0.76 | |
T2 | 617.07 | 913.52 | 3.69 | 0.69 | |
T3 | 405.59 | 541.45 | 2.99 | 0.89 | |
T4 | 459.19 | 650.75 | 3.22 | 0.85 |
Number | Time Window | MAE (kg/hm2) | RMSE (kg/hm2) | MAPE (%) | R2 | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Oct. | Nov. | Dec. | Jan. | Feb. | Mar. | Apr. | May | Jun. | |||||
1 | √ | √ | √ | 621.69 | 906.88 | 2.94 | 0.7 | ||||||
2 | √ | √ | √ | √ | 650.42 | 929.13 | 3.12 | 0.69 | |||||
3 | √ | √ | √ | √ | √ | 541.01 | 843.74 | 3.25 | 0.74 | ||||
4 | √ | √ | √ | √ | √ | √ | 555.19 | 741.11 | 3.42 | 0.8 | |||
5 | √ | √ | √ | √ | √ | √ | √ | 517.21 | 678.84 | 3.84 | 0.83 | ||
6 | √ | √ | √ | √ | √ | √ | √ | √ | 409.19 | 519.66 | 3.46 | 0.9 | |
7 | √ | √ | √ | √ | √ | √ | √ | √ | √ | 385.99 | 501.94 | 3.78 | 0.91 |
Number | TFE | Graph Attention Mechanism | MAE (kg/hm2) | RMSE (kg/hm2) | MAPE (%) × 100 | MSPE (%2) × 100 | R2 |
---|---|---|---|---|---|---|---|
1 | - | - | 695.39 | 904.36 | 14.82 | 3.31 | 0.62 |
2 | √ | - | 524.12 | 660.92 | 11.48 | 1.96 | 0.79 |
3 | - | √ | 578.24 | 736.13 | 12.86 | 2.49 | 0.75 |
4 | Ours | 431.47 | 548.98 | 9.53 | 1.37 | 0.86 |
Number | Remote Sensing | Soil | Climate | Fusion Method | MAE (kg/hm2) | RMSE (kg/hm2) | MAPE (%) × 100 | MSPE (%2) × 100 | R2 |
---|---|---|---|---|---|---|---|---|---|
1 | √ | - | - | - | 899.23 | 1058.05 | 18.25 | 4.06 | 0.47 |
2 | - | √ | - | - | 576.81 | 728.28 | 13.08 | 2.59 | 0.75 |
3 | - | - | √ | - | 704.33 | 857.3 | 14.54 | 2.79 | 0.65 |
4 | √ | √ | √ | concat | 728.66 | 855.94 | 15.39 | 2.93 | 0.66 |
5 | √ | √ | √ | add | 744.87 | 937.06 | 14.51 | 2.84 | 0.59 |
6 | √ | √ | √ | avg | 645.88 | 782.22 | 13.29 | 2.29 | 0.71 |
7 | √ | √ | √ | max | 803.01 | 908.57 | 17.11 | 3.45 | 0.61 |
8 | Ours | 431.47 | 548.98 | 9.53 | 1.37 | 0.86 |
Inference Time (s) | Model Storage Size (MB) | Parameters (M) | |
---|---|---|---|
LassoNet [36] | 0.043058 | 8.4622 | 0.039 |
LightGBM [43] | 0.076627 | 1.5070 | 0.012 |
DecisionTree [38] | 0.001994 | 0.0084 | 0.001 |
XGBoost [37] | 0.025452 | 1.9506 | 0.053 |
RCNN-SVR [39] | 0.095112 | 26.0975 | 0.003 |
Random Forest [37] | 0.033336 | 31.5588 | 0.459 |
EBM [40] | 0.051753 | 23.0054 | 0.124 |
LSTM [37] | 0.051126 | 1.36 | 0.116 |
ANN [37] | 0.165146 | 0.94 | 0.079 |
CNN-LSTM [41] | 0.070710 | 0.3 | 0.023 |
DFNN [42] | 0.001002 | 0.1302 | 0.031 |
Ours | 0.244363 | 1.4733 | 0.356 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Pan, S.; Liu, L. A Framework for Predicting Winter Wheat Yield in Northern China with Triple Cross-Attention and Multi-Source Data Fusion. Plants 2025, 14, 2206. https://doi.org/10.3390/plants14142206
Pan S, Liu L. A Framework for Predicting Winter Wheat Yield in Northern China with Triple Cross-Attention and Multi-Source Data Fusion. Plants. 2025; 14(14):2206. https://doi.org/10.3390/plants14142206
Chicago/Turabian StylePan, Shuyan, and Liqun Liu. 2025. "A Framework for Predicting Winter Wheat Yield in Northern China with Triple Cross-Attention and Multi-Source Data Fusion" Plants 14, no. 14: 2206. https://doi.org/10.3390/plants14142206
APA StylePan, S., & Liu, L. (2025). A Framework for Predicting Winter Wheat Yield in Northern China with Triple Cross-Attention and Multi-Source Data Fusion. Plants, 14(14), 2206. https://doi.org/10.3390/plants14142206