A Hybrid Deep Learning Model for Crop Yield Prediction Taking Weather Data Associated with Production Management Phases as Input
Abstract
1. Introduction
2. Contribution
3. Data and Models
3.1. The Data for the Crop Yield Prediction Model
3.2. The Proposed Crop Yield Prediction Model CNN-LSTM-AM
3.2.1. CNN
The Convolution Layer
The Pooling Layer
3.2.2. LSTM
3.2.3. AM
4. Results
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A. Phase-Segmentation Pseudocode
| Algorithm A1: Agricultural_Phase_Alignment_and_Preprocessing |
| Input: |
| - Weather_Data: Daily records of (Temp, Humidity, Pressure, Wind Speed, Rainfall, Solar Radiation) |
| - Production_Log: List of cycles with dates (Sowing, Fertilization I, Thinning, Fertilization II, Harvest) |
| Output: |
| - Final_Feature_Matrix: Normalized 24-feature vectors |
| 1. FOR each Production_Cycle in Production_Log: |
| 2. # Define Mutually Exclusive Intervals |
| 3. Phase_Intervals = [ |
| (Sowing, Fertilization I-1), |
| (Fertilization I, Thinning-1), |
| (Thinning, Fertilization II-1), |
| (Fertilization II, Harvest-1)] |
| 4. # Quality Filtering: Missing Data Check |
| 5. IF Any_Phase has Missing_Days >= 1: |
| 6. IF Missing_Days == 1: |
| 7. Apply Linear_Interpolation(Weather_Data) |
| 8. ELSE: |
| 9. Exclude(Production_Cycle) |
| 10. CONTINUE |
| 11. # Feature Aggregation per Phase |
| 12. FOR each Phase(i) in Phase_Intervals: |
| 13. # Mean-type: Temp, Humidity, Pressure, Wind Speed |
| 14. M[i][Mean_Type] = Average(Weather_Data within Phase(i)) |
| 15. # Cumulative-type: Rainfall, Solar Radiation |
| 16. M[i][Sum_Type] = Total_Sum(Weather_Data within Phase(i)) |
| 17. APPEND M to Dataset |
| 18. END FOR |
| 19. # Outlier Management (IQR Winsorization) |
| 20. FOR each Feature in Dataset: |
| 21. IQR = Q3 − Q1 |
| 22. Lower_Bound = Q1 − 1.5 × IQR |
| 23. Upper_Bound = Q3 + 1.5 × IQR |
| 24. CLIP(Feature) to [Lower_Bound, Upper_Bound] |
| 25. END FOR |
| 26. # Standardization |
| 27. APPLY Z-score Normalization (X_std = (X − μ)/σ) to all features |
| 28. RETURN Dataset |
Appendix B. Preprocessing Ablation Study
References
- Jabed, M.; Murad, M. Crop yield prediction in agriculture: A comprehensive review of machine learning and deep learning approaches, with insights for future research and sustainability. Heliyon 2024, 10, e40836. [Google Scholar] [CrossRef] [PubMed]
- Klompenburg, T.V.; Kassahun, A.; Catal, C. Crop yield prediction using machine learning: A systematic literature review. Comput. Electron. Agric. 2020, 177, 105709. [Google Scholar] [CrossRef]
- Khaki, S.; Wang, L.; Archontoulis, S. A CNN-RNN framework for crop yield prediction. Front. Plant Sci. 2019, 10, 1750. [Google Scholar] [CrossRef] [PubMed]
- Oikonomidis, A.; Catal, C.; Kassahun, A. Hybrid deep learning-based models for crop yield prediction. Appl. Artif. Intell. 2022, 36, e2031823. [Google Scholar] [CrossRef]
- Liu, F.; Jiang, X.; Wu, Z. Attention mechanism-combined LSTM for grain yield prediction in China using multi-source satellite imagery. Sustainability 2023, 15, 9210. [Google Scholar] [CrossRef]
- Li, C.; Zhang, L.; Wu, X.; Chai, H.; Xiang, H.; Jiao, Y. Winter wheat yield estimation by fusing CNN-MALSTM deep learning with remote sensing indices. Agriculture 2024, 14, 1961. [Google Scholar] [CrossRef]
- Lin, F.; Crawford, S.; Guillot, K.; Zhang, Y.; Chen, Y.; Yuan, X.; Chen, L.; Williams, S.; Minvielle, R.; Xiao, X.; et al. MMST-ViT: Climate change-aware crop yield prediction via multi-modal spatial-temporal vision transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2–6 October 2023; pp. 5774–5784. [Google Scholar]
- Bi, L.; Wally, O.; Hu, G.; Tenuta, A.U.; Kandel, Y.R.; Mueller, D.S. A transformer-based approach for early prediction of soybean yield using time-series images. Front. Plant Sci. 2023, 14, 1173036. [Google Scholar] [CrossRef]
- Junankar, T.; Sondhi, J.K.; Nair, A.M. Wheat yield prediction using temporal fusion transformers. In Proceedings of the 2nd International Conference for Innovation in Technology (INOCON), Bangalore, India, 3–5 March 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6. [Google Scholar]
- Mohan, A.; Venkatesan, M.; Prabhavathy, P.; Jayakrishnan, A. Temporal convolutional network based rice crop yield prediction using multispectral satellite data. Infrared Phys. Technol. 2023, 135, 104960. [Google Scholar] [CrossRef]
- Gong, L.; Yu, M.; Jiang, S.; Cutsuridis, V.; Pearson, S. Deep learning based prediction on greenhouse crop yield combined TCN and RNN. Sensors 2021, 21, 4537. [Google Scholar] [CrossRef]
- Osibo, B.K.; Ma, T.; Bediako-Kyeremeh, B.; Mamelona, L.; Darbinian, K. TCNT: A temporal convolutional network-transformer framework for advanced crop yield prediction. J. Appl. Remote Sens. 2024, 18, 044513. [Google Scholar] [CrossRef]
- Lu, J.; Li, J.; Fu, H.; Tang, X.; Liu, Z.; Chen, H.; Sun, Y.; Ning, X. Deep learning for multi-source data-driven crop yield prediction in northeast China. Agriculture 2024, 14, 794. [Google Scholar] [CrossRef]
- Dong, C.; Peng, X.; Yang, X.; Wang, C.; Yuan, L.; Chen, G.; Tang, X.; Wang, W.; Wu, J.; Zhu, S.; et al. Physiological and transcriptomic responses of Bok Choy to heat stress. Plants 2024, 13, 1093. [Google Scholar] [CrossRef]
- Kano, K.; Kitazawa, H.; Suzuki, K.; Widiastuti, A.; Odani, H.; Zhou, S.; Chinta, Y.; Eguchi, Y.; Shinohara, M.; Sato, T. Effects of organic fertilizer on Bok Choy growth and quality in hydroponic cultures. Agronomy 2021, 11, 491. [Google Scholar] [CrossRef]
- Pan, J.; Peng, K.; Ruan, R.; Liu, Y.; Cui, X. Impact of anaerobic fermentation liquid on Bok Choy and mechanism of combined vitamin C from Bok Choy and Allicin in treatment of DSS colitis. Foods 2025, 14, 785. [Google Scholar] [CrossRef] [PubMed]
- Sun, J.; Di, L.; Sun, Z.; Shen, Y.; Lai, Z. County-level soybean yield prediction using deep CNN-LSTM model. Sensors 2019, 19, 4363. [Google Scholar] [CrossRef] [PubMed]
- Chen, T.Q.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016. [Google Scholar]
- Huber, F.; Yushchenko, A.; Stratmann, B.; Steinhage, V. Extreme gradient boosting for yield estimation compared with deep learning approaches. Comput. Electron. Agric. 2022, 202, 107346. [Google Scholar] [CrossRef]
- Togliatti, K.; Archontoulis, S.V.; Dietzel, R.; Puntel, L.; VanLoocke, A. How does inclusion of weather forecasting impact in-season crop model predictions? Field Crops Res. 2017, 214, 261–272. [Google Scholar] [CrossRef]
- Peng, D.; Cheng, E.; Feng, X.; Hu, J.; Lou, Z.; Zhang, H.; Zhao, B.; Lv, Y.; Peng, H.; Zhang, B. A deep-learning network for wheat yield prediction combining weather forecasts and remote sensing Data. Remote Sens. 2024, 16, 3613. [Google Scholar] [CrossRef]
- Nevavuori, P.; Narra, N.; Lipping, T. Crop yield prediction with deep convolutional neural networks. Comput. Electron. Agric. 2019, 163, 104859. [Google Scholar] [CrossRef]
- Zhao, Y.; Potgieter, A.; Zhang, M.; Wu, B.; Hammer, G. Predicting wheat yield at the field scale by combining high-resolution sentinel-2 satellite imagery and crop modelling. Remote Sens. 2020, 12, 1024. [Google Scholar] [CrossRef]
- Hara, P.; Piekutowska, M.; Niedbała, G. Selection of independent variables for crop yield prediction using artificial neural network models with remote sensing data. Land 2021, 10, 609. [Google Scholar] [CrossRef]
- Son, N.; Chen, C.; Cheng, Y.; Toscano, P.; Chen, C.; Chen, S.; Tseng, K.; Syu, C.; Guo, H.; Zhang, Y. Field-scale rice yield prediction from Sentinel-2 monthly image composites using machine learning algorithms. Ecol. Inform. 2022, 69, 101618. [Google Scholar] [CrossRef]
- Sun, J.; Tian, P.; Li, Z.; Wang, X.; Zhang, H.; Chen, J.; Qian, Y. Construction and optimization of integrated yield prediction model based on phenotypic characteristics of rice grown in small–scale plantations. Agriculture 2025, 15, 181. [Google Scholar] [CrossRef]
- Joshua, V.; Priyadharson, S.; Kannadasan, R. Exploration of machine learning approaches for paddy yield prediction in eastern part of Tamilnadu. Agronomy 2021, 11, 2068. [Google Scholar] [CrossRef]
- Behzadi, G.; O’Sullivan, M.J.; Olsen, T.L.; Zhang, A. Agribusiness supply chain risk management: A review of quantitative decision models. Omega 2018, 79, 21–42. [Google Scholar] [CrossRef]


| Date | Temperature | Rainfall | Humidity | Accumulated Total Sky Radiation | Wind Speed | Atmospheric Pressure | Production Record | Yield in This Production Cycle (kg/ha) |
|---|---|---|---|---|---|---|---|---|
| 11 June 2019 | 27.4 | 2 | 76.0 | 4.4 | 210.8 | 1001.2 | Sowing | 16,494.9 |
| 12 June 2019 | 25.8 | 0 | 84.8 | 2.5 | 126.7 | 1002.2 | ||
| 13 June 2019 | 28.0 | 8.5 | 79.0 | 12.1 | 175.4 | 997.7 | ||
| 14 June 2019 | 26.7 | 0 | 81.9 | 7.2 | 146.7 | 997.7 | ||
| 15 June 2019 | 27.9 | 0 | 70.0 | 22.9 | 122.1 | 1000.7 | ||
| 16 June 2019 | 27.5 | 0 | 74.6 | 17.1 | 133.3 | 1003.2 | ||
| 17 June 2019 | 28.0 | 0 | 78.8 | 15.8 | 162.1 | 1005.6 | Fertilization I |
| Input variables xt |
| (1) Average temperature in the first phase |
| (2) Average temperature in the second phase |
| (3) Average temperature in the third phase |
| (4) Average temperature in the fourth phase |
| Accumulated rainfall in the first phase |
| Accumulated rainfall in the second phase |
| Accumulated rainfall in the third phase |
| Accumulated rainfall in the fourth phase |
| (5) Average humidity in the first phase |
| (6) Average humidity in the second phase |
| (7) Average humidity in the third phase |
| (8) Average humidity in the fourth phase |
| Accumulated total sky radiation in the first phase |
| Accumulated total sky radiation in the second phase |
| Accumulated total sky radiation in the third phase |
| Accumulated total sky radiation in the fourth phase |
| (9) Average wind speed in the first phase |
| (10) Average wind speed in the second phase |
| (11) Average wind speed in the third phase |
| (12) Average wind speed in the fourth phase |
| (13) Average atmospheric pressure in the first phase |
| (14) Average atmospheric pressure in the second phase |
| (15) Average atmospheric pressure in the third phase |
| (16) Average atmospheric pressure in the fourth phase |
| Output variable yt |
| (1) Crop yield at production cycle t (kg/ha) |
| Weather Variables | Phase 1 | Phase 2 | Phase 3 | Phase 4 |
|---|---|---|---|---|
| Temperature | 0.03% | 0.02% | 0.01% | 0.02% |
| Humidity | 0.12% | 0.08% | 0.15% | 0.10% |
| Wind speed | 0.25% | 0.30% | 0.22% | 0.28% |
| Atmospheric pressure | 0.04% | 0.03% | 0.05% | 0.02% |
| Rainfall | 0.05% | 0.02% | 0.04% | 0.03% |
| Accumulated total sky radiation | 0.85% | 0.72% | 0.90% | 0.88% |
| Category | Hyperparameter | Search Space |
|---|---|---|
| Optimization | Learning Rate | 10−2, 10−3, 10−4 |
| Batch Size | 16, 32, 64 | |
| Optimizer | Adam, RMSprop, SGD | |
| CNN | Filters/Kernel/Activation function | 32, 64, 128/3, 5/ReLU, Tanh |
| LSTM/Attention | Units/Activation function | 32, 64, 128/ReLU, Sigmoid, Tanh |
| Regularization | Dropout Rate | 0.1–0.5 |
| Model | Hyperparameter |
|---|---|
| CNN | filters = 64/kernel sizes = 3/pool_size = 2/activation function = ReLU |
| LSTM | units = 64/activation function = tanh |
| CNN-LSTM | CNN: filters = 64/kernel sizes = 3/pool_size = 2/activation function = ReLU |
| LSTM: units = 64/activation function = tanh | |
| LSTM-AM | LSTM: units = 64/activation function = tanh |
| AM: 64, use attention = True |
| Model | RMSE | MAPE (%) | R2 |
|---|---|---|---|
| The proposed model | 1448.24 | 3.60 | 0.98 |
| LSTM-AM | 2284.64 | 6.05 | 0.95 |
| CNN-LSTM | 1516.44 | 4.21 | 0.97 |
| LSTM | 2529.74 | 6.17 | 0.94 |
| CNN | 2919.18 | 8.41 | 0.92 |
| XGBoost | 3452.47 | 10.60 | 0.87 |
| Model | RMSE | MAPE (%) | R2 |
|---|---|---|---|
| The model with production management phases | 1448.24 | 3.60 | 0.98 |
| The model without production management phases | 1734.49 | 5.38 | 0.97 |
| Model | RMSE | MAPE (%) | R2 |
|---|---|---|---|
| The model with 1–4 phase weather data | 1448.24 | 3.60 | 0.98 |
| The model with the first three phase weather data (1–3 phase) | 2219.99 | 5.97 | 0.96 |
| The model with the first two phase weather data (1–2 phase) | 6110.22 | 18.76 | 0.69 |
| The model with the first phase weather data (1–1 phase) | 14,975.91 | 45.98 | 0.25 |
| Weather Variables | Phase 1 | Phase 2 | Phase 3 | Phase 4 | Variable Total |
|---|---|---|---|---|---|
| Daily average temperature | 0.04 | 0.05 | 0.08 | 0.18 | 0.35 |
| Daily average humidity | 0.03 | 0.03 | 0.04 | 0.05 | 0.15 |
| Daily average wind speed | 0.01 | 0.01 | 0.01 | 0.01 | 0.04 |
| Daily average atmospheric pressure | 0.01 | 0.01 | 0.01 | 0.02 | 0.05 |
| Daily accumulated rainfall | 0.02 | 0.02 | 0.03 | 0.05 | 0.12 |
| Daily accumulated total sky radiation | 0.02 | 0.04 | 0.08 | 0.15 | 0.29 |
| Phase total | 0.13 | 0.16 | 0.25 | 0.46 | 1 |
| Source | Crop | Prediction Method | Input | Metric for Methods |
|---|---|---|---|---|
| This paper | Bok choy | CNN-LSTM-AM | Weather data associated with production management phases | RMSE = 1448.24 kg/ha, MAPE = 3.60%, R2 = 0.98 |
| Nevavuori et al. [22] | Wheat | CNN | UAV imagery (NDVI, RGB) | MAPE = 8.8% (early), 12.6% (late) |
| Zhao et al. [23] | Wheat | Sentinel-2 indices + crop model | Satellite indices (OSAVI, CI), crop water stress | R2 = 0.91, RMSE = 0.54 t/ha, MAPE = 10–59% |
| Hara et al. [24] | Rapeseed | ANN (MLP) | Weather, soil, management | MAPE = 9.43% |
| Joshua et al. [27] | Paddy | GRNN, SVR, RBFNN, BPNN | Weather, soil, fertilizer, nutrients | GRNN: RMSE = 0.2295, MAPE = 1.34%, R2 = 0.9863 |
| Oikonomidis et al. [4] | Soybean | CNN-DNN (Hybrid Deep Learning) | Weather, soil (395 features) | RMSE = 0.266, MAE = 0.199, R2 = 0.87 |
| Son et al. [25] | Rice | SVM, RF, ANN | Sentinel-2 satellite imagery | SVM: MAPE = 3.5–9.4%, RMSPE = 4.7–11.2% |
| Sun et al. [26] | Rice | Stacking Ensemble (RF, SVM, MLP, etc.) | Phenotypic traits (panicle angle, length, etc.) | RMSE = 0.2483, MAPE = 6.90%, R2 = 0.9250 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, S.-C.; Lin, Y.-J.; Chung, C.-H.; Wen, H.-Y. A Hybrid Deep Learning Model for Crop Yield Prediction Taking Weather Data Associated with Production Management Phases as Input. Sustainability 2026, 18, 3806. https://doi.org/10.3390/su18083806
Liu S-C, Lin Y-J, Chung C-H, Wen H-Y. A Hybrid Deep Learning Model for Crop Yield Prediction Taking Weather Data Associated with Production Management Phases as Input. Sustainability. 2026; 18(8):3806. https://doi.org/10.3390/su18083806
Chicago/Turabian StyleLiu, Shu-Chu, Yan-Jing Lin, Chih-Hung Chung, and Hsien-Yin Wen. 2026. "A Hybrid Deep Learning Model for Crop Yield Prediction Taking Weather Data Associated with Production Management Phases as Input" Sustainability 18, no. 8: 3806. https://doi.org/10.3390/su18083806
APA StyleLiu, S.-C., Lin, Y.-J., Chung, C.-H., & Wen, H.-Y. (2026). A Hybrid Deep Learning Model for Crop Yield Prediction Taking Weather Data Associated with Production Management Phases as Input. Sustainability, 18(8), 3806. https://doi.org/10.3390/su18083806

