Precipitation Prediction and Factor Interpretation at Maqu Station in the Eastern Qinghai-Tibet Plateau Based on XGBoost-SHAP
Abstract
1. Introduction
2. Data and Methods
2.1. Data
2.2. Methods
2.2.1. Feature Selection
2.2.2. XGBoost
2.2.3. SHAP Model Interpretation Method
2.2.4. Model Evaluation and Interpretability Analysis
2.2.5. Physical Parameters Derivation
2.2.6. Model Training and Optimization Strategy
2.2.7. Baseline Model: Random Forest (RF)
3. Results and Discussion
3.1. Time Series of Observations
3.2. Vertical Structures of Observations
3.3. XGBoost Prediction Results
3.3.1. Training
3.3.2. Model Predictive Performance
3.4. Seasonal Interpretability and Consistency with Physical Processes
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Lin, Q.; Chen, J.; Chen, D.; Wang, X.; Li, W.; Scherer, D. Impacts of Bias-Corrected ERA5 Initial Snow Depth on Dynamical Downscaling Simulations for the Tibetan Plateau. J. Geophys. Res. Atmos. 2021, 126, e2021JD035625. [Google Scholar] [CrossRef]
- Zhu, Q.; Liu, Y.; Shao, T.; Luo, R.; Tan, Z. Role of the Tibetan Plateau in Northern Drought Induced by Changes in the Subtropical Westerly Jet. J. Clim. 2021, 34, 4955–4969. [Google Scholar] [CrossRef]
- Huang, J.; Zhou, X.; Wu, G.; Xu, X.; Zhao, Q.; Liu, Y.; Duan, A.; Xie, Y.; Ma, Y.; Zhao, P.; et al. Global Climate Impacts of Land-Surface and Atmospheric Processes Over the Tibetan Plateau. Rev. Geophys. 2023, 61, e2022RG000771. [Google Scholar] [CrossRef]
- Ma, M.; Tang, J.; Ou, T.; Chen, D. Subdaily Extreme Precipitation and Its Linkage to Global Warming Over the Tibetan Plateau. J. Geophys. Res. Atmos. 2023, 128, e2023JD039062. [Google Scholar] [CrossRef]
- Wu, F.; You, Q.; Pepin, N.; Kang, S.; Zhai, P. Surface Warming in Summer over the Tibetan Plateau: Local and Atmospheric Circulation Processes. Glob. Planet. Change 2025, 252, 104904. [Google Scholar] [CrossRef]
- Zhang, X.; Li, X.; Che, T.; Yang, C.; Duan, H.; Wu, J.; Liu, Y. Changes in Precipitation Phases Based on the Multi-Discrimination Method in the Tibetan Plateau. Atmos. Res. 2024, 310, 107597. [Google Scholar] [CrossRef]
- Ding, Z.; Ha, Y.; Hu, Y.; Zhu, Y.; Dai, H.; Zhong, Z. Spatiotemporal Characteristics of Summer Extreme Precipitation over the Inner Tibetan Plateau in Recent Decades. npj Clim. Atmos. Sci. 2025, 8, 193–199. [Google Scholar] [CrossRef]
- Lu, M.; Yang, S.; Fan, H.; Wang, J. Interdecadal Instability of the Interannual Connection between Southern Tibetan Plateau Precipitation and Southeast Asian Summer Monsoon. Atmos. Res. 2023, 291, 106825. [Google Scholar] [CrossRef]
- Zheng, Y.; Liu, H.; Du, Q.; Liu, Y.; Sun, J.; Cun, H.; Järvi, L. Effects of Precipitation Seasonal Distribution on Net Ecosystem CO2 Exchange over an Alpine Meadow in the Southeastern Tibetan Plateau. Int. J. Biometeorol. 2022, 66, 1561–1573. [Google Scholar] [CrossRef]
- Liu, J.; Chen, C.; Qing, N.; Li, X.; Lai, J.; Guo, B. Response of Water Resources to Climate Change in Zoige, Tibetan Plateau. J. Glaciol. Geocryol. 2016, 38, 498–508. [Google Scholar] [CrossRef]
- Sun, Z.; Wang, C. Impact of Changing Climate on Agriculture in China. Ke Ji Dao Bao 2010, 28, 110–117. [Google Scholar]
- Zhao, D.; Zhu, Y.; Wu, S.; Zheng, D. Projection of Vegetation Distribution to 1.5 Degrees C and 2 Degrees C of Global Warming on the Tibetan Plateau. Glob. Planet. Change 2021, 202, 103525. [Google Scholar] [CrossRef]
- Zou, F.; Li, H.; Hu, Q. Responses of Vegetation Greening and Land Surface Temperature Variations to Global Warming on the Qinghai-Tibetan Plateau, 2001–2016. Ecol. Indic. 2020, 119, 106867. [Google Scholar] [CrossRef]
- Chen, Q.; Liu, H.; Hu, M.; Ge, F.; Li, Y. A Review of the Studies on Extreme Precipitation in the Complex Terrain Region on the Eastern Side of the Tibetan Plateau. Torrential Rain Disasters 2024, 43, 255–265. [Google Scholar] [CrossRef]
- Wang, Z.Q.; Duan, A.M.; Li, M.S.; He, B. Influences of Thermal Forcing over the Slope/Platform of the Tibetan Plateau on Asian Summer Monsoon: Numerical Studies with the WRF Model. Acta Geophys. Sin. 2016, 59, 3175–3187. [Google Scholar] [CrossRef]
- Guo, Y.; Zheng, H.; Yang, Y.; Sang, Y.; Wen, C. A Hydrogeomorphic Dataset for Characterizing Catchment Hydrological Behavior across the Tibetan Plateau. Earth Syst. Sci. Data 2024, 16, 1651–1665. [Google Scholar] [CrossRef]
- Yang, K.; Zhou, X.; Ma, X.; Chen, D.; Chen, F.; Dai, Y.; Jiang, Y.; Huang, A.; Lin, Y.; Liu, J.; et al. A Physically-Refined Regional Climate Model for the Tibetan Plateau. Sci. Bull. 2025, 70, 4070–4079. [Google Scholar] [CrossRef]
- Dong, P.; Jiang, X.; Zhao, X.; Dong, Y.; Zheng, J.; Hu, C.; Gao, G.; Liu, L.; Li, S.; Bu, L. Vertical Profiles of Raindrop Size Distribution Parameters of Summer Rainfall in the Eastern Tibetan Plateau: Retrieval Method and Characteristics. Atmos. Meas. Tech. 2026, 19, 1407–1419. [Google Scholar] [CrossRef]
- Lyu, Y.; Yong, B. A Novel Double Machine Learning Strategy for Producing High-Precision Multi-Source Merging Precipitation Estimates over the Tibetan Plateau. Water Resour. Res. 2024, 60, e2023WR035643. [Google Scholar] [CrossRef]
- Ning, C.; Wang, Y.; Nan, Z.; Chen, H.; Liu, C. Study on Correction of Daily Precipitation Data of the Qinghai-Tibetan Plateau with Machine Learning Models. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2016—Proceedings; IEEE: New York, NY, USA, 2016; Volume 2016, pp. 517–520. [Google Scholar]
- Dong, J.; Zeng, W.; Wu, L.; Huang, J.; Gaiser, T.; Srivastava, A.K. Enhancing Short-Term Forecasting of Daily Precipitation Using Numerical Weather Prediction Bias Correcting with XGBoost in Different Regions of China. Eng. Appl. Artif. Intell. 2023, 117, 105579. [Google Scholar] [CrossRef]
- Grecco Sanches, R.; Sanches Miani, R.; César dos Santos, B.; Martins Moreira, R.; Zen de Figueiredo Neves, G.; Bourscheidt, V.; Augusto Toledo Rios, P. Using Xgboost Models Dor Daily Rainfall Prediction. An. De Geogr. De La Univ. Complut. 2025, 45, 75–92. [Google Scholar] [CrossRef]
- Kumar, V.; Kedam, N.; Sharma, K.V.; Khedher, K.M.; Alluqmani, A.E. A Comparison of Machine Learning Models for Predicting Rainfall in Urban Metropolitan Cities. Sustainability 2023, 15, 13724. [Google Scholar] [CrossRef]
- Mai, X.; Zhong, H.; Li, L. Research on Rain or Shine Weather Forecast in Precipitation Nowcasting Based on XGBoost. In Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery; Meng, H., Wang, L., Xiong, N., Lei, T., Li, M., Li, K., Eds.; Springer International Publishing AG: Cham, Switzerland, 2021; Volume 88, pp. 1313–1319. [Google Scholar]
- Han, Y.; Jin, W.; Liu, H.; Wang, W.; Ma, J.; Zhao, W. Optimization of Ecological Restoration Efficiency in Qinghai-Tibet Plateau Using the Cubist Regression Tree Model: A Study of Environmental Adaptability Models. PLoS ONE 2025, 20, e0335056. [Google Scholar] [CrossRef]
- Dong, N.; Hao, H.; Yang, M.; Wei, J.; Xu, S.; Kunstmann, H. Deep-Learning-Based Sub-Seasonal Precipitation and Streamflow Ensemble Forecasting over the Source Region of the Yangtze River. Hydrol. Earth Syst. Sci. 2025, 29, 2023–2042. [Google Scholar] [CrossRef]
- Lundberg, S.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. arXiv 2017, arXiv:1705.07874. [Google Scholar]
- Wang, F.; Wang, X.; Li, S. Explainable Machine Learning for Predictive Modeling of Blowing Snow Detection and Meteorological Feature Assessment Using XGBoost-SHAP. PLoS ONE 2025, 20, e0318835. [Google Scholar] [CrossRef] [PubMed]
- Qian, Q.; Jia, X. Seasonal Forecast of Winter Precipitation over China Using Machine Learning Models. Atmos. Res. 2023, 294, 106961. [Google Scholar] [CrossRef]
- Li, Y.; Che, T.; Dai, L.; Wu, A.; Wang, J. Investigating the spatiotemporal behavior of VIC model parameters over the Tibetan plateau via global sensitivity analysis and machine learning. Int. J. Digit. Earth 2026, 19, 2625537. [Google Scholar] [CrossRef]
- Poletaev, A.; Liu, B.; Li, L.; Avagyan, V.; Voskanyan, V. Improve the Prediction in the Digital Era: Causal Feature Selection with Minimum Redundancy. J. Digit. Econ. 2024, 3, 14–36. [Google Scholar] [CrossRef]
- Noardo, F. Multisource Spatial Data Integration for Use Cases Applications. Trans. GIS 2022, 26, 2874–2913. [Google Scholar] [CrossRef]
- Evans, J.D. Straightforward Statistics for the Behavioral Sciences; Brooks/Cole Pub. Co.: Pacific Grove, CA, USA, 1996. [Google Scholar]
- Cohen, J. Statistical Power Analysis for the Behavioral Sciences, 2nd ed.; Lawrence Erlbaum Associates: Hillsdale, NJ, USA, 1988. [Google Scholar]
- Saputra, A.; Ramli, K.; Nugroho, A.S. Performance with Feature Selection Based Machine Learning: Combination of Low Variance Filter Pearson Correlation for Bots and Brute Force. In Proceedings of the 2025 International Conference on Computer Sciences, Engineering, and Technology Innovation (ICoCSETI), Jakarta, Indonesia, 21 January 2025; IEEE: New York, NY, USA, 2025; pp. 933–938. [Google Scholar]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; ACM: New York, NY, USA, 2016; Volume 13–17, pp. 785–794. [Google Scholar]
- Qian, Q.; Jia, X.; Lin, H.; Zhang, R. Seasonal Forecast of Nonmonsoonal Winter Precipitation over the Eurasian Continent Using Machine-Learning Models. J. Clim. 2021, 34, 7113–7129. [Google Scholar] [CrossRef]
- Ali, S.; Khorrami, B.; Jehanzaib, M.; Tariq, A.; Ajmal, M.; Arshad, A.; Shafeeque, M.; Dilawar, A.; Basit, I.; Zhang, L.; et al. Spatial Downscaling of GRACE Data Based on XGBoost Model for Improved Understanding of Hydrological Droughts in the Indus Basin Irrigation System (IBIS). Remote Sens. 2023, 15, 873. [Google Scholar] [CrossRef]
- Yue, F.; Wang, X.; Ai, R.; Wu, Y.; Li, Q.; Feng, G. Predicting Summer Precipitation in China: A Hybrid Downscaling Model Using the XGBoost Method. Int. J. Climatol. 2025, 45, e70064. [Google Scholar] [CrossRef]
- Lei, H.; Li, H.; Zhao, H. Refining Daily Precipitation Estimates Using Machine Learning and Multi-Source Data in Alpine Regions with Unevenly Distributed Gauges. J. Hydrol. Reg. Stud. 2025, 58, 102272. [Google Scholar] [CrossRef]
- Schütz, M.; Schütz, A.; Bendix, J.; Thies, J.M.u.B. Evaluating Station, Satellite, & Combined Data for XGBoost-Based Visibility Forecast. Atmos. Res. 2026, 328, 108395. [Google Scholar] [CrossRef]
- Wang, M.; Li, Y.; Yuan, H.; Zhou, S.; Wang, Y.; Adnan Ikram, R.M.; Li, J. An XGBoost-SHAP Approach to Quantifying Morphological Impact on Urban Flooding Susceptibility. Ecol. Indic. 2023, 156, 111137. [Google Scholar] [CrossRef]
- Zhao, C.; Lin, Z.; Yang, L.; Jiang, M.; Qiu, Z.; Wang, S.; Gu, Y.; Ye, W.; Pan, Y.; Zhang, Y.; et al. A Study on the Impact of Meteorological and Emission Factors on PM2.5 Concentrations Based on Machine Learning. J. Environ. Manag. 2025, 376, 124347. [Google Scholar] [CrossRef]
- Lei, K.; Zhang, L.; Gao, L. Evaluation of IMERG Precipitation Product Downscaling Using Nine Machine Learning Algorithms in the Qinghai Lake Basin. Water 2025, 17, 1776. [Google Scholar] [CrossRef]
- Liu, S.; Wang, J.; Shi, F.; Zhuo, P.; Ao, T. Research on Multi-Source Precipitation Fusion Based on Classification and Regression Machine Learning Methods—A Case Study of the Min River Basin in the Eastern Source of the Qinghai-Tibet Plateau. Remote Sens. 2025, 17, 3982. [Google Scholar] [CrossRef]








| Data Type | Level & Temporal Resolution | Variables |
|---|---|---|
| Daily post-processed single-level statistics | Hourly (0.25° × 0.25°) | 10 m u-component of wind (u10), 10 m v-component of wind (v10), 2 m dewpoint temperature (d2m), 2 m temperature (t2m), Surface pressure (sp), Total precipitation (tp), Skin temperature (skt), 100 m u-component of wind (u100), 100 m v-component of wind (v100), Surface latent heat flux (slhf), Surface net solar radiation (ssr), Surface net thermal radiation (str), Surface sensible heat flux (sshf), Downward surface solar radiation (ssrd), Downward surface thermal radiation (strd), Cloud base height (cbh), total cloud cover (tcc), Evaporation (e), Potential evaporation (pev), Runoff (ro), Leaf area index (high vegetation) (lai_hv), Leaf area index (low vegetation) (lai_lv) |
| Single layer hour-by-hour | 500 hPa & 700 hPa Hourly (0.25° × 0.25°) | Divergence (d500; d700), Fraction of cloud cover (cc500; cc700), Geopotential (z500; z700), Ozone mass mixing ratio (o3500; o3700), Potential vorticity (pv500; pv700), Relative humidity (r500; r700), Specific cloud ice water content (ciwc500; ciwc700), Specific cloud liquid water content (clwc500; clwc700), Specific humidity (q500; q700), Specific rain water content (crwc500; crwc700), Specific snow water content (cswc500; cswc700), Temperature (t500; t700), u-component of wind (u500; u700), v-component of wind (v500; v700), Vertical velocity (w500; w700), Vorticity (relative) (vo500; vo700) |
| Microwave Radiometer (MWR, MP3000A) | Hourly | relative humidity (RH0; RH1…) water vapor density (wvd0; wvd1…), liquid water density (lwd0; lwd1…), Integral Water Vapor Content (IWV), Liquid Water Path (LWP) |
| Surface Meteorological Observations | Hourly | Temperature (T), surface temperature (T_surf), relative humidity (RH), surface atmospheric pressure (P), wind speed (Wspd), wind direction (Wdir), dew point temperature (DPT), precipitation (PRE_24h) |
| Category | Hyperparameter | Optimal Value |
|---|---|---|
| Boosting Control | learning_rate | 0.1 |
| n_estimators | 1000 | |
| Tree Structure | max_depth | 9 |
| min_child_weight | 5 | |
| Regularization | reg_alpha | 0.5 |
| reg_lambda | 1.0 | |
| Stochastic Sampling | subsample | 0.8 |
| colsample_bytree | 0.8 |
| Category | Metric | Unweighted Model (XGB-Shap) | Weighted Model (XGB-Shap) (This Study) | Baseline Model (RF) |
|---|---|---|---|---|
| Overall | R2 | 0.863 | 0.872 | 0.787 |
| MAE | 0.692 | 0.691 | 1.051 | |
| RMSE (mm) | 1.660 | 1.609 | 2.073 | |
| FAR | 0.0874 | 0.0890 | 0.0906 | |
| Heavy (>5 mm) | R2 | 0.686 | 0.727 | 0.529 |
| MAE | 3.093 | 2.942 | 4.436 | |
| RMSE (mm) | 4.783 | 4.470 | 5.861 | |
| Bias (mm) | −2.8092 | −2.7939 | −4.383 |
| Metric | Spr (XGB) | Spr (RF) | Sum (XGB) | Sum (RF) | Atu (XGB) | Atu (RF) | Win (XGB) | Win (RF) |
|---|---|---|---|---|---|---|---|---|
| R2 | 0.85 | 0.83 | 0.80 | 0.79 | 0.88 | 0.72 | 0.85 | 0.77 |
| RMSE | 0.75 | 1.73 | 3.26 | 2.51 | 1.17 | 1.99 | 0.32 | 1.95 |
| MAE | 0.44 | 0.91 | 1.67 | 1.18 | 0.60 | 1.04 | 0.11 | 1.06 |
| Bias | −0.04 | 0.03 | −0.07 | −0.18 | −0.01 | −0.14 | −0.02 | −0.06 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhao, D.; Zhang, S.; Liu, G.; Pan, X.; Wang, T.; Ding, H.; Sang, W.; Ma, Y. Precipitation Prediction and Factor Interpretation at Maqu Station in the Eastern Qinghai-Tibet Plateau Based on XGBoost-SHAP. Water 2026, 18, 1355. https://doi.org/10.3390/w18111355
Zhao D, Zhang S, Liu G, Pan X, Wang T, Ding H, Sang W, Ma Y. Precipitation Prediction and Factor Interpretation at Maqu Station in the Eastern Qinghai-Tibet Plateau Based on XGBoost-SHAP. Water. 2026; 18(11):1355. https://doi.org/10.3390/w18111355
Chicago/Turabian StyleZhao, Dandan, Shaoqing Zhang, Guangjing Liu, Xiaole Pan, Tianyi Wang, Huiyu Ding, Wenjun Sang, and Yongjing Ma. 2026. "Precipitation Prediction and Factor Interpretation at Maqu Station in the Eastern Qinghai-Tibet Plateau Based on XGBoost-SHAP" Water 18, no. 11: 1355. https://doi.org/10.3390/w18111355
APA StyleZhao, D., Zhang, S., Liu, G., Pan, X., Wang, T., Ding, H., Sang, W., & Ma, Y. (2026). Precipitation Prediction and Factor Interpretation at Maqu Station in the Eastern Qinghai-Tibet Plateau Based on XGBoost-SHAP. Water, 18(11), 1355. https://doi.org/10.3390/w18111355

