Exploring the Main Driving Factors for Terrestrial Water Storage in China Using Explainable Machine Learning

Ma, Xinjing; Huang, Haijun; Chen, Jinwen; Yu, Qiang; Cai, Xitian

doi:10.3390/rs17122078

Open AccessArticle

Exploring the Main Driving Factors for Terrestrial Water Storage in China Using Explainable Machine Learning

by

Xinjing Ma

^1,†,

Haijun Huang

^2,†,

Jinwen Chen

²,

Qiang Yu

³ and

Xitian Cai

^2,*

¹

School of Software Engineering, Sun Yat-sen University, Zhuhai 519082, China

²

School of Civil Engineering, Sun Yat-sen University, Zhuhai 519082, China

³

School of Atmospheric Sciences, Sun Yat-sen University, Zhuhai 519082, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Remote Sens. 2025, 17(12), 2078; https://doi.org/10.3390/rs17122078

Submission received: 11 May 2025 / Revised: 11 June 2025 / Accepted: 14 June 2025 / Published: 17 June 2025

(This article belongs to the Section AI Remote Sensing)

Download

Browse Figures

Versions Notes

Abstract

Terrestrial water storage (TWS) is a critical component of the hydrological cycle and plays a key role in regional water resource management. The launch of the Gravity Recovery and Climate Experiment (GRACE) satellite mission in 2002 has provided precise measurements of TWS, enabling systematic investigations into its spatial pattern and driving mechanisms. However, a comprehensive evaluation of the spatial drivers of TWS variations across China is still lacking. In this study, we employed a robust machine learning model to capture the spatial patterns of TWS in China and further applied the Shapley Additive Explanations (SHAP) method to disentangle the individualized effects of hydroclimatic variables. Our findings reveal that precipitation is the dominant driver in northern and southern China, while soil moisture and snow water equivalent are key contributors on the Tibetan Plateau. In northwestern China, air pressure and groundwater runoff are the main influencing factors, whereas temperature shows a pronounced negative effect. Importantly, most variables demonstrate non-monotonic influences: in particular, we found that the importance of precipitation diminishes beyond a certain threshold, and surface pressure shifts sharply toward a negative impact. The explainable machine learning framework demonstrated strong adaptability in identifying complex drivers of TWS, offering a powerful methodological advancement for exploring TWS dynamics and providing valuable insights for water resource management in China.

Keywords:

terrestrial water storage; machine learning; China; SHAP

1. Introduction

In response to the escalating challenges of water resource scarcity, the development of effective water resource management strategies has become a pressing research imperative. Within this context, terrestrial water storage (TWS)—an integrated measure encompassing surface water, soil moisture, groundwater, and snow/ice—has emerged as a critical research focus due to its fundamental role in water cycle dynamics [1]. The spatial heterogeneity of TWS dictates the distribution of water resource availability [2] and fundamentally impacts regional water security. Typically, although China is one of the countries with the richest water resources in the world in terms of total volume, the spatial distribution of water resources is highly uneven due to climatic and geographical factors. Many arid and semi-arid regions continue to suffer from water scarcity [3]. Therefore, an in-depth investigation into the spatial distribution of TWS in China and its climatic and hydrological driving mechanisms is of significant scientific and policy relevance for optimizing differentiated regional water allocation strategies and alleviating water scarcity issues in arid and semi-arid regions.

Since the launch of the Gravity Recovery and Climate Experiment (GRACE) satellites in 2002, highly accurate measurements of TWS have become available. This technological breakthrough has enabled extensive monitoring of TWS variations, often through integration with hydrological models, land surface models, and other analytical frameworks [4,5,6,7,8,9,10]. In particular, the feasibility of using GRACE satellite data to track long-term TWS changes in China has been validated in previous studies [11,12,13,14]. Additionally, to overcome the limited spatial resolution of GRACE data, subsequent research efforts have incorporated complementary approaches, such as land surface models and remote sensing techniques [15]. Among these, Global Land Data Assimilation System (GLDAS) 2.2 stands out by assimilating GRACE satellite observations to enhance both the spatial resolution and the accuracy of TWS estimates [16], thereby facilitating a more refined analysis of the driving factors and underlying mechanisms governing TWS variability. Currently, TWS has been observed to have declined globally at a rate of 1 cm per year over the past two decades [17], which is closely associated with the impacts of extreme weather events [18,19,20,21], with China experiencing a decline of about 2 mm annually. However, the primary factors driving TWS variations are highly region-specific. For example, Girotto et al. [6] identified groundwater extraction for irrigation as the primary cause of TWS depletion in India, while Kim et al. [22] discovered the influence of river storage variations on seasonal TWS changes in wet basins.

Similarly, despite these advances, substantial knowledge gaps persist, particularly in regions with complex topoclimatic conditions and pronounced spatial heterogeneity, such as those found across China [23,24]. Recent studies have begun to shed light on these complexities. For example, Li et al. [25] demonstrated that the dominant drivers of TWS in southern China are precipitation and runoff [26], while Xie et al. [27] reported that the TWS declines in northern China are largely attributable to anthropogenic activities, whereas increases in southern China are mainly driven by precipitation. Yang et al. [28] emphasized the significant roles of evapotranspiration (ET) and runoff in regulating TWS in southern regions. These findings point to two major limitations in current research. Firstly, most studies focus on a narrow set of climatic variables—typically no more than four, such as precipitation and evaporation—which may not comprehensively represent the full complexity of the hydrological cycle. Secondly, the predominant reliance on simple linear regression models limits the capacity to capture the spatial heterogeneity of TWS dynamics and fails to adequately address multicollinearity among input features, potentially leading to biased interpretations.

With the rapid advancement of machine learning (ML) technologies, there has been a growing trend toward leveraging ML’s superior capabilities in feature extraction and pattern recognition for hydrological studies, including applications to runoff, soil moisture, and groundwater prediction [29,30,31,32,33]. Artificial neural networks (ANNs) are well-suited for modeling nonlinear relationships and are resistant to noise, and their flexible structure makes them popular for tasks like flood forecasting and water quality modeling [34]. Support vector machines (SVMs) offer strong generalization and global optimal solutions, providing accurate and stable predictions even with small datasets, and they also train faster than ANNs and are often used for forecasting tasks, such as reservoir inflow prediction [35]. Ensemble models like random forests (RF) and boosting methods have shown strong performance and robustness in handling complex, high-dimensional data, while maintaining good interpretability. Deep learning (DL) excels at extracting features from complex and high-dimensional data and has been successfully applied to complex tasks like multisatellite data fusion, gap-filling, and prediction in data-scarce regions [36]. DL models can leverage advanced structures like convolutional layers and attention mechanisms to detect relevant or complementary information across multiple data sources, improve cross-validation, remove redundancy, and enhance prediction accuracy and reliability.

However, both traditional ML and recently developed DL approaches tend to function as “black-boxes”, making it difficult to reveal the underlying physical mechanisms of the models. This limitation restricts their application value in hydrological process diagnosis, causal inference, and scientific decision-making. To address this limitation, explainable ML (XML) techniques have emerged, aiming to enhance the transparency and interpretability of model outputs while preserving the strong predictive capabilities of ML models [37,38]. XML refers to a suite of methods designed to make the internal logic and output of complex models more understandable, enabling users to grasp the rationale be-hind predictions. This not only allows for more trustworthy and interpretable results, but also lays the groundwork for integrating data-driven modeling with hydrological domain knowledge. For this study, we selected an ML model, which has been shown to capture TWS patterns effectively without the added complexity and uncertainty of DL [39,40,41]. By pairing it with an XML framework, we achieved a synergistic balance between predictive accuracy and mechanistic insight, ultimately increasing the model’s value for both scientific research and practical water resource management.

Based on these motivations, the main contributions of this study are threefold: (1) the development of a machine learning model that effectively captures the spatial patterns of TWS using hydroclimatic variables; (2) the identification of dominant spatial drivers of TWS in China through XML methods; (3) the interpretation of the underlying mechanisms revealed by model explanations. Through this research, we aim to advance understanding of the hydrological system and provide scientific support for more effective and targeted water resource management strategies.

2. Materials and Methodology

2.1. Study Area

China, located in eastern Asia between 3.9°–53.5°N and 73.7°–135°E, spans nearly 9.6 million square kilometers and exhibits remarkable climatic and geographic diversity. Its extensive latitudinal range and complex topography give rise to distinct climate zones and a wide range of landscapes, leading to a pronounced variability in climate and TWS. For example, the difference between the subtropical monsoon climate in southern China and the temperate continental arid climate in the northwest establishes a marked southeast-to-northwest gradient in precipitation. Correspondingly, the landscape transitions from humid southeastern coastal plains, forests, and hills to increasingly arid mountains and deserts in the northwestern interior. In addition to natural climatic influences, intensive human activities—driven by a population exceeding 1.4 billion—have substantially reshaped the hydrological environment. Large-scale interventions, such as water conservancy projects and groundwater extraction, have profoundly altered the natural water cycle. Together, these natural and anthropogenic processes drive strong spatial heterogeneity in TWS, making China an ideal study area for investigating the drivers of TWS variations (Figure 1).

2.2. Data Source

The hydroclimatic data used in this study were derived from the GLDAS, developed by NASA’s Goddard Space Flight Center. GLDAS integrates satellite- and ground-based observations with advanced land surface models to generate high-resolution terrestrial hydroclimatic datasets with near-real-time capability and long-term consistency [14,42]. The Community Land Surface Model (CLSM), version 2.2, assimilates GRACE observations. This improves the consistency and accuracy of CLSM-derived TWS estimates, making them well-suited for long-term TWS analysis [43,44,45]. The CLSM TWS data are provided at a spatial resolution of 0.25° and a daily temporal resolution from 1 January 2003 to the present (1 September 2022, used in this study). Additionally, a multi-year mean was calculated from the TWS data using the Climate Data Operators software 2.2.0 to remove the temporal dimension and represent average spatial patterns. The original TWS data can be publicly accessed at https://daac.gsfc.nasa.gov/datasets/GLDAS_CLSM025_DA1_D_2.2/summary (accessed on 10 January 2023).

To capture the spatial distribution patterns of TWS, several key hydrometeorological variables were selected, including precipitation (P), surface air pressure (SP), air temperature (T), soil moisture (SM), groundwater runoff (Rg), snow melt amount (Rm), surface runoff (Rs), and snow depth water equivalent (SWE). These variables were based on Noah land surface model (Noah), version 2.1, which has been widely validated and applied in hydrological research for its robustness and high temporal resolution [46,47]. This version offers data at a spatial resolution of 0.25° and a temporal resolution of 3-hourly spanning from 1 January 2000 to the present (1 September 2022, used in this study). The study period for all variables used in the modeling was January 2003 to September 2022 to match the TWS data availability. Noah dataset can be accessed at https://daac.gsfc.nasa.gov/datasets/GLDAS_NOAH025_3H_2.1/summary (accessed on 10 January 2023). The overview of data used in this study is listed in Table 1.

To prepare the data for analysis, hydrological variables, including P, ET, Rg, Rs, and Rm, were first aggregated to a daily scale before multi-year averaging. For soil moisture, as Noah provides the SM data in four distinct layers, a weighted averaging method was applied to compute the total soil moisture. Regarding SWE, given its significant regional and seasonal variability, as well as its generally low magnitude, the months with highest coverage and values (January, February, November, and December) were selected for multi-year averaging to minimize potential biases.

2.3. Ensemble Machine Learning Framework

Extreme Gradient Boosting (XGBoost) has emerged as a powerful ensemble machine learning approach in hydrological studies for its superior performance in handling complex, non-linear relationships in Earth system data [48]. XGBoost is an ensemble learning method that sequentially trains weak learners (typically decision trees) to correct the errors of previous models, thereby improving overall prediction accuracy. It also incorporates regularization techniques to prevent overfitting [48,49,50,51]. Additionally, its computational efficiency is enhanced by parallel processing, out-of-core computation, GPU acceleration, and distributed training, making it particularly suitable for large-scale hydrological datasets. Moreover, regional-scale studies on TWS reconstruction have demonstrated that XGBoost achieves comparable or superior predictive performance compared to artificial neural networks and random forests while maintaining model interpretability [52,53]. Based on these advantages, XGBoost was selected for TWS simulation in this study.

The model used in this study was implemented using the native interface of the Python package “xgboost” (version 1.6.1) with Python 3.9.13. The dataset, comprising 94,703 samples, was randomly partitioned into training and testing sets at a 6:4 ratio, ensuring a similar distribution of data between the two sets. The best model, defined as the one with the minimum average root-mean-square error (RMSE), was selected based on 10-fold cross-validation conducted on the training set. For each fold, model hyperparameters were optimized by the stochastic hill-climbing algorithm, a local search method that sequentially adjusts each parameter within a predefined range. Starting from initial values and step sizes, the algorithm iteratively updates each hyperparameter to minimize RMSE, fixing the optimal value before proceeding to the next parameter. This process continues until a locally optimal hyperparameter combination is obtained. The hyperparameters considered, along with their optimal values, are summarized in Table 2. The complete framework is illustrated in Figure 2.

2.4. Evaluation Metrics

The evaluation of the model was carried out using the combination of three metrics: coefficient of determination (R²), RMSE, and mean absolute percentage error (MAPE). R² is a common indicator that quantifies the goodness of fit, with higher R² values signifying better performance. RMSE measures the magnitude of prediction errors, with a smaller RMSE associated with a better performance. MAPE evaluates model performance from the perspective of relative error, which is particularly advantageous when comparing datasets with different units (e.g., simultaneously assessing TWS and discharge predictions). A lower MAPE indicates higher prediction accuracy, with MAPE < 15% generally indicating excellent model performance. R², RMSE, and MAPE are estimated as follows:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(1)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(2)

M A P E = \frac{100 %}{n} \sum_{i = 1}^{n} |\frac{y_{i} - \bar{y}}{y_{i}}|

(3)

where n typically denotes the sample size,

y_{i}

represents the predicted value of the i-th sample point,

{\hat{y}}_{i}

denotes the observed value of the i-th sample point, and

\bar{y}

is the average of observed values in the samples.

2.5. SHAP

In this study, Shapely additive explanations (SHAP) [54,55], which is grounded in the Shapley value concept from game theory, were employed to explain predictions of the machine learning model. SHAP assigns each feature an importance value representing its marginal contribution to the prediction, ensuring consistency, local accuracy, and fair attribution across features. In hydrological modeling, where physical processes are often complex, nonlinear, and interdependent, SHAP offers a robust framework for quantifying the relative influence of meteorological, climatic, and catchment-specific variables on model outputs. Moreover, compared to the interpretation method used in XGBoost (gain-based interpretation, GBI), SHAP is more effective in handling multicollinearity and offers a more comprehensive understanding of the results, thus facilitating subsequent decision-making processes [56]. The calculation of SHAP value can be simply represented as follows:

g (Z^{'}) = φ_{0} + \sum_{j = 1}^{M} φ_{i} Z_{i}^{'}

(4)

φ_{i} = \sum_{S \subseteq N / i} \frac{| S |! (M - | S | - 1)}{M!} (f_{x} (S \cup i) - f_{x} (S))

(5)

where Z′∈{0, 1}^M is a coalition vector representing the presence or absence of each feature, M is the total number of features, φ_i is the SHAP value of feature i, f(x) is the non-linear regression mapping of XGBoost, N is the sample set of feature variables with a dimension of M, and S is a subset extracted from N with a dimension of |S|. This ultimately reflects the difference in prediction values caused by the inclusion of the additional feature variable. Each data sample’s final prediction value, when inputted into the explanation model, corresponds to a SHAP value, which reflects the contribution of the corresponding driving factor’s importance [57].

3. Results

3.1. Model Performance

Figure 3 presents the model performance on the testing set from different perspectives. Generally, the XGBoost model achieved an R² of 0.98, a mean MAPE of 2.36%, and a mean RMSE of 22.88 mm, demonstrating strong predictive capability and overall model performance. Figure 3a,b display the multi-year average TWS of observations and predictions, respectively, where minimal discrepancies are observed. This suggests that the model can effectively capture the spatial patterns of TWS. Figure 3c,d compare observed and predicted TWS values in detail. A high degree of fit is evident, with most points closely aligning along the 1:1 line (y = x), even for extreme values. Moreover, the model achieved a median residual of −0.30 mm, a mean MAPE of 1.28%, and a mean RMSE of 12.08 mm, further demonstrating its strong predictive accuracy.

A more in-depth analysis of the spatial distribution of the MAPE was conducted to better understand model performance across different regions. Figure 4 illustrates the spatial distribution of the MAPE. The results demonstrate exceptional model accuracy, with approximately 70% of monitoring points achieving MAPE values below 2.5%, while over 99% of points fall within the 20% threshold. Furthermore, larger MAPE values are predominantly concentrated in the Yellow River Basin and arid northwestern regions, characterized by intensive anthropogenic activities such as irrigation withdrawals and reservoir operations. This spatial correspondence suggests that human-induced hydrological modifications may introduce additional uncertainty, leading to increased prediction errors in these sensitive zones. Overall, model predictions demonstrate strong agreement with observed TWS, as supported by the evaluation metrics, thus providing a robust foundation for subsequent SHAP analysis.

3.2. Main Drivers for TWS

Figure 5 illustrates the feature importance derived from the XGBoost model using GBI and SHAP methods. Both approaches consistently identify P and SM as the most important features, with SP also showing a relatively high influence. This strong agreement between the two methods enhances the credibility of the results. However, a divergence is observed in the evaluation of T, Rg, and Rs: the GBI method attributes relatively low importance to these variables, whereas the SHAP method suggests a greater contribution. This discrepancy may be attributed to the GBI method’s limited capacity to accurately account for multicollinearity among input features, which can obscure the true importance of correlated variables. These findings further underscore the necessity of employing the SHAP method for more robust and reliable interpretation of feature contributions.

To further explore the influence of factors on TWS, the dominant driving factors at the pixel level were identified and mapped in Figure 6. Results reveal a significant regional heterogeneity from southeast to northwest China. In the monsoon climate zones, P emerges as the primary driver for TWS. Precipitation-dominated and non-precipitation-dominated regions are roughly bounded by the 400 mm annual precipitation threshold, emphasizing the important role precipitation plays in the distribution of TWS in China. In the Qinghai–Tibetan Plateau, SM plays a dominant role, which can be attributed to the water retention capacity under complex terrain conditions. In the northwestern arid and semi-arid climate zones, TWS is mainly regulated by SP, Rg, and Rm, where precipitation is scarce and TWS recharge comes mainly from Rm and Rg.

3.3. Individual Impact of Driving Factors

Figure 7 illustrates the individual impact of each driving factor on TWS, with the interactions between factors removed (showing SHAP main effects). Values above the red line indicate a positive driving effect, while values below the red line indicate a negative driving effect. Generally, almost all factors show non-monotonic impacts on TWS. Among all factors, precipitation and soil moisture exhibit relatively strong positive monotonic effects. As P and SM increase, their positive contributions to TWS strengthen. However, when P and SM reach a certain threshold, their impacts on TWS weaken. This may indicate that at this point, the soil has reached its water-holding capacity, limiting further positive contribution from P. Similarly, the influence of Rs and Rg on TWS is non-monotonic and relatively strong at low values. However, as the values increase, the effect gradually weakens and stabilizes. Interestingly, when SP exceeds a certain threshold, its SHAP value turns sharply negative. In terms of SWE and Rm, they have certain impacts on TWS at low values, but as their values increase, their influence becomes almost negligible. This could be attributed to China’s extensive latitudinal range, where snow accumulation is limited to specific regions. Significant snowfall only occurs in certain high-latitude or high-altitude areas, where SWE exerts a pronounced positive effect on TWS.

4. Discussion

4.1. Spatial Distribution of Dominant Drivers of TWS in China

TWS is a key component of the global hydrological cycle, and its distribution reflects the availability of freshwater resources. Identifying the driving factors of the spatial TWS distribution helps to improve our understanding of the water cycle processes, while also aiding in the assessment and allocation of water resources [58]. Previous studies have demonstrated that climate factors (e.g., precipitation, temperature, and snowmelt) dominate TWS variability through both direct (e.g., recharge via rainfall) and indirect pathways (e.g., temperature-controlled evapotranspiration) [59,60].

According to the feature importance rankings in Figure 5, precipitation is the primary factor influencing the distribution of TWS. On average, China receives about 0.64 mm of daily precipitation, but there are substantial regional differences. Mean annual precipitation ranges from over 2000 mm in the southeastern coastal regions to less than 50 mm in the arid northwest. Influenced by the monsoon, the southern and eastern coastal regions experience higher daily precipitation, while the northern and western regions, particularly the northwest, have lower daily precipitation. This significant difference in daily precipitation is the fundamental reason why precipitation is the primary factor determining the distribution of TWS in China. However, given the large monsoon-influenced regions, precipitation is the primary source fueling TWS across much of the country. This also explains why precipitation dominates in the model outputs and exhibits a significant positive driving contribution in the SHAP value analysis.

However, Figure 7a shows that as precipitation increases, its positive contribution to TWS distribution decreases. Results reveal that a multi-year mean daily precipitation of 4 mm (approximately 1400 mm annually) corresponds to the peak contribution of precipitation to TWS. When precipitation exceeds this threshold, its contribution to TWS begins to decline sharply. This variation may result from the coupled effects of soil hydrological properties and vegetation ecological processes [61]. Figure 8b shows that this level of precipitation primarily occurs in the southeastern coastal regions and southern China. From a soil perspective, the saturated hydraulic conductivity (K_sat) of the main soil types in eastern China reaches the inflection point of infiltration efficiency when the daily precipitation is between 3.5 and 4.5 mm [62]. When precipitation exceeds this threshold, the decline in soil matric potential slows down the infiltration rate of water into the subsurface and groundwater. This leads to increased saturation-excess runoff, thereby reducing TWS. Concurrently, enhanced lateral flow diverts more water toward rivers or water bodies rather than being stored in the soil [42,53]. This precipitation level (approximately 1400 mm a⁻¹) closely matches the water demand of the maximum net primary productivity (NPP) zone (1200–1500 gC m⁻² a⁻¹) [63] in the eastern forests of China. At this point, canopy interception (about 15%), transpiration (about 35%), and soil water retention (about 40%) reach an optimal balance for water use [64]. However, further increases in precipitation disrupt this balance: canopy interception rises nonlinearly (increasing by 1.8% for every additional 100 mm), and increased respiratory losses reduce the water use efficiency of NPP, forming a negative feedback loop among precipitation, vegetation, and TWS [57,63,64,65].

Both importance metrics (GBI and SHAP) indicate soil moisture exerts the second largest importance to TWS. Figure 6 illustrates that the sample points where SM dominates are primarily located in the Tibetan Plateau region. This region receives relatively low annual precipitation (especially in the western and northern parts), making the soil’s water-holding capacity a key factor in water retention. The Qinghai-Tibet Plateau features highly variable terrain, with lithology dominated by clastic rocks and permafrost layers [66]. These conditions hinder the development of well-formed aquifers, making it difficult to establish stable groundwater systems. Under such circumstances, shallow soil moisture becomes the most stable and responsive component of TWS. Although the plateau region does experience seasonal snow and glacial cover, rising temperatures cause snowmelt and runoff to flow out of the region, failing to effectively contribute to regional TWS gains [67]. Instead, this outflow may intensify the declining trend of TWS. As a result, although SWE may have significant short-term contributions in certain areas, its overall spatial contribution remains limited. In terms of the arid northwest region, SP and Rg emerge as dominant contributors in the XGBoost model. With scarce precipitation (e.g., <100 mm a⁻¹ in the Tarim Basin) and intense evaporation, elevated SP suppresses surface evaporation, reducing moisture loss and indirectly preserving limited soil water and groundwater, thereby enhancing their contribution to TWS. Additionally, the region is influenced by the Siberian High, where SP variations directly modulate external moisture transport by regulating atmospheric circulation, making it a key predictor for TWS [68]. Meanwhile, minimal surface runoff and sustained groundwater recharge from mountain snowmelt (e.g., Tianshan and Kunlun Mountains) maintain stable TWS [58,69,70]. The high SHAP value of Rg underscores its critical role in sustaining water resources in this arid zone.

In this study, the effect of temperature on TWS is primarily negative via indirect pathways (Figure 7e). This aligns with the fundamental physical mechanism, whereby elevated temperatures increase atmospheric water demand. Firstly, enhanced evapotranspiration under warmer conditions depletes soil moisture and reduces groundwater recharge [71]. Secondly, the spatial gradients representing persistent elevated temperature states in our climatological analysis demonstrate significant coupling with hydrologically stressed regimes that systematically impair water storage recovery processes [60].

4.2. Sources of Uncertainty

This study employed GLDAS to construct XGBoost and SHAP interpretability; this process may introduce uncertainties at both the data and model structure levels. Firstly, the structure of the GLDAS land surface model has certain limitations. For example, the soil moisture variable output by the model only reflects the changes in near-surface soil moisture and does not account for the changes in deep groundwater [72]. Additionally, the impact of human factors on TWS has not been included in this study. According to statistics, irrigation accounts for about 70% of global freshwater withdrawals. As an important component of the water cycle, TWS is strongly influenced by irrigation [39,73]. In China, the total irrigated area is approximately 67 million hectares, distributed across various provinces and municipalities. Irrigation is a significant factor influencing changes in TWS, and since our model relies solely on climate data, the predictions may lack precision in heavily irrigated areas. Secondly, many reservoirs were built for irrigation, flood control, and other purposes in China. The large amount of water stored in these reservoirs may lead to long-term changes in TWS, making the GLDAS data insufficiently accurate for simulating TWS in the Chinese region [74]. Moreover, numerous water resource management initiatives, such as the South-to-North Water Diversion Project, the Dianchi Lake Restoration Project, and the renovation of the Grand Canal, also impact the spatial distribution of TWS.

XGBoost can effectively reduce uncertainty through ensemble learning and its relatively transparent model structure; however, like many powerful ML models, its internal complexity poses challenges for interpretation. XML techniques, while enhancing the model interpretability and enabling informed model adjustments, inevitably introduce additional uncertainties, as they rely on data-driven frameworks that are inherently limited by the underlying algorithms. Moreover, the additive nature of SHAP value attribution estimates the contribution of each feature individually during feature analysis [54], potentially overlooking the complex interdependencies and couplings that often characterize real-world physical processes. Such limitations can affect the completeness of the mechanistic interpretation derived from SHAP values. To address this, we carefully analyzed feature correlation and made appropriate trade-offs during feature selection and preprocessing.

5. Conclusions

Since the launch of GRACE satellites in 2002, TWS declines have been observed in China at a rate of approximately 2 mm a⁻¹. To uncover the main drivers and their mechanisms behind these TWS losses, XGBoost and SHAP methods were employed to identify the spatial pattern of TWS in China and quantified the independent relative contributions of various hydroclimatic factors. Our results reveal that the XGBoost model can efficiently reproduce TWS patterns across China, achieving an R² of 0.98 and RMSE of 22.88 mm. Furthermore, the SHAP method showed that, in general, precipitation and soil moisture contribute most significantly to TWS spatial variability, with predominantly positive impacts. While surface pressure, surface runoff, and temperature also affect TWS, their effects are largely negative.

Spatially, the dominant driving factors exhibit pronounced heterogeneity. Precipitation emerges as the primary driver in southern China and the North China Plain, while soil moisture plays a leading role across the Tibetan Plateau. In northwestern China, surface pressure, snow melt runoff, and groundwater runoff are identified as key determinants of TWS distribution. Furthermore, our results reveal that the influence effects of most factors are nonlinear: the importance of precipitation decreases beyond a certain threshold, while surface pressure transitions from a positive to a negative effect as its value exceeds a certain level. These findings enhance our understanding of the spatial patterns of TWS in China and offer valuable insights for more effective water resource management and allocation. Moreover, the interpretable machine learning model based on XGBoost and SHAP demonstrate strong adaptability in simulating TWS patterns, underscoring the applicability of explainable machine learning in hydrological research.

Author Contributions

Conceptualization, X.M., H.H. and X.C.; data curation, Q.Y.; methodology, X.M., H.H. and J.C.; project administration, X.C.; software, X.M. and J.C.; validation, Q.Y.; visualization, H.H.; writing—original draft, X.M. and H.H.; writing—review and editing, X.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China (42375165), the National Key Research and Development Program of China (2023YFF0805501), and the Innovation and Entrepreneurship Training Program for College Students of Sun Yat-sen University.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gleeson, T.; Wada, Y.; Bierkens, M.F.; van Beek, L.P. Water Balance of Global Aquifers Revealed by Groundwater Footprint. Nature 2012, 488, 197–200. [Google Scholar] [CrossRef] [PubMed]
Güntner, A.; Stuck, J.; Werth, S.; Döll, P.; Verzano, K.; Merz, B. A Global Analysis of Temporal and Spatial Variations in Continental Water Storage. Water Resour. Res. 2007, 43, W05416. [Google Scholar] [CrossRef]
Cai, B.; Zhang, W.; Hubacek, K.; Feng, K.; Li, Z.; Liu, Y.; Liu, Y. Drivers of Virtual Water Flows on Regional Water Scarcity in China. J. Clean. Prod. 2019, 207, 1112–1122. [Google Scholar] [CrossRef]
Chen, J.; Tapley, B.; Wilson, C.; Cazenave, A.; Seo, K.W.; Kim, J.S. Global Ocean Mass Change from Grace and Grace Follow-on and Altimeter and Argo Measurements. Geophys. Res. Lett. 2020, 47, e2020GL090656. [Google Scholar] [CrossRef]
Eicker, A.; Forootan, E.; Springer, A.; Longuevergne, L.; Kusche, J. Does Grace See the Terrestrial Water Cycle “Intensifying”? J. Geophys. Res.-Atmos. 2016, 121, 733–745. [Google Scholar] [CrossRef]
Girotto, M.; De Lannoy, G.J.M.; Reichle, R.H.; Rodell, M.; Draper, C.; Bhanja, S.N.; Mukherjee, A. Benefits and Pitfalls of Grace Data Assimilation: A Case Study of Terrestrial Water Storage Depletion in India. Geophys. Res. Lett. 2017, 44, 4107–4115. [Google Scholar] [CrossRef]
Kim, J.S.; Seo, K.W.; Kim, B.H.; Ryu, D.; Chen, J.L.; Wilson, C. High-Resolution Terrestrial Water Storage Estimates from Grace and Land Surface Models. Water Resour. Res. 2024, 60, e2023WR035483. [Google Scholar] [CrossRef]
Loomis, B.D.; Felikson, D.; Sabaka, T.J.; Medley, B. High-Spatial-Resolution Mass Rates from Grace and Grace-Fo: Global and Ice Sheet Analyses. J. Geophys. Res.-Solid Earth 2021, 126, e2021JB023024. [Google Scholar] [CrossRef]
Munekane, H. Ocean Mass Variations from Grace and Tsunami Gauges. J. Geophys. Res.-Solid Earth 2007, 112, B07403. [Google Scholar] [CrossRef]
Seo, K.W.; Wilson, C.R.; Famiglietti, J.S.; Chen, J.L.; Rodell, M. Terrestrial Water Mass Load Changes from Gravity Recovery and Climate Experiment (Grace). Water Resour. Res. 2006, 42, W05417. [Google Scholar] [CrossRef]
Mo, X.; Wu, J.; Wang, Q.; Zhou, H. Variations in Water Storage in China over Recent Decades from Grace Observations and Gldas. Nat. Hazards Earth Syst. Sci. 2016, 16, 469–482. [Google Scholar] [CrossRef]
Liu, R.; Zhong, B.; Li, X.; Zheng, K.; Liang, H.; Cao, J.; Yan, X.; Lyu, H. Analysis of Groundwater Changes (2003–2020) in the North China Plain Using Geodetic Measurements. J. Hydrol. Reg. Stud. 2022, 41, 101085. [Google Scholar] [CrossRef]
Yang, T.; Wang, C.; Yu, Z.B.; Xu, F. Characterization of Spatio-Temporal Patterns for Various Grace- and Gldas-Born Estimates for Changes of Global Terrestrial Water Storage. Glob. Planet. Chang. 2013, 109, 30–37. [Google Scholar] [CrossRef]
Qi, W.; Liu, J.G.; Yang, H.; Zhu, X.P.; Tian, Y.; Jiang, X.; Huang, X.; Feng, L. Large Uncertainties in Runoff Estimations of Gldas Versions 2.0 and 2.1 in China. Earth Space Sci. 2020, 7, e2019EA000829. [Google Scholar] [CrossRef]
Save, H.; Bettadpur, S.; Tapley, B.D. High-Resolution Csr Grace Rl05 Mascons. J. Geophys. Res.-Solid Earth 2016, 121, 7547–7569. [Google Scholar] [CrossRef]
Syed, T.H.; Famiglietti, J.S.; Rodell, M.; Chen, J.; Wilson, C.R. Analysis of Terrestrial Water Storage Changes from Grace and Gldas. Water Resour. Res. 2008, 44, W02433. [Google Scholar] [CrossRef]
The National Climate Center (NCC) of the China Meteorological Administration (CMA). Blue Book on Climate Change in China (2022); Science Press: Beijing, China, 2022. [Google Scholar]
Houborg, R.; Rodell, M.; Li, B.L.; Reichle, R.; Zaitchik, B.F. Drought Indicators Based on Model-Assimilated Gravity Recovery and Climate Experiment (Grace) Terrestrial Water Storage Observations. Water Resour. Res. 2012, 48, W07525. [Google Scholar] [CrossRef]
Kraaijenbrink, P.D.A.; Bierkens, M.F.P.; Lutz, A.F.; Immerzeel, W.W. Impact of a Global Temperature Rise of 1.5 Degrees Celsius on Asia’s Glaciers. Nature 2017, 549, 257–260. [Google Scholar] [CrossRef]
Pokhrel, Y.N.; Hanasaki, N.; Yeh, P.J.F.; Yamada, T.J.; Kanae, S.; Oki, T. Model Estimates of Sea-Level Change Due To anthropogenic Impacts on Terrestrial Water storage. Nat. Geosci. 2012, 5, 389–392. [Google Scholar] [CrossRef]
Reager, J.T.; Thomas, B.F.; Famiglietti, J.S. River Basin Flood Potential Inferred Using Grace Gravity Observations at Several Months Lead Time. Nat. Geosci. 2014, 7, 588–592. [Google Scholar] [CrossRef]
Kim, H.; Yeh, P.J.F.; Oki, T.; Kanae, S. Role of Rivers in the Seasonal Variations of Terrestrial Water Storage over Global Basins. Geophys. Res. Lett. 2009, 36, L17402. [Google Scholar] [CrossRef]
Scanlon, B.R.; Zhang, Z.; Save, H.; Sun, A.Y.; Müller Schmied, H.; van Beek, L.P.H.; Wiese, D.N.; Wada, Y.; Long, D.; Reedy, R.C.; et al. Global Models Underestimate Large Decadal Declining and Rising Water Storage Trends Relative to Grace Satellite Data. Proc. Natl. Acad. Sci. USA 2018, 115, E1080–E1089. [Google Scholar] [CrossRef]
Al-Tameemi, M.A.; Chukin, V.V. Global Water Cycle and Solar Activity Variations. J. Atmos. Sol.-Terr. Phys. 2016, 142, 55–59. [Google Scholar] [CrossRef]
Li, C.; Yu, Q.; Zhang, Y.; Ma, N.; Tian, J.; Zhang, X. Dominant Drivers for Terrestrial Water Storage Changes Are Different in Northern and Southern China. J. Geophys. Res. Atmos. 2023, 128, e2022JD038074. [Google Scholar] [CrossRef]
Awange, J.L.; Forootan, E.; Fleming, K.; Odhiambo, G. Dominant Patterns of Water Storage Changes in the Nile Basin during 2003–2013. In Remote Sensing of the Terrestrial Water Cycle; John Wiley & Sons: Hoboken, NJ, USA, 2014; pp. 367–381. [Google Scholar]
Xie, X.; He, B.; Guo, L.; Miao, C.; Zhang, Y. Detecting Hotspots of Interactions between Vegetation Greenness and Terrestrial Water Storage Using Satellite Observations. Remote Sens. Environ. 2019, 231, 111259. [Google Scholar] [CrossRef]
Yang, B.; Li, Y.; Tao, C.; Cui, C.; Hu, F.; Cui, Q.; Meng, L.; Zhang, W. Variations and Drivers of Terrestrial Water Storage in Ten Basins of China. J. Hydrol. Reg. Stud. 2023, 45, 101286. [Google Scholar] [CrossRef]
Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat, F. Deep Learning and Process Understanding for Data-Driven Earth System Science. Nature 2019, 566, 195–204. [Google Scholar] [CrossRef]
Kratzert, F.; Klotz, D.; Brenner, C.; Schulz, K.; Herrnegger, M. Rainfall–Runoff Modelling Using Long Short-Term Memory (Lstm) Networks. Hydrol. Earth Syst. Sci. 2018, 22, 6005–6022. [Google Scholar] [CrossRef]
Fang, K.; Shen, C.; Kifer, D.; Yang, X. Prolongation of Smap to Spatiotemporally Seamless Coverage of Continental Us Using a Deep Learning Neural Network. Geophys. Res. Lett. 2017, 44, 11030–11039. [Google Scholar] [CrossRef]
Fang, K.; Pan, M.; Shen, C. The Value of Smap for Long-Term Soil Moisture Estimation with the Help of Deep Learning. IEEE Trans. Geosci. Remote Sens. 2018, 57, 2221–2233. [Google Scholar] [CrossRef]
Bruss, C.B.; Nateghi, R.; Zaitchik, B.F. Explaining National Trends in Terrestrial Water Storage. Front. Environ. Sci. 2019, 7, 85. [Google Scholar] [CrossRef]
Wang, G.; Yang, J.; Hu, Y.; Li, J.; Yin, Z. Application of a Novel Artificial Neural Network Model in Flood Forecasting. Environ. Monit. Assess. 2022, 194, 125. [Google Scholar] [CrossRef] [PubMed]
Babaei, M.; Moeini, R.; Ehsanzadeh, E. Artificial Neural Network and Support Vector Machine Models for Inflow Prediction of Dam Reservoir (Case Study: Zayandehroud Dam Reservoir). Water Resour. Manag. 2019, 33, 2203–2218. [Google Scholar] [CrossRef]
Gao, G.; Yao, L.; Li, W.; Zhang, L.; Zhang, M. Onboard Information Fusion for Multisatellite Collaborative Observation: Summary, Challenges, and Perspectives. IEEE Geosci. Remote Sens. Mag. 2023, 11, 40–59. [Google Scholar] [CrossRef]
Gilpin, L.H.; Bau, D.; Yuan, B.Z.; Bajwa, A.; Specter, M.; Kagal, L. Explaining Explanations: An Overview of Interpretability of Machine Learning. arXiv 2018, arXiv:1806.00069. [Google Scholar]
Molnar, C. Interpretable Machine Learning; Lulu.com: Morrisville, NC, USA, 2020. [Google Scholar]
Jing, W.; Yao, L.; Zhao, X.; Zhang, P.; Liu, Y.; Xia, X.; Song, J.; Yang, J.; Li, Y.; Zhou, C. Understanding Terrestrial Water Storage Declining Trends in the Yellow River Basin. J. Geophys. Res. Atmos. 2019, 124, 12963–12984. [Google Scholar] [CrossRef]
Jing, W.L.; Zhang, P.Y.; Zhao, X.D. A Comparison of Different Grace Solutions in Terrestrial Water Storage Trend Estimation over Tibetan Plateau. Sci. Rep. 2019, 9, 1765. [Google Scholar] [CrossRef] [PubMed]
Yin, J.; Slater, L.J.; Khouakhi, A.; Yu, L.; Liu, P.; Li, F.; Pokhrel, Y.; Gentine, P. Gtws-Mlrec: Global Terrestrial Water Storage Reconstruction by Machine Learning from 1940 to Present. Earth Syst. Sci. Data 2023, 15, 5597–5615. [Google Scholar] [CrossRef]
Rodell, M.; Houser, P.R.; Jambor, U.; Gottschalck, J.; Mitchell, K.; Meng, C.J.; Arsenault, K.; Cosgrove, B.; Radakovich, J.; Bosilovich, M.; et al. The Global Land Data Assimilation System. Bull. Am. Meteorol. Soc. 2004, 85, 381–394. [Google Scholar] [CrossRef]
Li, B.; Rodell, M.; Zaitchik, B.F.; Reichle, R.H.; Koster, R.D.; van Dam, T.M. Assimilation of Grace Terrestrial Water Storage into a Land Surface Model: Evaluation and Potential Value for Drought Monitoring in Western and Central Europe. J. Hydrol. 2012, 446-447, 103–115. [Google Scholar] [CrossRef]
Chen, J.L.; Wilson, C.R.; Ries, J.C. Broadband Assessment of Degree-2 Gravitational Changes from Grace and Other Estimates, 2002-2015. J. Geophys. Res. -Solid Earth 2016, 121, 2112–2128. [Google Scholar] [CrossRef]
Cui, W.; Wang, W.; Wang, X.; Chen, X. Evaluation of Gldas-1 and Gldas-2 Forcing Data and Noah Model Simulations over China at the Monthly Scale. J. Hydrometeorol. 2016, 17, 2815–2833. [Google Scholar]
Ji, L.; Senay, G.; Verdin, J. Evaluation of the Global Land Data Assimilation System (Gldas) Air Temperature Data Products. J. Hydrometeorol. 2015, 16, 2463–2480. [Google Scholar] [CrossRef]
Pang, Y.; Wu, B.; Cao, Y.; Jia, X. Spatiotemporal Changes in Terrestrial Water Storage in the Beijing-Tianjin Sandstorm Source Region from Grace Satellites. Int. Soil Water Conserv. Res. 2020, 8, 295–307. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. Xgboost: A Scalable Tree Boosting System. arXiv 2016, arXiv:1603.02754. [Google Scholar]
Bi, Y.; Xiang, D.X.; Ge, Z.Y.; Li, F.Y.; Jia, C.Z.; Song, J.N. An Interpretable Prediction Model for Identifying N7-Methylguanosine Sites Based on Xgboost and Shap. Mol. Ther.-Nucleic Acids 2020, 22, 362–372. [Google Scholar] [CrossRef]
Can, R.; Kocaman, S.; Gokceoglu, C. A Comprehensive Assessment of Xgboost Algorithm for Landslide Susceptibility Mapping in the Upper Basin of Ataturk Dam, Turkey. Appl. Sci. 2021, 11, 4993. [Google Scholar] [CrossRef]
Feng, D.-C.; Wang, W.-J.; Mangalathu, S.; Taciroglu, E. Interpretable Xgboost-Shap Machine-Learning Model for Shear Strength Prediction of Squat Rc Walls. J. Struct. Eng. 2021, 147, 04021173. [Google Scholar] [CrossRef]
Jing, W.; Zhao, X.; Yao, L.; Di, L.; Yang, J.; Li, Y.; Guo, L.; Zhou, C. Can Terrestrial Water Storage Dynamics Be Estimated from Climate Anomalies? Earth Space Sci. 2020, 7, e2019EA000959. [Google Scholar] [CrossRef]
Ali, S.; Khorrami, B.; Jehanzaib, M.; Tariq, A.; Ajmal, M.; Arshad, A.; Shafeeque, M.; Dilawar, A.; Basit, I.; Zhang, L.L.; et al. Spatial Downscaling of Grace Data Based on Xgboost Model for Improved Understanding of Hydrological Droughts in the Indus Basin Irrigation System (Ibis). Remote Sens. 2023, 15, 873. [Google Scholar] [CrossRef]
Lundberg, S. A Unified Approach to Interpreting Model Predictions. arXiv 2017, arXiv:1705.07874. [Google Scholar]
Chelgani, S.C.; Nasiri, H.; Alidokht, M. Interpretable Modeling of Metallurgical Responses for an Industrial Coal Column Flotation Circuit by Xgboost and Shap-a “Conscious-Lab” Development. Int. J. Min. Sci. Technol. 2021, 31, 1135–1144. [Google Scholar] [CrossRef]
Meng, Y.; Yang, N.; Qian, Z.; Zhang, G. What Makes an Online Review More Helpful: An Interpretation Framework Using Xgboost and Shap Values. J. Theor. Appl. Electron. Commer. Res. 2020, 16, 466–490. [Google Scholar] [CrossRef]
Li, Z. Extracting Spatial Effects from Machine Learning Model Using Local Interpretation Method: An Example of Shap and Xgboost. Comput. Environ. Urban Syst. 2022, 96, 101845. [Google Scholar] [CrossRef]
Zhou, Q.; Huang, J.; Hu, Z.; Yin, G. Spatial-Temporal Changes to Grace-Derived Terrestrial Water Storage in Response to Climate Change in Arid Northwest China. Hydrol. Sci. J. 2022, 67, 535–549. [Google Scholar] [CrossRef]
Zhao, M.; Geruo, A.; Zhang, J.; Velicogna, I.; Liang, C.; Li, Z. Ecological Restoration Impact on Total Terrestrial Water Storage. Nat. Sustain. 2021, 4, 56–62. [Google Scholar] [CrossRef]
Ji, R.; Wang, C.; Cui, A.; Jia, M.; Liao, S.; Wang, W.; Chen, N. Assessing Terrestrial Water Storage Dynamics and Multiple Factors Driving Forces in China from 2005 to 2020. J. Environ. Manag. 2024, 370, 122464. [Google Scholar] [CrossRef]
Jasechko, S.; Sharp, Z.D.; Gibson, J.J.; Birks, S.J.; Yi, Y.; Fawcett, P.J. Terrestrial Water Fluxes Dominated by Transpiration. Nature 2013, 496, 347–350. [Google Scholar] [CrossRef]
Zhao, Y.; Xiang, W.; Zhang, X.; Xie, S.; Yan, S.; Wu, C.; Liu, Y. Mechanistic Study on Laccase-Mediated Formation of Fe-Om Associations in Peatlands. Geoderma 2020, 375, 114502. [Google Scholar] [CrossRef]
Yu, D.; Zhu, W.; Pan, Y. The Role of Atmospheric Circulation System Playing in Coupling Relationship between Spring Npp and Precipitation in East Asia Area. Environ. Monit. Assess. 2008, 145, 135–143. [Google Scholar]
Tian, S.; Renzullo, L.J.; Pipunic, R.C.; Lerat, J.; Sharples, W.; Donnelly, C. Satellite Soil Moisture Data Assimilation for Improved Operational Continental Water Balance Prediction. Hydrol. Earth Syst. Sci. 2021, 25, 4567–4584. [Google Scholar] [CrossRef]
Zhang, Y.; He, B.; Guo, L.; Liu, J.; Xie, X. The Relative Contributions of Precipitation, Evapotranspiration, and Runoff to Terrestrial Water Storage Changes across 168 River Basins. J. Hydrol. 2019, 579, 124194. [Google Scholar] [CrossRef]
Zhang, Z.; Li, H.; Zheng, W.; Whattam, S.A.; Zhu, Z.; Jiang, W.; Zhao, D. Response of Paleogene Fine-Grained Clastic Rock Deposits in the South Qiangtang Basin to Environments and Thermal Events on the Qinghai-Tibet Plateau. ACS Omega 2023, 8, 26458–26478. [Google Scholar] [CrossRef] [PubMed]
Deng, H.; Chen, Y.; Chen, X. Driving Factors and Changes in Components of Terrestrial Water Storage in the Endorheic Tibetan Plateau. J. Hydrol. 2022, 612, 128225. [Google Scholar] [CrossRef]
Zhang, Q.; Xu, C.-Y.; Tao, H.; Jiang, T.; Chen, Y.D. Climate Changes and Their Impacts on Water Resources in the Arid Regions: A Case Study of the Tarim River Basin, China. Stoch. Environ. Res. Risk Assess. 2010, 24, 349–358. [Google Scholar] [CrossRef]
Abuduwaili, J.; Issanova, G.; Saparov, G. Water Resources and Impact of Climate Change on Water Resources in Central Asia. In Hydrology and Limnology of Central Asia; Springer: Singapore, 2019; pp. 1–9. [Google Scholar]
Scanlon, B.R.; Keese, K.E.; Flint, A.L.; Flint, L.E.; Gaye, C.B.; Edmunds, W.M.; Simmers, I. Global Synthesis of Groundwater Recharge in Semiarid and Arid Regions. Hydrol. Process. 2006, 20, 3335–3370. [Google Scholar] [CrossRef]
Meng, F.; Su, F.; Li, Y.; Tong, K. Changes in Terrestrial Water Storage during 2003–2014 and Possible Causes in Tibetan Plateau. J. Geophys. Res.-Atmos. 2019, 124, 2909–2931. [Google Scholar] [CrossRef]
Qian, A.; Yi, S.; Chang, L.; Sun, G.; Liu, X. Using Grace Data to Study the Impact of Snow and Rainfall on Terrestrial Water Storage in Northeast China. Remote Sens. 2020, 12, 4166. [Google Scholar] [CrossRef]
Siebert, S.; Burke, J.; Faures, J.-M.; Frenken, K.; Hoogeveen, J.; Döll, P.; Portmann, F.T. Groundwater Use for Irrigation—A Global Inventory. Hydrol. Earth Syst. Sci. 2010, 14, 1863–1880. [Google Scholar] [CrossRef]
Dong, N.; Wei, J.; Yang, M.; Yan, D.; Yang, C.; Gao, H.; Arnault, J.; Laux, P.; Zhang, X.; Liu, Y. Model Estimates of China’s Terrestrial Water Storage Variation Due to Reservoir Operation. Water Resour. Res. 2022, 58, e2021WR031787. [Google Scholar] [CrossRef]

Figure 1. Illustration of the geographic location and elevations of China.

Figure 2. The flowchart and schematic diagram illustrating XML framework developed in this study for TWSA reconstruction and interpretability analysis.

Figure 3. Model evaluation results on the testing set: spatial distribution of TWS observations (a) and XGBoost predictions (b), which are almost identical; scatter plot of comparison between observed TWS and predicted TWS (c); box plot for evaluation metrics of residuals, MAPE, RMSE (d).

Figure 4. Spatial distribution of MAPE. The height of the bars in the bar chart (top left) represents the proportion of cells within different MAPE ranges.

Figure 5. Bar plots of feature importance based on the XGBoost model using GBI (a) and SHAP (b) methods.

Figure 6. Spatial distribution of dominant driving factors for TWS across China, based on SHAP analysis. The heights of the bars (bottom left) indicate the percentage of the grid cells dominated by the corresponding drivers.

Figure 7. SHAP dependence plots depicting the SHAP main values along the gradient of (a) P, (b) SM, (c) Rg, (d) SP, (e) T, (f) Rs, (g) SWE, (h) Rm. The lateral axis indicates the gradient of variable values. The vertical axis indicates the magnitude of the SHAP main values. Positive SHAP values indicate a positive influence on TWS output while negative SHAP values indicate a negative influence.

Figure 8. Spatial distribution maps for multi-year averages of TWS and hydroclimatic factors: TWS (a), P (b), SM (c), Rg (d), SP (e), T (f), Rs (g), SWE (h), and Rm (i).

Table 1. Summary of the dataset used in this study.

Data Set	Variable Name	Acronyms	Unit	Date	Spatial Resolution	Temporal Resolution
GLDAS-CLSM (Version 2.2)	Terrestrial Water Storage	TWS	mm	Jan 2003–Sep 2022	0.25°	Daily
GLDAS-Noah (Version 2.1)	Baseflow-groundwater runoff	Rg	mm	Jan 2000–Sep 2022	0.25°	3 h
	Surface snow melt amount	Rm	mm
	Air temperature	T	K
	Precipitation	P	mm
	Surface runoff	Rs	mm
	Soil moisture	SM	mm
	Surface air pressure	SP	Pa
	Snow depth water equivalent	SWE	mm

Table 2. Hyperparameters, search ranges, and final selected values used for XGBoost model optimization.

Hyperparameter	Search Range	Final Value
n_estimators	(800, 1200)	1200
learning_rate	(0.05, 0.2)	0.07
max_depth	(8, 12)	10
subsample	(0.6, 0.8)	0.6
reg_alpha	(1, 3)	2
reg_lambda	(1, 5)	3
gamma	(0.3, 0.8)	0.5

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ma, X.; Huang, H.; Chen, J.; Yu, Q.; Cai, X. Exploring the Main Driving Factors for Terrestrial Water Storage in China Using Explainable Machine Learning. Remote Sens. 2025, 17, 2078. https://doi.org/10.3390/rs17122078

AMA Style

Ma X, Huang H, Chen J, Yu Q, Cai X. Exploring the Main Driving Factors for Terrestrial Water Storage in China Using Explainable Machine Learning. Remote Sensing. 2025; 17(12):2078. https://doi.org/10.3390/rs17122078

Chicago/Turabian Style

Ma, Xinjing, Haijun Huang, Jinwen Chen, Qiang Yu, and Xitian Cai. 2025. "Exploring the Main Driving Factors for Terrestrial Water Storage in China Using Explainable Machine Learning" Remote Sensing 17, no. 12: 2078. https://doi.org/10.3390/rs17122078

APA Style

Ma, X., Huang, H., Chen, J., Yu, Q., & Cai, X. (2025). Exploring the Main Driving Factors for Terrestrial Water Storage in China Using Explainable Machine Learning. Remote Sensing, 17(12), 2078. https://doi.org/10.3390/rs17122078

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Exploring the Main Driving Factors for Terrestrial Water Storage in China Using Explainable Machine Learning

Abstract

1. Introduction

2. Materials and Methodology

2.1. Study Area

2.2. Data Source

2.3. Ensemble Machine Learning Framework

2.4. Evaluation Metrics

2.5. SHAP

3. Results

3.1. Model Performance

3.2. Main Drivers for TWS

3.3. Individual Impact of Driving Factors

4. Discussion

4.1. Spatial Distribution of Dominant Drivers of TWS in China

4.2. Sources of Uncertainty

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI