Next Article in Journal
Effectiveness of Wetlands for Improving Different Water Quality Parameters in Various Climatic Conditions
Previous Article in Journal
Precision Identification of Irrigated Areas in Semi-Arid Regions Using Optical-Radar Time-Series Features and Ensemble Machine Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Effects of Climate Variables and Human Activities on Groundwater Level Fluctuations in Unconsolidated Sedimentary Aquifers: A Data-Driven Approach

1
College of Geoscience and Surveying Engineering, China University of Mining and Technology (Beijing), Beijing 100083, China
2
Qingdao Key Laboratory of Groundwater Resources Protection and Rehabilitation, Qingdao 266101, China
3
Key Laboratory of Geological Safety of Coastal Urban Underground Space, Ministry of Natural Resources, Qingdao Geo-Engineering Surveying Institute, Qingdao 266101, China
*
Author to whom correspondence should be addressed.
Hydrology 2025, 12(8), 215; https://doi.org/10.3390/hydrology12080215
Submission received: 4 July 2025 / Revised: 13 August 2025 / Accepted: 14 August 2025 / Published: 15 August 2025

Abstract

Groundwater level (GWL) in unconfined aquifers is highly susceptible to climate variables and human activities, exhibiting nonlinear fluctuations; these can further contribute to or exacerbate environmental hazards, such as land subsidence. Understanding the relationship between GWL changes and external conditions is essential for effective groundwater resource management and ecological protection. However, this relationship remains unclear and variable. This study systematically analyzes the correlations between climate and human factors and GWLs, using data from monitoring stations in the unconsolidated sedimentary aquifers of Beijing, China. It evaluates the importance of influencing factors on GWL simulation accuracy and tests how different inputs affect simulation performance. The results indicate that human factors are more strongly correlated with GWLs, yet climate factors hold higher importance scores. In GWL simulations, different input variables yield varying accuracy, with the inclusion of precipitation notably decreasing simulation precision because of its lagged or indirect effects on groundwater levels. The variation in accuracy across monitoring stations further suggests that the primary differences may stem from the GWL data itself. These findings underscore the need for high-resolution, localized data and tailored input selection to improve GWL projections and inform adaptive water-resource strategies under changing climatic and anthropogenic pressures.

1. Introduction

Groundwater plays a pivotal role in the global hydrological cycle, serving as a principal source of freshwater for a range of human activities, including drinking, irrigation, and industrial uses [1,2]. A reduction in groundwater level (GWL) can result in soil salinization, particularly in arid or semi-arid regions, which may have an impact on drinking or irrigation water [3,4]. Additionally, coastal areas are susceptible to seawater intrusion due to the lowering of GWLs, which represents a threat to freshwater resources [5,6,7]. Furthermore, the excessive exploitation of groundwater may also lead to ground subsidence, which in turn affects the stability of buildings and infrastructure [8,9]. It is therefore of great practical significance to be able to accurately model and predict changes in GWLs in order to manage regional water resources effectively and to prevent geological disasters.
Groundwater aquifers and hydrogeological conditions determine the characteristics of changes in GWLs [10,11]. Groundwater aquifers dominate the pathways of groundwater recharge and discharge [12,13]. An unconfined aquifer in groundwater exists above the first stable aquifer below the surface and is mainly recharged by precipitation and surface water infiltration and is susceptible to rapid responses to changes in moisture due to precipitation, evapotranspiration, and human activities, which can affect its water level fluctuations and water quality [14,15]. Hydrogeological settings—whether porous, fractured, or karstic—govern the hydraulic conductivity, storativity, and degree of hydraulic connectivity between the saturated zone and the vadose zone, thereby modulating the intensity of groundwater–surface exchange (recharge and discharge) [16]. Unconsolidated sedimentary aquifers, often characterized by high porosity and permeability, can facilitate rapid recharge but also exhibit a high vulnerability to contamination and surface impacts [17,18,19]. Therefore, the study of unconfined aquifers in areas of unconsolidated sedimentary aquifers is crucial.
The water tables of unconfined aquifers exhibit heightened sensitivity to climatic perturbations and anthropogenic pressures [20,21]. From a climate perspective, factors such as precipitation variability, temperature fluctuations, and prolonged droughts can directly influence groundwater recharge and evaporation rates, leading to fluctuations in the water table. Increased temperatures may also intensify evapotranspiration, further reducing GWLs [22]. From the perspective of anthropogenic activities, the extensive extraction of groundwater to meet the demands of agricultural irrigation, industrial operations, and urban expansion has been identified as a significant factor contributing to the rapid depletion of groundwater reserves and the subsequent decline of the water table [23]. Additionally, deforestation and urbanization alter the natural recharge patterns, further exacerbating the pressure on unconfined aquifers [24,25]. These combined effects underscore the need for effective groundwater management to mitigate potential impacts on water availability and ecosystem health.
Because traditional methods struggle to capture the nonlinear variations in groundwater levels, researchers have increasingly turned to machine learning techniques for improved modelling accuracy [26,27,28]. Physical-based models for groundwater simulation have been widely used to understand and predict groundwater dynamics [29,30,31,32]. While these models provide valuable insights, they often require extensive parameters and can be computationally intensive [33]. In recent years, data-driven models have emerged as a promising alternative for GWL simulation [34,35,36]. These models leverage large datasets and machine learning techniques to predict GWLs, offering a potentially more efficient and scalable approach. Among these models, the long short-term memory (LSTM) model stands out due to its ability to effectively handle time series data and capture the complex, nonlinear relationships between groundwater levels and various driving factors. The vanishing gradient problem occurs during the training of neural networks, particularly deep networks, where gradients become exceedingly small, making it difficult for the network to update the weights and learn effectively. This issue is addressed in LSTM models [37]. The LSTM model’s unique “gate” structure allows it to address the vanishing gradient problem that often plagues traditional recurrent neural networks, making it particularly well-suited for simulating the dynamic and nonlinear behavior of groundwater systems.
This study aims to investigate the effects of climate and human activities on GWLs through data-driven modeling approaches. Specifically, we analyze the correlation between key driving factors (climate factors and anthropogenic factors) and GWL changes, assess the relative importance of these factors in influencing groundwater dynamics, and investigate the effects of combining different factors on GWL prediction using LSTM models. We evaluated the effectiveness of these models in simulating GWLs and identified which combinations of input factors lead to the best model fit. By providing insights into the primary drivers of groundwater variability, this research contributes to improved prediction and management strategies, offering a scientific basis for sustainable groundwater resource management in the face of climate change and intensified human activities.

2. Materials and Methods

2.1. Datasets

Located in northern China, Beijing (39.4°~41.6° N, 115.7°~117.4° E) has experienced a rapid population growth that has led to the over-exploitation of groundwater due to limited water resources (Figure 1), results in various geological disasters in the plain region [38,39]. The annual precipitation is about 644 mm, while the precipitation exhibits pronounced orographic gradients: it is markedly higher along the windward piedmont slopes of the northeastern and southwestern ranges, significantly lower in the deep mountainous areas of the north and northwest, and intermediate across the plains. The city governs 16 districts with a total area of 16,410 km2, urban and construction land amounts to approximately 3576 km2, while agricultural land totals about 11,476 km2, yielding a relative coverage ratio of urban to agricultural land of roughly 1:3.2. By the end of 2018, Beijing’s comprehensive forest coverage had attained 43.5%, of which 28.5% was concentrated in the plains. The corresponding forest stock volume was 17.98 million m3. Urban green coverage reached 48.44%, yielding a per capita provision of public green space of 16.3 m2. The Beijing Plain comprises quaternary alluvial fans and plains with thicknesses ranging from tens of meters to over five-hundred meters. The aquifer system is complex with variable sediment thickness and lithology. From the alluvial fans to the plains, the sediment thickness increases and grain size decreases, aquifer systems change from a single gravel aquifer to multiple aquifer systems of sand layers separated by silt and clay layers. The water bearing layers within approximately 50 m of the surface constitute the shallow aquifer; those situated below this depth are designated as deep. The large part of the shallow aquifer is unconfined. It receives all the natural groundwater recharge. The majority of agricultural wells are located in the shallow aquifer. Some water supply well fields are installed in the shallow aquifer on the top of alluvial fans. The deep aquifers are confined, recharged chiefly by leakage from the shallow system, and contain the majority of industrial water supply wells along with some drinking water well fields [40]. In this study, five different degrees of aquifer enrichment (the discharge of a single well when the drawdown is 5 m, expressed in m3/d, is classified as follows: less than 500 m3/d is extremely low water richness (EL); 500 ~ 1500 m3/d is low water richness (L); 1500 ~ 3000 m3/d is moderate water richness (M); 3000 ~ 5000 m3/d is high water richness (H); greater than 5000 m3/d is extremely high water richness (EH)) within the unconsolidated sedimentary aquifers formations were randomly selected [41]. The data, which cover the period from 2018 to 2020 with a daily frequency, were sourced from the Groundwater Center of the China Geological Survey Institute.
For the climate variables that were closely linked to GWLs, we selected five driving factors: air pressure, temperature, precipitation, evaporation, and sunlight. Air pressure represents the atmospheric pressure at the surface; temperature refers to the surface temperature; precipitation measures the accumulation of liquid and frozen water that falls on the earth’s surface, including rain and snowmelt; evaporation measures the amount of water that evaporates from the surface and transpires from vegetation, with negative values indicating evaporation and positive values indicating condensation; sunlight measures the total amount of solar irradiance accumulated over a period and is converted into a standard test (irradiance 1000 w/m2). The climate variables data used were sourced from the National Oceanic and Atmospheric Administration (NOAA). The data for these five climatic variables are at a daily frequency. Additionally, five factors related to human activities were selected as input variables: supply water, domestic water consumption, environmental water consumption, industrial water consumption, and agricultural water consumption. Among these, “supply water” denotes the total volume of groundwater that is extracted, treated at water plants, and delivered to the urban distribution network, while “environmental water consumption” refers to groundwater recharged to rivers, lakes, wetlands, or other ecological targets for environmental purposes. The data for these five human activities factors are at a yearly frequency. The human activities data were obtained from the Beijing Water Resources Bulletin.

2.2. Methods

Pearson correlation analysis was used to measure the linear relationship between GWLs and 10 driving factors (including climate factors and anthropogenic factors) among 5 sites. The Pearson correlation coefficient is a correlation coefficient that measures linear correlation between two sets of data [42]. It is the ratio between the covariance of two variables and the product of their standard deviations (Equation (1)). It is essentially a normalized measurement of the covariance, such that the result always has a value between −1 and 1.
ρ X , Y = c o v   ( X , Y ) σ X σ Y
To evaluate the impact of climate and human activity on GWL simulation, we utilized Recursive Feature Elimination (RFE) to analyze the importance of the driving factors. RFE works by reducing the size of feature sets based on the weight of each factor [43,44]. For this particular study, we chose to use random forests as our base model due to their robust anti-interference and anti-overfitting capabilities. Based on the GWL datasets from 5 monitoring stations, the important scores of 10 driving factors were calculated 5 times to counteract the impact of the random initial weights of the fitted model on model accuracy.
To accurately assess the impact of various driving factors on GWL simulation, a long short-term memory (LSTM) model was utilized to combine 10 factors (5 climate and 5 anthropogenic variables) with data from 5 groundwater monitoring stations with different levels of aquifer richness. This resulted in 50 distinct datasets for simulation and prediction purposes. The LSTM model is a recurrent neural network that utilizes a “gate” structure to effectively address the challenge of exploding/vanishing gradient problems that can arise during training [45]. Our study employed an LSTM model with a 3 × 100 structure (number of layers × number of neurons), trained over 50 epochs. To mitigate the effects of randomness in neural network simulations, each of the 50 datasets was trained 5 times and averaged for more reliable results, while the dataset was temporally split with 85% used for training and the remaining 15% for validation; a dropout rate of 0.2 was applied after each LSTM and dense layer, and the Adam optimizer (learning rate 0.001, β1 0.9, β2 0.999) was adopted because its combination of momentum and adaptive learning rates ensures rapid convergence and robust performance in groundwater-level time series forecasting.
In order to better show the effect of model simulation, we used Taylor diagrams [46] with standard deviation (STD) (Equation (2)), correlation coefficient (r) (Equation (3)), and root mean square error (RMSE) (Equation (4)) to represent the performance of the LSTM combined with different driving factors. Standard deviation is an indicator that describes the degree of data dispersion, which represents the magnitude of change in GWL in our study. The correlation coefficient was used to measure the strength of the linear relationship between simulated and actual groundwater level. RMSE was used to measure the difference between simulated and actual groundwater level. These three indexes can measure the effect of LSTM simulation from three aspects (magnitude of change, correlation, and deviation), and shown in a single graph, each simulation result was normalized by dividing the standard difference of the observation data.
S T D = i = 1 n ( y i y ¯ ) 2 n ,
r = i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ ) 2 i = 1 n ( y i y ¯ ) 2
R M S E = i = 1 n ( x i y i ) 2 n

3. Results

3.1. Correlation Analysis of Driving Factors

The absolute correlation between anthropogenic factors and GWLs was found to be higher than that observed between climate factors and GWLs (Figure 2, The specific values of the Pearson correlation analysis are presented in Table A1 in Appendix A). The former ranged from −0.745 to 0.688, while the latter ranged from −0.541 to 0.425. Furthermore, no significant correlation was identified between the climatic factors (air pressure, temperature, precipitation, and sunlight) and GWLs in Group EL. In Group H, only two factors have been identified as exhibiting a statistically significant correlation with the GWL: temperature and evaporation. The findings suggest that, within the same hydrogeological type, the driving factors and GWLs are not necessarily correlated.
Among the climatic factors, there is a weak positive correlation between air pressure and GWL (in Group L, M and EH), with the value of 0.254 ± 0.102 (mean ± standard deviation). The correlation between temperature and GWL is predominantly negative, with the exception of Group EL, and the value is −0.294 ± 0.178. However, there is no significant correlation between precipitation and GWL which correlation coefficient is −0.136 ± 0.061. The correlation between evaporation and sunlight and the water table is also inconclusive, with both positive and negative correlations observed.
In the correlation calculation of anthropogenic factors and GWL, the results of Group L are diametrically opposed to those of the other groups. In Group L, only the environmental water factor exhibits a significant positive correlation with GWL, while the others demonstrate a significant negative correlation with GWL. In Groups EL, M, and EH, there were significant positive correlations between GWL and the following four water types: supplying water, domestic water, industrial water, and agricultural water. Conversely, GWL is significantly negatively correlated with environmental water.

3.2. Importance of Driving Factors

In general, the driving factors related to climate were considered to be of greater importance than those related to human activity (Figure 3). This discrepancy may be attributed to the quality of the input data, as there was comparatively little change in the variables related to human activity in comparison to those related to climate. With regard to the climate factors, precipitation was assigned the lowest importance score (0.657 ± 0.240), while the remaining variables were ranked within the top four. The importance rating for sunlight and temperature in the GWL simulations was 2.341 ± 0.614 (mean ± standard deviation) and 2.000 ± 0.390, respectively. This is likely due to the fact that they are directly related to evaporation and plant transpiration, which play a key role in the groundwater recharge and depletion process. Furthermore, evaporation and air pressure were also identified as significant factors, with ratings of 1.673 ± 0.292 and 1.259 ± 0.492, respectively. These findings reinforce the crucial role of climatic variables in influencing GWLs.
Despite the relatively low and similar ratings of anthropogenic factors, they do exert some influence across different groups. The ratings of anthropogenic factors in group H are relatively high in comparison to the other groups. This may be attributed to the fact that the monitoring stations were selected in areas with high anthropogenic impacts, where human dependence on groundwater is higher. In group EH, the importance ratings of anthropogenic factors other than environmental water were low, which may indicate that groundwater extraction is limited in this area.

3.3. GWL Simulation Under Different Driving Factors

The evaluation of groundwater simulation results for various input variables was presented using the Taylor diagram (Figure 4). Our models were named after the driving factors, and we grouped GWL simulation assessment based on different levels of water richness. The simulation results within each group were found to be relatively concentrated. Notably, the results from EL and EH groups showed a higher correlation with the observed data, with correlation values close to 1 and low RMSE values. However, the results from L and M groups were comparatively poorer.
The findings indicate high correlations between simulations and observation in GWL, with r values ranging from 0.56 to 0.99. Air pressure was the most fitting model in two groups (EL, EH), with high r values of 0.98 and 0.99. Among the remaining three groups, sunlight, industrial water, and evaporation had the highest fitting coefficients, with 0.82 of group L, 0.80 of group M, and 0.93 of group H, respectively. For all the groups, at least three of the top five relevant models were anthropogenic factors. This suggests that human activities play a beneficial role in the simulation of GWL. However, regardless of the group, the precipitation simulation results had the least correlation, with r values ranging from 0.56 to 0.96.
The RMS measures the disparity between simulated and observed GWL and ranges from 0.07 m to 0.20 m. Among the EL group, domestic water has the lowest RMSE value, while three anthropogenic factors also have lower RMSE values within this group. For the L group, sunlight has the lowest RMSE value. Meanwhile, industrial water, evaporation, and air pressure have the lowest RMSE values in the M, H, and EH groups, respectively. In these three groups, anthropogenic factors have lower RMSE values, ranking as the top six. Precipitation has the highest RMSE value in each group, ranging from 0.11 m to 0.20 m. The exceptional results of the anthropogenic driving factor in both correlation coefficient and RMSE highlight the importance of adding anthropogenic variables in GWL simulation.
The STD reflects the range of variation within GWL data. A higher STD indicates large fluctuations, while a lower STD suggests relatively stable fluctuations. In our comparison, we utilized the STD of observed GWL within each group as the comparison object. If the STD of the simulated value exceeds that of the observed value in the group, the simulation does not accurately reflect the actual GWL mutation. Conversely, if the STD is lower, the simulation may introduce noise, leading to inaccurate results. Our findings reveal that group L and group M have the highest STD of observed values, suggesting the model poorly simulated sudden changes. In group EH, the STD of observed values is the lowest in the group, and the highest value belongs to precipitation (with STD of 0.48 m). In the EL group, precipitation also has the highest STD (0.40 m), while temperature has the lowest (0.36 m).

4. Discussion

4.1. Driving Factors Affecting Groundwater Level

The importance score places considerable weight on climatic factors, reflecting the long-term and far-reaching impact of climate change on groundwater recharge and depletion patterns. To illustrate, air pressure directly affects the pressure and flow state of groundwater system, and air pressure fluctuations can have a significant effect on GWL [47]. Precipitation represents a significant source of groundwater recharge, with the potential to exert an influence on GWLs in the short, medium, and long term [48]. However, the findings of this study indicate that the incorporation of precipitation has an adverse impact on the modelling of GWLs. This is likely due to the presence of impermeable surfaces in the vicinity of the monitoring sites and the correlation between precipitation and GWLs, which subsequently influences the observed response. Evaporation can be defined as the transfer of water from soil, water bodies, and vegetation surfaces to the atmosphere. This process results in the absorption of water from the soil, which in turn recharges soil water. A reduction in groundwater recharge will lead to a corresponding reduction in the water table, a phenomenon that is particularly evident in arid regions [49]. The influence of sunlight and surface temperatures on soil temperatures and evaporation rates is a significant factor in the regulation of GWLs. Therefore, under the combined influence of global warming and intrinsic climate cycles, long-term GWL projections must explicitly incorporate both the periodic and secular components of climatic forcings to accurately resolve the underlying multi-scale variability [50].
Unlike the effects of climate, human activities impart a more spatially heterogeneous influence on GWLs. To illustrate, factors such as industrial water use, agricultural irrigation, and residential water use are subject to rapid change and are susceptible to significant fluctuations in GWLs in the short term [51]. Therefore, although the impact of anthropogenic factors is low in the overall score, they can significantly disturb the dynamic balance of groundwater due to their short-term and controllable nature [52]. In practical applications, the model should be adapted to accommodate the short-term fluctuations in water levels induced by anthropogenic activities, which can be regarded as disturbances, in order to optimize the responsiveness of the prediction model.
The superimposed effects between climate and human activities are manifested in complex, nonlinear relationships in GWL changes. In arid or semi-arid regions, where agricultural water demand is on the rise and groundwater is extensively utilized for agricultural irrigation, this heightened demand serves as an amplifier in changes in GWLs, potentially leading to a scenario where agricultural water usage may outweigh climatic factors [53]. Conversely, in regions characterized by higher precipitation levels and sufficient groundwater recharge, agricultural water demand is diminished, leading to a reduced reliance on groundwater for human activities [54]. Consequently, the impact of climatic factors can be more significant. The complex superposition effect necessitates the incorporation of the interaction between climate and human activities into GWL change models in order to achieve more accurate predictions in practical applications. This interaction analysis can elucidate the sensitivity of human activities to the groundwater system under different climatic conditions, thereby providing a scientific basis for effective groundwater management.
Furthermore, data quality, spatial resolution and temporal resolution all have an impact on the importance score. Climatic factors (temperature, precipitation, evapotranspiration, etc.) can typically be obtained with high temporal resolution through long-term monitoring and satellite remote sensing. Consequently, the model is able to capture seasonal and interannual variations of climatic factors during the simulation process. In contrast, anthropogenic variables (e.g., agricultural water use, industrial water use, and domestic water use) typically lack data records with high temporal resolution and exhibit less variability in the data, which may be considered on an annual or longer time scale. Consequently, models may be unable to accurately represent the immediate effects of human activities on GWLs, resulting in relatively low significance scores. Enhancements in the precision and temporal resolution of anthropogenic data could facilitate a more comprehensive representation of their impact on GWLs.

4.2. Groundwater Level Simulation Based on Data Modelling

Firstly, the performance of single-input models generally outperformed multi-input models, suggesting that data constraints may play a significant role. When numerous variables are introduced, each requires a substantial dataset to provide stable, high-quality input. Insufficient data coverage or quality in certain features may reduce the model’s ability to capture complex relationships, leading to diminished performance when multi-input approaches are employed. Thus, single-input models may yield more accurate predictions because they rely on more robust, focused data.
Secondly, the simulation accuracy varied across monitoring stations despite using a consistent training process. This inconsistency may be attributed to localized hydrological conditions that influence groundwater dynamics, such as unique geological formations or micro-climatic patterns not fully captured by a general model. Additionally, differences in data quality across stations may introduce variability in the accuracy of predictions, indicating a need for location-specific model optimization. An additional potential explanation is the discrepancy between the spatial distribution of the impact factor data and that of the water table data.
Interestingly, the inclusion of precipitation as an input factor sometimes led to poorer simulation outcomes, due to its lagged or indirect effects on GWLs (Figure A1). The process of precipitation input to groundwater is complex and can be influenced by local factors such as soil permeability and vegetation cover, which may not be fully captured by the current dataset. Future work should consider incorporating additional variables and conducting careful calibration to better account for these complex interactions.

5. Conclusions

The present study delved into the impacts of climate variables and anthropogenic activities on groundwater levels and their implications for groundwater modeling using a data-driven approach. The correlation analysis revealed a pronounced connection between human activities and groundwater levels, with Pearson correlation coefficients ranging from −0.541 to 0.425 for anthropogenic factors, compared to −0.745 to 0.688 for climate factors. However, the importance scores indicated that climate variables exerted a more significant influence on groundwater level simulation. The ranking of factor weights is as follows: sunlight > temperature > evaporation > air pressure > environmental water > supplying water > agricultural water > domestic water > precipitation > industrial water.
The simulation of groundwater levels was found to be influenced by a multitude of input variables, each with a distinct effect. Single-input models generally outperformed multi-input models, suggesting that data constraints play a significant role. The simulation accuracy varied across monitoring stations, which may be attributed to localized hydrological conditions and differences in data quality. The inclusion of precipitation as an input factor sometimes led to poorer simulation outcomes, due to its lagged or indirect effects on groundwater levels.
This study underscores the need for tailored data inputs and localized approaches to enhance groundwater level predictions. Future research could further explore the synergistic effects of climate and human activities on groundwater levels. Utilizing higher-resolution meteorological and anthropogenic data may elevate the precision of simulations. Additionally, integrating different machine learning approaches, such as ensemble learning or hybrid models, holds promise for optimizing predictions of groundwater levels. Such an approach would better account for the uncertainties stemming from climate and anthropogenic activities, thereby providing a more robust scientific foundation for groundwater management and improving water protection strategies amidst climate change.

Author Contributions

Conceptualization, L.Y. and J.C.; Methodology, M.G., J.C., Z.L., C.L. and J.Y.; Investigation, M.G., J.C., C.H. and X.Y.; Visualization, M.G. and J.C.; Funding acquisition, W.S. and J.D.; Project administration, J.D.; Supervision, J.D.; Writing—original draft, L.Y. and M.G.; Writing—review and editing: all authors. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Shandong Provincial Natural Science Foundation (grant number ZR2022MD099),the Open Foundation of the Qingdao Key Laboratory of Groundwater Resources Protection and Rehabilitation(grant number DXSKF2023Y01).

Data Availability Statement

The data, which cover the period from 2018 to 2020 with a daily frequency, were sourced from the Groundwater Center of the China Geological Survey Institute (http://www.iheg.cgs.gov.cn. Accessed on 1 March 2025); the climate variables data used were sourced from the National Oceanic and Atmospheric Administration (https://www.noaa.gov. Accessed on 10 March 2025); the human activities data were obtained from the Beijing Water Resources Bulletin (https://swj.beijing.gov.cn/zwgk/szygb/. Accessed on 5 March 2025). Data will be made available on request.

Acknowledgments

We acknowledge the China Institute for Geo-Environmental Monitoring for the data support.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

AbbreviationMeaning
GWLgroundwater level
LSTMlong short-term memory
STDstandard deviation
RMSEroot mean square error
RMSroot mean square
RFERecursive Feature Elimination

Appendix A

Table A1. This is the results of the Pearson correlation analysis.
Table A1. This is the results of the Pearson correlation analysis.
Driving FactorsExtremely HighHighModerateLowExtremely Low
Air pressure0.1020.0210.3460.3150.006
Temperature−0.254−0.115−0.541−0.264−0.026
Precipitation−0.093−0.029−0.179−0.0440.025
Evaporation0.2430.1110.4250.187−0.118
Sunlight0.0680.001−0.115−0.322−0.008
Supplying water0.652−0.0150.401−0.7380.486
Domestic water0.482−0.0570.324−0.7210.605
Environmental water−0.691−0.026−0.4010.626−0.297
Industrial water0.557−0.0430.361−0.7450.576
Agricultural water0.6880.0090.409−0.6870.386
Figure A1. These are the results of the Pearson correlation analysis.
Figure A1. These are the results of the Pearson correlation analysis.
Hydrology 12 00215 g0a1

References

  1. Jha, M.K.; Shekhar, A.; Jenifer, M.A. Assessing groundwater quality for drinking water supply using hybrid fuzzy-GIS-based water quality index. Water Res. 2020, 179, 115867. [Google Scholar] [CrossRef]
  2. Kuang, X.; Liu, J.; Scanlon, B.R.; Jiao, J.J.; Jasechko, S.; Lancia, M.; Biskaborn, B.K.; Wada, Y.; Li, H.; Zeng, Z.; et al. The changing nature of groundwater in the global water cycle. Science 2024, 383, eadf0630. [Google Scholar] [CrossRef] [PubMed]
  3. Tarolli, P.; Luo, J.; Park, E.; Barcaccia, G.; Masin, R. Soil salinization in agriculture: Mitigation and adaptation strategies combining nature-based solutions and bioengineering. iScience 2024, 27, 108830. [Google Scholar] [CrossRef] [PubMed]
  4. Hagage, M.; Abdulaziz, A.M.; Elbeih, S.F.; Hewaidy, A.G.A. Monitoring soil salinization and waterlogging in the northeastern Nile Delta linked to shallow saline groundwater and irrigation water quality. Sci. Rep. 2024, 14, 27838. [Google Scholar] [CrossRef]
  5. Klassen, J.; Allen, D.M. Assessing the risk of saltwater intrusion in coastal aquifers. J. Hydrol. 2017, 551, 730–745. [Google Scholar] [CrossRef]
  6. Michael, H.A.; Post, V.E.A.; Wilson, A.M.; Werner, A.D. Science, society, and the coastal groundwater squeeze. Water Resour. Res. 2017, 53, 2610–2617. [Google Scholar] [CrossRef]
  7. Werner, A.D.; Bakker, M.; Post, V.E.A.; Vandenbohede, A.; Lu, C.; Ataie-Ashtiani, B.; Simmons, C.T.; Barry, D.A. Seawater intrusion processes, investigation and management: Recent advances and future challenges. Adv. Water Resour. 2013, 51, 3–26. [Google Scholar] [CrossRef]
  8. Galloway, D.L.; Burbey, T.J. Review: Regional land subsidence accompanying groundwater extraction. Hydrogeol. J. 2011, 19, 1459–1486. [Google Scholar] [CrossRef]
  9. Cigna, F.; Tapete, D. Satellite InSAR survey of structurally-controlled land subsidence due to groundwater exploitation in the Aguascalientes Valley, Mexico. Remote Sens. Environ. 2021, 254, 112254. [Google Scholar] [CrossRef]
  10. Dvory, N.Z.; Livshitz, Y.; Kuznetsov, M.; Adar, E.; Yakirevich, A. The effect of hydrogeological conditions on variability and dynamic of groundwater recharge in a carbonate aquifer at local scale. J. Hydrol. 2016, 535, 480–494. [Google Scholar] [CrossRef]
  11. Jasechko, S.; Seybold, H.; Perrone, D.; Fan, Y.; Shamsudduha, M.; Taylor, R.G.; Fallatah, O.; Kirchner, J.W. Rapid groundwater decline and some cases of recovery in aquifers globally. Nature 2024, 625, 715–721. [Google Scholar] [CrossRef]
  12. Qian, H.; Chen, J.; Howard, K.W.F. Assessing groundwater pollution and potential remediation processes in a multi-layer aquifer system. Environ. Pollut. 2020, 263, 114669. [Google Scholar] [CrossRef]
  13. Walraevens, K.; Vandecasteele, I.; Martens, K.; Nyssen, J.; Moeyersons, J.; Gebreyohannes, T.; de Smedt, F.; Poesen, J.; Deckers, J.; van Camp, M. Groundwater Recharge and Flow in a Small Mountain Catchment in Northern Ethiopia. Hydrol. Sci. J. 2009, 54, 739–753. [Google Scholar] [CrossRef]
  14. Kavvas, M.L.; Tu, T.; Ercan, A.; Polsinelli, J. Fractional governing equations of transient groundwater flow in unconfined aquifers with multi-fractional dimensions in fractional time. Earth Syst. Dyn. 2020, 11, 1–12. [Google Scholar] [CrossRef]
  15. Scanlon, B.R.; Healy, R.W.; Cook, P.G. Choosing appropriate techniques for quantifying groundwater recharge. Hydrogeol. J. 2002, 10, 18–39. [Google Scholar] [CrossRef]
  16. Wang, J.; Hao, X.; Liu, X.; Ouyang, W.; Li, T.; Cui, X.; Pei, J.; Zhang, S.; Zhu, W.; Jin, R. Groundwater–surface water exchange affects nitrate fate in a seasonal freeze–thaw watershed: Sources, migration and removal. J. Hydrol. 2025, 654, 132803. [Google Scholar] [CrossRef]
  17. Yu, Z.; Zhu, S.; Wu, Y.; Yu, H. Study on the structural characteristics of the overburden under thick loose layer and thin-bed rock for safety of mining coal seam. Environ. Earth Sci. 2020, 79, 9. [Google Scholar] [CrossRef]
  18. Kaufmann, G. Modelling karst aquifer evolution in fractured, porous rocks. J. Hydrol. 2016, 543, 796–807. [Google Scholar] [CrossRef]
  19. Li, C.; Fang, J.; Feng, F.; Yao, T.; Shan, Y.; Su, W. Differential Evolution in Hydrochemical Characteristics Amongst Porous, Fissured and Karst Aquifers in China. Hydrology 2025, 12, 175. [Google Scholar] [CrossRef]
  20. Custodio, E.; Jodar, J.; Herrera, C.; Custodio-Ayala, J.; Medina, A. Changes in groundwater reserves and radiocarbon and chloride content due to a wet period intercalated in an arid climate sequence in a large unconfined aquifer. J. Hydrol. 2018, 556, 427–437. [Google Scholar] [CrossRef]
  21. Qin, R.; Song, Q.; Hao, Y.; Wu, G. Groundwater Level Declines in Tianjin, North China: Climatic Variations and Human Activities. Environ. Dev. Sustain. 2022, 25, 1899–1913. [Google Scholar] [CrossRef]
  22. Amanambu, A.C.; Obarein, O.A.; Mossa, J.; Li, L.; Ayeni, S.S.; Balogun, O.; Oyebamiji, A.; Ochege, F.U. Groundwater system and climate change: Present status and future considerations. J. Hydrol. 2020, 589, 125163. [Google Scholar] [CrossRef]
  23. Zhang, X.; Wu, X.; Zhao, R.; Mu, W.; Wu, C. Identifying the Facts and Driving Factors of Deceleration of Groundwater Table Decline in Beijing During 1999–2018. J. Hydrol. 2022, 607, 127475. [Google Scholar] [CrossRef]
  24. Zhou, W.J.; Hao, L. Impact of Rapid Urbanization on Groundwater Storage Variation Amid Climate Change in the Yangtze River Basin. J. Hydrol. Reg. Stud. 2025, 59, 102360. [Google Scholar] [CrossRef]
  25. Ali, J.; Islam, F.; Bibi, T.; Islam, I.; Mughal, M.R.; Sabir, M.; Awwad, F.; Ismail, E. Quantifying the impact of climate change and urbanization on groundwater resources using geospatial modeling. Front. Earth Sci. 2024, 12, 1377367. [Google Scholar] [CrossRef]
  26. Boo, K.B.W.; El-Shafie, A.; Othman, F.; Khan, M.M.H.; Birima, A.H.; Ahmed, A.N. Groundwater level forecasting with machine learning models: A review. Water Res. 2024, 252, 121249. [Google Scholar] [CrossRef] [PubMed]
  27. Kadiyala, L.A.; Mermer, O.; Samuel, D.J.; Sermet, Y.; Demir, I. The Implementation of Multimodal Large Language Models for Hydrological Applications: A Comparative Study of GPT-4 Vision, Gemini, LLaVa, and Multimodal-GPT. Hydrology 2024, 11, 148. [Google Scholar] [CrossRef]
  28. Gómez-Escalonilla, V.; Martínez-Santos, P. A Machine Learning Approach to Map the Vulnerability of Groundwater Resources to Agricultural Contamination. Hydrology 2024, 11, 153. [Google Scholar] [CrossRef]
  29. de Graaf, I.E.M.; Stahl, K. A model comparison assessing the importance of lateral groundwater flows at the global scale. Environ. Res. Lett. 2022, 17, 044020. [Google Scholar] [CrossRef]
  30. Ozdogan-Sarikoc, G.; Dadaser-Celik, F. Physically based vs. data-driven models for streamflow and reservoir volume prediction at a data-scarce semi-arid basin. Environ. Sci. Pollut. Res. Int. 2024, 31, 39098–39119. [Google Scholar] [CrossRef] [PubMed]
  31. Di Salvo, C. Improving Results of Existing Groundwater Numerical Models Using Machine Learning Techniques: A Review. Water 2022, 14, 2307. [Google Scholar] [CrossRef]
  32. Kim, N.W.; Chung, I.M.; Won, Y.S.; Arnold, J.G. Development and application of the integrated SWAT-MODFLOW model. J. Hydrol. 2008, 356, 1–16. [Google Scholar] [CrossRef]
  33. Hayley, K.; White, E. Quantifying urban groundwater complexity: A high parameterization modelling approach. J. Hydrol. 2024, 638, 131416. [Google Scholar] [CrossRef]
  34. Amaranto, A.; Pianosi, F.; Solomatine, D.; Corzo, G.; Muñoz-Arriola, F. Sensitivity analysis of data-driven groundwater forecasts to hydroclimatic controls in irrigated croplands. J. Hydrol. 2020, 587, 124957. [Google Scholar] [CrossRef]
  35. Tao, H.; Hameed, M.M.; Marhoon, H.A.; Zounemat-Kermani, M.; Heddam, S.; Kim, S.; Sulaiman, S.O.; Tan, M.L.; Sa’adi, Z.; Mehrm, A.D.; et al. Groundwater level prediction using machine learning models: A comprehensive review. Neurocomputing 2022, 489, 271–308. [Google Scholar] [CrossRef]
  36. Sun, Y.; Wendi, D.; Kim, D.E.; Liong, S.-Y. Technical note: Application of artificial neural networks in groundwater table forecasting—A case study in a Singapore swamp forest. Hydrol. Earth Syst. Sci. 2016, 20, 1405–1412. [Google Scholar] [CrossRef]
  37. Qin, C.; Chen, L.; Cai, Z.; Liu, M.; Jin, L. Long short-term memory with activation on gradient. Neural Netw. 2023, 164, 135–145. [Google Scholar] [CrossRef]
  38. Zhou, Y.; Wang, L.; Liu, J.; Li, W.; Zheng, Y. Options of Sustainable Groundwater Development in Beijing Plain, China. Phys. Chem. Earth 2012, 47–48, 99–113. [Google Scholar] [CrossRef]
  39. Du, J.; Laghari, Y.; Wei, Y.-C.; Wu, L.; He, A.-L.; Liu, G.-Y.; Yang, H.-H.; Guo, Z.-Y.; Leghari, S.J. Groundwater Depletion and Degradation in the North China Plain: Challenges and Mitigation Options. Water 2024, 16, 354. [Google Scholar] [CrossRef]
  40. Zhou, Y.; Dong, D.; Liu, J.; Li, W. Upgrading a Regional Groundwater Level Monitoring Network for Beijing Plain, China. Geosci. Front. 2013, 4, 127–138. [Google Scholar] [CrossRef]
  41. Beijing Geological Prospecting and Developing Bureau, Beijing Institute of Hydrogeology and Engineering Geology. Beijing Groundwater; China Land Press: Beijing, China, 2008; pp. 47–50. [Google Scholar]
  42. Edelmann, D.; Móri, T.F.; Székely, G.J. On Relationships Between the Pearson and the Distance Correlation Coefficients. Stat. Probab. Lett. 2021, 169, 108960. [Google Scholar] [CrossRef]
  43. Chen, Q.; Meng, Z.; Liu, X.; Jin, Q.; Su, R. Decision variants for the automatic determination of optimal feature subset in RF-RFE. Genes 2018, 9, 301. [Google Scholar] [CrossRef]
  44. Barzani, A.R.; Pahlavani, P.; Ghorbanzadeh, O.; Gholamnia, K.; Ghamisi, P. Evaluating the impact of recursive feature elimination on machine learning models for predicting forest fire-prone zones. Fire 2024, 7, 440. [Google Scholar] [CrossRef]
  45. Van Houdt, G.; Mosquera, C.; Napoles, G. A review on the long short-term memory model. Artif. Intell. Rev. 2020, 53, 5929–5955. [Google Scholar] [CrossRef]
  46. Taylor, K.E. Summarizing multiple aspects of model performance in a single diagram. J. Geophys. Res. Atmos. 2001, 106, 7183–7192. [Google Scholar] [CrossRef]
  47. Spane, F.A. Considering barometric pressure in groundwater flow investigations. Water Resour. Res. 2002, 38, 14-1–14-18. [Google Scholar] [CrossRef]
  48. Lorenzo-Lacruz, J.; Garcia, C.; Morán-Tejeda, E. Groundwater Level Responses to Precipitation Variability in Mediterranean Insular Aquifers. J. Hydrol. 2017, 552, 516–531. [Google Scholar] [CrossRef]
  49. Balugani, E.; Lubczynski, M.W.; Reyes-Acosta, L.; van der Tol, C.; Francés, A.P.; Metselaar, K. Groundwater and unsaturated zone evaporation and transpiration in a semi-arid open woodland. J. Hydrol. 2017, 547, 54–66. [Google Scholar] [CrossRef]
  50. Bova, S.; Rosenthal, Y.; Liu, Z.; Godad, S.P.; Yan, M. Seasonal Origin of the Thermal Maxima at the Holocene and the Last Interglacial. Nature 2021, 589, 548–553. [Google Scholar] [CrossRef]
  51. Guo, C.; Liu, T.; Niu, Y.; Liu, Z.; Pan, X.; De Maeyer, P. Quantitative analysis of the driving factors for groundwater resource changes in arid irrigated areas. Hydrol. Process. 2021, 35, e13967. [Google Scholar] [CrossRef]
  52. Scanlon, B.R.; Fakhreddine, S.; Rateb, A.; de Graaf, I.; Famiglietti, J.; Gleeson, T.; Grafton, R.Q.; Jobbagy, E.; Kebede, S.; Kolusu, S.R.; et al. Global water resources and the role of groundwater in a resilient water future. Nat. Rev. Earth Environ. 2023, 4, 87–101. [Google Scholar] [CrossRef]
  53. Tian, Y.; Zheng, Y.; Wu, B.; Wu, X.; Liu, J.; Zheng, C. Modeling surface water-groundwater interaction in arid and semi-arid regions with intensive agriculture. Environ. Model. Softw. 2015, 63, 170–184. [Google Scholar] [CrossRef]
  54. Foster, S.; Chilton, J.; Nijsten, G.-J.; Richts, A. Groundwater—A global focus on the ‘local resource’. Curr. Opin. Environ. Sustain. 2013, 5, 685–695. [Google Scholar] [CrossRef]
Figure 1. Geographic location and elevation of groundwater monitoring points in Beijing.
Figure 1. Geographic location and elevation of groundwater monitoring points in Beijing.
Hydrology 12 00215 g001
Figure 2. Heatmap illustrating the correlations between driving factors and groundwater levels across monitoring stations. Note: ‘*’ denotes statistical significance; ‘*’ represents p < 0.05; ‘**’ represents p < 0.01; non-significant correlations are unmarked.
Figure 2. Heatmap illustrating the correlations between driving factors and groundwater levels across monitoring stations. Note: ‘*’ denotes statistical significance; ‘*’ represents p < 0.05; ‘**’ represents p < 0.01; non-significant correlations are unmarked.
Hydrology 12 00215 g002
Figure 3. Relative contributions of climatic and anthropogenic factors to groundwater level modeling accuracy. Error bars represent variance of evaluation metrics across multiple simulation trials.
Figure 3. Relative contributions of climatic and anthropogenic factors to groundwater level modeling accuracy. Error bars represent variance of evaluation metrics across multiple simulation trials.
Hydrology 12 00215 g003
Figure 4. Evaluation of groundwater level prediction effect. Taylor diagrams of different degrees of aquifer enrichment ((a) extremely low water richness, (b) low water richness, (c) moderate water richness, (d) high water richness, and (e) extremely high water richness), where the correlation between simulated result and observation is shown as gray slash and RMSE is shown as yellow line.
Figure 4. Evaluation of groundwater level prediction effect. Taylor diagrams of different degrees of aquifer enrichment ((a) extremely low water richness, (b) low water richness, (c) moderate water richness, (d) high water richness, and (e) extremely high water richness), where the correlation between simulated result and observation is shown as gray slash and RMSE is shown as yellow line.
Hydrology 12 00215 g004
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, L.; Gao, M.; Chen, J.; Shi, W.; Hou, C.; Liu, Z.; Luo, C.; Yu, J.; Yang, X.; Dong, J. Effects of Climate Variables and Human Activities on Groundwater Level Fluctuations in Unconsolidated Sedimentary Aquifers: A Data-Driven Approach. Hydrology 2025, 12, 215. https://doi.org/10.3390/hydrology12080215

AMA Style

Yang L, Gao M, Chen J, Shi W, Hou C, Liu Z, Luo C, Yu J, Yang X, Dong J. Effects of Climate Variables and Human Activities on Groundwater Level Fluctuations in Unconsolidated Sedimentary Aquifers: A Data-Driven Approach. Hydrology. 2025; 12(8):215. https://doi.org/10.3390/hydrology12080215

Chicago/Turabian Style

Yang, Liu, Ming Gao, Jiameng Chen, Wenqing Shi, Changhong Hou, Zichun Liu, Cheng Luo, Jiahui Yu, Xiangyu Yang, and Jie Dong. 2025. "Effects of Climate Variables and Human Activities on Groundwater Level Fluctuations in Unconsolidated Sedimentary Aquifers: A Data-Driven Approach" Hydrology 12, no. 8: 215. https://doi.org/10.3390/hydrology12080215

APA Style

Yang, L., Gao, M., Chen, J., Shi, W., Hou, C., Liu, Z., Luo, C., Yu, J., Yang, X., & Dong, J. (2025). Effects of Climate Variables and Human Activities on Groundwater Level Fluctuations in Unconsolidated Sedimentary Aquifers: A Data-Driven Approach. Hydrology, 12(8), 215. https://doi.org/10.3390/hydrology12080215

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop