Identification of the Non-Stationarity of Meteorological Drought in the Yellow River Basin and Assessment of the Applicability of the GAMLSS Model

Liang, Li’e; Hu, Liulong; Wang, Xiaohan; Zhu, Yonghua; Chao, Yan; Wang, Yong; Liu, Ziyi

doi:10.3390/su18136383

Open AccessArticle

Identification of the Non-Stationarity of Meteorological Drought in the Yellow River Basin and Assessment of the Applicability of the GAMLSS Model

by

Li’e Liang

^*,

Liulong Hu

,

Xiaohan Wang

,

Yonghua Zhu

,

Yan Chao

,

Yong Wang

and

Ziyi Liu

College of Civil Engineering and Architecture, Yan’an University, Yan’an 716000, China

^*

Author to whom correspondence should be addressed.

Sustainability 2026, 18(13), 6383; https://doi.org/10.3390/su18136383 (registering DOI)

Submission received: 16 April 2026 / Revised: 19 June 2026 / Accepted: 21 June 2026 / Published: 23 June 2026

(This article belongs to the Topic Climate Change Impacts and Adaptation: Interdisciplinary Perspectives, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Taking the Yellow River Basin (YRB) as an example, this study explores the non-stationary drought evolution features in large river basins under climate change. This study utilized precipitation and multiple climate factor data to establish the non-stationary standardized precipitation index (NSPI) through the GAMLSS model. Combined with the run theory, Copula function and a cascaded RF-LSTM machine learning model, the drought characteristics and retrospective predictive patterns were systematically assessed. The results show that: (1) The Arctic Oscillation, the Pacific Decadal Oscillation, the Southern Oscillation and the North Pacific Index are the primary climate drivers of non-stationary precipitation variation in the YRB, with the former three being selected most frequently and NPI additionally influencing April–June and September, and their effects are both different and lagging. Compared with the traditional SPI, the NSPI assigned higher drought grades and greater severity to typical drought years (e.g., the 1974 event was rated D3 with a severity of 17.935 by NSPI versus D2 with 11.733 by SPI), and thus better captured non-stationary drought evolution. (2) The duration of droughts exhibited a decreasing trend that was not statistically significant (p > 0.05), whereas drought intensity and severity decreased significantly (p < 0.05); the peak severity showed a significant upward trend (p = 0.0078). Spatially, the northwest of the Loess Plateau was a compound core area with high severity, high frequency and long duration of droughts, while the upper reaches were mainly characterized by low severity, short duration and sudden droughts. (3) The drought risk in the YRB shows a higher frequency in the lower reaches and a lower frequency in the upper reaches. The middle and lower reaches were high-risk areas, with shorter AND-type joint exceedance return periods for moderate drought (2.46–5.83 years) and severe drought (3.77–9.15 years). The upper reaches were low-risk areas, with longer return periods reaching up to 5.83 years for moderate drought and 9.15 years for severe drought. The study shows that the NSPI, considering the driving of multiple climate factors, can more effectively identify and assess non-stationary drought risks, providing a scientific basis for drought prevention and control in river basins.

Keywords:

non-stationarity; GAMLSS model; Copula function; machine learning

1. Introduction

Global warming makes extreme events more likely to occur [1,2,3], especially extreme precipitation events [4,5,6]. The increasingly frequent extreme precipitation has led to more drought events [7,8]. Among all kinds of natural disasters around the world, drought is extremely destructive, and its main manifestation is water shortage [9]. The impacts of drought have both cumulative and persistent characteristics, and their effects remain in the ecosystem for a long time even after the drought ends. This effect is more pronounced in arid and semi-arid regions [10,11]. Therefore, a systematic study of long-term precipitation changes and their associations with droughts and extreme precipitation events is conducive to revealing the characteristics and evolution mechanisms of climate abrupt changes and providing a scientific basis for disaster risk assessment and the enhancement of disaster resilience.

At global and regional scales, the dynamic system formed by the ocean and atmosphere creates a complex and variable climate background. This system significantly alters water vapor transport and water availability, thereby becoming a key driving factor that directly controls the magnitude, occurrence frequency, and persistence duration of extreme precipitation events [12,13,14]. Large-scale climate modes such as the Atlantic Multidecadal Oscillation and the Pacific Decadal Oscillation have been identified as the dominant factors influencing the spatiotemporal distribution of extreme precipitation and drought conditions in China’s monsoon region and the Yangtze River Basin, and this role is particularly prominent in the process [15,16,17].These climatic phenomena generate a typical regional spatial pattern of droughts and floods by modulating atmospheric circulation and air-sea interactions [18], and they can trigger extensive precipitation anomalies and severe drought conditions during El Niño events [19,20]. For the variation patterns of such hydrological extremes in non-stationary environments, the research of scholars at home and abroad mainly focuses on three aspects: the processing of non-stationarity in hydrological series [21], the identification of abrupt change points [22], and the analysis of periodic characteristics [23]. In the exploration of the spatiotemporal variation characteristics of extreme precipitation events, the Mann–Kendall test, wavelet analysis, and Copula function are several commonly used technical means in the academic field [24,25]. In the field of non-stationary frequency analysis, the Generalized Additive Models for Location, Scale and Shape (GAMLSS) have gradually developed into a core technical tool due to their ability to flexibly represent the complex relationships between distribution parameters and various explanatory covariates (such as climate indices and reservoir regulation indices) [26,27]. This methodology has been effectively applied and widely adopted in investigating non-stationary drought patterns through the integration of climate indices [28].

The Yellow River Basin (YRB), serving as a critical agricultural ecosystem in China, has long faced severe drought challenges, which are closely associated with regional topographic variability, complex climatic conditions, and uneven water resource distribution [29,30]. Wang et al. [31] applied Copula functions to analyze drought duration and severity using the SPEI in the Yellow River Basin, while Guo et al. [32] assessed hydrological drought risk and its spatial transmission through a three-dimensional Copula framework. Zhang et al. [33] analyzed the evolution characteristics of meteorological drought under future climate change in the middle reaches of the YRB based on the Copula function. Yang et al. [34] selected the SPEI (Standardized Precipitation Evapotranspiration Index) as the drought measurement indicator and, based on this, analyzed the changing trends and abrupt points of temperature and precipitation, while also exploring the meteorological drought characteristics of the YRB and the changes in the drought recovery process on an annual scale. Facing severe challenges such as extreme drought and the intensification of water resource supply and demand contradictions, Feng et al. [35] adopted a dual-index framework composed of SPI and SPEI to quantitatively analyze historical drought and flood conditions and predict future trends under various scenarios. However, these related studies all relied on the assumption of stationarity. Cui et al. [36] constructed a non-stationary standardized runoff index (NSRI) under the GAMLSS framework using four local driving factors (precipitation, temperature, water withdrawal, and reservoir index), providing a drought assessment scheme for the YRB. Li et al. [37] applied the GAMLSS model to quantify the effects of precipitation and agricultural planting changes on seasonal runoff across five hydrological stations, demonstrating the model’s regional capability in hydrological analysis. Yu et al. [38] further developed a covariate-based standardized runoff index (SRI_cov) across the seven sub-basins of the YRB, revealing that drought events identified by traditional SRI often deviated from the actual distribution and that abnormal water supply conditions were poorly captured. Despite these advances, existing non-stationary assessments in the YRB have concentrated on runoff-based hydrological drought, leaving the precipitation-driven meteorological drought under large-scale climate oscillations largely unexplored. The delayed teleconnection effects of climate indices over a 0–12 month window have not been systematically screened, and Copula-based joint risk analysis coupled with machine learning prediction has yet to be incorporated into the non-stationary framework. Most of the YRB is located in an area with relatively low precipitation and high evaporation, and the non-stationary characteristics of the precipitation sequence have often been overlooked. In this context, the SPI more effectively reflects drought intensity and duration, enabling consistent drought assessment across multiple temporal scales and geographical regions, which accounts for its widespread adoption [39], but its stationary assumption limits its applicability in a changing climate. This study addressed these gaps by constructing a non-stationary standardized precipitation index (NSPI) driven by optimally lagged climate factors, coupling it with Copula joint distribution analysis for bivariate return period estimation, and employing an RF-LSTM model for dynamic drought projection across the YRB.

This study follows the logical sequence of “mechanism identification—model construction—evaluation and verification—prediction application”, screening out key climate factors such as AOI and PDO and analyzing the differences in their roles and lags in the spatiotemporal distribution of precipitation. To fit the monthly precipitation series, this study established a non-stationary model, in which multiple factors were incorporated as covariates. By comparing it with traditional stationary models, its superiority and applicability in a changing environment are verified. In the drought assessment stage, drought characteristic variables are extracted, and the differences in the identification of historical drought events between traditional SPI and NSPI are compared. By using the Copula function, the two variables of drought duration and intensity are constructed into a two-dimensional joint distribution to test whether the non-stationary framework can effectively describe the extreme drought process. To expand the predictive perspective, a hybrid machine learning model combining random forest and LSTM is further developed. It extracted the temporal autocorrelation patterns from the NSPI series to validate short-term drought characteristics, providing a basis for precise drought risk prevention and control in the YRB.

2. Study Area and Data

2.1. The Study Area Overview

The Yellow River originates from the Bayan Har Mountains on the Qinghai-Xizang Plateau, flows eastward through three climate zones (arid, semi-arid, and semi-humid), and finally empties into the Bohai Sea in Shandong Province. It is the second-longest river in China and provides crucial water resources for the northern regions. The basin is located between 96° E and 119° E and 32° N and 42° N (see Figure 1), covering an area of approximately 795,000 square kilometers. Geographically, it traverses the Qinghai-Xizang Plateau, the Inner Mongolia Plateau, the Loess Plateau and the Yellow-Huai-Hai Plain from west to east, with a general trend of higher terrain in the west and lower in the east. The annual precipitation in most parts of the basin ranges from 200 to 650 mm, decreasing from southeast to northwest, and shows significant intra-annual and inter-annual fluctuations. Due to its wide east–west extension, the YRB features complex topography and significant altitude differences, leading to considerable variations in climate conditions across different areas. Additionally, the basin lies in the mid-latitude zone, where atmospheric circulation and monsoon processes drive significant climatic differentiation, resulting in marked temperature differences and uneven spatial distribution of precipitation [40].

2.2. Data Sources

The precipitation data used in this article was sourced from the National Qinghai-Xizang Plateau Scientific Data Center (https://data.tpdc.ac.cn), with an original resolution of 1 km and resampled uniformly to 10 km [41]. The climate index data is from the Climate Prediction Center of the National Oceanic and Atmospheric Administration of the United States (http://www.cpc.ncep.noaa.gov/data/, accessed on 15 Augest 2025). Missing values are processed using linear interpolation.

3. Methods

3.1. Standardized Precipitation Index (SPI)

The calculation procedure for the conventional SPI can be summarized in three steps. For a time scale of

ω

months, the cumulative precipitation sequence is fitted using a Gamma distribution, and the specific expression is shown in the corresponding formula (1). The derivation expression of the cumulative distribution function (CDF) of the precipitation sequence based on the Gamma distribution (GA) is shown in Formula (2). Finally, the CDF is standardized to conform to a normal distribution with

μ

= 0 and

σ

= 1, and the SPI value is calculated accordingly [42]. The calculation formula is as follows:

P_{m}^{w} = \sum_{i = 0}^{w - 1} P_{m - i}

(1)

In the formula,

ω

represents the time scale,

m

represents the month,

P_{m}^{w}

is the cumulative precipitation sequence of the

ω

-month scale in the mth month, and the precipitation in the (

m - i

)th month is denoted as

P_{m - i}

.

In the formula, f(·) corresponds to the probability density function of the Gamma distribution. For SPI, the GAMLSS GA family parametrization was adopted, where

μ

and

σ

respectively represent the mean and the dispersion, with both estimated from the historical precipitation series and treated as fixed constants.

f (P_{m}^{w} | μ, σ) = \frac{1}{{(σ^{2} μ)}^{\frac{1}{σ^{2}}}} \cdot \frac{(P_{m}^{w})^{\frac{1}{σ^{2}} - 1} \exp [- \frac{P_{m}^{w}}{σ^{2} μ}]}{Γ (\frac{1}{σ^{2}})}

(2)

The cumulative probability over a certain time scale is:

F (P_{m}^{w}) = \int_{0}^{P_{m}^{w}} f (t) d t

(3)

The SPI is obtained by normalizing the cumulative probability:

S P I = Φ^{- 1} [F (P_{m}^{w})]

(4)

In the formula, the CDF corresponding to the standard normal distribution

Φ

has its inverse function denoted as

Φ^{- 1}

.

3.2. Non-Stationary Standardized Precipitation Index (NSPI)

The methodology for computing the NSPI in this study is conducted in two sequential steps: initially, an appropriate set of climate factors is preliminarily selected to constitute a covariate combination; subsequently, a GAMLSS model is constructed using the R programming language and its performance is rigorously evaluated.

3.2.1. Screening Climate Factors

Research indicates that large-scale climate indices such as the North Atlantic Oscillation (NAO), Pacific Decadal Oscillation (PDO), and Atlantic Multidecadal Oscillation (AMO) are associated with droughts in various regions around the world [43]. When evaluating the teleconnection relationship between hydrological variables and climate models, common methods include Kendall and Spearman correlation analyses [44]. For the six selected climate factors, within the lag range of 0 to 12 months, the Kendall correlation test at a significance level of 0.05 was used to screen out the optimal lag duration corresponding to the cumulative precipitation dataset and the most suitable large-scale climate oscillation. The specific steps are as follows:

For the cumulative precipitation

P_{m}^{w}

given in month

m

, a series of

ω

-month scale monthly average climate indices, denoted as

C_{m}^{ω} - L

, were determined by considering different lead times (L months). For each month and each climate index, 13 sets of data are constructed respectively: one set has no lead time (L = 0), and the other 12 sets correspond to different lead times (L = 1 to 12), to test their correlation with the corresponding cumulative precipitation sequence

P_{m}^{w}

in month

m

. Finally, for each month, the sequence of climate index lead times with the maximum correlation was selected as the main covariates for establishing the NSPI. A two-stage screening strategy was adopted to handle the large candidate pool. In the first stage, Kendall rank correlation was computed between the 12-month cumulative precipitation and each of the 78 lagged climate variables (6 indices × 13 lags). The top 10 variables with the strongest absolute correlations were retained as the candidate set for each month. This pre-screening reduced the dimensionality of the predictor space and ensured numerical stability. In the second stage, the GAMLSS stepGAIC function performed backward elimination based on the AIC criterion, starting from an initial model containing the top 5 candidates and testing all remaining candidates from the first stage. Only variables that provided sufficient improvement in model fit (ΔAIC > 2) were retained in the final model. This approach effectively shifted the selection criterion from individual correlation significance to overall model improvement, while the stepwise procedure inherently guarded against overfitting by excluding redundant predictors.

3.2.2. GAMLSS

Generalized Additive Models for Location, Scale and Shape (GAMLSS) represent a semi-parametric regression framework designed to assess non-stationarity in hydrometeorological time series. This approach models the distribution parameters of the response variable using linear or nonlinear functions of explanatory variables. For further details on GAMLSS, refer to the cited literature [26]. For the GAMLSS model, the observed value

y_{t}

is fitted to the probability density function

f (y_{t} ∣ θ_{t})

, where

θ_{t}

=

(θ_{t 1}, θ_{t 2}, \dots, θ_{t p})

. The vector composed of

P

parameters of the probability density function

f

is denoted as

θ_{t}

. The “k”th parameter of

θ

is characterized as a location, scale, or shape parameter, and is linked to the covariates (explanatory variables) through a monotonic link function

g_{k} (\begin{matrix} \cdot \end{matrix})

.

g_{k} (θ_{k}) = X_{k} β_{k} + \sum_{j = 1}^{J_{k}} Z_{j k} γ_{j k}

(5)

The GAMLSS method is used to solve the location and scale parameters of the non-stationary gamma distribution. Each of the two parameters forms a linear relationship with the large-scale climate index in the covariates. By means of the multiple stepwise regression method provided by the GAMLSS software package (R version 4.5.1), multiple meteorological factor covariates were screened to obtain the optimal combination. The non-stationary parameters

μ (t)

and

σ (t)

are as follows:

The location (

μ

) and scale (

σ

) parameters of the two-parameter Gamma distribution were fitted:

X_{t} \sim G a m m a (μ_{t}, σ_{t})

(6)

l o g (μ_{t}) = a_{0} + a_{1} c_{1} (t) + a_{2} c_{t} (t) + \dots + a_{n} c_{n} (t)

(7)

{l o g (σ}_{t}) = b_{0} + b_{1} c_{1} (t) + b_{2} c_{2} (t) + \dots + b_{n} c_{n} (t)

(8)

In the formula,

c_{i}

represents the

i

-th climate change factor, and

a_{i}

and

b_{i}

are the coefficients of the regression equation.

After estimating the non-stationary parameters

μ_{t}

and

σ_{t}

using the GAMLSS model, the cumulative precipitation

P_{m}^{w}

was fitted to a Gamma (

μ_{t}

,

σ_{t}

) distribution. The Gamma family was used with the default log-link functions for both

μ

and

σ

, ensuring positivity of the estimated values. The stepwise selection followed a two-stage procedure: first, covariates for μ were selected by backward elimination using the AIC criterion, with a scope ranging from the intercept-only model to the full set of candidate climate indices at their optimal lags; second, covariates for σ were selected, starting from the intercept-only model with the same upper scope. The entry and retention threshold was set at ΔAIC > 2. Across all 12 months, no covariates met the inclusion threshold for σ, indicating that the climate indices provided sufficient explanatory power for the location of the precipitation distribution but not for its scale. The cumulative probabilities were then transformed into a standard normal distribution via a specified equation to derive the NSPI. To mitigate model overfitting, the Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and Global Deviance (GD) were employed for model selection. Worm plots and Q-Q plots were utilized for visual residual diagnostics and comparative analysis of model efficacy. Through analytical screening, a time scale of 12 months was selected. Each candidate model was fitted independently to the cumulative precipitation series at every 10 km grid cell across the basin. The drought classification criteria are provided in Table 1 [45].

3.3. Run-Length Theory

The run-length theory was applied to the SPI12 and NSPI12 series at each 10 km grid cell to identify drought events and their extract duration, severity, and peak values. This procedure was performed independently across the basin to capture spatial variations in drought characteristics. Basin-averaged results were subsequently derived from these grid-cell outputs to represent the overall drought regime of the entire YRB. Previous research has confirmed the feasibility of this method [46]. A drought event is defined as a continuous period during which drought index values persistently fall below a predetermined threshold. For a drought event, its duration (D) is the length of time from the beginning to the end. For a drought event, the cumulative value of the drought index is taken as the events severity (S); the lowest value reached by the index during the event is defined as the peak severity (I) [47].

Following Yevjevich (1967) [48], this study set three threshold levels, denoted as X0, X1 and X2, corresponding to 0, −0.5 and −1.0, respectively. The threshold of −0.5 (X1) served as the criterion for drought event onset, −1.0 (X2) provided an additional confirmation threshold for events lasting exactly one month, and 0 (X0) served as the merge criterion for events separated by a single non-drought month. A drought event is considered to occur when the drought index is lower than −0.5. If a drought lasts exactly one month and the index is lower than −1, it is confirmed as a drought event; both conditions must be met, otherwise it is not recognized. For two drought events, if they are separated by only one month and the index value during the interval is less than 0, they are combined into one event. The duration of the combined drought is equal to the sum of the durations of the two events plus one month, and the intensity is the sum of the intensities of the two events.

3.4. Trend and Turning Point Test

Climate change, natural geographical conditions and human activities can affect hydrological sequences within a specific period, leading to significant changes before and after a certain point in time [38]. In this study, after obtaining the NSPI based on the GAMLSS model, basin-averaged SPI12 and NSPI12 series were constructed, and trend analysis and change point detection were carried out on these sequences.

For trend testing, the Trend-Free Pre-Whitening Mann–Kendall (TFPW-MK) method was adopted. The standard M-K test requires observations to be independent, but hydrological time series usually contain serial autocorrelation, which raises the chance of detecting false trends. The TFPW-MK method addresses this problem through three steps [49]. First, a trend was estimated and removed from the original series to obtain a detrended residual series. Second, the lag-1 autoregressive component was removed from these detrended residuals through pre-whitening to eliminate serial correlation. Third, the previously removed trend was added back to the pre-whitened residuals, and the standard M-K test was then performed on this reconstructed series. The Z value and the slope were computed to assess the significance and magnitude of the trend.

The TFPW-MK procedure produces three standard outputs. The Z value is the standardized test statistic derived from the Kendall score; under the null hypothesis of no trend, it asymptotically follows the standard normal distribution [49]. The p-value represents the probability of observing a test statistic at least as extreme as the computed Z value if the null hypothesis were true. The Sen slope is a nonparametric estimator of trend magnitude, computed as the median of all pairwise slopes between observations. By design, the TFPW-MK method reports the Sen slope as a point estimate together with the Z value and the p-value, but does not produce an analytical confidence interval for the slope [50]. This is because the Sen slope is based on the median of pairwise differences, and its exact sampling distribution under the pre-whitening transformation does not admit a simple closed-form variance estimator. Consequently, inference is conventionally drawn from the joint consideration of the highly significant p-value, the direction of the Sen slope, and the large sample size [51].

For change point detection, the Pettitt test was applied. This test identifies a single change point by locating the point that maximizes the difference between the ranks of observations before and after it [52]. The mean values before and after the identified change point were calculated to quantify the shift magnitude.

3.5. Copula Function

Through the Copula function, two or more probability distributions can be integrated into a multivariate distribution. Compared with traditional multivariate distributions, a core difference of the Copula lies in that it does not require the probability distribution types of each variable to be similar to each other [53]. For continuous random variables X and Y, let F(x,y) denote their joint distribution function, and let the marginal distributions be

F_{x}

and

F_{y}

respectively. According to Sklar’s theorem, there exists a unique copula function C(

μ, v)

that satisfies the following equation:

\begin{matrix} F (x, y) & = & C & [F_{x} (x), F_{y} (y)], & \forall x, y \end{matrix}

(9)

This study first identified drought events based on the SPI12 and NSPI12 sequences using the run-length theory, and extracted the drought duration D and drought severity S for each event. Subsequently, it fit the Gamma, Weibull, Normal, and Lognormal marginal distributions to D and S and constructed the D-S joint distribution using five Copula functions: Gaussian, t, Clayton, Frank, and Gumbel. The selection of marginal distributions and Copula functions was based on the minimum AIC value, with RMSE as an auxiliary goodness-of-fit indicator.

The joint recurrence period is defined as the AND-type joint exceedance recurrence period when both the drought duration and drought intensity reach or exceed a specified threshold, rather than being directly calculated from the joint cumulative distribution probability. For a given scenario (d0, s0), the joint exceedance probability and joint recurrence period are [54]:

P (D \geq d_{0}, S \geq s_{0}) = 1 - F_{D} (d_{0}) - F_{S} (s_{0}) + C_{D, S} [F_{D} (d_{0}), F_{S} (s_{0})]

(10)

T_{A N D} = \frac{E (L)}{1 - F_{D} (d) - F_{S} (s) + C_{D . S} (F_{D} (d), F_{S} (s))}

(11)

In the formula,

F_{D}

and

F_{S}

are respectively the cumulative edge distribution functions of the duration of drought D and the intensity of drought S;

C_{D . S}

is the joint distribution function; and E(L) is the average interval of drought events. The joint recurrence period was calculated under conditions of moderate drought (D = 3, S = 3) and severe drought (D = 6, S = 9), and the inverse distance weighting method was used for spatial interpolation of the grid-scale return periods.

3.6. Machine Learning—Random Forest (RF) + LSTM

In the field of long-term prediction within earth sciences, the capacity of machine learning algorithms to process nonlinear processes has garnered empirical support [55]. Random Forest (RF), as an ensemble learning model, can effectively deal with high-dimensional features, evaluate variable importance, and reduce overfitting risk through its ensemble mechanism, providing robust initial feature selection and nonlinear fitting capabilities for the model [56]. As a variant of the recurrent neural network, the LSTM model has the ability to autonomously learn the long-term dependency features in time series and can precisely depict the dynamic change process of hydro-meteorological data. This model was first proposed by Hochreiter and Schmidhuber (1997) [57]. This paper adopted a cascaded RF-LSTM architecture for NSPI12 prediction. The RF component was employed to correct the residuals of an LSTM baseline that was trained on 12-month lagged NSPI values and seasonal encodings. Specifically, the LSTM first established a baseline prediction, and the RF was subsequently fitted to the residuals between the observed NSPI and the LSTM output, using the same lagged NSPI features plus the LSTM prediction as an additional covariate. The final prediction was obtained by adding the RF residual estimate to the LSTM baseline. The model was trained on 1961–1988, validated on 1989–2016, and assessed on 2017–2024. This combined model was applied to the NSPI12 series at each 10 km grid cell across the basin.

4. Results

4.1. The Construction of Non-Stationary Models

4.1.1. Screening of Climatic Factors

To explore the influence of climate factors in each month, six climate indices—namely the Arctic Oscillation Index (AOI), the North Atlantic Oscillation Index (NAO), the Pacific Decadal Oscillation Index (PDO), the Atlantic Multidecadal Oscillation Index (AMO), the Southern Oscillation Index (SOI), and the North Pacific Index (NPI)—were selected. The Kendall correlation test was applied to the basin-averaged cumulative precipitation series to screen each climate index at a significance level of 0.05. The results of the significance tests at 0.01 and 0.05 levels (Table 2) show that the optimal lag order of different climate factors varies in the same month. Even for the same factor, the selected results in the same month can also be different. For example, in January, the p-value of AOI₁₀ is 0.007 and the p-value of AOI₆ is 0.018. The p-value of AOI₁₀ is the smallest among all factors, based on which it is determined as the optimal driving factor for climate change in January, and the corresponding optimal lag order is also identified. Most months have significant correlations with AOI, PDO, and SOI. NPI also has a relatively good correlation with precipitation changes, mainly affecting the months from April to June and September. NAO is significantly correlated with the cumulative precipitation in May, July, and August. AMO did not pass the significance tests at 0.01 and 0.05 levels in the cumulative precipitation sequence of the 12 months. However, AMO was retained in the initial candidate set based on documented teleconnections with decadal precipitation variability in the YRB, and the stepwise GAMLSS procedure subsequently excluded it from all final models (Table 3), confirming that it did not contribute additional explanatory power beyond the other retained covariates.

4.1.2. GAMLSS

This study sets up three types of models: Model₀ is the stationary type, where the distribution parameters have fixed values and do not change with external factors. Model₁ introduces time as a covariate, and the distribution function has a linear relationship with time t. Model₂ uses the optimal climate factor as a covariate, and the distribution parameters form a linear expression with it. GD, AIC criterion, and SBC criterion were adopted to prevent overfitting of the models. The results are shown in Figure 2. For the Yellow River Basin (YRB) at a 12-month scale, among all the models, Model₂, with climate factors as covariates, had the smallest median values of GD, AIC, and SBC. The ΔAIC values between Model₂ and Model0 ranged from −15.2 to −5.4 across the 12 months, and between Model₂ and Model₁ from −16.8 to −7.3. All differences substantially exceeded the threshold of 2, providing strong evidence that Model₂ was the superior specification. Under the minimization criterion, after introducing climate factors as covariates into the non-stationary model, the goodness of fit of this model for the 12-month cumulative precipitation data in the YRB is higher than that of the stationary model and the time-varying model. In terms of the cumulative precipitation sequence of the study area, the selected non-stationary model has higher applicability.

Table 3 presents the covariate composition of the distribution parameters

μ

and

σ

of the optimal non-stationary models for the cumulative precipitation series of each month. It can be seen that the best covariate combinations after optimization are different, and the lag times of the same climatic factors in the same month are also different. For parameter

μ

, SOI is selected most frequently, and different lags of it are optimal covariates for the distribution parameters in the same month. The simultaneous inclusion of multiple SOI lags reflects the persistent influence of the Southern Oscillation on precipitation at different time scales. Each retained lag provided additional explanatory power, as measured by the AIC improvement criterion, and the stepwise procedure automatically excluded redundant lags. Among the 12 months, AOI is selected 7 times, NPI is selected in May and June, and PDO and NAO are each selected once. Although the parameter

σ

failed to match the optimal covariates, the estimation results of parameter

μ

indicate that the mean level shows stronger non-stationary characteristics of the precipitation sequence, with the dominant factors being SOI and AOI. This pattern reflects that large-scale climate indices primarily influence the location of the precipitation distribution rather than its spread.

To evaluate the reliability degree of the optimal non-stationary model, this paper analyzed its residual sequence and goodness of fit. The statistical indicators calculated for the residual sequence of the optimal non-stationary model in March are as follows: the mean is 0, the variance is 1.02, the skewness coefficient is 0.05, the kurtosis coefficient is 3.22, and the Filliben coefficient is 0.987; in June, the corresponding indicators of Model2 were 0, 1.02, −0.57, 2.62 and 0.980; in December, the corresponding indicators of Model2 were 0, 1.02, 0.22, 3.59 and 0.985. The indicators for the remaining months also meet the following evaluation requirements: the mean approaches 0, the variance approaches 1, the skewness coefficient approaches 0, the kurtosis coefficient approaches 3, and the Filliben coefficient is not less than 0.978. Overall, on a 12-month scale, Model2 showed good rationality in each month in the YRB. In the results presented in Figure 3, the data points in the normal Q-Q plot lie approximately near the 45°reference line; the majority of residual points in the worm plot fall within the bounds of the 95% confidence interval. These features reflect a satisfactory fitting effect of the optimal non-stationary model.

4.2. Non-Stationary Meteorological Drought Assessment

4.2.1. Analysis of the Applicability of NSPI

Based on the numerical values output by the optimal non-stationary model, cumulative probability calculation and standardization were conducted to obtain the NSPI. A comparison and analysis with the traditional SPI were carried out, as shown in Figure 4. On a 12-month time scale, although the trends of SPI and NSPI over time are generally consistent, there are also certain differences, which may be attributed to the different results of precipitation estimation by the stationary model and the non-stationary model. The TFPW-MK test results (Figure 5, left column) showed that both series had a statistically significant upward trend. The SPI12 series had a Z value of 6.47 (p < 0.0001) and a Sen slope of 0.0044/a. The NSPI12 series had a Z value of 7.53 (p < 0.0001) and a Sen slope of 0.0044/a. Both Z values exceeded the critical value of 1.96 (α = 0.05). The NSPI12 Z value was higher than that of SPI12 (7.53 versus 6.47), which indicated that the non-stationary framework detected a more pronounced trend after climate covariates were included. Both results showed that dry and wet conditions in the YRB improved gradually over the study period. These Z values were higher than those from the standard M-K test, confirming that the upward trend remained significant after removing the effect of positive serial autocorrelation. The Pettitt test results (Figure 5, right column) identified a change point around 2004 for the SPI12 series and around 2012 for the NSPI12 series. In both cases, the mean values after the change point were higher than those before it, indicating a shift toward wetter conditions. The difference in the timing of the change points between the two indices reflected the different responses of the stationary and non-stationary frameworks to climate regime shifts. Both series showed clear interannual and interdecadal fluctuations. The average value of SPI12 was 0.90, ranging from −1.5 to over 2.5. This positive offset reflected the regional parameterization strategy: the Gamma distribution for SPI was fitted to the basin-averaged precipitation series and applied uniformly to all grid cells, rather than fitting pixel by pixel. Under this approach, spatial heterogeneity in precipitation regimes across the YRB prevented a perfect basin-mean standardization of zero. The relatively moist period occurred between the early 1960s and the late 1970s, while the significant drought peaks were in 1986–1987, 1992, and 1999–2001. The NSPI12 values fluctuated around −0.14: the values were generally high in the 1960s and 1970s, decreased significantly in the 1980s and 1990s, and showed an upward trend after the change point around 2012.

Based on the running theory, the drought characteristic variables of SPI and NSPI were extracted. The specific results are shown in Figure 6. From 1965 to 1966, SPI identified an extreme drought lasting for 13 months, while NSPI identified a moderate drought lasting for 16 months. Between 1979 and 1982, SPI identified a mild drought from February to April 1980 and a moderate drought from July 1980 to July 1981, while NSPI identified a mild drought lasting for 26 months from October 1979 to November 1981. The identification results show that both SPI and NSPI indicators can reflect the evolution trajectory of drought in the YRB from 1960 to 2024, but there are certain differences in the specific manifestations of drought (including the start and end times, duration and drought grades).

To better analyze and compare some aspects of drought evolution between SPI and NSPI, the drought characteristic variables and drought grades at the same drought onset time were extracted, as shown in Table 4. In 1960, 1969, 1995, and 2008, both SPI and NSPI classified the droughts as mild droughts, but the severity of droughts assessed by NSPI was often higher. For instance, in 1960, both were classified as mild droughts, but the drought intensity of NSPI was 5.800 and the drought severity was −0.967, both higher than those of SPI, which were 4.359 and −0.726 respectively. This indicates that NSPI assessment of droughts is usually more severe. In other years, the drought grades assessed by NSPI were higher than those by SPI. For the drought that started in October 1962 and lasted for 7 months, the precipitation in the YRB during this period was 106.6 mm. For the same event, NSPI classified the severity as moderate drought, while SPI determined it as mild drought. In 1974 and 1986, SPI classified the droughts as moderate, but NSPI classified them as severe. Under the standards of drought intensity and drought severity, NSPI was also significantly higher. Additionally, for 1969, SPI indicated that the drought lasted for 8 months, while NSPI only identified 2 months, but the drought intensity of NSPI was −0.713, which was lower than that of SPI, which was −0.651. This reflects the different sensitivities of the two indicators in identifying drought characteristic variables and drought grades. Overall, NSPI usually assigned higher drought grades and greater severity but lower intensity than SPI in most typical drought years. These differences arose because the two indices standardized precipitation against different reference distributions: NSPI conditioned on time-varying climate covariates through the GAMLSS model, whereas SPI used a stationary Gamma distribution fitted to the full historical record. Consequently, they quantified drought relative to different baselines and answered distinct scientific questions. Neither index was intrinsically more accurate; the choice between them depended on whether the analysis goal was to assess drought against a climate-conditioned baseline (NSPI) or an unconditional historical baseline (SPI).

4.2.2. Comparative Analysis of Drought Grades in Different Decades

The threshold values of SPI and NSPI indices can be used to identify meteorological drought events in the YRB. To better compare the two drought indices, the index series were divided into seven periods according to decades: P1 (1960–1969), P2 (1970–1979), P3 (1980–1989), P4 (1990–1999), P5 (2000–2009), P6 (2010–2019), and P7 (2020–2024). Because P7 covered only 5 years, the drought frequencies in Figure 7 were expressed as percentages (drought months divided by total months in each period), which normalized the differing period lengths in the denominator. Considering the occurrence frequencies of four drought grades according to the drought grade classification table, the results are shown in Figure 7.

As shown, from 1960 to 2024, SPI and NSPI have exhibited distinct patterns in the frequency of droughts of different severity levels. In view of the non-stationary characteristics of the climate system (such as long-term change trends and abrupt turning points), NSPI takes them into account during the modeling process. This approach may lead to systematic differences between NSPI and SPI in reflecting the frequency of droughts. As shown in Figure 7, for mild drought events, SPI-based frequencies exceeded those of NSPI in most periods except P2, P3 and P7. For moderate droughts, NSPI identified higher frequencies of moderate drought events than SPI in all periods except P3. In the case of severe droughts, NSPI showed higher frequencies than SPI during P1 and P2; neither index detected severe droughts in P6 or P7. Although NSPI produced lower frequencies than SPI in the remaining periods, the differences were minimal. For extreme drought events, neither SPI nor NSPI identified any occurrences during P2, P6 and P7; in all other periods, SPI yielded higher frequencies than NSPI. Overall, from mild to extreme drought levels, the frequencies generally decreased, reflecting the relative rarity of extreme drought events. Under the non-stationary framework, mild and moderate droughts occurred more frequently, indicating that the stationary assumption underlying SPI led to systematic deviations in drought frequency estimates relative to the climate-conditioned NSPI framework.

4.3. The Drought Characteristics of NSPI

Figure 8 presents the interannual variations of the YRB in terms of drought duration, intensity grades, severity, and peak values. The changing trends of the above four indicators (duration, intensity, severity and peak) can all be described by linear equations, reflecting long-term, slow and systematic evolution characteristics rather than short-term random fluctuations. From 1960 to 2024, at the 12-month scale across the YRB, the Mann–Kendall trend test after pre-whitening (TFPW-MK) was applied to the four drought characteristics. Drought duration exhibited a decreasing trend that was not statistically significant (Z = −1.35, p = 0.177), with a Sen slope of 0.0000 per year. In contrast, drought intensity declined significantly (Z = −2.20, p = 0.028), with a Sen slope of −0.0096 per year, and drought severity also decreased significantly (Z = −2.57, p = 0.010), with a Sen slope of −0.0035 per year. Meanwhile, the drought peak showed a significant upward trend (Z = 2.66, p = 0.0078), with a Sen slope of 0.0046 per year. The pronounced decline in drought severity indicated a substantial alleviation of overall drought stress, whereas the significant rise in peak severity suggested that extreme water deficits during individual drought events may have intensified. Overall, drought events in the basin exhibited a mitigating trend characterized by shortened duration, reduced intensity and diminished severity, albeit with an increasing peak severity.

For the drought events in the YRB at the monthly scale during 1960–2024, Figure 9 presents the spatial distribution of multiple characteristics, including average intensity, average duration, peak intensity, and occurrence frequency. In Figure 9a, the Loess Plateau region, especially its northwest part, is a concentrated area of high drought intensity values, while the upper plateau region of the YRB (such as Qinghai and Gansu) is a concentrated area of low drought intensity values. The distribution of average drought duration (Figure 9b) shows high spatial consistency with drought intensity. In the middle and lower reaches of the YRB, high-intensity areas are often accompanied by longer average drought durations. Drought events in this region frequently exhibit a compound characteristic of “high intensity and long duration”, which increases the possibility of causing disasters. The Hetao Irrigation District and some parts of the Loess Plateau also show longer drought durations, which may be related to the weak drought mitigation capacity of the ecosystem in these areas and the unstable seasonal precipitation supply. Drought events in the upper reaches are usually sudden and do not last long; thus, the duration of their droughts is relatively short. The peak intensity (Figure 9c) characterizes the possible extreme drought intensity during the observation period. Its high-value areas are concentrated in the grain-producing and densely populated regions within the middle and lower reaches of the YRB, such as eastern Henan, western Shandong, and northern Anhui. This indicates that these core areas suffer from extremely severe instantaneous water stress in extreme drought years, with particularly fatal impacts on agricultural production and water supply security. The peak intensity in the upper reaches is generally low, and some areas even show negative values. In Figure 9d, the spatial distribution of drought frequency is in good agreement with the spatial characteristics of drought intensity and duration, which further demonstrates the spatial differences in drought risk within the basin. The areas with frequent droughts are mainly concentrated in the northwest, especially in the Loess Plateau region. The frequency of droughts does not strictly follow a linear correspondence with their intensity. However, generally speaking, the high-frequency areas overlap spatially with the high-intensity and long-duration areas, jointly forming the “high-risk core area of drought” in the YRB.

Overall, the spatial pattern of drought risk in the YRB is clear and severe: the northwestern part of the basin, especially the Loess Plateau region, is under the quadruple threat of “high intensity, long duration, frequent occurrence, and strong peak”, making it the top priority for drought prevention and mitigation, as well as adaptive water resource management. Although the upstream region experiences relatively lower intensity and frequency of drought, its ecosystem exhibits greater vulnerability and insufficient resilience to post-drought recovery.

4.4. The Recurrence Period of Drought in NSPI

A mutual relationship exists between drought duration and severity. Using only the univariate return period may produce biased assessments, so the bivariate distribution method was introduced to describe drought characteristics more completely.

Five common Copula functions (Gaussian, t, Clayton, Frank, and Gumbel) were fitted to the D-S pairs extracted from both SPI12 and NSPI12 sequences. The goodness-of-fit results are presented in Table 5. All five Copulas converged successfully for both indices. For SPI, the Clayton Copula yielded the lowest AIC (−58.028), followed closely by the Gaussian Copula (−54.857). For NSPI, the Frank Copula achieved the lowest AIC (−84.071), followed by the Gaussian Copula (−83.715).

It should be noted that in preliminary exploratory fitting under the previous cumulative-probability-based return period formulation, the Frank Copula had failed to converge for NSPI. This occurred because the extreme dependence structure in the non-stationary drought characteristics caused the Frank parameter to approach its numerical limit (as Kendall’s tau neared unity), which triggered overflow in the optimization algorithm. After reformulating the return period as the AND-type joint exceedance probability and re-optimizing the marginal distributions, numerical stability improved substantially and the Frank Copula converged normally. Consequently, the Clayton Copula was retained for SPI and the Frank Copula for NSPI in all subsequent joint distribution analyses.

Figure 10 shows the joint probability contours (A, C) and the AND-type joint exceedance return period contours (B, D) for SPI-Clayton (A, B) and NSPI-Frank (C, D). Several differences were observed between the two indices. For joint probability (panels A and C), the NSPI-Frank probabilities were generally higher than the SPI-Clayton probabilities at comparable drought durations and severities. For example, when drought duration was approximately 10–15 months and severity ranged between 10–20, the SPI-Clayton joint probability fell between 0.2 and 0.4, whereas the NSPI-Frank probability exceeded 0.6. As duration and severity increased further, the NSPI-Frank probability contours shifted toward higher intervals, reflecting stronger lower-tail dependence captured by the Frank Copula under the non-stationary framework.

For the joint return period (panels B and D), the SPI-Clayton return period contours were relatively evenly distributed, while the NSPI-Frank contours were more compressed in the region of prolonged duration and high severity. This indicated that under the non-stationary framework, the co-occurrence of long duration and high severity was assigned a shorter return period than under the stationary framework, implying greater drought risk when climate covariates were considered. These differences demonstrated that the NSPI captured non-stationary variations in the dependence structure between drought duration and severity.

Table 6 further compares the sensitivity of the joint return period results to different Copula selection criteria. Under the AIC criterion, the optimal Copulas were Clayton for SPI12 and Frank for NSPI12; under the RMSE criterion, the optimal Copula for SPI12 was Gaussian and for NSPI12 was Clayton. For SPI12, the moderate drought return period varied only slightly between the two criteria (2.87 years under AIC versus 2.88 years under RMSE), and the severe drought return period ranged from 5.74 to 5.92 years. For NSPI12, the moderate drought return period ranged from 3.41 to 3.81 years, and the severe drought return period ranged from 8.40 to 9.54 years. These results indicated that the joint return period estimates were relatively robust to the choice of Copula selection criterion, particularly for SPI12. The differences were larger for NSPI12, reflecting the stronger influence of the non-stationary dependence structure on the Copula selection. Based on the primary AIC criterion, the Clayton Copula was retained for SPI12 and the Frank Copula for NSPI12 in the spatial mapping of drought return periods.

Figure 11 shows the spatial distribution of the AND-type joint exceedance return period for moderate drought (D = 3, S = 3, panel a) and severe drought (D = 6, S = 9, panel b) across the YRB, derived from the Clayton Copula for SPI12 and the Frank Copula for NSPI12. For moderate drought, the return period ranged from 2.46 years to 5.83 years. Shorter return periods (2.46–3.5 years), indicating higher drought risk, were concentrated in the middle and lower reaches, particularly the North China Plain. Longer return periods (up to 5.83 years) were found in the upper reaches. For severe drought, the return period ranged from 3.77 years to 9.15 years. The spatial gradient was similar: the middle and lower reaches had shorter return periods (3.77–5 years), while the upper reaches had longer return periods (up to 9.15 years). Both maps revealed a distinct spatial pattern of drought risk in the YRB. The middle and lower reaches, where population and agriculture are concentrated, were the core high-risk areas. The transition from the lower to upper reaches showed a clear increase in return period, reflecting reduced drought risk upstream.

4.5. Random Forest (RF) + LSTM Machine Learning Prediction Evaluation

To assess the short-term autoregressive predictability of NSPI, the cascaded RF-LSTM model was trained on data from 1961–1988 and applied to the prediction period of 2017–2024. Figure 12 presents the observed NSPI during 1989–2016, together with the model output for 2017–2024: the LSTM baseline is shown in blue, and the RF-corrected fused prediction in orange. During the drought interval of 2020–2021, when the observed NSPI fell below the drought threshold (−0.5), the predicted values captured the negative anomaly, confirming that the model detected the drought signal from the temporal structure of the series. The overestimation in 2023 suggests that the autoregressive framework has limited capacity to anticipate abrupt positive anomalies that deviate from the recent historical pattern.

The agreement between the observed and fused predicted NSPI series during the validation period (1989–2016) is shown in Figure 13. Overall, the predicted sequence and the actual sequence had basically consistent fluctuation trends. Specifically, during the drought period from 2020 to 2021, when the actual NSPI was below the drought threshold (−0.5), the predicted values also decreased simultaneously, indicating that the model has a certain ability to capture typical drought events. However, in 2023, the predicted value was slightly overestimated, which might be due to a systematic bias in the model’s response to positive anomaly signals.

The consistency and error distribution between the observed and predicted values were further shown by a scatter plot (Figure 14). The error metrics were R² = 0.429, RMSE = 0.370, and MAE = 0.279, indicating moderate explanatory power. The scatter points clustered tightly along the diagonal line, which suggested that the model could stably reproduce the changes in NSPI across the whole dynamic range. The model responded well to extreme dry and wet events: for intervals representing extreme low (NSPI < −0.5) and extreme high (NSPI > 1.0) values, the predicted values still changed in the same direction as the observed values, capturing the generation and dissipation of these key climate anomaly signals. Although the predicted and observed values did not fully match in these complex, nonlinear extreme cases, the overall trend consistency and the ability to capture signals showed that this fusion model had a solid predictive basis and potential for further optimization when dealing with extreme hydrological and climatic events in the YRB.

The prediction errors of the NSPI index sequence and the duration, intensity and peak of the drought events were calculated. The specific values are shown in Figure 15. Overall, the model had zero error in duration, indicating that the combined prediction model of random forest and LSTM was completely consistent with the actual duration of drought. The errors in the “peak” and “intensity” indicators were 0.159 and 0.230 respectively, which were relatively good. The error in the “NSPI index sequence” indicator was the highest (0.370). In summary, the random forest and LSTM fusion model demonstrated certain applicability in NSPI prediction and could well reflect the intensity and evolution trend of drought.

5. Discussion

5.1. The Influence of Spatio-Temporal Patterns on Drought in the Yellow River Basin

Regional differences and seasonal changes are key factors influencing drought. From the perspective of the basin scale, the YRB has a large span, ranging from plateau to plain, with significant topographic variations, complex climate, and uneven distribution of water resources [29,30]. These multiple factors result in different drought propagation times in various regions of the YRB. Yu et al. [38] found that droughts occurred more frequently in spring and that the middle reaches suffered from more severe droughts, consistent with the present identification of this region as a high-risk zone with shorter joint return periods. This agreement likely reflected the shared basin-scale focus, although their hydrological drought index differed from the present NSPI, which may explain their stronger seasonal sensitivity. Wan et al. [58] reported that drought propagation times were shortest in spring (3 months) and winter (6 months), consistent with the rapid drought response identified here. Their variable-scale analysis in the Henan sub-region differed from the present fixed 12-month basin-scale assessment, however, which precluded the resolution of intra-annual propagation dynamics in this study.

The spatial heterogeneity of drought risk identified in this study aligned with the regional climate gradient of the YRB. The northwestern Loess Plateau, where compound droughts of high severity, long duration, and high frequency converged, lies in the transitional zone between the East Asian monsoon and the continental arid climate systems. This location made the region particularly sensitive to the modulation of large-scale circulation patterns [12]. The dominance of AOI and SOI in the non-stationary model suggested that anomalies in the Arctic Oscillation and the Southern Oscillation altered the meridional moisture transport, reducing precipitation in this semi-arid zone. In contrast, the upper reaches of the YRB, characterized by low-severity and short-duration sudden droughts, were situated at higher altitudes where precipitation was primarily driven by local convective activity and plateau thermal dynamics rather than large-scale ocean-atmosphere teleconnections [14]. This explained why the non-stationary framework, which relied on large-scale climate indices as covariates, detected a more pronounced trend signal in the middle and lower reaches than in the upper plateau region. The spatial gradient of drought risk thus reflected both the physical geography of the basin and the varying relevance of the selected climate indices across sub-regions.

Compared with non-stationary drought assessments in other large river basins, the findings in the YRB shared commonalities while also exhibiting distinct regional characteristics. In the Yangtze River Basin, Gao et al. [15] found that the Pacific Decadal Oscillation and the Atlantic Multidecadal Oscillation were the dominant drivers of extreme precipitation non-stationarity, with drought risk concentrated in the middle and lower reaches. In the Huaihe River Basin, Wang et al. [27] reported that both climatic and anthropogenic indices improved the GAMLSS model fit for flood and low-flow frequency analysis. The YRB differed from these basins in two respects: first, the Southern Oscillation emerged as a key covariate alongside AOI and PDO, reflecting the stronger influence of the Indian Ocean-Pacific coupled system on the monsoon margin of northwestern China; second, the compound drought characteristics in the Loess Plateau exhibited a severity–duration–frequency intensity rarely observed in the humid and semi-humid basins of southern China. These comparisons highlighted that non-stationary drought patterns were basin-specific and that the selection of optimal climate covariates needed to be tailored to the regional teleconnection structures rather than applied uniformly across catchments.

This study constructed a non-stationary model, taking into account climatic factors. The northwestern Loess Plateau was identified as a key drought-prone area, consistent with Han et al. [59], although they used SPEI which incorporates evapotranspiration, whereas the present NSPI relied solely on precipitation. This index difference may explain a slight underestimation of warm-season drought severity here. Local differences may affect the applicability of the precipitation index. Future research could explore the driving forces of the drought process and seasonal impacts in local areas and deepen the understanding of the interaction among sub-basins in the YRB in different seasons and the influence of the lag effect of drought, thereby more comprehensively revealing the mechanism of drought spread.

5.2. The Sole Limitation of NSPI

This study constructed the NSPI by taking climate factors as covariates through the GAMLSS model, and used the TFPW-MK test for trend analysis and the Pettitt test for change point detection on the SPI and NSPI sequences. The Pettitt test identified significant change points around 2004 for SPI12 and around 2012 for NSPI12, both indicating a shift toward wetter conditions. The drought characteristics of NSPI were analyzed by combining the run theory and Copula function, and the future drought characteristics were detected using the RF-LSTM model. It should be noted that Cui et al. [36] previously developed a non-stationary standardized runoff index (NSRI) for the YRB using a GAMLSS model driven by four covariates (precipitation, temperature, water withdrawal, and reservoir index), focusing on hydrological drought at the Huayuankou station. The present study differed in both drought type and covariate selection: it concentrated on meteorological drought across the entire basin through the NSPI, which incorporated large-scale climate indices as covariates. Several limitations of this study should be acknowledged. First, the NSPI relied solely on precipitation without potential evapotranspiration (PET). In this water-limited basin, omitting PET may have underestimated warm-season drought severity, and NSPEI may offer a more comprehensive assessment. Second, this study was confined to a single basin and a single 12-month scale. Furthermore, the ML validation covered only eight years (2017–2024), and the zero-duration prediction error should be interpreted cautiously given the moderate skill (R² = 0.429, RMSE = 0.370). Finally, no external validation against observed drought impacts was performed.

Two methodological limitations should be acknowledged. First, the NSPI relied on the availability and quality of large-scale climate indices as covariates. Six indices (AOI, NAO, PDO, AMO, SOI, and NPI) were selected based on their established teleconnections with precipitation variability in the YRB [38]. However, this selection introduced several constraints. The temporal coverage and homogeneity of the index records determined the length and reliability of the resulting NSPI series; gaps or inhomogeneities in the raw data carried over into the non-stationary model parameters. The lagged correlations were optimized within a 0–12 month window, yet the stationarity of these lag structures across multi-decadal scales remained an implicit assumption. Changes in the underlying climate regime could alter the optimal lag or the relevance of certain indices, which would necessitate periodic re-evaluation [60]. Furthermore, the framework did not account for the uncertainty inherent in the indices themselves, such as differences between ENSO definitions or the sensitivity of PDO calculations to the reference period [61]. These factors introduced structural uncertainty that was difficult to quantify.

Second, the NSPI was validated through internal statistical checks (residual diagnostics, AIC comparison, and Copula goodness-of-fit) and relative comparisons with the traditional SPI, but an external validation against observed drought impacts was not performed. This was an important limitation because the utility of a drought index ultimately depends on its capacity to represent real-world water stress [9]. Previous work showed that the correlation between meteorological drought indices and sectoral impacts varied substantially across regions, with such indices often explaining only a modest fraction of the variance in agricultural or hydrological outcomes [60]. The lag between meteorological drought onset and crop damage, for example, ranged from weeks to months depending on crop type, soil properties, and irrigation practices. Without validation against crop yields, reservoir storage anomalies, or soil moisture deficits, the NSPI remained theoretically sound but operationally unverified. Future work should seek to relate NSPI series to local observations of drought consequences in the YRB, particularly in the middle and lower reaches where agricultural exposure was highest.

Apart from climatic factors, human activities are also significant contributors to non-stationary drought in the YRB [62]. To enhance the generalization ability of human influence indicators, Ren [62] further expanded on this basis, using the LSTM machine learning model to reconstruct the natural runoff sequence, and then derived the human activity influence index, while also calculating the reservoir influence index. Under the GAMLSS modeling framework, the two types of human influence indices were evaluated and optimized, improving the model fitting effect at most stations and confirming the effectiveness of coupling climatic and human driving factors. Moreover, in the field of hydrology and meteorology, the GAMLSS model has been widely applied in the analysis of precipitation, evapotranspiration, soil moisture, and runoff series [63]. Based on the GAMLSS model, various non-stationary coupled drought indices have been developed, such as NSPI, NSPEI, NSRI, and NSSMI. Among these, NSPEI deserves particular attention for the YRB because it integrates both precipitation and evapotranspiration, thereby encompassing a broader range of meteorological factors than NSPI. In water-limited basins such as the YRB, where rising temperatures intensify evaporative demand, NSPEI may provide a more comprehensive characterization of drought stress. Additionally, precipitation and evapotranspiration are generally regarded as the two most critical climatic factors influencing hydrological drought [64]. Specifically, insufficient precipitation directly leads to a reduction in surface and underground runoff; meanwhile, high temperatures intensify evapotranspiration, accelerating the loss of soil moisture [65]. All these indicate that the formation of drought is influenced by multiple factors. In the future, it is necessary to develop non-stationary coupled drought indices integrating multiple variables (such as precipitation-runoff, precipitation-soil moisture, evapotranspiration-soil moisture, etc.) and incorporate human influence into the drought research system. Comparing the performance of various indices in the analysis of drought driving mechanisms can help improve the overall accuracy and credibility of drought monitoring and assessment work.

6. Conclusions

(1): The research has revealed the key large-scale climate factors influencing the non-stationary changes in precipitation in the YRB. Among them, the influences of AOI, PDO and SOI are the most significant. When each factor acts on precipitation, the degree of influence and response time are different, and they simultaneously constitute the key driving factors for the precipitation changes in this basin. The NSPI provided a climate-conditioned baseline for identifying drought events of different grades relative to the traditional SPI, and the two indices quantified drought against different references that answered distinct questions.
(2): The NSPI12 series showed a significant upward trend (Z = 7.53, p < 0.0001), reflecting wetter conditions over the study period. Regarding the four drought characteristics, drought duration exhibited a decreasing trend that was not statistically significant (Z = −1.35, p = 0.177); drought intensity decreased significantly (Z = −2.20, p = 0.028); drought severity also decreased significantly (Z = −2.57, p = 0.010); and the drought peak increased significantly (Z = 2.66, p = 0.0078). These results indicated an overall mitigation of drought stress alongside an intensification of extreme peak deficits.
(3): The drought risk in the YRB showed a significant spatial gradient of ‘high frequency in the lower reaches and low frequency in the upper reaches’. The AND-type joint exceedance return periods of moderate and severe droughts ranged from 2.46 to 5.83 years and 3.77 to 9.15 years respectively. The densely populated and industrial-agricultural concentrated middle and lower reaches were the high-risk core areas, with shorter return periods, while the upper reaches had longer return periods.
(4): The cascaded RF-LSTM model demonstrated moderate predictive skill (R² = 0.429, RMSE = 0.370). The zero-duration prediction error reflected the limited eight-year test period rather than robust performance, as this short window could not encompass multi-year drought cycles.

Author Contributions

Writing—original draft: L.H.; Writing—review and editing: L.L. and X.W.; Conceptualization: Y.C. and Y.Z.; Supervision: Y.W.; Visualization: Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Yan’an University Doctoral Research Initiation Project (YDBK2017-19), the Key Industrial Chain Project of the Science and Technology Bureau of Yan’an City (2024-CYL-066), and the 2024 Scientific Research Plan Project of the Department of Education of Shaanxi Province (24JK0716).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data sources used in this study are included in the article. For any further inquiries, please contact the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Vicente-Serrano, S.M.; Quiring, S.M.; Peña-Gallardo, M.; Yuan, S.; Domínguez-Castro, F. A Review of Environmental Droughts: Increased Risk under Global Warming? Earth-Sci. Rev. 2020, 201, 102953. [Google Scholar] [CrossRef]
Lorenzo, M.N.; Pereira, H.; Alvarez, I.; Dias, J.M. Standardized Precipitation Index (SPI) Evolution over the Iberian Peninsula during the 21st Century. Atmos. Res. 2024, 297, 107132. [Google Scholar] [CrossRef]
Wu, S.; Wu, Y.; Wen, J. Future Changes in Precipitation Characteristics in China. Int. J. Climatol. 2019, 39, 3558–3573. [Google Scholar] [CrossRef]
Sun, Q.; Zhang, X.; Zwiers, F.; Westra, S.; Alexander, L.V. A Global, Continental, and Regional Analysis of Changes in Extreme Precipitation. J. Clim. 2021, 34, 243–258. [Google Scholar] [CrossRef]
Cheng, Q.; Zhong, F.; Wang, P. Potential Linkages of Extreme Climate Events with Vegetation and Large-Scale Circulation Indices in an Endorheic River Basin in Northwest China. Atmos. Res. 2021, 247, 105256. [Google Scholar] [CrossRef]
O’Gorman, P.A. Precipitation Extremes under Climate Change. Curr. Clim. Change Rep. 2015, 1, 49–59. [Google Scholar] [CrossRef] [PubMed]
Thackeray, C.W.; Hall, A.; Norris, J.; Chen, D. Constraining the Increased Frequency of Global Precipitation Extremes under Warming. Nat. Clim. Change 2022, 12, 441–448. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, M.; Yu, J.; Yu, Y.; Yu, R. Comparison of Flash Drought and Traditional Drought on Characteristics and Driving Forces in Xinjiang. Remote Sens. 2023, 15, 4758. [Google Scholar] [CrossRef]
Mishra, A.K.; Singh, V.P. A Review of Drought Concepts. J. Hydrol. 2010, 391, 202–216. [Google Scholar] [CrossRef]
Wilhite, D.A.; Svoboda, M.D.; Hayes, M.J. Understanding the Complex Impacts of Drought: A Key to Enhancing Drought Mitigation and Preparedness. Water Resour. Manag. 2007, 21, 763–774. [Google Scholar] [CrossRef]
Bolan, S.; Padhye, L.P.; Jasemizad, T.; Govarthanan, M.; Karmegam, N.; Wijesekara, H.; Amarasiri, D.; Hou, D.; Zhou, P.; Biswal, B.K.; et al. Impacts of Climate Change on the Fate of Contaminants through Extreme Weather Events. Sci. Total Environ. 2024, 909, 168388. [Google Scholar] [CrossRef] [PubMed]
Baker, L.H.; Shaffrey, L.C.; Scaife, A.A. Improved Seasonal Prediction of UK Regional Precipitation Using Atmospheric Circulation. Int. J. Climatol. 2018, 38, e437–e453. [Google Scholar] [CrossRef]
Gizaw, M.S.; Gan, T.Y. Possible Impact of Climate Change on Future Extreme Precipitation of the Oldman, Bow and Red Deer River Basins of Alberta: CLIMATE CHANGE IMPACT ON EXTREME PRECIPITATION OF SOUTHERN ALBERTA. Int. J. Climatol. 2016, 36, 208–224. [Google Scholar] [CrossRef]
Guo, L.; Klingaman, N.P.; Demory, M.-E.; Vidale, P.L.; Turner, A.G.; Stephan, C.C. The Contributions of Local and Remote Atmospheric Moisture Fluxes to East Asian Precipitation and Its Variability. Clim. Dyn. 2018, 51, 4139–4156. [Google Scholar] [CrossRef]
Gao, T.; Wang, H.J.; Zhou, T. Changes of Extreme Precipitation and Nonlinear Influence of Climate Variables over Monsoon Region in China. Atmos. Res. 2017, 197, 379–389. [Google Scholar] [CrossRef]
Li, X.; Zhang, K.; Gu, P.; Feng, H.; Yin, Y.; Chen, W.; Cheng, B. Changes in precipitation extremes in the Yangtze River Basin during 1960–2019 and the association with global warming, ENSO, and local effects. Sci. Total Environ. 2021, 760, 144244. [Google Scholar] [CrossRef] [PubMed]
Song, X.; Zou, X.; Mo, Y.; Zhang, J.; Zhang, C.; Tian, Y. Nonstationary Bayesian Modeling of Precipitation Extremes in the Beijing-Tianjin-Hebei Region, China. Atmos. Res. 2020, 242, 105006. [Google Scholar] [CrossRef]
Yang, Q.; Ma, Z.; Fan, X.; Yang, Z.-L.; Xu, Z.; Wu, P. Decadal Modulation of Precipitation Patterns over Eastern China by Sea Surface Temperature Anomalies. J. Clim. 2017, 30, 7017–7033. [Google Scholar] [CrossRef]
Singh, J.; Ashfaq, M.; Skinner, C.B.; Anderson, W.B.; Mishra, V.; Singh, D. Enhanced Risk of Concurrent Regional Droughts with Increased ENSO Variability and Warming. Nat. Clim. Change 2022, 12, 163–170. [Google Scholar] [CrossRef]
Zhang, Q.; Wang, Y.; Singh, V.P.; Gu, X.; Kong, D.; Xiao, M. Impacts of ENSO and ENSO Modoki+a Regimes on Seasonal Precipitation Variations and Possible Underlying Causes in the Huai River Basin, China. J. Hydrol. 2016, 533, 308–319. [Google Scholar] [CrossRef]
Akwei, E.; Lu, B.H.; Zhang, H.W. Precipitation Trend Analysis by Mann-Kendall Test: A Case of Tianchang County Anhui Province, China. Adv. Mater. Res. 2013, 864–867, 2218–2223. [Google Scholar] [CrossRef]
Zhao, P.; He, Z. Temperature Change Characteristics in Gansu Province of China. Atmosphere 2022, 13, 728. [Google Scholar] [CrossRef]
Shen, M.; Chen, J.; Zhuan, M.; Chen, H.; Xu, C.-Y.; Xiong, L. Estimating Uncertainty and Its Temporal Variation Related to Global Climate Models in Quantifying Climate Change Impacts on Hydrology. J. Hydrol. 2018, 556, 10–24. [Google Scholar] [CrossRef]
Shi, W.; Huang, S.; Liu, D.; Huang, Q.; Leng, G.; Wang, H.; Fang, W.; Han, Z. Dry and Wet Combination Dynamics and Their Possible Driving Forces in a Changing Environment. J. Hydrol. 2020, 589, 125211. [Google Scholar] [CrossRef]
Wang, Y.; Duan, L.; Liu, T.; Li, J.; Feng, P. A Non-Stationary Standardized Streamflow Index for Hydrological Drought Using Climate and Human-Induced Indices as Covariates. Sci. Total Environ. 2020, 699, 134278. [Google Scholar] [CrossRef] [PubMed]
Rigby, R.A.; Stasinopoulos, D.M. Generalized Additive Models for Location, Scale and Shape. J. R. Stat. Soc. Ser. C Appl. Stat. 2005, 54, 507–554. [Google Scholar] [CrossRef]
Wang, M.; Jiang, S.; Ren, L.; Xu, C.-Y.; Shi, P.; Yuan, S.; Liu, Y.; Fang, X. Nonstationary Flood and Low Flow Frequency Analysis in the Upper Reaches of Huaihe River Basin, China, Using Climatic Variables and Reservoir Index as Covariates. J. Hydrol. 2022, 612, 128266. [Google Scholar] [CrossRef]
Ouyang, R.; Liu, W.; Fu, G.; Liu, C.; Hu, L.; Wang, H. Linkages between ENSO/PDO Signals and Precipitation, Streamflow in China during the Last 100 Years. Hydrol. Earth Syst. Sci. 2014, 18, 3651–3661. [Google Scholar] [CrossRef]
Liu, Y.; Wen, Y.; Zhao, Y.; Hu, H. Analysis of Drought and Flood Variations on a 200-Year Scale Based on Historical Environmental Information in Western China. Int. J. Environ. Res. Public Health 2022, 19, 2771. [Google Scholar] [CrossRef] [PubMed]
Ding, Y.; Zhang, L.; He, Y.; Cao, S.; Wei, X.; Guo, Y.; Ran, L.; Filonchyk, M. Spatiotemporal Evolution of Agricultural Drought and Its Attribution under Different Climate Zones and Vegetation Types in the Yellow River Basin of China. Sci. Total Environ. 2024, 914, 169687. [Google Scholar] [CrossRef] [PubMed]
Wang, T.; Fu, Z.; Zhang, S.; Li, Z. Water Erosion Risk Assessment and Predictive Modelling for Cultural Heritage under Climate Change: A Case Study of the Great Wall in the Yellow River Basin, China. J. Clean. Prod. 2025, 510, 145645. [Google Scholar] [CrossRef]
Guo, Y.; Huang, C.C.; Pang, J.; Zhou, Y.; Zha, X.; Mao, P. Reconstruction Palaeoflood Hydrology Using Slackwater Flow Depth Method in the Yanhe River Valley, Middle Yellow River Basin, China. J. Hydrol. 2017, 544, 156–171. [Google Scholar] [CrossRef]
Zhang, T.; Su, X.; Feng, K. The Development of a Novel Nonstationary Meteorological and Hydrological Drought Index Using the Climatic and Anthropogenic Indices as Covariates. Sci. Total Environ. 2021, 786, 147385. [Google Scholar] [CrossRef]
Yang, X.; Zhu, M.; Luo, D.; Ye, Z.; Jiang, J. Dynamic Analysis of Meteorological Drought and Its Recovery in the Yellow River Basin. Water Resour. Prot. 2025, 41, 127–133. [Google Scholar] [CrossRef]
Feng, J.; Qin, T.; Lv, X.; Liu, S.; Wen, J.; Chen, J. Frequent Drought and Flood Events in the Yellow River Basin, Increasing Future Drought Trends in the Middle and Upper Reaches. Int. J. Appl. Earth Obs. Geoinf. 2025, 139, 104511. [Google Scholar] [CrossRef]
Cui, H.; Jiang, S.; Ren, L.; Zhu, Y.; Du, S.; Yan, D. A Non-Stationary Hydrological Drought Assessment Method Based on Four-Element Driving. Water Resour. Prot. 2024, 40, 71–78. [Google Scholar] [CrossRef]
Li, H.; Zhang, Q.; Gu, X.; Shi, P. Impacts of Precipitation and Agricultural Planting Changes on Runoff in the Yellow River Basin. J. Nat. Resour. 2018, 33, 1402–1415. [Google Scholar]
Yu, J.; Xiao, R.; Liang, M.; Wang, Y.; Wang, S. Hydrological Drought Assessment of the Yellow River Basin Based on Non-Stationary Model. J. Hydrol. Reg. Stud. 2024, 56, 101974. [Google Scholar] [CrossRef]
Wang, M.; Chen, Y.; Li, J.; Zhao, Y. Spatiotemporal Evolution and Driving Force Analysis of Drought Characteristics in the Yellow River Basin. Ecol. Indic. 2025, 170, 113007. [Google Scholar] [CrossRef]
Huang, C. Study on Drought Drivers, Assessment and Prediction in the Yellow River Basin. Ph.D. Thesis, Xi’an University of Technology, Xi’an, China, 2021. [Google Scholar]
Wei, W.; Song, X.; Li, L.; Xia, L. Integrating Non-Stationarity into Extreme Rainfall Risk Assessment: A GAMLSS-Based Framework for Large-Scale Region. J. Hydrol. 2026, 667, 134934. [Google Scholar] [CrossRef]
McKee, T.B.; Doesken, N.J.; Kleist, J. The Relationship of Drought Frequency and Duration to Time Scales. In Proceedings of the 8th Conference on Applied Climatology, Anaheim, CA, USA, 17–22 January 1993; American Meteorological Society: Boston, MA, USA, 1993; pp. 179–183. [Google Scholar]
Gao, L.; Huang, J.; Chen, X.; Chen, Y.; Liu, M. Contributions of Natural Climate Changes and Human Activities to the Trend of Extreme Precipitation. Atmos. Res. 2018, 205, 60–69. [Google Scholar] [CrossRef]
Gan, R.; Gu, S.; Gao, Y.; Guo, L.; Hou, X. Non-Stationary Meteorological Drought Assessment in the Weihe River Basin Based on the GAMLSS Model. Res. Soil Water Conserv. 2024, 31, 149–160. [Google Scholar]
Lu, E. Determining the start, duration, and strength of flood and drought with daily precipitation: Rationale. Geophys. Res. Lett. 2009, 36, L12707. [Google Scholar] [CrossRef]
Jiang, W.; Wang, L.; Zhang, M.; Yao, R.; Chen, X.; Gui, X.; Sun, J.; Cao, Q. Analysis of Drought Events and Their Impacts on Vegetation Productivity Based on the Integrated Surface Drought Index in the Hanjiang River Basin, China. Atmos. Res. 2021, 254, 105536. [Google Scholar] [CrossRef]
Lee, S.-H.; Yoo, S.-H.; Choi, J.-Y.; Bae, S. Assessment of the Impact of Climate Change on Drought Characteristics in the Hwanghae Plain, North Korea Using Time Series SPI and SPEI: 1981–2100. Water 2017, 9, 579. [Google Scholar] [CrossRef]
Yevjevich, V.M. An Objective Approach to Definitions and Investigations of Continental Hydrologic Droughts. J. Hydrol. 1969, 7, 353. [Google Scholar] [CrossRef]
Yue, S.; Pilon, P.; Phinney, B.; Cavadias, G. The Influence of Autocorrelation on the Ability to Detect Trend in Hydrological Series. Hydrol. Process. 2002, 16, 1807–1829. [Google Scholar] [CrossRef]
Hirsch, R.M.; Slack, J.R.; Smith, R.A. Techniques of Trend Analysis for Monthly Water Quality Data. Water Resour. Res. 1982, 18, 107–121. [Google Scholar] [CrossRef]
Kundzewicz, Z.W.; Robson, A.J. Change Detection in Hydrological Records—A Review of the Methodology / Revue Méthodologique de La Détection de Changements Dans Les Chroniques Hydrologiques. Hydrol. Sci. J. 2004, 49, 7–19. [Google Scholar] [CrossRef]
Pettitt, A.N. A Non-Parametric Approach to the Change-Point Problem. Appl. Stat. 1979, 28, 126. [Google Scholar] [CrossRef] [PubMed]
Nabaei, S.; Sharafati, A.; Yaseen, Z.M.; Shahid, S. Copula Based Assessment of Meteorological Drought Characteristics: Regional Investigation of Iran. Agric. For. Meteorol. 2019, 276–277, 107611. [Google Scholar] [CrossRef]
Liu, R.; Sun, P.; Zhang, Q.; Bian, Y.; Ma, Z.; Zou, Y.; Lv, Y. Characteristics of Non-Stationary Meteorological Drought in the Hengduan Mountains Based on Copula. Res. Soil Water Conserv. 2022, 29, 213–223. [Google Scholar] [CrossRef]
Lary, D.J.; Alavi, A.H.; Gandomi, A.H.; Walker, A.L. Machine Learning in Geosciences and Remote Sensing. Geosci. Front. 2016, 7, 3–10. [Google Scholar] [CrossRef]
Hu, S.; Li, Y.; Xie, J.; Huang, K.; Gong, W.; Yang, L. Prediction of the Onset of the South China Sea Summer Monsoon Based on Machine Learning. Atmos. Ocean. Sci. Lett. 2025, 19, 100676. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
Wan, F.; Zhang, F.; Wang, Y.; Peng, S.; Zheng, X. Study on the Propagation Law of Meteorological Drought to Hydrological Drought under Variable Time Scale: An Example from the Yellow River Water Supply Area in Henan. Ecol. Indic. 2023, 154, 110873. [Google Scholar] [CrossRef]
Han, Z.; Huang, Q.; Huang, S.; Leng, G.; Bai, Q.; Liang, H.; Wang, L.; Zhao, J.; Fang, W. Spatial-Temporal Dynamics of Agricultural Drought in the Loess Plateau under a Changing Environment: Characteristics and Potential Influencing Factors. Agric. Water Manag. 2021, 244, 106540. [Google Scholar] [CrossRef]
Blauhut, V.; Stoelzle, M.; Ahopelto, L.; Brunner, M.I.; Teutschbein, C.; Wendt, D.E.; Akstinas, V.; Bakke, S.J.; Barker, L.J.; Bartošová, L.; et al. Lessons from the 2018–2019 European Droughts: A Collective Need for Unifying Drought Risk Management. Nat. Hazards Earth Syst. Sci. 2022, 22, 2201–2217. [Google Scholar] [CrossRef]
Bachmair, S.; Svensson, C.; Hannaford, J.; Barker, L.J.; Stahl, K. A Quantitative Analysis to Objectively Appraise Drought Indicators and Modeldrought Impacts. Hydrol. Earth Syst. Sci. 2016, 20, 2589–2609. [Google Scholar] [CrossRef]
Ren, M.; Jiang, S.; Ren, L.; Yan, Y.; Cui, H.; Zhu, Y.; Du, S.; He, M.; Wang, M.; Xu, C.-Y. Nonstationary Spatiotemporal Evolution of Extreme Flood and Low Flow Affected by Climate Change and Human Activities in the Yellow River Basin. J. Hydrol. Reg. Stud. 2025, 61, 102640. [Google Scholar] [CrossRef]
Villarini, G.; Smith, J.A.; Napolitano, F. Nonstationary Modeling of a Long Record of Rainfall and Temperature over Rome. Adv. Water Resour. 2010, 33, 1256–1267. [Google Scholar] [CrossRef]
Jiang, S.; Wang, M.; Ren, L.; Xu, C.; Yuan, F.; Liu, Y.; Lu, Y.; Shen, H. A Framework for Quantifying the Impacts of Climate Change and Human Activities on Hydrological Drought in a Semiarid Basin of Northern China. Hydrol. Process. 2019, 33, 1075–1088. [Google Scholar] [CrossRef]
Li, Z.; Chen, Y.; Fang, G.; Li, Y. Multivariate Assessment and Attribution of Droughts in Central Asia. Sci. Rep. 2017, 7, 1316. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Map of the Yellow River Basin research area.

Figure 2. Boxplot comparison of GD, AIC and SBC for Models 1, 2, and 3.

Figure 3. Normal Q-Q plot and worm plot of the optimal GAMLSS model in the Yellow River Basin.

Figure 4. Time series of monthly SPI and NSPI for the Yellow River Basin from 1960 to 2024.

Figure 5. TFPW-MK and Pettitt tests for SPI12 and NSPI12 (1960–2024).

Figure 6. The distribution of drought grades estimated based on SPI and NSPI from 1960 to 2024.

Figure 7. The occurrence frequencies of meteorological droughts of different grades determined based on SPI and NSPI in different periods.

Figure 8. Changes in the duration, intensity, severity and peak of droughts in the Yellow River Basin from 1960 to 2024.

Figure 9. Spatial distribution of the average drought intensity (a), average drought duration (b), intensity peak (c), and drought frequency (d) of the non-stationary model.

Figure 10. Contour lines of joint distribution probability (A,C) and joint return period (B,D) of drought characteristic variables of SPI (A,B) and NSPI (C,D).

Figure 11. Spatial distribution of drought recurrence periods under the moderate drought (a) and severe drought (b) scenarios in the Yellow River Basin.

Figure 12. The verified NSPI sequence from 1989 to 2016 and the predicted sequence from 2017 to 2024.

Figure 13. Comparison of the actual observed and model-predicted changes in the time series sequence NSPI.

Figure 14. Scatter plot of actual observed values and predicted values.

Figure 15. Quantitative evaluation of prediction errors for exponential sequences, duration, intensity and peak value.

Table 1. Criteria for drought classification.

The Drought Level Indicates	Drought Index Value	Drought Severity Level
$D_{0}$	SPI/NSPI > −0.5	No drought
$D_{1}$	−1 < SPI/NSPI ≤ −0.5	Mild drought
$D_{2}$	−1.5 < SPI/NSPI ≤ −1	Moderate Drought
$D_{3}$	−2 < SPI/NSPI ≤ −1.5	Severe drought
$D_{4}$	SPI/NSPI ≤ −2	Extreme drought

Table 2. Climate factors for monthly precipitation series during 1960–2024 in the Yellow River Basin.

Month	Climatic Factors
Month	AOI	NAO	PDO	AMO	SOI	NPI
1	AOI₁₀	NAO₀	PDO₁₂	AMO₈	SOI₅	NPI₂
1	0.232 **	0.140	0.158	−0.067	0.279 **	0.134
2	AOI₁₁	NAO₁	PDO₀	AMO₀	SOI₆	NPI₀
2	0.220 *	0.125	−0.182 *	−0.098	0.282 **	0.157
3	AOI₈	NAO₈	PDO₁	AMO₀	SOI₇	NPI₀
3	0.227 **	0.126	−0.174 *	−0.114	0.284 **	0.156
4	AOI₁	NAO₄	PDO₁	AMO₀	SOI₈	NPI₁
4	0.195 *	−0.155	−0.187 *	−0.096	0.324 **	0.176 *
5	AOI₁	NAO₅	PDO₂	AMO₂	SOI₁₀	NPI₂
5	0.210 *	−0.187 *	−0.220 *	−0.077	0.283 **	0.267 **
6	AOI₃	NAO₆	PDO₃	AMO₄	SOI₁₁	NPI₃
6	0.173 *	−0.140	−0.188 *	−0.077	0.224 **	0.232 **
7	AOI₀	NAO₇	PDO₄	AMO₅	SOI₂	NPI₄
7	0.160	−0.184 *	−0.115	−0.079	0.252 **	0.143
8	AOI₁₁	NAO₁₂	PDO₀	AMO₀	SOI₃	NPI₁
8	0.201 *	−0.170 *	−0.133	−0.035	0.293 **	0.130
9	AOI₂	NAO₂	PDO₁	AMO₄	SOI₁	NPI₂
9	0.263 **	0.141	−0.186 *	−0.065	0.310 **	0.215 *
10	AOI₃	NAO₃	PDO₁	AMO₇	SOI₂	NPI₁₂
10	0.251 **	0.132	−0.159	−0.067	0.321 **	−0.165
11	AOI₄	NAO₄	PDO₂	AMO₈	SOI₃	NPI₁
11	0.214 *	0.137	−0.164	−0.082	0.314 **	0.134
12	AOI₅	NAO₅	PDO₃	AMO₉	SOI₄	NPI₅
12	0.224 **	0.126	−0.145	−0.091	0.288 **	0.140

Note: ** and * respectively indicate that the significance test has passed at the 0.01 and 0.05 levels. The optimal lag time is given by the number in the subscript of the climate factor (in months). For example, AOI10 denotes the AOI at a 10-month lag.

Table 3. Distribution parameter covariates of the optimal model for each month from 1960 to 2024.

Month	Optimal Model Distribution Parameter Covariates
Month	$μ$	$σ$
1	SOI₄, SOI₅, SOI₆, SOI₈, AOI₁₀	_
2	SOI₂, SOI₅, SOI₆, SOI₇, SOI₉	_
3	SOI₆, SOI₇, SOI₈, SOI₁₀, AOI₈	_
4	SOI₄, SOI₇, SOI₈, SOI₉, SOI₁₁	_
5	SOI₉, SOI₈, SOI₁₀, SOI₁₂, NPI₂	_
6	SOI₉, SOI₁₀, SOI₁₁, NPI₃, PDO₃	_
7	SOI₂, NAO₇, AOI₀, AOI₁₀	_
8	SOI₀, SOI₃, AOI₁, AOI₁₁	_
9	SOI₀, SOI₁, SOI₂, SOI₄, AOI₂	_
10	SOI₁, SOI₂, SOI₃, SOI₅, AOI₃	_
11	SOI₂, SOI₃, SOI₄, SOI₆, AOI₄	_
12	SOI₀, SOI₄, SOI₅, SOI₇	_

Note: “_” represents “not found”.

Table 4. Comparison of drought characteristic variables of typical drought years based on SPI and NSPI.

Drought Year	Start (Year-Month)	SPI				NSPI
Drought Year	Start (Year-Month)	D	S	I	Grade	D	S	I	Grade
1960	1960-12	6	4.359	−0.726	D1	6	5.800	−0.967	D1
1962	1962-10	7	6.881	−0.983	D1	7	7.537	−1.077	D2
1969	1969-10	8	5.207	−0.651	D1	2	1.426	−0.713	D1
1974	1974-08	11	11.733	−1.067	D2	11	17.935	−1.630	D3
1986	1986-09	22	32.219	−1.465	D2	12	20.214	−1.685	D3
1995	1995-11	6	3.980	−0.663	D1	6	3.925	−0.654	D1
2008	2008-10	13	6.779	−0.521	D1	9	7.618	−0.846	D1

Note: D = drought duration (months); S = drought severity (cumulative absolute NSPI/SPI value); I = drought intensity (lowest NSPI/SPI value during the event, more negative = more severe); Grade = drought class (D1 mild, D2 moderate, D3 severe).

Table 5. Evaluation of goodness of fit for Copula functions.

Drought Index	Copula Function	Parameter	AIC	RMSE
SPI	Gaussian	0.938	−54.857	0.060
	t	0.938	−52.856	0.060
	Clayton	6.160	−58.028	0.065
	Frank	16.314	−52.929	0.063
	Gumbel	3.053	−28.336	0.069
NSPI	Gaussian	0.975	−83.715	0.080
	t	0.975	−81.713	0.080
	Clayton	6.957	−66.794	0.078
	Frank	30.116	−84.071	0.079
	Gumbel	5.663	−54.678	0.086

Table 6. Comparison of optimal Copula functions and joint return periods under different selection criteria.

Drought Index	Optimal Choice Rule	Optimal Copula	Drought Recurrence Period/Year		Joint Probability Beyond
Drought Index	Optimal Choice Rule	Optimal Copula	Moderate Drought	Severe Drought	Moderate Drought	Severe Drought
SPI12	AIC	Clayton	2.87	5.92	0.8264	0.4009
SPI12	RMSE	Gaussian	2.88	5.74	0.8246	0.4136
NSPI12	AIC	Frank	3.81	9.54	0.5795	0.2316
NSPI12	RMSE	Clayton	3.41	8.40	0.6476	0.2631

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liang, L.; Hu, L.; Wang, X.; Zhu, Y.; Chao, Y.; Wang, Y.; Liu, Z. Identification of the Non-Stationarity of Meteorological Drought in the Yellow River Basin and Assessment of the Applicability of the GAMLSS Model. Sustainability 2026, 18, 6383. https://doi.org/10.3390/su18136383

AMA Style

Liang L, Hu L, Wang X, Zhu Y, Chao Y, Wang Y, Liu Z. Identification of the Non-Stationarity of Meteorological Drought in the Yellow River Basin and Assessment of the Applicability of the GAMLSS Model. Sustainability. 2026; 18(13):6383. https://doi.org/10.3390/su18136383

Chicago/Turabian Style

Liang, Li’e, Liulong Hu, Xiaohan Wang, Yonghua Zhu, Yan Chao, Yong Wang, and Ziyi Liu. 2026. "Identification of the Non-Stationarity of Meteorological Drought in the Yellow River Basin and Assessment of the Applicability of the GAMLSS Model" Sustainability 18, no. 13: 6383. https://doi.org/10.3390/su18136383

APA Style

Liang, L., Hu, L., Wang, X., Zhu, Y., Chao, Y., Wang, Y., & Liu, Z. (2026). Identification of the Non-Stationarity of Meteorological Drought in the Yellow River Basin and Assessment of the Applicability of the GAMLSS Model. Sustainability, 18(13), 6383. https://doi.org/10.3390/su18136383

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Identification of the Non-Stationarity of Meteorological Drought in the Yellow River Basin and Assessment of the Applicability of the GAMLSS Model

Abstract

1. Introduction

2. Study Area and Data

2.1. The Study Area Overview

2.2. Data Sources

3. Methods

3.1. Standardized Precipitation Index (SPI)

3.2. Non-Stationary Standardized Precipitation Index (NSPI)

3.2.1. Screening Climate Factors

3.2.2. GAMLSS

3.3. Run-Length Theory

3.4. Trend and Turning Point Test

3.5. Copula Function

3.6. Machine Learning—Random Forest (RF) + LSTM

4. Results

4.1. The Construction of Non-Stationary Models

4.1.1. Screening of Climatic Factors

4.1.2. GAMLSS

4.2. Non-Stationary Meteorological Drought Assessment

4.2.1. Analysis of the Applicability of NSPI

4.2.2. Comparative Analysis of Drought Grades in Different Decades

4.3. The Drought Characteristics of NSPI

4.4. The Recurrence Period of Drought in NSPI

4.5. Random Forest (RF) + LSTM Machine Learning Prediction Evaluation

5. Discussion

5.1. The Influence of Spatio-Temporal Patterns on Drought in the Yellow River Basin

5.2. The Sole Limitation of NSPI

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI