Future Hydrological Drought and Water Sustainability in the Sava River Basin: Machine Learning Projections Under Climate Change Scenarios

Leščešen, Igor; Josić, Milan; Gnjato, Slobodan; Petrović, Ana M.; Bajtek, Zbyněk

doi:10.3390/su18062678

Open AccessArticle

Future Hydrological Drought and Water Sustainability in the Sava River Basin: Machine Learning Projections Under Climate Change Scenarios

by

Igor Leščešen

^1,*

,

Milan Josić

²

,

Slobodan Gnjato

³

,

Ana M. Petrović

⁴

and

Zbyněk Bajtek

¹

Institute of Hydrology SAS, Dúbravská Cesta 9, 841 04 Bratislava, Slovakia

²

Department of Geography, Tourism and Hotel Management, Faculty of Sciences, University of Novi Sad, Trg Dositeja Obradovića 3, 21000 Novi Sad, Serbia

³

Faculty of Natural Science and Mathematics, University of Banja Luka, Mladena Stojanovića 2, 78000 Banja Luka, Bosnia and Herzegovina

⁴

Geographical Institute “Jovan Cvijić”, Serbian Academy of Sciences and Arts, Đure Jakšića 9, 11000 Belgrade, Serbia

^*

Author to whom correspondence should be addressed.

Sustainability 2026, 18(6), 2678; https://doi.org/10.3390/su18062678

Submission received: 26 January 2026 / Revised: 26 February 2026 / Accepted: 5 March 2026 / Published: 10 March 2026

(This article belongs to the Special Issue Impacts of Climate Change on Water Sustainability: Rivers, Floods, Droughts, and Extreme Precipitation)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Hydrological drought projections are crucial for climate-resilient water management; however, many basins lack calibrated process-based models that can readily be forced with climate scenarios. This study develops a purely data-driven framework to forecast the Streamflow Drought Index (SDI) from standardized meteorological indices and to assess future drought regimes under different emission pathways. We used a 60-year monthly record (1961–2020) of the Standardized Precipitation Index (SPI), the Standardized Temperature Index (STI), the Standardized Precipitation–Evapotranspiration Index (SPEI), and the SDI for the Sava River Basin. Correlation analysis showed that the SDI is primarily controlled by the short-lag SPI (0–1 months), whereas the STI and SPEI play a minor role. Several machine learning models were tested for one-month-ahead SDI prediction; a Random Forest (RF) with hyperparameters optimized by TimeSeriesSplit cross-validation, combined with linear-scaling bias correction, clearly outperformed XGBoost, Elastic Net, support vector regression, and a multilayer perceptron. On the independent test period (2009–2020), the RF achieved MAE ≈ 0.62, RMSE ≈ 0.83, NSE ≈ 0.49, and KGE ≈ 0.65. Using SPI/STI/SPEI projections from RCP2.6, RCP4.5, and RCP8.5, the RF produced monthly SDI projections for 2021–2050, revealing increasingly frequent, severe, and persistent streamflow droughts with higher emissions. The results demonstrate that carefully tuned ensemble tree models driven solely by standardized climate indices can provide skilful and interpretable SDI projections for drought risk assessment, supporting sustainable, climate-resilient water resources planning and adaptation in this transboundary basin.

Keywords:

Streamflow Drought Index (SDI); random forest; hydrological drought forecasting; machine learning; climate change scenarios

1. Introduction

Hydrological droughts, characterized by reduced river discharge and streamflow deficits, represent a distinct hazard from meteorological and agricultural droughts because they reflect catchment storage, routing, and human withdrawals. As a result, they determine impacts on water supply, ecosystems, navigation, and hydropower at various scales, including transboundary basins [1,2,3]. Recent global assessments indicate that extreme hydrological droughts are projected to become more frequent and severe with warming, with dominant drivers shifting from precipitation to temperature in some regions [4]. This change increases risks to water-dependent sectors and raises concerns about intergenerational exposure to water stress [5,6,7]. Comparisons of multiple drought types explicitly contrast meteorological, agricultural (soil moisture), and hydrological (runoff/discharge) droughts, revealing differing trends and underscoring the need to select indices appropriate for each drought type [8]. Because impacts such as reservoir inflows, navigation depth, and turbine heads respond to streamflow rather than precipitation deficits alone, streamflow-based indices are essential for impact-oriented drought assessment and management in international river basins [2,9].

Standardized indices remain the operational backbone of drought monitoring and climate change assessments because they enable comparability across regions and timescales. Canonical examples include the Standardized Precipitation Index (SPI), the Standardized Precipitation–Evapotranspiration Index (SPEI), and streamflow- or runoff-focused indices such as the Standardized Runoff/Streamflow Drought Index (SRI/SDI) [10,11,12]. The SDI captures cumulative streamflow deficits and hydrological persistence directly relevant to water resource impacts, making it particularly suitable for hydrological drought diagnostics and risk quantification [13,14,15]. Comparative assessments show that runoff-based indices better represent hydrological drought propagation and characteristics across scenarios than precipitation-based indices alone, especially when agricultural water withdrawals are considered [16,17]. Statistical standardized indices are easy to compute from modelled or observed series but can misrepresent hydrological impacts if evapotranspiration, land-surface processes, or human abstractions change. Physically based hydrological models can represent these processes and water use but introduce structural and parameter uncertainty and are computationally intensive in large multi-model ensembles [10,18]. Combining standardized indices with hydrological or hybrid approaches helps balance interpretability and process realism while revealing the dominant sources of projection uncertainty [19].

Hydrological drought projection studies commonly use multi-model climate forcing frameworks (GCM–RCM ensembles and EURO-CORDEX) and Representative Concentration Pathways (RCP2.6, RCP4.5, RCP8.5, or similar SSP scenarios) to examine emission-dependent changes in drought metrics, with ensemble approaches quantifying climate forcing uncertainty [20,21,22]. Regional analyses across Europe show spatially divergent low-flow trends and strong sensitivity of drought characteristics (frequency, severity, and duration) to scenario selection and climate model spread, especially for low-flow extremes, underscoring the need to propagate uncertainty from climate forcing through hydrological response [23,24,25]. Studies using large ensembles show that climate model and scenario uncertainty often dominate hydrological drought uncertainty at continental scales, while hydrological model and parameter uncertainty can be critical in certain catchments and regimes, particularly where local processes and human water use are significant [5,26,27]. Europe-scale hydrological simulation ensembles, driven by multiple Euro-CORDEX RCM combinations, demonstrate contrasting responses across subregions: Mediterranean and some continental zones face stronger low-flow intensification, while northern basins may show different responses. These results highlight the sensitivity of drought trends to model selection and internal variability [25,28,29].

Machine learning (ML) methods have recently been applied to streamflow simulation and drought projection because they can flexibly model nonlinear relationships, memory effects, and compound drivers without explicit process parameterization [30,31,32]. Random Forest, gradient boosting, and deep neural networks have been used to attribute meteorological drivers, emulate hydrological models, and project runoff or drought indices with competitive skill, especially in data-rich settings or where rapid multi-ensemble emulation is needed [5,33,34,35]. Hybrid frameworks that constrain ML projections with hydrological understanding or multi-model ensembles have proven effective at reproducing low-flow behaviour and quantifying bivariate drought characteristics (severity and duration) while reducing computational cost compared to full hydrological multi-model chains [30,32,36]. The Random Forest–Copula–Factorial Analysis (RFCFA) method, for example, integrates Random Forest with copulas to predict meteorological-to-hydrological drought propagation and reveal major drivers and uncertainties [37]. Support Vector Machine (SVM) variants optimized with metaheuristic algorithms have been used for drought-related discharge prediction and groundwater storage simulation under future RCP scenarios, demonstrating the breadth of machine learning applications in hydrological climate change studies [38,39]. However, machine learning approaches inherit uncertainties from climate forcing and can be sensitive to regime shifts in training data, so careful bias correction, cross-validation, and physical interpretability checks are required [2,40].

Europe and Central Europe show heterogeneous projected changes in hydrological drought, with Mediterranean and some continental zones experiencing stronger low-flow intensification, while northern basins may exhibit different responses, as indicated by continental assessments of low-flow indices and multi-model ensembles [25,29,41]. Many basin-scale future drought studies use deterministic hydrological modelling, variable-threshold methods, or indices derived from physically based runoff simulations. These approaches are valuable but often computationally demanding and limited in ensemble size [21,22,42]. In the Western Balkans, and specifically for the transboundary Sava River Basin—a major Danube tributary with significant roles in hydropower, navigation, and water supply—the literature does not provide a substantive record of machine learning-based SDI projections under RCP scenarios [43]. This evidentiary gap aligns with broader calls for improved ML–hydrology integration, better uncertainty decomposition in drought projections, and enhanced drought early-warning systems that use impact-relevant streamflow diagnostics and scenario-consistent climate forcing in transboundary contexts [5,44,45].

This study addresses the identified methodological and regional gap by applying a Random Forest framework to project the Streamflow Drought Index (SDI) in the Sava River Basin, using predictor indices (SPI, STI, and SPEI) derived from scenario-consistent climate projections under RCP2.6, RCP4.5, and RCP8.5 through 2050. Building on evidence that Random Forest and hybrid machine learning frameworks can capture nonlinear drought propagation and emulate multi-model behaviour [32,37], this contribution couples machine learning-based SDI forecasting with scenario-consistent GCM/RCM forcing to provide one of the first systematic machine learning assessments of future hydrological drought for this major transboundary basin. It explicitly examines low-flow extremes and uncertainty propagation from climate forcing to impact-oriented streamflow deficits [5,30,36]. The approach leverages the computational efficiency and nonlinear modelling capacity of Random Forest while maintaining interpretability through standardized drought indices, supporting climate-informed water management and drought preparedness in a data-sparse, transboundary setting [37,45,46]. By quantifying future hydrological drought trajectories under alternative emission pathways, this framework directly supports sustainable, climate-resilient water resources management and long-term adaptation planning in the Sava River Basin.

2. Data and Methods

We aimed to develop and apply a data-driven framework to forecast the Streamflow Drought Index (SDI) using meteorological and climatic drought indices derived under alternative climate change scenarios. Specifically, we used the SPI, STI, and SPEI, calculated from observed and projected temperatures and precipitation for three Representative Concentration Pathways (RCP2.6, RCP4.5, and RCP8.5), as predictors in a range of machine learning models to generate monthly SDI projections at the Sremska Mitrovica gauge. We then applied non-parametric trend and change-point tests to both historical and forecast SDI series (up to 2050) to identify statistically significant long-term trends and potential regime shifts, thus providing an integrated assessment of future drought hazards and their trajectories under contrasting climate scenarios.

We first assembled an observational hydrometeorological dataset. The regional climate model outputs from SMHI-RCA4 used in this study are publicly available through the EURO-CORDEX archive and the Earth System Grid Federation (ESGF) data portals. Station-based precipitation, temperature, and discharge data were obtained from the national hydrometeorological services of the Republic of Srpska, the Hydrometeorological Service of the Republic of Serbia, and the Croatian Meteorological and Hydrological Service. Monthly precipitation and air temperature data were collected from three meteorological stations: Sremska Mitrovica (Serbia), Slavonski Brod (Croatia), and Bijeljina (Bosnia and Herzegovina). Monthly river discharge data for the Sava River Basin were obtained for the Sremska Mitrovica hydrological station (Figure 1).

For the period 1961–2020, we calculated four monthly drought indices: the Standardized Precipitation Index (SPI), the Standardized Temperature Index (STI), the Standardized Precipitation–Evapotranspiration Index (SPEI), and the Streamflow Drought Index (SDI). At a monthly resolution, this corresponds to 720 observations per index (January 1961–December 2020) for the SPI, STI, SPEI, and SDI. For each index, we first aggregated the relevant variable over a chosen accumulation window, and then standardised it relative to its historical distribution. The Standardized Precipitation Index (SPI) is defined as:

S P I = (P_{k} - μ_{P, k}) / σ_{P, k}

(1)

where P_k is the k-month accumulated precipitation and μ_P,k and σ_P_,k are its long-term mean and standard deviation for that time scale [9]. The Standardized Temperature Index (STI) is analogously:

S T I = (T_{k} - μ_{T, k}) / σ_{T, k}

(2)

where T_k is the aggregated air temperature anomaly, allowing us to isolate thermal effects on drought. The Standardized Precipitation–Evapotranspiration Index (SPEI) uses the climatic water balance:

D_{k} = P_{k} - {P E T}_{k}

(3)

where D_k represents moisture deficit/surplus; P_k represents precipitation; and PET_k represents potential evapotranspiration, computed as:

S P E I = (D_{k} - μ_{D, k}) / σ_{D, k}

(4)

after fitting an appropriate distribution to D_k [11,19]. The Streamflow Drought Index (SDI) is defined as:

S D I = (V_{k} - μ_{V, k}) / σ_{V, k}

(5)

where V_k is the cumulative streamflow volume over k months [13]. As all indices are standardised, they share a common categorical interpretation based on thresholds along the standard normal scale, which facilitates direct comparison of drought severity and duration across meteorological, climatic, and hydrological variables [9]. We adopted the conventional classification shown in Table 1. These series provided a consistent basis for quantifying meteorological and hydrological drought conditions and exploring their temporal and spatial linkages within the study basin.

To analyse the influence of spatially distributed meteorological drought on discharge at the basin outlet, we developed a distance-weighted SPI referenced to the Sremska Mitrovica hydrological station. Using the geographical coordinates of each meteorological and hydrometric station, we first calculated great-circle (Haversine) distances. We then derived Inverse Distance Weighted (IDW) coefficients with a power parameter p = 2, normalised the weights to sum to one, and applied these coefficients to combine the individual station SPI series into a single composite index (SPIIDW). We recognise that weighting stations by their distance to the outlet overemphasises local precipitation near Sremska Mitrovica and does not accurately reflect rainfall distribution across the entire upstream catchment. In this research, the SPIIDW is used solely as a basic, exploratory measure to examine how stations affect discharge, rather than as the main factor in machine learning models, which instead depend on indices from gridded regional climate simulations. Creating basin-wide, area-weighted indices from more extensive observation networks or high-resolution gridded precipitation data will be a key future step. The SPIIDW thus represents the distance-weighted meteorological signal likely to influence discharge at Sremska Mitrovica. We quantified the strength and nature of the drought–discharge relationship by calculating Pearson and Spearman correlation coefficients between discharge and (i) each station’s SPI and (ii) SPIIDW, including lagged correlations of up to three months to capture delayed hydrological responses. To assess the combined influence of all stations and address potential multicollinearity, we fitted a multiple linear regression model using the individual SPI series as predictors of discharge and evaluated model performance through the coefficient of determination (R²) and observed–predicted scatter plots. We further visualised spatial station influence via an IDW-based weight heatmap and archived a dataset containing the original SPI series and SPIIDW. Because only three long-term meteorological stations were available for 1961–2020, the SPIIDW should be considered a first-order estimate of the basin-scale meteorological signal rather than a fully representative areal average, especially for remote headwater areas. In this study, the SPIIDW is used only for exploratory diagnostic purposes. Similar IDW-based spatial weighting as well as SPI and streamflow analyses have been applied in hydrological studies [47,48,49].

To relate meteorological and climatic drought indices to hydrological drought, we developed a data-driven modelling framework using the historical SPI, STI, SPEI, and SDI series. We first performed basic quality assurance and control by checking for missing values, calculating descriptive statistics, visually inspecting the time series, and ensuring strict chronological ordering. All predictors were processed with scikit-learn pipelines using z-score standardisation, while the target variable (SDI) was already standardised. To represent catchment memory and delayed hydrological response, we initially explored contemporaneous and lagged predictors for the SPI, STI, and SPEI with lags from 0 to 5 months and removed rows with lag-induced missing values. Exploratory cross-correlation analysis showed that SPI/STI/SPEI–SDI correlations peak within 0–3 months and become negligible beyond 5 months, with longer lags introducing redundancy and risk of overfitting [50,51]. For the final machine learning models, we therefore restricted the predictor set to lags of 0–2 months for each index and added harmonic seasonal terms (sine and cosine of calendar month) to allow a seasonally varying index–SDI relationship while avoiding information leakage, since these terms are deterministic functions of time. We quantified linear relationships between all lagged predictors and SDI using a correlation matrix, identified for each index family the lag with the highest absolute correlation with SDI, and visualised this dependence with a correlation heatmap. We then constructed a feature matrix X, comprising all lagged SPI, STI, and SPEI variables, and a target vector y for SDI. We split the series chronologically into training (approximately 80%) and test (approximately 20%) subsets to account for temporal dependence. Because the problem is formulated as continuous regression rather than classification, we did not perform any class-balancing or resampling; all models were trained on the full observed distribution of SDI values, including both wet and dry states.

To avoid imposing a priori assumptions on the functional form of the SDI response, we evaluated a broad ensemble of machine learning models implemented within scikit-learn pipelines with standardisation. We considered: (i) ordinary and regularised linear models (LinearRegression, Ridge, and Lasso) to provide an interpretable baseline, accommodate multicollinearity, and perform embedded coefficient shrinkage; (ii) a kernel method (support vector regression, SVR) to capture smooth nonlinear responses in relatively small hydrological samples; (iii) tree-based ensemble methods (RandomForestRegressor) to model complex, higher-order interactions between indices while maintaining robustness to outliers and redundant predictors; (iv) a gradient boosting model (XGBRegressor) to exploit stage-wise additive trees, which are well-suited to representing nonlinearities and extremes; and (v) a feedforward multilayer perceptron (MLPRegressor) for flexible nonlinear function approximation. For these five machine learning models (Random Forest, XGBoost, Elastic Net, SVR, and MLP), we tuned the main complexity and regularisation hyperparameters using scikit-learn’s RandomizedSearchCV with five-fold time-series cross-validation on the training period, minimising the cross-validated mean squared error. Time-series cross-validation was implemented using scikit-learn’s TimeSeriesSplit, which preserves the chronological order of the data and uses a forward-chaining scheme: for each of the five folds, the model is trained on an expanding window of the historical record and validated on the immediately subsequent block, ensuring that validation always occurs on data that are later in time than the training data and avoiding any information leakage across folds. The tuned hyperparameters and their candidate values are summarised in Table 2.

In this study, we used a Random Forest (RF) regression framework to model the relationship between meteorological drought indices and the Streamflow Drought Index (SDI). The Random Forest algorithm [33] is an ensemble learning method that combines predictions from many decision trees, each grown on a bootstrap sample of the training data with random feature subsampling at each split. This structure enables RF to flexibly represent nonlinear relationships and higher-order interactions between predictors, while reducing overfitting through averaging, and it has been shown to perform robustly in hydrological forecasting applications [52]. The predictors supplied to the RF model were the contemporaneous and lagged values (lags 0–2 months) of the SPI, STI, and SPEI, together with harmonic seasonal terms derived from the calendar month (month_sin and month_cos), which collectively describe short-term meteorological conditions and their seasonal modulation.

To achieve a parsimonious yet accurate configuration, we optimised key RF hyperparameters using a random search strategy with time-series cross-validation. Specifically, we considered the number of trees in the ensemble (n_estimators ∈ {200, 300, 500}), the maximum depth of each tree (max_depth ∈ {None, 4, 6, 8}), the minimum number of samples required at a leaf node (min_samples_leaf ∈ {1, 2, 4}), and the number of predictors considered at each split (max_features ∈ {‘auto’, ‘sqrt’, 0.5}). Hyperparameter combinations were evaluated using a five-fold TimeSeriesSplit scheme on the training period, with performance scored by the negative mean squared error to approximate minimisation of RMSE. The configuration yielding the lowest cross-validated error was then refitted on the full training dataset (1963–2009) to obtain the final global RF model. For interpretability, we also extracted feature importance rankings from the fitted RF model to assess the relative contribution of each lagged drought index and the seasonal terms to SDI prediction (reported in Section 3). To address systematic differences between simulated and observed SDI, we used a linear bias correction (Eq. 6) on each model’s output. During calibration, the mean and standard deviation of the simulated SDI were adjusted to match observations, and these parameters were then used to post-process hindcasts and future projections. Since the predictors are standardised indices (SPI, STI, and SPEI), which already reduce much of the mean and variance bias in climate forcing, this step primarily corrects residual statistical bias in the index–SDI relationship. For future scenarios, it is assumed that this bias remains roughly constant over time, a limitation we recognise given strong non-stationarity. The linear bias correction equation is as follows:

{\hat{Q}}_{t}^{*} = μ_{Q} + \frac{σ_{Q}}{σ_{\hat{Q}}} ({\hat{Q}}_{t} - μ_{\hat{Q}})

(6)

where μ_Q, σ_Q and

μ_{\hat{Q}}

,

σ_{\hat{Q}}

are the mean and standard deviation of the observed and simulated SDIs, respectively. Model skill was quantified during the independent test period (2009–2020) using MAE, RMSE, the coefficient of determination (R²), the Nash–Sutcliffe efficiency (NSE), and the Kling–Gupta Efficiency (KGE), providing complementary perspectives on error magnitude, variance explained, and hydrological realism. The bias-corrected RF model was then driven with projected SPI, STI, and SPEI time series for 2021–2050 to generate SDI forecasts for scenario analysis.

For the climate change component, we used the Swedish regional climate model, SMHI-RCA4, implemented on a 12.5 × 12.5 km grid, as the primary source of climate projections. Although this resolution is not the highest available, we selected RCA4 because it provides dynamically consistent simulations for all three forcing scenarios (RCP2.6, RCP4.5, and RCP8.5), ensuring methodological coherence in this initial evaluation. Relying on just one RCM means that uncertainty from the climate model structure and internal variability in climate forcing are not accounted for. As a result, the SDI projections shown should be viewed as conditional on the SMHI-RCA4 realization, rather than as a comprehensive ensemble of potential outcomes. Nevertheless, SMHI-RCA4 has been extensively applied and evaluated within the EURO-CORDEX and CORDEX frameworks, demonstrating robust performance for temperature and precipitation over Europe and other regions [20,53,54,55]. From these simulations, we derived future SPI, STI, and SPEI series for each RCP and used them as inputs to the selected bias-corrected model to produce scenario-specific SDI projections at Sremska Mitrovica up to 2050.

3. Results

This section first characterises the historical behaviour of the meteorological and hydrological drought indices, and then examines their statistical relationships. We next compare the skill of alternative modelling approaches, focusing on the optimised Random Forest model. Finally, we present projections of the Streamflow Drought Index under future climate scenarios and discuss associated uncertainties.

Figure 2 shows the temporal evolution of the standardized precipitation (SPI), temperature (STI), precipitation–evapotranspiration (SPEI), and streamflow drought (SDI) indices for 1961–2019. All four series display pronounced month-to-month variability and recurrent negative excursions, indicating frequent meteorological and hydrological drought episodes. Periods with strongly negative SPI and SPEI values generally coincide with negative SDI, suggesting clear propagation of meteorological deficits into streamflow drought. Conversely, positive SDI peaks often follow sustained wet anomalies in SPI and SPEI. STI shows both warm and cold extremes, but its correspondence with the SDI is visually weaker than that of the SPI and SPEI, foreshadowing the correlation and feature importance results presented below.

Table 3 shows that all indices are approximately standardised, with means near zero and standard deviations close to one. Quartiles are slightly negative, indicating a modest tendency toward drier-than-normal conditions. The relatively large positive maxima, particularly for the SPEI and SDI, highlight episodes of pronounced wetness and high flows superimposed on this generally variable regime.

The correlation analysis between the SDI and the lagged hydro-climatic indices shows a clear dominance of the SPI (Figure 3). The SDI has the strongest linear association with the SPI at a one-month lag (SPI_lag1, r ≈ 0.50), followed by the contemporaneous SPI (SPI_lag0, r ≈ 0.44) and the two-month lag (SPI_lag2, r ≈ 0.24). In contrast, correlations with the STI and SPEI at all lags remain weak (|r| ≲ 0.05), indicating that temperature-based and combined indices contribute little to the linear variability of the SDI in this basin. Overall, the results suggest that short-term precipitation anomalies with a lag of 0–1 month are the primary linear drivers of streamflow drought.

To examine how the Random Forest exploits the different predictors, we computed feature importance rankings for all lagged SPI, STI, and SPEI variables, as well as for the seasonal harmonics. The resulting feature importance rankings (Figure 4) confirm that the short-lag SPI, particularly the SPI at 1-month and contemporaneous lag, dominates the RF predictions, which is consistent with the correlation analysis. In contrast, the STI and SPEI lags contribute only marginal additional importance, and the seasonal terms play a secondary role by modulating the strength of the SPI–SDI linkage across the year.

Table 4 shows that before bias correction, all models achieve comparable MAE and RMSE, with Random Forest (RF) slightly outperforming the others and explaining the largest share of variance (R² ≈ 0.47). The key distinction emerges after bias correction is applied. RF attains the lowest errors (MAE = 0.62, RMSE = 0.83) and the highest skill (R²/NSE ≈ 0.49). Its KGE increases from 0.25 to 0.65, indicating a marked improvement in correlation, bias, and variability. XGBoost ranks second, with slightly higher errors and lower KGE (0.48). In contrast, Elastic Net, SVR, and MLP exhibit notably worse performance after bias correction, with reduced R² values and strongly negative KGE values, indicating poor reproduction of the observed distribution. The decline in KGE for Elastic Net, SVR, and MLP after bias correction reflects the Kling–Gupta Efficiency metric’s design and the error patterns of these models. Linear scaling aligns the mean and standard deviation of the simulated and observed SDI during training, thereby enhancing the variability and bias components of KGE for Random Forest, where these errors dominate. Conversely, Elastic Net, SVR, and MLP tend to have smaller mean/variance biases but more significant structural or timing errors. Applying a linear transformation trained on the training data to the independent test period can reduce the correlation component of KGE without substantial improvements in bias or variability, resulting in a lower overall KGE. On this basis, RF is clearly the most suitable model for SDI forecasting in this study, providing the best combination of accuracy, explained variance, and realistic hydrological behaviour.

The results presented in Figure 5 confirm the quantitative ranking reported in Table 3. For all models, points cluster around the 1:1 line, indicating reasonable skill in reproducing SDI variability. However, the Random Forest (RF) plot shows the tightest cluster and the least systematic deviation from the line, especially in the range −1 ≤ SDI ≤ 2, which is consistent with its lowest MAE/RMSE and highest R²/NSE and KGE after bias correction. XGBoost performs slightly worse, with a broader scatter, while Elastic Net, SVR, and MLP exhibit greater dispersion and bias at higher magnitudes. Thus, the visual diagnostics corroborate RF as the best-performing SDI forecasting model.

Figure 6 compares observed and Random Forest-predicted SDI for the independent test period (2009–2020). The model reproduces the timing and signals of most wet and dry episodes well: positive and negative bars closely align each year, indicating that Random Forest correctly captures the onset and duration of drought and recovery phases. Amplitudes of moderate events are also reasonably matched, while some of the highest positive SDI peaks and deepest negative values are slightly underestimated, reflecting the remaining unexplained variance. Overall, the visual agreement is consistent with the quantitative metrics (NSE ≈ 0.49, KGE ≈ 0.65), confirming good predictive skill.

Based on the comparative performance evaluation of all candidate algorithms, the Random Forest (RF) model emerged as the most suitable approach for Standardized Drought Index (SDI) forecasting and was therefore selected for all subsequent analyses. The final RF configuration, trained on the historical SPI, STI, and SPEI predictors, was combined with the previously estimated bias-correction parameters and then applied to the projected SPI, STI, and SPEI time series derived from climate scenarios RCP 2.6, RCP 4.5, and RCP 8.5. This procedure yielded bias-adjusted monthly SDI projections for 2021–2050, ensuring internal consistency between the training and projection phases while reducing systematic errors inherited from the driving climate simulations. The resulting SDI time series provides a robust basis for characterizing the timing, persistence, and severity of future drought conditions under alternative emissions pathways, and for supporting climate-informed water resources planning and risk management.

The SDI projections for 2021–2050 show a clear intensification of hydrological drought with increasing radiative forcing from RCP 2.6 to RCP 8.5 (Figure 7). Under the low-emission RCP 2.6 scenario, negative SDI values (SDI < 0) occur frequently, but most events remain within the mild to moderate drought range (−1 < SDI < 0), with fewer months crossing the SDI ≤ −1 threshold associated with moderate to severe hydrological drought. These deficits are typically interspersed with short recovery phases, indicating that under strong mitigation and a global warming limit of about 1.5–2 °C, the basin retains some resilience and the drought regime, while more variable than in a stationary climate, remains comparatively manageable. Under the intermediate stabilization pathway RCP 4.5, drought characteristics change markedly: negative SDI values become more persistent, and excursions below −1 and −1.5 are more frequent and tend to cluster in multi-year sequences, suggesting longer recovery times and increasing cumulative flow deficits. The high-emission RCP 8.5 pathway, representing unmitigated warming of around 5 °C by 2100, exhibits the most pronounced changes. Here, extended periods dominated by strongly negative SDI values appear, with repeated episodes below −1.5 indicating severe to extreme hydrological drought. Simultaneously, the frequency of high positive SDI peaks increases, suggesting a more volatile flow regime with amplified wet and dry extremes. These results demonstrate a strong link between rising greenhouse gas concentrations and drought hazard: as scenarios shift from RCP 2.6 to 4.5 and 8.5, droughts in the basin become more frequent, severe, and persistent, with significant implications for water resources planning, storage operation, and drought risk management under future climate conditions.

Figure 8 presents the distribution of monthly SDI values forecast by the bias-corrected Random Forest model for 2021–2050 under the three RCP scenarios. Under RCP2.6, the density is centred slightly below zero, with a long right tail, indicating that mildly dry conditions dominate, but occasional wet anomalies are projected. RCP4.5 exhibits a broader spread and a more negative mode, suggesting a shift toward more frequent and intense dry states, while still allowing for intermittent wet periods. In contrast, RCP8.5 shows a marked shift of the distribution toward positive SDI values, with the highest densities between −1 and 1, which is consistent with generally drier future conditions under this high-emissions pathway. The tails in all cases indicate non-negligible probabilities of both severe drought and wet extremes. Overall, the figure highlights substantial scenario-dependent differences in the balance between dry and wet states, underscoring the importance of emissions trajectories for future hydrological drought risk.

Figure 9 presents hydrostripes of the Streamflow Drought Index (SDI) for 2021–2050 under RCP2.6, RCP4.5, and RCP8.5, with anomalies expressed relative to the WMO reference period, 1961–1990. The visualization reveals clear scenario-dependent differences in both the frequency and magnitude of streamflow anomalies. Under RCP2.6, the SDI values fluctuate around the historical mean, with alternating wet and dry years and no persistent dominance of negative anomalies, indicating a relatively stable low-flow regime despite occasional drought years. The RCP4.5 scenario shows a similar pattern but with a slightly higher frequency of positive SDI anomalies and fewer pronounced negative departures, suggesting comparatively wetter average conditions and reduced drought severity over the projection period. In contrast, RCP8.5 exhibits markedly different behaviour, with stronger amplitudes and a higher frequency of negative SDI anomalies, especially from the late 2020s onward. This pattern indicates increased occurrence and intensity of hydrological droughts, along with greater interannual variability. The predominance of negative SDI values under RCP8.5 aligns with stronger radiative forcing, which increases atmospheric evaporative demand and alters precipitation regimes, intensifying low-flow conditions. Overall, the hydrostripes indicate that among the scenarios assessed, RCP8.5 results in the driest future conditions, while RCP4.5 moderates drought risk relative to both RCP2.6 and RCP8.5. These findings highlight the sensitivity of future streamflow drought characteristics to emission pathways and emphasize the importance of incorporating scenario-based SDI projections into long-term water resources planning and climate adaptation strategies.

4. Discussion

The strong dependence of the Streamflow Drought Index (SDI) on short-lag SPI (0–1 months) reflects rapid meteorological-to-hydrological transmission in temperate, precipitation-driven catchments and the limited influence of longer-term thermal indices. Short lags correspond to rapid runoff response and limited catchment memory in many large European basins, where soil moisture and fast flow pathways dominate early streamflow deficits [56,57,58]. Several studies report that runoff often responds within weeks to precipitation anomalies and that only a small fraction of precipitation droughts develop into prolonged hydrological droughts, while temperature-driven evaporative demand influences drought persistence rather than immediate onset [59,60,61,62]. Catchment attributes such as baseflow index and groundwater release determine memory effects and modulate short- versus long-lag relationships, explaining why the SPI at 0–1 months outperforms longer aggregation times and why indices that include potential evapotranspiration (SPEI, STI) play a secondary role in initial SDI formation in this region [19,59,63]. The RF feature importance analysis (Figure 4) reinforces these results: most predictive power comes from the SPI at lags 0–1 months, whereas the STI and SPEI contribute little additional skill. Including STI and SPEI in the predictor set therefore serves mainly to test for potential nonlinear or seasonal effects rather than to drive the forecasts. Given their limited importance, the SDI projections should be interpreted primarily as being driven by short-term precipitation anomalies (SPI), with temperature and combined indices playing at most a minor supplementary role.

Random Forest (RF) outperformed XGBoost, Elastic Net, SVR, and MLP for SDI forecasting because ensemble trees naturally capture complex nonlinearities and higher-order interactions without extensive feature engineering. RF’s bootstrap aggregation reduces variance, implicitly manages correlated predictors, and is robust to multicollinearity, which improves prediction when many lagged climate indices compete for explanatory power [33,34]. Comparable applications across European and regional basins document RF’s consistent skill advantages for streamflow and drought metrics, often exceeding or matching gradient-boosted and neural methods, especially after careful bias correction and ensemble approaches [64,65]. The achieved performance metrics align with contemporary machine learning drought studies, which report moderate explanatory power for monthly hydrological droughts in large basins, underscoring the value of tree ensembles for operational drought forecasting [30,32,62,66].

The marked improvement in Kling–Gupta Efficiency after bias correction highlights the importance of removing systematic errors from climate drivers before inputting them into statistical models. Bias correction addresses distributional mismatches that would otherwise propagate through climate impact chains, reducing conditional biases that machine learning models can amplify and improving the representation of tails and seasonal cycles that influence drought onset and recovery [25,67]. Recent impact-chain studies emphasise that bias correction or post-processing of GCM/RCM outputs materially improves hydrological low-flow representation and machine learning forecast reliability, making post-processing an essential step for credible drought projections and operational forecasting frameworks [25,56,57,68].

The scenario results showing intensified, more persistent, and more volatile SDI deficits from RCP2.6 to RCP8.5 are consistent with ensemble projections that amplify low-flow signals under higher warming. Higher emissions increase evaporative demand and shift precipitation regimes, promoting multi-year drought clustering, longer recovery times, and larger deficit volumes. This non-stationarity alters drought frequency, duration, and severity distributions, as documented across Europe and Central to Southeast Europe [23,69,70]. Several multi-model studies report a southwest–northeast contrast, with Mediterranean and southern basins experiencing amplified low flows and increased probability of multi-year events under RCP8.5, reinforcing scenario-dependent risk escalation and earlier onset of unprecedented drought conditions in vulnerable regions [23,24,27,28]. The observed multi-year clustering reflects compound feedbacks between soil moisture depletion, reduced baseflow recharge, and persistent atmospheric blocking patterns, which become more frequent under higher radiative forcing [29,41].

For transboundary basins such as the Sava, the results indicate a need to adapt reservoir operating rules, revise drought contingency plans, and strengthen international coordination to manage clustered multi-year deficits and tail risks [3]. However, key limitations persist: projections are driven by a single RCM (SMHI-RCA4), so climate forcing uncertainty and model spread are not explicitly quantified; the results therefore reflect one plausible realisation rather than a full ensemble envelope. In addition, projections depend on bias-correction choices, extreme SDI tails remain uncertain, and data-driven models cannot fully resolve process changes or unprecedented states [25,56]. Additionally, our linear-scaling bias correction for SDI predictions presumes that the bias in the learned index–SDI relationship remains constant over time. If significant future changes occur in the hydrological regime, this assumption may not hold, meaning the bias-corrected SDI projections should be viewed as conditioned on the current calibration of the mapping, rather than as an entirely accurate correction for all future distribution changes.

Furthermore, we did not include classical univariate time-series models like ARIMA or STL-based methods as primary benchmarks. Although these approaches are well-established and often competitive for short-term forecasts, they do not naturally integrate the exogenous, scenario-aligned drought indices (SPI, STI, and SPEI) that underpin our impact-chain projections. Instead, they tend to extrapolate SDI solely from its own past data. In our context, linear and regularized regression models with lagged indices, such as Elastic Net, already serve as simple, interpretable baselines that capture AR-like structures while remaining compatible with climate-scenario forcing. Nevertheless, incorporating explicit ARIMA-based benchmarks in future work could enhance the comparison and help quantify the added value of index-based machine learning models over purely autoregressive approaches.

Integrating process-based hydrological models, increasing ensemble sizes, characterising tail uncertainty through stochastic approaches, and coupling machine learning with physical constraints are priority research directions to reduce uncertainty and inform adaptive water allocation and ecosystem resilience strategies [18,36,46,57,71,72].

5. Conclusions

This study developed and evaluated a data-driven framework for forecasting the Streamflow Drought Index (SDI) using meteorological drought indices. Analysis of the 1961–2020 record showed that all indices are approximately standardised, with frequent negative excursions indicating recurrent drought episodes. Correlation and feature-based diagnostics consistently identified short-term precipitation anomalies as the dominant driver of hydrological drought: the SPI at lags of 0–1 months exhibited the strongest relationships with the SDI, whereas temperature-based and combined indices (STI, SPEI) contributed little additional explanatory power.

A suite of machine learning models was compared, including Random Forest, XGBoost, Elastic Net, support vector regression, and a multilayer perceptron. After chronological train–test splitting and linear-scaling bias correction, Random Forest clearly outperformed the alternatives, achieving the lowest errors (MAE ≈ 0.62, RMSE ≈ 0.83) and the highest skill (NSE ≈ 0.49, KGE ≈ 0.65) on the independent test period. The Random Forest model reproduced the timing and signals of most wet and dry events and captured the magnitude of moderate droughts reasonably well, although the most extreme peaks were still underestimated. These results highlight the suitability of ensemble tree methods for hydrological drought prediction when only index-based predictors are available.

The optimised and bias-corrected RF model was subsequently driven with SPI, STI, and SPEI projections from three RCP scenarios to generate monthly SDI forecasts for 2021–2050. The projections show a clear scenario dependence: under RCP2.6, mild to moderate droughts remain frequent but short-lived; under RCP4.5, negative SDI values become more persistent and cluster in multi-year episodes; and under RCP8.5, severe and prolonged hydrological droughts dominate, with increased interannual variability. These findings highlight the sensitivity of future streamflow drought regimes to emission pathways and indicate heightened drought risk under high-warming scenarios.

Despite these advances, uncertainties persist due to the reliance on SPI, STI, and SPEI predictors, climate model and bias-correction choices, and limited information on unprecedented extremes. Future work should combine process-based hydrological models with machine learning, expand climate ensembles, and explicitly quantify tail uncertainty to better support adaptive reservoir operation and transboundary drought management. Taken together, these advances would further strengthen the use of SDI projections as a tool for sustainable water allocation, drought risk reduction, and implementation of climate-resilient management strategies in the basin.

Author Contributions

Conceptualization, I.L.; methodology, I.L. and M.J.; software, I.L. and M.J.; validation, I.L. and Z.B.; formal analysis, I.L. and A.M.P.; investigation, I.L., A.M.P., and S.G.; resources, I.L., M.J., A.M.P., and S.G.; data curation, I.L.; writing—original draft preparation, I.L.; writing—review and editing, A.M.P., Z.B., and S.G.; visualization, I.L. and M.J.; supervision, I.L.; project administration, I.L.; funding acquisition, I.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the “Streamflow Drought Through Time” project funded by the EU NextGenerationEU through the Recovery and Resilience Plan of the Slovak Republic within the framework of project no. 09I03-03-V04-00186.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The regional climate model outputs from SMHI-RCA4 used in this study are publicly available through the EURO-CORDEX archive and the Earth System Grid Federation (ESGF) data portals. Station-based precipitation, temperature, and discharge data were obtained from the national hydrometeorological services of Serbia, Croatia, and Bosnia and Herzegovina under institutional data-sharing agreements and cannot be redistributed by the authors. However, the derived monthly drought indices (SPI, STI, SPEI, and SDI) are available upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Gu, L.; Yin, J.; Slater, L.J.; Chen, J.; Do, H.X.; Wang, H.M.; Chen, L.; Jiang, Z.; Zhao, T. Intensification of Global Hydrological Droughts Under Anthropogenic Climate Warming. Water Resour. Res. 2023, 59, e2022WR032997. [Google Scholar] [CrossRef]
Van Loon, A.F. Hydrological Drought Explained. Wiley Interdiscip. Rev. Water 2015, 2, 359–392. [Google Scholar] [CrossRef]
Stahl, K.; Kohn, I.; Blauhut, V.; Urquijo, J.; De Stefano, L.; Acácio, V.; Dias, S.; Stagge, J.H.; Tallaksen, L.M.; Kampragou, E.; et al. Impacts of European Drought Events: Insights from an International Database of Text-Based Reports. Nat. Hazards Earth Syst. Sci. 2016, 16, 801–819. [Google Scholar] [CrossRef]
Stoyanova, R.; Nikolova, N. Meteorological Drought in Southwest Bulgaria During the Period 1961–2020. J. Geogr. Inst. Jovan Cvijic SASA 2022, 72, 243–255. [Google Scholar] [CrossRef]
Liu, R.; Tian, J.; Yin, J.; Kang, S.; Liu, P.; Ming, B.; Slater, L. Intergenerational Inequity from Hydrological Drought in a Warming World. J. Environ. Manag. 2024, 388, 125988. [Google Scholar] [CrossRef]
Fang, W.; Ren, K.; Liu, T.; Shang, J.; Jia, S.; Jiang, X.; Zhang, J. An Evaluation of Random Forest Based Input Variable Selection Methods for One Month Ahead Streamflow Forecasting. Sci. Rep. 2024, 14, 29766. [Google Scholar] [CrossRef] [PubMed]
Wanders, N.; Van Lanen, H.A.J. Future Discharge Drought across Climate Regions around the World Modelled with a Synthetic Hydrological Modelling Approach Forced by Three General Circulation Models. Nat. Hazards Earth Syst. Sci. 2015, 15, 487–504. [Google Scholar] [CrossRef]
Newcomer, M.E.; Underwood, J.; Murphy, S.F.; Ulrich, C.; Schram, T.; Maples, S.R.; Peña, J.; Siirila-Woodburn, E.R.; Trotta, M.; Jasperse, J.; et al. Prolonged Drought in a Northern California Coastal Region Suppresses Wildfire Impacts on Hydrology. Water Resour. Res. 2023, 59, e2022WR034206. [Google Scholar] [CrossRef]
Mishra, A.K.; Singh, V.P. A Review of Drought Concepts. J. Hydrol. 2010, 391, 202–216. [Google Scholar] [CrossRef]
Mckee, T.B.; Doesken, N.J.; Kleist, J. The relationship of drought frequency and duration to time scales. In Proceedings of the 8th Conference on Applied Climatology, Anahein, CA, USA, 17–22 January 1993. [Google Scholar]
Vicente-Serrano, S.M.; Beguería, S.; López-Moreno, J.I. A Multiscalar Drought Index Sensitive to Global Warming: The Standardized Precipitation Evapotranspiration Index. J. Clim. 2010, 23, 1696–1718. [Google Scholar] [CrossRef]
Shukla, S.; Wood, A.W. Use of a Standardized Runoff Index for Characterizing Hydrologic Drought. Geophys. Res. Lett. 2008, 35, L02405. [Google Scholar] [CrossRef]
Nalbantis, I.; Tsakiris, G. Assessment of Hydrological Drought Revisited. Water Resour. Manag. 2009, 23, 881–897. [Google Scholar] [CrossRef]
Tigkas, D.; Vangelis, H.; Tsakiris, G. DrinC: A Software for Drought Analysis Based on Drought Indices. Earth Sci. Inform. 2015, 8, 697–709. [Google Scholar] [CrossRef]
Afzal, M.; Ragab, R. Assessment of the Potential Impacts of Climate Change on the Hydrology at Catchment Scale: Modelling Approach Including Prediction of Future Drought Events Using Drought Indices. Appl. Water Sci. 2020, 10, 215. [Google Scholar] [CrossRef]
Haslinger, K.; Koffler, D.; Schöner, W.; Laaha, G. Exploring the Link between Meteorological Drought and Streamflow: Effects of Climate-Catchment Interaction. Water Resour. Res. 2014, 50, 2468–2487. [Google Scholar] [CrossRef]
Noorisameleh, Z.; Khaledi, S.; Shakiba, A.; Firouzabadi, P.Z.; Gough, W.A.; Qader Mirza, M.M. Comparative Evaluation of Impacts of Climate Change and Droughts on River Flow Vulnerability in Iran. Water Sci. Eng. 2020, 13, 265–274. [Google Scholar] [CrossRef]
Prudhomme, C.; Giuntoli, I.; Robinson, E.L.; Clark, D.B.; Arnell, N.W.; Dankers, R.; Fekete, B.M.; Franssen, W.; Gerten, D.; Gosling, S.N.; et al. Hydrological Droughts in the 21st Century, Hotspots and Uncertainties from a Global Multimodel Ensemble Experiment. Proc. Natl. Acad. Sci. USA 2014, 111, 3262–3267. [Google Scholar] [CrossRef]
Stagge, J.H.; Tallaksen, L.M.; Gudmundsson, L.; Van Loon, A.F.; Stahl, K. Candidate Distributions for Climatological Drought Indices (SPI and SPEI). Int. J. Climatol. 2015, 35, 4027–4040. [Google Scholar] [CrossRef]
Jacob, D.; Petersen, J.; Eggert, B.; Alias, A.; Christensen, O.B.; Bouwer, L.M.; Braun, A.; Colette, A.; Déqué, M.; Georgievski, G.; et al. EURO-CORDEX: New High-Resolution Climate Change Projections for European Impact Research. Reg. Environ. Change 2014, 14, 563–578. [Google Scholar] [CrossRef]
Spinoni, J.; Vogt, J.V.; Naumann, G.; Barbosa, P.; Dosio, A. Will Drought Events Become More Frequent and Severe in Europe? Int. J. Climatol. 2018, 38, 1718–1736. [Google Scholar] [CrossRef]
Hanel, M.; Rakovec, O.; Markonis, Y.; Máca, P.; Samaniego, L.; Kyselý, J.; Kumar, R. Revisiting the Recent European Droughts from a Long-Term Perspective. Sci. Rep. 2018, 8, 9499. [Google Scholar] [CrossRef]
Cammalleri, C.; Naumann, G.; Mentaschi, L.; Bisselink, B.; Gelati, E.; De Roo, A.; Feyen, L. Diverging Hydrological Drought Traits over Europe with Global Warming. Hydrol. Earth Syst. Sci. 2020, 24, 5919–5935. [Google Scholar] [CrossRef]
Forzieri, G.; Feyen, L.; Rojas, R.; Flörke, M.; Wimmer, F.; Bianchi, A. Ensemble Projections of Future Streamflow Droughts in Europe. Hydrol. Earth Syst. Sci. 2014, 18, 85–108. [Google Scholar] [CrossRef]
Marx, A.; Kumar, R.; Thober, S.; Rakovec, O.; Wanders, N.; Zink, M.; Wood, E.F.; Pan, M.; Sheffield, J.; Samaniego, L. Climate Change Alters Low Flows in Europe under Global Warming of 1.5, 2, and 3 °C. Hydrol. Earth Syst. Sci. 2018, 22, 1017–1032. [Google Scholar] [CrossRef]
Giuntoli, I.; Vidal, J.P.; Prudhomme, C.; Hannah, D.M. Future Hydrological Extremes: The Uncertainty from Multiple Global Climate and Global Hydrological Models. Earth Syst. Dyn. 2015, 6, 267–285. [Google Scholar] [CrossRef]
Samaniego, L.; Thober, S.; Kumar, R.; Wanders, N.; Rakovec, O.; Pan, M.; Zink, M.; Sheffield, J.; Wood, E.F.; Marx, A. Anthropogenic Warming Exacerbates European Soil Moisture Droughts. Nat. Clim. Chang. 2018, 8, 421–426. [Google Scholar] [CrossRef]
Thober, S.; Kumar, R.; Wanders, N.; Marx, A.; Pan, M.; Rakovec, O.; Samaniego, L.; Sheffield, J.; Wood, E.F.; Zink, M. Multi-Model Ensemble Projections of European River Floods and High Flows at 1.5, 2, and 3 Degrees Global Warming. Environ. Res. Lett. 2018, 13, 014003. [Google Scholar] [CrossRef]
Laaha, G.; Gauster, T.; Tallaksen, L.M.; Vidal, J.P.; Stahl, K.; Prudhomme, C.; Heudorfer, B.; Vlnas, R.; Ionita, M.; Van Lanen, H.A.J.; et al. The European 2015 Drought from a Hydrological Perspective. Hydrol. Earth Syst. Sci. 2017, 21, 3001–3024. [Google Scholar] [CrossRef]
Nearing, G.S.; Kratzert, F.; Sampson, A.K.; Pelissier, C.S.; Klotz, D.; Frame, J.M.; Prieto, C.; Gupta, H.V. What Role Does Hydrological Science Play in the Age of Machine Learning? Water Resour. Res. 2021, 57, e2020WR028091. [Google Scholar] [CrossRef]
Rasouli, K.; Hsieh, W.W.; Cannon, A.J. Daily Streamflow Forecasting by Machine Learning Methods with Weather and Climate Inputs. J. Hydrol. 2012, 414–415, 284–293. [Google Scholar] [CrossRef]
Feng, D.; Fang, K.; Shen, C. Enhancing Streamflow Forecast and Extracting Insights Using Long-Short Term Memory Networks with Data Integration at Continental Scales. Water Resour. Res. 2020, 56, e2019WR026793. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Shortridge, J.E.; Guikema, S.D.; Zaitchik, B.F. Machine Learning Methods for Empirical Streamflow Simulation: A Comparison of Model Accuracy, Interpretability, and Uncertainty in Seasonal Watersheds. Hydrol. Earth Syst. Sci. 2016, 20, 2611–2628. [Google Scholar] [CrossRef]
Mathew, S.; Mohan, S. Prediction of Groundwater Storage under Climate Change Scenarios Using Meteorological and Hydrological Drought. Int. J. Sci. Res. Eng. Technol. 2025, 5, 116–124. [Google Scholar] [CrossRef]
Shen, C. A Transdisciplinary Review of Deep Learning Research and Its Relevance for Water Resources Scientists. Water Resour. Res. 2018, 54, 8558–8593. [Google Scholar] [CrossRef]
Wang, H.; Li, Y.; Huang, G.; Zhang, Q.; Ma, Y.; Li, Y. Development of a Random-Forest-Copula-Factorial Analysis (RFCFA) Method for Predicting Propagation between Meteorological and Hydrological Drought. Natl. Sci. Open 2024, 3, 20230022. [Google Scholar] [CrossRef]
Kisi, O.; Cimen, M. A Wavelet-Support Vector Machine Conjunction Model for Monthly Streamflow Forecasting. J. Hydrol. 2011, 399, 132–140. [Google Scholar] [CrossRef]
Rahmati, O.; Falah, F.; Dayal, K.S.; Deo, R.C.; Mohammadi, F.; Biggs, T.; Moghaddam, D.D.; Naghibi, S.A.; Bui, D.T. Machine Learning Approaches for Spatial Modeling of Agricultural Droughts in the South-East Region of Queensland Australia. Sci. Total Environ. 2020, 699, 134230. [Google Scholar] [CrossRef]
Karpatne, A.; Atluri, G.; Faghmous, J.H.; Steinbach, M.; Banerjee, A.; Ganguly, A.; Shekhar, S.; Samatova, N.; Kumar, V. Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data. IEEE Trans. Knowl. Data Eng. 2017, 29, 2318–2331. [Google Scholar] [CrossRef]
Stagge, J.H.; Kingston, D.G.; Tallaksen, L.M.; Hannah, D.M. Observed Drought Indices Show Increasing Divergence across Europe. Sci. Rep. 2017, 7, 14045. [Google Scholar] [CrossRef]
Sun, C.; Zhou, X. Characterizing Hydrological Drought and Water Scarcity Changes in the Future: A Case Study in the Jinghe River Basin of China. Water 2020, 12, 1605. [Google Scholar] [CrossRef]
ISRBC. 2nd Sava River Basin Analysis Report; International Sava River Basin Commission: Zagreb, Croatia, 2016. [Google Scholar]
Kim, J.H.; Sung, J.H.; Shahid, S.; Chung, E.S. Future Hydrological Drought Analysis Considering Agricultural Water Withdrawal Under SSP Scenarios. Water Resour. Manag. 2022, 36, 2913–2930. [Google Scholar] [CrossRef]
Pozzi, W.; Sheffield, J.; Stefanski, R.; Cripe, D.; Pulwarty, R.; Vogt, J.V.; Heim, R.R.; Brewer, M.J.; Svoboda, M.; Westerhoff, R.; et al. Toward Global Drought Early Warning Capability: Expanding International Cooperation for the Development of a Framework for Monitoring and Forecasting. Bull. Am. Meteorol. Soc. 2013, 94, 776–785. [Google Scholar] [CrossRef]
Hao, Z.; Singh, V.P.; Xia, Y. Seasonal Drought Prediction: Advances, Challenges, and Future Prospects. Rev. Geophys. 2018, 56, 108–141. [Google Scholar] [CrossRef]
Leščešen, I.; Tanhapour, M.; Pekárová, P.; Miklánek, P.; Bajtek, Z. Long Short-Term Memory (LSTM) Networks for Accurate River Flow Forecasting: A Case Study on the Morava River Basin (Serbia). Water 2025, 17, 907. [Google Scholar] [CrossRef]
Lema, F.; Mendoza, P.A.; Vásquez, N.A.; Mizukami, N.; Zambrano-Bigiarini, M.; Vargas, X. Technical Note: What Does the Standardized Streamflow Index Actually Reflect? Insights and Implications for Hydrological Drought Analysis. Hydrol. Earth Syst. Sci. 2025, 29, 1981–2002. [Google Scholar] [CrossRef]
Kamruzzaman, M.; Almazroui, M.; Salam, M.A.; Mondol, M.A.H.; Rahman, M.M.; Deb, L.; Kundu, P.K.; Zaman, M.A.U.; Islam, A.R.M.T. Spatiotemporal Drought Analysis in Bangladesh Using the Standardized Precipitation Index (SPI) and Standardized Precipitation Evapotranspiration Index (SPEI). Sci. Rep. 2022, 12, 20694. [Google Scholar] [CrossRef]
Li, Z.; Huang, S.; Zhou, S.; Leng, G.; Liu, D.; Huang, Q.; Wang, H.; Han, Z.; Liang, H. Clarifying the Propagation Dynamics from Meteorological to Hydrological Drought Induced by Climate Change and Direct Human Activities. J. Hydrometeorol. 2021, 22, 2359–2378. [Google Scholar] [CrossRef]
Stagge, J.H.; Tallaksen, L.M.; Xu, C.-Y.; Van Lanen, H.A.J. Standardized Precipitation-Evapotranspiration Index (SPEI): Sensitivity to Potential Evapotranspiration Model and Parameters. In Hydrology in a Changing World: Environmental and Human Dimensions; IAHS-AISH Proceedings and Reports; Copernicus Publications: Göttingen, Germany, 2014; Volume 363. [Google Scholar]
Pham, L.T.; Luo, L.; Finley, A. Evaluation of Random Forests for Short-Term Daily Streamflow Forecasting in Rainfall- And Snowmelt-Driven Watersheds. Hydrol. Earth Syst. Sci. 2021, 25, 2997–3015. [Google Scholar] [CrossRef]
Kotlarski, S.; Keuler, K.; Christensen, O.B.; Colette, A.; Déqué, M.; Gobiet, A.; Goergen, K.; Jacob, D.; Lüthi, D.; Van Meijgaard, E.; et al. Regional Climate Modeling on European Scales: A Joint Standard Evaluation of the EURO-CORDEX RCM Ensemble. Geosci. Model Dev. 2014, 7, 1297–1333. [Google Scholar] [CrossRef]
Nikulin, G.; Jones, C.; Giorgi, F.; Asrar, G.; Büchner, M.; Cerezo-Mota, R.; Christensen, O.B.; Déqué, M.; Fernandez, J.; Hänsler, A.; et al. Precipitation Climatology in an Ensemble of CORDEX-Africa Regional Climate Simulations. J. Clim. 2012, 25, 6057–6078. [Google Scholar] [CrossRef]
Kjellström, E.; Nikulin, G.; Strandberg, G.; Bøssing Christensen, O.; Jacob, D.; Keuler, K.; Lenderink, G.; Van Meijgaard, E.; Schär, C.; Somot, S.; et al. European Climate Change at Global Mean Temperature Increases of 1.5 and 2 °C above Pre-Industrial Conditions as Simulated by the EURO-CORDEX Regional Climate Models. Earth Syst. Dyn. 2018, 9, 459–478. [Google Scholar] [CrossRef]
Papadimitriou, L.V.; Koutroulis, A.G.; Grillakis, M.G.; Tsanis, I.K. High-End Climate Change Impact on European Runoff and Low Flows—Exploring the Effects of Forcing Biases. Hydrol. Earth Syst. Sci. 2016, 20, 1785–1808. [Google Scholar] [CrossRef]
Teutschbein, C.; Grabs, T.; Giese, M.; Todorović, A.; Barthel, R. Drought Propagation in High-Latitude Catchments: Insights from a 60-Year Analysis Using Standardized Indices. Nat. Hazards Earth Syst. Sci. 2025, 25, 2541–2564. [Google Scholar] [CrossRef]
Peña-Guerrero, M.D.; Wang, Z.; Ebeling, P.; Siebert, C.; Merz, R.; Tarasova, L. Pathways of Drought Propagation in Near-Natural Catchments across Germany. In Proceedings of the EGU General Assembly 2025, Vienna, Austria, 27 April–2 May 2025. [Google Scholar]
Hao, R.; Yan, H.; Chiang, Y.M. Forecasting the Propagation from Meteorological to Hydrological and Agricultural Drought in the Huaihe River Basin with Machine Learning Methods. Remote Sens. 2023, 15, 5524. [Google Scholar] [CrossRef]
Orth, R.; Destouni, G. Drought Reduces Blue-Water Fluxes More Strongly than Green-Water Fluxes in Europe. Nat. Commun. 2018, 9, 3602. [Google Scholar] [CrossRef]
Brunner, M.I.; Chartier-Rescan, C. Drought Spatial Extent and Dependence Increase During Drought Propagation From the Atmosphere to the Hydrosphere. Geophys. Res. Lett. 2024, 51, e2023GL107918. [Google Scholar] [CrossRef]
Sutanto, S.J.; Van Lanen, H.A.J. Catchment Memory Explains Hydrological Drought Forecast Performance. Sci. Rep. 2022, 12, 2689. [Google Scholar] [CrossRef]
Schneider, R.; Karlsson Seidenfaden, I.; Hansen, M.F.T.; Hansen, M.; Koch, J.; Andreasen, M.; Nilsson, B.; Stisen, S. Validating Drought Propagation through the Entire Hydrological Cycle Simulated with an Integrated National-Scale Hydrological Model. In Proceedings of the European Geosciences Union General Assembly 2025 (EGU25), Vienna, Austria, 27 April–2 May 2025. [Google Scholar]
Niazkar, M.; Cenobio-Cruz, O.; Mozzi, G.; Di Baldassarre, G.; Pal, J. Hydrological Modelling vs. Machine Learning for Water Availability: Case Study from the Reno Basin (Italy). In Proceedings of the European Geosciences Union General Assembly 2025 (EGU25), Vienna, Austria, 27 April–2 May 2025. [Google Scholar]
Kratzert, F.; Klotz, D.; Herrnegger, M.; Sampson, A.K.; Hochreiter, S.; Nearing, G.S. Toward Improved Predictions in Ungauged Basins: Exploiting the Power of Machine Learning. Water Resour. Res. 2019, 55, 11344–11354. [Google Scholar] [CrossRef]
Deb, D.; Arunachalam, V.; Raju, K.S. Daily Reservoir Inflow Prediction Using Stacking Ensemble of Machine Learning Algorithms. J. Hydroinform. 2024, 26, 972–997. [Google Scholar] [CrossRef]
Teutschbein, C.; Seibert, J. Bias Correction of Regional Climate Model Simulations for Hydrological Climate-Change Impact Studies: Review and Evaluation of Different Methods. J. Hydrol. 2012, 456–457, 12–29. [Google Scholar] [CrossRef]
Cannon, A.J.; Sobie, S.R.; Murdock, T.Q. Bias Correction of GCM Precipitation by Quantile Mapping: How Well Do Methods Preserve Changes in Quantiles and Extremes? J. Clim. 2015, 28, 6938–6959. [Google Scholar] [CrossRef]
Soares, P.M.M.; Careto, J.A.M.; Russo, A.; Lima, D.C.A. The Future of Iberian Droughts: A Deeper Analysis Based on Multi-Scenario and a Multi-Model Ensemble Approach. Nat. Hazards 2023, 117, 2001–2028. [Google Scholar] [CrossRef]
Potopová, V.; Štěpánek, P.; Zahradníček, P.; Farda, A.; Türkott, L.; Soukup, J. Projected Changes in the Evolution of Drought on Various Timescales over the Czech Republic According to Euro-CORDEX Models. Int. J. Climatol. 2018, 38, e939–e954. [Google Scholar] [CrossRef]
Leščešen, I. Hydrological Shifts in the Carpathian Basin: Climate Change Impacts on Summer Low-Flows. Geogr. Pannonica 2025, 29, 108–120. [Google Scholar] [CrossRef]
Sutanto, S.J.; Vitolo, C.; Di Napoli, C.; D’Andrea, M.; Van Lanen, H.A.J. Heatwaves, Droughts, and Fires: Exploring Compound and Cascading Dry Hazards at the Pan-European Scale. Environ. Int. 2020, 134, 105276. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Geographical position of the research basin and the stations used in this study.

Figure 2. Historical time series of the SPI, STI, SPEI, and SDI (1961–2020).

Figure 3. Relationship between meteorological and hydrological drought indices.

Figure 4. Random Forest feature importance rankings for lagged SPI, STI, SPEI, and seasonal terms.

Figure 5. Observed vs. predicted Streamflow Drought Index (SDI) results for five machine learning models (test period).

Figure 6. Observed and Random Forest-predicted SDI results during the test period 2009–2020.

Figure 7. Projected Streamflow Drought Index (SDI) results from the Random Forest model for 2021–2050 according to RCP 2.6, RCP 4.5, and RCP 8.5 (bias-corrected) (blue bars—positive SDI; red bars—negative SDI).

Figure 8. Distribution of forecast SDI values (Random Forest, bias-corrected) for 2021–2050.

Figure 9. Hydrostripes of the forecast Streamflow Drought Index (SDI) for 2021–2050 under RCP 2.6, RCP 4.5, and RCP 8.5 scenarios. Hydrostripes illustrate monthly hydrological anomalies relative to the 1961–1990 reference period, where increasingly darker red shades indicate progressively drier-than-average conditions and darker blue shades represent wetter-than-average months.

Table 1. Category thresholds for standardized drought indices (SPI, STI, SPEI, and SDI).

Category	Index Value Range
Extremely wet	≥+2.00
Very wet	+1.50 to +1.99
Moderately wet	+1.00 to +1.49
Near normal	−0.99 to +0.99
Moderate drought	−1.00 to −1.49
Severe drought	−1.50 to −1.99

Table 2. Main hyperparameters and search ranges for the five forecasting models.

Model	Hyperparameter	Description	Candidate Values/Range
Random Forest	n_estimators	Number of trees in ensemble	200, 300, 500
	max_depth	Maximum depth of each tree	None, 4, 6, 8
	min_samples_leaf	Minimum samples at a leaf node	1, 2, 4
	max_features	Predictors considered at each split	‘auto’, ‘sqrt’, 0.5
XGBoost	n_estimators	Number of boosting stages	200, 300, 500
	max_depth	Maximum depth of individual trees	3, 4, 5, 6
	learning_rate	Shrinkage applied to each tree	0.01, 0.03, 0.05, 0.10
	subsample	Row subsampling fraction	0.7, 0.8, 1.0
	colsample_bytree	Column subsampling fraction	0.7, 0.8, 1.0
	min_child_weight	Minimum sum of instance weight in a leaf	1, 3, 5
	gamma	Minimum loss reduction for a split	0, 0.1, 0.2
Elastic Net	alpha	Overall regularisation strength	log-spaced, 0.001–10
Elastic Net	l1_ratio	L1/L2 mixing parameter	0.1, 0.2, …, 0.9
SVR	C	Penalty parameter	log-spaced, 1–1000
	gamma	RBF kernel width	log-spaced, 0.001–10
	epsilon	Insensitive loss width	log-spaced, 0.001–1
MLP	hidden_layer_sizes	Size of hidden layers	(32,), (64,), (32, 16), (64, 32)
	alpha	L2 regularisation term	log-spaced, 1 × 10⁻⁵–1 × 10⁻¹
	learning_rate_init	Initial learning rate	log-spaced, 1 × 10⁻⁴–1 × 10⁻²

Table 3. Descriptive statistics and temporal behaviour of indices.

Index	Count	Mean	Std	Min	25%	50%	75%	Max
SPI	696	0.018	0.900	−4.111	−0.481	0.116	0.602	2.408
STI	696	0.007	0.962	−2.764	−0.611	0.041	0.628	2.943
SPEI	696	−0.004	0.996	−2.122	−0.742	−0.102	0.585	4.014
SDI	696	0.006	0.996	−2.209	−0.706	−0.148	0.550	5.104

Table 4. Evaluation metrics.

Evaluation Metrics	Test Metrics BEFORE Bias Correction
Evaluation Metrics	Random Forest	XGBoost	ElasticNet	SVR	MLP
MAE	0.626	0.653	0.642	0.649	0.652
MSE	0.720	0.767	0.756	0.790	0.784
RMSE	0.848	0.876	0.870	0.889	0.885
R2	0.470	0.436	0.443	0.419	0.423
NSE	0.470	0.436	0.443	0.419	0.423
KGE	0.255	0.197	−1.145	−2.522	−1.832
Evaluation Metrics	Test metrics AFTER Bias Correction
Evaluation Metrics	Random Forest	XGBoost	ElasticNet	SVR	MLP
MAE	0.623	0.636	0.720	0.745	0.742
MSE	0.694	0.710	0.914	0.975	0.915
RMSE	0.833	0.843	0.956	0.988	0.957
R2	0.489	0.478	0.327	0.282	0.327
NSE	0.489	0.478	0.327	0.282	0.327
KGE	0.651	0.482	−3.017	−2.705	−3.193

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Leščešen, I.; Josić, M.; Gnjato, S.; Petrović, A.M.; Bajtek, Z. Future Hydrological Drought and Water Sustainability in the Sava River Basin: Machine Learning Projections Under Climate Change Scenarios. Sustainability 2026, 18, 2678. https://doi.org/10.3390/su18062678

AMA Style

Leščešen I, Josić M, Gnjato S, Petrović AM, Bajtek Z. Future Hydrological Drought and Water Sustainability in the Sava River Basin: Machine Learning Projections Under Climate Change Scenarios. Sustainability. 2026; 18(6):2678. https://doi.org/10.3390/su18062678

Chicago/Turabian Style

Leščešen, Igor, Milan Josić, Slobodan Gnjato, Ana M. Petrović, and Zbyněk Bajtek. 2026. "Future Hydrological Drought and Water Sustainability in the Sava River Basin: Machine Learning Projections Under Climate Change Scenarios" Sustainability 18, no. 6: 2678. https://doi.org/10.3390/su18062678

APA Style

Leščešen, I., Josić, M., Gnjato, S., Petrović, A. M., & Bajtek, Z. (2026). Future Hydrological Drought and Water Sustainability in the Sava River Basin: Machine Learning Projections Under Climate Change Scenarios. Sustainability, 18(6), 2678. https://doi.org/10.3390/su18062678

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Future Hydrological Drought and Water Sustainability in the Sava River Basin: Machine Learning Projections Under Climate Change Scenarios

Abstract

1. Introduction

2. Data and Methods

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI