Next Article in Journal
Biochar–Sponge Iron Modified Bioretention System Improved Nitrogen Removal Efficiency for Aquaculture Wastewater Treatment
Next Article in Special Issue
Contrasting Water Quality Responses During Drought Development Versus Recovery in Poyang Lake
Previous Article in Journal
Spatiotemporal Evolution of Compound Dry–Hot Events and Their Impacts on Vegetation Net Primary Productivity in the Yangtze River Basin
Previous Article in Special Issue
Hydrological Analysis of the 2024 Flood in the Upper Biała Lądecka Sub-Basin in South Poland
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessing Climate Change Impacts on Future Precipitation Using Random Forest Statistical Downscaling of CMIP6 HadGEM3 Projections in the Büyük Menderes Basin

Department of Civil Engineering, Engineering Faculty, Pamukkale University, Denizli 20160, Türkiye
*
Author to whom correspondence should be addressed.
Water 2026, 18(2), 277; https://doi.org/10.3390/w18020277
Submission received: 1 December 2025 / Revised: 8 January 2026 / Accepted: 16 January 2026 / Published: 21 January 2026
(This article belongs to the Special Issue Climate Change Adaptation in Water Resource Management)

Abstract

Climate change increasingly threatens the sustainability of regional water resources; therefore, robust station-scale precipitation projections are essential for basin-level planning. This study aims to develop and evaluate a hybrid, machine-learning-based statistical downscaling framework to generate monthly precipitation projections for the 21st century in the Büyük Menderes Basin, western Türkiye, using the HadGEM3-GC31-LL global climate model from the CMIP6. Monthly observations from 23 rainfall observation stations and ERA5 reanalysis predictors were employed to train station-specific Random Forest (RF) models, with optimal predictor sets identified through a multistage selection procedure (MPSP). Coarse-resolution general circulation model (GCM) fields were harmonized with ERA5 data using a three-stage inverse distance weighting (IDW), Delta, and Variance rescaling approach. The downscaled projections were bias-corrected using Quantile Delta Mapping (QDM) to maintain the climate-change signal. The RF models exhibited strong predictive skill across most stations, with test Nash–Sutcliffe Efficiency (NSE) values ranging from 0.45 to 0.81, RSR values from 0.43 to 0.74, and PBIAS values from −21.99% to +5.29%. Future projections indicate a basin-wide drying trend under both scenarios. Relative to the baseline, mean annual precipitation is projected to decrease by approximately 12.2, 19.6, and 33.7 mm in the near (2025–2050), mid (2051–2075), and late (2076–2099) periods under SSP2-4.5 (Shared Socioeconomic Pathway 2-4.5, a moderate greenhouse gas scenario). Under the high-emission SSP5-8.5 scenario, projected decreases are 25.2, 53.2, and 86.9 mm, respectively. Late-century reductions reach approximately 15–22% in several sub-basins. These findings indicate a substantial decline in future water availability and underscore the value of RF-based hybrid downscaling and trend-preserving bias correction for water resources planning in semi-arid Mediterranean basins.

1. Introduction

Climate change has progressively disrupted natural systems since the early stages of human civilization and has become a primary driver of global warming in recent decades. Greenhouse gas emissions associated with global economic growth and population expansion are a principal cause of this warming trend. As a result, the global hydrological cycle and the long-term sustainability of water resources face significant threats [1].
The spatial and temporal variability of water resources is highly responsive to changes in climatic conditions. This responsiveness highlights the necessity for comprehensive assessments of water resources in basins vulnerable to climate change [2]. Precipitation, a critical component of the hydrological cycle, has shown significant changes in recent decades due to climate forcing. Consequently, water resource planners and decision-makers in many regions have initiated adaptive strategies to address the projected impacts of climate change [3]. For Mediterranean-type climates, recent syntheses report increasing drought risk and water scarcity alongside shifts in precipitation regimes [4]. The effectiveness of these strategies depends on access to reliable future climate projections for the region of interest. General Circulation Models (GCMs) are currently the most advanced tools for generating climate projections of hydroclimatic variables at both global and regional scales.
The latest coordinated framework for global climate projections is the Coupled Model Intercomparison Project Phase 6 (CMIP6), organized under the World Climate Research Programme (WCRP). CMIP6 adopts a federated experimental design centered on a common set of baseline experiments (DECK) and historical simulations, complemented by a suite of endorsed Model Intercomparison Projects (MIPs) that address key scientific questions and provide a standardized basis for model evaluation and impact assessment [5]. In the Mediterranean–Türkiye hotspot context, CMIP6-based assessments have increasingly been used to characterize future hydroclimatic risks, and multi-model ensembles are often used for impact studies [6]. Future climate projections in CMIP6 are primarily generated through ScenarioMIP using Shared Socioeconomic Pathway (SSP) forcing trajectories such as SSP2-4.5 and SSP5-8.5, facilitating consistent scenario-based comparisons across models [7]. In this study, we use the CMIP6 model HadGEM3-GC31-LL (Met Office Hadley Centre), the low-resolution configuration of the HadGEM3-GC31 coupled model system [8]. The present study prioritizes development and demonstration of a reproducible station-scale hybrid SDS framework; therefore multi-model spread assessment is left for future work.
Although GCMs are valuable, their spatial resolution is insufficient to capture basin-scale climate processes. This limitation renders GCM outputs inadequate for hydrological applications that require detailed, fine-scale information [9,10]. To overcome this challenge, both dynamical and statistical downscaling methods have been developed to translate large-scale GCM simulations to local or basin scales [11]. Recent reviews highlight significant advances in statistical downscaling, particularly machine-learning approaches that better capture nonlinearity and complex terrain effects [12,13].
Dynamical downscaling utilizes physically based regional climate models to refine GCM projections; however, its use is limited by high computational requirements, model complexity, and strong dependence on GCM boundary conditions. In contrast, statistical downscaling (SDS) methods are more widely adopted due to their lower computational cost, operational simplicity, and robust predictive capabilities [3,14]. SDS approaches establish statistical transfer functions between large-scale GCM predictors and local scale predictands [15].
Despite their practical advantages, statistical downscaling methods also have limitations. They often assume that the statistical relationship between GCM-scale predictors and local climate remains approximately stable over time, a relationship that can be challenged by changing circulation and moisture regimes. Their performance may decline when applied to conditions outside the training range—such as rare events or situations strongly influenced by local processes (e.g., orographic precipitation). Finally, because SDS does not explicitly constrain solutions to physical laws, additional checks and post-processing may be required to maintain physical plausibility. Accordingly, careful predictor selection, scale harmonization, and trend-preserving post-processing are essential to improving the robustness of SDS-based projections.
The current literature shows that regression-based and ML-based models are frequently preferred due to their efficiency, ease of implementation, and strong predictive performance. Notable regression-based studies have utilized Multiple Linear Regression (MLR) [16,17,18,19,20] and Generalized Linear Models (GLMs) [21,22]. In the ML domain, Artificial Neural Networks (ANNs) have been widely and effectively applied to precipitation downscaling [23,24,25,26,27,28,29,30,31]. Support Vector Machines (SVMs) have also demonstrated strong performance in SDS applications [32,33,34,35,36].
Relevance Vector Machines (RVMs), which are sparse Bayesian analogs to SVMs, provide additional benefits such as reduced model complexity and fewer required parameters. Ghosh and Mujumdar [37] applied RVMs to downscale GCM simulations for basin-scale precipitation and streamflow, demonstrating clear performance advantages over SVMs. Subsequent research has further validated the effectiveness of RVMs for SDS applications [38,39,40]. Recent studies have also investigated hybrid and multi-method approaches, comparing or integrating Genetic Programming, ANN, SVM, and RVM techniques [41,42].
The Random Forest (RF) algorithm is another ML technique that has gained prominence due to its capacity to model nonlinear relationships and its resistance to overfitting, making it well-suited for complex hydroclimatic processes. Comparative studies indicate that RF outperforms other ML models in downscaling satellite-based precipitation datasets [43,44,45,46,47]. Recent assessments using the Coupled Model Intercomparison Project phase 6 (CMIP6), the latest generation of global climate models, confirm that RF often achieves superior downscaling performance compared to SVM across various climate regions [48]. RF has also shown strong skill in predicting daily and sub-daily precipitation from atmospheric variables [49].
In addition to GCM-based SDS, RF is widely used for downscaling precipitation and for integrating satellite data with reanalysis data. It frequently outperforms other ML methods and utilizes extra land-surface and topographic information. Recent explainable ML techniques, such as SHAP (SHapley Additive exPlanations), improve transparency by identifying main predictors and regional factors affecting rainfall [50,51].
Consistent with this evidence, recent CMIP6 case studies, including those from Türkiye, show that RF effectively downscales monthly precipitation. RF can be integrated with quantile-based bias-correction methods, either as a post-processing step or within hybrid correction frameworks [20,52,53].
Despite advances in CMIP6-based climate projections and the rising use of machine learning for statistical downscaling, several issues persist in applying these methods to basin-scale water resource management in Mediterranean semi-arid regions. First, station-scale monthly precipitation projections derived from CMIP6 simulations are still limited for many basins in Türkiye, particularly where complex topography and strong seasonal contrasts demand localized predictor–predictand relationships [6,20]. Second, many downscaling studies rely on either direct predictor–predictand mapping or a single-step bias adjustment. In contrast, fewer studies explicitly address the multi-stage inconsistency between coarse GCM fields and reanalysis-based predictors before model training. Third, while machine learning techniques like RF are increasingly popular, there is a notable lack of integrated hybrid frameworks that include (i) systematic, station-specific predictor selection, (ii) spatial reconciliation of GCM data with reanalysis at basin scales, and (iii) bias correction methods that preserve climate change signals in future projections.
To address these gaps, this study proposes a hybrid CMIP6-to-station downscaling framework for the Büyük Menderes Basin that integrates ERA5 predictors, station observations, and CMIP6 HadGEM3-GC31-LL outputs. The novelty of the approach lies in (1) implementing a station-specific multistage predictor-selection procedure to optimize RF predictors, (2) harmonizing coarse-resolution GCM fields with ERA5 through a three-stage IDW–Delta–Variance rescaling before downscaling, and (3) applying Quantile Delta Mapping (QDM) as a trend-preserving bias-correction step for SSP2-4.5 and SSP5-8.5 projections, thereby supporting consistent near-, mid-, and late-century assessments for basin-scale planning.
Decisions regarding water resources are typically made at the basin and station levels, but climate projections are often provided at coarse scales. The Büyük Menderes Basin in western Türkiye features a semi-arid, Mediterranean-type climate with marked seasonality and significant variability. Its water demand, mainly driven by irrigation, and recurring water stress heighten its sensitivity to climate-induced changes in rainfall. Consequently, accurate, station-level precipitation projections that retain the climate change signal are essential for developing resilient basin management strategies.
This study generates station-scale monthly precipitation projections for the Büyük Menderes Basin by integrating observations from 23 rain gauges, ERA5 predictors, and CMIP6 HadGEM3-GC31-LL simulations under SSP scenarios. It applies bias correction to reduce systematic errors in the projections.
The findings offer important insights into sustainable water resources management in semi-arid regions and provide a scientific basis for national climate adaptation strategies. This study constitutes a pioneering effort in Türkiye by presenting a hybrid SDS framework that integrates CMIP6 climate model outputs to assess regional hydrological impacts of climate change within the strategically significant Büyük Menderes Basin.
Accordingly, the main objectives of this study are to:
(i)
Develop and evaluate a station-specific RF-based statistical downscaling framework for monthly precipitation using ERA5 predictors and observations from 23 rain stations.
(ii)
Generate station-scale monthly precipitation projections from CMIP6 HadGEM3-GC31-LL under SSP2-4.5 and SSP5-8.5 for the near (2025–2050), mid (2051–2075), and late (2076–2099) periods.
(iii)
Quantify projected precipitation changes and their spatial variability across the basin to support water-resources planning in a semi-arid Mediterranean setting; and.
(iv)
Apply trend-preserving bias correction (Quantile Delta Mapping) to reduce systematic errors while retaining the climate-change signal in the downscaled projections.

2. Materials and Methods

2.1. Study Area

The Büyük Menderes Basin, the designated study area, is situated in western Türkiye within the Aegean Region as shown in Figure 1. Encompassing approximately 26,000 km2, the Büyük Menderes Basin accounts for about 3.2% of Türkiye’s total land area and serves as a strategically significant water regime system [54]. The Büyük Menderes River, which extends 584 km from east to west, is the longest river discharging into the Aegean Sea. The basin is essential for supporting agricultural production, hydropower generation, and domestic water supply in the region.
The basin exhibits a typical Mediterranean climate, characterized by hot, dry summers and mild, wet winters. Annual mean temperatures range from 14 °C to 18 °C, with summer temperatures often exceeding 35 °C, particularly in lowland plains. In winter, average minimum temperatures range from 2 °C to 10 °C, and frost events are occasionally observed at higher elevations. The basin’s topography varies significantly, from rugged highlands in the east to fertile alluvial plains in the west. As a result, precipitation patterns are primarily influenced by frontal systems, while orographic precipitation is common in southern and mountainous regions [55]. According to the Köppen–Geiger climate classification, the Büyük Menderes Basin is categorized as ‘Csa’, indicating a hot-summer Mediterranean climate [56].

2.2. Data Sources

Monthly precipitation data from 23 meteorological stations within the basin were collected from official records for the period 1980–2014. During this interval, annual total precipitation ranged from 210 mm to 1305 mm. In recent years, rising temperatures and declining precipitation have led to significant water-supply challenges throughout the basin, particularly for agricultural irrigation.
This study utilized monthly precipitation data from 23 meteorological stations operated by the Turkish State Meteorological Service under the Ministry of Environment, Urbanization, and Climate Change. The locations of the stations are shown in Figure 1, and detailed information on each station is provided in Table 1. The observation stations average rainfall was determined by calculating the mean of the total annual rainfall across the observation period.
The datasets used in this study include:
(i)
Historical precipitation observations from meteorological stations (1980–2014);
(ii)
Reanalysis datasets;
(iii)
GCM simulations from the CMIP6 framework representing both historical and future climate conditions.
The ERA5 reanalysis dataset, produced by the Copernicus Climate Change Service (C3S) of the European Centre for Medium-Range Weather Forecasts (ECMWF), was employed for the 1980–2014 period. In climate change research, datasets published by the ECMWF are frequently utilized in studies across the literature [57].
ERA5 reanalysis data have been widely employed in recent downscaling studies [58,59,60]. Furthermore, recent studies have shown that ERA5 performs well compared with other reanalysis products [61,62]. Data from 53 ERA5 grid points covering the study area were downloaded from the Copernicus Climate Data Store (https://cds.climate.copernicus.eu/datasets, accessed on 10 February 2024). The monthly ERA5 dataset has a spatial resolution of 0.25° × 0.25°, and the arrangement of the selected grids is shown in Figure 1.
Within the CMIP6 framework, this study employs a single GCM to demonstrate and evaluate the proposed hybrid statistical downscaling framework at the station scale. The model was selected primarily due to the completeness and internal consistency of the required variables and pressure levels, and the availability of continuous historical simulations together with SSP2-4.5/SSP5-8.5 projections from a harmonized source. In addition, a basic consistency check against ERA5 reanalysis was performed for the historical period to ensure a physically plausible baseline for the basin. To provide broader context on CMIP6 model performance for monthly precipitation over Türkiye, previous evaluation studies were also considered [63,64,65]. We acknowledge that using a single GCM does not quantify inter-model uncertainty across CMIP6; therefore, the projections should be interpreted as conditional on HadGEM3-GC31-LL. The primary contribution of this study is methodological: the proposed framework is model-agnostic and can be extended to multi-model CMIP6 ensembles in future work to explicitly assess inter-model uncertainty.
GCM generates future climate projections under various greenhouse gas emission pathways. In this study, the medium-emission scenario SSP2-4.5 (corresponding to 4.5 W/m2 radiative forcing by 2100) and the high-emission scenario SSP5-8.5 (corresponding to 8.5 W/m2 radiative forcing by 2100) were utilized. To resolve the spatial resolution mismatch between the GCM and ERA5 datasets, GCM grids were aligned with the ERA5 grid system using IDW and interpolation-based regridding.
Recent hydrological observations indicate a decline in precipitation across the basin. Consequently, the low-emission scenario SSP1-2.6 was excluded from the analysis. The historical period for monthly total precipitation from the GCM covers 1980–2014, while future SSP projections span 2025–2099. The CMIP6 GCM dataset utilized in this study was sourced from the Copernicus Climate Data Store (https://cds.climate.copernicus.eu/datasets). Details of the selected GCM are provided in Table 2.
Regarding ensemble information, the historical, SSP2-4.5 and SSP5-8.5 simulations of HadGEM3-GC31-LL were taken from a single CMIP6 variant (r1i1p1f3) on the native grid (gn). Downscaling was applied to this single realization; no ensemble mean across multiple realizations was used.
This study utilized 12 atmospheric variables from four pressure levels (surface, 200 hPa, 500 hPa, and 850 hPa) from both the ERA5 reanalysis dataset and the selected CMIP6 HadGEM3-GC31-LL general circulation model dataset. The variables used in the analysis are listed in Table 3.

2.3. Methodology

Statistical downscaling methods are widely used to derive local-scale climate information from GCM outputs by modeling the statistical relationships between large-scale atmospheric predictors and local hydrometeorological observations. Once calibrated and validated, these relationships can be used in future GCM scenario simulations to generate localized precipitation projections [66]. In this study, SDS is implemented within a hybrid framework that combines reanalysis-based predictors with station observations and CMIP6 scenario data, enabling station-scale monthly precipitation projections suitable for basin-scale impact assessments.
This section provides sufficient methodological detail to ensure reproducibility. It outlines the process for screening station-specific predictors and details the setup and assessment of the Random Forest downscaling models. It also explains how GCM scenario predictors are prepared for use, including methods to minimize scale-related inconsistencies and the post-processing steps for the downscaled outputs used in scenario-based projections. Figure 2 clearly shows the workflow for this strategy, highlighting the sequence of steps and the flow of information between datasets. The subsequent subsections provide full methodological details, parameter settings, and performance metrics.

2.3.1. Multistage Predictor Selection Procedure

The performance of SDS models is strongly influenced by the relationship between selected atmospheric predictors and local meteorological variables. Consequently, the predictor selection process was conducted with particular care to identify the most appropriate multiple-predictor combinations for each station. Previous studies indicate that optimal predictors may vary according to the location of observation stations. Any predictor can be evaluated prior to the downscaling stage, but it should only be included if it demonstrates a statistically significant correlation with the observations [67].
Predictor selection was conducted independently for each observation station using a spatially weighted correlation-based method to identify the most representative ERA5 grids. This approach employs a scoring system that simultaneously considers both the distance between the station and the grids and the correlation coefficient. As a result, predictors derived from ERA5 grids with strong statistical and spatial representativeness were identified. Previous research has established that model performance is closely associated with the physical relevance of predictors to local hydrometeorological parameters and their degree of correlation [68,69,70].
To elucidate the multivariate relationships among predictor variables and optimize SDS model performance for each station, a multi-stage predictor selection procedure (MPSP) based on the All Possible Regression (APREG) approach was implemented. The procedure included the following steps: identifying ERA5 grids with high spatial representativeness, constructing predictor datasets for each station, conducting an exhaustive subset search using APREG with 5-fold cross-validation, performing multi-criteria model scoring, and executing a sensitivity analysis. The development of these steps was guided by the principles of physical consistency, generalization capability, and parsimony as emphasized in the literature [71,72].
The most representative ERA5 grids for each station were determined by jointly evaluating the correlation coefficient (R) between the grid and station time series and the corresponding distance, to compute a spatially weighted correlation. This method identified the four most representative grids for each station. The average of these four grid time series was then used to generate predictor time series for all ERA5 variables listed in Table 3. This process improved representativeness and minimized noise.
In constructing predictor subsets, the APREG approach, which has demonstrated effectiveness in recent SDS studies, was applied [24,67]. For each station, all possible combinations of 1 to 12 predictors were generated. The primary evaluation metric was root mean square error (RMSE) using 5-fold cross-validation, with additional metrics including adjusted coefficient of determination (Adj. R2), Mallows’ Cp, and the Bayesian Information Criterion (BIC). In accordance with parsimony principles, these penalty-based metrics were weighted under alternative scenarios and analyzed through sensitivity analysis to compute a combined score coefficient. This methodology mitigated overfitting through cross-validation and simultaneously optimized model training performance, physical consistency, and generalization ability. Penalty-based criteria such as Cp and BIC were used to balance model simplicity and predictive performance. Sensitivity analysis quantified the impact of alternative weighting scenarios on model selection, ensuring the identification of stable predictor combinations [73,74,75].
The MPSP analysis was conducted for each station using precipitation data from 23 meteorological stations and the ERA5 dataset. The optimal predictor combination for each station was identified and recommended for subsequent model training. For example, Table 4 presents the MPSP results for the Banaz station, where the four-variable predictor combination achieved the highest score coefficient, whereas combinations with more predictors yielded lower scores. Therefore, the MPSP approach facilitated the use of simple yet effective predictor combinations in model training, avoiding unnecessary complexity.

2.3.2. GCM–ERA5 Datasets Harmonization Using IDW–Delta–Variance Approach

GCM typically produce outputs at coarse spatial resolutions, creating challenges for hydrological and climate-impact studies conducted at the basin scale. The mismatch between GCM resolutions and those of observational or reanalysis datasets impedes meaningful comparisons and model integration. Therefore, harmonizing GCM data with higher-resolution reanalysis datasets is a critical preprocessing requirement [76,77].
A three-stage regridding framework was implemented to adapt GCM outputs to the 0.25° × 0.25° spatial resolution of the ERA5 dataset. This procedure comprises the following steps:
1.
Inverse Distance Weighting: During the first stage, GCM fields were interpolated onto ERA5 grid points using the IDW method, which assigns weights inversely proportional to the distance between source and target grid cells. This approach is a fundamental method for spatially harmonizing datasets with different resolutions. The IDW formulation is provided in Equation (1) [78]:
ŷ x 0 = i = 1 N y i d x 0 , x i p i = 1 N 1 d x 0 , x i p
In this formulation, ŷ represents the estimated value in the target grid, y i denotes the value in the source grid, d x 0 , x i is the distance in kilometers between the target and source grids, and p is the weight parameter.
2.
Delta (Anomaly) Adjustment: To preserve projected climate change signals within the context of ERA5 baseline climatology, the delta (anomaly) method was applied. In this step, GCM future-period deviations were calculated relative to the model’s historical mean climate, and these anomalies were then added to the ERA5 climatological normals [79]. The formulation for Delta Anomaly is given in Equation (2):
Y a n o m t = Y G C M t Ȳ G C M , r e f Y r e g r i d t = Ȳ E R A 5 , r e f Y a n o m t
In this equation, Y a n o m denotes the GCM anomaly value, Y G C M is the GCM raw data, Ȳ G C M , r e f is the reference period (1980–2014) average of the GCM, Y r e g r i d is the GCM value adapted to the ERA5 scale, and Ȳ E R A 5 , r e f is the ERA5 reference period average.
3.
Variance Scaling (Variance Post-processing): In the final stage, the variability of the regridded GCM series was adjusted to match the statistical dispersion of the ERA5 series. This correction reduces systematic variance differences and improves statistical compatibility between the two datasets [76]. The details are presented in Equation (3).
Y r e g r i d t = μ + Y r e g r i d μ × σ E R A 5 σ G C M
where Y r e g r i d represents the variance-corrected regridded data, μ is the mean of the regridded series, σ E R A 5 is the standard deviation of ERA5, and σ G C M is the standard deviation of the GCM.
The combined IDW–Delta–Variance procedure provides a comprehensive preprocessing framework for harmonizing GCM projections with the high-resolution ERA5 dataset before integration into the RF–based statistical downscaling model. This approach resolves the spatial-resolution mismatch between GCM and ERA5 and preserves future climate change signals in GCM dataset through anomaly and variance adjustments.
This method advances beyond conventional interpolation techniques by simultaneously achieving spatial consistency, maintaining projected climate signals, and enhancing statistical fidelity [29,80,81]. As a result, the rescaled GCM dataset were effectively integrated into station-based Random Forest models, leading to substantial improvements in both the reliability and spatial representativeness of future precipitation projections.

2.3.3. Random Forest Method

RF is a powerful and flexible ensemble learning method that constructs a large number of decision trees on bootstrap samples and aggregates their outputs to generate robust predictions. As an ensemble-based, nonparametric machine learning approach, RF is particularly effective at capturing nonlinear relationships and complex high-order interactions among predictors [82]. Within the RF framework, each tree is trained on a randomly drawn subset of the training data, while a randomly selected subset of predictor variables is considered at each node. Each tree learns independently and produces an individual prediction; for regression tasks, the ensemble output is obtained by averaging the predictions across all trees [83].
Randomly selecting both training samples and predictor subsets reduces correlation among trees and, consequently, decreases generalization error. As demonstrated by Breiman [82], the generalization error of a random forest converges toward a limiting value as the number of trees increases. A key advantage of RF is that it does not rely on assumptions regarding the distribution of input variables and is insensitive to variable scaling. The method can therefore produce unbiased or near-unbiased predictions, while effectively handling datasets containing outliers or missing values [84].
ANNs are widely used in statistical downscaling because they can model complex nonlinear relationships between large-scale predictors and local precipitation. Nevertheless, the effectiveness of ANNs depends heavily on network architecture and training methods, often requiring careful predictor normalization and risking overfitting when the model has many parameters relative to the training data. Conversely, RF uses bootstrap aggregation and random predictor subsampling to reduce variance and improve generalization. It is less affected by predictor scaling, provides an internal out-of-bag error estimate, and includes variable metrics that aid physical interpretation. These features make RF especially suitable for station-scale precipitation downscaling, as it handles multiple correlated predictors and nonlinear, intermittent precipitation patterns effectively.
Although conceptually straightforward, RF frequently surpasses conventional classifiers and regressors such as discriminant analysis, support vector machines, and artificial neural networks, primarily due to its robustness against overfitting. Furthermore, RF implementation is straightforward, requiring specification of only two primary hyperparameters: the number of trees in the ensemble (ntree) and the number of predictors randomly sampled at each node (mtry). Model performance typically exhibits low sensitivity to these parameter choices [82], which supports the practicality and stability of RF for large-scale climate applications.
In the context of statistical downscaling, the RF algorithm has several notable advantages. Precipitation is influenced by numerous atmospheric variables, and RF can efficiently process high-dimensional predictor sets without requiring explicit feature selection [85]. These properties have contributed to the growing adoption of RF in climate and hydrological research for both regression and classification tasks [86,87,88,89,90,91]. The operational steps of the RF algorithm used in this study are summarized as follows:
1.
Determination of the number of trees (ntree): A predefined number of decision trees is specified. For each tree, a bootstrap sample is generated by sampling with replacement from the training data. Observations not included in this sample form the out-of-bag (OOB) set for that tree.
2.
Tree construction using random predictor subsets: For each bootstrap sample, an unpruned regression tree is grown. At each node, mtry predictor variables are randomly selected from the full set, and the best split among these candidates is chosen. This process ensures diversity across trees by combining randomness in both data sampling and predictor selection.
3.
Prediction aggregation: For new observations, predictions from all trees are aggregated by averaging in regression tasks or by majority voting in classification tasks to generate the final RF estimate.
4.
OOB error estimation: Each tree is used to predict the OOB observations corresponding to it. Once all OOB predictions are obtained, the out-of-bag root mean square error (OOB-RMSE) is calculated. This provides an unbiased and efficient estimate of the model’s generalization error without requiring a separate validation dataset.
5.
Model selection: The RF configuration yielding the lowest prediction error during the iterative training process is selected as the optimal model for downscaling [82]
Overall, the RF approach offers a robust, computationally efficient, and flexible framework for statistical downscaling, making it suitable for modeling precipitation variability and future climate projections.

2.3.4. Performance Metrics

To assess the performance of the RF-based statistical downscaling models developed for each station, several performance metrics were calculated. These metrics include RMSE, Nash–Sutcliffe Efficiency coefficient ( N S E ), adjusted coefficient of determination ( A d j R 2 ), percent bias ( P B I A S ), and the ratio of root mean square error to the standard deviation of observations ( R S R ), which has become increasingly prominent in hydrology and climate modeling studies [92]. While NSE was used to evaluate predictive skill, Adj R2 was employed to account for the effect of predictor dimensionality in the regression-based downscaling framework.
In addition, OOB-RMSE is directly computed from out-of-bag predictions generated during RF training, providing an internal estimate of generalization errors. The equations for these metrics are provided in Equations (4)–(8), respectively.
R M S E = 1 n   i = 1 n Y i O b s Y i M o d 2
N S E = 1 i = 1 n Y i O b s Y i M o d 2 i = 1 n Y i O b s Ȳ O b s 2
A d j R 2 = 1 ( 1 R 2 ) n 1 n P 1
P B I A S = 100 × i = 1 N Y i M o d Y i O b s i = 1 n Y i O b s
R S R = R M S E σ O b s
In the equations, Y i O b s indicates the observed values, Y i M o d is the model output, Ȳ O b s denotes the mean of the observed data, σ O b s represents the standard deviation of the observed data, where n is the number of observations and P is the number of predictors used in the station-specific RF model. Lower RMSE and RSR values indicate better performance, while NSE values close to 1 signify higher prediction accuracy. PBIAS values near 0 suggest minimal bias. Positive PBIAS indicates overall overestimation, whereas negative values indicate underestimation.

2.3.5. Bias Correction

Outputs from GCM and SDS models often exhibit systematic distributional biases due to spatial resolution limitations, structural deficiencies in physically based climate models, methodological choices in optimization and downscaling. When uncorrected, especially in highly skewed and variable fields such as precipitation, these biases can result in an inaccurate representation of mean precipitation and an erroneous estimation of the frequency and intensity of extreme events. Such inaccuracies undermine the reliability of hydrological impact assessments.
Accordingly, quantile mapping techniques are widely adopted to correct systematic distributional biases in climate model precipitation outputs. In the present hybrid framework, bias correction is applied after statistical downscaling, i.e., as a post-processing step on station-scale precipitation projections, rather than bias-correcting coarse-grid GCM precipitation prior to downscaling. This sequencing is adopted for two main reasons. First, the RF downscaling models are calibrated solely using ERA5 predictors and station observations, and GCM outputs are used only at the projection stage; consequently, applying a bias-correction procedure to GCM precipitation before downscaling is not part of the model calibration pathway and does not necessarily improve the distribution of station-scale precipitation. Second, precipitation biases are scale-dependent: correcting biases at the native GCM grid does not guarantee unbiased station-scale values after downscaling, and the downscaling step may introduce or amplify distributional inconsistencies, which could necessitate an additional correction at the target scale. Therefore, QDM is applied once to the downscaled station precipitation to correct residual distributional biases directly at the impact-relevant scale, while preserving the projected change signal.
Traditional quantile mapping techniques effectively correct discrepancies during the historical period relative to observations; however, they can artificially distort long-term trends in future model projections [93]. To address this limitation, the present study employs the Quantile Delta Mapping (QDM) method, an advanced quantile mapping approach that preserves the projected change signal of GCM scenarios when correcting biases in station-scale future precipitation projections.
Within the QDM framework, the quantile of a model-projected future value is first determined from the cumulative distribution function (CDF) of the future model period. The corresponding historical model value is then identified using the inverse CDF of the historical model distribution. The ratio of these two values quantifies the relative change, or delta, between the historical and projected model climates.
This quantile is then mapped onto the observational distribution using the inverse CDF of the historical observations. The bias-corrected future model value is obtained by multiplying the observational quantile value by the delta factor [93]. Through this process, QDM preserves the climate change signal present in GCM and corrects systematic biases. The method is expressed as follows Equations (9)–(12):
q m , f t =   F m , f t x m , f t ,       q m , f t 0,1
m t = x m , f t F m , h 1 q m , f t  
x o , h t = F o , h 1 q m , f t
x m , f t = x o , h   t × m t
In these formulations, x m , f denotes the model projection, F m , f represents the CDF of the future period model, q m , f is the quantile of the future model value, F m , h 1 is the inverse CDF of the historical model distribution,   m t shows the relative change between the projected and historical model values, x o , h is the quantile value from the observational CDF, F o , h 1 is the inverse CDF of the historical observational distribution, and x m , f denotes the bias-corrected future projection.
Owing to its ability to preserve trends, QDM is regarded as one of the most reliable bias-correction techniques for climate change impact studies [76]. It has been widely implemented in recent climate and hydrological modeling research [94,95]. Comparative evaluations of bias-correction techniques further demonstrate that QDM consistently outperforms alternative methods [96,97,98].

2.3.6. Future Precipitation Projections Under SSP Scenarios

Station-specific RF models were developed using ERA5 predictors and monthly precipitation observations. Following the split shown in Figure 3, RF hyperparameters were optimized in the training period and finalized based on out-of-sample performance in the test period.
The finalized RF models were then applied to GCM simulations to project station-scale precipitation under the SSP2-4.5 and SSP5-8.5 scenarios. For each station, the predictor sets identified by the MPSP were retained to ensure consistency between the historical calibration and scenario applications. Since the RF models were trained using ERA5-based predictors, the GCM fields were first adjusted to align with ERA5 to reduce scale discrepancies between the reanalysis and GCM datasets. Specifically, GCM grids were harmonized with ERA5 grids using the three-stage IDW–Delta–Variance rescaling procedure. The harmonized scenario predictors were then used as inputs to the trained RF models to produce raw downscaled monthly precipitation projections for the entire scenario period (2025–2099).
To correct residual distributional biases while preserving the climate-change signal, the raw projections were bias-corrected using QDM. Although the SSP simulations begin in 2015, results are summarized for three standard future windows—near future (2025–2050), mid-century (2051–2075), and late century (2076–2099)—to highlight changes relative to the historical baseline and to ensure consistency with commonly used reporting periods in climate-impact studies.

3. Results

3.1. Results of Statistical Downscaling Model

During model training, precipitation data from observation stations served as the dependent variable, while multiple atmospheric variables from the ERA5 reanalysis dataset were used as predictors. This approach enabled the capture of multivariate and nonlinear relationships between large-scale climate variables and station-scale precipitation observations. Separate RF-based downscaling models were developed for each station using MATLAB R2025b. The workflow for this strategy is illustrated in Figure 2.
The performance of the RF algorithm depends significantly on the specification of its hyperparameters. Three key hyperparameters were optimized individually for each station using Bayesian Optimization: the number of trees (ntree), which was varied from 50 to 500 to balance flexibility and generalization; the minimum leaf size (MinLeafSize), adjusted between 1 and 20 to mitigate overfitting; and the number of predictor variables at each split (mtry), explored from 1 to p to maintain model diversity and an appropriate bias–variance balance. After dynamic hyperparameter optimization, the RF models were trained for each station.

3.2. Evaluation of Statistical Downscaling Models

The observational and ERA5 reanalysis datasets were divided into two periods for training and testing the downscaling models: 50% for training (1980–1997) and 50% for testing (1998–2014). A key advantage of the RF algorithm is its capacity to internally evaluate model performance using OOB samples. In this study, model performance was assessed using both OOB samples during training and an independent test dataset. OOB samples provide an estimate of internal accuracy by evaluating the model on data not used during training, while the independent test set measures external accuracy by assessing performance on previously unseen data.
Accordingly, a two-step validation approach was employed: internal validation using OOB-RMSE and external validation using test-period performance metrics. This dual validation strategy enhances the scientific rigor of the analysis by demonstrating both the model’s resistance to overfitting and its predictive performance.
During the test period, RMSE values varied between 13.30 and 37.48 mm, while RSR efficiency for the SDS models ranged from 0.43 to 0.74. NSE scores fell between 0.45 and 0.81, and PBIAS values ranged from 5.29% to −21.99%. Additionally, OOB-RMSE values computed by the RF algorithm during model training ranged from 14.95 to 29.48.
Among the 23 observational stations evaluated, test-period NSE values of the RF models were classified as “good” to “very good” for all except five stations, while RSR values received similar ratings for all except four stations. PBIAS values were also assessed as “good” to “very good” for all but one station. These evaluations are based on performance classification criteria widely used in hydrological and climate modeling studies [99,100].
Test period performance metrics and internal validation results from the RF algorithm are presented in Table 5. An example of training and test performance for the Yatağan station is shown in Figure 3.
Comparison of OOB-RMSE and test RMSE values across stations indicates that the RF method generally performs consistently at most locations. The close agreement between OOB-RMSE and test RMSE suggests that the model does not overfit during training [82]. However, larger discrepancies observed at stations such as Station 4, Station 6, and Station 14 may result from local data quality issues, station-grid inconsistencies, or climatic anomalies. These findings emphasize the importance of station-specific evaluations and the need to account for spatial heterogeneity in SDS applications.

3.3. Downscaling of Future Precipitation and Bias Correction

Accurately estimating future climate change impacts is critical for effective planning and sustainable management of regional water resources. The limited spatial resolution of GCM, which serve as primary data sources for climate change assessments, restricts their direct application at basin or station scales. To overcome this constraint, the spatial resolution of GCM data was improved to align with that of the ERA5 dataset during the development of RF-based statistical downscaling models, as previously described.
Two scenarios from the HadGEM3-GC31-LL climate model within the CMIP6 framework were employed: SSP2-4.5, representing a moderate emissions pathway with mitigation measures such as energy efficiency improvements and carbon reduction strategies, and SSP5-8.5, reflecting a high-emission future in which such measures are absent and greenhouse gas emissions increase. Following the alignment of GCM output and ERA5 reanalysis data resolutions, statistical downscaling was performed using RF models trained on optimal predictor combinations for each of the 23 stations.
Extensive research indicates that downscaled results frequently exhibit significant biases, primarily due to model parameterizations and unresolved geophysical processes [19,101,102]. To address these biases, the QDM method introduced by Cannon et al. [93] was applied. Theoretical formulations and implementation steps for QDM have been detailed in previous sections. QDM was chosen for its proven ability to preserve the climate change signal in future projections.
The bias-correction procedure was applied to the downscaled future precipitation datasets, resulting in a substantial reduction in systematic biases. CDFs were generated for observed precipitation, uncorrected projections, and bias-corrected projections at all 23 stations. Figure 4 presents the CDFs for the Yatağan station (Station 1), along with the corresponding PBIAS values before and after bias correction. Due to space constraints, only the results for Yatağan are shown.
Following bias correction, future precipitation projections from the SSP2-4.5 and SSP5-8.5 scenarios of the HadGEM3-GC31-LL model were evaluated at the station scale for three periods: near future (2025–2050), mid-future (2051–2075), and far future (2076–2099). For all stations, mean observed annual precipitation, mean projected annual precipitation, and projected changes are summarized in Table 6 and Table 7.
Compared to the observation period, the SSP2-4.5 scenario projects basin-wide declines in annual total precipitation, averaging 12.2 mm/year in the near future, 19.6 mm/year in the mid-future, and 33.7 mm/year in the far future. Four stations display increases in annual precipitation in the near future; however, only the increase at Söke station is statistically significant. The remaining nineteen stations exhibit declines, with particularly notable reductions (greater than 20 mm/year) at Kavaklıdere, Nazilli, Kuyucak, Sivaslı, and Banaz. In the mid-future, all stations except Söke experience decreases in annual total precipitation, with the largest reduction (56.6 mm/year) at Kavaklıdere. Far-future projections indicate substantial reductions across all stations, with decreases reaching up to 81.1 mm/year. Detailed results are provided in Table 6.
Under the SSP5-8.5 scenario, reductions in mean annual precipitation are more pronounced relative to the observation period: 25.2 mm in the near future, 53.2 mm in the mid-future, and 86.9 mm in the far future. The largest near-future decline (47.8 mm/year) is projected for Nazilli station. Mid-future reductions intensify, reaching up to 94.6 mm/year at certain stations. In the far-future period, decreases approach 124 mm/year across the basin. Comprehensive results for SSP5-8.5 are presented in Table 7.
In summary, the bias-corrected RF-based downscaling results demonstrate a consistent decline in precipitation across the Büyük Menderes Basin under both emission scenarios, with a more pronounced trend under the high-emission SSP5-8.5 pathway. These findings underscore the potential for substantial future changes in the basin’s hydrological balance and water resource availability due to climate change.
Accordingly, the following section presents a detailed evaluation of the spatial distribution of future precipitation trends, temporal comparisons across projection periods, and station-specific precipitation changes throughout the basin.

3.4. Changes in Future Precipitation

Future precipitation changes in the Büyük Menderes Basin were evaluated using bias-corrected RF–based statistical downscaling under the SSP2-4.5 and SSP5-8.5 scenarios. Projections for 23 meteorological stations were analyzed across three future periods: near future (2025–2050), mid-century (2051–2075), and late century (2076–2099). The results indicate a consistent drying trend throughout the basin under both emissions scenarios.
Figure 5 presents the distribution of annual total precipitation from 1980 to 2014, the observational period, with values ranging from 210 mm to 1305 mm across the basin. Precipitation amounts are higher at western and southern stations, including Söke, Kale, Kavaklıdere, and Yatağan, while lower values are observed at inland and eastern stations such as Sarayköy, Sandıklı, Hocalar, Çivril, and Dinar. This spatial distribution reflects the influence of orographic precipitation, shaped by the basin’s topographic gradients and varying proximity to the Aegean Sea.
Figure 6 and Figure 7 display boxplots of future precipitation projections for 2025–2099 under the SSP2-4.5 and SSP5-8.5 scenarios. Compared to the observational period, SSP2-4.5 projections show a general decrease in precipitation. Precipitation remains relatively stable along the western coast, while inland areas such as Nazilli, Kuyucak, and Sarayköy experience more significant declines. These findings suggest that a basin-wide drying trend may develop even under moderate emissions scenarios.
Comparison of future projections with the observational period reveals a general decline in both median and mean annual precipitation throughout the 21st century. SSP2-4.5 results in a moderate drying signal across the basin, whereas SSP5-8.5 leads to substantial reductions in both the distribution range and mean precipitation at all stations. In the late-century period, precipitation decreases reach 15–21%, with the most pronounced declines observed in sub-basins around Nazilli, Sarayköy, Çal, and Güney.
Figure 8 and Figure 9 provide an illustrative station-level time-series view of annual precipitation projections for the Yatağan station. Equivalent annual projection plots were produced for all stations; however, only Yatağan is shown here to maintain a concise presentation. Under SSP2-4.5 in Figure 8, projected annual totals retain substantial interannual variability across the near (2025–2050), mid-century (2051–2075), and late-century (2076–2099) windows, while exhibiting a gradual downward shift from the near to the late period. This shift is reflected by a lower occurrence of years reaching the upper end of historical annual totals in the late-century window, consistent with a modest drying tendency under intermediate forcing.
Under SSP5-8.5 in Figure 9, the drying signal at Yatağan becomes stronger and more persistent, with the annual totals more frequently concentrated toward lower values and fewer late-century years approaching the higher annual amounts characteristic of the historical period. Compared with SSP2-4.5, the late-century window shows a clearer downward displacement of the annual distribution, indicating an intensification of drying under stronger radiative forcing. Together, these station-level trajectories complement the summary change statistics by showing how scenario-dependent differences manifest in the annual sequence while preserving interannual variability.
Overall, the Yatağan example illustrates that the projected changes are expressed not only as shifts in period-average precipitation but also through the year-to-year sequence, where interannual variability remains substantial while the central tendency moves toward lower annual totals with increasing forcing and time horizon. This time-series perspective provides additional context for interpreting scenario- and period-dependent differences, particularly regarding the persistence of drier conditions across successive years.
Figure 10 illustrates the spatial distribution of projected percentage changes in precipitation relative to the observational period. Furthermore, the numerical values of these percentage changes for each station are presented in Table 6 and Table 7 for the SSP245 and SSP585 scenarios, respectively. Under SSP2-4.5, marginal increases are projected for the western coastal areas in the near future, while a general drying trend is anticipated in inland and eastern regions. This trend intensifies in the mid- and late-century periods, with reductions of up to 10–15% in the central basin. These results indicate that, even under moderate emissions, precipitation is projected to decline progressively from the interior toward the east, with an increasing drying tendency.
Under the high-emission scenario, the drying signal is substantially more pronounced. Increased greenhouse gas concentrations result in reductions of approximately 10% in most parts of the basin, even in the near future. In SSP5-8.5 projections, these declines intensify to 15–22% during the mid- and late-century periods and extend across the entire basin. These results demonstrate the basin’s high sensitivity to climate change. Both scenarios project a systematic decrease in precipitation, with the drying tendency doubling under high emissions. This projected decline, significant in both magnitude and duration, is likely to have profound impacts on the basin’s future hydrological regime. The findings highlight considerable risks for sustainable water resource management, agricultural planning, and hydropower potential in the Büyük Menderes Basin under future climate change.

4. Discussion

4.1. Key Findings and Methodological Advantages

Precipitation projections were assessed for near-term (2025–2050), mid-century (2051–2075), and late-century (2076–2099) periods under SSP2-4.5 and SSP5-8.5 scenarios. The results indicate a consistent basin-wide decline in precipitation for both scenarios. SSP2-4.5 projects moderate reductions, whereas the high-emission SSP5-8.5 scenario forecasts decreases of up to 15–22%. These findings suggest substantial shifts in precipitation regimes are probable throughout much of the basin, particularly under SSP5-8.5. The most significant reductions are anticipated in sub-basins near Aydın, Nazilli, Sarayköy, Güney, and Çal.
Although the overall workflow is widely used in climate change impact studies, the present framework incorporates several features that improve physical consistency and station-scale applicability. First, predictors are selected in a station-specific manner (MPSP/APREG), rather than relying on a fixed set of large-scale variables across the basin, which supports robust local skill. Second, GCM fields are harmonized with ERA5 using the IDW–Delta–Variance approach prior to downscaling, reducing scale and distribution mismatches between the driving model and the reanalysis-based training domain. Third, the RF algorithm captures multivariate and nonlinear relationships relevant to monthly precipitation while allowing rigorous evaluation through OOB and independent testing. Finally, trend-preserving QDM is applied at the station scale to correct residual distributional biases without undermining the climate-change signal, which is a known limitation of conventional bias-correction approaches. From an applications perspective, reliable station-scale precipitation projections are also critical for reservoir operation and hydropower planning in the Büyük Menderes Basin; recent optimization and management studies for the basin emphasize that robust future inflow estimates are a key limiting factor [103,104].

4.2. Comparison with Previous Studies

Our basin-scale drying signal is largely consistent with recent CMIP6-based assessments over Türkiye that highlight increasing hydroclimatic stress, particularly across the Aegean–Mediterranean transition zone. For example, a national-scale analysis of downscaled CMIP6 projections reported increasingly dry conditions toward the late century and noted that total precipitation can decline by up to ~20% over the Aegean and Mediterranean regions under SSP5-8.5, implying stronger water-stress conditions compared with SSP2-4.5 [105]. Complementarily, aridity-focused CMIP6 analyses show that multiple aridity indices consistently indicate a shift toward drier climate regimes in Türkiye after the mid-century, with particularly notable increases projected for the inner Aegean among other regions—supporting the interpretation that precipitation reductions in western Anatolia are part of a broader, robust drying tendency [106]. At the same time, model evaluation studies emphasize that CMIP6 generally improves precipitation skill relative to CMIP5 while still exhibiting non-negligible model spread and regional differences, reinforcing that our results should be interpreted as conditional on the selected GCM and the applied downscaling/bias-adjustment chain [65]. Finally, impact-oriented studies from nearby southwestern basins such as the Burdur Basin that incorporate CMIP6 information similarly point to heightened future drought-related pressures on water systems, which is qualitatively aligned with the drying implications derived here for the Büyük Menderes Basin [107].
At the basin scale, our findings also agree with recent CMIP6-driven drought analyses specifically targeting the Büyük Menderes Basin. Using multiple CMIP6 GCMs under SSP2-4.5 and SSP5-8.5 and computing SPEI after bias correction, Rotbeei et al. projected an increase in dry-month occurrence toward the late century, supporting the likelihood of intensifying drought conditions under stronger forcing [55]. Observational analyses further contextualize this signal: trend-based evaluations over the Aegean region (including the Büyük Menderes) using multi-station records identify clear regional warming over recent decades, a key co-driver that can amplify drought risk even when precipitation changes are spatially heterogeneous [108]. Importantly, our mean/median precipitation decreases do not preclude intensification of short-duration rainfall extremes—consistent with national-scale CMIP6 downscaling work showing increased return levels of daily maximum precipitation, with substantially larger increases under SSP5-8.5 [6]. In addition, multi-model CMIP6 basin assessments employ ERA5 as a reference underscore the value of ensemble-based evaluation for quantifying inter-model uncertainty in precipitation and temperature projections, which complements our (single-model) methodological framework and motivates future extension to multi-model ensembles [60]. Finally, basin-scale hydroclimatic impact studies coupling bias-corrected climate projections with hydrological modeling in Türkiye frequently report seasonal redistribution (wetter wet seasons and drier dry seasons), emphasizing that water-management implications depend not only on annual totals but also on intra-annual shifts—an aspect that is consistent with the spatially heterogeneous changes identified in our downscaled station-scale projections [102]. Historical analyses for the Aegean region, including the Büyük Menderes Basin, also indicate pronounced drying [109].

4.3. Limitations and Future Work

A limitation arises from the coarse native resolution of HadGEM3-GC31-LL over the basin, where only a small number of GCM grid cells are available. This may smooth local gradients and under-represent orographic and coastal-transition effects, potentially affecting station-to-station contrasts and extremes. Because the basin is represented by only five native HadGEM3 grid cells, the regridded GCM predictors largely reflect an interpolated large-scale signal with limited independent spatial degrees of freedom. Therefore, fine-scale spatial contrasts among stations should be interpreted primarily as arising from the station-specific RF transfer function rather than from resolved GCM-scale gradients, and projected local differences, especially for extremes, remain more uncertain. In the proposed workflow, this scale mismatch is partly mitigated because the RF models are trained using higher-resolution ERA5 predictors together with station observations, and GCM fields are first harmonized to ERA5 statistics using the IDW–Delta–Variance approach prior to downscaling. Nevertheless, residual structural uncertainty related to coarse GCM sampling remains.
The observational dataset used in this study covers 1980–2014; post-2014 observations were not included due to data-availability constraints. Here, 2025 denotes the start of the scenario projection period rather than an observational endpoint.
In addition, projections were generated using a single CMIP6 realization (r1i1p1f3) rather than an ensemble mean, and therefore internal variability and ensemble spread are not quantified. Accordingly, the projected changes should be interpreted as conditional on the selected model configuration. In addition, future work will explore the sensitivity of projections to predictor choice and downscaling configurations, particularly for extremes. Extending the framework to multi-model and multi-member ensembles and higher-resolution simulations is a natural next step.

5. Conclusions

This study developed station-scale monthly precipitation projections for the Büyük Menderes Basin in western Türkiye using a hybrid statistical downscaling framework that combines Random Forest modeling with trend-preserving quantile delta mapping. ERA5 reanalysis predictors and station observations were used to train and evaluate the downscaling models, and CMIP6 HadGEM3-GC31-LL outputs were subsequently harmonized and downscaled to generate projections for SSP2-4.5 and SSP5-8.5. Model evaluation indicates overall good to very good performance across stations.
Across the basin, projections suggest a consistent tendency toward lower precipitation totals, with moderate reductions under SSP2-4.5 and larger decreases under SSP5-8.5 reaching approximately 15–22% by the late century. The strongest reductions are concentrated in several sub-basins, particularly around Aydın, Nazilli, Sarayköy, Güney, and Çal, indicating spatially heterogeneous but broadly coherent drying patterns. These results highlight increasing hydroclimatic stress and emphasize the need to consider climate-informed planning for water-dependent sectors in the basin. The framework is readily extensible to other semi-arid basins in Türkiye, subject to data availability and local validation.
Limitations include the use of a single driving GCM at coarse native resolution and the lack of explicit inter-model/ensemble uncertainty; therefore, the projections should be interpreted as conditional on the selected CMIP6 configuration. Future work will extend the same framework to multi-model and multi-member ensembles and higher-resolution simulations to quantify uncertainty and strengthen robustness.

Author Contributions

Conceptualization, I.A., G.G. and M.Y.; methodology, I.A., G.G. and M.Y.; software, I.A.; validation, I.A. and G.G.; investigation, I.A.; data curation, I.A.; writing—original draft preparation, I.A.; writing—review and editing, I.A., G.G. and M.Y.; visualization, I.A.; supervision, G.G. and M.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data that support the findings of this study are available from the first author upon reasonable request. Interested readers and researchers can contact the author directly at [iara08@posta.pau.edu.tr] to request access to the data. The data will be provided within a reasonable timeframe and in appropriate formats.

Acknowledgments

This study is based on research conducted as part of the first author’s Ph.D. thesis work.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kouhestani, S.; Eslamian, S.S.; Abedi-Koupai, J.; Besalatpour, A.A. Projection of climate change impacts on precipitation using soft-computing techniques: A case study in Zayandeh-rud Basin, Iran. Glob. Planet. Change 2016, 144, 158–170. [Google Scholar] [CrossRef]
  2. Pascual, D.; Pla, E.; Lopez-Bustins, J.A.; Retana, J.; Terradas, J. Impacts du changement climatique sur les ressources en eau dans le bassin méditerranéen: Une étude de cas en Catalogne, Espagne. Hydrol. Sci. J. 2015, 60, 2132–2147. [Google Scholar] [CrossRef]
  3. Semiromi, M.T.; Koch, M. Statistical downscaling of precipitation in northwestern Iran using a hybrid model of discrete wavelet transform, artificial neural networks, and quantile mapping. Theor. Appl. Clim. 2024, 155, 6591–6621. [Google Scholar] [CrossRef]
  4. Elham, A.; Wolfgang, C. Mediterranean Region. In Climate Change 2022—Impacts, Adaptation and Vulnerability; Pörtner, H.-O., Roberts, D.C., Tignor, M., Poloczanska, E.S., Mintenbeck, K., Alegría, A., Craig, M., Langsdorf, S., Löschke, S., Möller, V., et al., Eds.; Cambridge University Press: New York, NY, USA, 2022; pp. 2233–2272. [Google Scholar] [CrossRef]
  5. Eyring, V.; Bony, S.; Meehl, G.A.; Senior, C.A.; Stevens, B.; Stouffer, R.J.; Taylor, K.E. Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization. Geosci. Model Dev. 2016, 9, 1937–1958. [Google Scholar] [CrossRef]
  6. Tugrul, T.; Oruc, S.; Gunes, B. Quantifying future rainfall extremes in Türkiye: A CMIP6 ensemble approach with statistical downscaling. Acta Geophys. 2025, 73, 3477–3494. [Google Scholar] [CrossRef]
  7. O’NEill, B.C.; Tebaldi, C.; van Vuuren, D.P.; Eyring, V.; Friedlingstein, P.; Hurtt, G.; Knutti, R.; Kriegler, E.; Lamarque, J.-F.; Lowe, J.; et al. The Scenario Model Intercomparison Project (ScenarioMIP) for CMIP6. Geosci. Model Dev. 2016, 9, 3461–3482. [Google Scholar] [CrossRef]
  8. Andrews, M.B.; Ridley, J.K.; Wood, R.A.; Andrews, T.; Blockley, E.W.; Booth, B.; Burke, E.; Dittus, A.J.; Florek, P.; Gray, L.J.; et al. Historical Simulations with HadGEM3-GC3.1 for CMIP6. J. Adv. Model. Earth Syst. 2020, 12, e2019MS001995. [Google Scholar] [CrossRef]
  9. Chen, J.; Zhang, X.J. Challenges and potential solutions in statistical downscaling of precipitation. Clim. Change 2021, 165, 63. [Google Scholar] [CrossRef]
  10. Mandal, S.; Srivastav, R.K.; Simonovic, S.P. Use of beta regression for statistical downscaling of precipitation in the Campbell River basin, British Columbia, Canada. J. Hydrol. 2016, 538, 49–62. [Google Scholar] [CrossRef]
  11. Wilby, R.; Wigley, T. Downscaling general circulation model output: A review of methods and limitations. Prog. Phys. Geogr. Earth Environ. 1997, 21, 530–548. [Google Scholar] [CrossRef]
  12. Rampal, N.; Hobeichi, S.; Gibson, P.B.; Baño-Medina, J.; Abramowitz, G.; Beucler, T.; González-Abad, J.; Chapman, W.; Harder, P.; Gutiérrez, J.M. Enhancing Regional Climate Downscaling through Advances in Machine Learning. Artif. Intell. Earth Syst. 2024, 3, 230066. [Google Scholar] [CrossRef]
  13. Lei, K.; Zhang, L.; Gao, L. Evaluation of IMERG Precipitation Product Downscaling Using Nine Machine Learning Algorithms in the Qinghai Lake Basin. Water 2025, 17, 1776. [Google Scholar] [CrossRef]
  14. Rashid, M.; Beecham, S.; Chowdhury, R.K. Statistical downscaling of rainfall: A non-stationary and multi-resolution approach. Theor. Appl. Clim. 2016, 124, 919–933. [Google Scholar] [CrossRef]
  15. Okkan, U.; Inan, G. Statistical downscaling of monthly reservoir inflows for Kemer watershed in Turkey: Use of machine learning methods, multiple GCMs and emission scenarios. Int. J. Clim. 2015, 35, 3274–3295. [Google Scholar] [CrossRef]
  16. Hessami, M.; Gachon, P.; Ouarda, T.B.; St-Hilaire, A. Automated regression-based statistical downscaling tool. Environ. Model. Softw. 2008, 23, 813–834. [Google Scholar] [CrossRef]
  17. Pahlavan, H.A.; Zahraie, B.; Nasseri, M.; Varnousfaderani, A.M. Improvement of multiple linear regression method for statistical downscaling of monthly precipitation. Int. J. Environ. Sci. Technol. 2018, 15, 1897–1912. [Google Scholar] [CrossRef]
  18. Zuo, J.; Xu, J.; Chen, Y.; Wang, C. Downscaling precipitation in the data-scarce Inland River Basin of Northwest China based on earth system data products. Atmosphere 2019, 10, 613. [Google Scholar] [CrossRef]
  19. Sachindra, D.A.; Huang, F.; Barton, A.; Perera, B.J.C. Statistical Downscaling of General Circulation Model Outputs to Precipitation—Part 1: Calibration and validation. Int. J. Clim. 2014, 34, 3264–3281. [Google Scholar] [CrossRef]
  20. Ozbuldu, M.; Irvem, A. Projecting and Downscaling Future Temperature and Precipitation Based on CMIP6 Models Using Machine Learning in Hatay Province, Türkiye. Pure Appl. Geophys. 2025, 182, 1825–1842. [Google Scholar] [CrossRef]
  21. Fealy, R.; Sweeney, J. Statistical downscaling of precipitation for a selection of sites in Ireland employing a generalised linear modelling approach. Int. J. Clim. 2007, 27, 2083–2094. [Google Scholar] [CrossRef]
  22. Asong, Z.; Khaliq, M.; Wheater, H. Projected changes in precipitation and temperature over the Canadian Prairie Provinces using the Generalized Linear Model statistical downscaling approach. J. Hydrol. 2016, 539, 429–446. [Google Scholar] [CrossRef]
  23. Wilby, R.; Hay, L.; Leavesley, G. A comparison of downscaled and raw GCM output: Implications for climate change scenarios in the San Juan River basin, Colorado. J. Hydrol. 1999, 225, 67–91. [Google Scholar] [CrossRef]
  24. Okkan, U.; Fistikoglu, O. Evaluating climate change effects on runoff by statistical downscaling and hydrological model GR2M. Theor. Appl. Clim. 2014, 117, 343–361. [Google Scholar] [CrossRef]
  25. Dorji, S.; Herath, S.; Mishra, B.K. Future climate of Colombo downscaled with SDSM-neural network. Climate 2017, 5, 24. [Google Scholar] [CrossRef]
  26. Noor, M.; Ismail, T.; Chung, E.-S.; Shahid, S.; Sung, J.H. Uncertainty in rainfall intensity duration frequency curves of Peninsular Malaysia under changing climate scenarios. Water 2018, 10, 1750. [Google Scholar] [CrossRef]
  27. Tanteliniaina, M.F.R.; Rahaman, H.; Zhai, J. Assessment of the future impact of climate change on the hydrology of the mangoky river, madagascar using ann and swat. Water 2021, 13, 1239. [Google Scholar] [CrossRef]
  28. Haji-Aghajany, S.; Amerian, Y.; Amiri-Simkooei, A. Function-Based Troposphere Tomography Technique for Optimal Downscaling of Precipitation. Remote Sens. 2022, 14, 2548. [Google Scholar] [CrossRef]
  29. Prathom, C.; Champrasert, P. General Circulation Model Downscaling Using Interpolation—Machine Learning Model Combination—Case Study: Thailand. Sustainability 2023, 15, 9668. [Google Scholar] [CrossRef]
  30. Gumus, V.; El Moçayd, N.; Seker, M.; Seaid, M. Evaluation of future temperature and precipitation projections in Morocco using the ANN-based multi-model ensemble from CMIP6. Atmos. Res. 2023, 292, 106880. [Google Scholar] [CrossRef]
  31. Ganjei, S.; Shiri, J.; Karimi, S. Evaluating climate change impacts on reference evapotranspiration using CMIP6 projections and machine learning in the Aras River Basin. Model. Earth Syst. Environ. 2026, 12, 3. [Google Scholar] [CrossRef]
  32. Tripathi, S.; Srinivas, V.; Nanjundiah, R.S. Downscaling of precipitation for climate change scenarios: A support vector machine approach. J. Hydrol. 2006, 330, 621–640. [Google Scholar] [CrossRef]
  33. Chen, S.-T.; Yu, P.-S.; Tang, Y.-H. Statistical downscaling of daily precipitation using support vector machines and multivariate analysis. J. Hydrol. 2010, 385, 13–22. [Google Scholar] [CrossRef]
  34. Sachindra, D.A.; Huang, F.; Barton, A.; Perera, B.J. Least square support vector and multi-linear regression for statistically downscaling general circulation model outputs to catchment streamflows. Int. J. Climatol. 2013, 33, 1087–1106. [Google Scholar] [CrossRef]
  35. Yazdian, H.; Salmani-Dehaghi, N.; Alijanian, M. A spatially promoted SVM model for GRACE downscaling: Using ground and satellite-based datasets. J. Hydrol. 2023, 626, 130214. [Google Scholar] [CrossRef]
  36. Shukla, R.; Khare, D.; Dwivedi, A.K.; Rudra, R.P.; Palmate, S.S.; Ojha, C.S.P.; Singh, V.P. Evaluation of statistical downscaling model’s performance in projecting future climate change scenarios. J. Water Clim. Change 2023, 14, 3559–3595. [Google Scholar] [CrossRef]
  37. Ghosh, S.; Mujumdar, P. Statistical downscaling of GCM simulations to streamflow using relevance vector machine. Adv. Water Resour. 2008, 31, 132–146. [Google Scholar] [CrossRef]
  38. Okkan, U.; Inan, G. Bayesian Learning and Relevance Vector Machines Approach for Downscaling of Monthly Precipitation. J. Hydrol. Eng. 2014, 20, 04014051. [Google Scholar] [CrossRef]
  39. Okkan, U.; Serbes, Z.A.; Samui, P. Relevance vector machines approach for long-term flow prediction. Neural Comput. Appl. 2014, 25, 1393–1405. [Google Scholar] [CrossRef]
  40. George, J.; Athira, P. A Multi-stage Stochastic Approach for Statistical Downscaling of Rainfall. Water Resour. Manag. 2023, 37, 5477–5492. [Google Scholar] [CrossRef]
  41. Sachindra, D.; Ahmed, K.; Rashid, M.; Shahid, S.; Perera, B. Statistical downscaling of precipitation using machine learning techniques. Atmos. Res. 2018, 212, 240–258. [Google Scholar] [CrossRef]
  42. Sulaiman, N.A.F.; Shaharudin, S.M.; Ismail, S.; Zainuddin, N.H.; Tan, M.L.; Jalil, Y.A. Predictive Modelling of Statistical Downscaling Based on Hybrid Machine Learning Model for Daily Rainfall in East-Coast Peninsular Malaysia. Symmetry 2022, 14, 927. [Google Scholar] [CrossRef]
  43. Shi, Y.; Song, L.; Xia, Z.; Lin, Y.; Myneni, R.B.; Choi, S.; Wang, L.; Ni, X.; Lao, C.; Yang, F. Mapping annual precipitation across Mainland China in the period 2001–2010 from TRMM3B43 product using spatial downscaling approach. Remote Sens. 2015, 7, 5849–5878. [Google Scholar] [CrossRef]
  44. Jing, W.; Yang, Y.; Yue, X.; Zhao, X. A Comparison of different regression algorithms for downscaling monthly satellite-based precipitation over North China. Remote Sens. 2016, 8, 835. [Google Scholar] [CrossRef]
  45. Li, X.; Zhang, G.; Zhu, S.; Xu, Y. Stepwise Downscaling of ERA5-Land Reanalysis Air Temperature: A Case Study in Nanjing, China. Remote Sens. 2025, 17, 2063. [Google Scholar] [CrossRef]
  46. Medrano, S.C.; Satgé, F.; Molina-Carpio, J.; Zolá, R.P.; Bonnet, M.-P. Downscaling Daily Satellite-Based Precipitation Estimates Using MODIS Cloud Optical and Microphysical Properties in Machine-Learning Models. Atmosphere 2023, 14, 1349. [Google Scholar] [CrossRef]
  47. Fu, Y.; Zhuang, H.; Shen, X.; Li, W. Assessment and prediction of regional climate based on a multimodel ensemble machine learning method. Clim. Dyn. 2023, 61, 4139–4158. [Google Scholar] [CrossRef]
  48. Paz, S.M.; Gómez, M.F.V.; Willems, P. Adapting to Climate Change with Machine Learning: The Robustness of Downscaled Precipitation in Local Impact Analysis. Water 2024, 16, 3070. [Google Scholar] [CrossRef]
  49. Diez-Sierra, J.; del Jesus, M. Subdaily rainfall estimation through daily rainfall downscaling using random forests in Spain. Water 2019, 11, 125. [Google Scholar] [CrossRef]
  50. Xu, C.; Liu, C.; Zhang, W.; Li, Z.; An, B. Downscaling and Merging of Daily Scale Satellite Precipitation Data in the Three River Headwaters Region Fused with Cloud Attributes and Rain Gauge Data. Water 2023, 15, 1233. [Google Scholar] [CrossRef]
  51. Hisam, E.; Sertel, E.; Seker, D.Z. Precipitation downscaling with the integration of multiple precipitation products, land surface data and gauge stations using explainable machine learning algorithms: A case study in the Mediterranean region of Turkiye. Sci. Total. Environ. 2025, 1002, 180540. [Google Scholar] [CrossRef]
  52. Raeesi, M.; Zolfaghari, A.A.; Kaboli, S.H.; Rahimi, M.; de Vente, J.; Eekhout, J.P.C. Using quantile mapping and random forest for bias-correction of high-resolution reanalysis precipitation data and CMIP6 climate projections over Iran. Int. J. Clim. 2024, 44, 4495–4514. [Google Scholar] [CrossRef]
  53. Liu, T.; Zhu, X.; Tang, M.; Guo, C.; Lu, D. Multi-model ensemble bias-corrected precipitation dataset and its application in identification of drought-flood abrupt alternation in China. Atmos. Res. 2024, 307, 107481. [Google Scholar] [CrossRef]
  54. General Directorate of Water Management. Preliminary Scoping Report for the Strategic Environmental Assessment of the Büyük Menderes Basin Flood Management Plan Update Project; Republic of Türkiye Ministry of Agriculture and Forestry General Directorate of Water Management: Ankara, Turkey, 2025. [Google Scholar]
  55. Rotbeei, F.; Balov, M.N.; Safari, M.J.S.; Vaheddoost, B. Projected Drought Intensification in the Büyük Menderes Basin Under CMIP6 Climate Scenarios. Climate 2025, 13, 47. [Google Scholar] [CrossRef]
  56. Peel, M.C.; Finlayson, B.L.; Mcmahon, T.A. Hydrology and Earth System Sciences Updated World Map of the Köppen-Geiger Climate Classification. 2007. Available online: www.hydrol-earth-syst-sci.net/11/1633/2007/ (accessed on 2 October 2025).
  57. Ekinci, D.D.; Fistikoğlu, O. Gördes Havzası Akışlarının Modellenmesinde Era5-Land Verilerinin Performans Değerlendirmesi. DÜMF Mühendislik Derg. 2023, 14, 507–517. [Google Scholar] [CrossRef]
  58. Skoulikaris, C.; Venetsanou, P.; Lazoglou, G.; Anagnostopoulou, C.; Voudouris, K. Spatio-Temporal Interpolation and Bias Correction Ordering Analysis for Hydrological Simulations: An Assessment on a Mountainous River Basin. Water 2022, 14, 660. [Google Scholar] [CrossRef]
  59. Andrade-Velázquez, M.; Montero-Martínez, M.J. Statistical Downscaling of Precipitation in the South and Southeast of Mexico. Climate 2023, 11, 186. [Google Scholar] [CrossRef]
  60. Yılmaz, B.; Aras, E.; Nacar, S. A CMIP6-ensemble-based evaluation of precipitation and temperature projections. Theor. Appl. Clim. 2024, 155, 7377–7401. [Google Scholar] [CrossRef]
  61. Hassler, B.; Lauer, A. Comparison of reanalysis and observational precipitation datasets including ERA5 and WFDE5. Atmosphere 2021, 12, 1462. [Google Scholar] [CrossRef]
  62. Nacar, S.; Kankal, M.; Okkan, U. Evaluation of the suitability of NCEP/NCAR, ERA-Interim and, ERA5 reanalysis data sets for statistical downscaling in the Eastern Black Sea Basin, Turkey. Meteorol. Atmos. Phys. 2022, 134, 39. [Google Scholar] [CrossRef]
  63. Yılmaz, M.; Alemdar, K.D.; Tosunoğlu, F. Evaluation of NEX-GDDP-CMIP6 to simulate precipitation using multi-criteria decision-making analysis over Türkiye. Theor. Appl. Clim. 2025, 156, 578. [Google Scholar] [CrossRef]
  64. Oruc, S. Performance of bias corrected monthly CMIP6 climate projections with different reference period data in Turkey. Acta Geophys. 2022, 70, 777–789. [Google Scholar] [CrossRef]
  65. Bağçaci, S.Ç.; Yucel, I.; Duzenli, E.; Yilmaz, M.T. Intercomparison of the expected change in the temperature and the precipitation retrieved from CMIP6 and CMIP5 climate projections: A Mediterranean hot spot case, Turkey. Atmos. Res. 2021, 256, 105576. [Google Scholar] [CrossRef]
  66. Raj, A.D.; Kumar, S.; Sooryamol, K.R. Downscaling future precipitation with shared socioeconomic pathway (SSP) scenarios using machine learning models in the North-Western Himalayan region. Model. Earth Syst. Environ. 2024, 10, 6373–6395. [Google Scholar] [CrossRef]
  67. Fistikoglu, O.; Okkan, U. Statistical Downscaling of Monthly Precipitation Using NCEP/NCAR Reanalysis Data for Tahtali River Basin in Turkey. J. Hydrol. Eng. 2011, 16, 157–164. [Google Scholar] [CrossRef]
  68. Wilby, R.; Dawson, C.; Barrow, E. Sdsm—A Decision Support Tool for the Assessment of Regional Climate Change Impacts. 2002. Available online: www.elsevier.com/locate/envsoft (accessed on 16 October 2025).
  69. Benestad, R.E.; Chen, D.; Hanssen-Bauer, I. Empirical-Statistical Downscaling; World Scientific Publishing Company: Singapore, 2008. [Google Scholar]
  70. Okkan, U.; Kirdemir, U. Downscaling of monthly precipitation using CMIP5 climate models operated under RCPs. Meteorol. Appl. 2016, 23, 514–528. [Google Scholar] [CrossRef]
  71. Fowler, H.J.; Blenkinsop, S.; Tebaldi, C. Linking climate change modelling to impacts studies: Recent advances in downscaling techniques for hydrological modelling. Int. J. Clim. 2007, 27, 1547–1578. [Google Scholar] [CrossRef]
  72. Maraun, D.; Wetterhall, F.; Ireson, A.M.; Chandler, R.E.; Kendon, E.J.; Widmann, M.; Brienen, S.; Rust, H.W.; Sauter, T.; Themeßl, M.; et al. Precipitation downscaling under climate change: Recent developments to bridge the gap between dynamical models and the end user. Rev. Geophys. 2010, 48. [Google Scholar] [CrossRef]
  73. Mallows, C.L. Some Comments on Cp. Technometrics 1973, 15, 661–675. [Google Scholar] [CrossRef]
  74. Schwarz, G. Schwarz-1978-BIC. Ann. Stat. 1978, 6, 461–464. [Google Scholar]
  75. Dormann, C.F.; Elith, J.; Bacher, S.; Buchmann, C.; Carl, G.; Carré, G.; Marquéz, J.R.G.; Gruber, B.; Lafourcade, B.; Leitão, P.J.; et al. Collinearity: A review of methods to deal with it and a simulation study evaluating their performance. Ecography 2013, 36, 27–46. [Google Scholar]
  76. Hempel, S.; Frieler, K.; Warszawski, L.; Schewe, J.; Piontek, F. A trend-preserving bias correction—The ISI-MIP approach. Earth Syst. Dyn. 2013, 4, 219–236. [Google Scholar] [CrossRef]
  77. Teutschbein, C.; Seibert, J. Bias correction of regional climate model simulations for hydrological climate-change impact studies: Review and evaluation of different methods. J. Hydrol. 2012, 456–457, 12–29. [Google Scholar] [CrossRef]
  78. Shepard, D. A two-dimensional interpolation for irregularly spaced data function. In Proceedings of the 1968 ACM National Conference, San Francisco, CA, USA, 27–29 August 1968; pp. 517–523. [Google Scholar]
  79. Hay, L.E.; Wilby, R.L.; Leavesley, G.H. A comparison of delta change and downscaled GCM scenarios for three mountainous basins in the United States. J. Am. Water Resour. Assoc. 2000, 36, 387–397. [Google Scholar] [CrossRef]
  80. Zhang, X.-C.; Liu, W.-Z.; Li, Z.; Chen, J. Trend and uncertainty analysis of simulated climate change impacts with multiple GCMs and emission scenarios. Agric. For. Meteorol. 2011, 151, 1297–1304. [Google Scholar] [CrossRef]
  81. Umugwaneza, A.; Chen, X.; Liu, T.; Li, Z.; Uwamahoro, S.; Mind’je, R.; Umwali, E.D.; Ingabire, R.; Uwineza, A. Future climate change impact on the nyabugogo catchment water balance in rwanda. Water 2021, 13, 3636. [Google Scholar] [CrossRef]
  82. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar]
  83. Zhao, N.; Chen, K. A Simple Method of Coupled Merging and Downscaling for Multi-Source Daily Precipitation Data. Remote Sens. 2023, 15, 4377. [Google Scholar] [CrossRef]
  84. Nguyen, G.V.; Le, X.-H.; Van, L.N.; Jung, S.; Yeon, M.; Lee, G. Application of random forest algorithm for merging multiple satellite precipitation products across South Korea. Remote Sens. 2021, 13, 4033. [Google Scholar] [CrossRef]
  85. Yan, X.; Chen, H.; Tian, B.; Sheng, S.; Wang, J.; Kim, J.-S. A downscaling–merging scheme for improving daily spatial precipitation estimates based on random forest and cokriging. Remote Sens. 2021, 13, 2040. [Google Scholar] [CrossRef]
  86. He, X.; Chaney, N.W.; Schleiss, M.; Sheffield, J. Spatial downscaling of precipitation using adaptable random forests. Water Resour. Res. 2016, 52, 8217–8237. [Google Scholar] [CrossRef]
  87. Yang, Y.; Cao, C.; Pan, X.; Li, X.; Zhu, X. Downscaling land surface temperature in an arid area by using multiple remote sensing indices with random forest regression. Remote Sens. 2017, 9, 789. [Google Scholar] [CrossRef]
  88. Pham, Q.B.; Yang, T.-C.; Kuo, C.-M.; Tseng, H.-W.; Yu, P.-S. Combing random forest and least square support vector regression for improving extreme rainfall downscaling. Water 2019, 11, 451. [Google Scholar] [CrossRef]
  89. Song, Y.H.; Chung, E.-S.; Shiru, M.S. Uncertainty analysis of monthly precipitation in GCMs using multiple bias correction methods under different RCPs. Sustainability 2020, 12, 7508. [Google Scholar] [CrossRef]
  90. Legasa, M.N.; Manzanas, R.; Calviño, A.; Gutiérrez, J.M. A Posteriori Random Forests for Stochastic Downscaling of Precipitation by Predicting Probability Distributions. Water Resour. Res. 2022, 58, e2021WR030272. [Google Scholar] [CrossRef]
  91. Chen, C.; He, Q.; Li, Y. Downscaling and merging multiple satellite precipitation products and gauge observations using random forest with the incorporation of spatial autocorrelation. J. Hydrol. 2024, 632, 130919. [Google Scholar] [CrossRef]
  92. Moriasi, D.N.; Arnold, J.G.; van Liew, M.W.; Bingner, R.L.; Harmel, R.D.; Veith, T.L. Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
  93. Cannon, A.J.; Sobie, S.R.; Murdock, T.Q. Bias correction of GCM precipitation by quantile mapping: How well do methods preserve changes in quantiles and extremes? J. Clim. 2015, 28, 6938–6959. [Google Scholar] [CrossRef]
  94. Xavier, A.C.F.; Martins, L.L.; Rudke, A.P.; de Morais, M.V.B.; Martins, J.A.; Blain, G.C. Evaluation of Quantile Delta Mapping as a bias-correction method in maximum rainfall dataset from downscaled models in São Paulo state (Brazil). Int. J. Clim. 2022, 42, 175–190. [Google Scholar] [CrossRef]
  95. Lehner, F.; Nadeem, I.; Formayer, H. Evaluating skills and issues of quantile-based bias adjustment for climate change scenarios. Adv. Stat. Clim. Meteorol. Oceanogr. 2023, 9, 29–44. [Google Scholar] [CrossRef]
  96. Tegegne, G.; Melesse, A.M. Comparison of Trend Preserving Statistical Downscaling Algorithms Toward an Improved Precipitation Extremes Projection in the Headwaters of Blue Nile River in Ethiopia. Environ. Process. 2021, 8, 59–75. [Google Scholar] [CrossRef]
  97. Qian, W.; Chang, H.H. Projecting health impacts of future temperature: A comparison of quantile-mapping bias-correction methods. Int. J. Environ. Res. Public Health 2021, 18, 1992. [Google Scholar] [CrossRef]
  98. Ibebuchi, C.C.; Schönbein, D.; Adakudlu, M.; Xoplaki, E.; Paeth, H. Comparison of Three Techniques to Adjust Daily Precipitation Biases from Regional Climate Models over Germany. Water 2022, 14, 600. [Google Scholar] [CrossRef]
  99. Abbaspour, K.C.; Vaghefi, S.A.; Srinivasan, R. A guideline for successful calibration and uncertainty analysis for soil and water assessment: A review of papers from the 2016 international SWAT conference. Water 2017, 10, 6. [Google Scholar] [CrossRef]
  100. Abbaspour, K.C.; Rouholahnejad, E.; Vaghefi, S.; Srinivasan, R.; Yang, H.; Kløve, B. A continental-scale hydrology and water quality model for Europe: Calibration and uncertainty of a high-resolution large-scale SWAT model. J. Hydrol. 2015, 524, 733–752. [Google Scholar] [CrossRef]
  101. Nauman, S.; Zulkafli, Z.; Bin Ghazali, A.H.; Yusuf, B. Impact assessment of future climate change on streamflows upstream of Khanpur Dam, Pakistan using Soil and Water Assessment Tool. Water 2019, 11, 1090. [Google Scholar] [CrossRef]
  102. Baycan, A.; Sonmez, O.; Evcil, G.T. Climate Change Effects on Precipitation and Streamflow in the Mediterranean Region. Water 2025, 17, 2556. [Google Scholar] [CrossRef]
  103. Yasar, M. Optimization of reservoir operation using cuckoo search algorithm: Example of Adiguzel Dam, Denizli, Turkey. Math. Probl. Eng. 2016, 2016, 1316038. [Google Scholar] [CrossRef]
  104. Ara, I.; Yasar, M. Optimization of the Operation Rule Curves for Cascade Reservoirs Using the Cuckoo Search Algorithm. Adv. Civ. Eng. 2022, 2022, 8463358. [Google Scholar] [CrossRef]
  105. Gumus, B.; Oruc, S.; Yucel, I.; Yilmaz, M.T. Impacts of Climate Change on Extreme Climate Indices in Türkiye Driven by High-Resolution Downscaled CMIP6 Climate Models. Sustainability 2023, 15, 7202. [Google Scholar] [CrossRef]
  106. Yavaşlı, D.D.; Erlat, E. Climate model projections of aridity patterns in Türkiye: A comprehensive analysis using CMIP6 models and three aridity indices. Int. J. Clim. 2023, 43, 6207–6224. [Google Scholar] [CrossRef]
  107. Çolak, M.A.; Öztaş, B.; Özgencil, I.K.; Soyluer, M.; Korkmaz, M.; Ramírez-García, A.; Metin, M.; Yılmaz, G.; Ertuğrul, S.; Tavşanoğlu, Ü.N.; et al. Increased Water Abstraction and Climate Change Have Substantial Effect on Morphometry, Salinity, and Biotic Communities in Lakes: Examples from the Semi-Arid Burdur Basin (Turkey). Water 2022, 14, 1241. [Google Scholar] [CrossRef]
  108. Mersin, D.; Tayfur, G.; Vaheddoost, B.; Safari, M.J.S. Historical Trends Associated with Annual Temperature and Precipitation in Aegean Turkey, Where Are We Heading? Sustainability 2022, 14, 13380. [Google Scholar] [CrossRef]
  109. Tanrıkulu, A.; Guner, U.; Bahar, E. The Assessment of Precipitation and Droughts in the Aegean Region Using Stochastic Time Series and Standardized Precipitation Index. Atmosphere 2024, 15, 1001. [Google Scholar] [CrossRef]
Figure 1. Study area and location of meteorological stations in the Büyük Menderes Basin in Türkiye.
Figure 1. Study area and location of meteorological stations in the Büyük Menderes Basin in Türkiye.
Water 18 00277 g001
Figure 2. Flowchart of the methodology applied for precipitation downscaling.
Figure 2. Flowchart of the methodology applied for precipitation downscaling.
Water 18 00277 g002
Figure 3. RF model training and testing results at Yatağan station (Station 1).
Figure 3. RF model training and testing results at Yatağan station (Station 1).
Water 18 00277 g003
Figure 4. Kernel-CDF comparison of future scenarios for Yatağan station (Station 1).
Figure 4. Kernel-CDF comparison of future scenarios for Yatağan station (Station 1).
Water 18 00277 g004
Figure 5. Precipitation Distribution Observed from 1980 to 2014.
Figure 5. Precipitation Distribution Observed from 1980 to 2014.
Water 18 00277 g005
Figure 6. Precipitation Distribution from 2025 to 2099 based on SSP2-4.5 Projection.
Figure 6. Precipitation Distribution from 2025 to 2099 based on SSP2-4.5 Projection.
Water 18 00277 g006
Figure 7. Precipitation Distribution from 2025 to 2099 based on SSP5-8.5 Projection.
Figure 7. Precipitation Distribution from 2025 to 2099 based on SSP5-8.5 Projection.
Water 18 00277 g007
Figure 8. Annual Precipitation Projections under SSP2-4.5 for (a) 2025–2050, (b) 2051–2075 and (c) 2076–2099 at Yatağan station (Station 1).
Figure 8. Annual Precipitation Projections under SSP2-4.5 for (a) 2025–2050, (b) 2051–2075 and (c) 2076–2099 at Yatağan station (Station 1).
Water 18 00277 g008
Figure 9. Annual Precipitation Projections under SSP5-8.5 for (a) 2025–2050, (b) 2051-2075 and (c) 2076-2099 at Yatağan station (Station 1).
Figure 9. Annual Precipitation Projections under SSP5-8.5 for (a) 2025–2050, (b) 2051-2075 and (c) 2076-2099 at Yatağan station (Station 1).
Water 18 00277 g009
Figure 10. Heat maps showing the annual percentage change in precipitation for the Büyük Menderes Basin. (%).
Figure 10. Heat maps showing the annual percentage change in precipitation for the Büyük Menderes Basin. (%).
Water 18 00277 g010
Table 1. Meteorological stations used in the Büyük Menderes Basin.
Table 1. Meteorological stations used in the Büyük Menderes Basin.
NumberStation
Name
Station
No
Latitude
(°N)
Longitude
(°E)
Elevation
(m)
Station 1Yatağan1788637.339528.1369365
Station 2Çine1802337.594728.044770
Station 3Kavaklıdere1802237.450828.3621973
Station 4Kale1830137.435328.86081208
Station 5Tavas1806537.565029.0706992
Station 6Söke1788137.704927.382775
Station 7Aydın1723437.840227.837956
Station 8Sultanhisar1785037.884328.150473
Station 9Karacasu1802537.719428.6075580
Station 10Denizli1723737.762029.0921425
Station 11Nazilli1786037.913528.343784
Station 12Kuyucak1842937.902228.456475
Station 13Sarayköy1806837.911128.9081194
Station 14Çal1806738.089429.3953817
Station 15Dinar1786238.059730.1531864
Station 16Güney1782438.151529.0587825
Station 17Çivril1782538.260329.7120824
Station 18Eşme1782738.396128.9906809
Station 19Sivaslı1807138.498629.6953990
Station 20Hocalar1829238.581929.96971130
Station 21Sandıklı1800238.455830.23131040
Station 22Uşak1718838.671229.4040919
Station 23Banaz1806938.752529.7436987
Table 2. The Data sets used in this study and their characteristics.
Table 2. The Data sets used in this study and their characteristics.
DataSpatial
Resolution
Number of GridInstitution
HadGEM3-GC31-LL1.875° × 1.25°5Met Office Hadley Centre
ERA50.25° × 0.25°53European Centre for Medium Range Weather Forecasts.
Table 3. The common predictor variables selected from the ERA5 re-analysis and the GCM data set.
Table 3. The common predictor variables selected from the ERA5 re-analysis and the GCM data set.
NumberPredictorsDescriptionUnit
1prMonthly total precipitationmm
2tasNear surface air temperature°C
3psSurface air pressurehPa
4zg200200 hPa geopotential heightm
5hur200200 hPa relative humidity%
6ta200200 hPa air temperature°C
7zg500500 hPa geopotential heightm
8hur500500 hPa relative humidity%
9ta500500 hPa air temperature°C
10zg850850 hPa geopotential heightm
11hur850850 hPa relative humidity%
12ta850850 hPa air temperature°C
Table 4. Results of the multistage predictor selection procedure for Banaz station.
Table 4. Results of the multistage predictor selection procedure for Banaz station.
Predictor Numberprtaspszg200hur200ta200zg500hur500ta500zg850hur850ta850R2AdjRMSECpBICScore
1 0.73419.3838.5724870.00
2 0.73919.0431.1924840.35
3 0.75018.7014.2524720.69
4 0.75818.310.6024620.99
5 0.75918.350.9724670.95
6 0.75818.373.2024730.88
7 0.75818.363.9924780.83
8 0.75818.395.4124830.76
9 0.75718.387.7924900.67
10 0.75718.479.4424950.55
11 0.75718.599.0024990.45
120.75718.7311.0025050.00
Table 5. Results of performances of RF models trained for the observation stations.
Table 5. Results of performances of RF models trained for the observation stations.
NumberStation
Name
Test
RMSE (mm)
RSRNSEAdj R2PBIAS
(%)
OOB
RMSE
(mm)
Station 1Yatağan26.940.450.800.80−4.7826.83
Station 2Çine21.410.520.730.74−11.4223.73
Station 3Kavaklıdere25.250.460.790.79−3.7024.22
Station 4Kale34.700.620.610.68−21.9926.02
Station 5Tavas14.800.470.780.79−6.8517.55
Station 6Söke37.480.450.800.800.2029.48
Station 7Aydın30.490.520.730.72−3.1524.71
Station 8Sultanhisar25.380.430.810.81−7.1123.05
Station 9Karacasu27.550.520.720.73−5.2626.97
Station 10Denizli23.410.490.760.79−10.9422.69
Station 11Nazilli26.740.520.730.73−0.5325.22
Station 12Kuyucak22.030.510.740.74−4.9818.04
Station 13Sarayköy13.300.460.790.79−4.5916.76
Station 14Çal23.210.620.620.62−10.1616.75
Station 15Dinar17.860.570.670.68−11.3816.51
Station 16Güney19.950.520.730.730.8418.22
Station 17Çivril18.090.580.670.67−4.1216.66
Station 18Eşme19.370.590.650.66−10.7316.26
Station 19Sivaslı17.320.520.730.74−7.6618.42
Station 20Hocalar18.540.740.450.502.9515.60
Station 21Sandıklı16.650.710.490.495.2914.95
Station 22Uşak20.520.530.710.72−9.2216.78
Station 23Banaz18.920.510.740.75−9.9720.62
Table 6. Estimated annual total precipitation and changes at stations under the SSP2-4.5 scenario.
Table 6. Estimated annual total precipitation and changes at stations under the SSP2-4.5 scenario.
NumberStation
Name
Observed
(mm)
Near
Future
(mm)
Middle
Future
(mm)
Far
Future
(mm)
Near
Change
(mm)
Middle
Change
(mm)
Far
Change
(mm)
Near
Change
(%)
Middle
Change
(%)
Far
Change
(%)
Station 1Yatağan661.5647.9647.6611.4−13.7−13.9−50.2−2.1−2.1−7.6
Station 2Çine547.3530.1537.0501.7−17.2−10.4−45.6−3.15−1.89−8.33
Station 3Kavaklıdere729.0693.7672.5647.9−35.4−56.6−81.1−4.8−7.8−11.1
Station 4Kale767.4756.0737.5702.9−11.4−29.8−64.5−1.5−3.9−8.4
Station 5Tavas495.4499.5492.3460.94.1−3.1−34.60.8−0.6−7.0
Station 6Söke834.6855.8847.8796.621.113.2−38.12.51.6−4.6
Station 7Aydın623.7613.8614.9581.2−10.0−8.8−42.6−1.6−1.4−6.8
Station 8Sultanhisar593.2577.0581.1555.0−16.2−12.1−38.2−2.7−2.0−6.4
Station 9Karacasu605.0589.3578.3575.2−15.7−26.7−29.8−2.59−4.42−4.92
Station 10Denizli562.9555.7539.7526.2−7.1−23.2−36.6−1.27−4.12−6.51
Station 11Nazilli562.0532.3518.9517.3−29.7−43.2−44.7−5.3−7.7−8.0
Station 12Kuyucak481.4459.9458.4444.1−21.5−23.0−37.3−4.5−4.8−7.8
Station 13Sarayköy372.9356.8346.1346.2−16.2−26.8−26.8−4.3−7.2−7.2
Station 14Çal478.7467.5456.4467.2−11.2−22.3−11.5−2.3−4.7−2.4
Station 15Dinar443.1433.1431.9432.1−10.0−11.2−11.0−2.3−2.5−2.5
Station 16Güney502.4492.2471.8483.2−10.2−30.7−19.3−2.0−6.1−3.8
Station 17Çivril437.8422.9420.6408.5−14.9−17.1−29.2−3.4−3.9−6.7
Station 18Eşme456.4453.1445.6444.4−3.3−10.8−12.1−0.7−2.4−2.6
Station 19Sivaslı494.5473.5473.9468.6−21.0−20.6−25.9−4.2−4.2−5.2
Station 20Hocalar433.0433.1412.1397.60.0−20.9−35.40.0−4.8−8.2
Station 21Sandıklı406.5408.6398.5381.92.1−8.0−24.60.5−2.0−6.1
Station 22Uşak525.9507.7505.7506.2−18.2−20.2−19.7−3.5−3.8−3.7
Station 23Banaz526.9502.7501.9510.8−24.2−25.0−16.1−4.6−4.7−3.1
Table 7. Annual total precipitation estimates and changes at stations under the SSP5-8.5 scenario.
Table 7. Annual total precipitation estimates and changes at stations under the SSP5-8.5 scenario.
NumberStation
Name
Observed
(mm)
Near
Future
(mm)
Middle
Future
(mm)
Far
Future
(mm)
Near
Change
(mm)
Middle
Change
(mm)
Far
Change
(mm)
Near
Change
(%)
Middle
Change
(%)
Far
Change
(%)
Station 1Yatağan661.5632.8567.0549.5−28.8−94.6−112.0−4.4−14.3−16.9
Station 2Çine547.3519.7487.4470.9−27.6−59.9−76.4−5.05−10.95−13.96
Station 3Kavaklıdere729.0712.7634.8615.8−16.4−94.3−113.2−2.2−12.9−15.5
Station 4Kale767.4748.9701.4681.1−18.5−66.0−86.3−2.4−8.6−11.2
Station 5Tavas495.4486.9468.3432.4−8.5−27.1−63.0−1.7−5.5−12.7
Station 6Söke834.6813.4803.2710.6−21.3−31.4−124.0−2.6−3.8−14.9
Station 7Aydın623.7589.1540.0510.7−34.6−83.7−113.0−5.6−13.4−18.1
Station 8Sultanhisar593.2559.9518.6490.8−33.4−74.7−102.4−5.6−12.6−17.3
Station 9Karacasu605.0596.0536.6505.3−9.0−68.4−99.7−1.49−11.30−16.48
Station 10Denizli562.9560.6506.8474.5−2.2−56.1−88.4−0.39−9.96−15.71
Station 11Nazilli562.0514.2486.2450.1−47.8−75.9−111.9−8.5−13.5−19.9
Station 12Kuyucak481.4441.9419.8396.3−39.6−61.6−85.1−8.2−12.8−17.7
Station 13Sarayköy372.9345.2322.9302.5−27.7−50.1−70.5−7.4−13.4−18.9
Station 14Çal478.7446.2441.0377.5−32.5−37.8−101.2−6.8−7.9−21.1
Station 15Dinar443.1428.6417.1395.0−14.5−26.0−48.1−3.3−5.9−10.9
Station 16Güney502.4463.1456.2395.3−39.3−46.3−107.2−7.8−9.2−21.3
Station 17Çivril437.8412.5397.1368.1−25.3−40.7−69.6−5.8−9.3−15.9
Station 18Eşme456.4425.7422.9386.9−30.7−33.5−69.6−6.7−7.3−15.2
Station 19Sivaslı494.5461.4444.5414.9−33.1−50.0−79.6−6.7−10.1−16.1
Station 20Hocalar433.0414.0397.8362.0−19.1−35.3−71.0−4.4−8.1−16.4
Station 21Sandıklı406.5402.4379.2354.6−4.1−27.3−51.9−1.0−6.7−12.8
Station 22Uşak525.9489.3483.9455.1−36.6−42.0−70.8−7.0−8.0−13.5
Station 23Banaz526.9497.3486.2444.2−29.6−40.7−82.8−5.6−7.7−15.7
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ara, I.; Yasar, M.; Gurarslan, G. Assessing Climate Change Impacts on Future Precipitation Using Random Forest Statistical Downscaling of CMIP6 HadGEM3 Projections in the Büyük Menderes Basin. Water 2026, 18, 277. https://doi.org/10.3390/w18020277

AMA Style

Ara I, Yasar M, Gurarslan G. Assessing Climate Change Impacts on Future Precipitation Using Random Forest Statistical Downscaling of CMIP6 HadGEM3 Projections in the Büyük Menderes Basin. Water. 2026; 18(2):277. https://doi.org/10.3390/w18020277

Chicago/Turabian Style

Ara, Ismail, Mutlu Yasar, and Gurhan Gurarslan. 2026. "Assessing Climate Change Impacts on Future Precipitation Using Random Forest Statistical Downscaling of CMIP6 HadGEM3 Projections in the Büyük Menderes Basin" Water 18, no. 2: 277. https://doi.org/10.3390/w18020277

APA Style

Ara, I., Yasar, M., & Gurarslan, G. (2026). Assessing Climate Change Impacts on Future Precipitation Using Random Forest Statistical Downscaling of CMIP6 HadGEM3 Projections in the Büyük Menderes Basin. Water, 18(2), 277. https://doi.org/10.3390/w18020277

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop