1. Introduction
The Intergovernmental Panel on Climate Change (IPCC) has observed that climate variable changes associated with global warming are affecting regional or catchment scale hydrological processes [
1]. Changes in precipitation and temperature are anticipated as direct driving factors because they are the main factors that influence regional hydrological processes [
2]. The Regional Climate Models (RCMs) provide a new opportunity for climate change effects analysis since they have a higher spatial resolution and more reliable results on a regional scale compared to General Circulation Models (GCMs) [
3,
4,
5]. Numerous studies have shown that RCM outputs improve the representation of climate change information at the mesoscale by providing spatially and physically coherent outputs with observations [
4,
5]. However, the original RCM outputs still contain considerable bias, which is inherited from the forcing of GCMs or produced by systematic model error [
6,
7]. Furthermore, such biases may be amplified when climate change effects are included, such as in hydrological effect studies [
4]. Therefore, bias correction of RCM data is the prerequisite step to the data being used in any climate change effects analysis.
Many bias correction methods, ranging from simple scaling techniques to the rather more sophisticated distribution mapping techniques, have been developed to correct biased RCM outputs [
8]. The scaling approach mainly includes linear or nonlinear approaches that adjust the climatic factors based on the differences between observed and RCM means in a linear or nonlinear formula, such as the linear scaling method (LS) [
9,
10] and the power transpiration method (PT) [
11]. Distribution mapping, involving distribution-based and distribution-free quantile mapping methods, matches the statistical distribution of RCM-simulated climatic factors to the distribution of observations. Distribution-based quantile mapping is based on the assumption that climatic factors obey a certain distribution, such as Gamma and Gaussian distributions [
4,
12], while the distribution-free quantile mapping technique employs the empirical distribution [
13]. Selecting a suitable bias correction method is important for providing reliable inputs for impact analysis of a region.
Teutschbein and Seibert [
8] applied six ensemble statistical and bias correction methods to correct precipitation and temperature data from eleven different RCM outputs in five typical catchments in Sweden. The results suggest that most methods were able to correct daily mean values to a certain extent, while only higher-skill approaches, such as distribution mapping and power transformation methods, performed well in hydrological extremes. Chen, Brissette, Chaumont and Braun [
4] investigated the performance and uncertainty of two change factor and four bias correction methods in quantifying the climate change effects over Manicouagan 5 and Chickasawhay basins. It indicated that the uncertainties that result from RCM simulations are slightly greater than those from different bias correction methods. Chen, Brissette, Chaumont and Braun [
13] compared the performances of six bias correction methods in four RCM-simulated precipitation events over ten North American river basins. The results demonstrate that all bias correction methods are capable of reducing bias in raw RCM, while performances of hydrological modelling are highly dependent on the locations of the catchments and the choice of bias correction methods. Setting these recent studies aside, contributions that compare and evaluate different kinds of bias correction methods in hydrological modelling are still rare in the literature, in particular over arid mountainous areas. Moreover, only a few have provided the best combination of bias correction methods for precipitation and temperature over a specific region.
Xinjiang Uygur Autonomous Region, which is an arid and semi-arid region and located in Central Asia, is extremely vulnerable to climate change effects, since most water sources originate from the upper mountainous regions [
14,
15]. It is necessary to select suitable bias correction methods for providing credible inputs to estimate climate change effects over the region. Therefore, the Kaidu River Basin, one of the typical mountainous catchments in Xinjiang, was selected as a case study. The objective of the present study is to investigate the performance of available bias correction methods for downscaling climatic outputs of RCM and provide the best combination of bias correction methods for precipitation and temperature over an arid mountainous region. This paper compares existing bias correction methods of climate and hydrological projections. Seven precipitation correction methods and five temperature correction methods cover most of the existing bias correction methods that are included with the intention of correcting deviations in this study. The influence of bias correction methods on hydrological simulations is studied by comparing different discharge outputs. This is achieved by using SWAT hydrological modelling with original RCM data and all possible combinations of corrected precipitation and temperature data.
4. Results
4.1. Performance of RCM Outputs in Reproducing Discharges
In order to evaluate the capability of original RCM data in discharge modelling, precipitation and maximum/minimum temperature from RCM outputs are used to directly force the SWAT model. The SWAT model is calibrated and specially forced by the original RCM simulations against the observed discharges. If the original RCM output simulated discharges closely align with the observed discharges of reasonable model parameters, RCM outputs are considered to be small biases that can be overcome through model calibration. Under this circumstance, bias correction approaches will not be required. If this is not the case, original RCM outputs are seriously biased and not suitable for hydrological modelling.
The mean daily cycle results reveal that the simulation run by original RCM precipitation and temperature has obvious bias and great overestimation for discharges (
Figure 3), particularly in the wet seasons (April–September). This is to be expected because the performance of the RCM data is highly dependent upon the season. When the observed discharges are included, the original RCM-simulated discharges become less coherent. The NSE coefficient is −10.13 over the length of the relevant period, and this indicates that the original RCM outputs are not capable of being used for calibration and the bias cannot be corrected by hydrological calibration.
4.2. Validation of Original Precipitation and Temperature
The validation of the original RCM-simulated precipitation and temperature is contingent on the quality of the probabilities distributions and average daily cycle. The subsequent discussion focuses upon the performance of daily average temperature because similar results are obtained from daily maximum and minimum temperatures. The performance of the RCM outputs is also dependent upon the specific region. The distribution of RCM-simulated precipitation at the BYBLK station is less coherent than observations obtained from the BLT station (
Figure 4).
At the BYBLK station, overestimations are observed in RCM-simulated precipitation, with probabilities that fall below 0.78. When the focus is on the temporal distribution (
Figure 5), overestimated periods are mainly concentrated in the wet season. But the biases cannot be attributed to systematical error because the conditions are very different at the BLT station. RCM simulations underestimate precipitation, with probabilities below 0.04 and, when applied at the BLT station, overestimate probabilities between 0.04 and 0.80. The underestimations mainly occur during the June-July period and overestimations are concentrated in the November-December period. It is worthwhile to note that, in comparison to the dry season, precipitation is consistently less accurate in the wet season. This can possibly be attributed to the RCMs evidencing diminished capacity in the simulation of convective precipitation [
40]. The RCM-simulated temperature outputs obtained at the BYBLK station are more reliable than at the BLT station (
Figure 6). The probabilities distribution of temperature at BYBLK is more concentrated and there are obvious overestimations of low temperatures that fall beneath −19.1 °C (probabilities above 0.8). High temperatures fall in the slightly overestimated range between 18.8 °C and 7.1 °C (probabilities below 0.28). Overestimations are mainly concentrated in the Winter and Summer seasons, while underestimations are clearly evidenced in Spring once temporal variation is taken into account (
Figure 7). At the BLT station, the temperature is more decentralized and serious underestimations range between −21.9 °C and 13 °C (probabilities 0.37-1) during the November–May period. Slight overestimations fall between 13 °C to 27.1 °C (probabilities below 0.37) during the June–September period.
4.3. Validation of Corrected Precipitation and Temperature
All bias correction methods are capable of effectively improving the original RCM simulations to a certain degree. The LS and LOCI methods underestimate heavy precipitation, with probabilities falling below 0.08 and 0.06 at the BYBLK station, respectively. This is because the LS and LOCI are mean-based methods that adjust different precipitation levels by using a unique scaling factor during a specific month. Extreme precipitation is not specifically considered by the two methods. Meanwhile, overestimations are included, with probabilities falling between 0.08 and 0.33 in the LS method and 0.06 to 0.31 in the LOCI method. But the LOCI method is obviously superior when dry days (<0.1 mm) are considered. The LS method is limited in reproducing dry days and consistently overestimates probabilities that range from 0.33 to 0.79.
The DT method is able to competently adjust the higher precipitation in probabilities that fall below 0.06, but it does not take into account drizzle days corrections, and this results in consistent overestimations of probabilities between 0.31 and 0.79. It is fortunate that this limitation is fully overridden by the DBC method, which uses observations to correct the original RCM-simulated precipitation distribution to a completely uniform configuration. The DM, EQM, and PT methods reproduce precipitation quite well, with the partial exception of the PT method, which slightly overestimates the original precipitation within a probabilities range of 0.09–0.30. These corrected methods present similar capacities, both at the BLT and BYBLK station. But it is worth noticing that several outliers (extreme values) are evidenced within the DM and PT (the reason for this will be discussed at a later stage of this paper).
The performance of the corrected temperature confirms that all the temperature correction methods improve the original RCM simulations. The DM, DT, EQM, and VARI methods correct the original temperature and produce exceedance probabilities that closely resemble observations taken from the two stations. But the LS method, in particular when it is applied at the BLT station, is incapable of improving the RCM quality to the same extent as the other four methods. The poor performance of the original RCM-simulated temperature at the BLT station is still not fully overcome by the LS method, and this indicates the RCM-model dependence of the bias correction methods.
The calculated performance metrics (
Figure 8) suggest that a bias in average precipitation is no longer evidenced in the LS method, whereas original RCM simulations respectively show a bias of 1.4 mm and −0.01 mm at the BYBLK and BLT stations. But the LS method still presents large biases in other indices. The DT method is capable of reproducing the 95th percentile upon the basis of the 100-quantile corrected processes, but it does not present great improvement in other metrics and its complete oversight of drizzle days means that it even extends the error in mean. Biases in mean precipitation, probability of wet days, and wet-day intensity are not found in the LOCI method, while biases in the coefficient of variation, 95th percentiles, and standard deviation still exist. The other four correction methods evidence a level of performance that exceeds each of these three methods. No bias in these metrics can be identified in the DBC and EQM methods, while their DM and PT counterparts only evidence slight biases in some metrics.
The bias in average temperature is no longer found in all correction methods at the two stations, whereas the original RCM temperature respectively presented biases of −0.03 °C and 10.14 °C at the BYBLK and BLT stations (
Figure 9). The VARI method is the only one of the five temperature correction methods that specifically considers the correction of variance. In most cases, the correction in mean values does affect the variance, and this leads to variations in the DM, DT, and EQM methods, thereby lending coherence to the variance of observations and VARI-corrected temperature. The LS method also reduces the bias in variance, although it is unable to correct extreme temperatures (5/95th percentiles). The performance of the other four correction methods is broadly comparable and does not evidence biases in extreme temperatures.
The time-series-based metrics of these corrected precipitation and temperature data, along with original RCM simulations, are also analyzed in this paper (
Figure 6 and
Figure 7;
Table 3). For presentational reasons,
Table 3 only presents the results of the BYBLK station. The performance of RCM-simulated precipitation is very biased, as an MAE of 1.65 mm, NSE of −5.71, PBIAS of 224.2%, and R
2 of 0.59 attests (daily scale). Comparison of these time-series-based metrics suggests that all methods are capable of improving the original RCM simulations to different levels. The DT method still overestimates precipitation, with PBIAS and MAE respectively rising to 49.50% and 0.47 mm. Other corrected precipitation series suggest that the MAE, PBIAS, NSE, and R
2 respectively range between 0.27 mm/0.37 mm, −0.2%/4.2%, 0.46/0.72, and 0.56/0.72. In time-series-based metrics, the performance of mean-based methods (LS and LOCI) in precipitation is slightly better than quantile-based methods. Temperature exhibits a much better performance than precipitation at the BYBLK station, which slightly overestimates the observations with an MAE of 1.99 °C and PBIAS of 0.7%. All temperature bias correction methods improve the performance of RCM-simulated temperature and present PBIAS almost equal to zero.
4.4. The Performance of Bias Correction Methods for Hydrological Modelling
In order to evaluate the capacity of the corrected precipitation and temperature data in discharge simulations, 35 possible combinations of corrected precipitation and temperature data are applied to drive the SWAT model. The exceedance probabilities of simulated discharges are grouped in accordance with different temperature-corrected methods (
Figure 10). The simulated discharges that use observed precipitation and temperature data, as opposed to observed discharges, are used as the reference in order to prevent the model exerting undue influence. Great uncertainty corresponds to different precipitation methods, and this is mainly attributable to the poor performance of DT and LS methods. Discharges simulated by the corrected precipitation with the LS and DT methods significantly differ with the reference. In particular, simulated discharges driven by corrected precipitation with the DT method are even worse than the discharges using original RCM-simulated precipitation.
The DBC performs slightly better than the LOCI method when projecting the exceedance probabilities of discharges. The DM, EQM, and PT methods demonstrate a consistently good performance when reproducing the daily discharges, and there are no obvious differences between the three methods in this regard. Differences will be evaluated by applying statistical metrics in the following sections. In a similar manner to its immediate predecessor,
Figure 11 presents the exceedance probability of simulated discharges, which are grouped in accordance with different precipitation correction methods. No substantial difference is found between the different temperature correction methods in hydrological modelling and a relatively small variability range is identified. This indicates that the temperature correction methods, in comparison to precipitation correction methods, provide a higher level of certainty in the reproduction of discharges. But the performances of these temperature-corrected methods are also slightly different when combined with different precipitation-corrected methods. When grouped with LOCI, DBC, PT, DM, and EQM-corrected methods for precipitation, these five temperature correction methods present quite similar forms, although the DM method evidences more differences than its other four counterparts when combined with the DT and LS methods—this may conceivably be attributed to the interaction included in the different method combinations.
The cumulative runoff from 1986 to 2004 simulated by the 35 combinations is also presented in
Figure 12 for better understanding the capacities of these bias correction methods in simulating interannual variabilities of discharge. The precipitation corrected by LS and DT methods is not able to catch the cumulative runoff well. The positive deviation is getting larger and larger from 1986 to 2004. The bias correction methods without wet-day frequency correction are unable to describe the interannual variabilities of discharge as well. The precipitation corrected by DM and EQM methods exhibited an obvious advantage when compared to precipitation corrected by LOCI, DBC, and PT methods, especially combined with temperature corrected by VARI and EQM. It indicates that these two methods are able to simulate the interannual variabilities of runoff effectively. The bias correction methods of precipitation also exhibited relative greater uncertainties in interannual variabilities of runoff when compared with temperature correction methods.
The SWAT-simulated discharge characteristics, including daily flood peaks, daily low flow, mean, and 25th quantile and 75th quantile discharges during the relevant period, are also considered (
Figure 13). Simulations forced by corrected precipitation with the DT and LS methods generally seriously overestimate all these statistical indices, and this indicates that the precipitation correction methods without wet-day frequency correction are restricted to discharge simulations. Simulations conducted through corrected precipitation with the other five methods come very close to being referenced, especially when combined with temperature corrected by the DT, EQM, and VARI methods. In particular, the DM method for both precipitation and temperature corrections presents a slightly better performance in daily extreme flow simulations.
Table 4 summarizes the time-series-based metrics of simulated discharges forced by different combinations of corrected precipitation and temperature and original RCM outputs. With the exception of simulations driven by corrected precipitation with the DT method, all 35 simulations improved the statistical metrics. The simulations forced by corrected precipitation with the LS method are also very biased, with PBIAS ranging between 45.2% and 55.3%. The simulations that use corrected precipitation with the DBC, DM, EQM, LOCI, and PT methods more closely resemble simulations forced by observations with considerably reduced MAE and PBIAS. The corrected precipitation with the DM and EQM methods presents an equally excellent performance, while corrected temperature by the EQM and VARI methods performs best in the reproduction of time-series-based metrics. In the reproduction of discharges, the best combinations are the DM-corrected precipitation and VARI-corrected temperature.
5. Discussion
Several bias correction methods have been proposed to downscale RCM outputs as a prerequisite for the analysis of climate change. These methods range from simple linear scaling techniques to rather more sophisticated distribution mapping techniques. There is a clear and ongoing need to compare and evaluate their performance.
Numerous studies prove that the original RCM outputs, and in particular RCM-simulated precipitation [
41,
42,
43], are always biased. Their direct use in climate change effects is inadvisable because they might lead to misleading results. The validating results of the original RCM simulations in this study indicate that the performance of RCM outputs varies across region. They fail in hydrological modelling because of their poor performance in the Kaidu River Basin and model calibration could not even overcome their biases. What is worth noting is that precipitation presents a consistently worse accuracy in the wet season than the dry season. The possible reason for this is that the RCMs have a diminished capacity in simulating convective precipitation [
40].
All methods can improve the original RCM simulations at different levels. The LS method is the simplest bias correction method that corrects the climate data upon the basis of the difference between RCM simulations and observations. While it can adjust mean values, it cannot be used in the analysis of extreme events because the unique scaling factor in a specific month often leads to heavy precipitation being greatly underestimated [
44]. The LOCI method is the extension of the LS method that effectively corrects the precipitation frequency. No bias can be found in wet-day frequency or intensity. When compared to the LS method, heavy precipitation is also partly corrected, and this is attributable to the correction in wet-day frequency. The DBC and DT methods are both based on 100-quantile distributions and their major point of divergence originates within the correction of precipitation occurrence [
8]. The extreme events are perfectly presented within these two methods, and this feature clearly distinguishes them from the LOCI and LS methods.
However, the DT method greatly deviates the actual mean precipitation and produces large biases in discharge simulations. The EQM method is based on a point-wise empirical distribution and it takes different precipitation levels into account on an individual basis. In this study, the DBC and EQM methods perform the best in precipitation projection—this is because no bias is included in their frequency-based metrics. The nonlinear DM (based on Gamma distribution) and PT methods still evidence tiny biases in some frequency-based metrics. Meanwhile, the DM method should be used with caution because it originates within the assumption that RCM-simulated and observed time series approximate the distribution [
45]. Its use is oppositional unless the time series obeys a theoretical distribution. As has already been noted, several extreme events are found in the corrected time series when the DM and PT methods are used and these new extremes arise when the reference period is not stable [
4]. Therefore, a long and relatively stable period would be preferable as it would avoid extreme error to the greatest possible extent.
When time-series-based metrics are included, linear-based LS and LOCI methods perform best, with the nonlinear-based PT method following closely behind. Distribution-based methods evidence a poorer level of performance than other methods. A study by Fang, Yang, Chen and Zammit [
29] reached a similar conclusion—these bias correction methods are grounded within temporal structure correction [
13,
30], while linear approaches perform slightly better than the other approaches. In the case of precipitation correction, the LS method demonstrates the poorest performance in frequency-based and time-series-based metrics and is clearly distinguished from the other four methods when applied to temperature correcting. This is because the time structure problem does not exist in temperature series. The nonlinear VARI method and distribution-based DM, DT, and EQM methods compare fairly well, and present a similar performance of PBIAS equal to zero.
When the corrected precipitation and temperature are transferred to discharges by hydrological modelling, great improvements are evidenced that compare favorably to the results forced by original RCM outputs (corrected precipitation with the DT method being the one exception in this respect). Different precipitation-corrected methods present larger uncertainty when compared against various temperature-corrected methods—this is attributable to the poor performance of these methods in the absence of wet-day frequency correction. The exceedance probabilities of simulated discharges driven by corrected precipitation with the DT method greatly deviate the references, to an extent that even exceeds the simulated discharges driven by the original RCM outputs. The results can be traced back to two sources. In the first instance, simulated discharges driven by the original RCM outputs were calibrated specifically and the biases were partly overcome as a consequence.
Chen, Brissette, Chaumont and Braun [
13] arrive at a similar conclusion. The model may also be sensitive to a bias in the DT method’s corrected precipitation that has been further magnified by hydrological modelling. The corrected precipitation in the LS method for precipitation was also not acceptable for the hydrological modelling because it includes large biases. This further reiterates that the drizzle effects were not neglected and that the methods without wet-day frequency correction, such as those evidenced in the DT and LS methods, are not acceptable in discharge simulations in our study area.
These results clearly diverge from a previous study by Chen, Brissette, Chaumont and Braun [
4], which found that the drizzle effects are not significant in North America. It indicates that the DT method is highly dependent on the RCM and region. It performs better in the more humid North America than in arid Xinjiang, which has less precipitation occurrences. The other five methods are all capable of reproducing discharges to a reasonable extent. Discharges simulated by corrected precipitation with the DBC method perform slightly better when compared against discharges simulated by corrected precipitation with the LOCI method. Although correction based on 100 quantiles does affect hydrological modelling, the effects are not comparable to those evidenced for wet-day frequency correction. DM and EQM methods perform better against other methods when applied to corrected precipitation; the same applies to EQM and VARI methods in relation to corrected temperature. The performance of these correction methods in relation to discharge simulations is not completely consistent with those for climate projection; nonetheless, this is only a small difference, which attests to the robust performance of these methods.
The Kaidu River Basin is a snow-dominated basin where melting water accounts for 58.6% of the total discharges. Discharges are therefore less sensitive to the temporal structure of precipitation occurrences. Chen, Brissette, Chaumont and Braun [
13] also suggest that bias correction methods perform more poorly in precipitation-dominated regions than in the snow-dominated regions. The weak performance of the time structure of corrected precipitation may therefore be insufficiently recognized by this study. Lafon, Dadson, Buys and Prudhomme [
42] also suggest that corrected precipitation is more sensitive to the selection of a particular time period. Bias correction methods should therefore be used with caution when the studies are strongly dependent on the temporal structure of precipitation, although it should be recognized that a good temporal structure of the original RCM simulations provides considerable assurance in this respect.
6. Conclusions
This paper compares the performances of (seven precipitation and five temperature bias) RCM correction methods and their hydrological applications in China’s Kaidu River Basin. The abilities of these corrected methods are evaluated by reproducing discharges, precipitation, and temperature through the SWAT hydrological model. Several conclusions can be extracted afterwards.
Original RCM outputs are very biased, and this precludes their direct use in the analysis of climate change effects. The representation of the RCM simulations is highly dependent upon the region and season. All bias correction methods have the potential to improve the performance of reproducing precipitation and temperature, although the bias correction method great influences their final results.
The performance of different precipitation-corrected methods presented greater differences when compared against various temperature-corrected methods. These differences mainly resulted from the poor performance of the corrected methods (DT and LS methods) in the absence of precipitation occurrence correction. The distribution-based DBC and EQM methods performed best in reproducing precipitation, while all temperature-corrected methods, with the exception of the LS method, performed extremely well.
The correction in wet-day frequency is extremely important for hydrological projection of the Kaidu River Basin. The DT and LS methods, which lack wet-day frequency correction, are not suitable for being applied in discharge simulations. Biases in corrected precipitation are likely amplified when transferred to discharges. The distribution-based DM and EQM methods for precipitation performed equally well in discharge simulations. There were no substantial differences in the various temperature-corrected methods in discharge simulations, while the LS method clearly provided the poorest performance when compared against the other methods. The EQM and VARI methods for temperature did the best job in discharge simulation. The DM method for both precipitation and temperature evidenced a narrow superiority in extreme-flow projection.
In general, this paper emphasizes the importance of using several bias correction methods for crosscheck in climate and hydrological response analysis. Even the performance of bias correction methods is dependent upon the RCM model and region. The results set out in this paper are of wider significance and the procedures can accordingly be applied to other regions.