Evaluation of Grid-Based Rainfall Products and Water Balances over the Mekong River Basin

Gridded precipitation products (GPPs) with wide spatial coverage and easy accessibility are well recognized as a supplement to ground-based observations for various hydrological applications. The error properties of satellite rainfall products vary as a function of rainfall intensity, climate region, altitude, and land surface conditions—all factors that must be addressed prior to any application. Therefore, this study aims to evaluate four commonly used GPPs: the Climate Prediction Center (CPC) Unified Gauge-Based Analysis of Global Daily Precipitation, the Climate Prediction Center Morphing (CMORPH) technique, the Tropical Rainfall Measuring Mission (TRMM) 3B42, and the Global Satellite Mapping of Precipitation (GSMaP), using data collected in the period 1998–2006 at different spatial and temporal scales. Furthermore, this study investigates the hydrological performance of these products against the 175 rain gauges placed across the whole Mekong River Basin (MRB) using a set of statistical indicators, along with the Soil and Water Assessment Tool (SWAT) model. The results from the analysis indicate that TRMM has the best performance at the annual, seasonal, and monthly scales, but at the daily scale, CPC and GSMaP are revealed to be the more accurate option for the Upper MRB. The hydrological evaluation results at the daily scale further suggest that the TRMM is the more accurate option for hydrological performance in the Lower MRB, and CPC shows the best performance in the Upper MRB. Our study is the first attempt to use distinct suggested GPPs for each individual sub-region to evaluate the water balance components in order to provide better references for the assessment and management of basin water resources in data-scarce regions, suggesting strong capabilities for utilizing publicly available GPPs in hydrological applications.


Introduction
It is well recognized in the literature that precipitation is one of the key factors in hydrological application practices [1][2][3] Existing precipitation products generally include gauge observations, estimates inferred from satellite imagery, and outputs from various numerical models [4]. The strengths and weaknesses of different kinds of precipitation data vary greatly. While the data quality of in The Mekong River is the largest river in South East Asia, flowing 4909 km with a total basin area of 795,000 km 2 and an average discharge of 14,500 m 3 /s, distributed unevenly across six countries: China (21%), Myanmar (3%), Lao People's Democratic Republic (Lao PDR) (25%), Thailand (23%), Cambodia (20%), and Vietnam (8%) [32]. The MRB ranges from temperate to tropical monsoon climate. The Upper Basin in China (known as Lancang) is glaciated, starting in the Tibetan Plateau ( Figure 1). The Lower Mekong Basin starts from Yunnan province downstream (China) and flows through the Golden Triangle tripoint to the South China Sea (known as the Lower Mekong Basin), and is classified as tropical monsoon [32]. The watershed has a complex topography, with the elevations in the basin ranging from above 6000 m in the Tibetan Plateau to less than a meter (0.3-0.7 m) above sea level in the downstream river delta, and with deep-cut valleys in the high mountains [32,33]. Overall, acrisols are the most common type of soil in the MRB, comprising around 65% of the total land area. Acrisols are common to humid tropical climates, and are a weathered-type, clay-rich, and low-nutrient soil, posing limitations for agriculture and thus being commonly forested. Lithosols are the second most common type of soil in the MRB, found in the upper, steep mountainous areas and common to highly erodible grassland [32,33].
Land use can be broadly divided into three major components: agriculture (32%), forested land (39%), and grassland (9%). The majority of the MRB is covered by forest, mainly located in Lao PDR and Cambodia. Grassland dominates the upper basin and agricultural land dominates the low-lying plains of the Chi-Mun Basin in northeast Thailand, the Vientiane plain in Lao PDR, the Tonle Sap Basin in Cambodia, and the delta in southern Vietnam ( Figure 1) [32,33].
As can be seen from the spatial contrasts in rainfall, much of the monsoonal rain is captured in the Northern and Eastern Highlands, creating dry conditions across the Korat Plateau. The mean annual rainfall is 1300 mm, more than 70% of which falls in the summer season. The dry season lasts from December to May, and evapotranspiration is high during this time. The river flow has a distinct seasonal pattern, with high flows from June to November that account for 80%-90% of the total annual flow [34]. The annual flood season is especially important in the Lower Mekong Basin, where it has shaped the environment and its inhabitants. Overall, acrisols are the most common type of soil in the MRB, comprising around 65% of the total land area. Acrisols are common to humid tropical climates, and are a weathered-type, clay-rich, and low-nutrient soil, posing limitations for agriculture and thus being commonly forested. Lithosols are the second most common type of soil in the MRB, found in the upper, steep mountainous areas and common to highly erodible grassland [32,33].

Materials
Land use can be broadly divided into three major components: agriculture (32%), forested land (39%), and grassland (9%). The majority of the MRB is covered by forest, mainly located in Lao PDR and Cambodia. Grassland dominates the upper basin and agricultural land dominates the low-lying plains of the Chi-Mun Basin in northeast Thailand, the Vientiane plain in Lao PDR, the Tonle Sap Basin in Cambodia, and the delta in southern Vietnam ( Figure 1) [32,33].
As can be seen from the spatial contrasts in rainfall, much of the monsoonal rain is captured in the Northern and Eastern Highlands, creating dry conditions across the Korat Plateau. The mean annual rainfall is 1300 mm, more than 70% of which falls in the summer season. The dry season lasts from December to May, and evapotranspiration is high during this time. The river flow has a distinct seasonal pattern, with high flows from June to November that account for 80%-90% of the total annual flow [34]. The annual flood season is especially important in the Lower Mekong Basin, where it has shaped the environment and its inhabitants.

Rainfall Datasets
The ground rainfall data from 1998 to 2006 used in this study were obtained from the Mekong River Commission (MRC) and the China Meteorological Administration (CMA). Data were collected from a series of 175 rain gauges over the study area (Figure 1a). The GPPs included CPC, TRMM, and CMORPH over the period of 1998-2006, and GSMaP over the period of 2000-2006, which were selected for evaluation in the MRB (Table 1). For global land areas, the CPC data used in the current global/regional gridded datasets were mainly retrieved from the World Meteorological Organization (WMO) Global Telecommunication System (GTS) based on the optimal interpolation (OI) method [35] on a 0.5 × 0.5 • grid. It should be noted that within the MRB, the number of GTS gauges operating over the period of 1998-2006 was relatively low-30 gauges (1998)(1999)(2000)(2001)(2002) and 40 gauges (2002-2006)-compared to 175 rain gauges from the MRC and the CMA. Moreover, these satellite precipitation products (GSMaP, TRMM, and CMORTH) were calibrated with the CPC and the Global Precipitation Climatology Center (GPCC) [36] products to obtain higher accuracy.

Discharge Data
The discharge observations from six gauges along the main stream of the Mekong River provided by the MRC were used for the calibration of the SWAT model and the evaluation of the precipitation products-i.e., Chiange Saen, Luang Prabang, Nong Khai, Mukhdan, Pakse, and Stung Treng. The catchment area and location are given in Table 2 and Figure 1.

Methods
Owing to the two specific objectives of this study, the following two methods were used: (1) Statistical evaluation metrics were employed for evaluating the performance of the GPPs compared with the "actual" precipitation patterns derived from the gauge-based rainfall observations. (2) The SWAT model was used to investigate how accurately the GPPs are able to model hydrologic processes, while statistical metrics (i.e., the Nash-Sutcliffe efficiency coefficient (NSE), percent bias (PBIAS), and coefficient of determination (R 2 )) were used to evaluate the model's performance, followed by the SWAT model to analyze the water balance components in each sub-region of the MRB.

Statistical Evaluation of GPPs against Gauge Observations
In order to analyze the performance of GPPs in capturing precipitation, three types of indices were used: i.
Evaluation of the capability of the GPPs to detect rain and non-rain days, which plays an important role in hydrological applications [39]. Therefore, in this study, three indicators-including probability of detection (POD), critical success index (CSI), and false alarm ratio (FAR)-were employed ( Table 3). POD is typically used to describe the proportions of rainy days that are correctly detected by GPPs to the total observations [40]; CSI reflects the overall proportion of rainfall events that are correctly detected by GPPs; and FAR describes the proportions of rainy days that are not recorded by the rain gauges to the total observations. iii.

Statistical Metric Unit Equation Optimal
Value Note: n, number of samples; O i , observed precipitation (or observed streamflow); P i , the precipitation estimates from the evaluated products (or simulated streamflow); T, the total number of the rainy days that the gridded precipitation products (GPPs) successfully detect rain; M, the number of days that the GPPs fail to detect the observed rain; F, the number of days that the GPPs fail to detect no-rain cases (unsuccessful no-rain detection); σ, standard deviation.

Model Setup
ArcSWAT 2012, interfacing in ArcGIS 10.2, was used to simulate the hydrological properties of the MRB. The sub-basins were delineated using the SWAT watershed analysis module (watershed delineator) based on the DEM 90 × 90 m data and the stream network, resulting in 383 sub-basins in the MRB, with the average area of a sub-basin being around 2000 km 2 . The sub-basins were then further divided into hydrologic response units (HRUs)-consisting of homogeneous land use, management, topographical, and soil characteristics-by the HRU module in SWAT, using the land use, soil characteristics, slope, and stream network of 2005 (Table 1). In this way, we obtained 2850 HRUs for the MRB according to five slope classifications: 0%-2%, 2%-6%, 6%-15%, 15%-25%, and >25%. Then, the basin was divided into five elevation bands (0-500 m, 500-1000 m, 1000-1500 m, 1500-2000 m, and >2000 m) to improve model performance in the runoff simulation [11,39].
The precipitation data needed to be imported into SWAT in the form of point data. It should be noted that the rainfall data of the station closest to the centroid of each sub-basin were used, but the number of sub-basins was much smaller than the gridded precipitation data (383 sub-basins in the MRB compared to 1689 gridded points at 0.25 • × 0.25 • resolution). Therefore, the virtual precipitation station method proposed by Ruan et al. [42] was applied to estimate the precipitation for the sub-basins by taking the average value of all of the grids within the sub-basins from the GPPs. A total of 383 virtual precipitation stations were then created, and their estimated precipitation values were provided as inputs for the SWAT model.

Model Calibration
The parameters and the default range recommended by previous studies [27,[43][44][45] were first selected for sensitivity analysis and calibration in the current study. Then, one-at-a-time (OAT) in SWAT Calibration and Uncertainty Procedures (SWAT-CUP), each parameter was held constant, changing only one parameter at a time, to identify its effect on the model output or objective function. The results were used to fill out more sensitive parameters among the initial selection. The SWAT-CUP software package with the SUFI-2 algorithm was used for calibration, validation, and sensitivity analysis [45], because it is recognized as a robust tool for model calibration and validation [16,42,46]. The values of the parameters were calibrated with four iterations, using 1500 model runs for each iteration, by the SUFI-2 method.
To evaluate model performance, the NSE [47] and the coefficient of determination (R 2 ) were used. Uncertainties were quantified by two measures: P-factor and R-factor. P-factor is the percentage of observation points bracketed by 95% prediction uncertainty (95PPU) to measure the degree of uncertainty, and R-factor is the average width of the 95PPU band divided by the standard deviation of the observed values to reflect the strength of the uncertainty analysis, for which a P-factor above 0.70 and an R-factor of around 1 are considered satisfactory simulations [45]. Visual inspection of spatial distributions reveals that TRMM has the best agreement with the observation pattern (Figure 2b,g). The spatial distributions and precipitation amount of CPC and GSMaP are similar, but they are different to the observation pattern. Particularly, CPC and GSMaP show a distinct rainfall pattern with much lower precipitation over the northeast and mid-latitude regions (Figure 2c,d). This is possibly because the number of GTS gauges that provide data for CPC Visual inspection of spatial distributions reveals that TRMM has the best agreement with the observation pattern (Figure 2b,g). The spatial distributions and precipitation amount of CPC and GSMaP are similar, but they are different to the observation pattern. Particularly, CPC and GSMaP show a distinct rainfall pattern with much lower precipitation over the northeast and mid-latitude regions (Figure 2c,d). This is possibly because the number of GTS gauges that provide data for CPC is insufficient to capture the precipitation pattern of these sub-areas. CMORPH reveals a relatively similar spatial distribution pattern as that of the TRMM 3B42 at the mid-latitude and low-latitude parts of the river basin, even though apparently lower precipitation amounts are observed. The spatial precipitation patterns in the upstream part of the basin (i.e., the northern part) for both seasons using CMORPH are very distinct from the observation pattern and the other evaluated GPPs-this product's precipitation amount is extremely low compared to the others. The performances of the GPPs at the annual scale exhibit a similar pattern to the results at the seasonal scale.

Statistical Evaluation of GPPs
It is clear that the spatial patterns of precipitation need to be considered when using these GPPs; TRMM 3B42 is able to capture the temporal (at both the seasonal and annual time scales) and spatial distribution patterns of precipitation better than the other evaluated GPPs over the MRB, and can be considered as a good alternative to rain gauges for hydrological research applications at these evaluated scales.

Monthly Comparison
The monthly average precipitation time series in the basin for the GPPs during the period 1998-2006 were evaluated against the rain gauge data at different spatial scales (i.e., basin and pixel scales).
Relatively good agreements with the rain gauge data were observed in the CC indicator for all of the GPPs at the basin scale, with 0.83 for TRMM 3B42, 0.79 for GSMaP, 0.77 for CMORPH, and 0.76 for CPC. In terms of the five statistical indicators (Table 4), TRMM 3B42 had the best performance at the monthly scale, but with a slightly overestimated precipitation value, and CMORPH, CPC, and GSMaP had large errors with all indicators underestimating the precipitation. At the pixel scale, the spatial distributions of the indicators (CC, RMSD, and PBIAS) at a monthly scale over the study area are shown in Figure 3a-l). From the simulated data, it can be seen that TRMM 3B42 has the best CC values for the whole basin. The CC values of CPC and GSMaP are worst in the south-east region, ranging from 0.2 to 0.4, while those of CMORPH are relatively low in the upstream and south-west regions of the basin, in the range of 0.4-0.6 ( Figure 3a-d).  The RMSD shows a similar pattern, but to a different extent in all of the GPPs (Figure 3e-h). The RMSD of TRMM 3B42 is the lowest error (around 50-100 mm/month) at the monthly scale, compared to all of the other GPPs (250-300 mm/month). Very high RMSD values are seen in CPC and GSMaP in the south-east of the river basin, as shown in the Figure 3e-h, which is probably because this area is a high rainfall zone in the river basin. At the basin scale, TRMM 3B42 also has the smallest RMSD value (82.93 mm/month), while the RMSD values of GSMaP, CMORPH, and CPC are 91.71, 97.05, and 99.58 mm/month, respectively.
The distribution of PBIAS indicates that TRMM 3B42 performs the best compared to the others, since PBIAS shows the lowest values at both the grid scale, around 10% (Figure 3i-l), and the watershed scale, at 4% (Table 4). Considering CPC and GSMaP, their PBIAS values are consistently the worst in the east of basin-40% underestimation was seen-but the indicators are improved in the south-west of the basin, with PBIAS values of around 10% overestimation. In contrast, the PBIAS of CMORPH has no clear relationship with the gauge observations, sometimes overestimating or underestimating the observation by 20%-40%, as seen in Figure 3i-l. As expected, at the watershed scale, the PBIAS of GSMaP (-12%) and of CPC (−15%) can be considered to perform comparably better than CMORPH (−16%).
With monthly time series, it can be concluded that first, the TRMM 3B42 has the best performance at the basin scale, and CMORPH has the largest errors. Second, conducting analyses at the pixel scale further confirms the outperformance of the other models by TRMM 3B42, which is characterized by stable performance with observations in all of the indicators. In contrast, GSMaP, CPC, and CMORPH show inferior performances, especially in the south-east part of the basin.

Daily Comparison
As shown in Table 4, in terms of the daily average precipitation, TRMM 3B42 has the smallest average errors (0.08 mm/day overestimation), while GSMaP has the largest (3.52 mm/day by GSMaP compared with 4.61 mm/day by observation). Similarly, TRMM 3B42 overestimates precipitation overall, with a PBIAS value of 4%, showing the best performance, while CMORPH, CPC, and GSMaP underestimate precipitation, with a PBIAS value of −16%, −15%, and −12%, respectively. With RMSD and MAE, the TRMM product performs worst, with large overestimations for RMSD of 12.19 mm/day and MAE of 5.24 mm/day; the other products-CMORPH, CPC, and GSMaP-performed slightly better, with both RMSD and MAE being 11.75-4.87, 11.44-4.83, and 10.73-4.56, respectively. Overall, higher accuracy can be seen more at the monthly scale than at the daily scale, a possible reason for which being that the errors at the daily scale are canceled out due to aggregation. The SE value for TRMM rainfall is 0.18 mm, which is closest to the ground observation, while those for GSMaP, CPC, and CMORPH rainfall, are 0.13, 0.14, 0.15, respectively. Figure 4 plots the correlation coefficients between the GPPs and the rain observations, considering the effects of latitude. It can be seen that, in the upstream part of the basin (higher than 20 • N latitude), the CC of GSMaP is highest compared to the other GPPs, and the CPC product has better correlation than CMORPH and TRMM. In the low-and mid-latitude regions (lower than 20 • N latitude), the CC of GSMaP shows strong variation, reaching 0.9 at several stations-and even lower than 0.1 at some stations. A narrower variation in the range of [0.2,0.6] is presented in the CC of TRMM 3B42 and CMORPH. At the basin scale, the CCs of all of the selected GPPs are around 0.4; the GSMaP has the best overall performance with a CC of 0.44, while the CC of TRMM, CMORPH, and CPC is 0.42, 0.40, and 0.40, respectively. This result indicates that "calibration with rain gauge data" has the potential to improve the daily precipitation distribution for hydrometeorological applications, and the performance of GPPs, especially GSMaP, is strongly related to latitude.
The POD and CSI indicators were employed to analyze the ability to detect precipitation of all of the selected GPPs, and 0.1 mm/day was defined as the threshold of rain/no-rain detection [39]. Generally, the POD of all of the GPPs shows a good value, larger than 0.81 (Table 4); the POD of GSMaP and CPC is observed to be larger than 0.91, with an insignificant FAR of 0.44 for GSMaP and 0.41 for CPC, which indicates that the GSMaP and CPC data generally have more rain days compared to the observations. The CSI indicators of TRMM and CMORPH (0.5) are higher than those of GSMaP and CPC (0.46-0.48), which indicates that TRMM and CMORPH better detect precipitation than GSMaP and CPC. The POD and CSI indicators were employed to analyze the ability to detect precipitation of all of the selected GPPs, and 0.1 mm/day was defined as the threshold of rain/no-rain detection [39]. Generally, the POD of all of the GPPs shows a good value, larger than 0.81 (Table 4); the POD of GSMaP and CPC is observed to be larger than 0.91, with an insignificant FAR of 0.44 for GSMaP and 0.41 for CPC, which indicates that the GSMaP and CPC data generally have more rain days compared to the observations. The CSI indicators of TRMM and CMORPH (0.5) are higher than those of GSMaP and CPC (0.46-0.48), which indicates that TRMM and CMORPH better detect precipitation than GSMaP and CPC. Figure 5 shows the frequency of rainfall events (bar chart) and their relative contributions to the total accumulative rainfall from the 1998-2006 period (line chart) by different rainfall classification of the four GPPs and rain gauge observations at the daily scale. Overall, the GPPs underestimate rainfall frequency, as well as the contributions of no rain (rain ≤ 0.1 mm) and moderate/heavy rain events (rain > 20 mm) compared to the observations. Conversely, with the little and light rain intensities (0.1 < rain ≤ 20 mm), both the occurrence frequencies and their contributions of the gridded data are overestimated in comparison to the rain gauges. The contribution of light rain shows the highest contribution in the GPPs, while the highest contribution of observation is from the moderate (20,50] mm rainfall class.   The POD and CSI indicators were employed to analyze the ability to detect precipitation of all of the selected GPPs, and 0.1 mm/day was defined as the threshold of rain/no-rain detection [39]. Generally, the POD of all of the GPPs shows a good value, larger than 0.81 (Table 4); the POD of GSMaP and CPC is observed to be larger than 0.91, with an insignificant FAR of 0.44 for GSMaP and 0.41 for CPC, which indicates that the GSMaP and CPC data generally have more rain days compared to the observations. The CSI indicators of TRMM and CMORPH (0.5) are higher than those of GSMaP and CPC (0.46-0.48), which indicates that TRMM and CMORPH better detect precipitation than GSMaP and CPC. Figure 5 shows the frequency of rainfall events (bar chart) and their relative contributions to the total accumulative rainfall from the 1998-2006 period (line chart) by different rainfall classification of the four GPPs and rain gauge observations at the daily scale. Overall, the GPPs underestimate rainfall frequency, as well as the contributions of no rain (rain ≤ 0.1 mm) and moderate/heavy rain events (rain > 20 mm) compared to the observations. Conversely, with the little and light rain intensities (0.1 < rain ≤ 20 mm), both the occurrence frequencies and their contributions of the gridded data are overestimated in comparison to the rain gauges. The contribution of light rain shows the highest contribution in the GPPs, while the highest contribution of observation is from the moderate (20,50] mm rainfall class. GSMaP shows the largest discrepancy with the observations, both in frequency and contribution, in all rain intensity classes. The frequency is 27% higher than that of the observations in the little rain class (0.1,1) mm-31.3% for GSMaP and 4.3% for the observations. The rainfall of all of the GPPs in this class provides an insignificant contribution. At the light rain intensity (1,20] mm, the occurrence frequency of GSMaP is 36.3% compared to 20.7% for the observations, and the contribution is 62.5% and 35% for GSMaP and the observations, respectively, which is nearly a 30% difference. The CPC and CMORPH products illustrate the same variation, but with a narrower spread compared to GSMaP-the occurrence frequency of 27% and 20% with little rain, and 35.5% and 28.7% with moderate rain, respectively. TRMM 3B42 deviates the least compared to the observations regarding the occurrence frequencies and contributions of all rainfall intensities, as shown in Figure 5. TRMM 3B42 estimates the occurrence frequency of little rain events to be approximately 11.7%, which is approximately 7% lower than the actual occurrence frequency. TRMM 3B42 estimates the occurrence frequency of light rainfall events to be approximately 29.14%, which is approximately 8% higher than the actual occurrence frequency, and the contribution to be 45%, which is 15% higher than the observations. With moderate rain class (20,50] mm, the contribution of TRMM 3B42 even matches that of the observations-38% and 39%, respectively. At heavy intensity (>50 mm), the difference between TRMM and the observations, both in occurrence frequency and contribution, is also lowest-0.62% and 10%, respectively.
In other words, quantitatively analyzing the discrepancies between the GPPs and the rain gauge data in terms of the rain class first shows that all of the GPPs feature similar error characteristics, i.e., they tend to overestimate little and light rainfall events, but underestimate moderate and heavy ones. Significant discrepancies can be found in the GSMaP product in contrast to the narrowest spread presented in TRMM 3B42. Second, all of the GPPs perform poorly in capturing heavy rainfall (>50 mm/day) events, which suggests that local calibration with rain gauge or ground radar data should be carried out to further improve the daily precipitation estimates for further studies.
The analysis of statistical indicators proves that the TRMM 3B42 product illustrates the best ability to capture precipitation for the whole MRB at different time scales: Monthly, seasonal, and annual comparisons. It can be found that at the daily time scale, the GSMaP and CPC products show better performance than the other GPPs in the upstream region of the basin.
Evaluation of the different GPPs at different variability scales (i.e., spatial and temporal scales) using statistical indicators provides an overall picture of the GPPs' performance. To enable the guiding of users to more precisely select the appropriate product for hydrological applications, multiple GPPs should be tested in hydrological simulations [48,49]. Therefore, in this study, further simulations of the daily hydrological process in the MRB using the SWAT model were conducted.

Evaluation of Precipitation Products' Hydrological Performance using SWAT Model
This section explores the potential of the selected precipitation products for hydrological simulation over the MRB using the SWAT model. The model was calibrated against the daily discharge collected from six hydrological gauges within the MRB. Sensitive analysis and model calibration were performed using the SUFI-2 algorithm integrated into the SWAT-CUP tool. The analysis was conducted over 8 years (1998-2006) with different precipitation data at daily time scales, while the other meteorological information remained unchanged.

Evaluation of Model Performance
The 1998 data were used as a "warm-up" period to initialize the model, and the data from the rain gauge observations and the three GPPs (CPC, TRMM 3B42, and CMORPH) from 1999 to 2006 were used for model calibration. With GSMaP, the precipitation from 2001 to 2006 was used for model calibration, with the 2000 dataset used for the warm-up period, due to a lack of obtained data. This calibration procedure was performed at each individual discharge gauge and for each individual GPP. The best model parameter values obtained for each precipitation product for the calibration period are presented in Figure 6 and Table 5.  The P-factor and R-factor results for the calibration range from 0.68-0.98 and 0.67-1.55, respectively, which reveals that all of the simulations to capture streamflow discharge acquire  The P-factor and R-factor results for the calibration range from 0.68-0.98 and 0.67-1.55, respectively, which reveals that all of the simulations to capture streamflow discharge acquire reasonable uncertainties [45]. The simulated streamflow reproduced by the different GPPs at each individual station is satisfactory, with NSE > 0.67 and R 2 > 0.77 (Table 5).
The models using the GSMaP and CPC products as the precipitation data attain excellent performance for the upstream part of the MRB during the calibration period, with NSE and R 2 at Chiange Saen and Luang Prabang being in the range of 0.82 and 0.86, respectively (Figure 6),while the performances with the observation data for the calibration periods are 0.7 for NSE and 0.81 for R 2 .
These results indicate that the spatial coverage of the rainfall stations in the upstream part of the MRB does not feature a sufficient density to reflect the rainfall characteristics in this region. Calibrating the SWAT model with individual gridded precipitation further improves streamflow simulations compared to the model calibrated with gauges for the sparse-rain gauge region in the upstream part of the MRB. Although individual precipitation may possess large biases, calibration may mitigate the errors of such biases. Over the northern section of the river basin with sparse gauges, the use of GPPs is more reasonable than that of gauge-based precipitation products.
At Nong Khai station (mid-latitude region), the fitting of the simulated streamflow of the GSMaP data-forced model has relatively lower but still reasonable performance compared to the other precipitation products, with an NSE and R 2 of around 0.84 and 0.94, while those same values for CPC, Gauge, CMORPH, and TRMM 3B42 are around 0.93 and 0.97, respectively. In addition, the PBIAS values in this area vary from 3.65 to −8.76 for all of the GPPs' forced simulations, showing a large variability in the performances between the different GPPs.
At the Mukhdan, Pakse, and Stung Treng gauges (i.e., the eastern part), the best performance is attained when using the rain gauge observation data, with the NSE being around 0.94 and the R 2 being 0.98 for all three of the stations. This is simply because the highest density of rainfall is collected, which better reflects the precipitation characteristics for this sub-region ( Figure 6). All of the GPPs' forced data simulations exhibit satisfactory performance, with the NSE changing from 0.78 to 0.89. The lowest NSE can be seen for GSMaP.
The TRMM 3B42 forced simulation exhibits an almost negligible PBIAS (−5.62% to −2.01%), suggesting its promising potential to replace in situ observations in hydrologic applications, even though it produces a slightly lower streamflow than that of the observations. The simulation of GSMaP shows large variation of the PBIAS (from −26.64% to −5.93%), which suffers from serious underestimation, especially in the eastern part of the basin (Table 5). A similar tendency as that of GSMaP is visualized during the simulation of CPC, with the highest PBIAS at the Mukhdan and Stung Treng gauges being −24.54% and −10.44%, respectively.
Overall, at the daily scale, there is a large variability in hydrological performance depending on the GPP being used. In the upstream part of the basin, CPC attains the best performance, while in the mid-and low-latitude regions, the best performance can be seen by TRMM 3B42 s forced simulations.

Water Balance Components at the Sub-Region Scale
As mentioned above, the application of a GPP is specific to the location. This uncertainty in precipitation will propagate into water balance components. Therefore, for water balance analysis, the Mekong mainstream river was divided into six regions, respectively, with six hydrological stations (   Figure 8 shows the variability of precipitation (P), evapotranspiration (ET), groundwater recharge (GW), and runoff (R) for each GPP obtained from SWAT model simulations for sub-regions in the MRB. A significant variation of all water-balance components under different GPPs is observed for each sub-region. These uncertainties are also expressed with a wide range of standard errors (SE), as presented in Table 6. The variation of precipitation could be the main reason for the variations of other water-balance components. In the R1 sub-region, for example, the variation of P among GPPs is much smaller than those in other sub-regions, and similar variations are found for other water-balance components. As a result, the selection of relevant GPPs for a specific sub-region is important for the reliability of water-balance components and water-resource evaluation. Based on the PBIAS and NSE values, it is recommended that CPC products be used for R1, TRMM for R2a, R2b, R2c, R4, and gauge products for R3. The quantitative analysis of the water-balance components using the recommended rainfalls as inputs for respective sub-regions is displayed in Table 6. as presented in Table 6. The variation of precipitation could be the main reason for the variations of other water-balance components. In the R1 sub-region, for example, the variation of P among GPPs is much smaller than those in other sub-regions, and similar variations are found for other waterbalance components. As a result, the selection of relevant GPPs for a specific sub-region is important for the reliability of water-balance components and water-resource evaluation. Based on the PBIAS and NSE values, it is recommended that CPC products be used for R1, TRMM for R2a, R2b, R2c, R4, and gauge products for R3. The quantitative analysis of the water-balance components using the recommended rainfalls as inputs for respective sub-regions is displayed in Table 6.   Precipitation over the basin is highly variable, ranging from 2338 mm in R2c to 857 mm in R1 (Upper Mekong River in China). The highest rainfall regions, R2c and R4, generally correspond to the areas of high elevation along the Annamite Range and are covered primarily by forest (around 60%) with a streamflow-precipitation ratio of approximately 0.6. The distribution of total runoff depth in the MRB closely reflects the spatial patterns in rainfall, with much higher proportions of rainfall (approximately 60%) translated into streamflow in R2c and R4, and with the greatest runoff of up to 1305.7 in R2c as the annual average for the 7-year period.
The precipitation in R2a, R2b, and R3 is lower than that in R2c and R4, but is still relatively high-from 1615 mm in R2a to approximately 2000 mm in R2b and R3. This R2a and R2b area is almost entirely mountainous and is covered with natural forest (nearly 90% in R2a and around 60% in R2b). These areas have high evapotranspiration and low surface runoff due to the high subsurface flow. Since R2a is still dominated by characteristics of R1, the evapotranspiration in R2a occupies upto 60%, and the surface runoff occupies only 4% in the total runoff depth. Additionally, R2b has a wide scope for agriculture (30% of the area), therefore much higher proportions of rainfall are translated into streamflow compared with R2a (up to 53%). Agriculture dominates in R3 and is reported to account for more than 80% of the area [32,33]. Consequently, the surface runoff is high, accounting for approximately 60% of the total flow. This region also has a high evaporation rate, accounting for 47%.
The rainfall in R1 decreases as the altitude increases toward the north; in this region, the mean rainfall is only 857 mm, and evapotranspiration is around 60%. Land use in this region is mainly divided into two major components: The majority is covered primarily by grassland (50%), while a further 40% is covered primarily by forest. This region has a very low runoff coefficient, but the streamflow-precipitation ratio accounts for up 42% of the total rainfall due to the major source of water flowing into the river coming from the melting snow on the Tibetan Plateau.

Conclusions
In this study, we analyzed four GPPs (TRMM 3B42, CMORPH, GSMaP, and CPC) and their hydrological application over the whole MRB for a nine-year period (1998)(1999)(2000)(2001)(2002)(2003)(2004)(2005)(2006). We applied statistical indicators to evaluate the consistency between GPPs and the precipitation observations, and applied the SWAT model for analysis of the water balance components. The following major findings are concluded:

1.
Considering the statistical indicators and the average precipitation, TRMM 3B42 illustrated the best ability to capture precipitation at the annual, seasonal, and monthly scales. At the daily scale, GSMaP and CPC showed a better performance and should be considered for use, especially for the upstream region of the basin.

2.
With each dataset calibrated individually by the SWAT model, satisfactory performances were achieved at the daily scale for all of the GPPs and the gauge-driven models. For the ungauged or sparsely gauged regions, better performance was seen from the GPPs than the gauge-driven models, especially for the CPC product. In the downstream regions, TRMM showed the best performance, except for the gauge-driven models. These results further confirm the appropriateness of the GPPs at the daily time scale, which suggests its promising potential to replace in situ observations in hydrologic applications and its potential in the performance of all water balance components.

3.
This study attempted to use different GPPs for each individual sub-region to evaluate the water balance components, suggesting strong capabilities for utilizing the advantages of publicly available GPPs in hydrological applications. 4.
The spatial variability of water balance components was analyzed in the sub-regions. The distribution of total runoff depth is consistent with the spatial patterns in rainfall, but the landscape, soil texture, and terrain are also major factors that shape the distribution of streamflow. Forests are the major water yield for vegetation types, contributing to the baseflow, while agriculture offers factor-driven high surface runoff.
Generally, TRMM 3B42 is more robust than the other GPPs and is a reliable data source for hydrological applications in data-sparse area. The spatial variability of the water balance components should be analyzed based on the simulation with distinct suggested GPPs for each of the different sub-regions, which provides better references for the assessment and management of the basin water resources in data-scarce regions. Our findings are useful for selecting which GPP is suitable for advanced hydrological applications for each sub-region. Further studies should examine statistical distribution analysis and include extreme precipitation and the possible prediction of rainfall products over the river basin in order to add more inference and generalization to this study.