Assessing Hydrological Modelling Driven by Different Precipitation Datasets via the SMAP Soil Moisture Product and Gauged Streamflow Data

To compare the effectivenesses of different precipitation datasets on hydrological modelling, five precipitation datasets derived from various approaches were used to simulate a two-week runoff process after a heavy rainfall event in the Wangjiaba (WJB) watershed, which covers an area of 30,000 km2 in eastern China. The five precipitation datasets contained one traditional in situ observation, two satellite products, and two predictions obtained from the Numerical Weather Prediction (NWP) models. They were the station observations collected from the China Meteorological Administration (CMA), the Integrated Multi-satellite Retrievals for Global Precipitation Measurement (GPM IMERG), the merged data of the Climate Prediction Center Morphing (merged CMORPH), and the outputs of the Weather Research and Forecasting (WRF) model and the WRF four-dimensional variational (4D-Var) data assimilation system, respectively. Apart from the outlet discharge, the simulated soil moisture was also assessed via the Soil Moisture Active Passive (SMAP) product. These investigations suggested that (1) all the five precipitation datasets could yield reasonable simulations of the studied rainfall-runoff process. The Nash-Sutcliffe coefficients reached the highest value (0.658) with the in situ CMA precipitation and the lowest value (0.464) with the WRF-predicted precipitation. (2) The traditional in situ observation were still the most reliable precipitation data to simulate the study case, whereas the two NWP-predicted precipitation datasets performed the worst. Nevertheless, the NWP-predicted precipitation is irreplaceable in hydrological modelling because of its fine spatiotemporal resolutions and ability to forecast precipitation in the future. (3) Gauge correction and 4D-Var data assimilation had positive impacts on improving the accuracies of the merged CMORPH and the WRF 4D-Var prediction, respectively, but the effectiveness of the latter on the rainfall-runoff simulation was mainly weakened by the poor quality of the GPM IMERG used in the study case. This study provides a reference for the applications of different precipitation datasets, including in situ observations, remote sensing estimations and NWP simulations, in hydrological modelling.


Introduction
Numerical simulation of the rainfall-runoff process is an important way to research water cycle, flood monitoring, water resource management and environmental conservation [1][2][3][4][5].The performance of a rainfall-runoff simulation differs from the applied hydrological model, calibrated parameters and input data [6][7][8][9].Because of different theories of runoff generation and routing, different considerations for spatial variation of the underlying surface, and different scheme combinations, diverse hydrological models lead to different simulation results [10][11][12].For a specific study area, when the applied hydrological model, required parameters, and other input data are fixed, precipitation is the most important datum that affects hydrological modelling, because its accuracy will dominantly influence the subsequent calculation qualities of net precipitation, soil moisture variation, runoff generation, and runoff routing, then finally determine the success of the rainfall-runoff simulation [13][14][15][16][17].
Currently, there are three mainstream ways to obtain precipitation data.One method is the use of traditional in situ observation [18].This type of precipitation data is measured using the equipments set in precipitation stations, meteorological stations, or automatic weather stations, which are managed by different departments.The in situ observed precipitation is generally recognized as the most accurate datum that represents the true value [19,20].However, its application in hydrology is limited by its poor point-to-area representativeness, incomplete opening and sparse station network in developing areas [21,22].The second method is remote sensing estimation.This category of precipitation data emerged as the remote sensing techniques of visible, infrared and microwaves were developed.Commonly used satellite precipitation products contain the Global Precipitation Climatology Project (GPCP) [23], the Climate Prediction Center Morphing Technique (CMORPH) [24], the Tropical Rainfall Measuring Mission (TRMM) [25] and its successor, the Global Precipitation Measurement (GPM) [26].These products not only cover a nearly global area but also are freely available to the public, and their spatiotemporal resolutions are becoming finer.Moreover, there are corresponding upgraded products after post-processing [27].Nevertheless, the remotely sensed precipitation still falls short in terms of showing the consecutiveness of precipitation and detecting extreme events at high latitudes [28].The third method is obtaining precipitation from a numerical weather prediction (NWP) model.Because this atmospheric model is built on precise physical governing equations, an NWP model can describe the inherent dynamics of precipitation, thus present nearly the entire precipitation process with specific atmospheric reanalysis data [29][30][31].However, due to the incompleteness of initial and boundary conditions provided by reanalysis data, uncertainties are unavoidably included in the NWP outputs when solving the physical equations with approximations [32,33].When applied in hydrological modelling, the poor precipitation prediction of an NWP model and the scale mismatch between it and a hydrological model are two primary problems [34][35][36].To improve the accuracy of the NWP-predicted precipitation, various methods of data assimilation have been used to enhance the initial and lateral boundary conditions of an NWP model [37][38][39][40][41].The generally used NWP models include the National Meteorological Center (NMC) forecast model [42], the next-generation Weather Research and Forecasting (WRF) model [43], the operational Japan Meteorological Agency (JMA) mesoscale model [44] and the European Centre for Medium-Range Weather Forecasts (ECMWF) [45].
These three methods of collecting precipitation data have been widely used in hydrological modelling [1,[46][47][48][49][50][51][52][53][54][55].Many studies have been conducted to investigate the utilities of remote sensing estimations and the NWP simulations in hydrological modelling.For example, Wu et al. [48] evaluated the performances of nine existing precipitation products, including TRMM, CMORPH, and NLDAS-2 (Phase 2 of the North American Land Data Assimilation System), in the DRIVE model system for simulating a series of flood in Iowa.Essou et al. [1] applied global atmospheric reanalysis data to drive the lumped conceptual hydrological model HSAMI over 370 American watersheds.Liechti et al. [52] used TRMM 3B42 to drive the hydraulic-hydrologic model over the African Zambezi basin.Rasmussen et al. [49] investigated the spatial-scale characteristics of the precipitation predicted by the WRF model and its applications in hydrological modelling; they concluded that the RCM predictions had larger predictive certainty at a larger scale than at a smaller scale.Lin et al. [51] used the precipitation data generated by the Canadian atmospheric Mesoscale Compressible Community Model (MC2) to drive the Chinese Xin'anjiang hydrological model and simulated a series of flood events in the Huaihe River basin at a 5-km resolution.All of these investigations demonstrated the potential of the indirectly measured precipitation to be used in hydrological modelling.Nevertheless, there are still limited studies concerning the different effectivenesses of different precipitation datasets in hydrological modelling.Therefore, five types of precipitation datasets, incorporating one traditional in situ station observation, two satellite precipitation products, and two NWP predictions, were collected, evaluated and applied in hydrological modelling in this study.Moreover, to avoid the uncertainties caused by scale mismatch between the NWP and hydrological models [49], a 1-km resolution, which the applied NWP and hydrological models could realize, was employed in the simulations.Furthermore, as soil moisture is a crucial intermediate variable in hydrological modelling that affects water exchange, evapotranspiration estimation, runoff generation and model simulation [56][57][58][59][60][61], hydrologists have extended their attention to assess and improve the accuracy of soil moisture simulation to ensure the reasonability of hydrological simulation [62][63][64][65][66][67][68][69].Therefore, we assessed the performances of the hydrological modelling driven by different precipitation datasets via not only outlet discharge but also soil moisture.
This manuscript is structured as follows: Section 2 introduces the study area, study period and study data.Section 3 introduces the experimental design and evaluation metrics.Section 4 shows 1-km grid data obtained from different precipitation datasets, the simulated soil moisture and outlet discharges.Section 5 presents the evaluations of different precipitation datasets, the simulated soil moisture and outlet discharges.Finally, the conclusions are drawn in Section 6.

Study Area
As the NWP predictions generally have larger predictive certainty at a larger scale than at a smaller scale [35,49], a large-scale watershed covering an area of 30,630 km 2 , namely, Wangjiaba (WJB) was selected as the study area.The WJB watershed is located between 113.3 • -115.8 • E and 31.5 • -33.4 • N (Figure 1b); it is a sub-basin of the Huaihe River basin (HRB), which is one of the seven major river basins in China and lays between the Yellow and Yangtze rivers.This region has important political and economic functions because it has the highest population density in China and has 17% of the country's cultivated land [51].The WJB watershed belongs to the warm temperate and semi-humid monsoon climate.Its northern region is characterized by hot and wet summers and cold and dry winters because it is primarily controlled by the monsoon climate at mid-latitudes, while its southern part has hot and wet summers and mild and dry winters because it is dominated by a subtropical monsoon climate [70].The regional elevation in this watershed decreases from the west to the east.In the west, there are foothills and mountains, and the highest altitude is 1130 m above sea level (m.a.s.l.).The middle and eastern parts are vast plains.In this watershed, rapid flood drainage occurs with heavy precipitation because of the high regional topographic relief.Such rapid flood drainages cause the flat midstream and downstream reaches of the HRB difficult to drain, which finally result in floods.Therefore, the rainfall-runoff process in the upstream reach of the HRB, i.e., the WJB watershed, deserves further investigation.

Study Period
It is noteworthy that the temporal spans of the hydrological modelling driven by different types of precipitation datasets are different.The hydrological modelling driven by the in situ observed and remotely sensed precipitation often span periods as long as several decades once the data are available [71,72].In contrast, the time span of the hydrological modelling driven by the NWP-predicted precipitation is much shorter, as an NWP model commonly demands vast computational resources and computing time, particularly when it applies data assimilation and runs at a very fine grid spacing of 1 km [73][74][75][76].Thus, to compare the effectivenesses of these different precipitation datasets on hydrological modelling, we focused our study on one short-term rainfall-runoff process and used it as a case study over the WJB watershed.
To select the study period, the rainfall events that occurred in the WJB watershed in 2015 were analysed based on the in situ daily precipitation data, which were collected from the 215 precipitation stations of the China Ministry of Water Resources (CMWR).The contributions of the accumulated daily precipitation to the annual precipitation were summed and sorted in decreasing order for each precipitation station (Figure 2a).All the 215 precipitation stations received half of the annual amount of precipitation within a minimum of 4 days (LiJi station) and a maximum of 14 days (Sanliping station).This suggests that a single heavy precipitation event is the main contributor to the amount of annual precipitation in the WJB watershed [77,78].Because the forcing data of the applied NWP model, i.e., the final analysis (FNL) data ds083.3were just released on 8 July 2015, we selected the specific heavy precipitation event from August, which is also in the flood season (i.e., June-September) in the WJB watershed.As shown in Figure 2b, there were two days of continuous heavy precipitation on 18 and 19 August in the WJB watershed, and the mean daily precipitation of the 215 precipitation stations reached the highest monthly value (27.4 mm) on 18 August.Thus, we chose the rainfall-runoff process caused by this heavy rainfall event as our study case.Moreover, considering the spin-up problem [79][80][81] of the NWP model and the natural rainfall-runoff process, we extended the study period to include the periods before and after the heavy rainfall event.Finally, a two-week period from 17 to 30 in August 2015 was selected as the study period.
Remote Sens. 2018, 10, x FOR PEER REVIEW 5 of 28 annual amount of precipitation within a minimum of 4 days (LiJi station) and a maximum of 14 days (Sanliping station).This suggests that a single heavy precipitation event is the main contributor to the amount of annual precipitation in the WJB watershed [77,78].Because the forcing data of the applied NWP model, i.e., the final analysis (FNL) data ds083.3were just released on 8 July 2015, we selected the specific heavy precipitation event from August, which is also in the flood season (i.e., June-September) in the WJB watershed.As shown in Figure 2b, there were two days of continuous heavy precipitation on 18 and 19 August in the WJB watershed, and the mean daily precipitation of the 215 precipitation stations reached the highest monthly value (27.4 mm) on 18 August.Thus, we chose the rainfall-runoff process caused by this heavy rainfall event as our study case.Moreover, considering the spin-up problem [79][80][81] of the NWP model and the natural rainfall-runoff process, we extended the study period to include the periods before and after the heavy rainfall event.
Finally, a two-week period from 17 to 30 in August 2015 was selected as the study period.

In Situ Observed Precipitation
In this study, two types of in situ observed precipitation were used.One dataset was the precipitation measurements from the 14 meteorological stations within and near the WJB watershed (Figure 1b), which were provided by the China Meteorological Administration (CMA) (http://data.cma.cn).The daily CMA data are free to the public; they were downloaded and used in the calibrations and validations of the applied hydrological model.To improve the temporal resolution of the hydrological simulations, the hourly CMA data during the study period were also collected and used in the hydrological modelling.Another dataset was the precipitation measurements from the 215 precipitation stations in the WJB watershed (Figure 1c).This dataset was reported in the book of Annual Hydrological Report for the P.R. of China published by the CMWR (http://www.mwr.gov.cn).Although the CMWR data were only available in daily values, their observation network was much denser than that of the CMA data.Therefore, the CMWR data were not used for the hydrological modelling but rather applied in the selection of study case and the evaluation of the five precipitation datasets at a daily scale.

In Situ Observed Precipitation
In this study, two types of in situ observed precipitation were used.One dataset was the precipitation measurements from the 14 meteorological stations within and near the WJB watershed (Figure 1b), which were provided by the China Meteorological Administration (CMA) (http://data.cma.cn).The daily CMA data are free to the public; they were downloaded and used in the calibrations and validations of the applied hydrological model.To improve the temporal resolution of the hydrological simulations, the hourly CMA data during the study period were also collected and used in the hydrological modelling.Another dataset was the precipitation measurements from the 215 precipitation stations in the WJB watershed (Figure 1c).This dataset was reported in the book of Annual Hydrological Report for the P.R. of China published by the CMWR (http://www.mwr.gov.cn).Although the CMWR data were only available in daily values, their observation network was much denser than that of the CMA data.Therefore, the CMWR data were not used for the hydrological modelling but rather applied in the selection of study case and the evaluation of the five precipitation datasets at a daily scale.

Remotely Sensed Precipitation
Generally, finer satellite precipitation resolutions result in higher accuracies [82][83][84][85][86], thus the recently released Integrated Multi-satellite Retrievals for GPM (GPM IMERG) with spatiotemporal resolutions of 0.1 • and 30 min [82,87,88], and the merged CMORPH data with spatiotemporal resolutions of and 0.1 • and one hour, were selected and employed in this study.The GPM IMERG was the third-level precipitation product of GPM, which covers an area of ±60 • N/S.Tang et al. [70] concluded that, when compared with gauged observations, the Pearson's correlation coefficient (CC) values of the GPM IMERG over mainland China reached 0.53 and 0.71 at the hourly and daily timescales, respectively.The merged CMORPH was released by the CMA; it was produced by taking two algorithms of probability density function matching and optimal interpolation to merge the following two datasets: (1) the remote sensing precipitation product of the CMORPH data with spatiotemporal resolutions of 8 km and 30 min, which was released by the U.S. Climate Prediction Center [24]; and (2) the hourly in situ gauged precipitation data from more than 30,000-40,000 automatic weather stations in China after quality control.

NWP-Predicted Precipitation
In this study, the WRF model (version 3.7.1)and its WRF 4D-Var system were used to generate two types of the NWP-predicted precipitation data.The WRF model is a limited-area, non-hydrostatic, primitive-equation model with multiple options for various physical parameterization schemes.Since its release in May 2004, the WRF model has been widely used in atmospheric research and operational NWP user communities due to its advantages in terms of efficiency [89].Generally, higher grid resolution in the WRF model can capture more local characteristic of precipitation, thus reduce the prediction biases [90][91][92].Moreover, to avoid a scale mismatch between the WRF and the applied hydrological models, the nesting domain technique was used in the WRF model to dynamically downscale its input reanalysis data to a final resolution of 1 km.The nested domains were set around the WJB watershed (Figure 1a).The dominant parameters and physical configuration of the WRF model were set (Table 1), obeying the rule that the configuration should incorporate the experiences obtained from comparable atmospheric modelling studies as much as possible, especially the studies undertaken in the WJB watershed.To specify the initial state and the lateral boundary condition of the WRF model, the National Center for Environment Prediction (NCEP) FNL ds083.3dataset (http://rda.ucar.edu/)was applied as forcing data.The spatiotemporal resolutions of the FNL data are 0.25 • and 6 h; they are available from 8 July 2015 to a near-current date.This dataset is made with the same model which NCEP uses in the Global Forecast System (GFS) and obtained from the Global Data Assimilation System (GDAS), which continuously collects observational data from the Global Telecommunications System (GTS) and other sources for related analyses.

Map projection
Lambert conformal Centre point of domain 35.8Furthermore, considering the lower accuracy of the WRF precipitation simulation, the WRF 4D-Var data assimilation system [99][100][101][102] was also applied to improve the initial and boundary conditions to improve the accuracy of the precipitation prediction.The GPM IMERG was selected as the observation operator because of its recent release, wide coverage, free download and high accuracy [70].Moreover, the feasibility of assimilating the GPM IMERG into the WRF model has been demonstrated in Yi et al. [18].The GPM IMERG was accumulated into 6 h values and assimilated into the WRF 4D-Var system to improve the initial condition at every day 00 UTC.With the improved condition, a subsequent 24-h forecast was then adopted.This means that there were 14 independent WRF 4D-Var simulations during the entire 14 days of the study period.The main configuration of the WRF 4D-Var system was the same as that in the WRF model.To make the WRF 4D-Var convergence criterion more stringent, an EPS variable of 0.0001 was used.The key background error covariance matrix for the 4D-Var data assimilation was domain-specific; it was generated based on the 1-month-long (August) ensemble simulations, which were performed every 12 h using the National Meteorological Center (NMC) method [103].

Soil Moisture and Outlet Discharge
The measurement network for soil moisture was sparse in the WJB watershed and the gauged data were unavailable, so we used a remote sensing soil moisture product to assess the soil moisture simulated during the hydrological modelling [62,63].The satellite-based soil moisture was chosen obeying the following rules: choose the soil moisture data with finer resolution which are generally considered to have higher accuracy [86].Therefore, we selected the Soil Moisture Active Passive (SMAP) product with spatiotemporal resolutions of 9 km and 3 h as benchmark in the soil moisture evaluations.The SMAP mission was launched by the National Aeronautics and Space Administration (NASA); it takes advantage of the relative strengths of both active (radar) and passive (radiometer) microwave remote sensing to obtain an intermediate level of accuracy and resolution for soil moisture mapping.Among its 15 data products with different levels of data processing, the SMAP Level-4 Surface and Root-Zone Soil Moisture (L4_SM) data (version 3) were employed.L4_SM is generated by the NASA catchment land surface model, and it mainly assimilates the SMAP 9-km active-passive (AP) soil moisture product L2_SM_AP, which combines radar and radiometer measurements.It is gridded using an Earth-fixed, global, cylindrical equal-area scalable Earth grid, and version 2.0 (EASE-Grid 2.0).Reichle et al. [104] assessed the accuracy of the L4_SM product and concluded that it met the soil moisture accuracy requirements specified as an unbiased RMSE of 0.04 m 3 m −3 or better.The L4_SM data are available from 31 March 2015 to the present (within 3 days from real-time) and provide estimates of the surface (0-5 cm) and root-zone (0-100 cm) soil moisture values.Hereafter, the SMAP soil moisture mentioned in the manuscript refers to the data from the SMAP L4_SM product at the root zone.The hourly discharges observed at the watershed outlet in the study period were provided by the China Institute of Water Resources and Hydropower Research (IWHR).

Hydrological Model
To reflect the impacts of spatial characteristics of rainfall on hydrological modelling, the semi-distributed hydrological model TOPX was used in this study.The TOPX model is constructed on the basis of the topographic index (TOP) and the water balance concept of the Xin'anjiang model (X).It applies the improved simple TOPMODEL (topography-based hydrological model)-based runoff parameterization (SIMTOP) [105], and the methods of empirical unit hydrograph, linear reservoir equation, and Muskingum for its routings of overland flow, base flow, and channel flow, respectively [106,107].Apart from some physical parameters, the input data of the TOPX model including precipitation, potential evaporation, topographic index (TI) are required as grid-based data, so as to reflect of the spatial variations of precipitation, evaporation, and topography, thus facilitate reflecting their subsequent impacts on water exchange, soil moisture variation, runoff generation, runoff routing and outlet discharge simulation.
When applying the TOPX model, the 1-km grid data of precipitation and potential evaporation were obtained from the 14 CMA meteorological stations and the 10 CMWR evaporation stations (Figure 1c), respectively.The data of TI were calculated with the method posed by Yi et al. [106] which considered the impacts of both topography and soil properties on hydrological processes.The TI was computed based on the digital elevation model (DEM) that was downloaded from the official website of the United States Geological Survey (USGS) (https://www.usgs.gov/)and the Harmonized World Soil Database (HWSD) obtained from the Cold and Arid Regions Sciences Data Center at Lanzhou (http://westdc.westgis.ac.cn/).The results of the used 1-km grid TI data and the soil type classification from the HWSD in the WJB watershed are shown in Figure 3.To achieve better compatibilities of the TOPX model in the WJB watershed, the model was calibrated and validated with the available daily data using the trial-and-error method.During calibration and validation, both the long-term rainfall-runoff process and the short-term flood process were simulated, as the latter can revise the parameters that are determined based on the former [108][109][110].The long-term rainfall-runoff processes covered the periods from 2001 to 2005 and from 2014 to 2015; the 7 short-term flood events were selected from the long-term rainfall-runoff processes.As shown in Figure 4, the simulated recession curves of several selected short-term flood events decay faster than those of the observations.These fast-decay recession curves and their subsequent shorter flood durations were mainly related to the lack of consideration of the reservoir storage capacity in the TOPX model.Because we focused on the differences of model performance caused by the different precipitation inputs, such limitations of the TOPX model were not considered in the investigation.The statistical results showed that the Nash-Sutcliffe coefficient (NS) [111] of the TOPX model in the WJB watershed was as low as 0.700 (Table 2).

Experimental Design
When assessing the effectivenesses of the different precipitation datasets, the studied two-week long rainfall-runoff process was simulated with the TOPX model at an hourly scale.Therefore, a slight tuning of the model parameters calibrated at a daily scale was done to adapt the hourly simulations.The final parameters of the TOPX model used in the simulations are shown in Table 3.To simulate the study case at spatiotemporal resolutions of 1 km and one hour, the hourly CMA observations were interpolated to 1-km grid data using the inverse distance weighted (IDW) algorithm, since the IDW method can furthest reflect the impact of each station observation on the interpolated point through distance weighting [112].The 30-min GPM IMERG data were firstly accumulated to hourly grid data, then resampled from 0.1 • to a 1-km resolution with the operationally used algorithm of bilinear interpolation [113].The hourly merged CMORPH data were also resampled from 0.1 • to a 1-km resolution with the bilinear interpolation algorithm.The 1-km grid data of precipitation obtained from the WRF model and the WRF 4D-Var system data were accumulated from 6 s to hourly values.Moreover, to unify the time zones of these precipitation datasets and make them consistent with the observed outlet discharges, all the employed precipitation data were adjusted to the same time zone as that of the observed outlet discharge data, i.e., Beijing time.According to the different precipitation inputs, five experiments of rainfall-runoff simulation were performed and labelled as P_CMA, P_GPM, P_CMOR PH, P_WRF and P_4D-Var, respectively (Table 4).Before the five hydrological modelling experiments, the precipitation input of the TOPX model, i.e., the 1-km grid data obtained from the five different precipitation datasets, were evaluated with the CMWR in situ data.Firstly, the hourly precipitation values were extracted from the grid points nearest to the 215 CMWR precipitation stations and accumulated to daily values, then compared to the daily CMWR station observations.For this point-scale evaluation, we used the error scores of the mean error (ME), relative error (RE), root mean square error (RMSE) and CC (Table 5), which describe the errors, deviations and correlation between the simulated data and the reference data, respectively.Secondly, as the studied rainfall-runoff process was triggered by the heavy precipitation on 18 and 19 August, the accuracies of the different heavy precipitation accumulated in these two days were also evaluated.The daily 1-km grid data of the CMWR data processed with the IDW method were accumulated in the two days.The 1-km grid data of the CMA, GPM IMERG, merged CMORPH, WRF model and WRF 4D-Var were accumulated in those two days as well and compared to the accumulated CMWR data.For this field-scale evaluation, the skill scores of the bias score (BIAS), false alarm ratio (FAR), probability of detection (POD) and threat score (TS) (Table 5) were used.These skill scores were constructed based on the "contingency table" [114].BIAS is an indicator of how well the estimation covers the number of occurrences of an event.FAR is the fraction of "yes" estimation that turns out to be wrong.POD is the ratio of correct estimations to the number of times the event occurred; it is commonly known as the hit rate.TS is one of the most frequently used and comprehensive skill scores for summarizing square contingency tables, and this metric combines the characteristics of hints and random detections.The equations and the perfect values of these evaluation indices are listed in Table 5.
Bias score (BIAS) False alarm ratio (FAR) Threat score (TS) * P P,i and P O,i denote the simulated and observed values, respectively, of the i grid, and P P,i and P O,i are their respective means.A represents the precipitation predicted by the WRF and observed by the reference data; B represents the precipitation predicted by the WRF but not observed by the reference data; C represents the precipitation not predicted by the WRF but observed by the reference data; D represents the precipitation not predicted by WRF and not observed by the reference data.Q obs,i and Q sim,i are the observed and simulated values at time i; Q obs is the average value of the observations during the simulation period.
Because the soil area defined in the TOPX model extends to the root zone, we used the data in the root zone of L4_SM for the soil moisture evaluation.To compare with the SMAP soil moisture, the hourly soil moisture simulated by the P_CMA, P_GPM, P_CMORPH, P_WRF and P_4D-Var experiments were accumulated every 3 h and interpolated with kriging method to a 9-km resolution, which were the same with the spatiotemporal resolutions of the SMAP.The units of the soil moisture obtained from the TOPX model and the SMAP product are different, the former applies the depth of the water column (mm), and the latter employs the water volume content (mm 3 /mm 3 ).Therefore, we indirectly evaluated the simulated soil moisture with the SMAP data from the perspective of relativity, which is a frequently used method in the evaluation of soil moisture [115,116].The CC and standard deviation were employed to investigate the relationship between the simulated soil moisture and the SMAP soil moisture, and t-test was used to determine the statistical significance (p) of the correlation.In this study, we defined the CC as significant when its corresponding p was less than 0.05 [117].The NS, CC and RE were applied to evaluate the simulated outlet discharge.

1-km Grid Data of Different Precipitation Datasets
The 1-km grid of the hourly data obtained from the CMA, GPM IMERG, merged CMOPRH, WRF model and WRF 4D-Var system were accumulated to the amount of total precipitation during the entire study period, the daily CMWR data were also accumulated in this way.As portrayed in Figure 5, the spatial heterogeneity of the CMWR data was very obvious (Figure 5a) because its observation network was the highest.In contrast, because of much fewer observation stations, the spatial variation of the CMA data was relatively homogeneous (Figure 5b), which was also reflected in its lowest standard deviation (9.0).The downscaled results of the GPM IMERG (Figure 5c) and the merged CMORPH (Figure 5d) still showed evident grid characteristics because of their coarser resolution of 0.1 • .The watershed mean values of the latter were obviously higher than that of the former.Having the finest spatial resolution (1 km), the distributions of the precipitation simulated by the WRF model and the WRF 4D-Var system were the most continuous.Figure 5e shows that the WRF-predicted precipitation is focused in the middle and south of the WJB watershed, and its maximum (185.9 mm) and average (68.2 mm) watershed values are the highest among the studied precipitation datasets.After data assimilation with the GPM IMERG, the heavy precipitation predicted by the WRF 4D-Var system moved from the middle to the south (Figure 5f), and the average precipitation amount in the watershed decreased from 68.2 mm to 32.2 mm.
The 1-km grid of the hourly data obtained from the CMA, GPM IMERG, merged CMOPRH, WRF model and WRF 4D-Var system were accumulated to the amount of total precipitation during the entire study period, the daily CMWR data were also accumulated in this way.As portrayed in Figure 5, the spatial heterogeneity of the CMWR data was very obvious (Figure 5a) because its observation network was the highest.In contrast, because of much fewer observation stations, the spatial variation of the CMA data was relatively homogeneous (Figure 5b), which was also reflected in its lowest standard deviation (9.0).The downscaled results of the GPM IMERG (Figure 5c) and the merged CMORPH (Figure 5d) still showed evident grid characteristics because of their coarser resolution of 0.1°.The watershed mean values of the latter were obviously higher than that of the former.Having the finest spatial resolution (1 km), the distributions of the precipitation simulated by the WRF model and the WRF 4D-Var system were the most continuous.Figure 5e shows that the WRF-predicted precipitation is focused in the middle and south of the WJB watershed, and its maximum (185.9 mm) and average (68.2 mm) watershed values are the highest among the studied precipitation datasets.After data assimilation with the GPM IMERG, the heavy precipitation predicted by the WRF 4D-Var system moved from the middle to the south (Figure 5f), and the average precipitation amount in the watershed decreased from 68.2 mm to 32.2 mm.The watershed average values of the different hourly precipitation datasets are portrayed in Figure 6a.They generally showed similar rainfall tendencies.Their main differences existed in terms of the times that the peaks occurred and the magnitudes of the peak flow.Figure 6b  The watershed average values of the different hourly precipitation datasets are portrayed in Figure 6a.They generally showed similar rainfall tendencies.Their main differences existed in terms of the times that the peaks occurred and the magnitudes of the peak flow.Figure 6b clearly shows that the watershed mean values of the daily precipitation data from the CMA, merged CMORPH, WRF model and WRF 4D-Var system present concentrated rainfall on 19 August, but the datasets of the CMWR and the GPM IMERG present the rainfall event on 18 and 19 August.In terms of the heavy rainfall, the watershed average values of the daily GPM IMERG showed evident underestimations.In contrast, the daily WRF simulations showed obvious overestimations.The WRF 4D-Var-predicted daily precipitation was more sharply reduced than the WRF-predicted precipitation after assimilating the GPM IMERG.
WRF model and WRF 4D-Var system present concentrated rainfall on 19 August, but the datasets of the CMWR and the GPM IMERG present the rainfall event on 18 and 19 August.In terms of the heavy rainfall, the watershed average values of the daily GPM IMERG showed evident underestimations.In contrast, the daily WRF simulations showed obvious overestimations.The WRF 4D-Var-predicted daily precipitation was more sharply reduced than the WRF-predicted precipitation after assimilating the GPM IMERG.

The Simulated Soil Moisture
The 3-h averaged soil moisture simulated by the TOPX model with different precipitation datasets is shown in Figure 7.It was clear that the spatial distributions of these soil moisture all kept accordance with their corresponding precipitation fields (Figure 5).This suggested that the spatial variation of precipitation dominantly influenced the spatial distribution of soil moisture.The soil moisture generated in the P_CMORPH experiment was generally higher than that generated in the P_GPM experiment (Figure 7c,d), which was consistent with the results that the watershed average precipitation from the CMORPH was higher than that from the GPM IMERG (Figure 6).Because of the reduction of the WRF 4D-Var-predicted precipitation after assimilating the GPM IMERG, its simulated soil moisture decreased as well compared to that of the P_WRF experiment (Figure 7e,f

The Simulated Soil Moisture
The 3-h averaged soil moisture simulated by the TOPX model with different precipitation datasets is shown in Figure 7.It was clear that the spatial distributions of these soil moisture all kept accordance with their corresponding precipitation fields (Figure 5).This suggested that the spatial variation of precipitation dominantly influenced the spatial distribution of soil moisture.The soil moisture generated in the P_CMORPH experiment was generally higher than that generated in the P_GPM experiment (Figure 7c,d), which was consistent with the results that the watershed average precipitation from the CMORPH was higher than that from the GPM IMERG (Figure 6).Because of the reduction of the WRF 4D-Var-predicted precipitation after assimilating the GPM IMERG, its simulated soil moisture decreased as well compared to that of the P_WRF experiment (Figure 7e,f   The watershed average values of the 3-h and daily soil moisture were calculated based on the hourly simulations of the five experiments.Figure 8 clearly shows that the SMAP soil moisture reflects a good response to the CMA-recorded precipitation process.After two days of continuous rainfall on 18 and 19 August, the watershed mean value of the SMAP soil moisture started to increase, reached its maximum on 20 August and subsequently decreased as the rainfall ceased.Compared to the SMAP soil moisture, the simulated soil moisture generally showed similar variation tendencies.Because the different precipitation peaked at different times, the different simulated soil moisture reached the highest values at different hours.The watershed mean soil moisture simulated by the P_CMA, P_CMORPH and P_WRF experiments was very similar.For the overestimation of the WRF-predicted precipitation, the soil moisture simulated by the P_WRF experiment showed the highest peak value.The soil moisture simulated by the P_GPM experiment exhibited very low values because the GPM IMERG precipitation was generally underestimated.With the assimilation of the GPM IMERG, the soil moisture simulated by the P_4D-Var experiment was evidently lower than that simulated by the P_WRF experiment, and became much closer to the simulated results of P_GPM.
Remote Sens. 2018, 10, x FOR PEER REVIEW 15 of 28 was evidently lower than that simulated by the P_WRF experiment, and became much closer to the simulated results of P_GPM.

The Simulaed Outlet Discharge
The simulated hourly discharges and the accumulated daily discharges at the WJB watershed outlet from the five experiments suggested similar tendencies with the observed discharges (Figure 9a,b).The peaks of the hourly simulated discharges appeared at different time due to the different occurrences of the heavy precipitation.The discharges simulated by the P_WRF experiment were evidently larger than other observed and simulated results, as the WRF-predicted precipitation was overestimated.In contrast, the discharges simulated by the GPM IMERG were generally lower than the observed discharges for its underestimation.After assimilating the GPM IMERG, the P_4D-Var experiment yielded better simulations than the P_WRF experiment.The accumulated discharges of the five experiments are shown in Figure 9c,d.It was shown that the P_CMORPH experiment generated an obviously better simulation of the outlet discharges than the P_GPM experiment.As time passed, the accumulated discharges from the P_4D-Var experiment were closer to the accumulated observations than that from the P_CMORPH experiment; this may be caused by that the overestimations of the P_4D-Var experiment after the rainfall supplemented its underestimations before the rainfall.The discharges extracted from the P_CMA experiment were the closest to the observations because of better precipitation.For the hourly accumulated results, the total simulated discharges during the entire study period for the P_CMA, P_GPM, P_CMORPH and P_4D-Var experiments were 0.328%, 13.680%, 5.667%, and 0.973% lower than the total observed discharge, respectively, and 4.849% higher for the P_WRF.

The Simulaed Outlet Discharge
The simulated hourly discharges and the accumulated daily discharges at the WJB watershed outlet from the five experiments suggested similar tendencies with the observed discharges (Figure 9a,b).The peaks of the hourly simulated discharges appeared at different time due to the different occurrences of the heavy precipitation.The discharges simulated by the P_WRF experiment were evidently larger than other observed and simulated results, as the WRF-predicted precipitation was overestimated.In contrast, the discharges simulated by the GPM IMERG were generally lower than the observed discharges for its underestimation.After assimilating the GPM IMERG, the P_4D-Var experiment yielded better simulations than the P_WRF experiment.The accumulated discharges of the five experiments are shown in Figure 9c,d.It was shown that the P_CMORPH experiment generated an obviously better simulation of the outlet discharges than the P_GPM experiment.As time passed, the accumulated discharges from the P_4D-Var experiment were closer to the accumulated observations than that from the P_CMORPH experiment; this may be caused by that the overestimations of the P_4D-Var experiment after the rainfall supplemented its underestimations before the rainfall.The discharges extracted from the P_CMA experiment were the closest to the observations because of better precipitation.For the hourly accumulated results, the total simulated discharges during the entire study period for the P_CMA, P_GPM, P_CMORPH and P_4D-Var experiments were 0.328%, 13.680%, 5.667%, and 0.973% lower than the total observed discharge, respectively, and 4.849% higher for the P_WRF.

Evaluation of the Different Precipitation Datasets
The results of the point-scale evaluation between the daily data from the five different precipitation datasets and the daily CMWR in situ observations are shown in Table 6.It is clear that the ME and RE of the WRF precipitation predictions are 0.159 mm and 0.037, respectively, while the MEs and REs for the other precipitation obtained from the CMA, GPM IMERG, merged CMORPH and WRF 4D-Var are all negative.This indicated that the daily precipitation predicted by the WRF model was overestimated.The WRF predicted daily precipitation also had the highest RMSE (13.411), which presented the highest deviation from the CMWR data.The CC value of the WRF daily precipitation is only 0.343.The daily GPM IMERG has the best CC value (0.493).The CC values of the daily precipitation from the CMA, merged CMORPH and WRF 4D-Var system all passed the level of 0.4.

Evaluation of the Different Precipitation Datasets
The results of the point-scale evaluation between the daily data from the five different precipitation datasets and the daily CMWR in situ observations are shown in Table 6.It is clear that the ME and RE of the WRF precipitation predictions are 0.159 mm and 0.037, respectively, while the MEs and REs for the other precipitation obtained from the CMA, GPM IMERG, merged CMORPH and WRF 4D-Var are all negative.This indicated that the daily precipitation predicted by the WRF model was overestimated.The WRF predicted daily precipitation also had the highest RMSE (13.411), which presented the highest deviation from the CMWR data.The CC value of the WRF daily precipitation is only 0.343.The daily GPM IMERG has the best CC value (0.493).The CC values of the daily precipitation from the CMA, merged CMORPH and WRF 4D-Var system all passed the level of 0.4.The results of the field-scale evaluation between the two-day accumulated heavy precipitation of the five different precipitation and that of the CMWR data are shown in Figure 10.It clearly shows that as the precipitation threshold increases, the skill scores deviate further from their perfect values.This indicated that all the five applied datasets had great difficulty in estimating heavy precipitation.The CMA data had equivalent values with the CMWR data as the BIAS was near 1 before the threshold of 40 mm; however, it then started to decrease which indicated that the CMA data underestimated the heavy rain.For the GPM IMERG data, before the threshold of 70 mm, the rainfall estimations were always lower than those of the CMWR data; however, after the threshold of 70 mm, the rainfall estimations began to be overestimated, and then the bias began to decrease after the threshold surpassed 95 mm.This reflected that the GPM IMERG data were underestimated for most of the thresholds.The merged CMORPH data had the lowest deviation from the CMWR data; the BIAS values were generally lower and stayed nearest to 1 before the threshold of 90 mm.The underestimations of the heavy rainfall from the GPM IMERG and the merged CMORPH were related to the weak ability of detector to measure heavy precipitation under complicated atmospheric conditions, and the homogenization of heavy rainfall by coarse grids [82,118].The POD, FAR and TS values of the merged CMORPH were generally higher than the GPM IMERG; this showed better estimation of the heavy precipitation, possibly related to its gauge correction.For the WRF prediction, its BIAS values showed a significant deviation from the reference value of 1, and an obvious precipitation overestimation can be found above the threshold of 50 mm.However, after assimilating the GPM IMERG, the overestimations of the WRF model for the heavy rain were well controlled, and its BIASs were obviously reduced.The skill scores of the WRF 4D-Var-predicted precipitation were closer to the scores of the GPM IMERG data.This indicated that the assimilated data were the key factor to affect the final assimilation results.The generally underestimations of the heavy rain in the GPM IMERG resulted in the weaker detection of heavy rain in the WRF 4D-Var system.At the threshold of 50 mm, the TS values for the heavy precipitation obtained from the CMA, GPM IMERG, merged CMORPH, WRF model and WRF 4D-Var system were 0.267, 0.096, 0.216, 0.364, and 0.032, respectively.Based on the point-scale and field scale evaluation of the five precipitation datasets, it was concluded that although having coarser spatial resolution, the CMA data comprehensively had the best accuracy since they were in situ gauged observations.Because of gauge correction, the accuracy of the hourly merged CMORPH was better than the 30-min GPM IMERG.Although having the highest spatiotemporal resolutions, the accuracy of the WRF-predicted precipitation was worst because of uncertainties primarily introduced from its incomplete forcing data.The 4D-Var data assimilation could effectively improve the accuracy of the WRF prediction.However, the WRF 4D-Var-predicted precipitation was still not good, because the quality of the assimilated GPM IMERG was poor during the entire study period in the WJB watershed.

Evaluation of the Simulated Soil Moisture
The evaluated results of the 3-h averaged simulated soil moisture for the five experiments are shown in Figure 11.It is shown that most CC values between the simulated soil moisture and the SMAP soil moisture surpassed 0.6.The lower CC values of each experiment were mainly concentrated in the middle part of the WJB watershed, which was near the outlet, and this possibly resulted from the convergence scheme of the TOPX model.There were 412 grids with a 9-km resolution joined the correlation statistics in the WJB watershed.Except for the P_GPM and P_4D-Var experiments, 100% of the total grids for the other three experiments had CC values that were statistically significant at the 0.05 level.The 12 grids with non-significant CC values in the P_GPM experiment (Figure 11b) may be caused by the precipitation underestimation of the GPM IMERG.Based on the point-scale and field scale evaluation of the five precipitation datasets, it was concluded that although having coarser spatial resolution, the CMA data comprehensively had the best accuracy since they were in situ gauged observations.Because of gauge correction, the accuracy of the hourly merged CMORPH was better than the 30-min GPM IMERG.Although having the highest spatiotemporal resolutions, the accuracy of the WRF-predicted precipitation was worst because of uncertainties primarily introduced from its incomplete forcing data.The 4D-Var data assimilation could effectively improve the accuracy of the WRF prediction.However, the WRF 4D-Var-predicted precipitation was still not good, because the quality of the assimilated GPM IMERG was poor during the entire study period in the WJB watershed.

Evaluation of the Simulated Soil Moisture
The evaluated results of the 3-h averaged simulated soil moisture for the five experiments are shown in Figure 11.It is shown that most CC values between the simulated soil moisture and the SMAP soil moisture surpassed 0.6.The lower CC values of each experiment were mainly concentrated in the middle part of the WJB watershed, which was near the outlet, and this possibly resulted from the convergence scheme of the TOPX model.There were 412 grids with a 9-km resolution joined the correlation statistics in the WJB watershed.Except for the P_GPM and P_4D-Var experiments, 100% of the total grids for the other three experiments had CC values that were statistically significant at the 0.05 level.The 12 grids with non-significant CC values in the P_GPM experiment (Figure 11b) may be caused by the precipitation underestimation of the GPM IMERG.7. It is shown that at the 3-h timescale, the P_CMORPH experiment had the highest mean value of 0.885 and the highest maximum value of 0.992, while the CC of the P_GPM showed the lowest value of 0.695.This indicated that the merged CMORPH precipitation data could yield better soil moisture simulations than could the GPM IMERG, because of the accuracy improvement of the merged CMORPH for merging gauged data.The negative values of the minimum CC appeared in the P_GPM and P_4D-Var experiments, and their mean CC values were also lower than those of the other experiments.These differences were caused by the obvious underestimation of the GPM IMERG.At the daily scale, the minimum values of the CC were all significantly improved except for that of the P_WRF, and the maximum and mean values of the CC in all five experiments showed the same variation as that at the 3-h scale.The good correlations between the simulated soil moisture and the SMAP soil moisture ensured the rationalities of the subsequent simulations of runoff and outlet discharge.The statistics of the CC values of the 3-h and daily mean soil moisture for the five experiments are listed in Table 7.It is shown that at the 3-h timescale, the P_CMORPH experiment had the highest mean value of 0.885 and the highest maximum value of 0.992, while the CC of the P_GPM showed the lowest value of 0.695.This indicated that the merged CMORPH precipitation data could yield better soil moisture simulations than could the GPM IMERG, because of the accuracy improvement of the merged CMORPH for merging gauged data.The negative values of the minimum CC appeared in the P_GPM and P_4D-Var experiments, and their mean CC values were also lower than those of the other experiments.These differences were caused by the obvious underestimation of the GPM IMERG.At the daily scale, the minimum values of the CC were all significantly improved except for that of the P_WRF, and the maximum and mean values of the CC in all five experiments showed the same variation as that at the 3-h scale.The good correlations between the simulated soil moisture and the SMAP soil moisture ensured the rationalities of the subsequent simulations of runoff and outlet discharge.

Evaluation of the Simulated Outlet Discharge
The evaluated results of the simulated hourly and the accumulated daily outlet discharge from the five rainfall-runoff experiments are shown in Table 8.Compared to the evident differences among the five precipitation datasets, the discrepancies among the five hydrological modelling experiments driven by them were narrowed.This may be influenced by the watershed size, because larger watershed generally had a lower magnitude-of-difference compared to the smaller watershed [119][120][121][122]. Table 8 shows that the CC values between the simulated discharges and the observed discharges are all above 0.787 and 0.796 at the hourly and daily timescales, respectively.Except for the RE of the P_WRF experiment, the REs of the other experiments were all negative, which means that the simulated outlet discharges were underestimated overall.Considering the comprehensive evaluation index NS, the P_CMA, P_GPM, P_CMORPH, P_WRF and P_4D-Var experiments reached 0.658, 0.576, 0.596, 0.464 and 0.547, respectively, on an hourly scale.These NS values indicated that for the rainfall-runoff simulations in the WJB watershed, the in situ CMA precipitation data had the best effectiveness in driving the TOPX model.As for the general underestimations of the heavy precipitation from the GPM IMERG and the merged CMORPH, their simulated discharges were lower than the observed values, there RE values were both negative.Because of better accuracy in estimating precipitation, especially for heavy precipitation, the P_CMORPH experiment performed better than the P_GPM experiment.The NS of the P_4D-Var experiment was 0.083 (hourly) and 0.061 (daily) higher than that of the P_WRF experiment, and it was clear that the 4D-Var data assimilation with the GPM IMERG improved the hydrological modelling performance through enhancing the accuracy of the WRF 4D-Var-predicted precipitation.However, the performance of the P_4D-Var experiment was only higher than the P_WRF experiment; this mainly resulted from the poor quality of the GPM IMERG over the study period in the WJB watershed.It was concluded that the model performance was affected by the precipitation accuracy to a large extent; the higher the precipitation accuracy, the better the model performed.

Conclusions
Precipitation is a very important component in water cycle.In order to investigate the effectivenesses of different precipitation datasets on hydrological modelling, five different precipitation datasets were used to simulate a two-week runoff process after a heavy rainfall event in the WJB watershed (30,000 km 2 ).The five precipitation datasets contained one traditional in situ observation, i.e., the CMA data, two satellite precipitation products, i.e., the GPM IMERG and the merged CMORPH, and two NWP-predicted precipitation data, i.e., predictions from the WRF model and the WRF 4D-Var system.According to the requirement of the applied TOPX model, the five precipitation datasets were processed to 1-km grid data and evaluated with the daily CMWR data at point and field scales.The evaluated results suggested that the accuracies of the precipitation datasets from the CMA data, merged CMORPH, GPM IMERG, WRF 4D-Var system and WRF model generally decreased in sequence.The methods of gauge correction and 4D-Var data assimilation could improve the accuracies of the merged CMORPH and the WRF 4D-Var-predicted precipitation.
The soil moisture generated in the hourly rainfall-runoff simulations was evaluated to guarantee the rationalities of the hydrological modelling.In comparisons with the SMAP soil moisture, the watershed average CC values of the 3-h mean soil moisture from the P_CMA, P_GPM, P_CMORPH, P_WRF and P_4D-Var experiments reached 0.825, 0.695, 0.885, 0.841 and 0.774, respectively.The evaluations suggested that the spatiotemporal variations of the soil moisture were closely related to the variations of precipitation.Finally, the hourly simulated and daily accumulated outlet discharges were assessed.The NS values for the hourly simulated outlet discharges from the P_CMA, P_GPM, P_CMORPH, P_WRF and P_4D-Var experiments were 0.658, 0.576, 0.596, 0.464 and 0.547, respectively.These investigations demonstrated that the accuracy of precipitation data was a crucial factor to influence the performance of hydrological modelling.For this study case, all five precipitation datasets could yield reasonable hydrological simulations.The traditional in situ-observed precipitation was still the optimum dataset to simulate the studied rainfall-runoff process.The remotely sensed precipitation products of the GPM IMERG and the merged CMORPH were the secondary options.As the accuracy of the merged CMORPH was better than the GPM IMERG, the P_CMORPH experiment performed better than the P_GPM experiment.The performances driven by the NWP-predicted precipitation were the worst.As the WRF 4D-Var-predicted precipitation was obviously improved by the 4D-Var data assimilation method, the performance of the P_4D-Var experiment outperformed the P_WRF experiment, but because of the poor quality of the assimilated GPM IMERG over the study period in the WJB watershed, the performance of the P_4D-Var experiment was still not good.Despite lower effectivenesses in hydrological modelling, the precipitation datasets of the remotely sensed and the NWP-predicted are undoubtedly valuable and deserve further research, as the accuracies of the two datasets have been improved with the development of remote sensing, data merging, numerical simulation and data assimilation technologies.Moreover, the remotely sensed precipitation data are particularly indispensable in un-gauged areas, and the NWP model can forecast precipitation in the future, thus realize flood warning and other related water resource management in a real time.
For future studies, other data assimilation methods that are not as time-consuming as the 4D-Var can be used to simulate long-term rainfall-runoff processes or more short-term flood events.Moreover, many other remote sensing precipitation products can be assimilated into the WRF model because the GPM IMERG generally underestimates precipitation.In addition, other hydrological models constructed on the different bases of runoff generation and routing can be applied in related studies.

Figure 2 .
Figure 2. (a) Contribution (%) of the accumulated daily precipitation to the annual total precipitation for each precipitation station in the WJB watershed in 2015; the data were provided by the China Ministry of Water Resources (CMWR); (b) box plot * of daily precipitation from the 215 CMWR precipitation stations.* The lower and upper edges of the central box represent the first and third quartiles (25% and 75%, respectively), and the band inside the box represents 50%.

Figure 2 .
Figure 2. (a) Contribution (%) of the accumulated daily precipitation to the annual total precipitation for each precipitation station in the WJB watershed in 2015; the data were provided by the China Ministry of Water Resources (CMWR); (b) box plot * of daily precipitation from the 215 CMWR precipitation stations.* The lower and upper edges of the central box represent the first and third quartiles (25% and 75%, respectively), and the band inside the box represents 50%.

Figure 3 .
Figure 3. (a) The 1-km grid data of the topographic index (TI) of the WJB watershed; and (b) the applied soil type classification of the Harmonized World Soil Database (HWSD) used in the calculation of TI.

Figure 4 .
Figure 4. Long-term calibration (a) and validation (b) of the TOPX model in the WJB watershed and its short-term calibration (c-g) and validation (h,i).Discharge (mm 3 /s) in the figure indicates discharge at the WJB hydrology station; precipitation (mm) denotes the average precipitation in the WJB watershed that was calculated based on the CMA observations.

Figure 4 .
Figure 4. Long-term calibration (a) and validation (b) of the TOPX model in the WJB watershed and its short-term calibration (c-g) and validation (h,i).Discharge (mm 3 /s) in the figure indicates discharge at the WJB hydrology station; precipitation (mm) denotes the average precipitation in the WJB watershed that was calculated based on the CMA observations.

Figure 5 .
Figure 5. 1-km grid data of the total precipitation (mm) obtained from the precipitation datasets of the CMWR (a), CMA (b), GPM IMERG (c), merged CMORPH (d), WRF model (e) and WRF 4D-Var system (f).The minimum (Min), maximum (Max), mean values and the standard deviation (Std dev.) of the total precipitation are listed in the upper right corner of each graph.

Figure 5 . 1 -
Figure 5. 1-km grid data of the total precipitation (mm) obtained from the precipitation datasets of the CMWR (a), CMA (b), GPM IMERG (c), merged CMORPH (d), WRF model (e) and WRF 4D-Var system (f).The minimum (Min), maximum (Max), mean values and the standard deviation (Std dev.) of the total precipitation are listed in the upper right corner of each graph.

Figure 6 .
Figure 6.Averages of the hourly (a) and daily (b) precipitation (mm) in the WJB watershed obtained from different precipitation datasets during the study period.

Figure 6 .
Figure 6.Averages of the hourly (a) and daily (b) precipitation (mm) in the WJB watershed obtained from different precipitation datasets during the study period.
).The WJB watershed mean values of the 3-h soil moisture simulated by the P_CMA, P_GPM, P_CMORPH, P_WRF and P_4D-Var experiments were 95.927 mm, 87.110 mm, 92.321 mm, 94.513 mm and 86.461 mm, respectively, and the watershed mean value of the SMAP soil moisture was 0.286 mm 3 /mm 3 .

Figure 7 .
Figure 7. 9-km grid data of the average 3-h soil moisture in the WJB watershed extracted from the SMAP product (a) and the P_CMA (b), P_GPM (c), P_CMORPH (d), P_WRF (e), and P_4D-Var (f) experiments during the study period.

Figure 8 .
Figure 8.The 3-h (a) and daily (b) mean values of the WJB watershed soil moisture obtained from the SMAP (mm 3 /mm 3 ) and the different rainfall-runoff experiments based on the TOPX model (mm).

Figure 8 .
Figure 8.The 3-h (a) and daily (b) mean values of the WJB watershed soil moisture obtained from the SMAP (mm 3 /mm 3 ) and the different rainfall-runoff experiments based on the TOPX model (mm).

Figure 9 .
Figure 9.The results of the simulated hourly discharges (a), daily discharges (b), accumulated hourly discharges (c) and accumulated daily discharges (d) at the WJB watershed outlet.

Figure 9 .
Figure 9.The results of the simulated hourly discharges (a), daily discharges (b), accumulated hourly discharges (c) and accumulated daily discharges (d) at the WJB watershed outlet.

Figure 10 .
Figure 10.Skill scores of BIAS (a), POD (b), FAR (c) and TS (d) for the grid-scale evaluation of the accumulated heavy precipitation between the CMWR data and the other data from the CMA, GPM IMERG, merged CMORPH, WRF model and WRF 4D-Var system.

Figure 10 .
Figure 10.Skill scores of BIAS (a), POD (b), FAR (c) and TS (d) for the grid-scale evaluation of the accumulated heavy precipitation between the CMWR data and the other data from the CMA, GPM IMERG, merged CMORPH, WRF model and WRF 4D-Var system.

Figure 11 .
Figure 11.Pearson's correlation coefficient (CC) between the SMAP soil moisture and the soil moisture simulated by the rainfall-runoff experiments of P_CMA (a), P_GPM (b), P_CMORPH (c), P_WRF (d) and P_4D-Var (e) at 3-h intervals.The point denotes the CC value, which is not statistically significant at the level of 0.05.

Figure 11 .
Figure 11.Pearson's correlation coefficient (CC) between the SMAP soil moisture and the soil moisture simulated by the rainfall-runoff experiments of P_CMA (a), P_GPM (b), P_CMORPH (c), P_WRF (d) and P_4D-Var (e) at 3-h intervals.The point denotes the CC value, which is not statistically significant at the level of 0.05.

Table 1 .
The main configuration of the WRF model.

Table 2 .
Statistical results of the simulated daily discharges in the WJB watershed during the calibrations and validations of the hydrological model TOPX *.
* The indices of NS, CC and RE denote Nash-Sutcliffe coefficient, Pearson's correlation coefficient and relative error, respectively.

Table 2 .
Statistical results of the simulated daily discharges in the WJB watershed during the calibrations and validations of the hydrological model TOPX *.

Table 3 .
Main parameters in the hydrological model TOPX.

Table 4 .
Experimental design for the rainfall-runoff simulations with different precipitation datasets.
3.3.Evaluation of Precipitation, Soil Moisture and Outlet Discharge

Table 5 .
Statistical metrics applied in the evaluations *.

Table 6 .
Error scores of mean error (ME), relative error (RE), root mean square error (RMSE) and Pearson's correlation coefficient (CC) for the point-scale precipitation evaluations between the daily CMWR data and the daily grid data with 1-km resolution from different precipitation data.

Table 6 .
Error scores of mean error (ME), relative error (RE), root mean square error (RMSE) and Pearson's correlation coefficient (CC) for the point-scale precipitation evaluations between the daily CMWR data and the daily grid data with 1-km resolution from different precipitation data.

Table 7 .
Statistics of Pearson's correlation coefficient between the SMAP soil moisture and the soil moisture simulated by the rainfall-runoff experiments at 3-h and daily time scales.

Table 7 .
Statistics of Pearson's correlation coefficient between the SMAP soil moisture and the soil moisture simulated by the rainfall-runoff experiments at 3-h and daily time scales.

Table 8 .
Evaluation results of the hourly and daily outlet discharges simulated by the different rainfall-runoff experiments *.