Dissecting Performances of PERSIANN-CDR Precipitation Product over Huai River Basin, China

: Satellite-based precipitation products, especially those with high temporal and spatial resolution, constitute a potential alternative to sparse rain gauge networks for multidisciplinary research and applications. In this study, the validation of the 30-year Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Climate Data Record (PERSIANN-CDR) daily precipitation dataset was conducted over the Huai River Basin (HRB) of China. Based on daily precipitation data from 182 rain gauges, several continuous and categorical validation statistics combined with bias and error decomposition techniques were employed to quantitatively dissect the PERSIANN-CDR performance on daily, monthly, and annual scales. With and without consideration of non-rainfall data, this product reproduces adequate climatologic precipitation characteristics in mean total positive false the daily large underestimation of high percentile data overestimating percentile (error from high precipitation), of timescale, total Mean-Square-Error ( TMSE HB daily 1 mm / of biases HRB the of grids, percentile-based biases positive below 0.09 mm / day (4%) between P0 and P4 . After P4 , biases become negative, and magnitudes and spatial variability become larger as percentiles These suggest that low daily precipitation, while high daily precipitation is underestimated, for precipitation above P8, with biases lower than − 0.20 mm / day − 5%).

significant regional differences [14]. In China, the Huai River Basin (HRB) is a very important food production region-where the arable area and crop yields are about 11.7% and 17.4% of the nation's total, respectively [56]. However, this basin frequently suffers from droughts, with drought-affected arable areas exceeding 20,000 km 2 in 14 years during the period of 1949-2000 and local and regional droughts occurring nearly every year. Statistics have indicated that the accumulative arable area affected by drought disasters in the HRB has increased by 53% between 1991 and 2000 relative to the period of [1981][1982][1983][1984][1985][1986][1987][1988][1989][1990], implying that droughts have intensified and occurred more frequently [57]. In addition, the HRB is a typical water-deficient region in China: Its water resources per capita are lower than a quarter of the national average [58]. Therefore, reasonably planning water resource management and agricultural irrigation schedules are very important for regional and even national food security and food production capacity. In short, reliable, long-term (records generally longer than 30 years), continuous, and relatively high spatiotemporal resolution precipitation information is of critical significance for practical applications in the HRB; and then the PERSIANN-CDR precipitation product is selected here for evaluation based on a high density (182) of observation sites. Many studies have assessed PERSIANN-CDR across the world, including some regions of China [59][60][61][62][63][64][65]. However, satellite-based precipitation products usually vary in biases, errors, and detection capabilities among regions, timescales, and rainfall intensities [38]. As a result, several questions should be answered before the use of PERSIANN-CDR in the HRB: (1) How does this product perform according to various continuous and categorical metrics? (2) Does PERSIANN-CDR performance differ among temporal scales (i.e., daily, monthly, and annual) and precipitation thresholds (i.e., from the first to ninth percentiles)? (3) To what extent do bias and error sub-components, which are obtained by related decomposition techniques, play a role in continuous metrics? The analyses of this study will serve as a valuable reference for decision-makers to mitigate the adverse impacts of water deficit in the HRB and even for algorithm developers of PERSIANN-CDR.

Study Region
The study region is the HRB (between 30-39 • E and 111-123 • E), located in eastern China between the Yellow River and the Yangtze River (Figure 1). Covering a drainage area of approximately 33,000 km 2 , the HRB mainly includes the northern parts of Jiangsu and Anhui, a small part of Hubei, and most of Shandong and Henan. As one of the ten water resource regions in China, the HRB has a vast plain with many lakes and depressions, and is moderately mountainous (general elevation ranging from 1000 to 2000 m above sea level) near the western boundary, mid-eastern part, and Shandong peninsula. A typical semi-humid monsoon climate with warm temperatures prevails in this basin, with regional average annual temperature and precipitation of about 14 • C and 806 mm, respectively. In the HRB, precipitation is concentrated in the Meiyu rain season of China (from mid-May to mid-October) and has large annual and seasonal variability [66]. Because of the unique climate and specific surface conditions, the HRB often suffers from flooding, waterlogging, drought, and storm surge disasters, especially in its middle and lower parts [66].

Data
Observed daily precipitation data from 1960 to 2012 at more than 200 gauge sites were collected from the China Meteorological Administration (CMA) to validate the PERSIANN-CDR precipitation product. Two data quality issues-missing values and inhomogeneity-were first resolved. Days with missing values were determined for each year and each site. Sites with data available for less than 330 days per year were removed, and missing values of the remaining sites were filled with data from nearby sites using the bilinear regression. Subsequently, the analyses of time series homogeneity were performed using the Pettitt test [67], and the sites with time series not passing the significance test (p< 0.05) were eliminated. After the data quality control, records from 182 sites remained (Figure 1).
To match the gridded PERSIANN-CDR data, we followed Katiraie-Boroujerdy et al. [68] and processed the 182 sites into 176 grid boxes of 0.25 • × 0.25 • longitude by latitude. The final observational value for a certain grid box was computed by averaging the daily records of the gauge site(s) within the given grid. The sites and their corresponding grid boxes are shown in Figure 1. The PERSIANN-CDR precipitation product is a new retrospective satellite-based dataset that was developed for the U.S National Climatic Data Center (NCDC) Climate Data Record program in NOAA [29]. This dataset provides near-global (60 • S-60 • N) daily precipitation estimates from 1983 to the present with a spatial resolution of 0.25 • × 0.25 • . The PERSIANN-CDR dataset is available for public use from the U.S NOAA National Centers for Environment Information (NCEI) at https:// www.ncdc.noaa.gov/cdr/atmospheric/precipitation-persiann-cdr and the Centre for Hydrometeorology and Remote Sensing (CHRS) Data Portal at http://chrsdata.eng.uci.edu. For more details about the PERSIANN-CDR algorithm, readers can refer to Ashouri et al. [29]. In this study, the analysis time period is 1983-2012, when both the PERSIANN-CDR product and direct observations are available.

Continuous metrics
Total bias (TB): This metric measures the average tendency of PERSIANN-CDR precipitation to be larger or smaller than the observation (OBS), and it has an optimal value of 0. Positive and negative indicate overestimation and underestimation, respectively.
Total Mean-Square-Error (TMSE): For conveniently decomposing error, TMSE is used here. This indicator, which fluctuates between 0 (an optimal value) and +∞, describes the accuracy of precipitation estimates with the same efficiency as Root-Mean-Square-Error [69].
R measures the strength of the linear relationship between PERSIANN-CDR and OBS precipitation, with a range between −1 and 1 (a perfect value).
Kling-Gupta Efficiency (KGE; [69]): KGE is a relatively new indicator that is used to measure the overall performance of PERSIANN-CDR. KGE can comprehensively reflect the product's capability of reproducing temporal dynamics (measured by R) and the volume and distribution of precipitation (measured by α and β, respectively). This index ranges between −∞ and 1, of which KGE = 1 implies a perfect overall performance.

Categorical metrics
Probability of Detection (POD): POD gives the fraction of precipitation occurrences that are correctly detected by the PERSIANN-CDR product. It ranges from 0 to an optimal value of 1.
False alarm ratio (FAR): FAR gives the fraction of events for which PERSIANN-CDR detects precipitation that is not observed. Its ranges are between 0 (a perfect value) and 1.
Equitable Threat Score (ETS): The ETS metric indicates the fraction of observed and/or estimated events that are correctly predicted, adjusted by the frequency of hits that would be expected to occur simply by random chance. ETS is bounded by −1/3 and 1 (a perfect score). A value of 0 represents no skill, but negative values indicate that the chance estimation of an event is better than that of the selected product [70].
Heidke Skill Score (HSS): HSS, ranging from −∞ to 1 (an optimal score), shows the accuracy of estimates while accounting for matches, due to random chance. A negative value implies that random chance is better than the selected product's estimates, and a value of 0 means that the product has no skill.

Hanssen-Kuipers Discriminant (HK):
HK measures how well the satellite estimates discriminate between precipitation and no-precipitation events. This metric has a range between −1 and 1, of which a perfect performance and no skill correspond to 1 and 0, respectively.
Frequency Bias Index (FBI): FBI reveals systematic differences between precipitation event frequency in OBS and PERSIANN-CDR. It can indicate whether there is a tendency to underestimate (i.e., FBI < 1) or overestimate (i.e., FBI > 1) precipitation events and ranges from 0 to infinity, with a perfect score of 1.
Note: S i is the PERSIANN-CDR precipitation value of the ith data pair at a given temporal (i.e., daily, monthly, or yearly) scale, and O i is the OBS value. α = S/O and β = σ S /σ O , where S and σ S (O and σ O ) are the mean and standard deviation of PERSIANN-CDR precipitation (OBS), respectively. A = number of hits, C = number of false alarms, D = number of misses, E = number of corrective negatives, N = number of data pairs, and Ar = (A + C)·(A + D)/N represents hits that could occur by chance. The definitions of A, C, D and E can be found in Table 2.
Note: In view of satellite-based precipitation estimates' higher uncertainties at low precipitation rates, the threshold is usually considered to be between 1 and 2 mm/day. In accordance with Ebert et al. [36] and Lo-Conti et al. [32], the threshold here is specified to be 1 mm/day, below which all estimated precipitation is considered to be 0 or non-rainfall.

Bias Decomposition
To determine the impacts of the PERSIANN-CDR product's detection capability, we first decomposed the TB into three sub-components: The bias in successful detections (i.e., hits; HB or percent HB (pHB)), the bias, due to rainfall misses (MB or percent MB (pMB)), and the bias, due to false detections (FB or percent FB (pFB)), i.e., where n H , n M , and n F are the number of samples corresponding to conditions of S i > 0 mm/day, and O i > 0 mm/day, S i = 0 mm/day and O i ≥ 0 mm/day, and S i ≥ 0 mm/day and O i = 0 mm/day, respectively. HB can be further divided into two parts, HB 0-1 and HB ≥1 ; they reflect hit biases in successful detections of rainfall events with daily precipitation of 0-1 mm/day and ≥1 mm/day, respectively.
Additionally, the non-false bias (i.e., NFB = HB ≥1 + MB) can also be separated into several components corresponding to different bins. This is useful for the in-depth understanding of biases from its distribution among various rain intensities. In this study, we selected daily and monthly OBS precipitation percentiles as the partition points of each bin, such as the bin between ith and (i+1)th percentiles (named [Pi, Pi+1)). This process is named percentile-based bias decomposition, and the sub-component for a certain bin can be expressed as where B Pi,i+1 represents bias within [Pi, Pi+1), whose number of samples is n Pi,i+1 . Notably, the daily percentiles are obtained using data in which OBS daily precipitation ≥1 mm/day (so, the daily P0 value is 1 mm/day). The percent values of HB, MB, FB, NFB, and B Pi,i+1 (named pHB, pMB, pFB, pNFB, and pB Pi,i+1 ) can be calculated by dividing by the OBS mean (i.e., O).Error Decomposition According to previous studies (e.g., Reference [35]), the TMSE includes two components, i.e., systematic (MSE s ) and random errors (MSE r ), which are calculated by Similar to bias decomposition, we can also get detection-based and percentile-based error sub-components.
Detection-based Error Sub-components: whereŜ j is defined by the least-square linear regression, i.e.,Ŝ j = a + b·O j , with a and b denoting the intercept and slope, respectively. It is noteworthy that the regression equation for each grid box is established using all daily, monthly, or annual samples. Following HB decomposition, HMSE is also broken into HMSE 0-1 and HMSE ≥1 for respectively reflecting hit errors in successful detections of rainfall events with 0-1 mm/day and ≥1 mm/day. Then, non-false MSE (i.e., NFMSE) can be expressed as the sum of HMSE ≥1 and MMSE.
Percentile-based Error Sub-components: where HMSE, MMSE, and FMSE are errors in successful detections, errors, due to rainfall misses, and errors, due to false detection, respectively. MSE Pi,i+1 represents the error for the bin of [Pi, Pi+1), as defined as above. The percent values of HMSE, MMSE, FMSE, NFMSE, and MSE Pi,i+1 (named pHMSE, pMMSE, pFMSE, pNFMSE, and pMSE Pi,i+1 ) can be obtained by dividing its value by TMSE. Notably, in this study, we calculated these selected validation metrics at 176 grid boxes, and utilize their means to detect PERSIANN-CDR performances in the HRB. Moreover, the box-plots for each validation metric in the following text are drawn using their estimates at all the grid boxes.

Climatologic Characteristics of Precipitation
As a first step, the 30-year mean daily, monthly, and annual P all of OBS and PERSIANN-CDR were calculated by averaging over the HRB (  Figure 2(a1,a2) illustrates the results using the multi-year mean daily (monthly) data points. The OBS result reveals obvious seasonality, with higher values of P all (>3.50 mm/day or >105.00 mm/month) from June to August, accompanied by greater spatial variability (larger spread between the first and third quantiles), and a peak in July (6.39 mm/day or 185.00 mm/month). The remaining months correspond to lower values (<3.00 mm/day or <71.00 mm/month) and relatively smaller spatial variability. Intuitively, PERSIANN-CDR can effectively reproduce the intra-annual changes in P all , with a maximum (7.00 mm/day or 192.51 mm/month) in July and greater spatial variability. Figure 3(a1,b1,c1) show the spatial patterns of OBS multi-year mean P all on daily, monthly, and annual scales, respectively. It is evident that despite different temporal scales, OBS P all is generally characterized by a decrease from the southeast to northwest of the HRB. Comparison between OBS and PERSIANN-CDR (Figure 3(a1) vs. Figure 3(a2), Figure 3(b1) vs. Figure 3(b2), and Figure 3(c1) vs. Figure 3(c2)) suggests that PERSIANN-CDR can well capture the spatial distributions of climatological P all on daily, monthly, and annual scales, with a spatial R larger than 0.94. Also, the spatial pattern of daily or monthly P all for PERSIANN-CDR was compared with that of OBS for each month (not shown here). Except for August, when there is a reduction in OBS P all from the coast to inland, P all decreases from the southeast to northwest in the other moths. In each month, the spatial distribution of PERSIANN-CDR P all matches well with that of OBS on daily or monthly scales, with a spatial R between 0.71 and 0.96. This indicates that this product performs well in capturing the intra-annual cycle of daily and monthly P all in space. Note: P all values are calculated using all data (including data with rainfall and non-rainfall events), while P >0 (P ≥1 ) are obtained using data corresponding to OBS daily precipitation of >0 mm/day (≥1 mm/day). The values in parentheses refer to the number of selected data samples averaged over the HRB. We also compared multi-year means of OBS daily, monthly, and annual P >0 (P ≥1 ) averaged over the HRB with those of PERSIANN-CDR (Table 3). Relative to HRB P all , OBS multi-year mean daily P >0 (P ≥1 ) increases because of the decreased number of days with OBS daily precipitation larger than 0 (1 mm/day) and unchanged (slightly changed) accumulative precipitation values, which can be reflected by OBS multi-year mean monthly and annual P >0 (P ≥1 ; Table 3). Although PERSIANN-CDR multi-year mean daily P >0 (P ≥1 ) also increases compared with P all , its monthly and annual values decrease. This is mainly because that PERSIANN-CDR has lower daily estimates, and it missed rainfall events corresponding to OBS daily precipitation above 0 (1 mm/day). Similar to the intra-annual cycles of P all on daily and monthly scales, P >0 values ( Figure 2(a2,b2); P ≥1 , Figure 2(a3,b3)) are characterized by evident seasonality, i.e., an increase starting in January, a peak in July, and larger spatial variability in June-August. Moreover, OBS multi-year mean daily and monthly P >0 (P ≥1 ) values are larger than the PERSIANN-CDR values for each month. In terms of the spatial distributions of multi-year mean daily, monthly, and annual P >0 (Figure 3(a3) vs. Figure Figure 3(c6)), OBS and PERSIANN-CDR share similar features, i.e., P >0 (P ≥1 ) decreasing from the southeast to northwest, with a spatial R above 0.65. Moreover, PERSIANN-CDR can also better reproduce the spatial patterns of daily and monthly P >0 (P ≥1 ), with spatial R values ranging from 0.75 to 0.95 throughout the year (not shown).  Figure 4(a1,b1) reveals that more than 75% of grids show a positive daily TB (pTB), with an HRB mean of 0.25 mm/day (11%), which suggests the overestimation of PERSIANN-CDR on the daily scale across this study region. The values of HB, FB, and MB (Figure 4(a1); pHB, pFB, and pMB, Figure 4(b1)) are always negative, positive, and negative for each grid, respectively, with a regional mean of −0.55, 1.02, and −0.22 mm/day (−24%, 45%, and −10%). Consequently, PERSIANN-CDR daily TB (pTB) can be attributed to larger and positive FB (pFB). To further explore the sources of HB (pHB), we decomposed this metric into two sub-components, i.e., HB 0-1 (pHB 0-1 ) and HB ≥1 (pHB ≥1 ). Positive HB 0-1 (pHB 0-1 ) indicates PERSIANN-CDR overestimation for the bin of daily precipitation lower than 1 mm/day, while negative HB ≥1 (pHB ≥1 ) suggests PERSIANN-CDR underestimation for the bin of daily precipitation above 1 mm/day. According to the magnitudes of HB 0-1 (0.06 mm/day; pHB 0-1 , 3%) and HB ≥1 (−0.61 mm/day; pHB ≥1 , −27%), negative daily HB is mainly caused by larger underestimations for daily precipitation above 1 mm/day. In Figure 4(a2,b2), the results of percentile-based biases are presented. For the whole HRB and the overwhelming majority of grids, percentile-based biases are positive and below 0.09 mm/day (4%) between P0 and P4. After P4, biases become negative, and magnitudes and spatial variability become larger as percentiles increase. These findings suggest that PERSIANN-CDR overestimates low daily precipitation, while high daily precipitation is underestimated, especially for precipitation above P8, with biases lower than −0.20 mm/day (−5%).  Larger TB values (>0.20 mm/day) are found between January and July, especially in June and July, with a maximum of >0.40 mm/day, while lower TB (<0.20 mm/day) occurs in the remaining months ( Figure 4(c1)). For the detection-based sub-components for each month, HB, FB, and MB are negative, positive, and negative, respectively, corresponding to larger magnitudes of above 0.70, 1.20, and 0.20 mm/day between May and September. Despite the same signs (i.e., positive and negative), the characteristics of intra-annual cycles of pTB and its sub-components evidently differ from those of TB (Figure 4(d1)), mainly because of apparent differences in daily precipitation magnitudes among months (Figure 2(a1)). For example, pTB is characterized by larger values (>15%) in January-April and December and lower values (<15%) in the other seven months, with the maximum (44%) and minimum (3%) occurring in January and August, respectively. The pFB values in January-May and October-December are above 50%, while pHB (pMB) exhibit magnitudes larger than 30% (10%) in April-June (January-March and September-December). Comparison of HB, FB, and MB (Figure 4(c1); pHB, pFB, and pMB, Figure 4(d1)) indicates that the positive false biases are responsible for the overestimation of PERSIANN-CDR daily precipitation in each month. Throughout the year, HB 0-1 (pHB 0-1 ), always with a positive value below 0.10 mm/day (15%), indicates that the PERSIANN-CDR product overestimates daily precipitation below 1 mm/day, but daily precipitation above 1 mm/day is underestimated. This can be seen from negative HB ≥1 (pHB ≥1 ) ranging between −1.39 mm/day (−37%) and −0.13 mm/day (−22%). As shown in Figure 4(c2,d2)), PERSIANN-CDR overestimates daily precipitation below P4 for more than six months, with biases larger than 0.10 mm/day (around 2%) in July and August. Conversely, negative biases of daily precipitation above P4 are found in most (>6) months, and there is a larger magnitude above 0.24 mm/day or 16% (below 0.16 mm/day or 14%) for daily precipitation larger than P9 (between P6 and P8) between April-October (other months). In general, the negative daily non-false biases of the PERSIANN-CDR product in each month are due to its underestimation of higher precipitation. Figure 5 presents the results of error analyses using daily data. As depicted in Figure 5(a1), TMSE ranges from 35 to 73 (mm/day) 2 at half of the grids, with a regional mean of 62.14 (mm/day) 2 . For the systematic and random error sub-components (left of Figure 5(a1,b1)), MSE s (33.41 (mm/day) 2 ; pMSE s , 53%) is slightly larger than MSE r (28.73 (mm/day) 2 ; pMSE r , 47%). From the perspective of detection-based error sub-components (right of Figure 5(a1,b1)), HMSE (pHMSE) makes the largest contribution of 49.19 (mm/day) 2 (79%) to TMSE (pTMSE), followed by the secondary contributor, FMSE (pFMSE), with a value of 9.58 (mm/day) 2 (16%). In addition, the HMSE 0-1 (pHMSE 0-1 ) value is negligible, while the HMSE ≥1 (pHMSE ≥1 ) value is comparable to HMSE (pHMSE), suggesting that the PERSIANN-CDR error is caused by the larger error of precipitation estimates above 1 mm/day. As seen in Figure 5(a2,b2)), the percentile-based error increases as percentiles increase, and is generally characterized by lower values (<10 (mm/day) 2 ; <10%) below P9 and a maximum of 35.53 (mm/day) 2 (57%) above P9. Figure 5(c1) shows that the TMSE and its systematic and random sub-components, as well as detection-based error exhibit seasonality. They increase from January, peak in July, and then decrease. Although there exists seasonality for pMSE s and pMSE r (left of Figure 5(d1)), their major characteristics markedly different from those of the TMSE, i.e., larger pMSE s (>60%) in April-June, but larger pMSE r (>40%) in January-March and August-December. The pHMSE shows an intra-annual cycle similar to the TMSE, but with a maximum of 83% in June (right of Figure 5(c1,d1)), while pFMSE (pMMSE) values above 20% (10%) are detected in January-April, November, and December (January, February, and October-December). Except for in November, MSE s (pMSE s ) always contributes more to the error. Comparisons among the detection-based error sub-components for each month suggest that the largest error proportions exist in HMSE (pHMSE) because of the largest error of PERSIANN-CDR estimations of daily precipitation above 1 mm/day. Between March and October ( Figure 5(c2,d2)), the percentile-based error sub-components become larger as percentile increases; however, in the following four months, these variables first increase and then decrease, with the maximum arising between P7 and P9.

R and KGE Analyses
R all (calculated using all data) values are generally between 0.48 and 0.59 at the overwhelming majority of grids, and the HRB mean is 0.54 (Figure 6(a1)). This suggests that the PERSIANN-CDR product can generally capture the temporal variability of daily precipitation across this region. Excluding non-rainfall data points, we find that the values of R (i.e., R >0 and R ≥1 , which are estimated using data with OBS daily precipitation of >0 and >1 mm/day, respectively) are comparable to R all , implying that the effects of these non-rainfall samples have limited impact on R-based performance of the PERSIANN-CDR product on the daily scale. Also, we calculated R between OBS and PERSIANN-CDR ( Figure 6(b1)) using samples above given OBS percentiles (i.e., P0, . . . , and P9). It is evident that as the percentile threshold increases, R decreases, and its spread between the maximum (or the third quantile) and the minimum (or the first quantile) values widens. These results indicate that the ability of PERSIANN-CDR to represent the temporal variability of higher daily precipitation is relatively lower and exhibits larger spatial differences. Among all of the months (Figure 6(c1)), larger values of R all (regional mean > 0.45) occur in March-November, with a maximum of 0.58 for the HRB mean in June, while lower values of R all appear in other three months with a regional mean of around 0.40. With the precipitation threshold above a certain OBS percentile (Figure 6(d1)), the R values of each month using daily data show intra-annual cycles the same as R all 's, with larger values in March-November. Interestingly, the relationship between R values and percentile thresholds reveals that R in January, May-September, and December decreases with percentile thresholds, but R increases for the other five months. Figure 6. R (KGE) and its values using data above a given OBS daily precipitation threshold and their intra-annual cycles. (a1,b1) illustrates R all , R >0 , and R ≥1 (KGE all , KGE >0 , and KGE ≥1 ), while (a2,b2) shows R for each percentile threshold; (c1,d1) (c2,d2) are for the intra-annual cycles of R and its values that correspond to each percentile threshold, respectively. NAN in (d1,d2) suggests that the PERSIANN-CDR product at some grids does not have a rainfall event above a given OBS daily precipitation threshold; here, if the number of grids is more than 10, this index is not shown and labeled NAN. Figure 6(a2-d2) show the KGE results, which represent PERSIANN-CDR's overall performance skill. The regional mean KGE all (calculated using all data), KGE >0 (calculated based on data with OBS daily precipitation > 0 mm/day), and KGE ≥1 (calculated with OBS daily precipitation > 1 mm/day), are between 0.30 and 0.40, implying that PERSIANN-CDR has a moderate skill in reproducing daily precipitation (Figure 6(a2)). With an increased daily precipitation percentile threshold, the corresponding regional mean KGE decreases; it is lower than 0.20 after P5 and even below 0 after P8 (Figure 6(b2)). In Figure 6(c2), the KGE values are larger than 0.30 in 10 months, particularly in June-August and November, with a value above 0.40. However, PERSIANN-CDR data have KGE values of around 0.30 in February and December and even near 0 in January. The intra-annual cycles of KGE for each percentile threshold (Figure 6(d2)) show characteristics similar to KGE all 's, but with lower magnitudes in each month (excluding January). In January, February, October, November, and December, KGEs for each percentile threshold is always lower than 0.15 and even negative, indicating that PERSIANN-CDR lacks the skills to represent daily precipitation in these months. In March-September, KGEs are generally larger than 0.20 for the lower percentile thresholds, implying that this product has some skills in capturing lower precipitation, but no skills in reproducing higher precipitation.

Rainfall Detection Analyses
To characterize the detection capabilities and limitations of the PERSIANN-CDR product, we calculated POD, FAR, ETS, HK, HSS, and FBI values at given precipitation thresholds. These metrics, together with a function of increasing precipitation thresholds, are very useful for measuring the PERSIANN-CDR algorithm performance for different precipitation rates (Figure 7, [57]). It is evident that, in general, POD linearly decreases with increased precipitation thresholds, followed by good (>0.50) and poor performance (<0.50) for lower (<P6) and higher (≥P6) precipitation values, respectively (Figure 7(a1)). The behavior of FAR (Figure 7(a2)) shows an increase from the lowest threshold (P0), a decrease after P0, and an increase after P7. FAR generally fluctuates within a narrow spread between 0.53 and 0.65, indicating that more than 50% of the precipitation detections for each threshold are incorrect, particularly for low thresholds (< P4). ETS shows that the maximum detection skill (a value of 0.26) is achieved at P5, and the skill is limited for precipitation between P0 and P1 or exceeding P9 (Figure 7(a3)). Comparing Figure 7(a4) with Figure 7(a3) shows that, although HSS has larger values, similar behavior is detected for this categorical metric; i.e., the maximum detection skill (0.41) is found at P5. The values of HK (Figure 7(a5)) at most thresholds are above 0.40, and the maximum of 0.49 is observed at P2. These results suggest that the PERSIANN-CDR estimates can generally discriminate between rainfall events at a given threshold. Performance evaluation using FBI scores are above 2.00 (Figure 7(a6)), indicating an overestimation of precipitation totals in thresholds below P7, especially at the thresholds P0, P1, and P2. By contrast, an FBI of below 1.00 shows an underestimation of precipitation totals in thresholds above P6.
As plotted in Figure 7(b1), the behavior of POD in each month shows a decrease as the precipitation threshold increases. Basically, the PERSIANN-CDR product throughout the year corresponds to good performance for precipitation below P4, particularly in June-August, with POD larger than 0.50 continuing until P7. At a given threshold, the corresponding POD shows obvious seasonality, with a peak always occurring in July. FAR (Figure 7(b2)) indicates that more than 50% of precipitation detections are incorrect at almost all thresholds for each month, particularly for precipitation in January-June and September-December below P7, with FAR values larger than 60%. This metric's fluctuations in each month are characterized by an initial increase, followed by a decrease, and finally, an increase (except for March-May) with increasing thresholds. The monthly pattern of this categorical indicator is similar to that evaluated using all daily data (Figure 7(b2)). Figure 7(b3) shows that the ETS values are all smaller than 0.30. Moreover, the maxima occurs in June-August and the remaining months correspond to precipitation thresholds above and below P5, respectively. HSS (Figure 7(b4)) values do not exceed 0.50, but in each month, the precipitation threshold corresponding to the largest HSS is different, i.e., P1-P4 for January-April and September-December, and P4-P6 for May-August. Except in July and August, when HK values are large than 0.50 for precipitation thresholds between P2 and P6 (Figure 7(b5)), this indicator's values are all below 0.50 for other months. In particular, the HK values for thresholds above P6 (P3) in January-April and September-November (December) and above P8 in the remaining months are lower than 0.30, which suggests poor PERSIANN-CDR performance. Overall, FBI behavior generally shows a decrease with increasing percentiles for each month (Figure 7(b6)). However, different tendencies to underestimate or overestimate precipitation events are found among the percentiles. For example, PERSIANN-CDR generally overestimates precipitation below P4 in January-May and October-December, especially for precipitation below P1, with an FBI larger than 3.00. However, it underestimates other precipitation events, e.g., precipitation above P6 having an FBI of less than 0.50. In June-September, most precipitation events are overestimated by the PERSIANN-CDR product. Categorical metrics using different OBS daily precipitation thresholds (a1-a6) and their intra-annual fluctuations (b1-b6). The metrics with a subscript (i.e., >"x") represent PERSIANN-CDR product's capacity in detecting OBS daily precipitation above "x" mm/day. NAN in b1-b6 suggests that the PERSIANN-CDR product at some grids does not have rainfall events above a given OBS daily precipitation threshold; here, if the number of grids is more than 10, the index is not shown and labeled NAN.

Performance Based on Monthly Data
The monthly data shows that the TB (pTB) values are positive at almost all the sites, with a mean of 7.49 mm/month (11%) in the HRB. Biases for precipitation below P9 (Figure 8(a1,b1)) are positive at the overwhelming majority of or even all the grids, and the regional mean is between 0.69 mm/month (0.5%) and 1.50 mm/month (2%). However, performance for PERSIANN-CDR above P9 is distinct, with negative biases for more than 75% of the sites, and a mean of −3.03 mm/month (−4%) for the HRB. If all monthly data are used, the regional mean TMSE is 1587.04 (mm/month) 2 . The error increases with increasing percentiles (Figure 9(a1,a2)), followed by smaller (<150 (mm/month) 2 or <10%) and larger (>150 (mm/month) 2 or >10%) values for precipitation below and above P7, respectively. Figure 8(a2,b2) illustrate the intra-annual cycle of biases from monthly data. Positive values of HRB biases indicate that this product overestimates precipitation in each month, and the same is true at more than 75% of the grids in all months (except for August). As depicted in Figure 8(a2,b2)), the regional mean TB (pTB) values are larger than 10 mm/month (20%) in June and July (January-April and December). TMSE, MSE s , and MSE r show obvious seasonality, which apparently increases after January and peak in July, followed by a decrease (Figure 9(b1-b3)). Because of differences in TMSE magnitude among months, the seasonal characteristics of pMSE s (pMSE r ) differ from those of MSE s (MSE r ) (Figure 9(c1,c2)). Generally, pMSE s (pMSE r ) first decreases (increases), then increases (decreases), and finally decreases (increases). For each month, comparison between pMSE s and pMSE r illustrates that, except for June and July (which have larger pMSE s values), the error can be attributed to larger random error sub-components in the remaining ten months, and the largest pMSE r (>70%) is found in November and December show. Figure 8. Percentile-based biases using monthly data (a1,b1) and intra-annual cycles of TB (a2) and pTB (b2). B Pi,i+1 represents B within [Pi, Pi+1), and its percentage expression is labelled with a header of "p". Figure 10(a1) plots changes in R with increasing precipitation thresholds. In general, PERSIANN-CDR well represents the temporal variability of monthly precipitation, with the HRB means higher than 0.50 for all precipitation thresholds. Interestingly, regional mean R values decrease with increasing precipitation percentiles, and this implies that the product's ability to represent temporal variability is relatively weaker for higher precipitation. The intra-annual cycle of R for monthly data is presented in Figure 10(b1). Obviously, PERSIANN-CDR's ability to represent the temporal variability of monthly precipitation varies from month to month, i.e., better performance (R =~0.90) in January-March and October-December compared with April-September (excluding July), for which R is below 0.80. The HRB KGE values (Figure 10(a2)) at the precipitation thresholds below P6 are generally constant (~0.75), and then they sharply decrease, ranging from 0.68 at P7 to 0.37 at P9. These findings indicate a comparable and better overall performance of this product in representing monthly precipitation below P6, but the worst overall performance for monthly precipitation occurs above P9. KGE values vary among months (Figure 10(b2)), with larger values (>0.60) in March and September-December, moderate values (between 0.60 and 0.50) in February and April-June, and lower values (<0.50) in January and July. Figure 9. Percentile-based Mean-Square-Error (a1,a2); intra-annual cycles of Mean-Square-Error (b1) and its systematic and random sub-components (b2,b3,c1,c2), which are calculated using monthly data. MSE Pi,i+1 represents Mean-Square-Error within [Pi, Pi+1), with its percentage expression labelled as a header of "p". . R (a1) and KGE (a2), which are calculated using data above OBS monthly precipitation percentile thresholds, and intra-annual cycles of R (b1) and KGE (b2) estimated with all monthly data. Table 4 illustrates the PERSIANN-CDR product performance based on annual data. The annual precipitation estimates are higher than the OBS values at the majority (>75%) of the grids, with a regional mean TB of 89.87 mm/year or a pTB of 11%. For the systematic and random sub-components of annual errors, the HRB mean MSE s and MSE r (pMSE s and pMSE r ) are 19896.57 and 10459.86 (mm/year) 2 (61% and 39%), respectively, and the former is always larger than the latter at most (>78%) grids. This finding indicates that the PERSIANN-CDR error results mainly from the systematic error sub-component on the annual scale. With some exceptions (<25% of grids), the vast majority of the grids show higher values of annual R (>0.69; KGE > 0.59), with a regional mean of 0.75 (0.63), which indicates that PERSIANN-CDR data can well reproduce the temporal variability of annual precipitation (i.e., the data has a better overall performance).

Uncertainties in the PERSIANN-CDR Product
In this study, we comprehensively evaluated the PERSIANN-CDR precipitation product over the HRB by comparison with direct measurements at 176 grid boxes. It should be noted that this product has some shortcomings that may have various origins, including PERSIANN's conceptual foundations and specifications, as well as inputs (including datasets for calibrating and running the algorithm and adjusting precipitation estimates). Therefore, we will discuss possible sources of this product's uncertainties in the following subsections.

Uncertainties from PERSIANN's Conceptual Foundations and Specifications
Richards and Arkin [73] first reported that convective clouds with cold Tb are more likely to produce rain than clouds with warmer cloud-top temperatures. After that, this finding has become a critical conceptual foundation for estimating precipitation with satellite VIS/IR (e.g., PERSIANN developed by Hsu et al. [39]) and MW data. Tb data from GEO satellites generally has a high sampling frequency (i.e., full globe scans are currently available every 15 min), and is selected to infer the position and cloud-top temperature of cloud masses (and, implicitly, cloud-top height). Connections between cloud-top temperature and surface rainfall rates are established using different ground-based precipitation observations as references. However, Tb data can only reflect the cloud's bulk properties rather than the sophisticated characteristics of microphysics and morphology, and thus does not always correlate well with rainfall. Therefore, these VIS/IR algorithms lack a strong physical basis and are empirical in nature [28,[74][75][76]. For example, an actual precipitating region may not be perfectly consistent with and even be smaller than the cold cloud shield in a precipitating system. Sometimes, no rainfall occurs under the coldest section [28,77,78]. This implies that cold clouds (i.e., lower Tb) are neither a sufficient nor necessary condition for rainfall. In this study, higher FAR values, especially for the heavy precipitation and wintertime precipitation (Figure 7(a2,b2)), could be partly due to PERSIANN mistakes cold, but non-precipitating clouds in precipitating systems [76,[79][80][81]. Specifically, in convective storms, the relationship between Tb and precipitation rate varies with spatiotemporal storm evolutions [81]. Heavy precipitation actually appears at the edges of a cloud patch, often with higher Tb values relative to the inner core of this patch. As a result, the precipitation rates estimated using fitted parameters of the relationship between Tb and precipitation rate over the whole precipitation area, may have larger negative biases and larger errors [77,78,82]. Similarly, if fitted parameters are employed throughout a storm's life cycle, VIS/IR algorithms are likely to frequently underestimate rainfall during the early stages, during which the system is mainly controlled by convective cores at high precipitation rates [81]. PERSIANN indeed extracts some useful precipitation information (i.e., statistics of mean and standard deviation of nearby pixels) from IR images and combines MW calibrators [29], although these processes are not enough to provide complete information about a storm's spatiotemporal evolutions (e.g., higher Tb values at the edges of a cloud patch and early stages of a storm's life cycle). The two issues above are possible explanations for the larger underestimations of the daily and monthly HRB precipitation at higher percentiles (Figure 4(a2-d2), Figure 5(a2-d2), Figure 8(a1-b1), and Figure 9(a1,a2)).
In PERSIANN and other VIS/IR algorithms, a few critical physical processes that closely relate to precipitation (e.g., sub-cloud evaporation, cloud types, and topography) are not fully considered, potentially introducing uncertainties to precipitation estimates [3,15,16,28,78,[83][84][85][86][87]. Kuligowski et al. [84] found significant false alarm rainfall in the TRMM precipitation product in the central part of the U.S. (dry regions). Their further work suggested that this issue is caused largely by the evaporation of hydrometeors below the cloud base [85]. Because IR images do not directly include information about the evaporation of hydrometeors below the cloud base, sub-cloud evaporation effects on precipitation are likely to be partly responsible for the FAR of the PERSIANN algorithm. Because of the close association of cloud types with rain intensities, their delineations are important for accurately estimating precipitation at different intensities, which has been pointed out by numerous studies [15,16,78,88]. Despite that, recognizing types of clouds, which are related to complex three-dimensional structures of water vapor, is difficult based on only the limited information conveyed by Tb in each pixel. Therefore, uncertainties in VIS/IR-based precipitation estimates can come from incomplete and inaccurate information about cloud types. In addition, topographically caused precipitation and warm precipitation clouds (e.g., stratiform) may result in precipitation over a region, for example, northeastern Brazil [3,87]. Current VIS/IR algorithms scarcely consider the topographical impacts, possibly leading to underestimating warm cloud precipitation.
Besides precipitation regime-related issues, VIS/IR-based precipitation product accuracy is likely to be influenced by algorithms' specifications. The specifications for PERSIANN-CDR include identification of non-rain events, adjustment of 3-hourly precipitation estimates using monthly weight parameters, aggregation of 3-hourly precipitation estimates into daily values, and so on. A slight problem in the PERSIANN model is that 3-hourly precipitation maps have many noisy pixels, which are generally associated with an observed values of zero, but very small non-zero values from neural network model estimates [4,39]. Undoubtedly, this algorithm shortcoming is bound to impact categorical indexes, e.g., high FAR (Figure 7(a3,b3)). Before data aggregation, PERSIANN 3-hourly precipitation is adjusted using monthly weighted parameters, which are ratios between monthly values of the Global Precipitation Climatology Project (GPCP) and PERSIANN estimates. Because this process does not consider weight fluctuations among days, daily precipitation estimates are impossible to be perfectly adjusted. By contrast, adjusted monthly and annual precipitation values are closer to observations, and this is reflected by PERSIANN-CDR's better performance (i.e., continuous statistics and KGE) on the monthly and annual scales in the HRB. Aggregation of PERSIANN-CDR 3-hourly precipitation estimates may lead to either good or poor daily performance, which is dependent on whether the uncertainties of the 3-hour estimates in one day are offset or accumulated [8,78,[89][90][91][92].

Uncertainties from Inputs
Inputs for PERSIANN-CDR mainly consist of the NCEP Stage IV hourly radar precipitation data used for initially training the PERSIANN model, gridded satellite IR data (GridSat-B1) from the International Satellite Cloud Climatology Project (ISCCP) used for running the established model and estimating precipitation, and monthly GPCP data used for adjusting the preliminary product [29]. First, inherent uncertainties in the source data can be propagated into PERSIANN-CDR precipitation estimates. The NCEP Stage IV data, which is high-resolution Doppler next-generation radars (NEXRADs) merged with hourly rain gauge data over the continental U.S. [29,85], has been evaluated over the past several years (e.g., References [72,93,94]). For instance, Nelson et al. [94] summarized this dataset's disadvantages, including: (1) Bad rain gauge measurements merged in the radar-based precipitation estimation algorithm, especially for hourly estimates; (2) spatial discontinuities of hourly data, e.g., in the Northwest River Forecast Center (i.e., not available), the coast of South Carolina and Georgia from compositing data in these overlapping regions, as well as some River Forecast Centers (RFC)s because of different algorithms and processes (e.g., identifying bad gauge measurements or radar estimates); and (3) evident underlying issues related to radar-based precipitation estimation (e.g., beam blockage, bright band contamination) in the Stage IV product. The GridSat-B1 IR window (IRWIN) Tb data from 1983 to the present, with spatial and temporal resolutions of 10 km and 3 h, is used to drive the PERSIANN model for estimating precipitation [29]. In the process of producing the GridSat-B1 IRWIN data by merging the ISCCP B1 IR data, three sets of calibration (i.e., on-board calibration, ISCCP inter-calibration, and HIRS (high-resolution IR radiation sounder) inter-calibration) have been conducted to derive a consistent and more accurate radiance dataset in space and time [95][96][97][98]. However, some factors, such as view zenith correction, gaps in coverage, and differences in instrument spectral response functions, can also adversely impact the GridSat-B1 IRWIN data quality and then decrease the accuracy of PERSIANN-CDR precipitation estimates [97,99]. Since GPCP products (including monthly, pentad, and daily merged analysis) were developed, they have been widely evaluated with different reference data and used for various relevant studies across the globe [100][101][102][103][104]. For example, GPCP generally captures the spatial patterns of annual and seasonal precipitation over most of the world, but substantial discrepancies appear in some regions (e.g., equatorial West Africa) as a result of low gauge densities [100]. Meanwhile, larger disagreements in spatial and temporal patterns for inter-annual to decadal variations are observed in South America, Africa, and Australia [101,105].
In addition to the propagation of these inherent uncertainties in the source data, Stage IV and GridSat-B1 data can indirectly influence PERSIANN precipitation estimates. For example, Stage IV data is produced by merging the high-resolution Doppler next-generation radars (NEXRADs) and hourly rain gauge data of the continental U.S. References [93,94] rather than the globe, so its representativeness is limited. PERSIANN calibration parameters with this data may not be reasonably employed to the regions outside the continental U.S., especially for those with different atmospheric circulation systems [8]. The East Asian Monsoon prevails in the HRB, and extrapolation of calibrated parameters potentially leads to the local uncertainties in precipitation estimates. Since the duration of some precipitation events may be less than three hours, not all precipitation events are captured by the PERSIANN preliminary product, whose temporal scale of three hours is determined by the GridSat-B1 IR data. For example, convective rainfall events happen in tens of minutes and disappear in a similar period. Despite a shorter duration and smaller spatial extent, these rainfall events may have larger contributions to precipitation totals on various temporal scales (e.g., References [7,106]). Particularly in summer with unstable atmospheric conditions, convective rainfall events regularly take place regularly and likely result in larger uncertainties of precipitation estimates. As a result, inability to capture the short-period (i.e., <3 h) precipitation processes may be a reason for the larger negative biases at higher precipitation percentiles in our study (Figure 4(a2-d2).

Uncertainties of Rain Gauge Data
Commonly, rain gauge observations are taken as ground truth values for validating various precipitation products (e.g., radar-and satellite-based) on different temporal and spatial scales across the globe, since they have specific advantages, such as accessibility [7,106] (e.g., simple data acquisition and processing), direct and relatively accurate measurements, and long timespans [107][108][109][110][111]. Nonetheless, there are still uncertainties in gauge data, for instance, the inherent difference between point-scale gauge measurements and areal satellite precipitation, i.e., a point of space in time accumulation vs. a snapshot of time in space aggregation. The spatial resolution of PERSIANN-CDR is 0.5 • × 0.5 • , over which estimated precipitation is averaged, while the footprint of a gauge is much smaller than the coverage of the PERSIANN-CDR pixel. Given the larger variability of precipitation over a small spatial extent, a sparse gauge network may not detect meso-/microscale weather system-related precipitation (e.g., convective precipitation), so gauge precipitation may be lower in magnitude and occurrence than the ground truth values in a target pixel. Overall, mismatches between representatives of satellite-based and gauge precipitation are likely to have an important effect on the accuracy and precision of qualitative and quantitative assessments of satellite-based precipitation products [4,15,16,[111][112][113][114]. Moreover, as suggested in numerous studies (e.g., References [8,37,90,114]), a sufficient number of gauges should be collected for the corresponding pixel to decrease the uncertainties induced by gauge representatives. For example, to estimate areal rainfall (for a pixel of about 200 km 2 ) within 20% of its true value, over 25, around 25, 15, and 4 gauges are necessary at the 15 min, hourly, 3-hourly, and daily scales, respectively [114].
Rain gauges are also subject to measurement uncertainties, which are closely associated with flaws in calibration, wind-related undercatch, and wetting-evaporation losses (e.g., References [24,72,107,109,[114][115][116][117][118][119][120][121]). Gauges are calibrated (e.g., tipping-bucket rain gauges calibrated using static (volumetric) and dynamic methods), but measured values still contain uncertainties. Shedekar et al. [116] evaluated the accuracies of precipitation observed by three tipping-bucket rain gauges, which had been calibrated in advance. They found that the three devices still significantly underestimated precipitation when compared with actual rainfall depths, especially for greater rainfall intensities. Due to the impacts of flow deflection and the associated eddies and turbulence around gauges (namely, wind-related undercatch effect), wind generally causes some raindrops (especially smaller ones) to miss the funnel or fall at an inclination, which can influence the catch efficiency of the gauge (e.g., References [115,[120][121][122][123]). Finally, uncertainties induced by wind effects are dependent on ambient wind speed, raindrop size distribution, and gauge design. Sieck et al. [122] pointed out that wind-exposed aboveground gauges likely observe less rainfall than collocated buried gauges, with a typical undercatch of 2-10% and sometimes more. Wetting losses, defined as water adhering to the inside walls of the gauge and evaporating (or sublimating) without being recorded, generally cause gauge-recorded precipitation to be lower and vary among gauge configurations (e.g., number of times the gauge is emptied) and precipitation types [119]. For example, wetting loss experiments at a gauge site in Russia reported that the average wetting loss was 0.2 mm per record for rainfall observation, but 0.15 mm per record for both snow and mixed precipitation [124]. Rain gauges are exposed to the atmosphere, within which water is always lost through evaporation (namely, evaporation losses). Magnitudes induced by evaporation losses show evident variations among gauge types, climatic zones, and seasons and are typically between 0.1 and 0.8 mm/day or 0-4% of gauged precipitation [108]. Usually, the aggregate impact caused by influential factors of rain gauge measurements is a net underestimation of precipitation [119]. Yang and Ohata [119] developed a bias-corrected (i.e., the aforementioned gauge measurement uncertainties were generally eliminated) climatological precipitation model over Siberia and concluded that after bias correction the annual precipitation increased by 30-330 mm or 10-65% relative to the gauge measurements. This implies that PERSIANN-CDR performance in the HRB based on TB (Figures 4 and 8, and Table 4) is underestimated for daily, monthly, and annual scales, but that derived from percentile-based bias are underestimated and overestimated for lower and higher percentiles, respectively. Moreover, precipitation measurements have an inhomogeneity issue that is imposed by changes in gauge instruments, station relocation, station environment, etc. [30,67,92,125,126]. These factors can have adverse impacts, especially for climatic studies using long-term time series. In this study, we tried our best to reduce the related uncertainties by removing sites with inhomogeneous observations detected by the Pettitt test (because of a lack of metadata for the selected sites); however, there is no guarantee that no inhomogeneity issue exists in the remaining sites, potentially weakening the confidence level of our results.

Conclusions
A comprehensive evaluation of a long-term, continuous, and high spatial resolution satellite-based daily precipitation product (PERSIANN-CDR) during the period 1983 to 2012 is conducted in this study by comparing the estimates with rain gauge measurements over the HRB. A suite of statistical measures and techniques were implemented to characterize the differences and agreements between PERSIANN-CDR estimates and rain gauge observations on different time scales. The conclusions are summarized as follows, with some suggestions for the PERSIANN team in improving algorithms and/or corrections in the future and potential users of the PERSIANN-CDR in the HRB.

•
Climatologic characteristics: Despite the differences in magnitudes between PERSIANN-CDR and OBS in the HRB, PERSIANN-CDR can well reproduce intra-annual cycles and spatial characteristics of climatologic precipitation calculated using all data or non-rainfall data.

•
Bias analyses: There are daily, monthly, and annual overestimations (with a regional mean pTB of 11%) in the PERSIANN-CDR precipitation data over the HRB. According to detection-based and percentile-based bias compositions, daily overestimations can be attributed to larger and positive FB compared to MB, HB or their sum, followed by negative NFB, which are mainly due to larger underestimations for higher bins. On the monthly scale, overestimations for bins below P9 are likely responsible for larger PERSIANN-CDR estimates; this is also closely associated with daily false biases.

•
Error analyses: Regardless of whether daily, monthly, or annual scales are examined, PERSIANN-CDR errors always come from systematic sub-components of errors. Also, daily and monthly errors are largely the result of larger contributions from errors for bins with higher precipitation. • R and KGE analyses: For each timescale, R and KGE respectively indicate that PERSIANN-CDR can well capture the temporal variability of precipitation and has a moderate or high overall performance in producing precipitation. Moreover, these capabilities increase from the daily to annual scale, but decrease with the given precipitation thresholds.

•
Detection capability: POD suggests that PERSIANN-CDR has a good (poor) performance in detecting rainfall events with lower (higher) precipitation thresholds. According to FAR for each threshold, this product incorrectly detects more than 50% of precipitation. ETS and HSS both indicate that PERSIANN-CDR has some skill for detecting precipitation between P2 and P8. Based on HK, PERSIANN-CDR can generally discriminate between rainfall events at a given threshold. FBI scores show an overestimation and underestimation of precipitation totals for thresholds below and above P7, respectively. • Intra-annual cycles of these validation metrics: Continuous and categorical metrics calculated using daily and monthly data reveal the presence of intra-annual cycles.
Briefly, the comprehensive analyses of PERSIANN-CDR performance in this study provide some significant suggestions for PERSIANN team and the potential users of this precipitation product in the HRB. For example, the analyses of bias and detection capability on various timescales indicate that PERSIANN team can try to develop specific algorithms and/or correction procedures for reducing FB and NFB (especially for rainfall events within higher bins), and thus increasing the overall performance of the PERSIANN-CDR product. The results from error analyses provide a possible solution for the PERSIANN team to decrease PERSIANN-CDR errors, e.g., removing the systematic sub-components using a simple linear regression method established based on the estimates and the observations. Meanwhile, the intra-annual cycles of these selected validation metrics should be involved in the improvements of PERSIANN algorithm and/or correction procedures. For the potential users who focus on the HRB, PERSIANN-CDR can be used for studying water resource from perspectives of intra-annual allocation, spatial distribution, and temporal variability of precipitation, while attention should be paid to the magnitude biases. The relatively poor performance of PERSIANN-CDR at higher precipitation bins indicates that this data has limited capability in capturing heavy rainfall events and potential floods in the HRB. Moreover, this product should be applied with caution for studies related to monthly (seasonal) precipitation, because the intra-annual differences of PERSIANN-CDR performance may cause different uncertainties of the results in each month (season).