1. Introduction
Gridded precipitation datasets are developed and used because they provide detailed knowledge about temporal and spatial representations of precipitation for applications, such as model validation, input for land surface models, as well as extreme rain-event characterization, especially on the global scale. The intensity of the global gridded daily precipitation amount and its spatiotemporal distribution is essential for the global water cycle, the hydrological cycle, the climatic effects of human activities, water resources assessment, and the droughts and floods forecast [
1,
2].
Gauge observations, satellite estimations, and numerical simulations are three existing principal sources of daily precipitation at the global scale [
3]. A series of global gridded daily or sub-daily precipitation datasets have been designed and produced based on these sources for different applications [
4]. Several datasets are based on data sources from sole gauge or satellite observations. Climate Prediction Center (CPC) Unified gauge-based analysis [
5], for example, collects gauge observations from more than 30,000 stations over the global land areas, while the Tropical Rainfall Measuring Mission (TRMM) Multi-satellite Precipitation Analysis (TMPA) [
6] and CPC’s morphing technique (CMORPH) [
7] combine precipitation estimates originating from passive microwave (PMW) and infrared (IR) observations from multiple satellite sensors. These datasets reveal the nature of daily precipitation to a certain extent, but they suffer from incomplete global coverage, especially over vast ocean areas and high-latitude areas for gauge- and satellite-based datasets, respectively. The global reanalysis dataset provides an effective way to obtain daily scale precipitation data with complete global coverage. In the several latest global reanalysis datasets, such as the Interim European Center for Medium-Range Weather Forecasts (ECMWF) Re-Analysis (ERA-Interim) [
8], the 55 year Japanese Reanalysis Project (JRA-55) [
9], and the National Centers for Environmental Prediction (NCEP) Climate Forecast System Reanalysis (CFSR) [
10], precipitation is usually produced by a short-range forecast from the operational model, assimilating a wide variety of atmospheric observations but not precipitation measurements. The numerical forecast of precipitation performs well at middle and high latitudes but performs poorly at low latitudes owing to the accumulation of model errors caused by various physical processes [
11].
The deficiencies in individual precipitation sources have led to attempts to combine the complementary information of each source to exploit the advantages of each for producing optimal estimations of global fine-scale precipitation with complete global coverage. Two multiple-source merged daily gridded precipitation datasets are available for the study of the global fine-scale water cycle. A widely available global daily precipitation dataset produced by The Global Precipitation Climatology Project (GPCP), called GPCP one-degree daily (GPCP-1DD) [
12], is the first product developed for estimating daily precipitation with complete global coverage from multiple sources (daily precipitation information from satellites and monthly precipitation information from gauge analyses). The GPCP-1DD product has been widely used in many fields such as weather process diagnostic analyses, hydrologic streamflow model validation, flood monitoring and forecasting, and climate change applications [
13,
14,
15,
16,
17].
The Beijing Climate Center of the China Meteorological Administration (BCC/CMA) has crafted a global daily precipitation dataset known as BCC Merged Estimation of Precipitation (BMEP). This dataset amalgamates daily-scale information from gauges, satellites, and reanalysis sources [
18]. Notably, BMEP stands as a global gridded precipitation dataset on a daily scale, incorporating inputs from all three primary data sources. Given the advantage of each data source in understanding of the temporal and spatial variability of fine time scale precipitation on a global scale, it is essential to merge information from multiple sources to take advantage of the strengths offered by each type.
At present, there have been numerous comparative evaluations conducted for various gridded precipitation datasets from different sources. Due to the limited availability of daily gridded precipitation with complete global coverage, most comparisons between such datasets have been conducted at regional scales [
19,
20,
21,
22]. However, comprehensive comparisons between daily gridded precipitation datasets from different sources, particularly multiple-source merged datasets, have been scarce. Beck et al. have utilized original global station precipitation observations for validation [
23]. However, due to the uneven distribution of these stations around the world and challenges in assembling the data, these analyses often miss out on regions with sparse or no station coverage. Consequently, they struggle to provide a clear picture of precipitation patterns across the entire global land area. Furthermore, it is worth noting that in previous comparative assessment studies of various global gridded precipitation datasets, there has been limited exploration of products originating from China’s developed global grid-based precipitation dataset, e.g., the BMEP dataset. The few available studies have primarily focused on comparing the monthly-scale BMEP product within China’s region [
15]. Limited attention has been dedicated to assessing its performance on a global scale and at the daily resolution.
In this study, we extensively compare five gridded daily precipitation datasets (BMEP, GPCP-1DD, JRA-55, ERA-I, CFSR) with complete global coverage and evaluate them using gauge-based analysis of daily precipitation over global land areas (CPC-Unified, referred to as CPC-U). The objective is to assess their reliability in representing the spatial and temporal variability of global daily precipitation and provide recommendations regarding the quality and utilization of these datasets for a comprehensive understanding of global water and hydrological cycles. As BMEP is a recently developed dataset, it is crucial to inform the community about its quality analysis and differences from other global precipitation datasets. Moreover, this evaluation can potentially offer valuable insights for developers of these global precipitation datasets for enhancing their products. By harnessing the strengths of each dataset, researchers and practitioners can enhance the precision and comprehensiveness of global fine-scale precipitation assessment, enabling more effective flood forecasting and simulation capabilities.
3. Results
3.1. Spatial Distribution of Annual Means
Figure 1 illustrates the spatial distribution of multi-year mean daily precipitation across the globe from 2003 to 2016. The gauge-based CPC-U data show that the highest annual mean precipitation areas over global land are in the Indonesian Islands (over 7 mm/d) situated within the western tropical Pacific ITCZ region. Additionally, the Amazon (over 5 mm/d) and Equatorial Africa (over 3 mm/d) represent two distinct rainfall centers influenced by the expansion of the eastern Pacific ITCZ. The eastern coastal regions of Asia (over 4 mm/d) and North America (over 3 mm/d) exhibit high precipitation levels associated with mid-latitude storm tracks. All reanalysis datasets and combined datasets successfully capture the overall spatial distribution of CPC-U’s annual mean precipitation across the globe. However, GPCP-1DD, ERA-I, CFSR, and JRA-55 show a significant overestimation of precipitation over the Amazon rainfall center. GPCP-1DD and CFSR also overestimate precipitation over the Kalimantan Islands, while BMEP, ERA-I, and JRA-55 underestimate it. Other gridded datasets, except BMEP, substantially overestimate rainfall centers over New Guinea Island, eastern coastal regions of China, and the Bay of Bengal. Although all gridded datasets detect the rainfall belt over Equatorial Africa, there are notable differences in the locations and magnitudes of the strong rainfall center among these precipitation estimates.
3.2. Statistical Indices
3.2.1. Bias
The spatial distributions of the multi-year mean of daily precipitation bias from each gridded dataset against CPC-U data are presented in
Figure 2. Notably, a consistent tendency to overestimate the annual mean precipitation emerges in specific regions. Equatorial Africa, for instance, exhibits an overestimation exceeding 3 mm/d, while the Amazon and the Indo-China Peninsula both show overestimations exceeding 2 mm/d. Conversely, in numerous parts of northern Eurasia (above 30°N), northern Africa, Australia, and North America, the biases are smaller in magnitude, ranging between −0.5 mm/d and 0.5 mm/d. Remarkably, except for ERA-I, reanalysis products exhibit more extensive large-scale patterns of bias, a trend that extends further when contrasted with the merged products of BMEP and GPCP-1DD. This is especially pronounced in areas characterized by lower annual mean daily precipitation, as illustrated in
Figure 1.
3.2.2. Time Correlation Coefficient (TCC)
Figure 3 presents the spatial distributions of the multi-year mean of the daily precipitation TCC for each gridded dataset in comparison to the CPC-U data. Notably, BMEP exhibits a notably enhanced performance in terms of spatial patterns and TCC values when juxtaposed with other gridded precipitation products, particularly in regions endowed with a more densely distributed network of gauge observations [
18]. In contrast, GPCP-1DD, despite relying on gauge-based precipitation observations [
12], exhibits diminished TCC values compared to the three reanalysis datasets that do not encompass direct precipitation observations. This could be attributed to the fact that BMEP uniquely integrates daily gauge precipitation information during its creation process [
18], thereby enhancing its ability to capture the fine-scale temporal variations in precipitation. On the other hand, the three reanalysis datasets adjust precipitation predictions through the assimilation of various atmospheric observational data from their respective systems [
8,
9,
10]. However, in the case of GPCP-1DD, the monthly gauge-based precipitation analysis data sourced from GPCP-SG [
12] falls short in capturing the intricacies of daily-scale precipitation variability, thereby underscoring the limitations of its performance in this context.
3.2.3. RMSE
The spatial distributions of the multi-year mean of daily precipitation RMSE from each gridded dataset against CPC-U data are meticulously detailed in
Figure 4. It is worth noting that the spatial patterns of RMSE observed across all gridded datasets (as illustrated in
Figure 2) are in accord with those of annual means (as depicted in
Figure 1). This alignment echoes findings from previous studies [
22,
36] that have underscored the inherent correlation between the magnitude of precipitation and the associated error in fine-scale precipitation representations. It is particularly noteworthy that the prominently high RMSE region over the Antarctic continent does not arise from inaccuracies in rainfall intensity estimates derived from gridded precipitation datasets. Notably, the notable area of elevated RMSE values across the Antarctic continent is not the result of inaccuracies in rainfall intensity estimation stemming from the gridded precipitation products themselves. Instead, it can be attributed to the original interpolation errors present in the CPC-U data, a crucial context not explicitly depicted in the figure. These results enhance our comprehension of the intricate factors contributing to spatial discrepancies and errors within gridded precipitation datasets.
Table 1 summarizes the globally averaged statistics for each dataset. The annual mean precipitation ranges from 1.65 mm/d (BMEP) to 1.94 mm/d (CFSR) among the gridded datasets, indicating a 9% (BMEP) and 28% (CFSR) higher estimate compared to CPC-U data. Among the datasets, GPCP-1DD and BMEP exhibit the smallest overall bias. Notably, BMEP exhibits an overall advantage over other gridded datasets in other statistical measures, with the smallest RMSE and the largest TCC. This highlights the significant improvements achieved by merging data from multiple sources in estimating daily precipitation. Conversely, GPCP-1DD demonstrates inferior TCC and RMSE performance compared to BMEP, ERA-I, and JRA-55, despite also being a merged dataset. The results for reanalysis products are mixed, with CFSR exhibiting higher bias and RMSE errors compared to all other datasets, while ERA-I and JRA-55 demonstrate more favorable TCC (both 0.35) and RMSE (both around 5.07 mm/d) performances, even surpassing the merged data from GPCP-1DD.
3.3. Time Series of Regional Means
The time series of the global averaged daily precipitation intensity between gridded datasets and CPC-U over global, northern hemisphere, and southern hemisphere land areas are shown in
Figure 5. The interannual variability of daily precipitation is consistent among all datasets with the annual cycle of CPC-U observations, which could be caught by all five gridded datasets over the global land area (
Figure 5a). The global land average and the Northern Hemisphere land average exhibit congruent annual cycles of daily precipitation. The peak of precipitation is observed during the months of July and August, while the lowest values occur around January. In contrast, the annual cycle in the Southern Hemisphere land regions presents a precisely opposite pattern, with peak precipitation occurring around January and trough values appearing in July and August. Analyzing the consistency between different precipitation datasets, specifically the CPC-U precipitation and other products, reveals that the datasets demonstrate superior consistency in the average time series of both hemispheres compared to the global land average. Notably, the dispersion among the datasets is significantly greater in the global average than in the hemisphere-specific results.
Throughout the evaluation period, the temporal curves of precipitation in the BMEP dataset closely align with those of the CPC-U product in the Southern Hemisphere. Conversely, alternative precipitation products such as GPCP-1DD, ERA-I, JRA-55, and CFSR display noticeable positive biases in precipitation intensity when compared to both BMEP and CPC-U data throughout the entire period. This positive bias is not particularly pronounced during the annual precipitation troughs but becomes more distinct during the periods of the annual precipitation peak. Also, the positive bias is primarily concentrated within the Equatorial region (the figure has not been presented). The overestimation of precipitation over the Equatorial region in reanalysis data, such as ERA-I, may be attributed to an overestimation of the deep convection and moisture flux convergence over the tropical land [
37]. This issue is likely a common challenge across other reanalysis precipitation products. The overestimation of Equatorial precipitation in the GPCP-1DD dataset could potentially stem from the incorporation of the monthly average GPCP-SG gridded data for the correction of daily precipitation values during its production process [
12]. It is generally recognized that monthly average precipitation data possess greater spatial integrity compared to daily precipitation data. Consequently, utilizing monthly averages to amend daily averages may inadvertently result in an overestimation of cumulative daily precipitation values.
In the Northern Hemisphere, both BMEP and ERA-I products exhibit positive biases compared to CPC-U data. A possible explanation for this phenomenon is that CPC-U data rely solely on station observations, whereas BMEP or ERA-I data incorporate satellite or reanalysis data as well. In regions with sparse station coverage, CPC-U assigns a value of 0 to grid points without station observations, whereas BMEP or ERA-I data often incorporate satellite-derived or reanalysis-based precipitation in those grid points, leading to values greater than 0. Consequently, in areas with limited station coverage, the values of BMEP and ERA-I data tend to be higher compared to the CPC-U data.
It is noteworthy that the CFSR data exhibit a notable positive trend from 2003 to 2016, particularly after 2011, which is not observed in other datasets. This divergence can be attributed to the transition from CFSR data to CFSv2 data. While the modeling and assimilation systems used in CFSv2 are quite similar to those of CFSR, there are still certain differences between CFSv2 as an operational system and the CFSR reanalysis system [
38]. On one hand, CFSv2 employs a new gravity wave parameterization scheme [
38], based on cumulus convection-induced gravity wave drag, which might lead to an excessive simulation of convective precipitation. On the other hand, being an operational system in real-time, CFSv2 exhibits weaker completeness and quality control of observational data collection compared to the comprehensive reanalysis system of CFSR, which to some extent affects the accuracy of precipitation forecasting. These two factors could potentially result in noticeable differences in precipitation in the CFSR data after the period of 2010. This abnormal bias in the CFSR data aligns with previous research indicating that the renewal of modeling systems and data assimilation systems can introduce spurious trends in precipitation forecasts, impacting the accuracy of reanalysis-based precipitation assessments [
20,
39].
3.4. Spatial Biases
Figure 6 illustrates the time series of spatial biases between various grid-based daily precipitation products and the CPC-U precipitation data across global land areas as well as two hemispheric land areas. The time series of spatial bias exhibit distinct annual cyclic characteristics, mirroring those of the daily precipitation patterns themselves. Furthermore, the annual cyclic features of the spatial bias align closely with the corresponding annual cyclic characteristics of the regional daily precipitation. Specifically, the peak values of the spatial bias curve correspond to the peak values of the average precipitation in each area, while the trough values align with the corresponding minimum precipitation values. The spatial bias of the JRA-55 data is lower than that of the CFSR data but higher than that of other grid-based precipitation datasets. Its larger spatial bias values, compared to other datasets, primarily occur during peak periods (approximately 1.0 mm/d) and remain relatively stable throughout the entire statistical period, without exhibiting the increasing trend observed in the later stages, as observed in the CFSR data.
Among these grid-based daily precipitation datasets considered, the BMEP precipitation showcases the smallest spatial bias characteristics across global land areas and two hemispheric land areas (
Table 2). Specifically, within global land areas, the BMEP data exhibit an average spatial bias reduction of over 0.13 mm/d compared to other datasets. Notably, in the Southern Hemisphere land areas, the average spatial bias of the BMEP data demonstrates a substantial reduction of approximately 1 mm/d in comparison to the CFSR precipitation. Within global land regions and Northern Hemisphere land regions, the advantages of the BMEP data over alternative precipitation datasets primarily manifest during the trough periods of the time series, particularly in winter seasons, where the regional average spatial bias remains near zero. However, in the Southern Hemisphere, the spatial bias of the BMEP data does not exhibit a similar characteristic. Nonetheless, it is worth emphasizing that the BMEP data consistently exhibit the smallest regional average spatial bias among these grid-based precipitation products.
3.5. Spatial Correlation Coefficients
Figure 7 illustrates the temporal variations of spatial correlation coefficients between different grid-based precipitation products and the CPC-U precipitation data in global land areas and the two hemispheric land areas. Notably, unlike spatial bias, the temporal curves of spatial correlation coefficients exhibit diverse patterns without consistent annual cyclic characteristics. In the global land area (
Figure 7a), the spatial correlation coefficients of grid-based precipitation products display minimal evidence of an annual cyclic pattern. However, in the Northern Hemisphere land area, BMEP, ERA-I, JRA-55, and CFSR data demonstrate a coherent yet relatively weak annual cyclic feature in their spatial correlation coefficient curves. Conversely, GPCP-1DD data deviate from this trend, displaying an inconsistent annual cyclic characteristic and exhibiting the lowest spatial correlation coefficient compared to other grid-based precipitation datasets. Due to data gaps in the GPCP-1DD dataset during certain periods in the high-latitude regions of the Northern Hemisphere (40°N–90°N), this leads to significantly lower spatial correlation coefficients between GPCP-1DD and CPC-U during those time intervals. This discrepancy, in turn, leads to a change in the annual cycle for those years on the average curve of the GPCP-1DD spatial correlation coefficient in the North Hemisphere. In the Southern Hemisphere land region, all precipitation datasets exhibit a relatively consistent annual cyclic pattern in the spatial correlation coefficients, with GPCP-1DD data aligning well with the other datasets, demonstrating good consistency.
The BMEP data demonstrate a clear advantage in terms of spatial correlation coefficients compared to other grid-based precipitation products. On a global land average, the spatial correlation coefficients of BMEP data remain consistently above 0.6, significantly higher than those of JRA-55 (0.52), ERA-I (0.48), GPCP-1DD (0.42), and CFSR (0.44). In both the global land area and the Northern Hemisphere land area, JRA-55 precipitation data exhibit higher spatial correlation coefficients compared to ERA-I, CFSR reanalysis data, and GPCP-1DD data. Meanwhile, in the Southern Hemisphere, JRA-55 and ERA-I data display similar levels of spatial correlation coefficients, outperforming GPCP-1DD and CFSR data, with CFSR data exhibiting the poorest performance in terms of spatial correlation coefficients.
The above results indicate that, in terms of the spatial pattern of global daily precipitation, the BMEP precipitation exhibits the highest spatial consistency with CPC-U precipitation data based on daily station observations. This consistency is evident across the entire global land area and both hemispheric regions at different temporal scales, demonstrating robustness. We speculate that this coherence is attributed to the incorporation of daily station-based precipitation observations in the BMEP data, which is unique among these analyzed datasets. The introduction of more accurate precipitation observations at the daily scale plays a crucial role in improving the temporal variability of grid-based precipitation, thereby positioning BMEP precipitation as the superior product among these examined grid-based precipitation datasets.
3.6. Spatial RMSEs
Figure 8 illustrates the temporal evolution of the spatial RMSE (Root Mean Square Error) for various grid-based precipitation products in comparison to the CPC-U precipitation data across the global land area and the two hemispheric land areas. It is evident that each daily grid-based precipitation dataset exhibits pronounced annual cyclic variations in spatial RMSE within the hemispheric land areas. Notably, the Northern Hemisphere land area and the Southern Hemisphere land area exhibit precisely opposite patterns in their annual cyclic behavior. In the Northern Hemisphere, the peak values of spatial RMSE, relative to the CPC-U data, are observed during the months of July and August, while the trough values occur around January. Conversely, the Southern Hemisphere land region shows an inverse behavior, with peak spatial RMSE values occurring around January and trough values occurring around July and August. This distinct annual cyclic characteristic of spatial RMSE coincides remarkably with the annual cyclic patterns of precipitation in these areas, emphasizing that the magnitude of spatial RMSE between different grid-based precipitation datasets and the CPC-U data is primarily governed by the intensity of their precipitation amounts.
The spatial RMSE of BEMP precipitation data is significantly smaller than that of other grid-based daily precipitation datasets in all these analyzed land areas. For the global land average, the time-mean spatial RMSE of BEMP data remains stable at around 4 mm/d (average of 4.27 mm/d,
Table 2), which is notably lower than other reanalysis or satellite merged grid-based precipitation datasets. Moreover, the advantage of BEMP data over other grid-based precipitation datasets is consistently evident throughout the entire study period, indicating the robustness of this advantage in terms of spatial RMSE. This is partly attributed to the incorporation of station-based daily precipitation observations in BEMP precipitation data and partly to the implementation of an objective error estimation algorithm that dynamically adjusts the error structure of each merged data source adaptively [
29]. Consequently, the errors are effectively reduced, resulting in an outstanding performance of BEMP data compared to other grid-based precipitation datasets.
Additionally, it is noteworthy that despite being a reanalysis-based precipitation product, JRA-55 data demonstrate a spatial RMSE performance that is only surpassed by that of BEMP data. Its average spatial RMSE over the global land, Northern Hemisphere, and Southern Hemisphere land areas is 4.90 mm/d, 4.09 mm/d, and 6.74 mm/d, respectively (
Table 2). These values are significantly better than those of ERA-I and CFSR, two widely used reanalysis datasets, and also outperform the satellite-derived merged GPCP-1DD (with average spatial RMSE values of 5.59 mm/d for the global land, 4.81 mm/d for the Northern Hemisphere land, and 7.47 mm/d for the Southern Hemisphere land, as shown in
Table 2). These findings indicate that, when it comes to daily precipitation estimation on a global scale, the advantages of GPCP-1DD precipitation based on satellite data are not distinctly evident compared to reanalysis precipitation. Despite the reanalysis precipitation products offering six-hour forecasted precipitation, insufficient handling of satellite precipitation errors in GPCP-1DD can potentially result in greater daily-scale precipitation RMSE compared to reanalysis precipitation products.
3.7. Categorical Statistics
Figure 9 presents the results of POD, FAR, and ETS for five different gridded products across varying daily precipitation thresholds. As the thresholds increase, all gridded products consistently exhibit a decrease in POD and an increase in FAR values. This finding aligns with earlier studies [
22,
35,
40] indicating that identifying heavy rainfall is more challenging than detecting light rain for both merged and reanalysis precipitation products. When considering thresholds ranging from 0.1 to 10 mm/day, BMEP, JRA-55, and CFSR clearly outperform GPCP-1DD in detecting rain occurrence (
Figure 9a). Furthermore, the skill of GPCP-1DD deteriorates at a slower rate compared to that of BMEP, JRA-55, and CFSR, resulting in higher POD values for thresholds exceeding 20 mm/d. The FAR of BMEP remains relatively stable around 0.55, significantly lower than that of other datasets for thresholds exceeding 5 mm/d. Conversely, the FAR values of GPCP-1DD and CFSR rapidly escalate, exhibiting substantially higher levels than other gridded data (
Figure 9b). These findings highlight BMEP’s ability to achieve the lowest false alarm ratios in detecting medium to high precipitation rates across global land areas. To assess the overall capability of detecting rain events, the ETS is evaluated. BMEP exhibits an increasing trend in ETS scores for light rainfall events, reaching its peak at a threshold of 5 mm/d. Moreover, BMEP consistently maintains significantly higher ETS scores than other products, gradually declining as the threshold increases (
Figure 9c). The superior performance of BMEP in ETS scores can be attributed to its substantially lower number of false alarms. This indicates that BMEP excels in successfully identifying daily precipitation, particularly for moderate and heavy rain events.
4. Conclusions
In this study, the performance of five daily-scale gridded precipitation datasets with complete global coverage including BMEP, GPCP-1DD, ERA-interim, JRA-55, and CFSR over a global land area have been systematically compared with the CPC-Unified gauge-based data during the period of 2003–2016 in terms of the annual mean, daily time series, spatial and temporal characteristics of bias, CC, and RMSE, as well as the POD, FAR, and ETS of the occurrence of rainfall events for different daily precipitation thresholds.
Gauge-based CPC-U data show that the Indonesian Islands have the highest annual mean precipitation over global land. The Amazon, Equatorial Africa, eastern coastal regions of Asia, and North America also experience significant rainfall. Reanalysis and merged datasets capture the overall precipitation patterns, but some datasets exhibit overestimation or underestimation in specific regions.
Gridded datasets tend to overestimate precipitation in Equatorial Africa, the Amazon, and the Indo-China Peninsula. Among the reanalysis products, ERA-I and JRA-55 perform well in terms of correlation coefficients and RMSE. GPCP-1DD and BMEP exhibit the smallest overall bias. BMEP exhibits an overall advantage over other gridded datasets in other statistical measures, with the smallest RMSE and the largest TCC. Conversely, GPCP-1DD underperforms compared to BMEP, ERA-I, and JRA-55. Overall, BMEP shows substantial advantages, while CFSR exhibits notable biases and RMSE errors.
The interannual variability of daily precipitation is consistent across all datasets, with the annual cycle captured by the five gridded datasets over global land areas. Peak precipitation occurs in July and August in the Northern Hemisphere, while the Southern Hemisphere exhibits an inverse pattern with peak precipitation in January. The datasets demonstrate better agreement within each hemisphere compared to the global average. BMEP and CPC-U data exhibit remarkable consistency, with BMEP demonstrating superior performance compared to other gridded products on a global scale and in both hemispheres. However, GPCP-1DD, ERA-I, and JRA-55 display noticeable positive biases in precipitation intensity compared to CPC-U. The CFSR data show a notable positive trend after 2011, attributed to the transition from CFSR data to CFSv2 after the year 2010. CFSv2’s new gravity wave parameterization for cumulus convection-induced drag may cause excess convective precipitation, and its real-time operational nature affects observational data quality, potentially impacting precipitation accuracy after 2010. This finding aligns with previous research suggesting that updates of modeling systems and data assimilation systems can introduce spurious trends in precipitation forecasts, impacting the accuracy of reanalysis-based assessments.
The spatial bias in daily precipitation exhibits distinct annual cyclic patterns, reflecting the true precipitation dynamics. Among the analyzed datasets, BMEP demonstrates the smallest global and hemispheric spatial bias, surpassing others by a notable margin. Moreover, BMEP showcases exceptional spatial consistency with CPC-U precipitation data, boasting superior global and hemispheric spatial correlation coefficients. This remarkable performance is attributed to the incorporation of daily station-based precipitation observations, enhancing the temporal variability of grid-based precipitation estimates. Additionally, BMEP consistently outperforms its counterparts in terms of spatial RMSE, maintaining a stable value of around 4 mm/d for the global land average. Notably, satellite-based GPCP-1DD precipitation does not exhibit distinct advantages over reanalysis precipitation for global daily estimation, and caution should be exercised during the use of these datasets. JRA-55 stands out as a reliable alternative that exhibits competitive performance, outshining widely used reanalysis datasets and the satellite-derived GPCP-1DD.
The performance of gridded precipitation products varies with precipitation thresholds. As the thresholds increase, the probability of detecting rainfall decreases (lower POD), while the false alarm ratio increases (higher FAR). BMEP, JRA-55, and CFSR outperform GPCP-1DD in detecting rain occurrence for thresholds ranging from 0.1 to 10 mm/d. Among them, BMEP consistently exhibits the lowest FAR, especially for thresholds above 5 mm/d, indicating its superiority in detecting moderate to heavy precipitation over global land areas. The ETS analysis demonstrates that BMEP achieves higher scores for light rainfall events and maintains superior performance across different thresholds. This is primarily due to its significantly lower false alarm rate, highlighting BMEP’s effectiveness in accurately detecting daily precipitation, particularly for moderate and heavy rain events.
This study demonstrates that BMEP precipitation, which incorporates daily station observations, exhibits significant advantages over other global gridded products at the daily scale, providing a more reasonable and accurate characterization of global precipitation. The findings of this study provide an initial assessment of the temporal and spatial error characteristics of widely used global daily precipitation gridded datasets. These results can offer valuable feedback to developers for improving their precipitation products and serve as useful references for users of these datasets.
We suggest that further work should address this area. Specifically, detailed evaluations should be conducted on the performance of daily gridded precipitation at smaller scales, as well as its performance in numerical modeling, hydrological monitoring and forecasting, flood prediction, and early warning systems. Understanding the strengths and weaknesses of various reanalysis and merged daily precipitation datasets in terms of these aspects will contribute to better support for global flood simulation, hydrological monitoring, and geological hazard early warning. Additionally, future studies could also explore the potential integration of other observational datasets, such as remote sensing and radar data, to further improve the accuracy and coverage of global precipitation estimates. Continuous efforts in advancing data merged techniques and improving quality control procedures would also enhance the reliability and usability of global gridded precipitation datasets.