Analysis of the IMERG-GPM Precipitation Product Analysis in Brazilian Midwestern Basins Considering Different Time and Spatial Scales

: Precipitation products derived from satellites have emerged as a promising approach for obtaining precipitation estimates, enabling accurate long-term observations and describing the water cycle dynamics from a global scale to a local scale. The quality of these products has improved signiﬁcantly in the last decades, especially with the emergence of TRMM missions and its successor GPM. The objective of this study was to evaluate the daily, monthly and annual precipitation estimates provided by IMERG version 05 of the GPM, with the data observed by the rainfall stations of the Brazilian Agency of Water and Sanitation (ANA) in the basins of the Brazilian midwest. In order to compare the data, the spatialization of the data of the rainfall stations was performed by means of the ordinary kriging technique, interpolating the data for grids of 0.1 ◦ × 0.1 ◦ that correspond to the specialized grids of the GPM satellite. The data were evaluated quantitatively by means of statistical metrics. The GPM satellite precipitation product performed relatively well on a daily scale for regions with smooth topography, and was able to describe the rainfall regime on larger time scales, regardless of the terrain conditions. However, the satellite retrievals were unable to reproduce rainfall extremes in virtually all situations, which may limit their application in frequency analyses.


Introduction
Understanding the time-space variability of rainfall is paramount for hydrological applications.In fact, precipitation information at suitable resolutions, both in time and space, is necessary; for instances, for forecasting extreme flooding events; for continuous hydrological simulation that may provide streamflow estimates for the management and operation of hydropower reservoirs and water supply systems; for landslide warnings; and for forcing irrigation models, particularly for agricultural activities in semi-arid environments [1].In this sense, the definition of strategies for environmental sustainability, flood and drought risk mitigation as well as water resources management is inherently related to the proper stochastic characterization of the precipitation process across a region of interest.
Traditionally, measurements of rainfall amounts are directly obtained from groundbased gauges, and these have constituted the main source of information for hydrological studies.However, rainfall gauging station networks are often unevenly distributed sparsely across space, which imposes difficulties for properly capturing the spatial variability of precipitation systems [2].In addition, precipitation samples obtained from ground-based gauges are frequently corrupted by long periods of missing data, which may hinder their use for continuous rainfall-runoff modeling and, accordingly, for the indirect estimation of streamflow-related variables [3].
Water 2022, 14, 2472 2 of 17 Precipitation information retrieved from satellites has remained a promising approach for characterizing the precipitation process on spatial scales that range from that of the catchment to a near-global scale [4].Research efforts throughout the last decade have demonstrated potential uses of satellite precipitation products in a variety of applications in hydrology, such as in modeling extreme precipitation events [5,6], rainfall frequency analysis [7], flood frequency analysis [8,9], drought monitoring [10,11] and forecasting [12,13], rainfall-runoff simulation [14][15][16], and in the planning and management of water resources systems [17].Previous works have also indicated that resorting to satellite estimates may be advantageous for characterizing the precipitation process in regions where ground-based networks are insufficient, and where other estimation approaches, such as those involving radars, are unable to provide reliable estimates of rainfall amounts [6,18].
The accuracy of precipitation products has considerably increased in the last years [19], particularly since the launching of the Tropical Rainfall Measuring Mission (TRMM) as well as its successor, the Global Precipitation Measurement (GPM).The main purpose of these missions is to provide high-quality and high-resolution global precipitation estimates [20], which could be utilized for real-time monitoring as well as for short-term weather forecasting [21].GPM, which originated from a joint initiative of the National Aeronautics and Space Administration (NASA) and the Japanese Aerospace Exploration Agency (JAXA), was launched in February 2014.It comprises a large group of international space agencies that includes the Indian Space Research Organization (ISRO), the National Oceanic and Atmospheric Administration (NOAA), and the European Organization for the Exploitation of Meteorological Satellites (EUMETSAT), among others [22,23].Through improving the estimation of precipitation on a global scale, the GPM mission may enhance knowledge of precipitation systems in addition to the resulting variabilities from of other components of the water cycle.Also, shortterm weather forecasting and 4-dimensional reanalyses, such as measurements of space-time variabilities in global precipitation, permit us to better understand the following: (i) storm structures, (ii) water/energy balance, (iii) freshwater resources, and (iv) interactions between precipitation and other climate parameters.These fields may all benefit from provided GPM information [24].
In general terms, rainfall amounts are indirectly estimated from satellites by resorting to retrieval algorithms that integrate information from distinct sensors [25].For the GPM mission, such a combination is performed by integrated multi-satellite retrievals for GPM (IMERG), which merge and interpolate data from a set of passive microwave sensors from the GPM constellation, as well as from information stemming from infrared counterparts; these provide a precipitation product with a spatial resolution of 0.1 • × 0.1 • , and a sampling frequency of 30 min across the globe [22].
Although the main objective of the GPM mission is to provide a high-quality precipitation product, it has been widely acknowledged in the literature that the use of retrieval algorithms, no matter how complex, always introduces bias to precipitation estimates [6].Such bias usually manifests itself distinctly with respect to precipitation amounts, and may furthermore be amplified by climate [26,27] and complex terrain conditions [6].Moreover, retrieval errors may strongly depend on the aggregation time scales; these errors are usually larger for shorter time scales (e.g., hourly or daily) and considerably smaller for longer ones (e.g., monthly or annual) [19].These facts have prompted a plethora of studies that assessed the performances of distinct retrieval algorithms in different parts of the world by comparing satellite estimates with ground-based measurements (which are, more often than not, erroneously assumed to be error-free), as well as studies that developed mathematical models for bias correction [6,[28][29][30][31][32][33][34][35].
With respect to performance assessment, the IMERG-GPM algorithm has been highlighted for having good overall agreement with ground information, particularly for short time scales, compared to other established precipitation products.In fact, a recent study by Tang et al. (2016) [36] indicated that the IMERG product outperformed its TRMM multisatellite precipitation analysis (TMPA) 3B42V7 and 3B42RT counterparts, for both daily and sub-daily time scales, in Chinese catchments.These findings were supported by Sharifi, Steinacker and Saghafian (2016) [37], who compared the IMERG and TRMM products on a daily scale in Iran.Nonetheless, since the performances of retrieval algorithms often vary with climate and topography [6], these conclusions cannot be readily generalized, As a result, suitability assessments for distinct precipitation products should be carried out for each particular study region.
In view of the foregoing, the objective of this study is to evaluate the performance of IMERG-GPM version 05, on daily, monthly and annual scales, for the Brazilian midwestern region.In order to conduct this evaluation, we compare the satellite retrievals with information derived from the rainfall gauging network operated by the Brazilian Agency of Water and Sanitation (Agência Nacional de Águas e Saneamento Básico, ANA), after first performing spatialization to match the satellite resolution for the Meia Ponte and the Bois River catchments.The Brazilian midwestern region is a relatively poorly gauged area [29,35], with a complex climate that is influenced by a variety of atmospheric systems [38].The area presents marked seasonal features that are known to affect the performance of satellite precipitation products [6].Previous research on this study region [29,35] has suggested that the TRMM products, which are frequently used as data sources in tropical areas, may present significant biases during the wet season, which in turn provide some justification for performing similar evaluations with the IMERG-GPM counterpart.The remainder of this paper is organized as follows: Section 2 presents the material and methods, with a brief description of the study area and the data, as well as the methods utilized for data quality checking for interpolating the ground-based rainfall amounts and for performance assessment.Section 3 comprises the main results as well as a discussion of them with respect to previous research.Finally, in Section 4, conclusions and research developments are addressed.

Study Area
The study area encompasses the Meia Ponte River catchment, located in the central region of the Brazilian state of Goiás, and the Bois River catchment, located at the southern portion of the state (Figure 1).The Meia Ponte River catchment drains an area of 14,819 km 2 , amounting to approximately 3.6% of the territory of Goiás.It is a densely populated region, with about 3.131 × 10 6 inhabitants concentrated in the municipalities of Goiânia, Aparecida de Goiânia, Anápolis, Senador Canedo and Itumbiara.The catchment is characterized by a tropical savanna climate, with a dry season spanning from April to September and a wet season between October and March.The temperature ranges from 17 • C to 31 • C, whilst the mean annual rainfall varies from 1400 to 1600 mm [39].The Bois River catchment, in turn, amounts to an area of 35,435 km 2 , which corresponds to 9% of the area of Goiás.Forty-three municipalities, which comprise 651,391 inhabitants, are partially or entirely contained in this catchment.According to Santos, Bayer and Carvalho (2008) [40], this is also a region with marked seasonality, having a dry period between May and September and a wet counterpart from October to April.Mean annual rainfall amounts range from 1400 mm to 1800 mm.

Data from Ground-Based Rainfall Gauging Stations
Daily rainfall amounts were obtained from the digital platform of the Brazilian Agency of Water and Sanitation (Agência Nacional de Águas e Saneamento Básico, ANA).A collection of 37 gauging stations with a period of record spanning from 1988 to were initially selected for this study.In order to fill missing data that amounted to 31 days for the 1750000 gauging station, 13 days for the 1750001 and a single day for the 1750008 gauging station, we resorted to using simple linear regression.The procedure, which was performed with the "hyfo" R package [41], is as follows: for a given gauging station with data to be filled, the candidate neighboring gauges are ranked based on their correlation coefficients; next, simple linear regression equations are derived by using the data from each candidate as explanatory variables; finally, for each day with missing data the rainfall amount estimate is derived from the most correlated candidate with available data.We acknowledge that this simple linear regression may not be the most accurate alternative to fill missing precipitation data on a daily scale [42], and that the predictive abilities of the obtained regression models are relatively low (R 2 < 0.30).However, since the number of missing data is small, we believe that the use of this simplified tool did not strongly affect the performance assessment.
Additional data quality checks comprised excluding gauging stations with more than 20% of missing data in addition to those that received annual rainfall amounts larger than 2500 mm or smaller than 1000 mm, which are deemed unreasonable values for the study region on the basis of the 30-year average annual precipitation.
After the data quality check and the filling of missing data, four gauging stations which presented anomalous behaviors with respect to the mean annual rainfall (values less than 1000 mm) were discarded.The gauges that were retained for analyses are shown in Table 1.Finally, for assessing the performance of the GPM precipitation product, whose data are available from 2014 onwards, the daily rainfall amounts recorded in the water year of September 2016-August 2017 were utilized.We note that this decision resulted from the large amount of missing data prior to 2016 and from 2017 onwards in most of the rainfall gauging stations utilized in our study; this could introduce high levels of bias in the comparison.On the other hand, 2017 presented an annual rainfall amount relatively close to the long-term average (1339 mm and 1500 mm, respectively, for the Meia Ponte River catchment), which may, at least to some extent, attenuate the effects of low data availability in subsequent analyses.

Data from the GPM Precipitation Product
Satellite precipitation estimates were retrieved with IMERG, the algorithm developed by the GPM team to provide the precipitation product.The algorithm's fifth version (level 03), which provides rainfall estimates with a spatial resolution of 0.1 • × 0.1 • and a 30-min sampling frequency, was utilized.The algorithm was designed to combine information from multiple international satellites and develop long-term precipitation records in uniformly distributed pixels across the globe [43].
For our analyses, the IMERG-GPM product, provided in format HDF5 by NASA, was initially imported to the software ArcGIS (version 10.6, Esri, RedLands, CA, USA).Next, precipitation estimates were extracted for the water year of 2017 and for those pixels located between 52 • and 19 • S and 19 • and 16 • W, which entirely enclosed the study area.The satellite retrievals were then aggregated for daily, monthly, and annual time scales and compared to spatialized and raw ground-based rainfall measurements.

Data Interpolation
Data derived from rainfall gauges were interpolated in order to form uniform grids with the same spatial resolution as the GPM satellite retrievals (i.e., 0.1 • × 0.1 • or approximately 11 km).The interpolation was performed in ArcGIS (version (10.6) by using the ordinary kriging technique, a method that assumes a spatial Gaussian field with a covariance function defined by a semivariogram [44].Ordinary kriging is a widespread interpolation technique that frequently outperforms alternative methods when dealing with precipitation data [45].For fitting the theoretical Gaussian field to the empirical sample points, a spherical semivariogram was utilized with a range of 150 km and a cutoff point of 300 km.Then, the rainfall amounts were estimated at the vertices of each pixel defined by the IMERG-GPM product and averaged over these points in order to match the satellite's resolution.
Once the precipitation time series were interpolated, spatialized average values were computed for both the catchments' drainage areas and for each of the gridded elements with ground-based gauges; this allows the identification of those regions across the study area in which the satellite retrievals present larger deviations with respect to the measured rainfall amounts.The comparisons comprised the period of record spanning from 1 September 2016 to 31 August 2017.

Comparison of Precipitation Amounts
The performance assessment of the IMERG-GPM algorithm was based on the computation of the goodness-of-fit metrics presented in Table 2 [46], which are intended to provide a comprehensive evaluation of the satellite product.Absolute metrics such as the mean absolute error (MAE) and the root mean square error (RMSE) assess the overall agreement between ground-based and satellite estimates, whereas the mean error (ME) and the percent bias (PBIAS) disclose the existence of systematic errors which result in under-or overestimation.Finally, we also utilized as benchmarks the Nash-Sutcliffe efficiency crite-rion (NSE) as well as the coefficient of persistence (CP); the former is a common metric that is used in hydrological applications but sometimes criticized by misinterpretations [47]; the latter, which uses the previous day's observed precipitation as an alternative to the satellite information, provides a naïve yet usually more robust benchmark.Some of these indexes, such as the MAE and RMSE, provide similar information regarding model performance; however, marked distinctions among them might indicate a more general lack of fit or large deviations solely with respect to higher-order statistics [48].

Mean Error (ME) mm
Expresses the uncertainty in a measurement Evaluates the predictive ability of hydrological models.
Compares the performance of the model being used and performance of the persistent Note(s): n is the number of sample points, O denotes the ground-based rainfall measurements, and E corresponds to the IMERG-GPM retrievals.
In addition to the interpolated rainfall, we also evaluated the performances of the retrieval algorithm with the raw data extracted from the gauges as a means of assessing potential benefits or shortcomings of the kriging procedure.Furthermore, we computed the metrics for those ground-based rainfall amounts that equaled or exceeded the 95th-quantile data from each gauging station (from the raw data set) in order to assess the goodness-of-fit of the IMERG-GPM product regarding daily rainfall extremes.

Daily Time Scale
A comparison of satellite retrievals and rainfall gauging measurements averaged over the catchments' drainage areas is presented in Figure 2 for the Meia Ponte River catchment, and in Figure 3 for the Bois River catchment.It is possible to note that, for the former, the temporal coherence of rainfall events is reasonably reproduced by the satellite product, as demonstrated by the relatively high CC, albeit with a tendency to overestimate; this is detected from the values of the ME and PBIAS.The value of the RMSE, in turn, is almost twice that of the MAE, which suggests that the higher-order statistics are not properly reproduced by the satellite retrievals-this may constitute a major limitation for utilizing the IMERG-GPM product for both block-maxima and peaks-over-threshold frequency analysis.Finally, the benchmarks NSE and CP indicate that the satellite product has larger predictive skills than the observed mean and the observed previous day's precipitation data, respectively, even though their values are relatively close to zero.
For the Bois River catchment, the overall tendency of overestimation is even more pronounced, with higher values for the ME and PBIAS, as compared to the Meia Ponte data set.Moreover, a poorer representation of the temporal dynamics of the observed rainfall is perceived, with some lag between GPM retrievals and observed events throughout most of the period of record.Finally, the distinctions among the RMSE and MAE values are also noticeable, suggesting an unsuitable description of rainfall extremes by the satellite retrieval; additionally, both benchmarks are negative, indicating that the mean and/or the previous day's observed precipitation are preferable for prediction.On the other hand, the dry season was reasonably described in both catchments, which suggests at least for the study area that the GPM product is sufficiently accurate for detecting non-rainfall events.We note, however, that this is a poorly gauged catchment, in which many of the rainfall gauging stations are located in areas with more complex terrain and stronger topographic gradients (Figure 1).At least to some extent, this fact may explain the poorer performance of the IMERG-GPM product in the Bois River catchment.In effect, it is well established that most satellite products are unable to properly reproduce rainfall regimes in regions with complex topographies [6].  Figure 4 depicts scatterplots for the GPM precipitation estimates and the obse rainfall amounts.For the Meia Ponte River catchment (left panel), a linear functional may be visualized, despite the large dispersion of the errors for precipitation amo larger than 10 mm, which entailed a value of 0.61 for the coefficient of determination and some degree of deviation from the 1:1 line.For the Bois River catchment (right pa on the other hand, the linear association is much weaker with  2 = 0.29, which indi that the satellite retrievals are unable to explain the variation in the observed rainfa this area.Figure 4 depicts scatterplots for the GPM precipitation estimates and the obse rainfall amounts.For the Meia Ponte River catchment (left panel), a linear functional f may be visualized, despite the large dispersion of the errors for precipitation amo larger than 10 mm, which entailed a value of 0.61 for the coefficient of determination and some degree of deviation from the 1:1 line.For the Bois River catchment (right pa on the other hand, the linear association is much weaker with  2 = 0.29, which indic that the satellite retrievals are unable to explain the variation in the observed rainfa this area.Figure 4 depicts scatterplots for the GPM precipitation estimates and the observed rainfall amounts.For the Meia Ponte River catchment (left panel), a linear functional form may be visualized, despite the large dispersion of the errors for precipitation amounts larger than 10 mm, which entailed a value of 0.61 for the coefficient of determination R 2 , and some degree of deviation from the 1:1 line.For the Bois River catchment (right panel), on the other hand, the linear association is much weaker with R 2 = 0.29, which indicates that the satellite retrievals are unable to explain the variation in the observed rainfall in this area.We also compared the daily precipitation of each rainfall gauging station, after interpolation and with raw data, with those obtained from the corresponding grid of the GPM product.Results are summarized in Table 3 for the Meia Ponte River catchment, and in Table 4 for the Bois River catchment.One may notice that for the former, the metrics across gauges for the interpolated rainfall are somewhat similar, with exception of the PBIAS, which presented mostly positive low values, but indicated a tendency to underestimate (slightly) in the Meia Ponte gauging station.Hence, the performance of the GPM product did not present marked spatial variability in this catchment.We also note that despite entailing similar values for most metrics, using the raw data affected the systematic biases either by changing their signs or by increasing their values (in absolute terms).This fact would favor the interpolation approach.On the other hand, the benchmarks suggest that satellite retrievals are closer to the raw data, whose use entailed substantial improvements in the values of both the NSE and CP-for the interpolated rainfall, these are mostly negative or close to zero.
Inspection of Table 4, in turn, indicates a much larger variation in the values of the goodness-of-fit metrics, and an overall worse performance with respect to the interpolated rainfall in the Bois River catchment.In effect, whilst most values of the CCs ranged from 0.39 to 0.50, suggesting a poorer description of the temporal dynamics of the observed rainfall, the values of RMSE surpass, in many cases, more than 20% of those in the Meia Ponte catchment.High levels of variation are also verified for the PBIAS, and the tendency of overestimation is much stronger in the Bois River catchment-in some cases, systematic errors larger than 15% were verified.Of course, this may have stemmed from the inaccurate spatialization of the observed rainfalls in some portions of the catchment, as well as from the locations of the rainfall gauging stations in areas with complex terrain.However, our results suggest that the GPM product was unable to retrieve the real evolution of the daily precipitation activities across this entire region, and this certainly calls for further investigation in regard to potential causes of this phenomenon.
As for the comparison between raw and interpolated data, similar remarks to those in the Meia Ponte River catchment can be made: the PBIAS is strongly affected by the sampling approach, and the use of raw data increased the values of the benchmarks.However, compared to the Meia Ponte river catchment, such increases in the NSE and CP were less noticeable.Overall, the interpolation procedure did not seem to be beneficial, which could be at least to some extent anticipated, based on the large distances between the rainfall gauging stations.The incorporation of covariates such as topographic features to the kriging procedure may improve the interpolation results, and this will be addressed in future research.We also compared the daily precipitation of each rainfall gauging station, after interpolation and with raw data, with those obtained from the corresponding grid of the GPM product.Results are summarized in Table 3 for the Meia Ponte River catchment, and in Table 4 for the Bois River catchment.One may notice that for the former, the metrics across gauges for the interpolated rainfall are somewhat similar, with exception of the PBIAS, which presented mostly positive low values, but indicated a tendency to underestimate (slightly) in the Meia Ponte gauging station.Hence, the performance of the GPM product did not present marked spatial variability in this catchment.We also note that despite entailing similar values for most metrics, using the raw data affected the systematic biases either by changing their signs or by increasing their values (in absolute terms).This fact would favor the interpolation approach.On the other hand, the benchmarks suggest that satellite retrievals are closer to the raw data, whose use entailed substantial improvements in the values of both the NSE and CP-for the interpolated rainfall, these are mostly negative or close to zero.Inspection of Table 4, in turn, indicates a much larger variation in the values of the goodness-of-fit metrics, and an overall worse performance with respect to the interpolated rainfall in the Bois River catchment.In effect, whilst most values of the CCs ranged from 0.39 to 0.50, suggesting a poorer description of the temporal dynamics of the observed rainfall, the values of RMSE surpass, in many cases, more than 20% of those in the Meia Ponte catchment.High levels of variation are also verified for the PBIAS, and the tendency of overestimation is much stronger in the Bois River catchment-in some cases, systematic errors larger than 15% were verified.Of course, this may have stemmed from the inaccurate spatialization of the observed rainfalls in some portions of the catchment, as well as from the locations of the rainfall gauging stations in areas with complex terrain.However, our results suggest that the GPM product was unable to retrieve the real evolution of the daily precipitation activities across this entire region, and this certainly calls for further investigation in regard to potential causes of this phenomenon.As for the comparison between raw and interpolated data, similar remarks to those in the Meia Ponte River catchment can be made: the PBIAS is strongly affected by the sampling approach, and the use of raw data increased the values of the benchmarks.However, compared to the Meia Ponte river catchment, such increases in the NSE and CP were less noticeable.Overall, the interpolation procedure did not seem to be beneficial, which could be at least to some extent anticipated, based on the large distances between the rainfall gauging stations.The incorporation of covariates such as topographic features to the kriging procedure may improve the interpolation results, and this will be addressed in future research.
Finally, the goodness-of-fit assessment regarding observed daily extreme events, as materialized by the empirical 95th-quantile data from each rainfall gauging station, are shown in Tables 5 and 6 for the Meia Ponte and the Bois River catchments, respectively.It is generally possible to note that the values of metrics such as RMSE and MAE are close to the empirical quantiles themselves, which indicates a strong disagreement between satellite retrievals and ground-based information for large rainfall amounts.Moreover, the benchmarks are mostly negative, and the values of CCs are low, which may be due to poorer performance of the retrieval algorithm for extreme rainfall conditions, or because such extreme events are not being recorded on the same days by the satellite and the gauges.Overall, as previously hypothesized, the IMERG-GPM product was unable to reproduce daily rainfall extremes, which might limit its use for frequency analysis and risk assessment.

Monthly Time Scale
Figure 5 depicts the comparison of the monthly precipitation for the rainfall gauging stations located in the Meia Ponte River catchment; the values of the goodness-of-fit metrics are presented in Table 7.A similar plot is shown in Figure 6 for the Bois River catchment, with the metrics being provided in Table 8.One may observe that, as expected for larger time scales which smooth out strong variations in precipitation activity on a daily or sub-daily scale, the performance of the IMERG-GPM product considerably improves.In effect, the values of CCs were larger than 0.90 in all situations, although the tendency to overestimate still persisted in some gauging stations, such as Montividiu (PBIAS = 25.18%).In all cases, the values of the NSE and CP indicate that the satellite product has considerably greater predictive skills as compared to the benchmarks.Again, we note that a higher level of variability in the goodness-of-fit metrics was verified for the Bois River catchment, and the values of the RMSE and MAE present marked distinctions for this area, which may suggest that the GPM product could not properly capture the spatial patterns of variability and describe the rainfall behavior during the wet season, even on a monthly time scale, for this region.
On the other hand, when precipitation amounts are averaged over the entire areas of the catchments, the performance of the GPM product is suitable and similar for both geographical regions.In fact, as depicted in Figure 7, the satellite retrievals present good agreement with spatialized gauge data for all months, and are able to explain 98% (left panel) and 99% (right panel) of the latter's variability for the Meia Ponte and the Bois River catchments, respectively.In other words, the averaging procedure across large extensions smoothed out the larger variations in particular locations of the catchment that were verified in the previous analyses, hence improving GPM performance overall.

REVIEW
12 of 18 patterns of variability and describe the rainfall behavior during the wet season, even on a monthly time scale, for this region.As for the other goodness-of-fit metrics, the GPM product presented values of 10.9 mm for the MAE, 13.51 mm for the RMSE, 4.31% for the PBIAS, 4.81 mm for the ME, 0.97 for the NSE and 0.94 for the CP in the Meia Ponte River catchment; and 9.55 mm for the MAE, 14.64 for the RMSE, 7.46% for the PBIAS, 7.5 mm for the ME, 0.97 for the NSE and 0.92 for the CP in the Bois River catchment, when compared to gauging measurements.These results indicate a considerable enhancement with respect to the pixel-based analyses, which are again indicative of the potential advantages of averaging the precipitation amounts over larger geographical areas.On the other hand, when precipitation amounts are averaged over the entire areas of the catchments, the performance of the GPM product is suitable and similar for both geographical regions.In fact, as depicted in Figure 7, the satellite retrievals present good agreement with spatialized gauge data for all months, and are able to explain 98% (left panel) and 99% (right panel) of the latter's variability for the Meia Ponte and the Bois River catchments, respectively.In other words, the averaging procedure across large extensions smoothed out the larger variations in particular locations of the catchment that were verified in the previous analyses, hence improving GPM performance overall.As for the other goodness-of-fit metrics, the GPM product presented values of 10.9 mm for the MAE, 13.51 mm for the RMSE, 4.31% for the PBIAS, 4.81 mm for the ME, 0.97 for the NSE and 0.94 for the CP in the Meia Ponte River catchment; and 9.55 mm for the MAE, 14.64 for the RMSE, 7.46% for the PBIAS, 7.5 mm for the ME, 0.97 for the NSE and 0.92 for the CP in the Bois River catchment, when compared to gauging measurements.These results indicate a considerable enhancement with respect to the pixel-based analyses, which are again indicative of the potential advantages of averaging the precipitation amounts over larger geographical areas.

Anual Time Scale
The spatial distribution of the annual rainfall amounts for the water year of 2017 is depicted in Figure 8; the spatialized ground-based measurements are shown in the left panel and those of the GPM product are in the middle counterpart.It is possible to observe that as with the other time scales, the GPM product overestimated the precipitation amounts for both catchments; for the maximum values of annual rainfall, the groundbased gauges accumulated 1400 mm while the satellite retrievals obtained 1600 mm.In addition, the satellite product was not able to reproduce the spatial pattern of variability in the observed rainfall.Whereas some variability is verified for the ground-based rainfall data, particularly for the northeastern portion of the Bois River catchment which presents a more complex topography, the satellite estimates are relatively homogenously distrib-

Anual Time Scale
The spatial distribution of the annual rainfall amounts for the water year of 2017 is depicted in Figure 8; the spatialized ground-based measurements are shown in the left panel and those of the GPM product are in the middle counterpart.It is possible to observe that as with the other time scales, the GPM product overestimated the precipitation amounts for both catchments; for the maximum values of annual rainfall, the ground-based gauges accumulated 1400 mm while the satellite retrievals obtained 1600 mm.In addition, the satellite product was not able to reproduce the spatial pattern of variability in the observed rainfall.Whereas some variability is verified for the ground-based rainfall data, particularly for the northeastern portion of the Bois River catchment which presents a more complex topography, the satellite estimates are relatively homogenously distributed across the entire study region.Such a condition resulted in a noticeable gradient in the errors in the southeastern-northwestern direction (right panel of Figure 8).We again hypothesize that the spatial interpolation may play a large role on the rougher behavior verified in the left panel of Figure 8.However, our results suggest that bias correction would be troubling for annual rainfall amounts in the study area since the regional distinctions in rainfall distribution might not be readily explained by the usual covariates, such as altitude.
Water 2022, 14, x FOR PEER REVIEW 15 of 18 would be troubling for annual rainfall amounts in the study area since the regional distinctions in rainfall distribution might not be readily explained by the usual covariates, such as altitude.As a final remark, we note that previous research demonstrated that the GPM product has suitable abilities in describing spatial precipitation patterns, but the rainfall intensities and spatial variability, which are closely linked to seasonality, have some influence on the capability of the GPM retrievals to capture local precipitation patterns [48].Our results are at least to some extent in agreement with these conclusions.In fact, in many situations, the spatial distribution of rainfall was not properly described by the IMERG satellite retrievals, with a tendency of generating smoother surfaces as compared to the data captured by ground-based information.Nonetheless, as may be inferred from Melo et al., 2015, and Moraes and Gonçalves, 2021 [29,35], the IMERG-GPM may be a preferable alternative for our study region after spatial averaging, as it more properly described the rainfall amounts on a daily time scale.Hence, despite the inaccurate descriptions of daily rainfall extremes, which are not considerably improved under bias correction [6], the short size of our sample, and the relatively poor representation of spatial patterns, we still believe that the IMERG-GPM product may be a useful data source for the Brazilian midwestern region, as compared to well-established alternatives such as TRMM, mainly for continuous rainfall-runoff simulation based on a daily time step, and drought management, which requires data at monthly or longer time scales.

Conclusions
The GPM mission has provided a new generation of high-resolution precipitation products that could be utilized in several fields, such as hydrology and climatology.In this paper, the performance of the fifth version of the IMERG retrieval algorithm for the GPM constellation was assessed by comparing its satellite retrievals with ground-based information, on daily, monthly and annual scales throughout the period spanning from September 2016 to August 2017.The study evaluated the precipitation fields and the rainfall amounts, averaged at the catchment scale, for the Meia Ponte and the Bois River catchments, both located in the state of Goiás in the Brazilian midwestern region.
Our results indicated that for the Meia Ponte River catchment, a reasonable agreement between satellite retrievals and ground-based measurements in the precipitation fields was obtained, with a tendency of overestimation in all time scales, by the satellite precipitation products.For the Bois River catchment, which is located in a region with more complex terrain and is less densely gauged, the performance of the satellite product As a final remark, we note that previous research demonstrated that the GPM product has suitable abilities in describing spatial precipitation patterns, but the rainfall intensities and spatial variability, which are closely linked to seasonality, have some influence on the capability of the GPM retrievals to capture local precipitation patterns [48].Our results are at least to some extent in agreement with these conclusions.In fact, in many situations, the spatial distribution of rainfall was not properly described by the IMERG satellite retrievals, with a tendency of generating smoother surfaces as compared to the data captured by ground-based information.Nonetheless, as may be inferred from Melo et al., 2015, and Moraes and Gonçalves, 2021 [29,35], the IMERG-GPM may be a preferable alternative for our study region after spatial averaging, as it more properly described the rainfall amounts on a daily time scale.Hence, despite the inaccurate descriptions of daily rainfall extremes, which are not considerably improved under bias correction [6], the short size of our sample, and the relatively poor representation of spatial patterns, we still believe that the IMERG-GPM product may be a useful data source for the Brazilian midwestern region, as compared to well-established alternatives such as TRMM, mainly for continuous rainfall-runoff simulation based on a daily time step, and drought management, which requires data at monthly or longer time scales.

Conclusions
The GPM mission has provided a new generation of high-resolution precipitation products that could be utilized in several fields, such as hydrology and climatology.In this paper, the performance of the fifth version of the IMERG retrieval algorithm for the GPM constellation was assessed by comparing its satellite retrievals with ground-based information, on daily, monthly and annual scales throughout the period spanning from September 2016 to August 2017.The study evaluated the precipitation fields and the rainfall amounts, averaged at the catchment scale, for the Meia Ponte and the Bois River catchments, both located in the state of Goiás in the Brazilian midwestern region.
Our results indicated that for the Meia Ponte River catchment, a reasonable agreement between satellite retrievals and ground-based measurements in the precipitation fields was obtained, with a tendency of overestimation in all time scales, by the satellite precipitation products.For the Bois River catchment, which is located in a region with more complex terrain and is less densely gauged, the performance of the satellite product was considerably worse, with a disruption in temporal coherence for daily data, and a stronger positive systematic bias, as compared to its performance with the Meia Ponte River catchment.When averaged over the catchment area, the precipitation estimates were reasonable for monthly and annual scales, indicating that the averaging procedure smoothed out the largest deviations in some areas of the catchments.Nonetheless, the spatial rainfall patterns were more often than not misrepresented by the IMERG retrievals, which generated oversmoothed surfaces and did not capture local features of the observed rainfall fields.Furthermore, in both spatial scales the most extreme events on a daily scale were not properly reproduced by the satellite product.
Satellite precipitation products are advantageous for practical applications, since they capture in a more effective manner the space-time patterns of precipitation events and are not affected by missing data.Nonetheless, as shown in this study and in several others, due to the indirect mechanisms for estimating rainfall amounts, satellite products are biasedmainly for the most extreme events, which are paramount for design and risk assessment.Hence, despite the technological development applied to the GPM mission for providing high-quality precipitation products, some research effort is still necessary to develop more effective techniques for bias correction.This is envisaged as our next research objective.

Water 2022 , 8 Figure 2 .
Figure 2. Comparison of the daily precipitation, as obtained from the GPM and the ground ra gauging network, averaged over the area of the Meia Ponte River catchment.

Figure 3 .
Figure 3.Comparison of the daily precipitation, as obtained from the GPM and the ground ra gauging network, averaged over the area of the Bois River catchment.

Figure 2 . 8 Figure 2 .
Figure 2. Comparison of the daily precipitation, as obtained from the GPM and the ground rainfall gauging network, averaged over the area of the Meia Ponte River catchment.

Figure 3 .
Figure 3.Comparison of the daily precipitation, as obtained from the GPM and the ground ra gauging network, averaged over the area of the Bois River catchment.

Figure 3 .
Figure 3.Comparison of the daily precipitation, as obtained from the GPM and the ground rainfall gauging network, averaged over the area of the Bois River catchment.

Figure 4 .
Figure 4. Scatterplots of the GPM estimates and the observed precipitation amounts for the Meia Ponte River catchment (left panel) and the Bois River catchment (right panel).

Figure 4 .
Figure 4. Scatterplots of the GPM estimates and the observed precipitation amounts for the Meia Ponte River catchment (left panel) and the Bois River catchment (right panel).

Figure 5 .
Figure 5. Monthly rainfall amounts as obtained from the GPM product and ground-based gauges for the Meia Ponte River catchment.

Figure 5 .
Figure 5. Monthly rainfall amounts as obtained from the GPM product and ground-based gauges for the Meia Ponte River catchment.

Figure 6 .
Figure 6.Monthly rainfall amounts as obtained from the GPM product and ground-based gauges for the Bois River catchment.

Figure 6 .
Figure 6.Monthly rainfall amounts as obtained from the GPM product and ground-based gauges for the Bois River catchment.

Figure 7 .
Figure 7. Scatterplots of monthly rainfall amounts for the GPM retrievals and ground-based gauges in the Meia Ponte (left panel) and the Bois River catchments (right panel).

Figure 7 .
Figure 7. Scatterplots of monthly rainfall amounts for the GPM retrievals and ground-based gauges in the Meia Ponte (left panel) and the Bois River catchments (right panel).

Figure 8 .
Figure 8. Annual rainfall amounts for the water year of 2017, as obtained from the ground-based rainfall gauging network (left panel), from the GPM product (middle panel) and the spatial distribution of bias (right panel).

Figure 8 .
Figure 8. Annual rainfall amounts for the water year of 2017, as obtained from the ground-based rainfall gauging network (left panel), from the GPM product (middle panel) and the spatial distribution of bias (right panel).

Table 1 .
Rainfall gauging stations utilized for the estimation of spatialized precipitation and for comparison with the GPM data.
Water 2022, 14, x FOR PEER REVIEW 4 of 18 Figure 1.Locations of the Meia Ponte and Bois River catchments, rainfall gauging stations, elevations and sampling points utilized in this study.Black squares represent the 33 gauging stations (shown as reference numbers; see Table 1) utilized in the comparison with the 0.1° × 0.1° GPM pixels.

Number in Figure 1 Code Rainfall Gauging Station Longitude Latitude Elevation (m) Mean Annual Rainfall (mm)
Figure 1.Locations of the Meia Ponte and Bois River catchments, rainfall gauging stations, elevations and sampling points utilized in this study.Black squares represent the 33 gauging stations (shown as reference numbers; see Table1) utilized in the comparison with the 0.1 • × 0.1 • GPM pixels.

Table 1 .
Rainfall gauging stations utilized for the estimation of spatialized precipitation and for comparison with the GPM data.

Table 2 .
Goodness-of-fit metrics utilized in the comparison of the GPM precipitation product and the ground-based measurements.

Table 3 .
Goodness-of-fit metrics for interpolated and raw (in parentheses) daily precipitation data from four gauging stations located in the Meia Ponte River catchment.

Table 4 .
Goodness-of-fit metrics for interpolated and raw (in parentheses) daily precipitation data from 12 gauging stations located in the Bois River catchment.

Table 5 .
Goodness-of-fit metrics for daily rainfall amounts above the 95th empirical quantiles from four gauging stations located in the Meia Ponte River catchment.

Table 6 .
Goodness-of-fit metrics for daily rainfall amounts above the 95th empirical quantiles from 12 gauging stations located in the Bois River catchment.

Table 7 .
Goodness -of-fit metrics for monthly precipitation data from four gauging stations located in the Meia Ponte River catchment.hCC MAE (

Table 7 .
Goodness-of-fit metrics for monthly precipitation data from four gauging stations located in the Meia Ponte River catchment.

Table 8 .
Goodness -of-fit metrics for monthly precipitation data from 12 gauging stations located in the Bois River catchment.Gauging Station with CC MAE (mm) RMSE (mm) PBIAS (%) ME (mm) NSE CP

Table 8 .
Goodness-of-fit metrics for monthly precipitation data from 12 gauging stations located in the Bois River catchment.