Evaluation of TRMM / GPM Blended Daily Products over Brazil

The precipitation estimates from the Tropical Rainfall Measuring Mission (TRMM) Multi-satellite Precipitation Analysis (named TMPA and TMPA-RT for the near real-time version) are widely used both in research and in operational forecasting. However, they will be discontinued soon. The products from the Integrated Multi-satellite Retrievals for Global Precipitation Measurement (IMERG) and The Global Satellite Mapping of Precipitation (GSMaP) are analyzed as potential replacements for TMPA products. The objective of this study is to assess whether the IMERG and/or GSMaP products can properly replace TMPA in several regions with different precipitation regimes within Brazil. The validation study was conducted during the period from 1st of April, 2014 to the 28th of February, 2017 (1065 days), using daily accumulated rain gauge stations over Brazil. Six regions were considered for this study: five according to the precipitation regime, plus another one for the entire Brazilian territory. IMERG-Final, TMPA-V7 and GSMaP-Gauges were the selected versions of those algorithms for this validation study, which include a bias adjustment with monthly (IMERG and TMPA) and daily (GSMaP) gauge accumulations, because they are widely used in the user’s community. Results indicate similar behavior for IMERG and TMPA products, showing that they overestimate precipitation, while GSMaP tend to slightly underestimate the amount of rainfall in most of the analyzed regions. The exception is the northeastern coast of Brazil, where all products underestimate the daily rainfall accumulations. For all analyzed regions, GSMaP and IMERG products present a better performance compared to TMPA products; therefore, they could be suitable replacements for the TMPA. This is particularly important for hydrological forecasting in small river basins, since those products present a finer spatial and temporal resolution compared with TMPA.


Introduction
Knowledge of the spatial and temporal distribution of precipitation is of key importance for planning a wide range of socio-economic activities such as agriculture, livestock grazing, energy generation, etc.The availability of accurate and consistent precipitation data is then paramount for a proper assessment of such activities.However, traditional rain gauge measurements are relatively scarce and poorly distributed over the surface of the globe, particularly over remote areas or in developing countries.In the last three decades, satellite-derived precipitation estimate products have been developed using multi-satellites and multi-sensors.Such products provide an effective way of estimating precipitation data in areas where measurements are scarce, such as deserts [1], forests [2] and oceans [3].Accordingly, they have been widely used in research and applications worldwide [4][5][6][7].
The first approaches to employ remote sensing techniques for estimating precipitation were performed during the 70s.During that time, satellite images were not digitized.Barret [8] was one of the pioneers in developing a method for estimating monthly precipitation, using the visible channel.This method was called "cloud index", and was later improved by Follansbee [9], which included estimations of daily precipitation rates.Almost two decades later, at the end of 1997, the Tropical Rainfall Measuring Mission (TRMM) [10] was launched jointly by NASA (National Aeronautics and Space Administration) and JAXA (Japan Aerospace Exploration Agency), aiming to improve precipitation estimates in tropical and subtropical regions.One of the most successful products generated from this mission is the Multi-Satellite Precipitation Analysis research version (TMPA) and the real-time version of the same product (TMPA-RT).Besides combining precipitation estimates derived from several satellites, the TMPA algorithm [4,11,12] is also able to incorporate observed precipitation data [13].Over the last decade, those retrievals have been greatly improved with different versions of those products.For this study, the version of TMPA launched in 2012 (version 7 or TMPA-V7 hereafter) will be used.
The TMPA product has been used in scientific research and operational activities which lead to outstanding socio-economic gains, such as studies of extreme precipitation events [14][15][16], forecasting of natural disasters [17,18], water resources management planning [19], performance of numerical models [20,21], among others.Due to this, TMPA validation studies have been performed in several regions around the globe, and those results show great agreement between TMPA products and surface data [17,[22][23][24][25][26].Nevertheless, particularly over Brazilian territory, systematic bias are still observed for some precipitation regimes associated with shallow convection systems near the coast of northeastern Brazil (underestimation of precipitation) [27,28].Conversely, in Southern Brazil (close to the border with Argentina and Paraguay), this algorithm overestimates the observed rainfall [20,26].According to Laing and Fritsch [29], one of the largest and most active mesoscale convective complexes (MCCs) in the world is observed in this region.
In the beginning of 2014, the Global Precipitation Measurement (GPM) mission was launched to improve global estimates of precipitation and snow in low and mid latitudes.Moreover, GPM is a natural replacement of the successful TRMM mission [30].The precipitation estimate algorithm created to replace TMPA is the IMERG (Integrated Multi-satellite Retrievals for GPM), made publicly available in the beginning of 2015 in the NASA portal [12].This suite of products, namely IMERG-Early, IMERG-Late and IMERG-Final, is considered the next-generation of satellite-derived precipitation products, since it brings together resources from the existing: (1) TMPA [4], (2) CMORPH (Climate Prediction Center Morphing) [5], and (3) Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Cloud Classification System (PERSIANN-CCS) [6,31].Because they are a recent effort, the products generated from GPM require urgent evaluation in order to be available for use, compared to other precipitation estimate products.
On the other hand, the Global Satellite Mapping of Precipitation (GSMaP) product, developed by a consortium of Japanese institutions and operated by JAXA [7,32], also offers a global coverage of rainfall with several versions: the real time version (GSMaP_NRT), the standard version (GSMaP_MVK) and the standard version with gauge correction (GSMaP-Gauge) [33].The basic idea of the GSMaP algorithm is to find the optimal precipitation for which the brightness temperatures (TBs) calculated by the radiative-transfer model (RTM) fit best with observed TBs [7].
In view of the need to evaluate the accuracy of the new suite of GPM products in different regions of the globe, this study aims to assess the IMERG-Final and GSMaP-Gauge retrievals, in order to replace the TMPA research version in several regions with different precipitation regimes in Brazil.This paper is organized as follows: Section 2 provides details of the study area, the criteria used for the division of sub-areas according to precipitation regimes, and the datasets and statistics used; Sections 3 and 4 present the main results of this research and a discussion.The main conclusions are provided in Section 5.

Area of Interest and Characterization of the Precipitation Regimes
Brazil, due to its continental dimensions (8,515,759 km 2 of territorial area), features a great diversity of landscapes, topography, biodiversity and climates, as well as of precipitation regimes.In order to evaluate the effectiveness of satellite-derived precipitation products in the country, it is necessary to contextualize the main precipitation regimes in Brazil.In order to do that, 18 years of precipitation data (1998-2016) from the MERGE product [34] were used.This product combines daily precipitation from rain gauge stations (see observed data section) with the TMPA_RT product.According to this study, the MERGE product has proven to be a valuable analysis tool for model evaluation, outperforming gauge analysis for those regions with low rain gauges density.
Figure 1 shows the spatial distribution of the precipitation climatology over Brazil based on MERGE data.This climatology was calculated for the whole country and divided into grid cells with 2 × 2 degrees.This spatial distribution fits very well with the study of Reboita et al. [35] based on gauge analysis.Considering these results and the Figure 3 of Reboita et al., five different precipitation regimes were identified within Brazilian territory.Region R1, located in southern Brazil, presents year-round well distributed precipitation, and high total precipitation: 1050-1750 mm/year.The main systems influencing this region are cold fronts, mesoscale convective complexes [36], the South Atlantic Convergence Zone (ZCAS) [37] and the low-level jet stream [38].Region R2, with a clear monsoon regime [39], covers most of the Brazilian territory, where the accumulated precipitation is higher during summer (DJF) and lower during winter time (JJA).Region R3, the driest region of the Brazilian territory, is located inland over northeastern Brazil and also presents a maximum of precipitation during summer and a minimum during winter, with totals between 200 and 500 mm/year.The main systems influencing this region are the Intertropical Convergence zone (ITCZ) and upper level cyclonic vortexes [40].Region R4 (located in the northeastern coast of Brazil) presents maximum precipitation in winter and minimum in summer.The main systems are the ITCZ, tropical mesoscale convective systems, the Trade Winds, upper level cyclonic vortexes, easterly waves and sea breeze circulation [41].Region R5 comprises the northern Amazonian region; the main influencing systems are the ITCZ, the tropical squall lines [42] and the trade winds.

Observed Data
The 24-h accumulation of rain gauge data used in this study are received daily, in near real time, by the Centre for Weather Forecasting and Climatic Studies of Brazil (CPTEC/INPE), and cover the period 12 UTC to 12 UTC.Main sources of precipitation data are composed of the global telecommunication system (GTS), the automatic platform for collecting data (PCDs), and the regional meteorological centers in Brazil.Most surface stations are located in eastern Brazil, near the coast.Towards the center of the continent, the network density decreases sharply.A quality control of gauge data is performed in two stages: the first is objective and the second is subjective.The data is verified in real time, that is, at the time of storage in database systems.At this stage, validity, consistency (internal and spatial) and control (temporal and climatological) checks are performed.In the validity check, acceptable values must belong to a predefined interval (or be within a tolerance limit).Internal consistency performs validations of variables focused on internal relationships, considering a single weather station.The limits of the variables (precipitation, in this case) are reevaluated in a spatial consistency process, considering different climatic regions.This process also compares variables from different meteorological stations (for example, the precipitation of a station is compared with the precipitation of other stations within a distance of up to 25 km).The temporal control verifies the differences of the variables over a given period (for example, precipitation is acceptable if the difference of its current value in relation to the previous value does not exceed 175 mm).At the end of the first stage, the data are classified according to quality, by means of a descriptor, whose values are: suspect or correct.In the subjective stage, verification is performed by a meteorologist, who evaluates variables which have been identified as suspicious in the first stage.After the subjective analysis, suspect variables can be classified as valid or invalid.This process was carried out over 35 months, from the 1st of April, 2014, to the 28th of February, 2017; on average, 3400 daily observations were used in this study.

TMPA Products
TMPA is responsible for two products: the research V7 version (TMPA), and the real-time version (TMPA-RT).Both are used extensively by a large community of users [11].The products combine the estimates of precipitation from several satellite sensors.The biggest difference between them is that TMPA (TMPA-V7 hereafter) incorporates rainfall monthly data from the global rain gauge network (GPCC, Global Precipitation Climate Center) to scale the final product.These products are developed with a temporal resolution of 3 h, and a horizontal resolution of 0.25 • , covering most of the globe (between latitudes 50 • N-S).TMPA products were accumulated over 24 h, according to guidelines from the World Meteorological Organization.This is done in order to standardize the time of synoptic observations around the world, according to universal time: the total precipitation from 12 UTC of a given day up to 12 UTC of the following day is used.TMPA data were obtained via ftp from the US National Aeronautics and Space Administration (NASA) (https://disc2.gesdisc.eosdis.nasa.gov/data).In order to use the final version of each product, TMPA-V7 will be used in this comparison.

IMERG Product
The Global Precipitation Measurement Core Observatory Satellite (GPM), launched on the 27th of February, 2014, aims to provide the next generation of precipitation products, continuing with the first-rate products provided by TMPA.Its algorithm, The Integrated Multisatellite Retrievals for GPM (IMERG), is similar to the algorithm of the TMPA products.It was built to calibrate, combine and interpolate satellite-derived precipitations (microwave, infrared) and worldwide observed data.IMERG is executed in near real-time for operational purposes and with two months' delay for the research version.It provides two near real-time precipitation estimates data options: Early and Late.Early provides a quick estimate, with a 4 h lag, taking into account only data which has been collected at that moment.Late has a 12-h lag (after more data has arrived), and is therefore obviously more precise.For the IMERG research product, estimates are combined with monthly observed data (similar procedure applied for TMPA-V7), and made available two months later (called IMERG-Final, hereafter IMERG-F).
IMERG-F product (version V05) used in this study were obtained from ftp://arthurhou.pps.eosdis.nasa.gov/gpmallversions/V05/with a temporal resolution of 30 min and 0.1 • × 0.1 • of horizontal resolution.IMERG-F covers most of the globe: all surface area between latitudes 60 • N and 60 • S, corresponding to 87% of the Earth's surface.According to WMO guidelines, IMERG-F data have also been accumulated over 24 h, as well as TMPA-V7 data.The high spatial and temporal resolution, together with the expressive area of operation, make IMERG-F a potentially valuable tool for the scientific community.In this study, IMERG-F data (the research version) have been used for comparisons with other algorithms.More details on the GPM products and the IMERG algorithm can be found in [12].

GSMaP Product
Development of the precipitation map algorithm, including microwave radiometer/sounder algorithms, has been continued in cooperation with the members of Global Satellite Mapping of Precipitation (GSMaP) project [32] in Japan.Since the GSMaP project targeted the production of the "best" precipitation estimates, and they did not consider real-time operation and/or data availability, JAXA has developed and has operated a global rainfall map production system in near-real-time since October 2008; hourly and 0.1-degree resolution binary data and images are available via the internet (http://sharaku.eorc.jaxa.jp/GSMaP/)four hours after observation.Core algorithms of the system are based on those provided by the GSMaP project; microwave radiometer rainfall retrieval algorithm [7], microwave sounder rainfall retrieval algorithm [43], microwave imager/sounder rainfall retrieval algorithm [44], microwave-infrared (IR) merged algorithm [45] and Gauge calibrated rainfall algorithm [33].
GSMaP-Gauge (hereafter GSMaP-G) is a product that adjusts the microwave-infrared (IR) merged algorithm (hereafter GSMaP_MVK) with the global gauge analysis (CPC Unified Gauge-Based Analysis of Global Daily Precipitation) supplied by NOAA.The product also has a spatial and temporal resolution of 0.1 degree and 1 h.The 24-h accumulation product (12 UTC to 12 UTC) and version 7, released in 2017, is used for this study.

Statistical and Categorical Indexes
Statistical and categorical indexes [46] are used to evaluate the TMPA-V7 (version 7), IMERG-F (version 5) and GSMaP-G (version 7) products.Tables 1-3 give short descriptions of those statistic parameters, while an intensity rain classification based on daily thresholds is presented in Table 4.

Gauge Rain Gauge No-Rain Total
Satellite

Standardization of Data
Data used in this validation study are generated in different formats and spatial resolutions.GSMaP-G, IMERG-F and TMPA-V7 products are regularly spaced, although with degree resolutions of 0.10 for the first two and 0.25 for the third.Observations (OBS) are measured in fixed points (latitude, longitude and precipitation value), not following a regular spatial pattern, which requires a standardization of the dataset.In this study, we chose to evaluate the products of precipitation estimates in a coarser resolution grid (0.25 degrees).Standardization was performed following these steps: (a) Using the position (latitude and longitude) of each station, satellite-based precipitation retrievals are extracted from TMPA-V7, IMERG-F and GSMaP-G products using the nearest neighbor approach (the closest center of the correspondent grid point is selected).This approach is the same as that used in [11] to retain the original retrieved value of each algorithm.In this case, the maximum distance between the center of the grid point and the gauge is approximately seven kilometers for IMERG-F and GSMaP-G, and eighteen kilometers for TMPA-V7 (below the nominal spatial resolution of the respective products); (b) A table is built with the latitude and longitude of the station, observed precipitation and estimated precipitation for the three products following the procedure described in the paragraph above; (c) From this table, and using the same regularly spaced grid as that of the TMPA-V7 (0.25 • × 0.25 • ), three grids with the averages of existing precipitations inside each grid point are calculated for IMERG-F, GSMaP-G and OBS.In the case of the TMPA-V7, the original value is preserved.These values represent the average precipitation at each grid point.Grid points with no existing gauges are flagged as invalid.Additionally, the average of the brightness temperature of GOES-13 channel 4 (10.8 microns) is also performed for those grid points with at least one gauge station.This variable, which represents the temperature of the top of the cloud, is used as a proxy to identify, in a very general way, the mean depth of the clouds; (d) In order to perform a statistically robust study, only grid points with 50% or more of rain gauge data frequency, using the entire time series, were considered.The spatial distribution of points which satisfy this criterion is shown in Figure 2. Table 5 shows the amount of valid grid points per region.

Temporal Evolution
Figure 3 shows the temporal evolution of the daily averages of TMPA-V7, IMERG-F, GSMaP-G observed precipitation (gauges), and brightness temperature from GOES-13 (Geostationary Operational Environmental Satellite) for Brazil and its five precipitation sub-regions.Brightness temperature is used here as a proxy in order to identify cloud top types (cold or warm) in each region.In order to smooth the higher frequencies of the time-series of each variable, a 10-day moving average was applied to all aforementioned variables.In R1 region (Figure 3a), considering the 271 valid grid points, TMPA-V7 and IMERG-F show similar behavior: both algorithms overestimate the accumulated daily precipitation.The overestimation is more evident from 2014 to the end of 2016, compared to that of the rest of the period.During this period, brightness temperature is highly correlated (negatively) with observed and estimated precipitation.The periods with larger overestimation of precipitation correspond to the lowest brightness temperature values.This behavior suggests the presence of deep convection systems (with large amount of ice on its structure), where larger amount of rainfall is estimated than is actually occurring.In this case, both products have the same RMSE value (1.31 mm/day), while the ME associated with TMPA-V7 (0.99 mm/day) is slightly lower than IMERG-F (1.01 mm/day).In the case of the GSMaP-G, the mean value is always below TMPA-V7 and IMERG-F retrievals, and close to observed values.This result suggests that the CPC Unified Gauge-Based Analysis of Global Daily precipitation used to adjust the bias in GSMaP-G (Ushio et al., 2013) is performing better than the monthly accumulation used to scale the NASA final products (TMPA-V7 and IMERG-F).For this region, ME is close to 0 and RMSE is 0.86 mm/day for GSMaP-G.
R2 region (Figure 3b), with 892 valid grid points, is the largest evaluated region.The behavior of maximum precipitation in summer and minimum in winter is estimated efficiently by all products.However, overestimation is also present along the studied period (mainly during summer time) for TMPA-V7 and IMERG-F, while GSMaP represents that peak better.As within R1 region, R2 region is influenced by deep convective systems, and the same behavior was seen: larger precipitation values were observed during periods with minimum brightness temperatures (generally summer months).In this region, RMSE (0.68 mm/day) and ME (0.52 mm/day) values for TMPA-V7 were slightly lower than RMSE (0.73 mm/day) and ME (0.59 mm/day) values from IMERG-F, while GSMaP-G has the lower values for all algorithms: 0.26 mm/day for RMSE and −0.06 mm/day for ME.
R3 region (Figure 3c), with 270 valid grid points, presents the lowest precipitation totals among all regions.All products showed similar behavior and are able to effectively estimate the precipitation regime in most of the analyzed period, except for a general tendency for overestimation during the wet season by TMPA-V7 and IMERG-F, and underestimation by GSMaP-G.However, in terms of ME, IMERG-F and TMPA-V7 have the lowest values, with 0.03 and 0.02 mm/day respectively.GSMaP-G has a negative bias of −0.31 mm/day, due to underestimation during the transition from the wet to the dry season (approximately April-August each year).RMSE values show that IMERG-F is slightly better that GSMaP-G (0.35 an 0.37 mm/day respectively) while TMPA-V7 has the worst performance (0.42 mm/day).
R4 (with 222 valid grid points) present the least accurate estimate of the precipitation regime of all products (Figure 3d).As observed in other regions, the behavior of all products is similar.During most of the period, there is an underestimation in precipitation values from all products, especially in days when brightness temperatures are higher (above 285 K).In this situation, precipitation is associated with systems with warm cloud tops.However, for precipitation systems characterized by cold cloud tops (relative minimum values of brightness temperature), better results are obtained.In this region, TMPA-V7 and IMERG-F show the same RMSE value (0.94 mm/day), but the ME is slightly lower for IMERG-F (−0.49mm/day).For GSMaP-G, RMSE is 0.87 mm/day, the best value among all algorithms, whereas ME is −0.74 mm/day, the largest (negative) bias for all products.
R5 region (Figure 3e), located in the extreme north of the country, presents the lowest coverage of observed data (124 valid grid points).The annual precipitation cycle, with maximum values in March/April and minimum values in October, is relatively well represented, although with some overestimation for TMPA-V7 and IMERG-F and underestimation for GSMaP-G.In this particular region GSMaP-G has the best performance in terms of RMSE: 30% and 18% better than IMERG-F and TMPA-V7 respectively.In this region, RMSE (0.94 mm/day) and ME (0.62 mm/day) values for TMPA-V7 were slightly lower than RMSE (1.09 mm/day) and ME (0.84 mm/day) values from IMERG-F, while GSMaP-G has lower values for all algorithms: 0.77 mm/day for RMSE and −0.61 mm/day for ME.A negative correlation between brightness temperature and the precipitation estimate is quite evident in this region.
Considering the whole Brazilian territory (Figure 3f), results are quite similar to those found in region R2, with maximum precipitation totals in January and minimum totals in July.This occurs due to the largest extent of region R2 in area and valid grid point of observations (about 59% of the total).

Quantitative Precipitation Forecast (QPF)
In this section, quantitative results will be presented for eight precipitation thresholds-0.5,2, 5, 10, 15, 20, 35 and 50 mm, based on a contingency table (Table 3).As with the previous section, only grid points with at least 50% of observed data are considered in this analysis.
Figure 4 shows the Equitable Threat Score (ETS) for both evaluated products.ETS analysis for region R1 (Figure 4a) shows that GSMaP-G presented the best performance for all precipitation thresholds.The lowest ETS values, as expected, are observed during intense precipitation episodes (more than 50mm).In general, the ETS for GSMaP-G is about 20% higher than IMERG-F, and 35% better than TMPA-V7.Considering that ETS measures the fraction of observed grid points for a given threshold that were correctly estimated by a given algorithm, adjusted for hits associated with random chance, this score is showing that daily gauge adjustment used by GSMaP-G is adding some extra value compared with monthly-adjusted algorithms (TMPA-V7 and IMERG-F).Because ETS allows scores to be compared more fairly across different regimes, it is possible to conclude that R1 presents the best performance for all products.
R2 region (Figure 4b) presents ETS values slightly lower than in R1, showing that precipitation estimates in region R1 are more accurate.The GSMaP-G product exhibits better performance for all precipitation thresholds.Above 2 mm threshold, all products show a considerable decrease in performance.In most of the thresholds, the performance of GSMaP-G is about 30% better than IMERG-F, while TMPA-V7 is always on the low end for all thresholds.
Inner northeastern Brazil, represented by region R3 (Figure 4c), shows different behavior compared to that of regions R1 and R2.While GSMaP-G has an almost constant value up to 5 mm, an increase in ETS values from the 0.5 to the 5.0 mm threshold is observed for TMPA-V7 and IMERG-F.However, there is a decrease in performance, as expected.In all thresholds, GSMaP-G is superior to IMERG-F and TMPA-V7.This is more evident for lower rain rates (below 5 mm/day).The performance of all products in this region is considerably lower than that in regions R1 and R2.The Eastern coast of northeastern Brazil, corresponding to region R4 (Figure 4d), shows the worst performance for all products among all regions.The performance of GSMaP-G is higher in this region for all thresholds, with a similar pattern to region R3.This might be attributed to the presence of warm clouds system.Palhiarini and Vila (2017) concluded, using 17 years of TRMM-PR (TRMM-precipitation radar) data, that shallow convection is the predominant cloud type system during the rainy season in this region.The lack of ice in precipitating clouds is this region makes it very difficult for microwave sensors to retrieve rainfall using high frequency channels (Braga, 2014).
In region R5 (Figure 4e), the ETS values for GSMaP-G are higher than those of NASA products.When considering the whole country (Figure 4f), the ETS values, as expected, are higher for GSMaP-G for all precipitation thresholds.
Figure 5 presents the performance diagram [47] of IMERG-F (blue), GSMaP-G (green) and TMPA-V7 (red) products.The circles represent the precipitation thresholds.The smallest circle represents the rain-no rain threshold, and the largest circle, the threshold above 50 mm.This diagram makes it easier to analyze the results, as it represents several dichotomic (yes/no) quality measures simultaneously, such as the POD, BIAS, CSI and FAR.Dashed lines represent BIAS, and solid lines the CSI.Thus, the best estimates are located in the upper right part of the diagram.The CSI for R1 region (Figure 5a) indicates better performance for the GSMaP-G product, followed by the GMP_F, and finally by TMPA-V7 for all thresholds analyzed.CSI for GSMaP-G is around 13% higher than the IMERG-F, and 20% higher than the TMPA-V7.In terms of FAR, it is observed that for thresholds below 2.0 mm, the results are similar among all three products, but for higher thresholds, FAR values for GSMaP-G are significantly lower.When this result is combined with a BIAS score below 0.8, indicates that GSMaP-G is missing high precipitation events in this region, while the relatively higher FAR value for NASA products, combined with BIAS around 1.5 is related with overestimation of the area with heavy precipitation rates (20-50 mm/day).
A similar analysis could be done for R2 (Figure 5b): GSMaP-G has better performance in terms of CSI, followed by IMERG-F and TMPA-V7.However, GSMaP-G is missing moderate and heavy rain events (low FAR and BIAS < 1), while IMERG-F and TMPA-V7 slightly overestimate the area for moderate and light rain rates.
For R3 region (Figure 5c), as was observed for R1 and R2, GSMaP-G exhibits higher CSI values for all thresholds when compared to the other two products.However, the underestimation of the area with rainfall (BIAS < 1) is present in GSMaP-G for all thresholds.The BIAS for IMERG-F and TMPA-V7 are close to 1, which means that the area with precipitation above a given threshold is correctly estimated.Low POD and high FAR values for high rain rates suggest that those high impact events are not correctly placed for NASA products.
Among all the analyzed regions, the R4 region (Figure 5d) shows the lowest performance.The same behavior observed in other regions is observed here.The main difference is the underestimation of the area for IMERG-F and TMPA-V7 for moderate and light rain rates.
R5 region (Figure 5e) has a similar patter than R2, the largest analyzed region.GSMaP-G has better performance in term of CSI, followed by IMERG-F and TMPA-V7.However, GSMaP-G is missing moderate and heavy rain events (low FAR and BIAS < 1), while IMERG-F and TMPA-V7 slightly overestimate the area for all rainfall threshold.
It is not surprising that Brazil, as a whole region (Figure 5d), shows a similar pattern to that of the R2 region.Because the largest number of gauges are found in R2, the weight in the final result is larger than other regions.However, the behavior of the analyzed categorical indexes is quite similar along all different rainfall regimes.

Discussion
The performance of precipitation estimate products, obtained using three state-of-the-art algorithms for GPM era, namely IMERG-F and TMPA-V7 from NASA and GSMaP-G from JAXA, were evaluated in five Brazilian regions with different precipitation regimes, and the whole Brazilian territory, during the period from April 2014 to February 2017.From a broad perspective, and considering the amount of rainfall, GSMaP-G has the lowest ME and RMSE when compared with NASA products for most of the regions, except for R4.This is also true for categorical indexes like CSI and ETS.However, when the area of rainfall above a certain threshold is evaluated, GSMaP tends to miss moderate and heavy precipitation events in almost all areas.These results are in a good agreement with other studies like [48], where IMERG-F overestimates extreme precipitation indices, but GSMaP-G shows a significant underestimation in several basins in China.
In this section, the possible reasons for this behavior will be discussed, considering the characteristics of each database.In this study, gauge bias-adjusted versions were selected as the final products for each algorithm to make a fair comparison.However, while for IMERG-F and TMPA-V7 this procedure is done using monthly totals from GPCC [13], in the case of GSMaP, a daily gauge analysis (CPC Unified Gauge-Based Analysis of Global Daily Precipitation) supplied by CPC/NOAA is used.
In the first case, the hourly accumulated rainfall (or 3-hourly, in case of TRMM-V7) is obtained using the monthly GPCC precipitation gauge analysis (over land) in a three-step process.First, the gauge analysis is adjusted by multiplying the monthly precipitation values with the corresponding month's gridbox climatological adjustment ratios.Second, the multi-satellite estimate is adjusted to the large-scale mean of the gauges.Finally, the adjusted multi-satellite and gauge fields are combined using weighting by inverse estimated error variance [4].In the case of GSMaP-G, the CPC Unified Gauge-Based Analysis of Global Daily Precipitation is applied based on the optimal theory which adjusts the GSMaP-Gauge hourly rain rate, so that the sum of the 24 h GSMaP-G rain rate is roughly same as the gauge measurement where those gauges are available [33].
The observational database used as ground truth, as described in Section 2.2, is composed of the global telecommunication system (GTS), the automatic platform for collecting data (PCDs), and the gauge data from regional meteorology centers in Brazil.Because NOAA receives some of those gauges in real time, they are also included in the CPC Unified Gauge-Based Analysis of Global Daily Precipitation product.In such cases, the observational and the GSMaP-G database are not completely independent.During the period of this study, the mean number of gauges used for validation was around 3400 per day, while CPC Unified Gauge-Based Analysis of Global Daily Precipitation uses approximately 1000 gauges per day (on average).This could explain the lower ME and RMSE of this algorithm compared to those of NASA products for the regions with largest amount of gauges (R1 and R2).However, this fact does not exclude this product from the comparison, because, even considering this limitation, this algorithm shows some characteristics which are interesting to consider: (i) it represents state-of-theart of satellite rainfall retrievals at JAXA (a partner of GPM program) with continuous developments and reprocessing cycles; (ii) it does not reproduce for all times, all regions and all periods, the evolution of the observed precipitation, which means that some degree of independent data remains in the database (Figure 3), and (iii) this gauge-adjusted product is the only one which is available in near real-time (4 days), compared with the two months' latency for IMERG-F and TMPA-V7.This last issue is quite important for some applications, such as hydrological forecasting (dams management) for hydropower and irrigation, and other users where the latency plays a vital role in the decision making process and for which the accuracy of retrieval should be above certain threshold which cannot be reached using the satellite-only versions (real time versions).
In the case of NASA products, the mean number of gauges available in the GPCC database used for bias correction in the final products over Brazil for the period 2014-2017 is around 300.This number is less than 10% of gauge data available for validation (~3400).Even those databases (TMPA-V7 and IMERG-F) are also not completely independent; a larger 'degree of independence' is achieved when compared with GSMaP-G.
Future validation studies should include no gauge-adjusted versions of these algorithms and fully independent observed data (i.e., radar estimates), to come to more conclusive results about the performance of these algorithms.

Conclusions
The performance of three satellite-based rainfall estimation products were evaluated in five Brazilian regions with different precipitation regimes, and the whole Brazilian territory, during the period from April 2014 to February 2017.Generally speaking, all products are able to estimate, with different degrees of accuracy, the levels of precipitation over the Brazilian territory.While overestimations are present in most of the studied regions for NASA products, GSMaP-G tend to slightly underestimate the observed rainfall.The most noticeable estimation errors for all products occur over the eastern coast of northeastern Brazil (region R4), where large underestimation for all products occurred during precipitation episodes caused by warm clouds.The quantitative analysis (ETS and CSI) shows that the GSMaP-G product presents better performance in all regions and all precipitation thresholds, while large underestimation of the area covered with heavy rainfall (rain rates > 100mm/day) is also observed for this algorithm (Figure 5).IMERG-F and TMPA-V7 show similar behavior in terms of CSI, ETS, POD, FAR and BIAS with a better performance for IMERG-F.
This study shows that GSMaP-G and IMERG-F precipitation products exhibit better performance compared to the current TMPA-V7, besides the finer horizontal and temporal resolution of the new generation products.In this context, GSMaP-G and IMERG-F algorithms are a great replacements for TMPA-V7 products in the Brazilian territory, characterized by high density of river basins throughout its territory, where flood and landslide events are common, with negative social and environmental impacts.However, the choice of a given product will depend on the user's needs: GSMaP-G has a lower latency and tends to represent better to total amount of rainfall, while IMERG-F is more accurate for the retrieval of moderate and heavy rainfall events in terms of frequency (area).

Figure 1 .
Figure 1.Spatial distribution of precipitation climatology (1998-2016) based on MERGE data [34] for the five identified regions, for each grid box of approximately 2 degrees.
rain a = Hit b = false alarm E = (a + b) Satellite no-rain c = miss d = correct negative (c + d) Total O = (a + c) (b + d) (a + b + c + d)

Figure 2 .
Figure 2. Spatial distribution of grid points which present rain gauge data frequency of at least 50% in the studied period, and in regularly spaced grids of 0.25 • .

Figure 5 .
Figure 5. Performance diagram [47] summarizing the SR, POD, BIAS, and CSI for regions R1 (a), R2 (b), R3 (c), R4 (d), R5 (e) and BRAZIL (f).Dashed lines represent BIAS scores with labels on the outward extension of the line, while labelled solid contours are CSI.Circles represent the eight precipitation thresholds.The smallest circle represents the rain/no rain threshold (0.5 mm), and the largest circle represents the threshold above 50 mm.

Table 4 .
Rain classification and thresholds.

Table 5 .
Number of valid grid points for each region.