Evaluation of Satellite Precipitation Products for Hydrological Modeling in the Brazilian Cerrado Biome

: This study investigates the applicability of Satellite Precipitation Products (SPPs) in streamﬂow simulations performed in the Brazilian Cerrado biome, which is one of the world’s biodiversity hotspots. Local data from ground observations were used as a reference for evaluating the Tropical Rainfall Measuring Mission (TRMM) Multi-satellite Precipitation Analysis (TMPA) and Integrated Multi-Satellite Retrievals for Global Precipitation Measurement (IMERG). The Soil and Water Assessment Tool (SWAT) was used to simulate the streamﬂow in a subbasin of the Tocantins river basin. Statistical precision metrics showed that both SPPs presented a satisfactory performance for precipitation monitoring on a monthly scale, in which IMERG performed better than TMPA. The Nash–Sutcli ﬀ coe ﬃ cient and Kling–Gupta e ﬃ ciency obtained for both calibration and validation period were greater than 0.82 and 0.79, respectively, demonstrating that both SPPs were able to simulate the hydrological regime adequately. However, the bias indicated that the SPPs overestimated the observed streamﬂow. The r-factor and p-factor values showed that both TMPA and IMERG presented low uncertainty in streamﬂow simulations. SPPs o ﬀ er a great alternative for monitoring the precipitation and hydrological studies in the Brazilian Cerrado biome, and presented better simulation results than rain gauges.


Introduction
Precipitation is one of the main components of the hydrological cycle due to the significant role that it plays in various socio-economic activities. It involves complex physical processes and displays high spatial and temporal variability [1][2][3]. Accurate estimates of precipitation are crucial for a wide range of climate and hydrology applications [4]. Despite its importance, in several locations around the world, the monitoring network of precipitation has limitations related to density, frequency of observations, and infrastructure [2].
The rain gauges provide data for calibration and validation for other precipitation data sources [5]. They are considered the reference data source for precipitation [3] and the most adequate data for hydrological modeling. However, rain gauge measurements have limitations, such as spatial coverage and point measurement, mainly in tropical developing regions, where high variability makes precipitation one of the most difficult weather variables for estimating [1]. Furthermore, only a few countries can afford a dense rainfall monitoring network [6]. Currently, Brazil has an average density of one rain gauge per 720 km 2 [7], which is below that recommended by the World Meteorological

Study Area
The Tocantins-Araguaia river basin (TARB) is the largest basin entirely inserted in the Brazilian territory [6] (Figure 1), accounting for 9% of the country's area [1]. It has a drainage area of the 764,000 km 2 . Furthermore, approximately 64% of the runoff comes from regions where the Cerrado biome is predominant [33]. The TARB is essential for water supply, irrigation, and hydroelectricity generation, being the third-ranked Brazilian basin in installed hydropower, with 13.14 GW comprising 15% of the installed capacity of the country [34]. However, the TARB has the lowest number of rain gauges among the Brazilian basins [35], which undermines the appropriate capture of the spatial distribution of the precipitation and can introduce unknown uncertainties into the estimated spatial precipitation due to the low density of the rain gauge network.
Water 2020, 12, x FOR PEER REVIEW 3 of 19

Study Area
The Tocantins-Araguaia river basin (TARB) is the largest basin entirely inserted in the Brazilian territory [6] (Figure 1), accounting for 9% of the country's area [1]. It has a drainage area of the 764,000 km 2 . Furthermore, approximately 64% of the runoff comes from regions where the Cerrado biome is predominant [33]. The TARB is essential for water supply, irrigation, and hydroelectricity generation, being the third-ranked Brazilian basin in installed hydropower, with 13.14 GW comprising 15% of the installed capacity of the country [34]. However, the TARB has the lowest number of rain gauges among the Brazilian basins [35], which undermines the appropriate capture of the spatial distribution of the precipitation and can introduce unknown uncertainties into the estimated spatial precipitation due to the low density of the rain gauge network. The study area comprises the upstream drainage area of the Serra da Mesa Hydropower Plant (SMHPP) (Figure 1) in the TARB. The Serra da Mesa Basin (SMB) has a drainage area of 51,238 km 2 and is responsible for generating 1275 MW, approximately 10% of the total hydroelectricity in the TARB [36]. The SMHPP reservoir has 1784 km 2 of surface area and is the largest in Brazil in amount of dammed water [37].
According to Koppen's climate classification, the climate of the basin is predominantly Aw (tropical Savanna) with a dry winter [38]. During a long-term climate observed from 1981 to 2010, the average annual precipitation was 1400 mm, the maximum and minimum average annual temperatures were 31 and 18 °C, respectively, and the average annual potential evapotranspiration was 1550 mm. Regarding the topography, the elevation of the SMB ranges between 405 m and 1673 m. The study area comprises the upstream drainage area of the Serra da Mesa Hydropower Plant (SMHPP) (Figure 1) in the TARB. The Serra da Mesa Basin (SMB) has a drainage area of 51,238 km 2 and is responsible for generating 1275 MW, approximately 10% of the total hydroelectricity in the TARB [36]. The SMHPP reservoir has 1784 km 2 of surface area and is the largest in Brazil in amount of dammed water [37].
According to Koppen's climate classification, the climate of the basin is predominantly Aw (tropical Savanna) with a dry winter [38]. During a long-term climate observed from 1981 to 2010, the average annual precipitation was 1400 mm, the maximum and minimum average annual temperatures were 31 and 18 • C, respectively, and the average annual potential evapotranspiration was 1550 mm. Regarding the topography, the elevation of the SMB ranges between 405 m and 1673 m.

Ground-Based Precipitation
Daily precipitation data were obtained from the Brazilian National Institute of Meteorology (INMET) and the Brazilian National Water Agency (ANA-Hidroweb). Precipitation data from 30 rain gauges ( Figure 1) were analyzed for consistency evaluation. The criteria adopted for the selection of rain gauges was that they should have less than 10% of missing data in the precipitation time series from June 2000 to December 2018 [6].

Satellite Precipitation Products (SPPs)
The SPP data were obtained from the NASA Goddard Space Flight Center (https://gpm.nasa. gov), from which the Tropical Rainfall Measurement Mission (TRMM) and the Global Precipitation Measurement (GPM) products were used ( Figure 2). The TRMM is a low-Earth orbit satellite resulting from a joint project between the Japan Aerospace Exploration Agency (JAXA) and the National Aeronautics and Space Administration (NASA) launched in late 1997. The TRMM orbits at an angle of 35 • to Ecuador and provides precipitation products between the latitudes of 50 • N and 50 • S, with temporal resolution varying from three hours to one month and spatial resolutions from 0.25 • to 5 • [11]. TRMM Multi-satellite Precipitation Analysis (TMPA) is based on the calibration of the products from Infrared (IR), Microwave Passive (MP) wavelength measurements of various satellites, and bias correction of the remote sensing precipitation using rain gauges [11]. In this study, the 7th version of the TMPA (3B42V7) dataset, with a spatial resolution of 0.25 • × 0.25 • and daily temporal resolution, was used.

Ground-Based Precipitation
Daily precipitation data were obtained from the Brazilian National Institute of Meteorology (INMET) and the Brazilian National Water Agency (ANA-Hidroweb). Precipitation data from 30 rain gauges ( Figure 1) were analyzed for consistency evaluation. The criteria adopted for the selection of rain gauges was that they should have less than 10% of missing data in the precipitation time series from June 2000 to December 2018 [6].

Satellite Precipitation Products (SPPs)
The SPP data were obtained from the NASA Goddard Space Flight Center (https://gpm.nasa.gov), from which the Tropical Rainfall Measurement Mission (TRMM) and the Global Precipitation Measurement (GPM) products were used ( Figure 2). The TRMM is a low-Earth orbit satellite resulting from a joint project between the Japan Aerospace Exploration Agency (JAXA) and the National Aeronautics and Space Administration (NASA) launched in late 1997. The TRMM orbits at an angle of 35° to Ecuador and provides precipitation products between the latitudes of 50° N and 50° S, with temporal resolution varying from three hours to one month and spatial resolutions from 0.25° to 5° [11]. TRMM Multi-satellite Precipitation Analysis (TMPA) is based on the calibration of the products from Infrared (IR), Microwave Passive (MP) wavelength measurements of various satellites, and bias correction of the remote sensing precipitation using rain gauges [11]. In this study, the 7th version of the TMPA (3B42V7) dataset, with a spatial resolution of 0.25° × 0.25° and daily temporal resolution, was used. The Global Precipitation Measurement (GPM) started with the launch of the GPM Core Observatory in early 2014. GPM is a satellite mission designed to provide a new generation of rainfall observations that are more accurate than the TRMM [10,39]. The GPM Core sensors extend the measurement range attained by the TRMM and provide information globally [39]. Moreover, the Integrated Multi-Satellite Retrievals for GPM (IMERG) provide rainfall and snow information products at 0.1° × 0.1° (spatial) and half-hour (temporal) resolutions to global coverage. IMERG compensates for the low frequency of MP sensors onboard satellites in low-Earth orbits by combining as many satellites as possible with IR sensors in geocentric orbit [12]. The IMERG dataset used in this The Global Precipitation Measurement (GPM) started with the launch of the GPM Core Observatory in early 2014. GPM is a satellite mission designed to provide a new generation of rainfall observations that are more accurate than the TRMM [10,39]. The GPM Core sensors extend the measurement range attained by the TRMM and provide information globally [39]. Moreover, the Integrated Multi-Satellite Retrievals for GPM (IMERG) provide rainfall and snow information products at 0.1 • × 0.1 • (spatial) and half-hour (temporal) resolutions to global coverage. IMERG compensates for the low frequency of MP sensors onboard satellites in low-Earth orbits by combining as many satellites as possible with IR sensors in geocentric orbit [12]. The IMERG dataset used in this study was the IMERG Final Run Version 6 (IMERGF-V6) at a 0.1 • × 0.1 • (spatial) and daily (temporal) resolution.

Hydrological Model Description
In this study, the Soil and Water Assessment Tool (SWAT-http://swat.tamu.edu) in its 2012 version was used to simulate the streamflow of the SMB. The SWAT is a basin-scale, physically based, semi-distributed, long-term, and continuous-time hydrological model that operates on a daily time step. The SWAT was developed by the US Agricultural Research Service (ARS) to predict the impact on water, sediment production, and agricultural chemicals from land use and management practices [40,41].
The SWAT divides the watershed into multiple subbasins connected by the drainage network based on a Digital Elevation Model (DEM). Each subbasin is further divided into Hydrologic Response Units (HRU) that consist of lumped land comprising unique soil characteristics, land use, management, and topography [40,41].
The water balance, which is the driving force behind the SWAT simulation, is computed for each HRU and accumulated to obtain the total for the subbasins [42]. Equation (1) describes the water balance adopted by the swat model through which the main processes of the hydrological cycle, such as surface and subsurface flows, infiltration, percolation, plant uptake, evapotranspiration, and soil moisture, are obtained.
where SW t is the final soil water content (mm H 2 O), SW o is the initial soil water content on day i (mm); t is the time (days), R day,i is the amount of precipitation on day i (mm), Q surf,i is the amount of surface runoff on day i (mm), E a,i is the amount of evapotranspiration on day i (mm), W seep,i is the amount of water entering the vadose zone from the soil profile on day i (mm), and Q gw,i is the amount of return flow on day i (mm). The Potential Evapotranspiration (PET) can be computed in the SWAT by the Priestley and Taylor [43], Hargreaves [44], or Penman-Monteith methods [45,46]. The surface runoff is modeled using either the Green-Ampt method [47] or a modification of the Soil Conservation Service (SCS) curve number method (CN-SCS) [48]. The peak flow is obtained by a modified rational method and the flow is routed through the channel using either a variable storage coefficient method or the Muskingum routing method. In this study, the Penman-Monteith, CN-SCS, and variable storage coefficient methods were applied. The SWAT theoretical documentation (Neitsch et al. [49], Arnold et al. [41], and Srinivasan et al. [50]) provide the model equations and detailed description.

Model Setup
Streamflow simulation in the SWAT requires different types of input data such as the Digital Elevation Model (DEM), soil maps, land use maps, weather, and streamflow data. The river basin delineation, creation of streams, characteristics of the subbasins, and stream network were computed through an Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) DEM with a spatial resolution of 30 m downloaded from the USGS (United States Geological Survey) (Figure 3c).
The soil map ( Figure 3a) with a 1:5,000,000 scale used in this study was derived from the Brazilian Agricultural Research Corporation [51]. The soil types in the SMB (    (Figure 1). In the SWAT model, all weather data were categorized into subbasins according to the "nearest-distance" principle [53]. Therefore, a SWAT subbasin will use the weather data from the weather station that is closest to its centroid. The weather data were used in the model simulations for computing the potential evapotranspiration through the Penman-Monteith method. The partition of the precipitation between infiltration and surface runoff was computed using the SCS curve number method and the variable storage coefficient method was used for routing in the channel.
The grids containing the SPP data (TMPA and IMERG) were delimited using the boundaries of the basin and transformed into points representing the centroid of each pixel. The historical series of daily precipitation obtained at each point of the SPPs were used as input in the SWAT. For the hydrological simulation, 460 and 79 points were used within the basin area from IMERG and TMPA, respectively. Similar to the weather data, both SPP and rain gauge data were categorized into subbasins according to the "nearest-distance" principle.
Due to the presence of the reservoir in the basin, the naturalized streamflow data from the SMHPP were obtained from the National Electric System Operator (ONS) and used for comparison between the SWAT model simulation results. Naturalized streamflows are those that occur if there is no reservoir in a basin [54], and they are derived from an integrated system that incorporates 96 stochastic models for forecasting the streamflows in each hydropower plant [55]. Naturalized streamflows have been used in several studies of hydrological modeling in Brazil, such as those by Cassalho et al. [13], Nóbrega et al. [56], and Oliveira et al. [55].

Calibration and Validation of the SWAT Model
The calibration, validation, and uncertainty analysis were carried out automatically using the program SWAT Calibration and Uncertainty Procedures (SWAT-CUP) [57]. Among different calibration algorithms in SWAT-CUP, the Sequential Uncertainty Fitting algorithm (SUFI-2) was used in this study with Nash-Sutcliffe efficiency (NSE) as an objective function.
The SUFI-2 algorithm [53,58] expresses all uncertainties, such as uncertainty in the conceptual model, drive variables, and measured data, within the parameter ranges, and attempts to capture most of the measured data within the 95% prediction uncertainty (95PPU) of the model in an iterative process [59]. The 95% prediction uncertainty is calculated at 2.5% and 97.5% levels of the cumulative distribution of an output variable obtained by sampling in a Latin hypercube. Both p-factor  (Figure 1). In the SWAT model, all weather data were categorized into subbasins according to the "nearest-distance" principle [53]. Therefore, a SWAT subbasin will use the weather data from the weather station that is closest to its centroid. The weather data were used in the model simulations for computing the potential evapotranspiration through the Penman-Monteith method. The partition of the precipitation between infiltration and surface runoff was computed using the SCS curve number method and the variable storage coefficient method was used for routing in the channel.
The grids containing the SPP data (TMPA and IMERG) were delimited using the boundaries of the basin and transformed into points representing the centroid of each pixel. The historical series of daily precipitation obtained at each point of the SPPs were used as input in the SWAT. For the hydrological simulation, 460 and 79 points were used within the basin area from IMERG and TMPA, respectively. Similar to the weather data, both SPP and rain gauge data were categorized into subbasins according to the "nearest-distance" principle.
Due to the presence of the reservoir in the basin, the naturalized streamflow data from the SMHPP were obtained from the National Electric System Operator (ONS) and used for comparison between the SWAT model simulation results. Naturalized streamflows are those that occur if there is no reservoir in a basin [54], and they are derived from an integrated system that incorporates 96 stochastic models for forecasting the streamflows in each hydropower plant [55]. Naturalized streamflows have been used in several studies of hydrological modeling in Brazil, such as those by Cassalho et al. [13], Nóbrega et al. [56], and Oliveira et al. [55].

Calibration and Validation of the SWAT Model
The calibration, validation, and uncertainty analysis were carried out automatically using the program SWAT Calibration and Uncertainty Procedures (SWAT-CUP) [57]. Among different calibration algorithms in SWAT-CUP, the Sequential Uncertainty Fitting algorithm (SUFI-2) was used in this study with Nash-Sutcliffe efficiency (NSE) as an objective function.
The SUFI-2 algorithm [53,58] expresses all uncertainties, such as uncertainty in the conceptual model, drive variables, and measured data, within the parameter ranges, and attempts to capture most of the measured data within the 95% prediction uncertainty (95PPU) of the model in an iterative process [59]. The 95% prediction uncertainty is calculated at 2.5% and 97.5% levels of the cumulative distribution of an output variable obtained by sampling in a Latin hypercube. Both p-factor (percentage Water 2020, 12, 2571 7 of 18 of observation within the 95PPU band) and r-factor (which corresponds to the average thickness of the 95PPU band divided by the standard deviation of the measured data) were used to evaluate the uncertainty of the simulation. Although there are no exact numbers for what these two factors should be, Abbaspour et al. [59] suggested a p-factor greater than 0.7 (>70%) and an r-factor around 1 for streamflow simulation.
The streamflow simulation was performed for the period from June 2000 to December 2018. In large basins, the water resources management and planning are carried out monthly, and the observed hydrological data, which are then more readily available, are used for calibrating and validating hydrological models [55]. The first two years were used as a warm-up period to reduce the uncertainties regarding the initial conditions of the surface domain [60]. The period from January 2002 to December 2010 was used for calibration. The initial ranges of the parameters used in calibration were selected based on previous studies conducted in the Cerrado biome, such as Oliveira et al. [27] and Rodrigues et al. [61]. Subsequently, the parameters were adjusted for the study area through two iterations with 500 simulations each through the SUFI-2 algorithm implemented in SWAT-CUP. The validation consisted of running the model from January 2011 to December 2018 using the parameters adjusted in the calibration period.

Meteorological and Hydrological Evaluation
The performance of the SPPs was divided into precipitation assessment (comparison with rain gauges) and hydrological assessment (comparing the validation of SWAT streamflow simulations and observed streamflow). As the pixel-to-pixel approach requires a high density of rain gauges [62], uncertainties may arise from the interpolation of data due to the low density of rain gauges [63]. Furthermore, the spatial resolutions of TMPA and IMERG are different, which can affect the comparison. For this reason, the point-to-pixel approach was used to compare the precipitation rain gauges against SPPs in this study [62][63][64][65]. In this approach, the rain gauges closest to the centroid of the SPP grid pixel were compared with SPPs on a daily and monthly basis. The performance of the SPPs was assessed through statistical metrics (Table 1), including the root mean square error (RMSE), percentage bias (Pbias), correlation coefficient (r), and Kling-Gupta efficiency (KGE) [66]. Table 1. Statistical metrics for hydrological and meteorological assessment.

Statistical Metrics Equation Perfect Value Unit
Root mean square error O are the average of the estimated and observed variable, respectively; P i and O i are the i th estimated and observed variable, respectively; n is the number of observations; and σ p and σ p are the standard deviation of the estimated and observed variable, respectively.
The hydrological assessment of the SPPs was carried out by comparing the SWAT streamflow simulation using the different precipitation inputs (rain gauges, TMPA, and IMERG). In addition to the uncertainty analysis performed using the p-factor and r-factor described in Section 2.5, the Pbias, NSE, and KGE (Table 1) were adopted to evaluate the model's performance regarding the calibration and validation periods. The evaluation criteria considered in this study was proposed by Moriasi et al. [67]: "very good" (NSE > 0.75; PBIAS < ±10), "good" (0.65 < NSE < 0.75; ±10 < PBIAS < ±15), "satisfactory" (0.50 < NSE < 0.65; ±15 < PBIAS < ±25), and "unsatisfactory" (NSE < 0.50; PBIAS > ±25). In addition, the KGE closest to 1 indicates the best fit of the model [66]. Figure 4 shows the box plots of the statistical metrics as well as the points representing the result of the comparison between SPPs against rain gauge over the SMB on a daily and monthly basis. Overall, IMERG and TMPA presented similar results at both time scales. The average Pbias indicates that IMERG presented more overestimation than TMPA on a daily (Pbias of −3.13% and −0.34%, respectively) and monthly time scale (Pbias of −3.56% and −0.75%, respectively). The highest overestimations for IMERG and TMPA were 40.20% and 37.80%, respectively, at both time scales. On the other hand, the underestimation reached 28.7% and 33.3% for IMERG and TMPA, respectively.  Figure 4 shows the box plots of the statistical metrics as well as the points representing the result of the comparison between SPPs against rain gauge over the SMB on a daily and monthly basis. Overall, IMERG and TMPA presented similar results at both time scales. The average Pbias indicates that IMERG presented more overestimation than TMPA on a daily (Pbias of −3.13% and −0.34%, respectively) and monthly time scale (Pbias of −3.56% and −0.75%, respectively). The highest overestimations for IMERG and TMPA were 40.20% and 37.80%, respectively, at both time scales. On the other hand, the underestimation reached 28.7% and 33.3% for IMERG and TMPA, respectively. The magnitude of error between SPPs and rain gauges presented by the RMSE showed similar values, mainly daily time scale, in which the average RMSEs for IMERG and TMPA were 12.65 and 12.54 mm, respectively. However, a better performance for the average RMSE was obtained on a monthly time scale by IMERG (58.09 mm) in comparison to TMPA (59.61 mm). In addition, the coefficient of correlation (r) highlighted the greater performance of IMERG. Both TMPA and IMERG showed a lower correlation with gauges on a daily time scale. Despite varying at similar magnitudes, the monthly average r value of IMERG was greater than TMPA, being 0.91 and 0.90, respectively. As a result, the precipitation derived from IMERG presented a slightly better agreement with rain gauges than TMPA. For KGE, the results ranged from 0.09 to 0.24 for IMERG, and from 0.09 to 0.22 for TMPA, when the comparison was performed on a daily scale. Similar to the r values, average KGE values increased on a monthly scale, ranging from 0.50 to 0.92 for IMERG and from 0.48 to 0.92 for TMPA. The average KGE values of IMERG were slightly better than TMPA. Table 2 presents the parameters that were used in the SWAT model with their respective initial ranges, which were selected from the literature review of hydrological modeling studies in the Brazilian Cerrado biome [27,61], and the range of the parameters obtained by SUFI2. It is interesting to highlight that the simulations driven by IMERG and rain gauges resulted in similar parameter values. A detailed description of the parameters used in this study can be obtained in Neitsch et al. [49]. The magnitude of error between SPPs and rain gauges presented by the RMSE showed similar values, mainly daily time scale, in which the average RMSEs for IMERG and TMPA were 12.65 and 12.54 mm, respectively. However, a better performance for the average RMSE was obtained on a monthly time scale by IMERG (58.09 mm) in comparison to TMPA (59.61 mm). In addition, the coefficient of correlation (r) highlighted the greater performance of IMERG. Both TMPA and IMERG showed a lower correlation with gauges on a daily time scale. Despite varying at similar magnitudes, the monthly average r value of IMERG was greater than TMPA, being 0.91 and 0.90, respectively. As a result, the precipitation derived from IMERG presented a slightly better agreement with rain gauges than TMPA. For KGE, the results ranged from 0.09 to 0.24 for IMERG, and from 0.09 to 0.22 for TMPA, when the comparison was performed on a daily scale. Similar to the r values, average KGE values increased on a monthly scale, ranging from 0.50 to 0.92 for IMERG and from 0.48 to 0.92 for TMPA. The average KGE values of IMERG were slightly better than TMPA. Table 2 presents the parameters that were used in the SWAT model with their respective initial ranges, which were selected from the literature review of hydrological modeling studies in the Brazilian Cerrado biome [27,61], and the range of the parameters obtained by SUFI2. It is interesting to highlight that the simulations driven by IMERG and rain gauges resulted in similar parameter values. A detailed description of the parameters used in this study can be obtained in Neitsch et al. [49]. The results of the statistical metrics used to evaluate SWAT performance are shown in Table 3. The criteria presented by Moriasi et al. [67] were used to interpret the SWAT's performance. Regarding the NSE values, the performance of the model forced with rain gauges (ground precipitation), IMERG, and TMPA was considered "very good" for both calibration and validation. In terms of Pbias values, simulations driven by the SPPs performed better than the simulations driven by rain gauges. Both calibration and validation periods driven by SPPs presented performances classified as "good", except for the validation using IMERG that was classified as "very good". On the other hand, simulations based on rain gauges presented inferior performances. Although the simulation driven by rain gauges was classified as "good" in the calibration, the performance of the model in the validation period was lower and classified only as "satisfactory" (Pbias of −23.6%). As a result, the SWAT showed a more significant overestimation of the observed data when driven with rain gauges.

Monthly Streamflow Simulation Using SPP and Rain Gauges
KGE results closest to 1 indicate the perfect adjustment between observed and simulated data. The simulations with TMPA showed better performance both in the calibration and validation periods ( Table 3). The simulation driven by IMERG presented similar KGE values for calibration and validation, whereas simulations forced by rain gauges obtained higher KGEs in calibration than in the validation period (0.82 and 0.75, respectively).
Through the p-factor and r-factor (Table 3), it is possible to observe that all simulations presented low uncertainties according to the criteria suggested by Abbaspour et al. [59]. The results showed that the simulation using TMPA presented more reliable results than other precipitation inputs, whereas the validation of the simulation using rain gauges presented the highest uncertainty among the inputs, in which the p-factor was slightly lower than 0.7.
Although IMERG has higher statistical coefficients, the uncertainty analysis and scatters plots ( Figure 5) indicate a better performance of TMPA for water resource management concerning recession streamflow. Figure 5 shows that the simulation based on IMERG underestimates peak streamflow, indicating that the use of this product in flood mitigation studies could lead to errors.
Through the p-factor and r-factor (Table 3), it is possible to observe that all simulations presented low uncertainties according to the criteria suggested by Abbaspour et al. [59]. The results showed that the simulation using TMPA presented more reliable results than other precipitation inputs, whereas the validation of the simulation using rain gauges presented the highest uncertainty among the inputs, in which the p-factor was slightly lower than 0.7.
Although IMERG has higher statistical coefficients, the uncertainty analysis and scatters plots ( Figure 5) indicate a better performance of TMPA for water resource management concerning recession streamflow. Figure 5 shows that the simulation based on IMERG underestimates peak streamflow, indicating that the use of this product in flood mitigation studies could lead to errors. The results depicted in Figure 6 show that the recession periods in the simulations driven by TMPA were better represented than both IMERG and rain gauges. This fact becomes clearer in the monthly streamflow duration curve (Figure 7), in which the streamflows with exceeding frequencies (duration) above 70% using TMPA were closer to the observed streamflow than other rainfall sources. The results depicted in Figure 6 show that the recession periods in the simulations driven by TMPA were better represented than both IMERG and rain gauges. This fact becomes clearer in the monthly streamflow duration curve (Figure 7), in which the streamflows with exceeding frequencies (duration) above 70% using TMPA were closer to the observed streamflow than other rainfall sources.

Meteorological Evaluation
The critical aspect of the SPP evaluation was the difference in spatial resolution between SPPs and rain gauges. Although Tan et al. [62] found no difference between the two approaches when comparing point-to-pixel and pixel-to-pixel in a basin located in Malaysia, the point-to-pixel approach may have been compromised due to the spatial difference between precipitation data sources and the low density of the rain gauges in the SMB, as shown in Figure 2. Furthermore, since satellite measurements represent the average value within a pixel observed in the atmosphere, whereas rain gauges generally collect point precipitation observed on the ground [62], the limitations of point-to-pixel can increase these differences [68].
RMSE, PBias, KGE, and r statistic metrics were used to assess the ability of TMPA and IMERG to represent the rainfall. Overall, both IMERG and TMPA were able to estimate rainfall in the Cerrado biome region on a monthly time scale more accurately than on a daily scale (Figure 4). Since the final versions of both SPPs were used in this study, the monthly bias corrections applied to these products before being made available may explain the best performance when used on a monthly scale. For statistical metrics referring to bias and magnitude of error, the behavior was similar to the daily data.
Gadelha et al. [7] evaluated IMERG over Brazil on a monthly basis and obtained r values higher than 0.80 in 90% of the cells on a monthly time scale, whereas a large number of cells presented r values ranging from 0.1 to 0.6 on a daily scale in central-western Brazil, where the SMB is located. The results confirmed that the use of longer time averages such as a monthly scale are more representative of the comparison between SPPs and rain gauges in Brazil. Melo et al. [15] also obtained better statistical metrics for TMPA over Brazil when evaluating on a monthly scale. Better results are expected on a monthly time scale since both IMERG and TMPA are corrected with ground precipitation data to remove monthly bias [11].
Despite improvements presented from the removal of bias, the results of Pbias for both SPPs showed an overestimation of the observed precipitation in both time scales. Similar results were obtained in other regions of the world. Su et al. [69], in a study performed with IMERG, found a slightly higher overestimation of the precipitation over Mainland China. Results obtained by Rozante et al. [70] indicate IMERG and TMPA products overestimate precipitation over Brazil.
Although TMPA presented an average Pbias value closer to zero, and showed better performance than IMERG, TMPA also showed an overestimation of precipitation over the SMB. Other studies also point to the overestimation of the precipitation estimated from TMPA. Falck et al. [1] observed overestimations of precipitation obtained from TMPA when compared to rain gauges over the Tocantins-Araguaia river basin. A similar result was obtained by Melo et al. [15] in an assessment of the general quality of TMPA 3B42 (versions 6 and 7, available at https://disc.gsfc.nasa.gov/) estimates over Brazil. The authors found a high level of uncertainty in the estimates of the precipitation in central-western Brazil. Despite the improvements in the 6th version compared to the 5th, it should be noted that TMPA still presented difficulties in estimating the precipitation of the Cerrado biome region. RMSE, r, and KGE indicate that IMERG performed slightly better than TMPA in the SMB. This result can be attributed to the increasing number of passive microwave samples for measurements compared to TMPA [39]. Microwave passives samples are more accurate than infrared, and they are visible because of their sensitivity to the concentrations of ice particles or droplets associated with precipitation [71]. In addition, IMERG is more sensitive in light rainfall than TMPA, since the DPR in GPM has a sensitivity of 0.2 mm h −1 in the Ka-band, whereas the Precipitation Radar (PR) in the TRMM is 0.5 mm h −1 . Furthermore, according to Prakash et al. [4], the DPR can detect and estimate extreme precipitation more precisely than PR. The best performance demonstrated by IMERG rather than TMPA was also observed in other studies [8,63,69]. In Brazil, Rozante et al. [70] also showed that IMERG presented results that were superior to TMPA.
The r and KGE metrics showed similar results for IMERG and TMPA. However, a greater correlation was obtained between IMERG and the observed precipitation. Thus, since the TMPA data were interrupted in 2019, IMERG has presented as a potential substitute for satellite precipitation monitoring in the Cerrado biome.

SWAT Performance Evaluation
Based on the statistical metrics NSE and Pbias, IMERG performed better among the precipitation data sources. The results obtained in this study were more accurate than those found by Monteiro et al. [6] in the Tocantins river basin, in which the SWAT was driven using remote sensing products from climatic reanalysis, Climate Forecast System Reanalysis (CFSR), and ERA-Interim. Both TMPA and IMERG performed better than the reanalysis products for the validation period.
The NSE values (Table 3) indicate that both IMERG and TMPA were able to simulate streamflow in the SMB. However, greater NSE values were obtained from IMERG. As in the Cerrado biome, studies highlighted the hydrological relevance of IMERG as well as of TMPA. Le et al. [8] simulated the monthly streamflow in six representative river basins in Vietnam, covering six sub-climate zones, using the SWAT and the SPPs TMPA, IMERG, Climate Hazards Group Infrared Precipitation with Station (CHIRPS), and Precipitation Estimation form Remotely Sensed Information using Artificial Neural Networks (PERSIANN). The median NSEs obtained for all basins were 0.77 and 0.74, using IMERG and TMPA, respectively. Jiang et al. [72] performed the hydrological simulations using IMERG in comparison with TMPA over the mid-latitude humid Mishui basin in southern China. They concluded that, overall, IMERG demonstrated the best performance (NSE of 0.78), presenting a potential alternative for water resource planning and management.
Hydrological simulation with the three products overestimated the streamflow presenting negative Pbias values. The inferior performances of the simulations were observed when rain gauges were used. IMERG performed better during the calibration period, whereas the results were close to those obtained with TMPA during the validation. TMPA was able to better simulate the minimum streamflows (Figures 6 and 7) and peak streamflows ( Figure 5), and thus was more suitable for the calculation of minimum streamflows (which are very important for water use in Brazil) as well as flood studies. Furthermore, since KGE offers interesting results in model performance evaluation because it uses correlation, variability, and bias [73], its values showed the best SWAT performance when coupled with TMPA.
Overall, SWAT streamflow simulations using SPPs and rain gauges presented satisfactory performances regarding uncertainty analysis. Zhu et al. [74] highlighted that uncertainty analysis is necessary to evaluate the hydrological applications of SPP datasets, since the influence of parameter uncertainty on model simulation may be greater than the effects of input data accuracy. Despite presenting adequate r-factor values (Table 3), the simulation driven by rain gauges presented higher uncertainty when compared to the SPPs, presenting a p-factor slightly lower than 0.7 for the validation period. According to Cassalho et al. [13], the local characteristic of rain gauges often limits their ability to represent the spatial structure of precipitation events. Therefore, the low density of rain gauges may be the cause of the greater uncertainties in the simulations [75].
The p-factor values for the simulations driven by TMPA were slightly larger than other precipitation sources (Table 3). It is possible to assume from the p-factor values that the 95PPU band of the simulation using TMPA (Figure 6) was capable of capturing 88% and 78% of the observed streamflows for calibration and validation, respectively. Furthermore, the r-factor values both for calibration and validation (0.84 and 0.80, respectively) were lower than those obtained with other precipitation data. Consequently, a narrower 95PPU band was generated, and less uncertainty in the fitted parameters was obtained from TMPA ( Figure 6).
A higher p-factor and consequently a larger width of 95PPU could be achieved if the r-factor were high. However, greater uncertainties in outputs may arise. Hence, a balance should be reached between the two factors [59]. In this sense, a study conducted by Silva et al. [76] in three basins in the Brazilian Cerrado biome obtained larger uncertainties in streamflow simulations using the SWAT driven by rain gauges. In the validation period, the r-factor values were 1.74, 2.35, and 3.44, whereas p-factor values were 0.87, 0.86, and 0.81, respectively [76]. This result emphasizes the reliability of SPPs for hydrological simulation in the SMB when compared with rain gauges.
The 95PPU band from the simulations driven by IMERG was capable of capturing 73% and 76% of the observed values for calibration and validation, respectively (Table 3), and presented greater uncertainties than TMPA. Such uncertainties were greater in the simulation of the recession phase of the hydrographs (dry periods), both in calibration and validation periods ( Figure 6). It is possible to observe that the 95PPU band did not include the recessions that occurred in the years 2015 and 2016 the observed data. In these years, Junqueira et al. [77] observed one of the most severe droughts in the basin for the 1987-2017 period, which may have affected SWAT performance.
An anomaly in the simulations driven by SPPs was observed in October 2006 ( Figure 6). Since an anomaly did not occur in the simulation driven by rain gauges, and the only difference between the model setups was the precipitation data inputs, this occurrence may have been due to false alarms (i.e., when there is no precipitation recorded in rain gauges, but the satellite product records a value). As noticed by Prakash et al. [4] in southeast peninsular India, and Gadelha et al. [7] in Brazil, false alarms are probably caused by low numbers of rainy days in certain periods of the year, a situation that is normal from May to September (dry season) in the SMB.
IMERG offers a great potential for monitoring rainfall and hydrological studies in the Brazilian Cerrado biome, and is an alternative to TMPA. The high spatial resolution provided by SPPs is a significant improvement for hydrological simulations in the region with a deficient ground base rainfall monitoring. Furthermore, an improvement in IMERG data for hydrological studies can be achieved with the application of rainfall data merging methods. Neirini et al. [78] compared two non-parametric rainfall data merging methods for optimizing the hydrometeorological performance of SPPs over a tropical watershed and obtained improvements in hydrological performance over the original SPP. Thus, it is expected that the rainfall data merging methods will increase the performance of IMERG in hydrological applications and the reliable usage in the management of water resources in the Cerrado biome.

Conclusions
The low density of rain gauges used for validating SPPs may have been one of the reasons for some errors obtained in the study area. A greater number of gauges would provide information for a more adequate assessment of the ability of SPPs to capture the spatial variability of precipitation in the study area. Both SPPs showed similar performances in monitoring rainfall. However, the statistical metrics highlighted a slightly better performance for IMERG on daily and monthly time scales. The bias removal performed in both the final TMPA and IMERG versions was the main reason for the strong agreement between SPPs and rain gauges explained by KGE and r.
Overall, SPPs and rain gauges were able to simulate the hydrological regime in the SMB. The simulations driven by rain gauges showed higher overestimations than SPPs. The better performance of SPP products is associated with the finer spatial resolution in precipitation measurements, since the rainfall monitoring in the ground is sparse in the SMB. TMPA showed less uncertainty in the hydrological simulations (p-factor and r-factor) and more adequately captured hydrological behavior than IMERG. IMERG showed less ability than TMPA in simulating peak streamflows, which may compromise its application for some purposes such as flood forecasting. The rainfall data merging methods may assist in overcoming this limitation and could improve SPP performance.
IMERG demonstrated a strong potential for replacing TMPA in the estimation of precipitation over the SMB since TMPA was discontinued in 2019. In addition, SPPs present a potential application for hydrological modeling and water resource management in the Brazilian Cerrado biome.