1. Introduction
Climate change has significantly altered rainfall regimes at local, regional, and global scales [
1]. These alterations manifest in the distribution and intensity of precipitation, directly affecting ecosystem processes, increasing environmental vulnerability, and impacting human activities [
2,
3]. Accurate and reliable precipitation estimation, as a key meteorological variable, is therefore essential for understanding and predicting these climate variations [
4].
Historically, ground-based meteorological stations have been the sole source of precipitation data for studies and forecasting. However, the scarcity of monitoring stations, limited accessibility, inconsistent data quality, lack of extreme event records, and low spatial and temporal coverage have constrained the availability of reliable data for integrated water resources management [
5,
6,
7,
8].
Currently, the availability and continuity of satellite-based precipitation records are transforming the way scientific and societal issues related to this variable are addressed, due to their rapid and easy accessibility through various data portals [
9]. Several climate datasets have been developed at different spatiotemporal scales based on in situ observations. For instance, the Global Historical Climatology Network (GHCN) is an integrated database with approximately 31,000 stations and observations spanning the entire 20th century [
10]. Satellite-based algorithms have enabled significant advances in estimating climate variables such as precipitation, temperature, and albedo, especially in data-scarce regions. However, these are indirect estimates that rely on cloud-top properties, utilizing infrared (IR) algorithms, and cloud liquid and ice content, utilizing polar microwave (PM) algorithms [
11]
Quantifying the uncertainty and accuracy associated with each estimate permits users to infer the reliability of satellite precipitation products (SPPs) and improve their applicability across disciplines. Errors are assessed through validation studies that compare satellite-based precipitation estimates with ground truth, using rain gauges or radar observations [
12].
Integrated Multi-satellite Retrievals for GPM (IMERG) [
13] and Climate Hazards Group InfraRed Precipitation with Station data (CHIRPS) are two widely used SPPs [
14]. These tools have proven valuable for supplementing sparse ground station networks, especially in remote or poorly monitored areas, even though each product has advantages and limitations depending on its spatial and temporal resolution. Thus, combining the strengths of each product can enhance their overall performances [
15,
16]. CHIRPS is a high-resolution, quasi-global precipitation dataset spanning over 35 years, developed by the Climate Hazards Group at the University of California, Santa Barbara. It merges infrared satellite data with ground-based observations to generate accurate precipitation estimates in regions with limited station coverage [
14]. IMERG, on the other hand, is a global precipitation dataset produced through the integration of multiple satellite sensors, developed as part of the Global Precipitation Measurement (GPM) mission, which is a collaboration between NASA and the Japan Aerospace Exploration Agency (JAXA) [
13].
These products have been evaluated in various studies across different regions and scales. In China, daily and annual precipitation from nine satellite and reanalysis datasets were assessed between 2000 and 2020. The results indicated that CHIRPS performed best in the arid and semi-arid areas of the study region [
17].
In South America, Benitez et al. [
18] assessed the ability of satellite data to estimate rainfall across the Southeastern South America (SESA) region, covering Argentina, Uruguay, Brazil, and Paraguay. They compared four satellite products—IMERG V.06 Final Run, PERSIANN, PERSIANN CCS-CDR, and PDIR-NOW—against observations from 118 meteorological stations from 2001 to 2020. The results showed that IMERG and CCS-CDR better captured precipitation patterns at annual and seasonal scales. All products had difficulties estimating winter and summer rainfall, and accuracy was generally higher in humid regions and lower in dry zones. Although IMERG showed the highest accuracy (with correlation coefficients up to 0.95), all products struggled to capture extreme events and data-sparse areas.
In Bolivia, Mattos et al. [
19] compared CHIRPS and IMERG satellite estimates with rain gauge measurements from 2002 to 2020 across three distinct topographic zones—highlands, valleys, and lowlands. IMERG performed better in detecting precipitation, particularly in the highlands, whereas CHIRPS provided more accurate precipitation measurements across all regions, with lower random errors and relative biases below 10%. IMERG, however, tended to overestimate rainfall, reaching up to 75% overestimation in the Bolivian Altiplano [
19].
In Perú, Salas et al. [
20] evaluated the performance of GPM IMERG_F v6 by comparing monthly satellite rainfall estimates with data from eight SENAMHI stations in the Madre de Dios basin. Using statistical techniques such as Nash–Sutcliffe Efficiency (NSE) and Root Mean Square Error (RMSE), the authors found that, while satellite data are generally useful, they can exhibit over- or underestimation depending on the season, with an average RMSE of 146.48 mm. In some cases, such as the Iñapari station, NSE values fell below 0.65, indicating poor model performance in specific contexts. The study also highlights the benefits of combining satellite data with data from other sources to optimize water management and evaluate climate phenomena in areas with limited infrastructure.
In Ecuador, Ballari et al. [
21] validated monthly precipitation estimates from the TRMM 3B43 product using 14 rain gauges across Ecuador’s coastal plains, the Andes, and the Amazon. The study underlined the importance of rainfall monitoring for water availability, especially in mountainous areas with sparse measurements. Results showed that TRMM 3B43 captured precipitation seasonality well, with better performance in coastal and Amazon regions than in the Andes. The authors recommended further calibration and bias correction for satellite data in ungauged catchments.
Recently, Huber et al. [
22] compared gridded global datasets of temperature and precipitation (IMERG, CHIRPS, ERA5, GLDAS) with data from a local automatic weather station network initiated in 2014. The findings emphasize that, while satellite products can help fill data gaps, their performance varies depending on location and precipitation type. IMERG and CHIRPS were found to be more effective under certain conditions, yet the lack of in situ data in regions such as the Amazon limits the accuracy of satellite estimates. Expanding the ground station network is therefore essential in improving validation and supporting water planning and climate monitoring in Ecuador.
The above studies demonstrate that, while satellite precipitation products are valuable tools for rainfall estimation, their performances depend on factors such as topography, station density and distribution, climate conditions, cloud characteristics, and temporal resolution. Therefore, evaluating their performances in different geographical and hydrological contexts remains essential.
The Ambato River Basin, located in Ecuador’s Sierra region, is part of the upper Pastaza River system. It plays a critical role in agricultural production in Tungurahua Province and is home to approximately 75% of the province’s population. The basin faces high water demand and contributes significantly to downstream pollution, affecting neighboring catchments such as the Patate River and tributaries of the Pastaza [
23]. In 2013, the provincial government of Tungurahua deployed a network of automatic weather stations to monitor key meteorological variables and provide reliable data for risk prevention and water planning. However, complex topography leads to high climatic variability, and the existing stations are not sufficient to fully capture local precipitation dynamics [
24,
25].
In this context, the present study aims to evaluate the accuracy of satellite precipitation products CHIRPS and IMERG in the Ambato River Basin. The goal is to assess detection capacity using categorical and continuous statistics, as well as to calculate accuracy indicators through bias decomposition metrics to characterize the spatiotemporal rainfall patterns in the basin. This will contribute to improving the quality of available data for integrated water resources management and enhance our understanding of SPP applicability in high Andean catchments.
3. Results
3.1. Daily Rainfall Detection
Across the basin, IMERG detects daily rainfall more consistently than CHIRPS at all gauges. POD values for IMERG were around 0.4 at every station (often >0.4), whereas CHIRPS remains <0.3 in most cases; FAR is comparable between products (≈0.28;
Table 2).
Spatially, the strongest agreement for IMERG occurs at the eastern station (P. Fermín Cevallos) and the northwestern station (Chiquiurco). In the northwestern sector (Mula Corral, Chiquiurco), CHIRPS shows the weakest daily rainfall detection, while IMERG still outperforms CHIRPS though with more moderate skill than at the eastern/northeastern sites. The northern station (Quisapincha) exhibits intermediate performance, with IMERG consistently ahead of CHIRPS.
The frequency bias further indicates systematic under-detection by CHIRPS (FBI < 1 across most stations), whereas IMERG is closer to the ideal (FBI → 1); for example, CHIRPS attains its highest FBI at P.F. Cevallos (~0.68) yet remains below unity. HSS follows the same pattern, with IMERG higher at nearly all gauges. Overall, these daily occurrence results point to basin-wide superiority of IMERG and limited categorical skill for CHIRPS, particularly in the northwest (
Figure 1;
Table 2).
3.2. Daily Precipitation Totals (Continuous Validation)
At the daily scale, IMERG and CHIRPS show broadly comparable performance, with IMERG exhibiting a slight advantage in bias and index of agreement (
d), and marginally higher MAE than CHIRPS (
Table 3). IMERG tends to overestimate daily totals on average, whereas CHIRPS generally underestimates them.
Spatially, the largest daily discrepancies for both products occur at the northwestern station Chiquiurco, reflected in elevated MAE and RMSE. In the north sector (Quisapincha), performance is intermediate; IMERG achieves the highest d observed across the network at Quisapincha (≈0.65). The central western station Calamaca and the eastern station P.F. Cevallos show comparatively better agreement patterns, consistent with the station-wise means.
Bias results indicate contrasting tendencies: IMERG averages bias ≈ 1.13, i.e., an overestimation of ~13% in daily totals, while CHIRPS averages bias ≈ 0.84, i.e., an underestimation of ~16%. Station-wise, IMERG is close to unity at Mula Corral and Chiquiurco (≈0.95 and ≈0.86, northwest), whereas CHIRPS shows pronounced underestimation across sites; at Mula Corral (northwest) its bias is ~0.43
IMERG provides slightly higher daily agreement (higher d) but with a tendency to overestimate amounts, while CHIRPS yields slightly lower error magnitudes on average (RMSE) yet systematically underestimates daily totals; this is particularly evident at the northwestern station Chiquiurco and Mula Corral.
3.3. Daily-Scale Bias Decomposition
The bias decomposition analysis at daily time scale for the IMERG and CHIRPS products was conducted to evaluate their detection performance, as well as the magnitude of undetected precipitation and false detections. The overall bias was disaggregated into three components: Hit Bias (HB), Miss Bias (MB), and False Bias (FB) (
Table 4).
Network-wide, CHIRPS exhibits a lower HB on average (+14.2%) than IMERG (+18.5%), indicating less overestimation on coincident rainy days. In contrast, MB is substantially less negative for IMERG (≈−43.3%) than for CHIRPS (≈−62.9%), implying that IMERG misses a smaller fraction of gauge-observed daily rainfall. For FB, CHIRPS is slightly lower on average (38.25%) than IMERG (39.17%), indicating fewer false-positive rainfall detections.
At the station scale, patterns align with basin-relative location. On the eastern side (P.F. Cevallos), both products exhibit their largest Hit Bias (HB; CHIRPS 51.42%, IMERG 39.63%) together with elevated False Bias (FB; CHIRPS 92.99%, IMERG 69.73%), indicating strong overestimation on coincident wet days and a high frequency of false-positive rainfall detections. In the northwestern sector (Mula Corral, Chiquiurco), CHIRPS shows the most negative Miss Bias (MB; down to −67.97% at Mula Corral and −66.95% at Chiquiurco), evidencing substantial missed rainfall, while FB remains low (≈15%). IMERG in this sector reduces the magnitude of MB (≈−43% at both stations), with modest positive HB (7–13%) and low-to-moderate FB (≈20%). In the northeast (Quisapincha, Ambato Airport), IMERG attains its lowest FB at Quisapincha (16.33%) but records its most negative MB within its range at Ambato Airport (−49.36%); CHIRPS also underestimates at both sites (MB −62.91% and −62.5%, respectively), with FB of 17.99% and 49.42%. In the central west (Calamaca), IMERG presents one of its least negative MB values (−41.3%), consistent with comparatively better daily detection there; CHIRPS shows moderate HB (15.27%), moderate FB (37.42%), and strongly negative MB (−64.33%).
Overall, the decomposition reveals complementary behaviors. IMERG reduces the fraction of missed rainfall (less-negative MB) across the network but tends to overestimate on coincident wet days (higher HB); and, on average, it incurs slightly higher FB than CHIRPS. CHIRPS, in turn, produces fewer product-only wet days (lower FB) and lower HB, yet misses a larger share of gauge-observed rainfall (more-negative MB), particularly in the northwest. These station-wise contrasts are consistent with the categorical findings and help explain the product-specific tendencies reported elsewhere in
Section 3 (
Table 4).
3.4. Monthly Precipitation Totals (Continuous Validation)
At the monthly scale, interpreted here as calendar aggregates for climatological/volumetric evaluation, IMERG generally shows tighter agreement with gauges than CHIRPS, with higher index of agreement (
d) and, on average, lower or comparable MAE/RMSE (
Table 5). Bias indicates opposing tendencies: IMERG overestimates monthly totals by ~15% on average (bias ≈ 1.15), whereas CHIRPS underestimates by ~15% (bias ≈ 0.85).
Spatially, the largest monthly discrepancies for both products occur at the northwestern station, Chiquiurco (elevated MAE/RMSE). In the north sector, IMERG attains the highest d across the network at Quisapincha (≈0.65), while CHIRPS reaches its best d at Ambato Airport (≈0.55). An outlier appears at the eastern station, P.F. Cevallos, where both products show positive bias (IMERG ≈ 1.72, CHIRPS ≈ 1.79).
The gauge–satellite scatter plots (
Figure 2) corroborate these patterns: IMERG (blue) points cluster more tightly around the 1:1 line (red dashed line), whereas CHIRPS (black) points more often fall below 1:1, reflecting underestimation; Ambato Airport is a notable exception where CHIRPS aligns relatively well with observations.
Notably, IMERG showed stronger concordance with observations at the Quisapincha, Calamaca, and Mula Corral stations, which aligns with the results from the continuous validation metrics (
Table 4). Conversely, CHIRPS tends to underestimate monthly precipitation values when compared to gauge data, as evidenced by most scatter points lying below the 1:1 line. An exception to this pattern is observed at the Ambato Airport station, where CHIRPS demonstrates relatively better agreement, consistent with the continuous validation results: bias = 0.93 and
d = 0.55.
Monthly climatology data (2014–2023;
Figure 3) indicate that both products reproduce the seasonal cycle only at two stations—Calamaca (central west) and Quisapincha (north)—and show limited skill at the remaining four gauges. Consistent with the metrics, IMERG tends to lie above the observed monthly totals, while CHIRPS tends to lie below them.
3.5. Interannual Spatiotemporal Precipitation Patterns
To investigate the spatiotemporal behavior of rainfall across the study area, annual mean precipitation was calculated for both satellite-based products: IMERG and CHIRPS (
Figure 4). IMERG estimates ranged from 788 mm to 1577 mm per year (
Figure 4a), while CHIRPS estimates varied between 333 mm and 1187 mm annually (
Figure 4c).
The coefficient of variation (CV), used to assess interannual variability, further revealed spatial differences between the two products. For IMERG, the highest CV reached 25% (
Figure 4b), mainly concentrated in the northeastern sector of the Ambato River basin. The rest of the basin exhibited relatively low CV values, suggesting stable annual precipitation over the study period. In the case of CHIRPS, the maximum CV was slightly lower at 21% (
Figure 4d), located to the northwest of the catchment, outside the basin boundaries. Despite these differences, both products revealed similar spatial patterns of interannual variability within the core area of the basin.
Within the study area, three distinct rainfall regimes can be observed. The first corresponds to the inter-Andean corridor spanning the northeastern to southeastern sectors of the catchment. In this corridor, IMERG estimates show a gradual increase in annual precipitation from around 800 mm in the northeast to 1000 mm in the southeast of the catchment. CHIRPS, in contrast, reports lower values ranging from approximately 500 mm to 900 mm.
The second region encompasses the eastern slopes of the Western Andes in the northwest part of the basin, an area of higher elevations with páramo ecosystems. IMERG indicates mean annual precipitation of around 1000 mm, whereas the CHIRPS estimates are significantly lower, near 600 mm.
The third region lies in the southwest sector along the eastern flank of the Western Andes. This area overlaps substantially with the Chimborazo and Carihuairazo volcanoes. This is the largest inter-product discrepancy in the basin (
Figure 4e), with IMERG exceeding CHIRPS by ~900 mm.
These spatial contrasts further support the patterns revealed in the validation analyses, highlighting the consistent overestimation by IMERG and the underestimation by CHIRPS across various sectors of the basin.
4. Discussion
Satellite precipitation products such as IMERG and CHIRPS have become indispensable tools for hydrological analysis in regions with sparse ground-based data, particularly in mountainous and forested catchments. However, their performance varies depending on topographic complexity, data processing algorithms, and the temporal resolution of evaluation. For instance, in a study conducted across Bolivia’s diverse terrain, IMERG showed superior skill in detecting daily rainfall (POD > 0.5) compared to CHIRPS, especially in high-altitude areas, but suffered from a high false alarm ratio and rainfall overestimation of up to 75% in the highlands [
19]. Similarly, in the Madre de Dios basin in Peru, Salas-Choquehuanca et al. [
20] reported that IMERG_F v6 data presented notable deviations from observed values, with RMSEs averaging 146 mm and Nash–Sutcliffe Efficiency (NSE) values often below acceptable thresholds (< 0.65). Although the Pearson correlation coefficients (r > 0.8) suggested a generally consistent trend, systematic over- and underestimations persisted across the stations. These results collectively emphasize that, while IMERG can capture general rainfall patterns, it tends to overestimate intensities and may underperform at finer scales or in specific microclimates.
In the Ambato River catchment, characterized by steep topography, the categorical validation metrics (POD, FAR, FBI, HSS) confirmed the superior performance of IMERG over CHIRPS in detecting daily rainfall. IMERG achieved higher POD and HSS values, reflecting its greater ability to correctly identify precipitation occurrences, while both products showed similar FAR values. This aligns with findings from Benítez et al. [
18], who highlighted IMERG’s capability to detect rainy days (POD > 0.7) and its high agreement with observed precipitation in the wetter regions of southeastern South America. However, as also observed in Bolivia [
19], our analysis revealed that IMERG exhibited a higher False Bias Index (FBI), suggesting the presence of false rainfall detections not supported by gauge observations.
In terms of continuous validation at a daily scale, both products produced comparable results for MAE and RMSE. Nevertheless, IMERG showed a slight advantage in MAE and presented a lower bias index, suggesting less systematic deviation from observed rainfall. Interestingly, IMERG tended to overestimate precipitation, whereas CHIRPS consistently underestimated it—a pattern also reported by [
19,
20]. At the monthly scale, this behavior became more pronounced, with IMERG consistently providing higher rainfall totals. The scatterplot analysis of the monthly estimates confirmed this trend, as IMERG’s predictions clustered more closely around the 1:1 line, whereas CHIRPS values tended to fall below it, indicative of underestimation. One factor that may have influenced the monthly validation results is the selection of only those months with complete datasets, whereas daily validation included time series that may have come from months with missing data.
In this context, the findings of López-Bermeo et al. [
35] offer relevant evidence, as they conducted a robust validation of CHIRPS using 75 rain gauge stations across the diverse topography of Antioquia, Colombia. Their results revealed that CHIRPS generally performs well at annual and interannual scales, particularly in Andean regions, but tends to overestimate precipitation in most stations and underestimates in warmer, lowland subregions. These biases become more pronounced at the daily scale, where CHIRPS exhibited limited accuracy. This supports our current analysis, which also shows that, while CHIRPS captures the general precipitation patterns, its reliability varies by subregion and time scale, and local overestimation or underestimation must be accounted for when interpreting results against station data.
Quispe et al. [
36] assessed the performance of GPM IMERG products (early, late, and final versions) across the Lake Titicaca Basin and found that accuracy improved with coarser temporal scales, with the monthly IMERG-F (final) product consistently outperforming its early and late versions. This finding aligns with our results, in which IMERG showed stronger agreement with rain gauge data, particularly in monthly accumulations.
The bias decomposition analysis further clarified the strengths and limitations of each product. CHIRPS exhibited better performance in terms of Hit Bias (HB), meaning that, on average, its estimations more closely matched the number of actual rainy days. However, IMERG had a better Miss Bias (MB), with fewer undetected precipitation events compared to CHIRPS. Regarding False Bias (FB), CHIRPS again outperformed IMERG, supporting previous findings that IMERG’s overestimation is primarily driven by spurious rainfall detections, particularly in regions of complex terrain and variable surface emissivity [
19].
Spatial analysis revealed further discrepancies. CHIRPS produced lower annual average precipitation estimates than IMERG, especially in the southwestern region of the basin, where CHIRPS suggested values below 400 mm, while IMERG exceeded 1300 mm. This contrast is critical, as prior studies by Hunink et al. [
27] suggest that annual rainfall in this area should be closer to 1000 mm. Therefore, while IMERG may overestimate rainfall amounts, its estimates may still offer a closer approximation to reality in certain high-precipitation subzones.
The implications of these findings are particularly relevant for hydrological modeling, flood forecasting, and water resource planning in the Ambato River basin and similar Andean catchments. The tendency of CHIRPS to underestimate precipitation may lead to underestimation of runoff and water availability in rainfall-runoff models, potentially compromising water allocation and drought risk assessments. Conversely, the overestimation by IMERG—especially in the highland zones—could inflate simulated streamflow values, influencing flood hazard assessments and infrastructure design. Given these trade-offs, the selection of an appropriate product should be context-specific: CHIRPS may be more suitable for long-term climatological analyses and drought studies, where minimizing false rainfall is crucial, while IMERG might be better suited for applications requiring accurate detection of rainfall occurrence, such as early warning systems and daily hydrological modeling. Integrating bias-correction techniques or combining both products using ensemble or fusion methods may further enhance their applicability in operational settings.
Nonetheless, these results should be interpreted considering certain limitations that may affect their generalizability and accuracy. Although six automatic rain gauge stations were used, their spatial distribution may be insufficient for fully representing the heterogeneity of rainfall patterns in such a topographically complex Andean basin. This limited density may reduce the precision of ground-based validation, especially in areas with strong orographic influence. In addition, although a strict quality control process was applied to ensure the reliability of the data, missing records from some stations were not gap-filled. This decision was taken to preserve the integrity of the validation analysis, avoiding artificial bias. While this approach favors methodological rigor, it may also restrict the completeness of the comparison with satellite estimates. Despite these constraints, the study offers valuable insight for hydrometeorological applications in data-scarce regions, where ground-based networks are sparse and satellite precipitation products serve as essential tools. Future research could further enhance validation robustness by integrating complementary data sources and expanding the observational network.
The present comparative assessment of CHIRPS and IMERG reveals that, while both products exhibit valuable capabilities, they also present significant limitations that must be accounted for in hydrometeorological applications. IMERG outperforms CHIRPS in rainfall detection and correlation with observed data, but suffers from higher false detections and systematic overestimation in certain zones. CHIRPS provides more conservative and spatially coherent estimates, though it underrepresents precipitation in high-rainfall regions. These findings reinforce the importance of regional validation of SPEs and support the use of multi-source strategies to improve rainfall estimation accuracy in topographically complex and data-scarce Andean environments.
5. Conclusions
This study provides a comprehensive evaluation of the performance of CHIRPS and IMERG satellite precipitation products against rain gauge observations in the Ambato River catchment, a mountainous Andean basin characterized by steep topography and variable climate conditions. The findings demonstrate that both products capture general precipitation patterns but differ markedly in their accuracy, bias structure, and suitability depending on the temporal scale and intended hydrometeorological application.
IMERG outperformed CHIRPS in rainfall daily rainy-day detection (POD = 0.44 vs. 0.24), and monthly Mean Absolute Error (MAE = 38.12 vs. 42.03), making it more reliable for rainfall-driven hydrological applications. These results suggest that IMERG is more reliable for daily hydrological modeling, flood forecasting, and early warning systems, especially when accurate detection of rainfall occurrence and intensity is required. However, its tendency to overestimate rainfall and produce false alarms (FAR = 0.44) may compromise accuracy in high-resolution or risk-sensitive applications.
Conversely, at a daily time scale, CHIRPS showed better performance in Hit Bias (HB) and False Bias (FB), along with more conservative rainfall estimates, making it potentially more suitable for long-term climatological assessments or drought monitoring, where underestimations are more acceptable than false detections. Its stability under dry conditions further supports this use case.
These results highlight the importance of regional validation before operational use of satellite precipitation estimates. For hydrological modeling, drought assessment, or flood forecasting in the Andes, product selection should be context-specific. Future research should explore bias-correction techniques and multi-sensor integration to improve rainfall representation in complex terrains. Ultimately, while satellite products are indispensable in data-scarce regions, their limitations must be explicitly accounted for to ensure accurate and reliable hydrometeorological analyses.