1. Introduction
The effects of climate change on future precipitation remain uncertain [
1]. However, climate model predictions simulate yearly decreases in semi-arid regions of the Mediterranean [
2]. Mountain areas are particularly vulnerable, where the cryosphere is directly affected by global warming, which consequently leads to altered seasonal runoff patterns [
3]. Thus, hydrological cycles will gradually shift from being dominated by snow and ice to being determined by rainfall [
4]. Accurate precipitation measurements at different spatial and temporal scales are of great significance for validating numerical weather and climate models, managing water resources and predicting natural disasters.
However, quantitative estimates of precipitation often have significant uncertainty [
5]. Rain gauges, which are the world’s most common method of obtaining accurate and reliable measurements at high temporal resolutions, provide point-scale measurements. This makes them unable to fully capture the spatial variability of the precipitation or to capture extreme local events in many areas wherein the instrument density is low. Ground-based radar-derived estimates are another feasible method, but due to their poor global coverage, the effects of terrain blockage [
6] and the difficulties associated with estimating mixed and solid phase precipitation [
5,
7], there are limitations to obtain reliable estimates. Thus, satellite precipitation estimates (SPEs) offer an excellent way to compensate for some of these limitations and, although they have their own shortcomings, can be considered a complement to other methods [
8].
Based on the success of the previous Tropical Rainfall Measurement Mission (TRMM), the Global Precipitation Measurement Mission (GPM) core satellite plus a constellation of satellites from partner countries provide one of the most accurate and fine-grained spatio-temporal resources for global precipitation measurements [
9]. GPM has advanced sensors such as the GPM dual frequency precipitation radar (DPR) and microwave imager (GMI), which quantify precipitation more accurately and detect light and solid precipitation [
10]. The associated processing, Integrated Multisatellite Retrievals for GPM (IMERG), incorporates, fuses and intercalibrates several infrared, microwave (MW) and gauge observations to provide precipitation estimates at relatively high spatial (0.1° × 0.1°) and temporal (30 min) resolutions [
9].
Since the launch of the GPM (February 2014), the chronology of publications evaluating the performance of the IMERG reflects a growing trend of research interest in the subject [
11]. Most of the works that take a country or region of study stratify the results of the validation process according to different time scales [
12,
13,
14,
15,
16,
17,
18], topographic features [
15,
19,
20,
21,
22,
23,
24], climatic conditions [
23,
25,
26,
27] and in terms of precipitation intensity [
19,
20,
28,
29,
30,
31,
32,
33]. In this way, a more specific description of IMERG behaviour under different conditions is obtained, leading to the choice of a more suitable use for its application.
Several investigations with this approach have been developed in Mediterranean countries. In Greece, Kazamias et al. [
34] explored the performance of IMERG Final across the country at daily, seasonal and annual scales, in different elevation zones and rainfall intensities. Caracciolo et al. [
35] studied the influence of morphology and land–sea transition on the reliability of IMERG Final at hourly and daily scales, while Chiaravalloti et al. [
36] evaluated and compared the IMERG Early, Late and Final products over complex terrain in southeastern Italy. Tapiador et al. [
37] introduced for the first time the results of a validation in Spain based on a comparison with a high-resolution grid of daily precipitation derived from the records of approximately 2300 rain gauges covering the Iberian Peninsula and the Balearic Islands. The study at annual, seasonal and daily resolutions also analysed the spatial structure of precipitation and considered different precipitation thresholds for the three IMERG products. Similarly, Navarro et al. [
38] validated the IMERG at the south of the Pyrenees and the Ebro valley according to four parameters: altitude, climate type, seasonality and quality of surface observations. Finally, Tapiador et al. [
39] selected the IMERG Late product to evaluate the consistency of ground observations and satellite data during the Storm ‘Filomena’ in January 2021. Pradhan et al. [
11] recently reviewed validation studies of IMERG and identified the most common limitations in this type of work, offering some suggestions to solve them. An important pending issue is the evaluation of IMERG products at multiple time scales, including sub-daily periods, to understand the errors associated with temporal aggregation. Further analysis in mountainous regions, over different climatic regimes, geographical conditions and assessing the effect of rainfall intensity on their accuracy still require the attention of the scientific community.
Based on these research gaps, the objectives of this work are twofold: (1) To evaluate the precipitation estimates obtained from the three IMERG runs (Early, Late and Final) at different time scales (half-hourly, hourly, daily, monthly, seasonal and annual) simultaneously taking as reference the automatic stations of the Meteorological Service of Catalonia and (2) To validate the IMERG estimates at the highest temporal resolution (30 min) according to different orographic features (valley, flat, ridgetop), different climatic conditions (BSk, Csa, Cf, Df) (see
Appendix B) and according to different precipitation intensity thresholds (light, moderate, heavy, very heavy, torrential). The study considers the period from 2015 to 2020, so that full calendar years of the GPM core satellite data are employed. We focus on the region of Catalonia, northeast of the Iberian Peninsula, being one of the first studies to evaluate the behaviour of IMERG at a sub-daily temporal resolution in this region. This complements the previous studies done in IP and other areas with complex orography.
Section 2.1 and
Section 2.2 provide a description of the study area and details of the methodology, data and assessment metrics employed.
Section 3.1 and
Section 3.2 compare the rain gauge observations and estimates of the three IMERG products at different time scales simultaneously.
Section 3.4 and
Section 3.5 focus on the semi-hourly scale, considering different orographic and climatic conditions as well as different precipitation intensity thresholds, respectively. The most significant results are discussed in
Section 4, and finally a summary with the most relevant aspects is given in
Section 5.
3. Results
3.1. Mean Annual Precipitation 2015–2020
A comparison of mean annual precipitation amounts was made between IMERG products and XEMA data.
Figure 4 shows the spatial distribution of the mean annual precipitation of IMERG products compared with rain gauge recorded from 2015 to 2020. In addition, the probability of occurrence of annual precipitation and the kernel density estimation (KDE) curve associated with the distribution of each dataset are plotted (lower panel).
According to rain gauge data, the average annual rainfall in Catalonia during this period varies between 300 mm and 1600 mm/year. The lowest records are observed in the Central Depression, where they do not exceed 450 mm/year, followed by the coastal areas with values around 600 mm/year. In contrast, stations close to the Pyrenees have accumulations usually exceeding 900 mm/year and those located above 2000 masl typically are over 1600 mm/year, which represents the maximum values for the region under study, and also some of the highest of the Iberian Peninsula. This high spatial variability is consistent with previous precipitation climatologies [
61,
62,
63] in the studied region, which guarantees the representativeness of the selected sample.
The comparative analysis between the products shows a very similar performance between IMERG_E and IMERG_L, while in IMERG_F there is evidence of the unbiased effect thanks to the calibration with GPCC rainfall. It is also worth noting that the three IMERG products broadly reproduce the spatial rainfall pattern in the region, characterized by a marked latitudinal gradient that decreases from north to south. However, there are discrepancies in magnitude that are substantial. IMERG_E and IMERG_L overestimate precipitation by over 20% in almost all the territory with biases of 160 and 140 mm/year, respectively. This overestimation is notable in the areas of the Central Depression, characterized by a dry continental climate with low pluviometric values. Similar results were detected by Kazamias et al. [
34], wherein the
IMERG_unCal show the largest discrepancies in the areas of Greece with low annual accumulations. Similarly, Navarro et al. [
38] also found a general overestimation of precipitation over the Ebro Delta river, and Tapiador et al. [
37] reported an underestimation in the Pyrenees mountain massif.
Although the tendency of IMERG_E and IMERG_L to overestimate is shown in the same way at the pre-coastal, coastal and Ebro basin areas, the correction carried out in IMERG_F is effective and generally reflects annual mean values very similar to the rain gauge records (
Figure 4, bottom panel). However, IMERG_F generally reduces and smooths the precipitation field over the Pyrenees and some high-altitude stations show an increased bias exceeding 600 mm/year.
3.2. Continuous Verification Scores for Different Time Scales
Table 4 shows a summary of various statistics calculated at half-hourly, hourly, daily, monthly and annual scales considering all valid records between 2015 and 2020. The
Bias,
MAE and
RMSE are standardized to the mean of the observations at the different time scales, which allows for comparisons to be made between them.
In terms of Rbias, IMERG_E and IMERG_L present an overestimation of precipitation close to 20% at all time scales, except at the seasonal and yearly levels. In contrast, this behaviour only occurs in IMERG_F at monthly and annual scales, although it does not exceed 2%. At daily and sub-daily scales, IMERG_F slightly underestimates precipitation relative to observations, with values ranging between −0.05 mm/h and −0.12 mm/day, which is relatively small compared to the mean of the observations at these scales (2.16 mm and 8.56 mm, respectively).
The analysis of the average error (
Bias) reflects a significant improvement in the IMERG_F at all scales, although much more appreciable at the monthly and annual scales. At the latter, the
Bias decreases to 9.65 mm compared to the 159.22 mm recorded by IMERG_E, which means a reduction of the error close to 90%. At the monthly scale, the error value decreases by about 16 times compared to the Early and Late products. While this significant error reduction could represent a good indicator of the improvement in precipitation estimation with IMERG_F, we must consider the limitations of this statistic and its relationship with the possible cancellation of positive and negative errors [
64] between IMERG_F and ground-based observations.
As expected, as the temporal resolution decreases, there is a decrease in the
MAE and the normalised
RMSE regarding the mean for all products, with few differences at sub-daily scales. This behaviour is most evident in IMERG_F, in which the
MAE decreases from 0.87 mm at 30 min to 0.22 mm at the annual scale, and the
RMSE decreases from 1.73 mm to 0.31 mm. This improvement with a lower scale can be seen in the Taylor plot shown in
Figure 5a, which displays the
STD,
CC and centred
RMSE statistics normalised to the standard deviation of the three products for all temporal resolutions. A clear improvement in IMERG_F is observed at the monthly and annual scales, with values close to the benchmark (correlation and standard deviation equal to 1). The worst results are shown at the sub-daily scales with low correlation values and in the Early and Late products, with standard deviations higher than the benchmark unit. These differences between IMERG_F and the rest of the products, which grow with the increasing scale, highlight a gradual improvement as more information is integrated into the algorithm. Finally, while this product is expected to provide the most reliable estimates for research [
47], the other two products can also be used for related to low latency applications [
65].
In the seasonal analysis (
Figure 5b), IMERG_E and IMERG_L overestimate the precipitation values substantially. These errors are more noticeable during the summer period with a systematic error of over 97 mm and
MAE and
RMSE values around 105 mm and 143 mm, respectively. Interestingly, in this period, IMERG_F introduces significant improvements that reduce the overestimation to less than 10%, but it is still the season of the year wherein the worst results are obtained. Precipitation in the summer months is low throughout the Iberian Peninsula and Catalonia, but local storms with convective development usually occur, wherein the amount of precipitation fallen is not adequately captured by IMERG.
The values of the errors in autumn, although lower than in summer, also show overestimates of precipitation in all products and MAE and RMSE values, which, even with the unbiasing of the Final product, remain relatively high (MAE equal to 52.09 mm and RMSE equal to 70.80 mm). A similar behaviour is observed in the rest of the seasons of the year, although the RMSE values practically double the MAE values, which may be caused by the occurrence of extreme phenomena and bring into play the sensitivity of this statistic in such records.
IMERG_F reproduces the annual cycle of precipitation relatively accurately, identifying the spring and autumn months as those that make the overall greatest contribution to the annual cycle amount, while the winter and summer months show the lowest accumulations with very few differences between them. On the other hand, IMERG_E and IMERG_L represent the summer period as the second highest contribution with an average of approximately 215 mm, higher than that recorded in the observations (118 mm), which is consistent with the overestimation made by these products during this period.
The correlation coefficient calculated at the different time scales showed statistical significance at 95% of confidence in all cases. In
Figure 5a, at sub-daily scales, similar correlation values are shown among all products, and although a slight improvement appears in IMERG_F, it does not exceed 0.37. IMERG_E and IMERG_L at scales higher than daily show moderate linear correlations close to 0.6, and it is IMERG_F that represents high correlations, higher than 0.8. Similarly, there is a decrease in the standard deviation, closer to the reference point (
STD = 1), as a result of the unbiasing to which it is subjected. The performance shown demonstrates that this product would be the most suitable for the analysis of precipitation at seasonal and annual scales.
3.3. Categorical Verification Scores for Different Time Scales
Figure 6 shows a summary of the contingency table verification score at the different time scales. For each dataset shown, a threshold greater than or equal to the mean of the observations recorded at each time step is applied.
As shown in
Figure 6, IMERG_F has a higher ability to detect correct negatives with values close to 50% at all scales, although IMERG_L and IMERG_E are also very similar at sub-daily and daily scales. The percentage of hits tends to increase at scales higher than daily, while the percentage of misses decreases. According to the selected thresholds, the ability of IMERG to estimate precipitation is affected by the detected false alarms. These represent the highest percentage during the summer period in IMERG_E and IMERG_L.
Figure 7 provides the performance of the
POD, and the
FAR values at different time scales for different precipitation thresholds. The error associated with the calculation of the statistic at each point, as outlined by Jolliffe and Stephenson [
64], is also shown. The figure shows a clear improvement of IMERG estimates as time scale increases. At a half-hourly scale, the ability of IMERG to estimate events at different thresholds is remarkably poor.
If we consider the cut-off point between the POD and FAR line as a limit from which the satellite shows some decay in its event detection ability, we observe that it increases considerably with decreasing temporal resolution. At daily scales, this cut-off point occurs at around 20 mm/day, a value well above the 75th percentile of the sample, which indicates a much better performance compared to the estimation of events at the half-hourly scale, where a value slightly lower than the mean of the recorded observations at this scale is observed (1.4 mm/30 min).
Similarly, for the analysis at monthly and annual scales with cut-off points above 100 mm and 1000 mm, respectively, it ensures the correct identification of rainfall events up to this threshold. As higher thresholds of rainfall in the domain are assessed, the ability of the satellite decreases, highlighting the difficulty of IMERG to detect rainfall extremes at any scale.
Although the three products behave similarly, IMERG_F gives worse results, especially on scales above the monthly scale, wherein the
POD values decrease faster than in the rest of the products, which is associated with the unbiasing effect induced by the calibration of GPCC stations. These results are consistent with Shawky et al. [
66], which found no significant improvement of IMERG_F over IMERG_E in the arid environment of Oman. This result is in line with Sharifi et al. [
16], Behrangi et al. [
67] and Gosset et al. [
68] when positing that the gauge adjustment product (IMERG_F) can change the precipitation amounts, but it cannot modify the occurrence of precipitation.
3.4. Half-Hourly IMERG Products for Different Terrain and Climate Conditions
This section will test the abilities and shortcomings of the three IMERG products at a high temporal resolution (30 min). In addition, differences in the estimation of precipitation by satellite products will be analysed when considering the terrain over which they are estimated and under different climatic conditions.
Figure 8 shows the differences over each station between IMERG and the rainfall records of the XEMA network. In valley areas, the analysis of the systematic error shows a marked underestimation of precipitation in IMERG_F, with mean values of −0.15 mm/30 min, which represents an underestimation of 10% regarding the rain gauges. IMERG_L and IMERG_E show a tendency to overestimate the accumulated values and show
MAE and
RMSE values even higher than 100% relative to the mean.
There is a more marked tendency in the behaviour of IMERG in areas representing ridgetops. While IMERG_E and IMERG_L overestimate precipitation, and this could be verified in all time scales, the effect of the calibration incorporated in IMERG_F causes a significant smoothing, such that the
Rbias reaches critical values lower than −30% sometimes, as in the Bonaigua station (Z1) (
Figure 2) at 2266 masl This marked underestimation and change in behaviour from one product to another is probably related to the low density of GPCC reference stations in high altitude areas for calibration. The
CC shows a pattern in all three products with poor values, barely exceeding 0.3.
The largest errors occur in the stations in flat areas (Flat) with an average bias higher than 0.4 mm/30 min and
Rbias values higher than 30% in IMERG_E and IMERG_L (
Figure 8). Although IMERG_F significantly decreases the error, the tendency to overestimate the values is still maintained, and under this terrain classification the highest
MAE and
RMSE values regarding the mean are obtained (higher than 100% and 200%, respectively). In these areas, 43% of the automatic stations are located and analysed, which corresponds to the entire central inland part of the region of Catalonia, the coastal strip, the Ebro basin and the north-western part of the territory. This plays a significant role in the global results regardless of terrain classification.
Figure 9 evaluates the
Rbias of the three IMERG products under different climatic classifications. For example, IMERG_E presents a large overestimation over the BSk stations and IMERG_F shows a high underestimation over the Df stations. Overall, a clear improvement in bias reduction is found for BSk and Csa stations when IMERG_F is compared to IMERG_E. The improvement is not so evident for the Cf stations, and in contrast, there is a clear bias increase for the Df stations. These results obtained for 30 min records coincide with previous studies by Navarro et al. [
38] in the Ebro basin for seasonal and annual scales.
3.5. Intensity
Table 5 shows a summary of the statistics obtained in the validation process of three IMERG products, considering five categories of rainfall intensity recorded in 30 min. The categories of light, moderate, intense, very intense and torrential rain were scaled from a previous classification of rainfall intensity in 1 h, according to sources from the Spanish Meteorological Agency (AEMET) [
59].
The results obtained show substantial overestimation discrepancies for all rainfall intensity categories and in all IMERG products. Light rainfall, represented by the highest number of records, is overestimated by twice as much
Mbias by IMERG_F and nearly three times as much by the rest of the products. This implies a relative error rate (
Rbias) higher than 100% in all cases and a systematic error significantly higher than the mean of the observations. The best performance based on the
MAE and
RMSE is obtained by IMERG_F, although they are still quite high compared to the average of the studied records. Such indicators of overestimation in this category have been reported in previous studies [
23,
27].
On the contrary, at precipitation thresholds above 1 mm/30 min (moderate, heavy, very heavy and torrential), IMERG shows a tendency to underestimate precipitation, which becomes more significant as the intensity of precipitation increases (
Figure 10). For the classes heavy, very heavy and torrential, the satellite shows errors ranging between −60% and −90% of the deficit in relation to the rain gauges. The systematic errors in these groups are similar in magnitude to the mean absolute errors and to the mean of the values recorded by the stations in each of the corresponding thresholds, which register a more realistic, significant underestimation.
Among the three products, IMERG_F provides the worst results, while IMERG_L presents the best values, although these differences are not marked. These results are in agreement with studies by Mazzoglio et al. [
69] and show the challenge of detecting precipitation extremes at this resolution. Many of these extremes occur in the form of short and local intense rainfall, so they cannot be correctly captured due to the spatial and temporal resolution of satellite sensors. Precipitation at the daily and sub-daily scales is much more variable than monthly precipitation, and regional effects such as topography and local circulation play an important role in rainfall occurrence and distribution [
16].
4. Discussion
In line with previous work, IMERG roughly reproduces the spatial pattern and temporal variability of rainfall in the region of study [
17,
24,
70]. However, there are differences in the magnitude estimation for the different run types: Early, Late and Final. While there is a tendency to overestimate the accumulations in the Early and Late products across the whole territory, IMERG_F reduces the errors and shows a better ability to estimate the amount of precipitation at all time scales, with higher accuracy at monthly, seasonal and annual scales. However, for this product, there is again a tendency to underestimate in areas with complex topography, i.e., high mountain areas such as the Pyrenees. This result is reasonable and has been reported in other high mountain areas [
17]. Navarro et al. [
38] and Tapiador et al. [
37] suggested that this may be due to the lack of rain gauges contributing to the GPCC in high altitude areas, as well as to the low resolution of the GPCC grid (1° × 1°), which makes detection difficult in areas wherein precipitation is highly variable at small scales. Finally, Navarro et al. [
38] also mentioned the reduced detection capacity of IMERG in the identification of convective orographic rainfall, mainly related to mesoscale factors.
At the seasonal scale, a similar underestimation is observed in the Final product at all temporal scales. However, in the Early and Late products, significant errors appear during the summer with a tendency to overestimate the cumulates producing high
MAE and
RMSE errors. This differs from the studies of Moazam and Najafi, [
13] and Navarro et al. [
71], wherein the worst results were obtained mainly during winter, when the ground surface is covered with snow and ice [
24]. However, our results are in line with Retalis et al. [
15], in which the best results were obtained in the rainy seasons (winter and autumn). In semi-arid areas, the summer period is represented by low precipitation values, which makes detection by satellite sensors difficult [
17]. Another important issue to consider is that precipitation can be affected by a high rate of evaporation, where some of the liquid water evaporates during the fall process and is no longer part of the effective precipitation [
26,
31,
72], a virga being the extreme case wherein no precipitation reaches the ground. This phenomenon, coupled with the fact that satellite retrievals of precipitation are based on the structure of cloud systems [
73] and may not adequately account for the level of evaporation, may lead to the overestimation of precipitation in arid regions.
The effect of not accounting for evaporation in semi-arid areas further explains that, in terms of precipitation event detection, the error in IMERG is dominated by the occurrence of false alarms, especially in summer. In the occurrence of typical deep convective clouds with relatively cold cloud tops (anvils) and, with the absence of PMW measurements, the IR algorithm may falsely assign precipitation to pixels with cold brightness temperature values [
74]. Furthermore, in terms of IMERG’s ability to detect events given a continuous threshold of cumulates, no significant improvement of one product over the other is observed. In fact, IMERG_E and IMERG_L offer better performance as the thresholds grow with more stable
POD and
FAR values and lower uncertainty in the statistics. This is related to the inability of IMERG_F to detect extremes, similarly associated to the calibration of GPCC.
Few studies include the validation of IMERG at the highest temporal resolution (30 min). Even so, the authors of [
26,
75] agree on the decreased estimation capability of the three products with increasing temporal resolution. The repetition time of the GPM and the downscaling and interpolation procedures to 30 min [
76] are some of the main causes of the errors obtained. At this scale, the largest errors occur in flat areas, which coincides with the BSk climate, with a tendency to overestimate. The authors of [
38] found that in these areas, IMERG tended to overestimate precipitation equally. These regions, mainly represented in our study by inland depressions (Ebro valley) and coastal areas, are affected by extreme precipitation events occurring at local scales. Orographic factors and mesoscale conditions generate an uneven distribution of precipitation over the territory, resulting in a very spatially uncorrelated precipitation field [
37] and therefore an added challenge for satellite estimates.
Finally, the overestimation of lightprecipitation associated with the detection of false alarms and the underestimation of precipitation extremes reflects a similar behaviour to that found in the Tibetan Plateau [
27]. Along these lines, it is important to be aware of the limitations of the assessment procedure, which may influence the accuracy of the results. Firstly, it is worth mentioning that the rain gauge data used were not corrected for the effect of wind, so the measurements may suffer from systematic biases caused by wind-induced evaporation loss and the underestimation of trace values [
24]. On the other hand, in terms of the pixel-to-point method, although it has advantages over other methods [
70], it is very difficult for a (point-scale) rain gauge to represent the actual precipitation situation in an IMERG pixel-scale range. These inherent differences between the rain gauge estimate and the precipitation in the satellite area can directly influence the high values of false alarms, as well as the detection of extreme precipitation events occurring at the local scale. Especially in a region like Catalonia, characterised by its orographic complexity and climatic variability, more rain gauges per IMERG cell may provide better results.
5. Conclusions
The main purpose of the current study focused on a comprehensive evaluation of IMERG precipitation estimates in its three Early, Late and Final runs based on information from 186 automatic weather stations, managed by the Meteorological Service of Catalonia (NE Spain). The evaluation was carried out at different time scales (semi-hourly, hourly, daily, monthly, seasonal and annual) over a period of 6 years (2015–2020), based on the analysis of several metrics that quantify the error in precipitation accumulations. Similarly, the behaviour of IMERG was evaluated at a high resolution (30 min) under different topographic conditions (valley, flat, ridgetop), climatic conditions (BSk, Csa, Csb, Dfb) and under different precipitation intensity thresholds (light, moderate, heavy, very heavy, torrential). The main findings of the study are:
IMERG generally captures the spatial–temporal pattern and variability of annual mean precipitation. However, discrepancies appear in the estimation of the magnitude. While IMERG_E and IMERG_L overestimate precipitation by 20% in practically the whole territory, IMERG_F reduces the error significantly, yielding only 2%. The calibration performance in this run may even cause an underestimation of precipitation in areas of complex orography such as the Pyrenees.
The calculated statistics showed a significant improvement with decreasing temporal resolutions, with the monthly, seasonal and annual scales showing the best results in the estimation of precipitation accumulations. In contrast, the sub-daily scales showed high Bias values and very low correlation values, indicating the remaining challenge for satellite sensors to estimate precipitation at very high temporal resolutions. IMERG_F showed much better error statistics compared to IMERG_E and IMERG_L, wherein a generalised overestimation was evident and especially marked during the summer period.
Similarly, the analysis of the POD and FAR showed a greater ability of IMERG to identify precipitation events at scales greater than daily, wherein a stable behaviour of the statistics is observed well above the mean values, although with deficiencies in the identification of extreme events at all scales. The proportion of false alarms is a problem for IMERG especially during the summer, which is mainly associated with the detection of false precipitation in the form of lightrainfall (which is likely influenced by evaporation processes not assimilated by the algorithm), as well as the underestimation of locally occurring heavy precipitation.
The worst results were obtained on a semi-hourly scale represented by flat areas and under a BSk climate, wherein IMERG shows a tendency to overestimate rainfall.
IMERG tends to overestimate light precipitation, while it tends to underestimate accumulated precipitation in the rest of the intensity thresholds studied, especially those marked by high intensity precipitation. Associated with these errors is the fundamental role of taking rainfall gauges on a point scale that may not represent the spatial and temporal variability of rainfall in a region where this variable is spatially uncorrelated.
The evaluation of IMERG products presented here, although not the first one in Spain, is the first to address in detail the orographic and climatic factors at high temporal resolutions. Furthermore, we attempted to cover some of the most common weaknesses of this type of research by extending the analysis simultaneously to different temporal resolutions and by emphasising the analysis at high temporal resolutions. This study can be used by other researchers and developers involved in the IMERG algorithm to introduce improvements in future versions. Additionally, although with the limitation of latency, time observation and monitoring could be considered in operational work. For more applications based on the results presented here, and to try to answer some of the questions raised, in future work we intend to study in greater depth the capacity of IMERG to detect extreme events and to identify the specific behaviour of IMERG contributing sensors such as MW and IR.