Spatial-Temporal Assessment of Satellite-Based Rainfall Estimates in Di ﬀ erent Precipitation Regimes in Water-Scarce and Data-Sparse Regions

: Accurate precipitation measurement is very important for socio-hydrological resilience in the face of frequent extreme weather events such as cyclones. This study evaluates the performance of two satellite products: the Tropical Rainfall Measuring Mission (TRMM 3B43V7) Multi-satellite Precipitation Analysis (TMPA, hereafter: TRMM) and the Integrated Multi-satellite Retrievals for GPM (IMERG, Final Run V06, hereafter: GPM) in the Sultanate of Oman. Oman is an arid country that generally has few rainy days, but has experienced signiﬁcant ﬂash ﬂoods, tropical storms and cyclones recently, leading to the loss of lives and millions of dollars in damage. Accurate precipitation analysis is crucial in ﬂood monitoring, hydrologic modeling, and the estimation of the water balance of any basin, and the lack of a su ﬃ cient weather monitoring network is a barrier to accurate precipitation measurement. Satellite rainfall estimates can be a reliable option in sparse network areas, especially in arid and semi-arid countries. This study evaluated monthly rainfall (hereafter: OBSERVED) levels at 77 meteorological stations from January 2016 to December 2018. The capacity of the TRMM and GPM satellite products to detect monthly rainfall amounts at varying precipitation thresholds was also evaluated. Findings included (1) overall and across the 11 Governorates of Oman, both satellite products show di ﬀ erent spatial variability and performance to the OBSERVED at the monthly, seasonal, and annual temporal scales; (2) from the perspective of precipitation detection and frequency bias, GPM showed a similar performance to TRMM at detecting low precipitation (2 mm / month) but was poorer at detecting high precipitation ( > 30 mm / month) across the entire country as well as in the Northern, Interior, and Dhofar regions; (3) both products show similarities to the OBSERVED through the partitioning of their seasonal time series into a distinct number of homogenous segments; and (4) both products had di ﬃ culty reproducing OBSERVED levels in the Dhofar and Interior regions, which is consistent with studies conducted in mountainous and coastal regions. With the aim of reproducing the spatial and temporal structure of OBSERVED in a rugged terrain, the study shows that both satellite products can be used in areas of sparse rain gauges or as additional observation for studies of extreme weather events. Overall, this study suggests that for Oman, both satellite products can be used as proxies for OBSERVED with appropriate bias corrections and GPM is also a reliable replacement for TRMM as a precipitation satellite product. The ﬁndings will be useful to the country’s ﬂood resilience and mitigation e ﬀ orts, especially in areas where there is sparse rain gauge coverage.


Introduction
The Sultanate of Oman (henceforth "Oman") is an arid country with several mountainous and coastal areas. The country is prone to short-duration, high-intensity rainfall that often leads to flash floods and contributes to the loss of lives and millions of dollars in damage. The country's total area Xu [17], Zhang et al. [18], etc. While the objective of this research is not to provide a review, some studies have tested these two products in landscape areas similar to Oman. Lelis et al. [15] compared GPM and TRMM in the eastern region of Sao Paulo, Brazil, and reported similar performance between the two products. Rozante et al. [16] compared the products across the entirety of Brazil, reporting that both tended to overestimate precipitation compared to gauged precipitation. Xu et al. [17] evaluated the products in the Huang-Huai-Hai region of China, concluding that GPM was superior to TRMM (3B42V7) in that GPM has a better probability of precipitation detection for both trace and solid precipitation. Zhang et al. [18] reported greater bias and root mean square error (RSME) in estimates by GPM compared to TRMM and noted that GPM was limited in its ability to capture small-scale and isolated deep convective precipitation. Furthermore, the authors [18] reported that GPM underestimated precipitation by up to 16% at high elevations. Several of these studies were based on two methods of evaluation: continuous and categorical. In this study, the technique of change point (cp) analysis will also be used. This is a technique that can be used to compare similarities, abrupt changes, and aberrations in both gauged precipitation and satellite products [19]. The methods can be used in both non-parametric [20] or Bayesian domains [21]. This evaluation technique will help in looking for abrupt changes and aberrations and assist in comparing time series values at corresponding rain-gauge stations for both gauged precipitation and satellite products.
The primary objective of this research is to test the application of the satellite products in rugged terrain such as that found in Oman. Socio-hydrological resilience can be defined as the ability of any society to adapt to any abrupt changes in the face of both biophysical and hydrological changes [22]. Therefore, this study seeks to contribute to the effective management of water and precipitation extremes in an arid climate. To the authors' knowledge, no previous studies have compared these two products in Oman. This study intends to fill this gap. The study will also provide an update to previous studies using satellite products as a proxy for gauged observations in arid countries. In other words, the results from this study will provide users with the satellite products' accuracy and also act as feedback for the algorithm's developers. Figure 1 shows the position of Oman in the Arabian Peninsula. The country is located within 17.01-26.17 • N and 54.09-59.52 • E. Oman is arid and receives spatially variable annual rainfall that fluctuates based on each area's physiographic properties. The country's rugged landscape patterns include mountain ranges (Al Hajar Mountains and Qara Mountains), coastal plains (Salalah Plain and Al Batinah) and interior areas (located between the north and south) [23]. These landscapes play an important role in influencing the country's precipitation patterns. For instance, Kwarteng et al. [23] found a positive correlation (0.82) between annual rainfall and topography. Using 27 years of rainfall data, Kwarteng et al. [23] found that the highest level of rainfall occurred in Saiq, which has an elevation of 1950 m above sea level, while the lowest level occurred in the interior region, which has an elevation of about 300 m.

Study Region and Precipitation Patterns in the Three Sub-Divisions
Oman suffers from long dry spells, with an average annual rainfall of only 12 days [23]. This equates to an average of one wet day (rainy day) per month. Oman's precipitation patterns are heterogeneous and controlled by four factors: convective rainstorms, cold frontal troughs, tropical cyclones and on-shore southwesterly monsoon currents [23]. The convective rainstorms in Oman occur mostly during the summer months and are associated with localized convection, which can occur at any time of the year [23]. The cold frontal troughs mostly influence the rainfall pattern of the northern Oman Mountain, which originates in the North Atlantic and can bring an average annual rainfall (known as "seif") of up to 300-400 mm in the Al Jabal Al Khadar mountain region [23]. In contrast, the Dhofar plain/coastal region's rainfall pattern is controlled by the on-shore southwesterly monsoon season, which lasts from June to September and is characterized by humid conditions and frequent drizzle rainfall per year, monthly rainfall amounts arise solely from only a few data points. In other words, it is possible that similar conclusions would be reached when considering daily rather than monthly observations. Nevertheless, studies such as those by Adler et al. [27], Chiu et al. [28], Nicholson et al. [29], and Liu [30], have evaluated monthly satellite observations. Evaluating monthly datasets is important in understanding the timing, number and seasonal variability of precipitation events. These are crucial in hydrological applications [19] including estimating the monthly and annual water budgets for water resource management purposes such as irrigation and groundwater recharge, flood prediction design [31], understanding the number of rainy days for crop production, [32] and trend analysis of disaster management. Due to the low spatial density of publicly available rain gauge stations in Oman (Figure 2), only three subdivisions were evaluated in the current analysis. The influence of the number and spatial distribution of stations has been well documented elsewhere [33]. Precipitation amounts from weather stations within each region tend to be more homogeneous compared to those in other regions. Therefore, to illustrate the accuracy and reliability of the satellite products as proxies for gauged precipitation, the following subdivisions of Oman's governorates were considered: (1) Northern region: Al Buraymi, Muscat, Ad Dakhliyah, Al Batinah North and South, Al Dhahira, and Ash Shargiyah North and South Governorates (2) Interior region: Al Wusta Governorate Tropical cyclones affect Oman's coastal areas, including Muscat and the Dhofar governorates. These cyclones originate at the Arabian Sea and can occur from May-June or October-November [23]. A cyclone has been reported to occur at least once every three years in the Dhofar Governate and once every 10 years in the Muscat Governorate, giving rise to extremely severe storms [23]. Notably, the cyclone of 2007 (Cyclone "Gonu") reached up to 610 mm of rainfall and led to 50 deaths and approximately USD 4.2 billion in associated damages [24,25]. Similarly, the cyclone of 2018 (Cyclone "Mukenu") reached up to 617 mm of rainfall, led to seven deaths, and cost more than USD 1.5 billion [26].
Due to the absence of daily records in the NCSI's web-portal, the evaluations and assessments described in this study were performed monthly. However, given the limited number of days of rainfall per year, monthly rainfall amounts arise solely from only a few data points. In other words, it is possible that similar conclusions would be reached when considering daily rather than monthly observations. Nevertheless, studies such as those by Adler et al. [27], Chiu et al. [28], Nicholson et al. [29], and Liu [30], have evaluated monthly satellite observations. Evaluating monthly datasets is important in understanding the timing, number and seasonal variability of precipitation events. These are crucial in hydrological applications [19] including estimating the monthly and annual water budgets for water resource management purposes such as irrigation and groundwater recharge, flood prediction design [31], understanding the number of rainy days for crop production, [32] and trend analysis of disaster management.
Due to the low spatial density of publicly available rain gauge stations in Oman (Figure 2), only three subdivisions were evaluated in the current analysis. The influence of the number and spatial distribution of stations has been well documented elsewhere [33]. Precipitation amounts from weather stations within each region tend to be more homogeneous compared to those in other regions. Therefore, to illustrate the accuracy and reliability of the satellite products as proxies for gauged precipitation, the following subdivisions of Oman's governorates were considered:

Tropical Rainfall Measuring Mission (TRMM)
The Tropical Rainfall Measuring Mission (TRMM) Multi-satellite Precipitation Analysis (TMPA) research product (TMPA/3B43 Rainfall Estimate L3 1 month 0.25° × 0.25° V7) is a combination of satellite rainfall estimates and gauged precipitation. The TMPA (3B43V7) is created from the TRMM adjusted merged PMW, IR, and gauged precipitation from selected stations in the Global Precipitation Climatology Centre (GPCC) rain gauge analysis [13]. The algorithm producing this product is run monthly to produce a seamless best estimate precipitation rate (mm per h). This monthly TMPA (3B43V7) precipitation estimate (hereafter: "TRMM") was obtained from the National Aeronautics and Space Administration (NASA) Giovanni website (https://disc.gsfc.nasa.gov/datasets/TRMM_3B43_7/summary).

GPM Integrated Multi-Satellite Retrievals for GPM (IMERG)
According to Hou et al. [34], the aim of the GPM Integrated Multi-satellite Retrievals for GPM (IMERG, 0.10° × 0.10° hereafter: GPM) mission is to provide a seamless precipitation surface estimate worldwide. An additional objective is to extend TRMM; GPM is a combination of a dual-frequency phased array precipitation radar and a microwave imager. GPM produces three levels of data. Level 1 is broken down into categories A, B, and C, which represent unprocessed data at full resolution, radiometrically corrected and geolocated data, and inter-calibrated data, respectively. The Level 2 data are geographical dataset products derived from Level 1, and the Level 3 datasets are composite datasets derived from Level 2 [35]. IMERG version 06 (GPM IMERG Final Precipitation L3 1 month 0.1° × 0.1° V06, Table 1) was used in the current study. The IMERG-Final monthly satellite product was obtained from NASA's Giovanni website (https://disc.gsfc.nasa.gov/datasets/GPM_3IMERGM_06/summary?keywords=%22IMERG%20final %22).

Gauged Precipitation
The gauged precipitation (hereafter: OBSERVED) datasets (Table 1) were obtained from the NCSI portal (https://data.gov.om/bixytwb/weather?regions=1000000-oman). Data are available for the 77 stations, but observations are reported on a monthly basis, and there are considerable amounts of missing data. To avoid introducing bias into the current study, data from three years of data were used. Figure 2 shows the NCSI OBSERVED' spatial distribution of the national-level

Tropical Rainfall Measuring Mission (TRMM)
The Tropical Rainfall Measuring Mission (TRMM) Multi-satellite Precipitation Analysis (TMPA) research product (TMPA/3B43 Rainfall Estimate L3 1 month 0.25 • × 0.25 • V7) is a combination of satellite rainfall estimates and gauged precipitation. The TMPA (3B43V7) is created from the TRMM adjusted merged PMW, IR, and gauged precipitation from selected stations in the Global Precipitation Climatology Centre (GPCC) rain gauge analysis [13]. The algorithm producing this product is run monthly to produce a seamless best estimate precipitation rate (mm per h). This monthly TMPA (3B43V7) precipitation estimate (hereafter: "TRMM") was obtained from the National Aeronautics and Space Administration (NASA) Giovanni website (https://disc.gsfc.nasa.gov/datasets/TRMM_3B43_7/ summary).

GPM Integrated Multi-Satellite Retrievals for GPM (IMERG)
According to Hou et al. [34], the aim of the GPM Integrated Multi-satellite Retrievals for GPM (IMERG, 0.10 • × 0.10 • hereafter: GPM) mission is to provide a seamless precipitation surface estimate worldwide. An additional objective is to extend TRMM; GPM is a combination of a dual-frequency phased array precipitation radar and a microwave imager. GPM produces three levels of data. Level 1 is broken down into categories A, B, and C, which represent unprocessed data at full resolution, radiometrically corrected and geolocated data, and inter-calibrated data, respectively. The Level 2 data are geographical dataset products derived from Level 1, and the Level 3 datasets are composite datasets derived from Level 2 [35]. IMERG version 06 (GPM IMERG Final Precipitation L3 1 month 0.1 • × 0.1 • V06, Table 1) was used in the current study. The IMERG-Final monthly satellite product was obtained from NASA's Giovanni website (https://disc.gsfc.nasa.gov/datasets/GPM_3IMERGM_06/ summary?keywords=%22IMERG%20final%22).

Gauged Precipitation
The gauged precipitation (hereafter: OBSERVED) datasets (Table 1) were obtained from the NCSI portal (https://data.gov.om/bixytwb/weather?regions=1000000-oman). Data are available for the 77 Atmosphere 2020, 11, 901 6 of 26 stations, but observations are reported on a monthly basis, and there are considerable amounts of missing data. To avoid introducing bias into the current study, data from three years of data were used. Figure 2 shows the NCSI OBSERVED' spatial distribution of the national-level ground-based observation stations. It is important to note that the spatial locations of the rain gauge stations in Oman used by NASA to calibrate the satellite products are unavailable for public use. However, an approximate number of stations within a certain grid of the calibrated satellite precipitation product can be estimated [36]. According to DGMAN [6], the Global Climate Observing System (GCOS) network includes only three stations from Oman. GPCC contributes to the GCOS network [37], meaning that there are likely only three weather stations included in Oman's GPCC networks. No available information to affirm this. The inclusion of such a limited number of stations may bias the satellite products towards the OBSERVED. However, the current study aimed to evaluate the accuracy of these satellite products as proxies for OBSERVED, especially in areas in Oman that have few weather stations. As such, it is possible that more than 95% of the stations used in this study were excluded from the GPCC gauge-calibrating processes of the TRMM and IMERG satellite observations. The evaluation occurred between January 2016 and December 2018 (Figure 3), which represented a period of overlap between the satellite products as well as a period with relatively less missing OBSERVED observations from the NCSI in Oman. As shown in Figure 3, only the years 2016-2018 have missing values less than or equal to 20% (left panel). Additionally, the right panel ( Figure 3) shows several types of missing patterns and their corresponding ratios (e.g., only~6.6% of the datasets are not missing any information). The missing and observed (present) values are represented by yellow and navy-blue ( Figure 3), respectively. These monthly precipitation data were further accumulated into the season time scale as: spring (MAM: March, April, and May), summer (JJA: June, July, and August), and fall (SON, September, October, and November).
The nearest grid points of the satellite products were compared with OBSERVED data. There was no downscaling or upscaling to avoid station error representativeness [3] since both OBSERVED stations and satellite products did not cover the same area [3].
Atmosphere 2020, 11, x FOR PEER REVIEW 6 of 26 ground-based observation stations. It is important to note that the spatial locations of the rain gauge stations in Oman used by NASA to calibrate the satellite products are unavailable for public use. However, an approximate number of stations within a certain grid of the calibrated satellite precipitation product can be estimated [36]. According to DGMAN [6], the Global Climate Observing System (GCOS) network includes only three stations from Oman. GPCC contributes to the GCOS network [37], meaning that there are likely only three weather stations included in Oman's GPCC networks. No available information to affirm this. The inclusion of such a limited number of stations may bias the satellite products towards the OBSERVED. However, the current study aimed to evaluate the accuracy of these satellite products as proxies for OBSERVED, especially in areas in Oman that have few weather stations. As such, it is possible that more than 95% of the stations used in this study were excluded from the GPCC gauge-calibrating processes of the TRMM and IMERG satellite observations. The evaluation occurred between January 2016 and December 2018 ( Figure 3), which represented a period of overlap between the satellite products as well as a period with relatively less missing OBSERVED observations from the NCSI in Oman. As shown in Figure  The nearest grid points of the satellite products were compared with OBSERVED data. There was no downscaling or upscaling to avoid station error representativeness [3] since both OBSERVED stations and satellite products did not cover the same area [3].

Statistical Verification Techniques
Three different verification methods were applied in this study: continuous verification methods, categorical verification methods (i.e., dichotomous "forecast" (satellite product) verification (Table 2) and change point analysis.

Gauge Rain Gauge No-Rain Total
Satellite products * H signifies that the satellite products predicted rain to occur, and it did occur ("Hit"); FA signifies that the satellite products predicted rain to occur, but it did not occur ("False Alarm"); M signifies that the satellite products predicted rain not to occur, but it did occur ("Miss"); and CN signifies that the satellite products predicted rain not to occur, and it did not occur ("Correct Negatives").

Continuous Verification Index
This type of verification is used to assess how the values of the satellite observations (F) differ from those of OBSERVED observations (O). Table 3 shows the continuous verification methods examined in this study.

Categorical Verification Index
This type of verification involves converting the satellite and OBSERVED into a dichotomous "forecast" (satellite product) system where the possible outcomes are "yes, an event with a certain threshold will occur" or "no, the event will not occur" [38]. Categorical verification indices involve the construction of a contingency table (see Table 2) that provides "yes" or "no" predictions and occurrences [38]. The four combinations of the satellite product (rain or no rain) and OBSERVED (rain or no rain) are represented, as shown in Table 2. Five different precipitation thresholds (2, 5, 10, 20, and 30 mm/month) were used in this study. This follows the WMO criteria for precipitation intensity classifications with slight modifications [39]. These thresholds will vary from one country to another as reported by Llasat [40]. It is also worth pointing out that these precipitation thresholds defined for the monthly time scale may also hold true for daily time steps since the accumulated monthly rainfall amounts arise solely from only a few daily data points (~one per month). These threshold levels are defined to test the ability of the satellite products in detecting different levels of rainfall intensity.
All the categorical indices can be represented in a performance diagram [41] in which they are combined into a single plot. This has been implemented in verification packages of R Statistical Software [42]. It is then simple to make deductions about the performance of the satellite products using POD, CSI, BIA, and SR. In an optimal performance situation, the perfect performance should lie in the upper right-hand corner of the plot [41]. Table 3. Continuous and categorical verification metrics for TRMM and GPM satellite observations.

Correlation (r)
This test measures the linear relationship or phase error between the satellite observations and OBSERVED or how well both products correspond, expressed as where F and O are average values of satellite and OBSERVED, respectively. Ranged from -1 to 1 with a perfect score equal to 1 Mean Absolute Error (MAE) MAE can be described as the average difference between satellite and OBSERVED. In other words, this metric estimates the closeness of the satellite products to the OBSERVED. MAE can range from 0 to α with an optimal value approaching 0.
RMSE measures the average error, which is weighted based on the square of the error. RMSE also indicates sample standard deviation between the satellite observation and OBSERVED. The optimal value for RSME is 0.

Probability of Detection (POD)
POD is also known as the "hit rate." It is an index that combines both "misses" and "hits" from the contingency table. The optimal value is 1.
FAR is an index that quantifies the failure of the satellite products to mismatch the OBSERVED no rain occurrence. It is always used in conjunction with POD, having a perfect value of FAR = 0 and POD = 1. In the performance diagram examined herein, the success ratio (SR; equal to 1-FAR) is plotted against POD. CSI quantifies the fraction of OBSERVED and/or satellite precipitation that was correctly predicted [31]. The index ranges from 0 to 1, with a perfect score of 1.

Bayesian and Nonparametric Multiple Change Point Analyses
It is important to characterize the abrupt changes or aberrations between the gauging stations using precipitation estimates between observation and satellite products on seasonal time steps. For an area with complex terrain like Oman, this evaluation will help in locating stations that change within a season or from one season to another. Erdman and Emerson [21] implemented the Bayesian change point method technique in statistical package (hereafter: ecp) in R [21]. In their implementation, a series can have a change point based on the Markov Chain Monte Carlo Metropolis-Hasting method with a transition probability, ρ, and the conditional probability of change at location j + 1, which is defined in the following equation [19,21,43]: Atmosphere 2020, 11, 901 9 of 26 where N is the number of seasonal observations, Y is the seasonal precipitation, U is a partition block, w is the ratio of signal error to error variance, and w 0 , B 0 , w 1 , and B 1 are within-and between-block sums of squares derived when U i = 0 and U i = 1, respectively. More information about this technique can be found in Erdman and Emerson [21].
The other change point method is the non-nonparametric multiple change point analysis developed by James and Matheson [20]. This method has also been implemented as an ecp statistical R package and can be used for both univariate and multivariate datasets. In this study, the satellite products and OBSERVED were treated as independent univariate variables. This method involves nonparametric estimation of the total number of change points and the location at which they occur [20]. The technique is based on Euclidean distance between sample points without any assumption of a change in distribution except the existence of the αth absolute moment (i.e., for some α ∈ (0, 2)) [20]. Furthermore, the estimation of the change point is based on hierarchical clustering using divisive and agglomerative algorithms. The "E-Divisive" method used in this study sequentially identifies change points through a bisection algorithm [20,44] and a divergence measure (distributional changes through energy statistics) derived from Szekely and Rizzo [45]. The E-Divisive method for multiple change points involves performing multiple iterations to locate a change point. In each iteration, another change point is estimated that divides the initial segment. This repetitive procedure can be described as a diagrammed binary tree where the root node can be classified as no change points (which also contain the whole length of the time series), whereas all other non-root nodes can be a copy of their parent or new segments that are formed by the addition of a change point to the parent [20]. The mathematical formulation of this non-parametric technique can be found in James and Matheson [20].
The combinations of the posterior probability of change point & parameter uncertainty quantification using bcp and the non-assumption of normality in the ecp method will strengthen the identification of timing and the number of changes in the precipitation time series.

Results and Discussion
To assess the accuracy and reliability of the satellite products at estimating precipitation in Oman, the following factors were considered: (1) The distributional characteristics of the satellite products compared to the OBSERVED on monthly, seasonal, and annual time scales and also across all 11 governorates of Oman. Spatial distributions of the satellite products on an annual time scale were also considered. (2) Assessments of GPM and TRMM using continuous verification indices over the entire country and its subdivisions.  Table 4 shows the descriptive statistics for both satellite products and OBSERVED for 2016-2018. For 2016, the average annual precipitation for OBSERVED, TRMM, and GPM was 84.8 mm, 118.9 mm, and 121.9 mm, respectively. This shows that the satellite product overestimated the annual average precipitation for 2016. A similar trend was observed in 2017. For 2018, the annual average precipitation for OBSERVED, TRMM, and GPM was 179.9 mm, 103.4 mm, and 98.3 mm, respectively. Table 4 also shows that in 2016, there are coefficients of variation (CV) of 78.0%, 50.6%, and 49.6% for OBSERVED, TRMM, and GPM, respectively. This shows the spatial heterogeneity and variability of rainfall patterns in a complex terrain like that of Oman. A similar trend is also reproduced in 2017 and 2018. The maximum annual precipitation was 2031.8 mm, 426.2 mm, and 406.5 mm for OBSERVED, TRM, and GPM, respectively, for 2018. The large OBSERVED value is due to the large extreme events of 2018, which the TRMM and GPM could not match in terms of magnitude. Overall, these similar results suggest that the satellite products overestimated the annual average precipitation in 2016 and 2017, and underestimated it in 2018, but there is a need for further quantitative metrics providing indicators of consistency and accuracy.  Figure 4 shows the characteristics of the satellite products and OBSERVED on a monthly basis. In 2016, the highest amount of monthly precipitation was observed in March, with almost 50, 63, and 62 mm for OBSERVED, TRMM, and GPM, respectively. This high magnitude of rain is consistent with what was reported when heavy rain fell in different parts of Oman on both the first and second week of March, resulting in the loss of eight lives, damage to property, and overflowing dams and wadis (rivers). Throughout 2016, both TRMM and GPM consistently overestimated the OBSERVED. In 2017, March was also the month with the highest level of precipitation; however, there were also high levels in May, July, and December. The high value in May 2018 was due to the Mekunu cyclone, which formed and dissipated on 21 May 2018 and 27 May 2018, respectively. It is interesting to see that both TRMM and GPM captured this event. However, the satellite products underestimated the tropical storm "Hikka" that made landfall in Oman on 24 September 2018 [46]. Similarly, TRMM and GPM underestimated the tropical storm "Luban", which occurred on 10 October 2018 in the Dhofar Governorate. Both TRMM and GPM underestimated the June-October 2018 precipitation. Apart from this exception, TRMM and GPM responded to the timing and monthly changes in precipitation patterns in Oman. This shows that both products are useful as proxies for OBSERVED. This is especially true in locations where rain gauges are sparse or where satellite products could be used as an additional data source for OBSERVED. The monthly variability observed during the study period also confirms the inconsistency of rainfall patterns in the region, which was captured by both satellite products. This indicates that over the three-year study period, both TRMM and GPM agreed with OBSERVED timing (except June-October 2018) when examined on a monthly basis. Since monthly observations are an aggregate of daily values from the number of rainy days in any given month, this agreement between OBSERVED and satellite products would likely be achieved using accumulated daily precipitation observations. Monthly accumulated precipitation is useful for understanding the water balance of local and regional water basins using a hydrologic model. The findings of this evaluation suggest that satellite products are reliable for water resource modeling and predictions. understanding the water balance of local and regional water basins using a hydrologic model. The findings of this evaluation suggest that satellite products are reliable for water resource modeling and predictions.  Figure 5 shows the radar chart of the regional monthly average estimates of OBSERVED, TRMM, and GPM. The quantitative axes show the months, and the values on the axes represent the precipitation values (mm). Figure 5a shows the monthly variability for the entire country. It is clear that both TRMM and GPM overestimated the winter months, while they underestimated the summer and fall months. It can be seen that both satellite products replicate the same shape formed by OBSERVED. In the Dhofar region (Figure 5b), the highest precipitation recorded (~70 mm) by OBSERVED was in May, and both satellite products reproduced this; however, both products underestimated the precipitation for the summer and fall months. Both TRMM and GPM followed a similar shape formed as that of OBSERVED. This similar performance and trend was also noticed for other regions, except in the Interior region, where both TRMM and GPM overestimated winter and spring months and did not follow the same shape formed by OBSERVED.  Figure 5 shows the radar chart of the regional monthly average estimates of OBSERVED, TRMM, and GPM. The quantitative axes show the months, and the values on the axes represent the precipitation values (mm). Figure 5a shows the monthly variability for the entire country. It is clear that both TRMM and GPM overestimated the winter months, while they underestimated the summer and fall months. It can be seen that both satellite products replicate the same shape formed by OBSERVED. In the Dhofar region (Figure 5b), the highest precipitation recorded (~70 mm) by OBSERVED was in May, and both satellite products reproduced this; however, both products underestimated the precipitation for the summer and fall months. Both TRMM and GPM followed a similar shape formed as that of OBSERVED. This similar performance and trend was also noticed for other regions, except in the Interior region, where both TRMM and GPM overestimated winter and spring months and did not follow the same shape formed by OBSERVED. Figure 6 shows the seasonal variability of TRMM and GPM with the OBSERVED for (a) 2016, (b) 2017, and (c) 2018. Both satellite products overestimated the OBSERVED consistently for the 2016 season. This trend was not as obvious in 2017, as both TRMM and GPM underestimated the OBSERVED during the winter, spring and autumn. In 2018, TRMM and GPM overestimated the OBSERVED during the spring and summer seasons, respectively. In general, the spring season had the highest amount of precipitation across the three years, except in October (autumn) 2018, when the tropical storm "Lubia" made landfall in Dhofar. Figure 7 shows the box and whisker plot of the satellite products and OBSERVED across the 11 governorates of Oman for (a) 2016, (b) 2017, and (c) 2018. The governorates received different amounts of precipitation during the study period; as such, Figure 7 shows clear spatial variability and heterogeneity in terms of the frequency and amount of rain received by each governorate. It is notable that although the governorates have different climatic regions (coastal, mountainous, desert plain, etc.), across the country, the spring season had the highest amount of precipitation and the largest variability. The only exception was the Dhofar Governorate, where a large amount of precipitation occurred in the summer during the Kharif season mentioned above. Atmosphere 2020, 11, x FOR PEER REVIEW 12 of 26     Table 5 shows the RMSE and AME values for the entire country and the three precipitation regimes. TRMM and GPM have an RMSE of 37.13 and 36.68 mm/month, respectively. Both products also have a similar MAE of approximately 9 mm/month. This similarity and trend were noticed in all the three different precipitation regimes of Oman. Notably, the RMSE values of both products in the Dhofar region were higher than the entire country. This region experienced a cyclone and tropical storms in two of the years (2017 and 2018). These large values may be a result of both TRMM and GPM underestimating these extreme events.      Table 5 shows the RMSE and AME values for the entire country and the three precipitation regimes. TRMM and GPM have an RMSE of 37.13 and 36.68 mm/month, respectively. Both products also have a similar MAE of approximately 9 mm/month. This similarity and trend were noticed in all the three different precipitation regimes of Oman. Notably, the RMSE values of both products in the Dhofar region were higher than the entire country. This region experienced a cyclone and tropical storms in two of the years (2017 and 2018). These large values may be a result of both TRMM and GPM underestimating these extreme events. Figure 9 shows the correlation matrix of the monthly average precipitation of TRMM, GPM, and OBSERVED for year (a) 2016, (b) 2017, and (c) 2018 and for subdivisions (d) Northern, (e) Dhofar, and (f) Interior of Oman. The estimates of both GPM and TRMM were correlated with the OBSERVED, with values of r = 0.76 and 0.75 for TRMM and GPM, respectively, for 2016, demonstrating good agreement. The GPM and TRMM estimates were also correlated (r > 0.97), suggesting that GPM is a suitable replacement for TRMM and also confirming the similarities in their underlying algorithm. For cyclone years (2017 and 2018), there were weak correlations that suggest there is a misalignment in the 1:1 relationship between the OBSERVED and satellite products. Figure 9d-f are scatterplots showing the correlation between satellite products and OBSERVED for three subdivisions of Oman. As shown in Figure 9d, in the Northern region of Oman, the two satellite products showed good agreement with OBSERVED estimates, with correlation values of r = 0.64 and 0.62 for TRMM and GPM, respectively. In contrast, the performance of both satellite products was weak in the Dhofar region (Figure 9e). Indeed, correlation with the OBSERVED was weaker, with values of r = 0.43 and 0.45 for TRMM and GPM, respectively. The distribution of the station data around the correlation line was also highly variable. It is important to note that the Dhofar region includes both mountainous and coastal areas. Further research may be needed to better understand the performance of satellite observations for these distinct climatic regions. Weaker correlations were also shown for the Interior region with r = 0.39 and 0.40 for TRMM and GPM, respectively.   The GPM and TRMM estimates were also correlated (r > 0.97), suggesting that GPM is a suitable replacement for TRMM and also confirming the similarities in their underlying algorithm. For cyclone years (2017 and 2018), there were weak correlations that suggest there is a misalignment in the 1:1 relationship between the OBSERVED and satellite products. Figure 9d-f are scatterplots showing the correlation between satellite products and OBSERVED for three subdivisions of Oman. As shown in Figure 9d, in the Northern region of Oman, the two satellite products showed good agreement with OBSERVED estimates, with correlation values of r = 0.64 and 0.62 for TRMM and GPM, respectively. In contrast, the performance of both satellite products was weak in the Dhofar region (Figure 9e). Indeed, correlation with the OBSERVED was weaker, with values of r = 0.43 and 0.45 for TRMM and GPM, respectively. The distribution of the station data around the correlation line was also highly variable. It is important to note that the Dhofar region includes both mountainous and coastal areas. Further research may be needed to better understand the performance of satellite observations for these distinct climatic regions. Weaker correlations were also shown for the Interior region with r = 0.39 and 0.40 for TRMM and GPM, respectively.

Performance Assessments of GPM and TRMM using Change Point Analysis
The change point plots in Figures 10-12 show the partitioning of the original seasonal datasets (solid lines) and location of change and aberrations with high posterior probability (dash red lines) identified by both the bcp and ecp methods. In Figure 10, change point methods clearly differentiate the seasonal pattern in OBSERVED, TRMM, and GPM for 2016, except the extract change points at locations 142 and 173 identified in OBSERVED. This may be a result of "more noise" in OBSERVED compared to TRMM and GPM, which were from gridded products. Interpolated/gridded surfaces tend to smoothing values through the underestimation of high values and overestimation of low values. In general, it is clear that TRMM and GPM have similar internal structures compared to the OBSERVED time series from the perspective of the seasonal changes. For 2017 (Figure 11), it is clear that the satellite products show similar characteristics compared to the OBSERVED, as there were change points that did not follow the original seasonal changes. The timing and locations of changes in both observed and satellite products look similar. This also shows that satellite products exhibit similar seasonal properties of the OBSERVED for 2017. In Figure 12, OBSERVED and satellite products show similar change points that did not follow the original seasonal partitioning. The

Performance Assessments of GPM and TRMM Using Change Point Analysis
The change point plots in Figures 10-12 show the partitioning of the original seasonal datasets (solid lines) and location of change and aberrations with high posterior probability (dash red lines) identified by both the bcp and ecp methods. In Figure 10, change point methods clearly differentiate the seasonal pattern in OBSERVED, TRMM, and GPM for 2016, except the extract change points at locations 142 and 173 identified in OBSERVED. This may be a result of "more noise" in OBSERVED compared to TRMM and GPM, which were from gridded products. Interpolated/gridded surfaces tend to smoothing values through the underestimation of high values and overestimation of low values. In general, it is clear that TRMM and GPM have similar internal structures compared to the OBSERVED time series from the perspective of the seasonal changes. For 2017 (Figure 11), it is clear that the satellite products show similar characteristics compared to the OBSERVED, as there were change points that did not follow the original seasonal changes. The timing and locations of changes in both observed and satellite products look similar. This also shows that satellite products exhibit similar seasonal properties of the OBSERVED for 2017. In Figure 12, OBSERVED and satellite products show similar change points that did not follow the original seasonal partitioning. The posterior probabilities of change at each of the changing locations are similar for both OBSERVED and the satellite products. The seasonal response and timing of any precipitation products are paramount as properties to be considered in seasonal forecast, hydrologic modeling and river basin water balance analysis.
Atmosphere 2020, 11, x FOR PEER REVIEW 18 of 26 posterior probabilities of change at each of the changing locations are similar for both OBSERVED and the satellite products. The seasonal response and timing of any precipitation products are paramount as properties to be considered in seasonal forecast, hydrologic modeling and river basin water balance analysis.      Figure 13 shows a performance diagram of TRMM and GPM at different precipitation thresholds for the year (a) 2016, (b) 2017, and (c) 2018 and for subdivisions (d) Northern, (e) Dhofar and (f) Interior of Oman. This diagram provides several categorical indexes (POD, BIAS, CSI, and SR (1-FAR)) together on a single page. The five different precipitation thresholds are considered and represented from the smallest (2 mm/month) to the largest (>30 mm/month). The solid contour lines show the CSI values, while the BIAS shows the dash lines with extended labels on the upper (2nd) axes of x and y. In all the considered scenarios, it is clear that TRMM (red) and GPM (green) produced high POD (>0.8) at low precipitation thresholds. POD decreased as precipitation thresholds increased, except for the case shown in Figure 13a. It is interesting to see that TRMM and GPM performed well in 2016 (Figure 13a). This is the only year without any major tropical storm or cyclone events. In 2016, both TRMM and GPM detected large precipitation (>30 mm/month), low CSI, and BIAS of 1. Tropical cyclones are always characterized by high winds, intense rainfall, warm temperatures, and low atmospheric pressures. It seems both TRMM and GPM have problems detecting these features compared to a period when there are no such events. This result is consistent with performance using other techniques such as change point and correlation. In Figure 13b,c, it is shown that TRMM and GPM have problems detecting large amounts of precipitation and also underestimate large precipitation thresholds. For the three regions considered, TRMM and GPM show similar performance of overestimation of low precipitations and underestimation of large precipitations. In terms of BIAS (frequency bias), TRMM slightly performed better than GPM at low and high precipitation thresholds. TRMM's CSI was similar to GPM at low and high thresholds (>30 mm/month). In terms of SR (i.e., 1-FAR), the two products performed similarly. Figure 13 shows a performance diagram of TRMM and GPM at different precipitation thresholds for the year (a) 2016, (b) 2017, and (c) 2018 and for subdivisions (d) Northern, (e) Dhofar and (f) Interior of Oman. This diagram provides several categorical indexes (POD, BIAS, CSI, and SR (1-FAR)) together on a single page. The five different precipitation thresholds are considered and represented from the smallest (2 mm/month) to the largest (>30 mm/month). The solid contour lines show the CSI values, while the BIAS shows the dash lines with extended labels on the upper (2nd) axes of x and y. In all the considered scenarios, it is clear that TRMM (red) and GPM (green) produced high POD (>0.8) at low precipitation thresholds. POD decreased as precipitation thresholds increased, except for the case shown in Figure 13a. It is interesting to see that TRMM and GPM performed well in 2016 (Figure 13a). This is the only year without any major tropical storm or cyclone events. In 2016, both TRMM and GPM detected large precipitation (>30 mm/month), low CSI, and BIAS of 1. Tropical cyclones are always characterized by high winds, intense rainfall, warm temperatures, and low atmospheric pressures. It seems both TRMM and GPM have problems detecting these features compared to a period when there are no such events. This result is consistent with performance using other techniques such as change point and correlation. In Figure 13b,c, it is shown that TRMM and GPM have problems detecting large amounts of precipitation and also underestimate large precipitation thresholds. For the three regions considered, TRMM and GPM show similar performance of overestimation of low precipitations and underestimation of large precipitations. In terms of BIAS (frequency bias), TRMM slightly performed better than GPM at low and high precipitation thresholds. TRMM's CSI was similar to GPM at low and high thresholds (>30 mm/month). In terms of SR (i.e., 1-FAR), the two products performed similarly. Figure 13e shows the performance diagram of the satellite products for the Dhofar Region. TRMM and GPM performed similarly in detecting precipitation at the defined precipitation thresholds. The performance of both TRMM and GPM are shown in Figure 13f for the Interior Region. In this region, both products had difficulty detecting high precipitation levels. However, it is important to note that this region has the fewest rain gauge stations, which may have an influence on or inflate this "similar" result.  The circles represent different monthly precipitation thresholds, ranging from low (<2 mm) to high (>30 mm) rain. Solid contours represent the CSI, while dashed lines represent the BIAS. Figure 13e shows the performance diagram of the satellite products for the Dhofar Region. TRMM and GPM performed similarly in detecting precipitation at the defined precipitation thresholds. The performance of both TRMM and GPM are shown in Figure 13f for the Interior Region. In this region, both products had difficulty detecting high precipitation levels. However, it is important to note that this region has the fewest rain gauge stations, which may have an influence on or inflate this "similar" result.

Performance Assessments of GPM and TRMM Using Categorical Verification Indices
This study evaluated the strength and weaknesses of two satellite products for Oman. In general, both satellites use the same algorithm and therefore not surprising the similarities in performance. From the literature, previous studies such as those by Zhang et al. [18] and Tan and Duan [47] did not observe any significant improvement of GPM over TRMM. The result from this study is also consistent with findings from African coastal [48] and high topography [49] areas, where issues with the performance of satellite observations have been reported. While TRMM will be completely discontinued, GPM has better spatial and temporal resolutions with potential applications such as in operational cyclone monitoring in Oman. This study is very important and relevant for regions with sparse rain gauge networks. The steps applied for this evaluation may be applied to other areas but care must be taken when applying to regions with a different number of rain gauge networks and contrast climatic characteristics.

Importance of Precipitation Estimates and Analysis in Socio-Hydrologic Resilience
For a country like Oman, which is prone to frequent extreme events such as cyclones, sea-level rise, flash floods, etc., the importance of precipitation measurement and analysis cannot be over-emphasized. These events cannot be stopped from occurring, but they can be adapted to. In the face of these frequent threatening events, decision-makers, farmers, and all stakeholders need a decision support tool that can provide timely information about precipitation values in near real-time. There is a need to utilize currently available satellite precipitation products and OBSERVED in an integrated way for sustainable water management, flood and seasonal forecasting, hydrologic modeling, etc. Beyond the usefulness of satellite observations, there is a need for pragmatic efforts in providing a platform for precipitation analysis that has both finer spatial and temporal resolutions. Creating this platform is possible through the assimilation of climatic forecast (as a first guess), OBSERVED, satellite and radar rainfall observations, similar to the Canadian Precipitation Analysis [8,50,51]. With an appropriate bias correction framework such as that which is implemented in Friesen et al. [52], this method will correct and adjust the precipitation intensity and systematic bias. In other words, for real-time or near real-time applications, the average monthly accumulated OBSERVED from rain gauges through the entire country (from the previous 30 days) can be used for bias correction. The evaluated satellite products can be used as an additional data source or as a background field. The rainfall analysis could be produced either at 6-h or daily time steps. Having this type of product would strengthen Oman's adaptive strategies and socio-hydrologic resilience, especially in combating frequent extreme events.

Conclusions
This study evaluated the accuracy of precipitation estimates produced by two satellite products: TRMM (3B43V7) and GPM (IMERG V6) in the Sultanate of Oman. For continuous statistics verification performance, indices such as correlation (r), RMSE, and MAE were used to compare the association between the satellite products and OBSERVED from 77 weather stations. In the categorical verification domain, statistics such as CSI, SR (i.e., 1-FAR), POD, and BIAS (frequency bias) were used to assess the performance of the two products. In addition, change point methods from the Bayesian and non-parametric perspectives were also used to determine the timing and number of changes in the time-series. Study findings indicated that both GPM and TRMM compared well with OBSERVED observations in 2016, when there was no tropical storm or cyclone. It is recommended that future studies investigate the performance of early runs of GPM for real-time extreme events such as cyclone monitoring in Oman. However, before using these products for operational applications, relevant bias correction procedures may be needed. Additionally, more study years should be evaluated if OBSERVED from more rain-gauge stations is available. For hydrological forecasting or agricultural applications such as capturing near real-time cyclones, the satellite products should be assessed at shorter temporal scales such as sub-hourly, hourly, or daily accumulated rates.
In the current study, the following concluding remarks were drawn: (1) There was generally a weak linear relationship between GPM and TRMM in all the regions and years considered for the entire period of study, except for the good correlation observed in the Northern region and also in 2016 (for the entire country). The year 2016 was without any cyclone event and was generally dry except for heavy rain in March 2016. The weak performances seen in the Dhofar region may be due to the two distinct physiographic features in the areas (mountain and coastal climates). The Dhofar region may need to be analyzed separately in future studies to more accurately assess the performance of the satellite products in the region. Examining the products across all 11 governorates of Oman, there was consistent spatial variability in the satellite products, especially in regions where there are relatively few gauged networks. Bias corrections may be needed to be performed on the products before being used for any real-time applications. (2) In Oman, both products can detect low to medium precipitation thresholds; however, both have difficulty detecting precipitations at higher thresholds. In terms of BIAS, both products overestimated precipitation compared to OBSERVED at low precipitation thresholds, but underestimated precipitation levels at high thresholds. Of the three regions considered, there was a similar performance of the satellite products in all the regions, confirming the similarities in the algorithms used to produce them. For all the regions, the weak correlation performances noted above also reflect in the categorical performance. (3) From the change point analysis perspective, TRMM and GPM compared well with OBSERVED in their ability to detect seasonal changes or aberrations in both annual and seasonal scales.
Although there were differences in terms of the posterior mean obtained from bcp, there were similarities in change points or aberrations in the rain-gauge station locations across all the years. In general, TRMM and GPM showed similar characteristics with the OBSERVED in the number and timing of seasonal changes computed across the years. (4) In terms of measuring annual precipitation, the two satellite products demonstrated the same spatial pattern as the OBSERVED over the three study years (except 2018, when there was a tropical storm and a cyclone). In all the years, there was strong agreement between TRMM and GPM in terms of spatial distribution and magnitude.
Overall, this study assessed the performance of the GPM and TRMM satellite products in Oman. Both showed promising results (which can further be improved with appropriate bias corrections) in terms of acting as proxies for OBSERVED. There were no marked differences between the two products, suggesting that GPM can act as a reliable replacement for TRMM.