Evaluation of Satellite Rainfall Estimates for Meteorological Drought Analysis over the Upper Blue Nile Basin, Ethiopia

The objective of this study was to evaluate the performance of satellite rainfall estimates (Climate Hazards Group Infrared Precipitation with Stations version 2 (CHIRPSv2) and Multi-Source Weighted-Ensemble Precipitation version 2 (MSWEPv2) from 1981 to 2018 for monthly meteorological drought analysis over the Upper Blue Nile (UBN) basin. The reference for the performance evaluation was rainfall measured in situ selected with reference to the elevation zones of the basin: Highland, midland, and lowland. Both the measured and estimated rainfall datasets were aggregated by month at a spatial resolution of 10 km × 10 km with a temporal coverage of 38 years from 1981 to 2018 and evaluated with respect to raw precipitation statistics and the standardized precipitation index (SPI). The values of SPI were validated with reference to documented meteorological drought records of the country. The mean bias, correlation coefficient, probability of bias (PBias, %), mean error (ME, mm), and root mean square error (RMSE, mm) values across the elevation zones for CHIRPSv2 were found to be 1.07, 0.91, 6.75, 7.74, and 122.34, respectively. The corresponding values were 1.19, 0.87, 18.56, 19.54, and 130.26 for MSWEPv2. Based on this result, CHIRPSv2 was employed to analyze the magnitude of drought in the different elevation zones of the UBN. The magnitude (SPI) of monthly meteorological drought over the entire UBN basin from 1981 to 2018 ranged from 0 to −3.74. The strongest negative SPI value (−3.74) was observed in August 1984 in midland areas. The highest magnitude of drought was−3.0 in July 2015 over the highland and−3.03 in June 2015 over the lowland during 2014–2017. The observed drought was characterized by extreme, severe, and moderate levels. The mean frequency of severe/extreme meteorological drought in the 38-year period over the highland, midland, and lowland parts of the UBN ranged from 7% to 11%. The average of severe/extreme drought events in each of the elevation zones of the basin was 9%, that is, drought occurred almost every 10 years for all elevation zones of the basin. Over the 38-year period, severe/extreme drought occurred at the onset and/or offset time of rainy season over all elevation zones of the basin. The UBN is characterized as a drought-prone basin. However, the frequency and magnitude of drought could neither be described as a decreasing nor as an increasing linear trend. Thus, the farming practices in the basin need to be enhanced with an improved early warning system and drought-resistant seed technologies.


Introduction
Most of the water used in the lowlands between Ethiopia and the Mediterranean Sea originates in the Ethiopian Highlands. The Blue Nile basin alone contributes 60-70% of the water in the River Nile flowing through Sudan and Egypt [1]. According to the authors of [2], in Sudan and Egypt, up to 95% of the water used is blue water from the Nile. Until recently, more than 95% of the agricultural area in the headwaters depended almost exclusively on green water from precipitation [3]. Smallholder-based rainfed agriculture is the backbone of the Ethiopian economy [4]. The effect of climate change is expected to be more severe in developing economies, as their livelihood depends on subsistence-based rainfed agriculture [5]. In this regard, Ethiopia's economy is subject to the direct impact of climate change and meteorological drought [6][7][8][9]. In Ethiopia, drought is a frequently recurring phenomenon that is often accompanied by serious and diversified impacts on human lives and the environment [10,11]. The authors of [12], in an investigation on country economies versus rainfall deficit, confirmed that national gross domestic product (GDP) consistently depends on rainfall deficit and meteorological drought.
In line with the finings mentioned above, rainfall is one of the most important parameters for the characterization of water cycle and drought. In Africa, the assessment, planning, and management of water resources are often constrained by a lack of reliable rainfall data [8,13]. The density and spatial distribution of rain gauges in the Upper Blue Nile Basin is uneven and time-varying [8,14,15]. To solve problems caused by the shortage of spatiotemporal precipitation data for drought analysis, scientific communities have generated satellite-based products [16,17]. According to the performance assessment study conducted in the last decade in Ethiopia, various researchers have suggested different rainfall products.
The Climate Hazards Group Infrared Precipitation with Stations (CHIRPS) product has been shown to be significantly better than the African Rainfall Climatology version 2 (ARC2) product, having higher skill and low or no bias [18]. CHIRPS was also found to be slightly better than the latest Tropical Applications of Meteorology using Satellite data (TAMSAT) products on monthly timescales. The authors of [8] chose and used the CHIRPS rainfall product to assess the spatial and temporal variability of meteorological drought.
As reported by the authors of [19], event detection showed that Multi-Source Weighted-Ensemble Precipitation (MSWEP) outperformed both adjusted and unadjusted satellite-based products of the Tropical Rainfall Measuring Mission (TRMM) Multisatellite Precipitation Analysis (TMPA), the National Oceanic and Atmospheric Administration (NOAA) Climate Prediction Center (CPC) Morphing Technique (CMORPH), Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks (PERSIANN), and European Center for Medium-Range Weather Forecast (ECMWF) ERA Interim reanalysis. The MSWEP performed better in rainfall event detection. The authors of [19] performed TMPA, CMORPH, PERSIANN, and ECMWF ERA interim reanalysis and evaluated MSWEP estimates.
The Climate Hazards Group Infrared Precipitation, CHIRPS [16], Multi-Source Weighted-Ensemble Precipitation, MSWEP [17], Climate Prediction Center morphing method, CMORPH [20], Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks, PERSIANN [21], Tropical Applications of Meteorology Using Satellite Data and Ground-Based Observations, TAMSAT [22], the Tropical Rainfall Measuring Mission, Multisatellite Precipitation Analysis real-time product 3B42RT, TMPA-RT [23], and Global Satellite Mapping of Precipitation, GSMaP [24] are among the sources of multisatellite precipitation products. The unique nature of MSWEPv2 and CHIRPSv2 products is the availability of data since 1979 and 1981, respectively, at high spatial resolution. To date, no research has assessed the performance of the two available satellite-based rainfall products (MSWEPv2 and CHIRPSv2) mentioned above over the various elevation zones of the Upper Blue Nile (UBN) basin in the northwestern highlands of tropical Africa. Accordingly, the objective of the study was to evaluate the performance of satellite rainfall estimates (CHIRPSv2 and MSWEPv2) from 1981 to 2018 in meteorological drought analysis over the various elevation zones of the UBN basin, Ethiopia.
The research tried to answer the question: Which satellites rainfall estimates, MSWEPv2 or CHIRPSv2, were more accurate from 1981 to 2018 for meteorological drought analysis over the elevation zones of Upper Blue Nile (UBN)?

Background of the Study Area
The UBN basin, locally called Abbay in Ethiopian, is located in the northwestern region of the country within 7.5 • to 12 • N and 34 • to 40 • E ( Figure 1). The basin contributes a large share of water to the main Nile (above 60% of the Nile total flow) and covers a total area of 176,000 km 2 upstream of the Ethiopia-Sudan border [1]. The geological formations of the basin are dominated by volcanic rock and Precambrian basement rock, and small areas are covered by sedimentary rock [1]. Highlands with rugged mountainous areas in the central and eastern part of the basin and the lowlands in the western part of the basin are the two major topographic features. The altitude in the basin ranges from~490 m above sea level (a.s.l.) in the lowlands (near the Ethio-Sudan border) and up to~4300 m a.s.l. in the highlands (central part of Ethiopia). The dominant soil types in the UBN are leptosols, alisols, nitosols, vertisols, and cambisols [25]. Dry land crop, pastures, savannah, grassland, woodland, water bodies, and sparsely vegetated plants [26,27] dominate the land cover in the UBN.
Geosciences 2020, 10, x 3 of 22 question: Which satellites rainfall estimates, MSWEPv2 or CHIRPSv2, were more accurate from 1981 to 2018 for meteorological drought analysis over the elevation zones of Upper Blue Nile (UBN)?

Background of the Study Area
The UBN basin, locally called Abbay in Ethiopian, is located in the northwestern region of the country within 7.5° to 12° N and 34° to 40° E ( Figure 1). The basin contributes a large share of water to the main Nile (above 60% of the Nile total flow) and covers a total area of 176,000 km 2 upstream of the Ethiopia-Sudan border [1]. The geological formations of the basin are dominated by volcanic rock and Precambrian basement rock, and small areas are covered by sedimentary rock [1]. Highlands with rugged mountainous areas in the central and eastern part of the basin and the lowlands in the western part of the basin are the two major topographic features. The altitude in the basin ranges from ~490 m above sea level (a.s.l.) in the lowlands (near the Ethio-Sudan border) and up to ~4300 m a.s.l. in the highlands (central part of Ethiopia). The dominant soil types in the UBN are leptosols, alisols, nitosols, vertisols, and cambisols [25]. Dry land crop, pastures, savannah, grassland, woodland, water bodies, and sparsely vegetated plants [26,27] dominate the land cover in the UBN.
The annual rainfall of the UBN basin is estimated as 787-2200 mm. The UBN basin is characterized by bimodal rainfall seasons, locally called Belg and Kiremit. Belg is a short rainfall season from March to May, whereas Kiremit is the main rainfall season from June to September [28]. Following the seasonality of the rainfall, 82% of the annual flow occurs during the main rainy season in June to September [28]. The average temperature in the basin is 21 °C [29].

Satellite Rainfall Estimates
(1) Multi-Source Weighted-Ensemble Precipitation Version 2 (MSWEPv2) As described by the authors of [17], MSWEPv2 is a fully global, historic precipitation dataset (1979-present) with a three-hour temporal and 0.1° (~11 km at the equator) spatial resolution. This satellite rainfall is produced from satellite estimates, reanalysis data sources, and gridded gauge. It is The annual rainfall of the UBN basin is estimated as 787-2200 mm. The UBN basin is characterized by bimodal rainfall seasons, locally called Belg and Kiremit. Belg is a short rainfall season from March to May, whereas Kiremit is the main rainfall season from June to September [28]. Following the seasonality of the rainfall, 82% of the annual flow occurs during the main rainy season in June to September [28]. The average temperature in the basin is 21 • C [29]. As described by the authors of [17], MSWEPv2 is a fully global, historic precipitation dataset (1979-present) with a three-hour temporal and 0.1 • (~11 km at the equator) spatial resolution. This satellite rainfall is produced from satellite estimates, reanalysis data sources, and gridded gauge. It is validated at the global scale using observations from~70,000 gauges and hydrological modeling for~9000 catchments, with daily gauge corrections.
As stated by the authors of [30], MSWEPv1 relies exclusively on reanalysis and gauge data during the pre-TRMM era (before 1998). For MSWEPv2, the reanalysis and gauge data were supplemented with rainfall estimates based on cloud-top IR temperatures during the pre-TRMM era to improve the accuracy in convection-dominated regions. The authors of [17] identified the following several unique aspects of the product: (i) Full global coverage, including all land and oceans (most satellite-based datasets are limited to 50 • or 60 • latitude); (ii) high spatial (0.1 • ) and temporal (three-hour) resolution, increasing the local relevance of the precipitation estimates; (iii) optimal merging of a wide range of gauge, satellite, and reanalysis precipitation datasets to obtain the best possible P estimates at any location; (iv) correction for distributional biases to eliminate spurious drizzle and restore attenuated peaks; (v) correction of systematic terrestrial P biases due to gauge undercatch using observed river discharge from 13,762 catchments worldwide; (vi) corrections using daily observations from 76,747 gauges across the globe; and (vii) a gauge correction scheme that accounts for gauge reporting times.
(2) Climate Hazards Group InfraRed Precipitation version 2 (CHRIPSv2) The Climate Hazards Group Infrared Precipitation (CHIRP) and CHIRP, combined with station observations (CHIRPS), are recently produced satellite-based rainfall products with relatively high spatial and temporal resolutions and quasi-global coverage [18]. As described by the authors of [16], CHIRPS represents 35+ years of quasi-global geostationary thermal infrared (TIR) satellite observations. It spans 50 • S-50 • N (and all longitudes). CHIRPS incorporates in-house climatology, Climate Hazards Center's Precipitation Climatology (CHPclim), 0.05 • resolution satellite imagery, and in situ station data to create gridded rainfall time series for trend analysis and seasonal drought monitoring. The satellite rainfall estimates were produced by the Climate Hazards Center of the Department of Geography of the University of California Santa Barbara in collaboration with the United States Geological Survey (USGS) EROS Center. The estimates focused on combining models of terrain-induced precipitation enhancement with interpolated station data. Recently, new observations such as gridded satellite-based precipitation estimates from NASA and NOAA have been leveraged to build high-resolution (0.05 • ) gridded precipitation climatologies.
The retrieval algorithm of CHIRPS (i) is built around a 0.05 • climatology that incorporates satellite information to represent sparsely gauged locations; (ii) incorporates pentad, and monthly data from 1981 to present a spatial resolution with a grid size of 0.05 • . Cold Cloud Duration (CCD)-based precipitation estimates (i) blend station data to produce a preliminary information product with a latency of about two days and a final product with an average latency of about three weeks, and (ii) use a novel blending procedure incorporating the spatial correlation structure of CCD-estimates to assign interpolation weights. Version 2.0 of CHIRPS was completed and made available to the public as of 12 February 2015 [16].
A summary of the descriptions of the two rainfall products is presented in Table 1. In the current study, the monthly rainfall products of MSWEP v2 and CHIRPS v2 with a spatial resolution of 0.1 • (~11 km) grid size and monthly time scale for the period 1981-2018 were examined.

Observed Dataset
The observed dataset was used as a reference to validate the performance of satellite rainfall estimates. The spatial distribution of the gauge stations that were used in the current study is presented ( Figure 2). As stated by the authors of [15], the Ethiopian National Meteorological Agency (NMA) performs routine quality checks for the data in their archives. The NMA has recently updated its station coordinates using GPS. However, we found a mismatch among a few reported GPS readings. Additional quality checks such as identifying suspicious rainfall values were completed. The most challenging and time-consuming task was ensuring the accuracy of station coordinates.

Observed Dataset
The observed dataset was used as a reference to validate the performance of satellite rainfall estimates. The spatial distribution of the gauge stations that were used in the current study is presented ( Figure 2). As stated by the authors of [15], the Ethiopian National Meteorological Agency (NMA) performs routine quality checks for the data in their archives. The NMA has recently updated its station coordinates using GPS. However, we found a mismatch among a few reported GPS readings. Additional quality checks such as identifying suspicious rainfall values were completed. The most challenging and time-consuming task was ensuring the accuracy of station coordinates.

Performance Evaluation of Satellite Rainfall Estimates
The gauge stations selected as a reference for the performance evaluation were assumed to represent the traditional agroclimatic zones of the UBN ( Figure 2). The three major zones of the UBN are as follows: Lowland (warm semiarid) ranges from 500 m a.s.l. to 1500 m a.s.l. with an average temperature of 27.5 °C, midland (cool subhumid) ranges from 1500 m a.s.l. to 2300 m a.s.l. with an average temperature of 20 °C, and highland (cool and humid) ranges from 2300 m a.s.l. to 3200 m a.s.l. with an average temperature of 17.5 °C. The same zonation was used to select grid cells of

Performance Evaluation of Satellite Rainfall Estimates
The gauge stations selected as a reference for the performance evaluation were assumed to represent the traditional agroclimatic zones of the UBN ( Figure 2). The three major zones of the UBN are as follows: Lowland (warm semiarid) ranges from 500 m a.s.l. to 1500 m a.s.l. with an average temperature of 27.5 • C, midland (cool subhumid) ranges from 1500 m a.s.l. to 2300 m a.s.l. with an average temperature of 20 • C, and highland (cool and humid) ranges from 2300 m a.s.l. to 3200 m a.s.l. with an average temperature of 17.5 • C. The same zonation was used to select grid cells of satellite estimates to run the performance evaluation. The summer season was the focus of the study due to the fact that the season has the majority of rainy time for cereal crop production in the study basin. The Pearson correlation coefficient (r), bias, probability of bias (PBias), mean error (ME), and root mean square error (RMSE) were the pairwise comparison statistics techniques that were employed in the process of evaluating the performance of satellite rainfall estimates of CHIRPSv2 and MSWEPv2. The description of the comparison statistics employed in the current study is presented in Equations (1)-(5). The same techniques were employed as in previous scientific papers [8,18,19,31].
r is used to measure the goodness of fit and linear association between two variables. It measures the linear association and how well the satellite rainfall product corresponds to the observed rainfall (Equation (1)). Its value ranges between −1 to 1, where 1 indicates a perfect score.
where r is the correlation coefficient, O is the gauge rainfall measurement, O is the average gauge rainfall measurement, S is the satellite rainfall estimate, S is the average satellite rainfall estimate, and n is the number of data pairs. Bias reflects how well the mean of the satellite rainfall corresponds with the mean of the observed rainfall (Equation (2)). A bias value closer to 1 indicates that the cumulative satellite rainfall estimate is closer to the cumulative observed rainfall. A bias value of 1 is a perfect score. The bias ratio is an error metric measuring the systematic error component with a perfect score of 1, whereas a value less or greater than 1 shows underestimation and overestimation, respectively. The probability of bias (PBias) is defined in Equation (3).
where S is the mean of the satellite estimate, and O is mean of the observed rainfall.
where PBias is the probability of bias, S is mean of the satellite estimates, and O is mean of the observed rainfall. ME (mm) estimates the average error (Equation (4)). A positive value indicates an overestimate of the satellite rainfall, whereas a negative value indicates an underestimate as compared to the observed rainfall. An ME value of zero is a perfect score.
where ME is the mean error in mm, O is the gauge rainfall measurement, and S is the satellite rainfall estimate. RMSE (mm) was used to measure the average magnitude of the estimated errors between the satellite rainfall and the observed rainfall (Equation (5)). A lower RMSE value means greater central tendencies and small extreme error. An RMSE value of zero is a perfect score.
Geosciences 2020, 10, 352 where RMSE is the root mean square error in mm, O is the gauge rainfall measurement, and S is the satellite rainfall estimate. As a measure of goodness-of-fit, coefficient of model accuracy (CMA) and alternative coefficient of model accuracy (ACMA) comprise the product of nonparametric sample correlation and model bias [32]. Accordingly, newly introduced goodness-of-fit metrics were also employed in the current study (Equations (6a)-(6b)). The value of CMA and ACMA varied over the range 0-1. CMA or ACMA equal to one indicates a perfect model (no errors). However, CMA equal to zero indicates that the model is not better than the comparison baseline (such as the mean of observed data).
where CMA is the coefficient of model accuracy, f is the normalized trend slope, β is beta, θ is the theatre. Beta is a measure of covariation of the two series being compared. Theatre is the measure of the differences between the squared means of observed and modeled series. Lamda is the measure of the differences between the squared standard deviations of observed and modeled series.

Meteorological Drought Index
The Standardized Precipitation Index (SPI) was first presented to the international scientific community by the authors of [33]. This index is one of the most widely used meteorological drought indicators across the world [34][35][36][37]. According to the report by the authors of [38], SPI is widely used for monitoring Eastern African droughts because it is more easily adapted to the local climate, has modest data requirements, can be computed for at almost any time scale, has no theoretical upper or lower bounds, and is easy to interpret. By identifying SPI as an index for broad use, the World Meteorological Organization (WMO) provided direction for countries aiming to establish a level of early warning for drought. In terms of its effectiveness, the WMO recommended SPI for member countries as it is the best drought index [36]. For meteorological drought, the SPI is a highly valuable estimator of drought severity [39]. The authors of [38] recommended SPI for Eastern African countries. The authors of [40] also suggested SPI for drought assessment.
The SPI index is a probability (statistical) index that provides a representation of abnormal wetness and dryness on a given timescale from the long-term average precipitation. This index can be calculated on as little as 20 years of data, but ideally, the time series should have a minimum of 30 years of data, even when missing data are considered. The index can track drought on multiple timescales, including 1, 3, 6, 9, and 12 months. The SPI methodology does not change whether using daily, weekly, or monthly data [33]. In the case of the current study, the timescale was limited to one month because the study was conducted in a basin where the single-season cereal crop production is the mainstay of the economy. These crops are grown during the 3-4 month rainy season [28], and during this period, it is recognized that drought on the order of 4 weeks duration at any stage of the crops' growth can have an adverse effect on crop yield.
The SPI gives the number of standard deviations by which the observed value would deviate from the long-term mean for a normally distributed random variable [33,41]. The SPI calculation involves the selection of a distribution model that fits best with the season to be studied. The commonly used statistical probability distributions are gamma, normal, and log-Pearson [42]. Gamma distribution is often used when calculating the SPI [33,41,43,44] because it has a lower bound at zero, which agrees with the observed distribution. The gamma distribution fitted the rainfall record well in the majority of the stations in Ethiopia [45]. Accordingly, the gamma distribution was used in the current study (Equations (7a)-(7c)). Details on how to run the gamma distribution are presented by the authors of [41,45]. Overall, this standardization provides the advantage of consistent values in space and time for the frequency of extreme dry and wet events. As stated by the authors of [41], the gamma distribution is defined by its frequency or probability density function: where a and β are the shape and scale parameters, respectively; x is the precipitation amount; and Γ(a) is the gamma function. Parameters α and β of the gamma probability density function (pdf) were estimated for each station and for each time scale of interest (1, 3, 6, 9, 12 months, etc.). The maximum likelihood estimations of α and β are: where n is the number of observations and ln is the natural logarithm function. The resulting parameters were then used to find the cumulative probability of an observed precipitation event for the given month and time scale for the location in question. Since the gamma function is undefined for x = 0 and a precipitation distribution may contain zeros, the cumulative probability (the value of SPI) becomes: where q is the probability of zero precipitation and G(x) is the cumulative probability of the incomplete gamma function. If m is the number of zeros in a precipitation time series, then q can be estimated by m/n. The cumulative probability H(x) is then transformed to the standard normal random variable z with mean zero and variance of one, which is the value of the SPI. For this study, to assess the drought over the UBN, sample grid data were extracted from relatively accurate rainfall estimates. At this point, it would be possible to employ both rainfall estimates for meteorological drought assessment. The sample grids were selected based on their geographic location. Whether highland, midland, or lowland, geographic location was considered. The number of points considered at each elevation were six, seven, and seven in the highland, midland, and lowland respectively. The value of a point/grid refers to mean value of 3 × 3 pixels. The SPI index was computed employing (Equations (7a)-(7c)) with the help of Drought Indices Calculator version 2 (DrinCv2). DrinCv2 is a software package developed for computing drought indices including SPI. Its application in several locations, especially in arid and semi-arid regions, shows that it is a useful research tool [41]. In the current study, the SPI of all elevation zones was analyzed separately following all the procedures required by the software.
As stated by the authors of [33], based on SPI values, meteorological droughts are classified into levels of magnitude: extreme, severe, moderate, near normal/mild, and no drought ( Table 2). The ratio of the number of severe/extreme meteorological drought months over the 38-year period to the total number of given rainy months over the period was calculated to obtain the value of the frequency of severe/extreme meteorological drought (Equation (8)). The frequency (%) of drought was analyzed by month over the highland, midland, and lowland elevation zones of the UBN basin in the period of 1981-2018: where Pi is a grid within the region, n is the number of drought months over a given years, and N is the length of the data series or the number of rainy months over a given year. The performance of the SPI (rainfall deficit) was evaluated with respect to documented historic drought events. The information on historic droughts considered in this study was reviewed from previous published articles [7] and the Emergency Events Database (EM-DAT) International Disaster Database, Centre for Research on the Epidemiology of Disasters (CRED) [46].
Mapping SPI Values: Assuming that meteorological droughts are regional in nature and commonly cover large areas, the properties of regional meteorological droughts can be studied by analyzing the spatial pattern of at-site (point) droughts [47,48]. Considering the presence of climate change and variability, mapping the most recent spatial distribution of drought was assumed to be more useful for agricultural development practices. In this regard, we considered sample grid (point) values of SPI extracted in the elevation zone of the basin to index and map the magnitude of the meteorological drought that occurred from 2014 to 2017. The value of a point/grid refers to mean value of 3 × 3 pixels. In addition to elevation, the sample grids were extracted to capture the possible effect of direction on the spatial distribution of meteorological drought over the basin.
SPI is all about measuring the abnormal dryness/wetness of a given specific place at a specific time. Please note that to detect the correct mean SPI (drought months), it was essential to exclude values with nondrought seasons/months. Thus, if we aggregated the value of SPI for the whole study period, it would not possible to show months/seasons that could face extreme/severe meteorological drought. In relative terms, various corners of the basin experienced severe drought in 2014-2017. At the same time, this period could show us the most recent drought phenomenon over the basin.

Performance Evaluation of the Satellite Rainfall Estimates
The mean values of the bias, r, PBias (%), ME (mm), and RMSE (mm) for CHIRPSv2 over the all elevation zones of the UBN were found to be 1. Relatively, compared to other parameters, the bias values of both products in the highland, midland, and lowland elevation zones were found to be close. Similarly, closer RMSE values were observed in the highland part of the basin for both products. In relative terms, mean values of under-estimation were observed in the highland elevation zone for CHIRPSv2. The performance evaluation of the satellite rainfall estimates for each parameter is presented below.
Correlation coefficient (r): The mean value of r across the elevation zones of the UBN were 0.90-0.92 and 0.86-0.88 for CHIRPSv2 and MSWEPv2, respectively. The mean r values of CHIRPSv2 and MSWEPv2 were found to be 0.91 and 0.87, respectively. The value of r did not show a meaningful difference across the highland, midland, and lowland parts of the UBN for CHIRPSv2 and MSWEP v2 products. The mean value of r was nearly equivalent to the perfect score in both CHIRPSv2 and MSWEPv2, that is, the presence of a linear association between the measured and estimated rainfall datasets was observed. However, in relative terms, CHIRPSv2 showed a better performance in the case of UBN. As presented in the scatterplot (Figure 3), the value of R-Squared (R 2 ) was higher in the case of CHIRPSv2. Similarly, the value of CMA and ACMA became 0.6 and 0.71, respectively, for CHIRPS, whereas the values were 0.63 and 0.58 for MSWEP. Accordingly, CHIRPS could be taken as a relatively perfect model. Bias: The mean bias values of CHIRPSv2 and MSWEPv2 for the different elevation zones of the UBN were 0.97-1.13 and 1.15-1.20, respectively. The mean bias values of CHIRPSv2 and MSWEPv2 were found to be 1.07 and 1.19, respectively, where a value less than one shows underestimation, a value greater than one shows overestimation, and the perfect value is one. The bias value indicated the presence of almost the same level of error across the highland, midland, and lowland parts of the UBN for both rainfall estimates. The mean values of PBias (%) of the two products were 6.75 and 18.56, respectively. The mean value of bias was near to the perfect score in both CHIRPSv2 and MSWEPv2, that is, the mean of the satellite rainfall corresponded with the mean of the observed rainfall. In other words, the cumulative satellite rainfall estimate was found to be close to the cumulative observed rainfall. However, in relative terms, CHIRPSv2 performed better for the UBN. Similarly, the authors of [18] found that the CHIRPSv2 product showed low or no bias. ME (mm): The mean values of ME ranged from −3.74 to 13.99 and −14.72 to 25.52 across the elevation zones for CHIRPSv2 and MSWEPv2, respectively. The result showed the presence of overand underestimation for both products. The average ME values of CHIRPSv2 and MSWEPv2 were found to be 7.74 and 19.54, respectively. The ME value showed the presence of almost the same level of error across the highland, midland, and lowland parts of the basin for both rainfall estimates. However, in contrast to MSWEPv2, a mean value of ME and PBias indicating underestimation was observed in the highland part of the basin for CHIRPSv2. Both products showed overestimation according to the average ME values. However, in relative terms, CHIRPSv2 showed a better performance in the case of the UBN.
RMSE indicates the random error component. The mean values were 97.20-145.98 and 79.61-198.14 across the elevation zones for CHIRPSv2 and MSWEPv2, respectively. The mean RMSE values of CHIRPSv2 and MSWEPv2 were found to be 122.34 and 130.26, respectively. The mean RMSE value revealed the presence of almost the same level of error across the highland, midland, and lowland parts of the basin for both rainfall products. The result shows the presence of a lower-level magnitude of the estimated errors between the satellite rainfall and the observed rainfall, and greater central tendencies and small extreme error for CHIRPSv2 in the case of the UBN. Overall, CHIRPSv2 was found to be superior to MSWEPv2 with respect to essential parameters of performance assessment. Above all, the ME and RMSE between the satellite rainfall and the observed rainfall were smaller in the case of CHIRPSv2. In accordance with the result of the current study, the existence of a good agreement between the weather stations and satellite estimates over the UBN was reported by the authors of [8,19] for CHIRPSv2 and MSWEPv2 products, respectively. Similar to the results found by the authors of [8,18], the existence of a good agreement between the weather stations and satellite estimates over Eastern Africa was found. As a result, the current study used CHIRPSv2 to analyze the magnitude of monthly meteorological drought in the last 38 07 and 1.19, respectively, where a value less than one shows underestimation, a value greater than one shows overestimation, and the perfect value is one. The bias value indicated the presence of almost the same level of error across the highland, midland, and lowland parts of the UBN for both rainfall estimates. The mean values of PBias (%) of the two products were 6.75 and 18.56, respectively. The mean value of bias was near to the perfect score in both CHIRPSv2 and MSWEPv2, that is, the mean of the satellite rainfall corresponded with the mean of the observed rainfall. In other words, the cumulative satellite rainfall estimate was found to be close to the cumulative observed rainfall. However, in relative terms, CHIRPSv2 performed better for the UBN. Similarly, the authors of [18] found that the CHIRPSv2 product showed low or no bias. ME (mm): The mean values of ME ranged from −3.74 to 13.99 and −14.72 to 25.52 across the elevation zones for CHIRPSv2 and MSWEPv2, respectively. The result showed the presence of overand underestimation for both products. The average ME values of CHIRPSv2 and MSWEPv2 were found to be 7.74 and 19.54, respectively. The ME value showed the presence of almost the same level of error across the highland, midland, and lowland parts of the basin for both rainfall estimates. However, in contrast to MSWEPv2, a mean value of ME and PBias indicating underestimation was observed in the highland part of the basin for CHIRPSv2. Both products showed overestimation according to the average ME values. However, in relative terms, CHIRPSv2 showed a better performance in the case of the UBN.
RMSE indicates the random error component. The mean values were 97.20-145.98 and 79.61-198.14 across the elevation zones for CHIRPSv2 and MSWEPv2, respectively. The mean RMSE values of CHIRPSv2 and MSWEPv2 were found to be 122.34 and 130.26, respectively. The mean RMSE value revealed the presence of almost the same level of error across the highland, midland, and lowland parts of the basin for both rainfall products. The result shows the presence of a lower-level magnitude of the estimated errors between the satellite rainfall and the observed rainfall, and greater central tendencies and small extreme error for CHIRPSv2 in the case of the UBN. Overall, CHIRPSv2 was found to be superior to MSWEPv2 with respect to essential parameters of performance assessment. Above all, the ME and RMSE between the satellite rainfall and the observed rainfall were smaller in the case of CHIRPSv2. In accordance with the result of the current study, the existence of a good agreement between the weather stations and satellite estimates over the UBN was reported by the authors of [8,19] for CHIRPSv2 and MSWEPv2 products, respectively. Similar to the results found by the authors of [8,18], the existence of a good agreement between the weather stations and satellite estimates over Eastern Africa was found. As a result, the current study used CHIRPSv2 to analyze the magnitude of monthly meteorological drought in the last 38 years over the UBN. Consistent with the current study, the authors of [8] reported that CHIRPS could be used as a source of information in developing the grid-based drought monitoring tools for the basin that could help in developing early warning systems.

Meteorological Drought in the UBN Basin
According to the result of the current study, CHIRPSv2 was found to present a relatively more accurate satellite estimate of monthly rainfall in the case of the UBN basin. Thus, the monthly rainfall estimate of CHIRPSv2 was used to analyze the magnitude (SPI) and frequency of meteorological drought in the last 38 years over the highland, midland, and lowland elevation zones of the basin. At this point, it is necessary to note that employing the other rainfall estimate for drought analysis in the basin may not result in a significant difference. The spatial distribution of grids (CHIRPSv2) extracted from the elevation zones of the basin to analyze the meteorological drought over the UBN is shown (Figure 4). The sample grids were selected based on their geographic location. Whether highland, midland, or lowland, geographic location was considered.

Meteorological Drought in the UBN Basin
According to the result of the current study, CHIRPSv2 was found to present a relatively more accurate satellite estimate of monthly rainfall in the case of the UBN basin. Thus, the monthly rainfall estimate of CHIRPSv2 was used to analyze the magnitude (SPI) and frequency of meteorological drought in the last 38 years over the highland, midland, and lowland elevation zones of the basin. At this point, it is necessary to note that employing the other rainfall estimate for drought analysis in the basin may not result in a significant difference. The spatial distribution of grids (CHIRPSv2) extracted from the elevation zones of the basin to analyze the meteorological drought over the UBN is shown (Figure 4). The sample grids were selected based on their geographic location. Whether highland, midland, or lowland, geographic location was considered.  The magnitude (SPI) of monthly meteorological drought over the entire UBN basin from 1981-2018 ranged from 0 to −3.74. The highest SPI (−3.74) was observed in August 1984 over the midland, in the southern part of the UBN ( Figure 5). In 2014-2017, the highest magnitude of drought was −3.0 in July 2015 over the highland and −3.03 over the lowland in June 2015. The highest SPI value in the last six years was also found to be −3.03 over the midland, recorded in July 2015 over the southern part of the UBN. Over the 38-year period, severe/extreme drought occurred at the onset and/or offset time of rainy season over all elevation zones of the basin. The result implies the presence of varied levels of exposure to meteorological drought in different elevation zones of the UBN. According to the guidelines described by the authors of [33], the observed magnitude of meteorological drought over the elevation zones of the basin could be characterized by extreme, severe, and moderate levels. The prevalence of moderate, severe, and extreme magnitudes of meteorological drought in the basin was also reported by the authors of [46].
In accordance with the result of the current study, Reference [7], and the EM-DAT International Disaster Database, Centre for Research on the Epidemiology of Disasters (CRED), the authors of [46] revealed the occurrence of severe drought in various years and places of Ethiopia, including provinces situated in the UBN in the 38-year-period. For instance, consistent with the report by the authors of [7], 1982, 1984, 1987, 1997, 2002, 2014, 2015, and 2016 could be taken as severe/extreme drought years in the UBN (Figures 4 and 5). However, while most parts of Ethiopia were exposed to severe/extreme drought in 2000, 2008, and 2009 [7], limited corners of the UBN were found to be exposed to moderate drought in the current study. The authors of [14,49,50] also showed the presence of climate variability in the UBN.
To map the monthly magnitude of meteorological drought over the UBN, we had to compute the SPI of the most recent years, 2014-2017, for the selected grids/places situated in the elevation zones of the basin. According to the monthly SPI analysis result done for selected grids and years (2014-2017) employing CHIRPSv2 rainfall estimates, the mean SPI values over the highland, midland and lowland varied both in space and in time. The aggregated mean SPI value from 2014-2017 became −0.74 in the highland, −0.57 in the midland, and −0.62 in the lowland (Table 5 and Figure 6). Both the mean and aggregated mean SPI values over the elevation zones did not show the occurrence of extreme/severe meteorological drought for the UBN for the period 2014-2017. The highest mean SPI values over the last four years, −1.63 in July 2015 and −1.1 in June 2015, were observed in the highland and lowland parts of the basin, respectively. These mean SPI values show the presence of severe and moderate levels of meteorological drought in the corresponding elevation zones. Due to the fact that meteorological drought index can explain more than one specific drought category to a certain degree [46], the presence of rainfall deficiency over the study period in the UBN implies the prevalence of agricultural drought. In 2014-2017, specific places/grids experienced extreme meteorological drought with mean SPI values of −3.0 over the midland and highland in July 2015, and −2.27 over the lowland in June 2015. More importantly, considering the mean value of SPI over 2014-2017, the average suggests that these four years were somewhat dry, indicating that the SPI in this four-year period was slightly lower than the long-term average ( Figure 5). As indicated by the authors of [45], the drought years identified by this SPI analysis for the UBN basin are known for their devastating impact.  (Table 5 and Figure 6). Both the mean and aggregated mean SPI values over the elevation zones did not show the occurrence of extreme/severe meteorological drought for the UBN for the period 2014-2017. The highest mean SPI values over the last four years, −1.63 in July 2015 and −1.1 in June 2015, were observed in the highland and lowland parts of the basin, respectively. These mean SPI values show the presence of severe and moderate levels of meteorological drought in the corresponding elevation zones. Due to the fact that meteorological drought index can explain more than one specific drought category to a certain degree [46], the presence of rainfall deficiency over the study period in the UBN implies the prevalence of agricultural drought. In 2014-2017, specific places/grids experienced extreme meteorological drought with mean SPI values of −3.0 over the midland and highland in July 2015, and −2.27 over the lowland in June 2015. More importantly, considering the mean value of SPI over 2014-2017, the average suggests that these four years were somewhat dry, indicating that the SPI in this four-year period was slightly lower than the long-term average ( Figure 5). As indicated by the authors of [45], the drought years identified by this SPI analysis for the UBN basin are known for their devastating impact.   Regarding the computation of the frequency (%) of severe/extreme meteorological drought by month for the selected grids/places in the UBN basin, the ratio of the observed number of severe/extreme meteorological drought months to the total number of years considered for the analysis was calculated (Table 6). Here, the total number of years considered for drought analysis in the basin was 38 years, 1981-2018. Regarding the monthly analysis result, a varied frequency of severe/extreme drought was observed in different parts of the basin for the 38 years. The frequency (%) of severe/extreme meteorological drought over the selected grids/sites for the months of June, July, and August were 0-13.5, 5.4-16.2, and 2.5-13.5, respectively ( Table 6). The widest frequency range occurred in the month of June, which is the onset time of the rainy season.
Spatial variability in severe/extreme drought frequency was also notable. The highest frequency of severe/extreme meteorological drought months was observed in the northeast and southwest lowland parts of the basin, whereas the lowest frequency was found in the southeastern part of the UBN. Similarly, as reported by the authors of [1], annual rainfall over the basin decreased from the southwest (>2000 mm) to the northeast (around 1000 mm). Considering each site separately, there was a specific place/grid in the midland that faced extreme/severe drought six times (16.2%) in the month of July over the 38 years. There were also specific places and sites that had a zero frequency of severe/extreme drought in the month of June over the highland part of the basin. Regarding the computation of the frequency (%) of severe/extreme meteorological drought by month for the selected grids/places in the UBN basin, the ratio of the observed number of severe/extreme meteorological drought months to the total number of years considered for the analysis was calculated (Table 6). Here, the total number of years considered for drought analysis in the basin was 38 years, 1981-2018. Regarding the monthly analysis result, a varied frequency of severe/extreme drought was observed in different parts of the basin for the 38 years. The frequency (%) of severe/extreme meteorological drought over the selected grids/sites for the months of June, July, and August were 0-13.5, 5.4-16.2, and 2.5-13.5, respectively ( Table 6). The widest frequency range occurred in the month of June, which is the onset time of the rainy season.
Spatial variability in severe/extreme drought frequency was also notable. The highest frequency of severe/extreme meteorological drought months was observed in the northeast and southwest lowland parts of the basin, whereas the lowest frequency was found in the southeastern part of the UBN. Similarly, as reported by the authors of [1], annual rainfall over the basin decreased from the southwest (>2000 mm) to the northeast (around 1000 mm). Considering each site separately, there was a specific place/grid in the midland that faced extreme/severe drought six times (16.2%) in the month of July over the 38 years. There were also specific places and sites that had a zero frequency of severe/extreme drought in the month of June over the highland part of the basin.
The range of extreme drought frequencies seen for different months and different sites shows the highly temporally variable and spatially localized nature of climate variability in this basin. The SPI values were calculated using distributions fit to all monthly data for each elevation zone. Deviations from the mean severe/extreme drought frequency for June thus indicated that this is a particularly variable month relative to July and August. This is consistent with June having a higher percentage of rainfall variability, including high-impact variability at the onset of the rainy season [51]. Spatial variability either indicates that drought-relevant climate variability occurs at a finer scale than the defined elevation zones or simply that, in a 38-year record, site-specific noise can lead to substantial differences in drought frequency. The authors of [45] reported the absence of statistical evidence for a conclusive decreasing or increasing trend in the severity and frequency of meteorological droughts in the basin. As reported by the authors of [52], various empirical and modeling studies proved that climate change is very likely to increase the magnitude, frequency, and duration of droughts over some parts of the world in the coming decades.
estimates over the elevation zones of the basin. The meteorological drought observed in the 38-year period over the UBN was characterized by moderate, severe, and extreme magnitudes. The magnitude of monthly drought over the basin varied spatially and temporally. Exploring the reason for the presence of variation in meteorological drought among the elevation zones could be a potential future research area. The frequency of severe/extreme drought months in the basin also varied spatially. The drought magnitude could be described neither as a decreasing nor as an increasing linear trend. However, the UBN could be characterized as a drought-prone basin. Accordingly, since the livelihood of farmers depends on rainfed agriculture, the farming practices in the basin need to be enhanced with an improved early warning system, surface water use, and drought-resistant seed technologies. Funding: Bahir Dar University covered the cost of conducting the study, whereas the publication fee was provided by Belmont Forum, via the NSF award ICER-1624335.