Hydroclimatic Extremes Evaluation Using GRACE/GRACE ‐ FO and Multidecadal Climatic Variables over the Nile River Basin

: Hydroclimatic extremes such as droughts and floods triggered by human ‐ induced climate change are causing severe damage in the Nile River Basin (NRB). These hydroclimatic extremes are not well studied in a holistic approach in NRB. In this study, the Gravity Recovery and Climate Experiment (GRACE) mission and its Follow on mission (GRACE ‐ FO) derived indices and other standardized hydroclimatic indices are computed for developing monitoring and evaluation methods of flood and drought. We evaluated extreme hydroclimatic conditions by using GRACE/GRACE ‐ FO derived indices such as water storage deficits Index (WSDI); and standardized hydroclimatic indices (i.e., Palmer Drought Severity Index (PDSI) and others). This study showed that during 1950 – 2019, eight major floods and ten droughts events were identified based on standardized ‐ indices and GRACE/GRACE ‐ FO ‐ derived indices. Standardized ‐ indices mostly underestimated the drought and flood severity level compared to GRACE/GRACE ‐ FO derived indices. Among standardized indices PDSI show highest correlation (r 2 = 0.72) with WSDI. GRACE ‐ /GRACE ‐ FO ‐ derived indices can capture all major flood and drought events; hence, it may be an ideal substitute for data ‐ scarce hydro ‐ meteorological sites. Therefore, the proposed framework can serve as a useful tool for flood and drought monitoring and a better understanding of extreme hydroclimatic conditions in NRB and other similar climatic regions.


Introduction
The Nile is the worldʹs longest river with a drainage area of about 3.2 million km 2 , which is nearly 10% of the African continentʹs landmass. It flows across 11 countries from South to North, which crosses highly diverse landscapes and climatic zones, and mostly, the region depends hugely on rain-fed agriculture for its livelihood. Consequently, the agricultural system [1,2], economic development [3,4], food security, and increasing population [5] makes the basin extremely vulnerable. Furthermore, NRB is highly generalized and seldom or relatively infrequent in scientific discussion, doubtless due to its large size, poor hydrological records (i.e., spatial and temporal coverage) supplemented with limited access owing to hydro-political [1] and development [5] interest among NRB countries. Thus, the basin demands a suitable method to determine its spatiotemporal patterns of floods and droughts.
Despite the efforts mentioned above, a comprehensive study of long-term flood and drought over the whole NRB is missing. To our knowledge, this is the first study as a framework that comprises multidecadal flood and drought event identifications over the NRB. It is based on multidecadal standardized hydroclimatic extreme event and GRACE/GRACE-FO derived indices. This frame comprises groundwater, meteorological, and agricultural drought conditions. Thus, this studyʹs nobility or scientific contribution is to use GRACE/GRACE-FO temporal (2002 to 2019) TWS data in line with the multidecadal climatic variables to produce enhanced flood and drought estimates in the NRB. This study has three specific objectives: (1) provide a framework about NRB flood and drought event characteristics based on GRACE/GRACE-FO derived and standardized indices; (2) assess NRB multidecadal hydroclimatic extreme event impacts during 1950-2019 and their categorization; and (3) evaluate the feasibility of GRACE/GRACE-FO products based on drought and flood capturing capacity over NRB.

Study Area
The NRB is the world's longest river basin covering approximately 3,046,334 km 2 , and comprises 11 countries (see Figure 1). By the year 2030, the population is projected to rise to 700 million, and it raises concerns about sustainable/equitable management of the basinʹs resources [49]. Based on the hydroclimatic condition, we divided the NRB into four critical areas of interest: (1) The Blue Nile Region (BNR), (2) Lake Victoria Region (LVR), (3) The Bahr-el-Ghazal Region (BER), and (4) Main Nile Region (MNR). All NRB sub-basins (regions) exhibit more considerable spatial coverage that enables us to detect changes in TWS GRACE Satellite data. In this study, the term region is used to represents sub-basins of NRB interchangeably. Particularly, BNR mostly belongs to the Ethiopian Highlands, including both the Upper and Lower Blue Nile basin that contains Lake Tana ( Figure 1) and contributes about 76% [4] of the total flow of NRB. LVR comprises Lake Albert, Kyoga, Victoria, George, and Semilik sub-basins that contribute 24% [4] flows of NRB and forms the White Nile headwaters. BER is found in Congo-Nile River boundary tributaries and a large area of Sudanese plain with a low slope and supplies the Sudd-wetlands (marshes) that are thought to be where a substantial amount of the White Nileʹs water is lost due to evaporation [20]. MNR is mainly located in the Egyptian desert region and the northern part of Sudan consisting of Lake Nasser and High Aswan Dam and understood that intensive irrigation is used and water lost through evaporation [3].
The NRB's ecosystem is under severe stress [5] shared by more than 400 million people who depend heavily on agriculture. Nearly half of the NRB countries are projected to live below the water scarcity level (i.e., 1000 m 3 /person/year, by 2030) [5]. Furthermore, groundwater shows depletion across four sub-regions [20], particularly MNR noted depletion at a rate of −13.45 km 3 /year (i.e., 3/2006-3/2008) [3]. This depletion may significantly impact people living within the basin, particularly during the drought period, and may increase the already high water stress level.
In NRB satellite altimetry-derived [3] and in situ-measured data [5,34] shows the highest sensitivity of the surface water fluctuations (lake level decreasing) and river flow declining [5] during drought period that is triggered by natural and human-induced climate change.

GRACE/GRACE-FO Data
GRACE and GRACE-FO are twin-satellite missions. The mass change product from these two missions is temporal gravity field models. This study used GRACE/GRACE-FO products to estimate WSDI, CCDI, and GGDI. We evaluated both products based on their ability to reproduce the WSDI, CCDI, and GGDI indices. Both GRACE/GRACE-FO gridded products are acquired from the Center for Space Research (CSR) [50] at the University of Texas, German Research Center for Geoscience (GFZ) [51], and the Jet Propulsion Laboratory (JPL). We filled missed data (i.e., caused by satellite batteries and instrument failure) for all GRACE/GRACE-FO products using the cubic spline interpolation method. In this study, the missing gap between GRACE and GRACE-FO (i.e., July 2017 to May 2018) is not filled as presented in Section 3.3. This study used GFZ SH, JPL mascon, and CSR mascon GRACE/GRACE-FO data.
Each GRACE product is expected to represent the true value plus a certain amount of error. The arithmetic mean ensemble model was model was suggested by Sakumura et al. [52] owing to a much higher level of reliability than a stand-alone model. In this study, the ensemble model is produced by combining the arithmetic mean of three GRACE/GRACE-FO solutions and is used to calculate GRACE derived hydroclimatic extreme indices (see Sections 2.3.3 and 2.3.4)

Rainfall and Temperature Data
Understanding precipitation is crucial for studying regional floods and droughts. We used Tropical Rainfall Measuring Mission (TRMM) 3B43 precipitation V7 data (1998)(1999)(2000)(2001)(2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010)(2011)(2012)(2013)(2014)(2015)(2016) to compare with GRACE derived indices. TRMM satellite precipitation data is available at https://trmm.gsfc.nasa.gov/ (accessed on 31 December 2020) from the NASA website. We used the TRMM-3B43 (see Table 1) Level 3 satellite-gauge (SG) combination data to evaluate the GRACE-derived TWSA and characterize drought in the NRB. Furthermore, long-term multidecadal rainfall and temperature data acquired from the University of East Anglia gridded Climatic Research Unit (CRU) Time-series (TS) are available at https://climatedataguide.ucar.edu/climate-data (accessed on 31 December 2020). In this study, CRU data are used to understand the multidecadal (i.e., 120 years) pattern of Rainfall and temperature (see Section 3.1), while TRMM is used to compute Standardized Soil Moisture Index (SSI), SPI, and CCDI. We applied the areal mean estimation techniques by averaging all grids within the basin owing to the equal spatial resolution of all grids in the study area for further analysis at the sub-basin (regions) and basin levels.

Soil Moisture Data
We used the soil moisture data from Global Land Data Assimilation System Version 2 and 2.1 (GLDAS-NOAH M2.0 and M2.1), Modern Era Retrospective-analysis for Research and Applications (MERRA-2), and ERA5-Land models, as presented in Table 1 in more detail about spatiotemporal resolution. Data from three different sources were merged using Triple Collocation Analysis (TCA) for reliable estimates of soil moisture [53] changes over the NRB, as shown in Section 2.3.1.

Drought Indices Data
This study used SPEI monthly gridded data at 0.5° spatial resolution retrieved from https://spei.csic.es/map/maps.html (accessed on 31 December 2020) for January 1950 to December 2019 to evaluate GRACE derived indices. Similarly, the self-calibrated monthly Palmer Drought Severity Index (sc-PDSI) had a spatial resolution of 0.5×0.5° and accessed from https://crudata.uea.ac.uk/cru/data/drought/ (accessed on 31 December 2020) from January 1950 to December 2018.

Methods
The methodological flow and data processing techniques used in this study are presented in (Figure 2). The GRACE derived WSDI, CCDI, and GGDI computations based on severity and respective standardized indices are described in Sections 2.3.3-2.3.6. We also decomposed all seasonal data with Seasonal and Trend decomposition using Loess (STL; see Section 2.3.2).

Soil Moisture Changes
Soil moisture data from GLDAS [54], MERRA-2 [55], and ERA5-Land [56] models are merged using the Triple Collocation Analysis (TCA) over the NRB (see Figure 2). TCA reliably estimates soil moisture changes and offers an alternative method for estimating random error variances [57,58] in the absence of ground reference data. TCA analysis was conducted in four steps: (1) scaling each soil moisture data to the main reference set; (2) estimating the error variance via pair-wise multiplication; (3) determine the corresponding weights, and (4) merging datasets. TCA is applied here to merge soil moisture ( ) estimated as presented in Equation (1), where w , w , and w are the relative weights of soil moisture , , and , respectively. TCA has limitations caused by soil moisture datasets seasonality [59]. To overcome this limitation, time series of mean-zero anomalies computed by first subtracting off long-term seasonal soil moisture climatology per the recommendation of Yilmaz et al. [59].
Detailed TCA description and technical approaches are shown in Nigatu et al. [20] and Yilmaz et al. [59]. Soil moisture data computed by the TCA method is used to estimate the Standardized Soil Moisture Index, as presented in Section 2.3.6.

Time Series Decomposition and Trend Analysis
This study decomposed the GRACE-TWSA monthly time series data into three components [60,61] by applying the seasonal trend decomposition using Loess (STL) that exhibited similar results with harmonic analysis [22]. We used the STLplus approach of LOESS (LOcal regrESSion) smoothing that is modified by Hafen [61] after [62] that is adaptable and robust to decompose time series. STLplus provides enhancements (i.e., handling missing values, higher-order loess smoothing with automated parameter choices, and frequency component smoothing)) over the STL method [61].
where the original signal ( ) is represented as the sum of a long-term part ( ), a seasonal cycle ( ), and the remaining sub-seasonal residuals ( ). The longterm component ( ) further could be divided into long-term linear trends and long-term nonlinear (inter-annual) variability. The residuals reflect both sub-seasonal signal and noise [63]; these high-frequency residuals are anticipated to be a combination of both the noise and a real signal that represents sub-seasonal water storage variability that is presented in the GRACE data. Details of the STL decomposition approach are presented in Lu et al. [64]. Time series decomposed data is used to compute GRACE/GRACE-FO indices, as presented in Sections 2.3.3-2.3.5.

GRACE/GRACE-FO Derived Water Storage Deficit (WSD)
WSD is the difference between the GRACE TWSA time series and the monthly mean of TWSA values adopted from Thomas et al. [25], as follows: where , is the GRACE-inferred TWSA time series for the j th month and respective year i; is the long-term mean (i.e., January 2003 to December 2019) of TWSA for the same j th month. The positive value represents surplus water storage, while a negative value implies deficits in land water storage compared to its monthly mean values. Thomas et al. [25] designated drought events as WSDs lasting for three or more consecutive months. We normalized this parameter using the zero mean normalization method into the WSDI to characterize better droughts based on WSD and to compare WSD with other drought indices, as follows: (4) where μ is the mean and σ is the standard deviation of the WSD time series. The WSDI time series represents the average seasonal deviation from the average conditions, and its magnitude indicates the drought intensity. Furthermore, Thomas et al. [25] introduced another method to assess drought severity, which the following formula can describe: (5) where Se is the event severity; t denotes drought number events (i.e., vary from 1 to n number of drought events in the study area); M is the average deficit since the onset of the deficit period, and D is the duration of a drought event. Se combines WSDs with event duration and indicates the TWS deficit (TWSD) of a definite drought. Simultaneously, the WSDI reveals the relative monthly deficits of water storage during the study period. TWSD is a relevant indicator only for a particular drought event confirmed by the WSDI, and the value of the previous month represents the severity of the drought event.

Combined Climatologic Deviation Index (CCDI)
Drought conditions of NRB was characterized using a combination of terrestrial and atmospheric water data to attain a comprehensive understanding of the hydrosphere. The drought conditions were defined by defined by Sinha et al. [26] based on the Z-score of the sum of TWSA outliers and the monthly Precipitation Anomaly (PA): , , where , is the precipitation anomaly (PA) in year i and month j; , represents the amount of precipitation in year i (i = 2003,2004, …,2019) and month j (j = 1,2,3, …,12), is the monthly average precipitation.
, , where , is the PA residual in year i and month j, and is the average PA in month j.
, , where , indicates the TWSA residual in year i and month j, , is the TWSA in year i and month j, and is the monthly average TWSA anomaly in the month j.
, , , where , denotes the combined precipitation and TWSA deviations in year i and month j.
where , indicates the Combined Climatologic Deviation Index (CCDI) in year i and month j, is the monthly average CD, and is the standard deviation of CD. CCDI values less than −0.28 for three consecutive months showed the occurrence of drought events; henceforth, we used Equation (11) to calculate droughty severity.
The indices were consistently classified to confirm a reliable definition and classification of drought levels ( Table 2)   Normal We computed groundwater storage changes ( ∆ ) by using terrestrial water storage anomalies (∆ ) from GRACE/GRACE-FO products and soil moisture change (∆ ) and surface water storage (∆ ) change from GLDAS, as indicated in equation 12 [3,23]. In NRB, ice/snow and canopy water storage contribution are assumed to be negligible [3,60] and not considered in groundwater storage computation.
To compute ∆ from ∆ and ∆ , surface runoff is used as a proxy for ∆ [20,60] from GLDAS to isolate the GWS component from GRACE. Detail of soil moisture change estimation approach is presented in 2.3.1.
In this study, we adopted the GRACE groundwater drought index (GGDI) from Wang et al. [23] to investigate the characteristics of groundwater drought based on a monthly climatology ( ) calculated as follows (Equation (13)) to capture seasonality in the groundwater records.
Then, computed monthly climatology ( ) is subtracted from the basin mean groundwater storage anomaly to obtain a groundwater storage deviation (GSD based on Thomas et al. [65] recommendation, indicating groundwater storageʹs net deviation. Lastly, the GSD normalized by removing the mean ( and dividing by the standard deviation ( : where , is the groundwater drought index used to reflect the drought situation derived based on normalized net deviation in groundwater storage volumes.

Multivariate Standardized Drought Index (MSDI)
The precipitation and soil moisture are integrated to provide a composite agricultural and meteorological drought conditions computed based on MSDI [66]. The MSDI computed as , where is the standard function of normal distribution that uses joint probability of precipitation and soil moisture to compute either a parametric (MSDI ) [67] or an empirical(MSDI ) [16] method. Finally, MSDI (Equation (15)) is computed based on the accumulated precipitation (AP) and soil moisture (AS) from long-term records: The n-ensemble member of predicted MSDI (i.e.,MSDI , MSDI , MSDI ) obtained from the observed soil moisture and precipitation in historical records n-years and nmonths [66]. Detailed procedures and explanations about both MSDI and MSDI are presented in Hao and AghaKouchak [16,67].
Similarly, the Standardized Soil Moisture Index (SSI) [67] highlights soil moistureʹs nature that mainly leads to agricultural drought. The SSI is a drought index computed from a lognormal distribution function using the Standardized Drought Analysis Toolbox (SDAT). SDAT is a nonparametric framework that can be applied to different hydrological variables (i.e., soil moisture, precipitation, and relative humidity [68]. It designates droughts at diverse temporal scales that are SSI-3 (3-month SSI), SSI-6(6-month SSI), and SSI-12 (12-month SSI) in the same way as the SPI and SPEI.
Four wetness and drought levels [6] indicate different drought magnitude categories ranging from mild to regions with extreme wetness or drought. The severity level marked ʺNoʺ category designates those areas experiencing no wetness/ drought condition [69]. In this study, the wetness (W3-W4) category indicates flood conditions due to flood situations mostly preceded by high soil moisture content, peaking floodplains inundation [70].

Hydroclimatic Extreme Analysis Based on Standardized Indices
The framework proposed in this study allows combining multiple data sets for joint (multivariate) evaluation of drought and flood based on multiple input variables for a more extended period. The SDAT technique adopts different drought-related parameters to calculate the MSDI. The MSDI nonparametric and parametric 12-month outputs of the SDAT that is derived from soil moisture and rainfall data. SDAT is a flexible tool used to estimate 3-, 6-, 12-, 24-, and 48-month runs. The MSDI combines information and provides composite information on SPI (meteorological drought) and SSI (agricultural drought).
The moving average [71,72] method is mainly used with time-series data to exclude the short-term fluctuations and focus on longer trends. We used a 5-year moving average window (Figure 3) that shows a steady rise up to 1940 and declined up to 1963 for both temperature and rainfall. After 1965, temperature shows a strictly increasing pattern while rainfall shows a slight decreasing pattern (i.e., rainfall show dips in 1983-1984, then increased slightly) with fluctuations due to inter-annual variability. Thus, strictly increasing temperature and slight decreasing rainfall pattern likely caused droughts as indicated in Section 3. Furthermore, it may worsen future droughts of NRB owing higher rate of evapotranspiration that will result from NRB warming associated with global warming.  From 1983 to 2008, NRB noted a weak trend toward increasing soil moisture ( Figure  5c). Since 2008, the soil moisture shows a strictly decreasing trend, while precipitation shows an increasing trend. The decreasing soil moisture trend highlights the potential for increasing evapotranspiration (ET) and thereby indirectly escalating the regional hydrologic cycle. The impact of soil moisture on climate warming may not be monotonic [73]. Relatively, it is likely that in some regions, soil moisture might first increase in response to cumulative precipitation but then decrease because ET may rise quicker than precipitation as temperature increases. The SSI and MSD disagree with SPI drought conditions after 2011; SPI shows more wet conditions while others show more drought conditions. Accordingly, precipitation increases are assumed to have been more than compensated for increased losses due to increases in ET, as indicated in terms of SPEI (Figure 5a). This ET increase is due to an increasing temperature pattern (Figure 3) as expected to decrease soil moisture. Figure 5 shows the SPI, MSDIp, MSDIe, and SPEI time series over NRB and subbasins. All indices show agreement from1950 to 1998, and then after that, they differ in both magnitude and pattern direction until the study period ends. The SPI shows a decreasing droughts pattern due to the increased precipitation, while the MSDIp, MSDIe, and SPEI were increasing due to the temperature effect. This discrepancy highlights that the SPI shows a decrease in drought severity. In contrast, the SPEI increases droughts severity attributed to a strict and higher increasing temperature pattern than fluctuating precipitation (Figure 3).

Evaluation of TWS Deficits and Surplus
In drought and wetness characterization, the terrestrial water storage deficit and the surplus are crucial due to groundwater, surface water, and soil moisture water storage of GRACE TWS integration. GRACE/GRACE-FO captured hydroclimatic extreme events accurately (Figure 6a) (Figure 6c,d).
The WSD (Table 2) identified individual drought events observed on the cumulative WSD profile displaying a marked reduction during drought events. Even though 2012 to 2019 showed an overall advance in the cumulative WSD, this period similarly encompassed flood and drought events leveled (event numbers 2-4 and 9-11). Generally, most extreme event epochs reported in this study (Table 2: WSDI and GGDI) corresponded with the results described by the drought records from respective NRB countriesʹ government-issued bulletins of flood and drought data (see Section 3.6). Therefore, quantitative WSD and GGDI are more useful for characterizing flood and drought events. In contrast, CCD is not suitable for the NRB case due to overestimation ( Table 2) and low correlation (Table 3). Furthermore, CCDI shows extreme event occurrence time mismatch with other indices (Figure 6a,b) and historical records (see Section 3.6) of extreme events that may be associated with a time lag effect of precipitation incorporated during estimation.

Evaluation of Groundwater Drought and Surplus
Hydroclimatic extreme event evaluation using GRACE/GRACE-FO and GLDAS data is ultra-efficient compared to traditional drought and flood monitoring indices. Figure 6d illustrates the temporal pattern of GGDI characteristics in the NRB. The GGDI showed a downward pattern with different change characteristics in NRB and each subzone, demonstrating that the GGDI-identified drought was increasing in the NRB during 2003-2019. GGDI time series demonstrates that droughts have become more frequent in recent five years, with the most severe drought episode during June/2017 and August/2018. Over the entire NRB, the extreme drought (i.e., the lowest GGDI) stretched -2.89 during the 9/2016-05/2017 drought event. Extreme groundwater drought occurred in November 2017 with a minimum GGDI value of -2.9. In the meantime, in the later and previous periods of this drought event, the average GGDI reached -0.82 to 0.31, then after GGDI became more positive in 2019.
As shown in Figure 6d (2004)(2005)(2006) can be about half attributed to a drought in the Lake Victoria Basin and about half to an enhanced outflow [74,75], underlining the sensitivity of the LVR to human-induced activity (i.e., controlled by dam operations). Similarly, over BER, drought frequency and intensity are increasing that might be associated with Sudd-wetland (i.e., a place where White Nile's waters are lost). The wetland area is diminishing [76] due to groundwater depletion.
The most severe drought in BNR, LVR, BER, and MNR occurred in February 2003, February 2006, and August 2018, with GGDI values ranging from -2.98 to -1.09. Thus, these results indicate that GGDI is efficient in identifying drought events as it reflects direct evidence of a deficit in groundwater storage. NRB is sensitive to climate variability and human-induced drought event [77]. However, in LVR, surface water storage is abundant; it is sensitive to the drought associated with climate change and humaninduced effect. Furthermore, human water consumption intensifies hydrological drought, groundwater, and surface-water interactions [74,78].

GRACE Derived Drought Indices Feasibility for NRB Drought Identification
We evaluated GRACE-derived CCDI, GGDI, and WSDI feasibility by comparing with other standardized hydroclimatic extreme event indicator indices (Table 3; Figure 6). The correlation coefficient (R 2 ) between the GRACEs derived GGDI, WSDI, and CCDI with the other indices presented in Table 3 at a significant probability level (p ≤ 0.05). The processes used to calculate each hydroclimatic extreme event (i.e., drought and flood) index affect their relationships [26]. For example, the close associations of SPEI and SPI with the amount of evapotranspiration and precipitation can clarify the high spatiotemporal fluxes of these indices along with the close relationship between the SPI and CCDI (and the nonexistence of a clear association with the SPEI). The strong correlation between GGDI and SSI and weak correlation between the SSI and CCDI indicates that the higher dependence of groundwater on soil moisture showing consistency with arid regionsʹ hydrological processes. Similarly, the SPEI and GGDI show a high correlation highlighting the important effect of precipitation and evapotranspiration on soil moisture and groundwater extreme events.
Similarly, Figures 5 and 6a illustrated the comparison between WSDI and commonly used indices (PDSI, SPI, SPEI, SSI, and MSDI) over the NRB from 2003 to 2019 (Table 4). The characteristics detected for the WSDI and its response to the hydroclimatic indices agreed well. It showed a higher correlation with PDSI, SPI, and SSI and MSPI than SPEI, with similar peaks and troughs. Nevertheless, among indices, specific differences are observed owing to differences in formulation methodologies and variables. For instance, as noted in Figure 6a,b and Table 4, the GRACE/GRACE-FO based indices (i.e., WSDI, CCDI) are higher than standardized drought indices during the 12/2019 wet event and 12/2005 drought event. Generally, in agreement with them in other years that possibly associated with human-induced effect [74,79] as explained in Section 3.3. In 2005/2006, 2007, and 2010/2011, all of the standardized drought and wetness indices exhibited enormous troughs when the WSDs were the most substantial, which might be associated with the anthropogenic effect. Some studies [80,81] show that 2010-2011 drought events in NRB countries are likely due to anthropogenic influences. These human-induced droughts could emerge due to urbanization, deforestation, reservoir construction [82], water withdrawals for domestic use, irrigation, manufacturing, and mining [83].  "Event #" denotes event number, and "Range" is based on Table 2. Table 3 presented the correlations among the hydroclimatic extreme event indices that indicate a significant (p-value <0.05) correlation among GRACE derived and standardized indices. The correlation between PDSI and WSDI (r = 0.71) is higher than other standardized indices, which is possibly due to a series of water balance parameters included in PDSI computations. Furthermore, a strong correlation was exhibited between PDSI and the other drought indices (Table 3), showing PDSI is a comprehensive drought index. PDSI comprises water supply and demands deduced from meteorological and hydrological parameters [84].
The lower correlation between SPEI and WSDI (0.51) than that between SPI and WSDI (0.67) highlights that precipitation is more responsible for land WSD than the change between evapotranspiration and precipitation in the NRB during 2003-2018. As indicated in Figure 5a, after 2000/01, the SPEI drought shows an extraordinary increase in intensity and frequency due to increasing temperature over NRB; meanwhile, the only difference between SPEI and SPI is evapotranspiration [13]. The SPI is primarily a meteorological drought index [11] based on long-term climatic records and fitted to a probability distribution interpreted at various time scales in both short-term and longterm applications. For example, soil moisture conditions react to precipitation anomalies on a moderately short scale, and that is why SPI shows a high correlation (0.81-0.97) with SSI, MSDIe, and MSDIp.
The correlation coefficients of SSI and MSDI with WSDI (0.66 and 0.69) are higher than SSI and MSDI with the SPEI index, which evinces that flood and droughts events are more dependent on land moisture characteristics because of the NRBʹs large arid area. Owing to different principles and computation algorithms [9,13,23] of various drought indices, differences in behavior among the indices are expected.

Analysis Hydroclimatic Extreme Event Severity Levels
The overall drought and flood severity level of NRB from 01/2003 to 12/2019(i.e., within the 204 months) presented in Figure 6 and Table 4. The higher GGDIs, CCDIs, and WSDIs with longer durations represent the more severe event. Among the nine drought events detected across the NRB during the study period, the drought events 01/2004-06/2005 and 1/2009-06/2011 were the most widespread deficit periods noticeably, lasting 18 and 24 months with the highest total hydrological severities. Moreover, several other slight droughts are temporary episodes of dryness during the study period, namely the drought events 1/2012-06/2011 and 09/2016-01/2017, characterized by more than 4-month prolonged deficits only slight to moderate severities.
Commonly, drought and flood intensity is characterized by using drought indices divided into different drought severity levels, as displayed in Table 4, based on the criteria defined in Table 2. In some flood and drought years (event # 2, 5, 7, 9, and 11), different drought indices estimated different drought levels for the same drought events. Specifically, flood and drought severity levels in events #4 and #6 are classed as severe (W4 and D3) according to WSDI, whereas PDSI, SPEI, and other indices are estimated as mild drought and flood. The drought severity levels of the two drought events for the rest of the indices (i.e., MSDI, SSI, and SPI) were also different. SPI shows no drought condition in event #10, while the rest of the indices are classified as mild to moderate (D1-D2) drought.
Similarly, in the event, #11 and11, SPI shows no drought while the rest indices show mild to extreme (D1-D4) drought conditions. Overall, nevertheless, it is clear that the flood and drought severity levels of the eleven events identified by these indices conveyed apparent inconsistencies. The inconsistencies and differences detected among indices are possibly attributed to the principal differences in time scales employed, the type of the method, and data used in the computation of the various indices.
The spatio-temporal variations of drought-affected areas (Figures 4-6) characterize the regions suffering from droughts of different severities and drought development processes. For example, in the 1984-1985 drought years, the severe drought conditions were recorded in all seasons across the NRB, except no drought conditions in the southwestern part of LVR (i.e., in winter) and northeast part of NBR during autumn. The four severity levels ( Figure 4) are denoted by different degrees: red for drought and green for wetness.

Hydroclimatic Extremes Impact on Livelihood
In NRB countries, the hydroclimatic extreme event affected peopleʹs lives (Table 5), mainly due to frequent drought and flood events [85,86]. In Sudan and Ethiopia ( Figure  4), during 1985-1986, severe drought caused around 450,000 people death [87,88]. Around 14 million people were affected by famine during 2002-2003 (i.e., in Ethiopia only), and over 13 million people were affected in NRB countries during the 2008-2010 droughts years. The most extreme drought event (2010-2011) led to severe food emergencies and starvation, heartrending around 12 million people over NRB upstream countries [66], and then 2016-2017 is categorized in a moderate drought year. Additionally, flooding instantaneously tailed the drought (2007,2013,2017, and 2019) the abrupt conjunction of these events impaired the losses ( Figure 6).  In NRB, extreme flood events recorded (Table 5 and Figure 5) in 1978 /9, 1988, 1998, 2007, 2008, 2012/13, 2015/16, and 2019 [87]. The BER (South Sudan and Sudan), MNR (Egypt), BNR (western part of Ethiopia), LVR (equatorial lakes) regions are prone to floods due to highly variable river flows and extensive river floodplains area. It causes overwhelming effects on the property (i.e., infrastructure destruction) and lives (i.e., killing, poverty, and food insecurity). For example, the annual average damage from flooding is over US$25 million [89] in the riparian of the BNR and the MNR. The floods spreading in BNR (western part of Ethiopia) and BER (Sudan) displaced 242,000 people and resulted in 700 deaths in 2006/2007. During the high rainfall period, the river flows from BNR cause devastation in the floodplains of Sudan and Ethiopia that comes from BNR that is a contributor of 85% of the total NRB.

Discussion
Climate studies [90] showed an increase in temperature by more than 1°C and 1.5°C in NRB countries that leads to 1.5°C and 2°C warming levels, respectively. Likewise, according to Touma et al. [91], due to the more significant influence of temperature changes, the changes in drought characteristics using the SPEI index are substantial than the SPI changes. Osima et al. [90] also specified that NRB countries faster warming than the global mean, which further reinforces the role of temperature in the study area. Thus, a strict increase of SPEI droughts severity is the global warming context that has enhanced NRB and drought severity in the sub-basins.
GRACE/GRACE-FO TWS derived indices are more powerful than standardized indices to detect drought events attributed to natural and human-induced drought events. This strength is due to the GRACE/GRACE-FO capacity to integrate surface to deep aquifer water storage change vertically. Similarly, studies presented by Wang et al. [23] in Northern China plain and Thomas et al. [65] in northern California central valley reported declining groundwater storages due to droughts exacerbation, and GGDI is capable of capturing these phenomena. GRACE/GRACE-FO data convey spatially distributed information about flood and drought-related parameters promptly and efficiently as a unique source of information in ungauged basins, where reliable historical records of precipitation and discharge are missing.
Outstandingly, GRACE satellites are exceptional in their ability to monitor changes in TWS that encompasses the land surface to the deepest aquifers that vertically integrate [22] water storage changes. TWSA comprises groundwater storage, soil moisture storage, and surface water storage [92]; therefore, GRACE TWS gives an alternative method to monitor drought from an integrated approach [28]. Thus, the GRACE satellite can detect the loss or gain of deep soil water as well as groundwater [93]; it effectively characterizes surface to underground water surplus and deficit with practical understanding into integrated hydroclimatic extreme event indices.
These standardized drought indices are generally dependent on hydrologic fluxes and meteorological variables that only consider a limited centimeter on the top surface. Simultaneously, subsurface WSDs changes might also play a critical role in the hydroclimatic extreme formation. These conditions apply mainly to extreme-severity and long-term (flood and drought) events. On the contrary, GRACE/GRACE-FO based indices can quantify the actual amount of water deficit or surplus from the storage [22] because it assimilates the effects of various subsurface and surface hydrologic processes. Therefore, particular region actual hydrological conditions and the associated hydroclimatic extreme events are better explained by GRACE/GRACE-FO derived indices.
Furthermore, ground-based point measurements acquired from hydrometeorological stations may not accurately describe the regions of interestʹs spatiotemporal characteristics, particularly large-extent areas like NRB. WSDI is integrated land water storage variations observed from space, less complicated in terms of numerical and statistical computations than the standardized drought indices. Generally, GRACE/GRACE-FO derived hydroclimatic extreme indices have a substantial application potential, specifically for large-scale and scarce hydro-meteorological recording regions.
GRACE/GRACE-FO derived indices are associated with certain limitations owing to the entire dependence on GRACE-observed TWSA that imbue the WSDI infers. For instance, GRACE/GRACE-FO by itself cannot decouple different hydrologic store contributions [94] from monthly estimates of TWSA. Moreover, WSDI is more effective over large spatial scales inheriting the GRACE/GRACE-FO accuracy challenge [21]. The new GRACE-FO and GRACE mascon solutionsʹ spatial resolution is higher than that of the previous solution. Besides, as Thomas et al. [25] reported, time series of at least 30 years in length is preferable while GRACE-TWSA has 18 years of monthly TWSA, making the GRACE/GRACE-FO derived indices hydroclimatic extreme evaluation and estimation challenging. However, to overcome spatiotemporal resolution, we recommend reconstructing more than 30 years of TWSA from in situ/remote sensing data and downscaling the GRACE/GRACE-FO data by integrating with high-resolution remote sensing data. As the GRACE-FO observations updated by increasing the new observations, GRACE-FO derived indices method can be updated continuously.
GRACE/GRACE-FO TWSA data are typically less useful for some mild droughts and flood events caused by a precipitation deficit/surplus; subsequently, the surface water storage ruins within normal conditions. Even though GRACE/GRACE-FO resolution is improved, solo GRACE-FO data is still not sufficiently accurate to characterize hydroclimatic events in smaller basins (i.e.,≤ 200,000 km 2 ) owing to the GRACE footprint limit of nearly 200,000 km 2 [95]. Thus, we recommend station-derived observation supplementary methods with GRACE/GRACE-FO to characterize small-extent for less extreme hydroclimatic events.

Conclusions
This study proposed a framework that comprises hydroclimatic extreme indices using GRACE/GRACE-FO time series and standardized indices over NRB during 1950-2019. This framework incorporates groundwater, agricultural, and meteorological droughts and flood severity based on respective indices. Our results indicate that the GRACE/GRACE-FO derived indices can adequately capture the drought and flood events that agree reasonably well with PDSI, SPI, SPEI, SSI, MSDIp, MSDIe, and, though differences occur owing to inherent differences among indices.
The results of this study showed that the NRB drought and flood event severity and frequency increased after 1988. From 1950 to 2019, eight floods and ten droughts were identified based on the standardized indices (1950-2019) and GRACE/GRACE-FO derived (i.e., 2003-2019), mostly in agreement with those reported by different institutions. Out of these, four were severe flood events (1988,2007,2016, and 2019), and five severe drought events (1984, 2005/6, 2009/10, 2011/12, 2014/15). Some of the flood and drought events (e.g., event # 2, 5, 10, and 11) are categorized in different severity levels by different drought indices; they may be endorsed to differences in the data, category standards, time scales, and the computation technique of indices. The correlation between PDSI and WSDI (r 2 = 0.72) is higher than other standardized indices, which is undoubtedly due to a series of water balance parameters included in PDSI computations. Furthermore, a strong correlation exhibits between WSD and PDSI, showing PDSI as a comprehensive drought index (i.e., deduced from meteorological and hydrological variables).
GRACE/GRACE-FO captured hydroclimatic extreme events accurately except for severity level variations among the solutions; the time series of WSDI show a similar fluctuations pattern between 2004 and 2012. Contrarily, the variation increases after 2013, likely misrepresent drought and flood events severity level. Overall, the GRACE/GRACE-FO based indices captured all major flood and drought events and agreed with standardized indices; hence, it may be an ideal substitute for scarce hydro-meteorological sites. Standardized indices rely on vast site observations awkward to implement. The proposed framework can serve as a useful tool for integrated flood and drought situation monitoring and better understand extreme hydroclimatic conditions in NRB and other similar climatic regions.

Conflicts of Interest:
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.