1. Introduction
Drought is a natural phenomenon without a clear onset which makes it difficult to recognize. It is the world’s costliest natural disaster and can provide impacts in a global perspective, not restricted to places with low average precipitation amounts [
1,
2,
3]. In Europe, the total cost of drought damages recorded from 1976 to 2006 amounted to 100 billion € [
4]. Therefore, the continuous monitoring of the probability of drought events is crucial to deploy short term emergency measures and to mitigate the social, environmental and economic costs and losses associated with those events.
Considerable disagreement exists about the definition of drought. However, all the definitions relate the event to below-average precipitation over a period of time. If the event persists long enough, it can progressively affect soil moisture, water resources, and, consequently, economic and social development. According to its impacts, the droughts can be classified into four categories: meteorological, agricultural, hydrological, and socioeconomic [
1,
5].
Drought indices are the most suitable tool for drought monitoring and evaluation. Many different indices have been proposed in the last decades. Among the hydrological variables adopted to detect and characterize drought occurrences, precipitation is the most widely used, not only due to its intrinsic link with the phenomenon and its consequences but also because precipitation is widely monitored and there are relatively long historical records.
In 2009, the World Meteorological Organization (WMO) recommended the standardized precipitation index (SPI) [
6] to monitor meteorological drought conditions. Since then, the SPI is used worldwide to detect anomalous precipitation over different time scales. The SPI has the advantage of being independent of the magnitude of the mean precipitation, because it is a standardized index, and hence, able to compare droughts in different climatic zones. However, because it provides a standardized numerical value, it is difficult to connect it expeditiously to precipitation deficits, and, consequently, to use it to recognize or to predict drought events.
The droughts have traditionally been studied in a univariate context, mostly aiming at identifying and describing their occurrences. However, as many of the hydrological phenomena, they are characterized by multiple aspects some of them expectably correlated. Since a univariate approach ignores the dependence structure among those aspects, it may result in a poorer representation of the phenomenon.
Before copulas approach, some multivariate techniques were introduced in hydrological studies, such as in the case of the analysis of floods, droughts, and storms. Those techniques contribute to improving the accuracy of the estimates and provide information about the dependence structure among the characteristics. Most of them used bivariate probability distributions, such as bivariate gaussian, exponential, gamma, and extreme value distributions. The disadvantage of such approaches is that the marginals must have the same probability distribution and extensions to more than the bivariate case are not clear [
7,
8]. However, copulas can overcome such difficulties [
9]. The advantages of using copulas to model complex relationships among variables are (1) flexibility in choosing arbitrary marginals and structures of dependence, (2) capability to model more than two variables, and (3) splitting of marginal and dependence structure analysis [
7,
8,
10,
11].
Multiple dependent random variables need more advanced and complex copulas than the common ones that are applied to the bivariate case. An example is the vine-copulas, which are able of coupling multiple variables into a pair-to-pair manner [
12].
The application of copulas in hydrology has gained some relevance in the last decades. Regarding drought analysis, one of the main uses of copulas is to model the frequency analysis, combining different characteristics of the drought events (e.g. intensity, duration, magnitude, and spatial distribution) [
13,
14,
15,
16,
17]. Another important use of copulas is its application to the joint modeling of drought indexes of different drought categories, such as meteorological, hydrological and agricultural [
18,
19,
20].
In climatic regions like Mainland Portugal, insufficient precipitation during the short-duration and well-defined rainy season is the main trigger of the drought events. An innovative use of copulas could be its application in a multivariate context to monitor the evolution of the drought probability during that season, based on the continuous updating of precipitation that already occurred and the one that needs to occur, so that there is (or not) a drought by the end of the rainy season (a typical conditional probability problem). The conditional probability theory coupled with copulas is frequently used in hydrological applications to analyze multivariate dependence [
17,
21,
22] and will be applied in this study.
Aiming at exploring copula’s forecasting capabilities in a drought monitoring context, the concept of a precipitation threshold for drought recognition developed by [
23,
24,
25] was used. In each rain gauge, if the cumulative precipitation in a given timespan falls below the precipitation threshold for that timespan, a drought with a severity (from moderate to extreme), defined by the threshold, will occur.
Figure 1 exemplifies the application to Mainland Portugal of the precipitation surface threshold concept applied to recognize moderate to extreme droughts from October to March (the rainy season). If in a certain location the precipitation registered falls below the value given by one of the maps, then the location experienced a drought, with the intensity given by the threshold to which the map relates.
The application of the precipitation threshold concept in the scope of the present study can be formulated as follows: for a season of months to which the time scale of the SPI and the precipitation threshold, , refer, let denote the observed precipitation in the first months () and the total seasonal precipitation.
A drought will occur by the end of the
-month period if
added to the (unknown) precipitation in the remaining (
) months,
, is not enough to meet the threshold,
, i.e.,:
Consider, for example, the six-month season,
= 6, from October to March (during which most of the precipitation in Portugal falls) and, that at the end of December of a given year, an estimate of the probability of a moderate, severe or extreme drought occurrence is envisaged. Given the observed precipitation from October to December (
with
= 3), the problem to be addressed for each drought intensity can be stated as what is the probability the precipitation from January to March (still unknown) added to the precipitation from October to December being below the drought threshold? The solution is the drought probability given by the following equation:
By coupling a copulas approach with the precipitation threshold concept, the main objectives of this study were as follow: 1) to develop a methodology for a Continuous Drought Probability Monitoring System, CDPMS, 2) to evaluate the performance of CDPMS, and 3) to apply the CDPMS to a study area. To demonstrate the methodology, Mainland Portugal and its rainy season (from October to March) were selected. In Portugal, the precipitation regime is characterized by very pronounced seasonality, in average with 80% of the precipitation occurring from October to March, which makes it relevant to be able to anticipate if drought conditions are expected by the end of that period.
4. CDPMS for Mainland Portugal: Definition and Performance
The following items describe the stepwise development and the performance assessment of the CDPMS developed to continuously monitor the drought probability over Mainland Portugal during the rainy season, based on the precipitation records at the 45 rain gauges of
Figure 3. The application to a specific site is presented in
Section 5.
4.1. Precipitation Thresholds for Drought Recognition
Table 4 presents the precipitation thresholds,
, in the 45 rain gauges obtained by inverting the SPI for the time span of 6 months (SPI6), from October to March, for the different drought intensities (moderate, severe and extreme). By the end of March, a drought with a given intensity occurs in a given rain gauge whenever the precipitation for the period is smaller than the precipitation threshold for that intensity.
4.2. Copula Fitting
In each rain gauge, the bivariate model was constructed by coupling the precipitation in the rainy season, for N = 6, with the precipitation in the initial n months of that season, (i.e., the precipitation in October, for n = 1, from October and November, for n = 2 and so on until n = 5, from October to February). The length of each coupled (, ) series is equal to the length of the recording period (x = 100 years). The parameters were estimated by the MPL method, and the candidate copula families were selected based on the AIC.
For the 45 rain gauges, the frequency of the copula families chosen for each value of
n is presented in
Figure 4. Considering, for example,
n = 1, the percentage of rain gauges where the different types of families where selected is as follows: 31% copula Frank, a symmetric Archimedean copula, 38% Clayton copula, an asymmetric copula with lower tail dependence, 4% Survival Clayton copula, a 180 degrees rotated Clayton with upper tail dependence, 18% Gumbel copula, an asymmetric copula with upper tail dependence, and 9% Gaussian copula, a symmetric Meta-Elliptical copula.
As n increases, the percentage of rain gauges where Frank copula is selected decreases from 31 (n = 1) to 7% (n = 5). This is explained by the increase of the linear dependence between variables as the precipitation in additional months of the rainy season is progressively known and provided to the model. The same applies to the Meta-Elliptical copulas, Gaussian and t Student: less than 10% of the rain gauges for n = 1 to approximately 60% of the rain gauges for n from 3 to 5.
4.3. Drought Risk Monitoring
After the bivariate model has been set up for each rain gauge, the CDPMS was applied to estimate the drought probability, i.e., the temporal evolution of the conditional probability of drought occurrence as new precipitation records are progressively acquired (Equation (2)).
Figure 5 shows the spatial distribution of the moderated drought probability thus achieved for the rainy season of October 2012 to February 2013, chosen as an example. The probability surfaces were obtained by averaging the results at the 45 rain gauges according to the inverse distance weighting (IDW) method with exponent 2.
It is possible to see that, for n = 1 (that is, by the end of October), two regions presented a higher probability of drought occurrence: northwest and southeast regions. For n = 3 (end of December), the probability of drought in those regions increased and even expanded into some of the central areas. As the precipitation in the following months is progressively known, only a few regions of the country have a drought risk smaller than 50%. This could justify issuing an alert regarding a possible drought at the end of March—with caution in December (n = 3), for sure in January (n = 4) and definitely in February (n = 5). Such an early warning could raise the awareness of water resources managers and of civil protection authorities, urging the implementation of some anticipatory measures aiming at mitigating the consequences of a possible scenario of drought and water scarcity.
The last map of
Figure 5 shows what happened by the end of March. The almost perfect match between the areas where the drought probability progressively increased with
n (areas shaded from blue to orange and red) and those that in fact experienced drought by the end of March (red circles) clearly indicates the ability of the model to identify areas of increasing drought probability.
4.4. CDPMS Performance Assessment
The performance of the CDPMS was assessed based on the BSS computed for each one of the 45 rain gauges and values of
n, according to the LOOCV methodology (
Section 2.2).
The results achieved for predicted moderate droughts are presented in
Table 5. The values closer to 1 indicate better model performance, and the negative values indicate that the reference forecast outperformed the CDPMS. The ability of the CDPMS to predict the drought probability increases as the number of months,
n, with known precipitation increases: for
n = 1 the average performance for the complete set of rain gauges is 0.03 while for
n = 5 is 0.70.
The rain gauges where CDPMS had a better performance for n = 1 were RG01, RG06, RG33, RG38, RG41, RG42, and RG43 indicating a higher correlation between the precipitation in October and from October to March (BBS ≥ 0.07). Only for n = 1 and for seven rain gauges (RG02, RG09, RG13, RG19, RG26, RG30, and RG37) did the CDPMS perform worse than the reference forecast (BBS < 0). This means that for these rain gauges the knowledge of the precipitation in October does not allow accurate forecasts of the probability of having or not having drought by the end of March. However, the CDPMS performance increases every month, indicating sustained improvement in the monitoring capabilities as new precipitation data is being collected and provided to the model.
Figure 6 shows the spatial distribution and the evolution of the
BSS values for moderate droughts as a function of
n, allowing to identify the areas where the CDPMS has better monitoring capabilities (higher values of
BSS). The spatial interpolation technique used was also the inverse distance weighting (IDW) with exponent 2.
In order to further analyze the variance of the CDPMS performance, box plots were drawn based on the values of
BSS for all rain gauges (
Figure 7). The whiskers have a maximum length of 1.5 × IQR (interquartile range), and the values outside the whiskers are outliers.
Figure 7 shows that the variance of the performance measure,
BSS, increases in the last two months (
n = 4 and
n = 5) indicating that, in some rain gauges, a high probability may not result in a drought event as there is still precipitation to fall. The opposite is also true, low risk does not mean drought cannot happen, because the above threshold precipitation tendency presented in the initial months of the rainy season may not be enough to counterbalance the deficits during the last months. However, both specific cases are less likely to occur.
Drought occurrence in mainland Portugal is associated with a substantial interannual and decadal variability, strongly linked to precipitation shortage during the rainy season [
38,
39]. The dynamic interactions among weather types associated with mainland Portugal due to its location between the Atlantic Ocean and the Mediterranean Sea, strong orographic influence and small size contribute to explain high spatial variability and relative disconnection from general circulation [
39,
40,
41,
42]. The complex interactions between different weather types during the rainy season, also as a result of the different geographic conditions, might explain the variability between the model performances.
Overall, due to copula’s high flexibility, a great variety of copula families can be chosen to model the temporal dependence structure of the precipitation in each specific rain gauge, and, by this way, to represent its spatial variability.
The increasing performance of the CDPMS over time means that it consistently learns with the addition of the precipitation in the following months. It also consistently outperforms the reference forecasts, indicating that it can be a valuable source for assessing drought probability.
The application of the LOOCV excluded any bias in the performance verification by not choosing specific years that could best fit the expected performance, such as very dry or very wet years. Therefore, the CDPMS proved to be a valuable tool for drought probability monitoring. However, its application to Mainland Portugal to monitor under real-time conditions the probability of drought by the end of the rainy season requires a continuous updating of the precipitation records at the rain gauges of
Figure 2, which may not be an easy task. Alternatively, the CDPMS can be applied to a specific site, as exemplified in the next item.
5. CDRMS Applied to a Single Site
This item refers to the CDPMS development and application to a single site in Mainland Portugal aiming at exemplifying how the system can be operated as a drought monitoring, but also forecasting tool. For this purpose, the rainy season of the hydrological year of 2017/2018 (October 2017 to March 2018) at the rain gauge of Santa Marta da Montanha (RG07) was selected. It was assumed that the precipitation was progressively recorded and provided to the model until February 2018 aiming at estimating the drought probability by the end of the rainy season. As already said, such knowledge is extremely important to develop anticipatory actions and to mitigate impacts related to water scarcity.
5.1. CDRMS Development for Santa Marta da Montanha
The dependence structure for each coupled (
,
) precipitation series in RG07 rain gauge was modeled by the Archimedean (Frank) and Meta-Elliptical copulas (Gaussian and t Student).
Table 6 presents the bivariate models selected for each
n, from October (
n = 1) to February (
n = 5). The parameters
(for the Archimedean copulas) and
and
(for the Meta-Elliptical copulas) were estimated using MPL, and the copula families were selected based on the AIC, as described in
Section 2.1.2. The Kendall’s Tau was applied to verify the non-linear dependence between
and
modeled by the copulas. A hypothesis test was applied to determine whether
and
presented a relevant dependence structure, a small
p-value provides strong evidence against the null hypothesis that they are independent, for the 95% confidence level.
The values of the empirical Kendall Tau correlation coefficient presented in
Table 6 indicate that as the precipitation in the rainy season is progressively recorded and introduced in the model, the dependence between
and
becomes stronger. In fact, for
n = 1 the precipitation in October explains only 19% of the precipitation of the rainy season, while for
n = 2 it explains 40%, and for
n = 5, 83%. The small differences between model and empirical Kendall’s Tau values show that the dependence between
and,
was properly modeled by the copulas.
The bivariate model adopted for each coupled (
,
) series in Santa Marta da Montanha rain gauge is presented in
Figure 8. The axes in the figure were graduated in terms of the standard normal deviates that correspond to the non-exceedance probability given by the marginal distributions. The figure shows that as n increases the copulas become narrower due to stronger correlations between
and
.
5.2. CDRMS Application—Drought Risk Monitoring
After the bivariate model has been established, the CDPMS was applied to monitor the drought probability in Santa Marta da Montanha rain gauge during the rainy season of 2017/2018. The results obtained are exemplified in the table of
Figure 9 for the three categories of droughts, moderate, severe, and extreme droughts. The figure includes the precipitation thresholds,
, for the different droughts categories, the precipitation that fell along the season (monthly and cumulative precipitations) and the historical average monthly precipitations.
From October 2017 to March 2018, the total precipitation was 697.6 mm, i.e., below the threshold for moderate drought and above the thresholds for the other drought categories, meaning that a moderate drought event really occurred by the end of March.
In what concerns the moderate droughts, the CDPMS proved to the able to detect the increasing probability of drought: 31% (October 2017), 57% (November 2017), 56% (December 2017), 71% (January 2018), and 97% (February 2018) which could justify issuing a drought alert, at least in January and for sure in February. Despite the considerable above-the-average precipitation that occurred in March, there was a moderate drought, confirming the prediction of CDPMS.
The CDPMS also identified an increased risk of severe drought by the end of March, though with much smaller probability, only 31% in January and 61% in February. However, there was not a severe drought event, which suggests a poorer performance of the CDPMS. This circumstance can be explained by the anomalous and unforeseeable precipitation that took place in March that dampened the expectations of a severe drought.
The last row of the table of
Figure 9 indicates very small probabilities of having an extreme drought by the end of March, which was confirmed.
This example shows the capability of CDPMS in detecting moderate droughts. However, the model was not able to distinguish the intensity of the event, once severe and extreme droughts are very sensitive to an individual precipitation event. Precipitation thresholds for the droughts with higher intensity are lower and can be easily exceeded by a few millimeters of precipitation.
6. Discussion and Conclusion
Drought is a harsh natural disaster with onsets difficult to perceive. Therefore, it is relevant and challenging to develop a trustful tool able to recognize its occurrences and to initiate actions aiming at mitigating its impacts. This study developed such a tool, based on copulas applied to the continuous monitoring of the drought probability, using only precipitation data, the CDPMS.
Such a model uses a kind of stepwise procedure applied to each specific location where the drought probability evaluation is required. It starts with the computation of drought-triggering precipitation thresholds, which enable assigning precipitations to the drought categories given by the SPI [
23]. The next step refers to the setting up of the copula-based bivariate model that, by using historical monthly precipitation data, “couples” the seasonal precipitation of the rainy season (
) with the precipitation until the last but one month of such season (
), according to the dependence structure between
and
.
The last step relates to the application of the CDPMS under current conditions to monitor the drought probability during the rainy season aiming to answer the following question: will there be a drought by the end of the rainy season? Once the precipitation in each month of the current rainy season is progressively known and incorporated into the CDPMS, the model returns the drought probability, that is, the probability of the precipitation being smaller than the one required to avoid drought conditions by the end of the season. Based on that probability, drought warnings can be issued and anticipatory drought mitigation and adaptation measures implemented. The application based on a single rain gauge was exemplified for the rainy season from October 2017 to March 2018.
The CDPMS can also be applied to monitor the evolution of the drought probability during the rainy season in a region, instead of a single site, based on the continuous updating of the precipitation deficits or surplus in the region. Such an innovative application was demonstrated by the application of the CDPMS to Mainland Portugal to monitor the drought probability during the rainy season from October 2012 to March 2013, based on 100-year of precipitation data at 45 rain gauges evenly distributed over the country. The application demonstrated that the CDPMS is able to anticipate the regions that later experienced, in fact, drought conditions.
The study showed that the continuous drought probability monitoring system has the ability to detect drought events simply based on precipitation data. However, it has lower confidence in distinguishing among the drought intensities, probably because the differences among precipitation thresholds for the different intensities are too small and can be easily exceeded by also small, but unforeseeable, amounts of precipitation during the rainy season. The dynamic interactions among weather types associated with Mainland Portugal due to its location between the Atlantic Ocean and the Mediterranean Sea, the strong orographic influence in the precipitation spatial and temporal patterns and its small area may result in the CDPMS performing better in some regions than in others.
Notwithstanding, the CDPMS can help decision-makers to anticipate actions and strategies to decrease potential negative impacts, based on the assignment of a quantitative measure (the probability) to the imminence of a drought event. It also contributes to a systematic warning for water managers and civil protection authorities, allowing them to gradually adjust the public awareness as the threat of a possible drought event becomes more reliable.
The marked seasonality of the rainfall regime in Mainland Portugal makes the precipitation shortages during the rainy season a fundamental trigger of droughts, which explains the good performance of CDPMS despite only based on precipitation data. However, previous studies suggest that, due to the location of the country, the addition of other variables linked to climate, such as teleconnection indexes (North Atlantic Oscillation and sea surface temperature), may improve the drought forecasting capabilities [
43,
44]. In addition, other climatic and hydrological variables such as temperature and runoff could also be incorporated into the model. Further studies could also try to implement a time-varying copula model for bivariate modeling precipitation (
and
) designed to address the nonstationary behavior of some of the hydrological variables that is expected to result from climate change [
45].