Adjustment of the Standardized Precipitation Index (SPI) for the Evaluation of Drought in the Arroyo Pechel í n Basin, Colombia, under Zero Monthly Precipitation Conditions

: The evaluation of the meteorological drought is fundamental for the management of the water resource. One of the most used indices to evaluate the drought is the standardized precipitation index (SPI) due to its practicality and evaluation in a variety of time scales, however, this uses precipitation as the only variable, depending on the deviations in the precipitation values. This is important when evaluating the SPI, because in some ecosystems close to the equatorial zone, there are very warm periods with low rainfall, in which a large proportion of the data collected by the meteorological stations corresponds to zero. In this research, the SPI was calculated in the Pechel í n basin located in Colombia, in which there is zero precipitation in a large proportion of the data, registering zero precipitation in the month of January and February in 67% and 70% respectively. As a result, the SPI values increased to “wet” ranges, only when the amount of data with zero precipitation exceeded half of the total data; this means that the SPI determines wrong values when it is calculated with zero-precipitation data in large proportions. Based on this ﬁnding, this study aims to modify the index by typing the distribution (using a correction factor K), ﬁnally correcting the SPI values, this correction was called SPI-C. The results indicate that the SPI-C improved the identiﬁcation of drought, obtaining corresponding values that better represent the high frequency of zero precipitation existing in the study area.


Introduction
The SPI is frequently used for drought evaluation, the World Meteorological Organization considers the SPI as a universal drought index because of its capacity to be estimated for various reference periods adapting to the different response times of typical hydrological characteristics to precipitation [1]. Some investigations of the SPI as [2][3][4][5][6] show the practicality of the index to characterize drought events with few resources and at different time scales; this is an advantage over other methods of assessing drought. Although the SPI has been in existence for only a decade, it has been used with notable success in various applications, particularly in describing and monitoring drought conditions. It can be used as an indicator of drought severity or excessive wetness, and in the design of drought/flood contingency plans [7]. The SPI constitutes one of the basic elements to develop strategic planning, which allows the decision-making required in the medium and long term to efficiently face the different drought events that may arise [8].
Some research [9][10][11][12][13][14] indicates the temporal versatility of the index is useful in determining the beginning and end of dry events, it allows the analysis of the impacts of

The Standardized Precipitation Index (SPI)
In the study carried out by [2] in 1997, it was recommended that the SPI be calculated for a continuous monthly precipitation data set of at least 30 years at different time stages. Each of the data sets that are determined from the data from previous months adjusted to the gamma function to define the probability relationship with the precipitation from which the probability of any observed data point is obtained, and the deviation is calculated of precipitation for a normally distributed probability density with a mean of zero and unit standard deviation [21].
The cumulative probability of observed precipitation for the given month and time scale of a given station is given by: As the range of the gamma function is defined by values greater than zero, however, in the data we can find m precipitation values equal to zero, then the probability that this value occurs is m/n; where n is the number of years to evaluate, this probability is called q, then an adjustment is made for when one or more precipitation values equal to zero are found, this is expressed as follows: The value of SPI, for the normal distribution function with zero mean and one variance, is calculated as follows: Atmosphere 2022, 13, 236 The values obtained are classified in a scale proposed by [2], these range from the range of "extremely drought" to "extremely wet". Below is shown the proposed scale used to classify the SPI, see Table 1.

The Standardized Precipitation Index Corrected (SPI-C)
Previously, the SPI was evaluated under precipitation conditions of 0 mm/month (no rain). It was identified that the index calculates "wet", provided that the data with 0 exceed more than half of the total data recorded in the station. This is demonstrated in the research results.
For the adjustment and correction of the SPI values, a new model called SPI-C was proposed, which consisted in the adjustment of the standardized normal distribution employing typing, establishing a K factor that corrects the SPI values. To adjust the SPI values, in which there are more than 50% of the monthly precipitation data with zeros, the variable must be typified in such a way that the maximum SPI value remains the mean or zero, shifting the mean to the maximum value of zeros present in the station, this is done by establishing the value of the coefficient K and subtracting it from the value of the calculated SPI, the SPI-C is described below: When the probability of occurrence of precipitation is zero, the gamma distribution tends to zero, therefore the probability of occurrence of the event remains a function of the probability of zero precipitation.
When the probability of occurrence of zero is greater than 50%, the positive event 0.5 < H(x) ≤ 1 is assumed, which establishes that dry periods are being evaluated as wet periods.
The model is typed as a K coefficient, to transform the SPI into SPIC, the typing is only performed when the probability of occurrence of zeros is greater than 50% of the available data, adjusting the maximum and unique value of the series to the mean (µ = 0), shifting the graph to the left.
The typing, transforming the SPI into SPI-C, adjusts the maximum and unique value of the series to the mean (µ = 0), shifting the graph to the left as shown in Figure 1.

Study Site
The countries located in the equatorial belt generally present vulnerability to drought events due to their climatic characteristics of high temperatures [22][23][24][25][26][27] and had SPI in other ecosystems with warm environments, such as the Colombian Caribbean and the Pechelín stream basin, showing periods with low rainfall. Colombia has various meteorological characteristics depending on its geographical location and its orography, presenting warm climates in the regions of lower elevation for sea level. In [28] it was identified that the extreme or severe periods tend to be more intense, while the mean values have remained more or less constant. In other words, for the December to February quarter, the droughts in Colombia are intensifying; on the contrary, the months of May and June, the humid periods are more marked.
The Pechelín stream basin is in the Caribbean region, it is characterized mainly by its warm climate and low rainfall in the first quarter and extends to low rainfall in the first half. For the evaluation of the drought in the tropical zone, the investigation was carried out in the Arroyo Pechelín basin, in the Colombian Caribbean region, at latitude 9°31′0″ N and longitude 75°34′60″ W, in the department of Sucre in the Caribbean Littoral and Central Caribbean subregions.
The study area has a type of dry tropical climate Aw or As, with two seasons, a humid one with rains and a dry one with very warm temperatures throughout the year, with a temperature that oscillates between 19.7 °C and 35.3 °C, and low precipitation in its driest period. The main municipalities that are supplied with water from the basin or affected by runoff are Colosó, Chalán, Ovejas, Santiago de Tolú, and Toluviejo, which have agricultural, livestock and mining activities.
To calculate the SPI the basin was divided into different zones, low, medium and high, for the orography and the meteorological behavior present in the study area, as can be seen in Figure 2.

Study Site
The countries located in the equatorial belt generally present vulnerability to drought events due to their climatic characteristics of high temperatures [22][23][24][25][26][27] and had SPI in other ecosystems with warm environments, such as the Colombian Caribbean and the Pechelín stream basin, showing periods with low rainfall. Colombia has various meteorological characteristics depending on its geographical location and its orography, presenting warm climates in the regions of lower elevation for sea level. In [28] it was identified that the extreme or severe periods tend to be more intense, while the mean values have remained more or less constant. In other words, for the December to February quarter, the droughts in Colombia are intensifying; on the contrary, the months of May and June, the humid periods are more marked.
The Pechelín stream basin is in the Caribbean region, it is characterized mainly by its warm climate and low rainfall in the first quarter and extends to low rainfall in the first half. For the evaluation of the drought in the tropical zone, the investigation was carried out in the Arroyo Pechelín basin, in the Colombian Caribbean region, at latitude 9 • 31 0" N and longitude 75 • 34 60 W, in the department of Sucre in the Caribbean Littoral and Central Caribbean subregions.
The study area has a type of dry tropical climate Aw or As, with two seasons, a humid one with rains and a dry one with very warm temperatures throughout the year, with a temperature that oscillates between 19.7 • C and 35.3 • C, and low precipitation in its driest period. The main municipalities that are supplied with water from the basin or affected by runoff are Colosó, Chalán, Ovejas, Santiago de Tolú, and Toluviejo, which have agricultural, livestock and mining activities.
To calculate the SPI the basin was divided into different zones, low, medium and high, for the orography and the meteorological behavior present in the study area, as can be seen in Figure 2.

Obtaining Data
The information used is of a secondary nature, monthly precipitation data series, supplied by the existing meteorological stations of the Institute of Meteorology, Hydrology and Environmental Studies (IDEAM). Thirty-five years of records were selected and no more than 10% of data were missing, as seen below in Table 2. The missing data were recovered, using the normal proportion method, because the differences in the annual normal rainfall from the station (missing data station) and the data from the neighboring station (complete data station) differ by more than 10%. Figure 3 shows the summary of the data from the study area.

Obtaining Data
The information used is of a secondary nature, monthly precipitation data series, supplied by the existing meteorological stations of the Institute of Meteorology, Hydrology and Environmental Studies (IDEAM). Thirty-five years of records were selected and no more than 10% of data were missing, as seen below in Table 2. The missing data were recovered, using the normal proportion method, because the differences in the annual normal rainfall from the station (missing data station) and the data from the neighboring station (complete data station) differ by more than 10%. Figure 3 shows the summary of the data from the study area.

Obtaining Data
The information used is of a secondary nature, monthly precipitation data series, supplied by the existing meteorological stations of the Institute of Meteorology, Hydrology and Environmental Studies (IDEAM). Thirty-five years of records were selected and no more than 10% of data were missing, as seen below in Table 2. The missing data were recovered, using the normal proportion method, because the differences in the annual normal rainfall from the station (missing data station) and the data from the neighboring station (complete data station) differ by more than 10%. Figure 3 shows the summary of the data from the study area.  Exploratory analysis of the monthly precipitation data series per station was carried out, using tools belonging to descriptive statistics, (number of data, missing data recovery, mean, standard deviation) to identify possible trends in the data, characteristics in their structure and outliers or outliers, which indicate that there are no outliers and the data behave randomly in the stations under study.
With the data obtained, the data consistency analysis was performed using double mass analysis, t (Student), and Fisher's F tests, establishing that the series are homogeneous, with a probability of 95% for which reason no requires correcting the meteorological data. The nonparametric Mann-Whitney U test (Shapiro-Wilk) was applied, verifying that the precipitation series fit a normal distribution. It is determined that the sample data come from a normal distribution, establishing that the p Value (p Value) is greater than 0.05 adjusting to a normal distribution with a confidence level of 95%.

Time Series of the Standardized Precipitation Index in the Study Basin
A historical series for the calculation of the SPI was carried out for each of the stations present in the study basin in the period between 1985 and 2019, to identify the months for each year with the greatest drought, in which the divergence in drought events can be observed, in which they establish periods of greater drought mainly in the months of January, February and March of each season. This time series allows a general analysis of the intensity of the drought in the basin of the study area, as is shown in Figures 4-9. Exploratory analysis of the monthly precipitation data series per station was carried out, using tools belonging to descriptive statistics, (number of data, missing data recovery, mean, standard deviation) to identify possible trends in the data, characteristics in their structure and outliers or outliers, which indicate that there are no outliers and the data behave randomly in the stations under study.
With the data obtained, the data consistency analysis was performed using double mass analysis, t (Student), and Fisher's F tests, establishing that the series are homogeneous, with a probability of 95% for which reason no requires correcting the meteorological data. The nonparametric Mann-Whitney U test (Shapiro-Wilk) was applied, verifying that the precipitation series fit a normal distribution. It is determined that the sample data come from a normal distribution, establishing that the p Value (p Value) is greater than 0.05 adjusting to a normal distribution with a confidence level of 95%.

Time Series of the Standardized Precipitation Index in the Study Basin
A historical series for the calculation of the SPI was carried out for each of the stations present in the study basin in the period between 1985 and 2019, to identify the months for each year with the greatest drought, in which the divergence in drought events can be observed, in which they establish periods of greater drought mainly in the months of January, February and March of each season. This time series allows a general analysis of the intensity of the drought in the basin of the study area, as is shown in Figures 4-9.      1985  1985  1986  1987  1988  1989  1990  1990  1991  1992  1993  1994  1995  1995  1996  1997  1998  1999  2000  2000  2001  2002  2003  2004  2005  2005  2006  2007  2008  2009  2010  2010  2011  2012  2013  2014  2015  2015  2016  2017  2018 1985  1985  1986  1987  1988  1989  1990  1990  1991  1992  1993  1994  1995  1995  1996  1997  1998  1999  2000  2000  2001  2002  2003  2004  2005  2005  2006  2007  2008  2009  2010  2010  2011  2012  2013  2014  2015  2015  2016  2017  2018

Zero Precipitation Frequency (0 mm/Month)
To evaluate the SPI in critical conditions of absence of precipitation, the number of data that have zero precipitation for each month was determined, establishing the frequency with which it occurs. It was identified that in the lower area of the basin, the months of January and February have a precipitation frequency of 0 mm/month in more than 50% of the data; these months have zero monthly rainfall in more than half of the available data below, as is shown in Figure 10.

Zero Precipitation Frequency (0 mm/Month)
To evaluate the SPI in critical conditions of absence of precipitation, the number of data that have zero precipitation for each month was determined, establishing the frequency with which it occurs. It was identified that in the lower area of the basin, the months of January and February have a precipitation frequency of 0 mm/month in more than 50% of the data; these months have zero monthly rainfall in more than half of the available data below, as is shown in Figure 10.

Zero Precipitation Frequency (0 mm/Month)
To evaluate the SPI in critical conditions of absence of precipitation, the number of data that have zero precipitation for each month was determined, establishing the frequency with which it occurs. It was identified that in the lower area of the basin, the months of January and February have a precipitation frequency of 0 mm/month in more than 50% of the data; these months have zero monthly rainfall in more than half of the available data below, as is shown in Figure 10.

SPI Calculation
For each of the areas of the basin, the value of the monthly average SPI was determined through the analysis of data in the period between 1985 and 2019 grouped in monthly intervals. The upper and middle zone did not present extremely wet or dry values. However, for the lower zone, the months with less precipitation (January, February and March) were those that presented wet SPI values. This occurred due to the high frequency of zero-precipitation values in the months of January, February and March, as shown in Figures 11-13. 50% Figure 10. Precipitation frequency 0 mm/month in the low zone, Pechelín basin.

SPI Calculation
For each of the areas of the basin, the value of the monthly average SPI was determined through the analysis of data in the period between 1985 and 2019 grouped in monthly intervals. The upper and middle zone did not present extremely wet or dry values. However, for the lower zone, the months with less precipitation (January, February and March) were those that presented wet SPI values. This occurred due to the high frequency of zero-precipitation values in the months of January, February and March, as shown in  It is necessary to clarify that having zero precipitation (0 mm/month) in more than half of the available historical months should be an indication of meteorological drought. However, when calculating the SPI, these values rise, as can be seen in the low zone (January and February) concerning the months in which they have more precipitation (rest of the year). This condition can be observed in Figure 13.
The precipitation data in the basin under study begin to rise to "moderately wet", according to the classification of SPI values table suggested by [2], in the months January and February for the low zone. With these data it is shown that in the months with zero monthly precipitation in more than half of all the available historical data (p > 0.5), the standardized precipitation index (SPI) instead of forecasting negative values (drought), begins to generate higher positive values (wet), meaning a significant deviation in the SPI calculation. This evidences the need to correct the SPI values when they are calculated under zero-precipitation conditions.  It is necessary to clarify that having zero precipitation (0 mm/month) in more than half of the available historical months should be an indication of meteorological drought. However, when calculating the SPI, these values rise, as can be seen in the low zone (January and February) concerning the months in which they have more precipitation (rest of the year). This condition can be observed in Figure 13.
The precipitation data in the basin under study begin to rise to "moderately wet", according to the classification of SPI values table suggested by [2], in the months January It is necessary to clarify that having zero precipitation (0 mm/month) in more than half of the available historical months should be an indication of meteorological drought. However, when calculating the SPI, these values rise, as can be seen in the low zone (January and February) concerning the months in which they have more precipitation (rest of the year). This condition can be observed in Figure 13.
The precipitation data in the basin under study begin to rise to "moderately wet", according to the classification of SPI values table suggested by [2], in the months January

SPI-C Calculation (Corrected)
The corrected values were determined from the adjustment in the classification of the distribution presented previously, in which it is established that the correction or SPI-C is made in the values in the lower basin, in the months of January, February and March. (Probability of occurrence of zero precipitation in more than 50% of the data). The SPI-C values are shown in Figures 14-16. and February for the low zone. With these data it is shown that in the months with zero monthly precipitation in more than half of all the available historical data (p > 0.5), the standardized precipitation index (SPI) instead of forecasting negative values (drought), begins to generate higher positive values (wet), meaning a significant deviation in the SPI calculation. This evidences the need to correct the SPI values when they are calculated under zero-precipitation conditions.

SPI-C Calculation (Corrected)
The corrected values were determined from the adjustment in the classification of the distribution presented previously, in which it is established that the correction or SPI-C is made in the values in the lower basin, in the months of January, February and March. (Probability of occurrence of zero precipitation in more than 50% of the data). The SPI-C values are shown in Figures 14-16.  In the middle and high zone, the SPI obtained the same values as the SPI calculated initially. This is because the correction is made for months that exceed the zero precipitation in more than half of the available data.
The SPI-C substantially reduced these values of the index, falling in the month of January from Z = 0.693 → Z = 0.036 and in the month of February from Z = 0.686 → Z = 0.007, classified as normal in the classification table of the SPI suggested by [2]. There was a considerable reduction in the SPI-C values, in the months in which zero was presented in more than half of the monthly precipitation data, as can be seen in Figure 17. To check the validity of the correction, the correlation between the SPI and the SPI-C, for the month of January, February and March was determined through Pearson's linear In the middle and high zone, the SPI obtained the same values as the SPI calculated initially. This is because the correction is made for months that exceed the zero precipitation in more than half of the available data.
The SPI-C substantially reduced these values of the index, falling in the month of January from Z = 0.693 → Z = 0.036 and in the month of February from Z = 0.686 → Z = 0.007, classified as normal in the classification table of the SPI suggested by [2]. There was a considerable reduction in the SPI-C values, in the months in which zero was presented in more than half of the monthly precipitation data, as can be seen in Figure 17. In the middle and high zone, the SPI obtained the same values as the SPI calculated initially. This is because the correction is made for months that exceed the zero precipitation in more than half of the available data.
The SPI-C substantially reduced these values of the index, falling in the month of January from Z = 0.693 → Z = 0.036 and in the month of February from Z = 0.686 → Z = 0.007, classified as normal in the classification table of the SPI suggested by [2]. There was a considerable reduction in the SPI-C values, in the months in which zero was presented in more than half of the monthly precipitation data, as can be seen in Figure 17. To check the validity of the correction, the correlation between the SPI and the SPI-C, for the month of January, February and March was determined through Pearson's linear determination coefficient, this is a measure of linear dependence between two quantitative To check the validity of the correction, the correlation between the SPI and the SPI-C, for the month of January, February and March was determined through Pearson's linear determination coefficient, this is a measure of linear dependence between two quantitative random variables; this means that the mathematical model used to approximate the relationship between the variable (SPI-C) and the independent variable (SPI) reflects a high correlation. Figures 18 and 19 and Table 3 show the correlation between SPI and SPI-C.

Conclusions
SPI was evaluated with information from the study area. The Pechelín basin has critical weather conditions, registering high temperatures and low precipitation. The months of December, January, February and March were identified as the months with the least rainfall during the year. In addition, there are periods of rain during June and July. The identification of periods with low rainfall was important to evaluate the SPI and its effectiveness in evaluating drought. It was identified that, for January, February and March, the SPI began to show increasingly higher "moderately wet" values in the lower area of the basin. This happened only because these months had zero precipitation (0 mm/month) in more than 50% of the available data. In this way, it is shown that it is necessary to correct the SPI values when subjected to these conditions, due to the deviations in the values obtained.
Deviations in SPI values when calculated with a high frequency of zero precipitation, means that the SPI does not correctly represent the drought under these conditions. In

Conclusions
SPI was evaluated with information from the study area. The Pechelín basin has critical weather conditions, registering high temperatures and low precipitation. The months of December, January, February and March were identified as the months with the least rainfall during the year. In addition, there are periods of rain during June and July. The identification of periods with low rainfall was important to evaluate the SPI and its effectiveness in evaluating drought. It was identified that, for January, February and March, the SPI began to show increasingly higher "moderately wet" values in the lower area of the basin. This happened only because these months had zero precipitation (0 mm/month) in more than 50% of the available data. In this way, it is shown that it is necessary to correct the SPI values when subjected to these conditions, due to the deviations in the values obtained.
Deviations in SPI values when calculated with a high frequency of zero precipitation, means that the SPI does not correctly represent the drought under these conditions. In

Conclusions
SPI was evaluated with information from the study area. The Pechelín basin has critical weather conditions, registering high temperatures and low precipitation. The months of December, January, February and March were identified as the months with the least rainfall during the year. In addition, there are periods of rain during June and July. The identification of periods with low rainfall was important to evaluate the SPI and its effectiveness in evaluating drought. It was identified that, for January, February and March, the SPI began to show increasingly higher "moderately wet" values in the lower area of the basin. This happened only because these months had zero precipitation (0 mm/month) in more than 50% of the available data. In this way, it is shown that it is necessary to correct the SPI values when subjected to these conditions, due to the deviations in the values obtained.
Deviations in SPI values when calculated with a high frequency of zero precipitation, means that the SPI does not correctly represent the drought under these conditions. In addition, the correction of the values carried out by the SPI-C reduced the "wet" values, determining values that tend to the "drought" and "normal" range, according to the classification table suggested by [2].
The SPI-C only corrected the values when there is zero monthly precipitation (0 mm/month) in more than half of the data. The SPI-C reduced the values for January in the low zone of the Pechelín basin from Z = 0.693 → Z = 0.036 and for the month of February from Z = 0.686 → Z = 0.007. These results obtained by the SPI-C better represent the high frequency of data with zero precipitation.
The results obtained in the investigation show that the calculated "wet" values when the proportion of zero precipitation exceeds 50% of the data. On the other hand, the SPI-C determines values equal to the SPI when half the data with zero is not exceeded. Therefore, this study concludes that the correction factor (correction factor K) should be used only when the amount of data with zero precipitation exceeds half of the total available data.
The SPI-C improves the identification of drought, due to the correction in the deviations of the values produced by the frequency of zero; This makes it possible to organize and manage the prevention plan for periods of drought in the study area, for the benefit of the inhabitants and the biodiversity associated with the hydrographic basin. From the collected evidence, the SPI-C is a methodology that can be applied in different tropical areas that have similar precipitation characteristics, it is suggested to carry out its evaluation in different areas of the world that present this meteorological condition.

Data Availability Statement:
The data presented in this study are available on request from the corresponding authors.