1. Introduction
In the semi-arid region of Northeastern Nigeria, seasonal rainfall patterns are very important for continuous supply of water for domestic use, because rainfall leads to surface and sub-surface recharge, and for rain-fed agricultural production [
1,
2,
3]. Variability in rainfall over the last 60 years has included repeatedly late onset of rainfall, short dry spells, and sometimes droughts lasting several years. These events are the main factors determining crop failure and water scarcity in the region. Large inter-annual variations in rainfall amounts and prolonged periods of droughts have been recorded in 1967–1973 and 1983–1987 [
4] with a negative impact on the region’s agricultural output and severe consequences for the socio-economic situation of the people [
5].
Rainfall in Northeastern Nigeria is controlled by the West African Monsoon (WAM) [
6], and each year, during the boreal summer, this air mass dominates the region in response to a low pressure belt developing in North Africa with a parallel high pressure belt existing off the Gulf of Guinea. Once the monsoon has commenced, rainfall events are mostly a product of instability and convection cells developing in the lower atmosphere influenced by the nature and characteristics of the land surface [
3]. The amount and frequency of rainfall in Nigeria generally increases from southwest to northeast with some variations over short distances [
7]. In the southernmost part of the region, the wet season commences in late April, while the onset date of rains is delayed northwards with rainfall commencing in July in the northernmost part.
A combination of different factors is believed to be responsible for recent climatic anomalies and the seasonal and inter-annual variability of rainfall in the region. They include: regional and global sea surface temperature (SST) changes [
8,
9], the depth and strength of the seasonal WAM winds [
6], the propagation of the African Easterly Jet (AEJ) [
10,
11] Saharan/Sahel aerosol dust concentrations [
12,
13], and bio-geophysical feedback mechanisms [
14,
15,
16,
17,
18]. Authors in [
19] associated these changes to global warming caused by anthropogenic emission of greenhouse gasses and the gradual expansion of the tropics.
Several studies have analyzed rainfall trends and characteristics in Northeastern Nigeria. Authors in [
1,
4] identified a negative trend in yearly rainfall totals in the region from 1961–1990 with an increase in dry spells (dry episodes) during the wet season. They found a reduction of 8 mm·yr
−1 over the period. Authors in [
20] analyzed total annual rainfall, rainfall intensity, number of rain days, onset and cessation of the wet season in six locations in the region over a 65 years period (1931–1996). Their studies suggest a significant decrease in the annual rainfall from 1965 onwards mainly caused by a reduction of high intensity rainfall of 25 mm or more per day at the peak of the wet season in the months of August and September. There was a sign of a recovery [
21] and a significant increase in annual rainfalls in the 1990s compared to previous decades [
22], but annual rainfall means were only slightly above the long term mean [
3].
There is a relationship between seasonal rainfall regimes and the availability of surface and ground water resources in the region. Reference [
2] pointed out that surface and ground water sources in the semi-arid region of Northeastern Nigeria can be very depleted during the period, signaling the onset of the wet season. Therefore, delays in the commencement of the wet season could cause serious water shortages.
Several rainfall indices are used in planning for agricultural and water resources management [
20]; of those monthly rainfall and number of rain days in a month are analyzed here because the two can provide an insight in the variability of dry spells for each month during the wet season at inter-annual and multi-decadal time scales.
In this paper, these variations are analyzed in a spatially distributed data set,
i.e., for each grid cell of a latitude/longitude grid. An understanding of the magnitude of the temporal scale of rainfall anomalies will contribute to better planning for adaptation to water shortages, timing of planting crops, and the nature and variety of crops to be cultivated. Short-term climate forecasting was used in this study for the same reason. Reference [
3] used spatially averaged time series data of the southern sub-humid and northern semi-arid parts of the region to predict monthly rainfall and number of rain days in a month. Similarly, in this study we used an ARIMA model to make the same predictions spatially explicitly for each ½ degree pixel. The performance of the model in predicting monthly rainfall and number of rain days (days with ≥1 mm rainfall) in a month is also presented and analyzed in spatially distributed form.
The main objective of the study is to evaluate the temporal variation and trends in monthly rainfall and number of rain days in a month for six months (May–October) over a 27-year period. Most of this six-month period (May–October) is in the wet season in the region under the influence of the WAM.
The study also aimed to contribute to the study of ARIMA time series prediction models for monthly rainfall prediction and evaluates the model performance in an area of transition between the tropical sub-humid and semi-arid regions.
The data and methods used in the study are presented in the next section. Subsequently the results, discussion and conclusions are presented in later sections.
3. Results
Maps of spatial-temporal variability in monthly rainfall and number of rain days in a month of the NCEP-CRU data for six months (May–October) for the 27-year period (1980–2006) are presented in
Figure 1. Variability in both rainfall variables increases northwards and this is more obvious north of latitude 10°. Those months signaling the onset (May and June) and cessation (October) of the wet season in most parts of the study area have the largest CV. In May CV is 100% and above from 12° northwards. The months with the highest CV is October. The CV is lowest in the months at the peak of the wet season (July–August) when the CV is less than 20% for most areas south of latitude 10° north. The less than 20% CV for number of days with rainfall in a month also extends into the months of September for areas lower than 10° Latitude.
Figure 1.
Maps of the CV of (a) monthly rainfall over a 27-year period (1980–2006) and (b) the frequency (number of rain days) in a month from May–October. Both are shown as percent values. The axes show the latitude (North) and longitude (East) coordinates in degrees.
Figure 1.
Maps of the CV of (a) monthly rainfall over a 27-year period (1980–2006) and (b) the frequency (number of rain days) in a month from May–October. Both are shown as percent values. The axes show the latitude (North) and longitude (East) coordinates in degrees.
Results of the regression slopes used to study the temporal trends in monthly rainfall and number of rain days in a month (May–October) over the 27-year period are presented in
Figure 2. For the months of May there is a reduction in rainfall over the years with a maximum of −3 mm·yr
−1 between 10° and 11° latitude north. An exception to this trend is found in some areas in the central and northernmost parts of the study area, which have less than 1 mm or 0 increases in rainfall. There is a negative trend in the number of rain days in May, south of the 10° Latitude, with a maximum reduction of −0.3 days·yr
−1. This negative trend moves northwards for the month of June, although, the maximum reduction of −0.2 days·yr
−1 is less than the value obtained in the south for the month of May. Generally, the amount and frequency of rainfall reduces northwards. South of the 10° latitude there is a reduction in the amount of rainfall in the months of June and July (maximum of −2 mm yr
−1) with the changes in frequency of rainfall ranging from 0 to less than −0.1 days·yr
−1. The month of August is the peak of the wet season when all parts of the study area are under the influence of the WAM and are consistently experiencing rainfall. Over the 27-year period, the rainfall in August in the southernmost part of the study area shows no trend, but the trend increases northwards reaching a maximum positive trend of 4 mm·yr
−1 in some areas north of the 10° latitude. However, the increasing trend in the frequency of rainfall is most prominent north of the 10° latitude with a maximum of 0.3 days·yr
−1. In the month of September, there is a corresponding increasing trend in the amount (maximum of 3 mm·yr
−1) and frequency (maximum of 0.4 days·yr
−1) of rainfall centred around 9°–12° latitude North. By October the influence of the WAM is receding, marking the end of the wet season in the study area. In this month rainfall becomes erratic and is mostly restricted to the southern part of the study area (south of latitude 10°N). Rainfall amounts and frequency in October are showing increasing trends in the region south of latitude 10° north. In this part of the study area, rainfall amounts increased by 1 mm·yr
−1 and the number of rain days in a month increased by a maximum of 0.2 days·yr
−1.
Figure 2.
Regression slopes over a 27-year period (1980–2006) of (a) the monthly rainfall amounts in mm·yr−1 and (b) the frequency represented by number of rain days in a month in days·yr−1 from May–October. The axes show the latitude and longitude coordinates in degrees.
Figure 2.
Regression slopes over a 27-year period (1980–2006) of (a) the monthly rainfall amounts in mm·yr−1 and (b) the frequency represented by number of rain days in a month in days·yr−1 from May–October. The axes show the latitude and longitude coordinates in degrees.
The usefulness of the 24-month (January 2005–December 2006) ARIMA model predictions was evaluated using NCEP-CRU data for the same period using a Kolmogorov-Smirnov test to test whether they follow the same statistical distribution. These tests were carried out for each grid cell using the 24-month prediction (i.e., time series from January 2005 to December 2006) and the reference data. The null hypothesis defined states that there is no significant difference between the prediction and the NCEP-CRU data. If p < 0.05 the null hypothesis is rejected. The results of the Kolmogorov Smirnov test p > 0.05 (not presented in the paper) was obtained in areas south of 9° northern latitude and Lake Chad in the north, therefore, the null hypothesis was accepted that there is no significant difference between the distribution of the values between the 24-month predictions and the reference data. However, most parts of the study area especially north of latitude 9°N have p < 0.05, and therefore, the null hypothesis was rejected for those areas. A similar result was obtained for rainfall frequency. Following the same procedure tests were carried for selected months in the wet season (May–October) with the significance of the agreement increasing for each months from July reaching a peak in September (September 2005 and September 2006) where p > 0.05 was obtained for most areas.
The degree of agreement between the NCEP-CRU data and the prediction on monthly basis (May–October) is presented in
Figure 3. A wider range between the limits is acceptable for higher the bias between the NCEP-CRU data and the prediction. Thus, a range between the limits of 101.8 mm of rainfall in the month of August is of less significance than a range of 163.2 mm of rainfall in the month of in July because of the difference in the amounts of rainfall experienced in those two months.
Figure 3.
Bland-Altman scatter plot of spatially aggregated averages of prediction and NCEP-CRU data plotted against their differences for the months of May-October. (a) Monthly rainfall. (b) Monthly rainfall frequency.
Figure 3.
Bland-Altman scatter plot of spatially aggregated averages of prediction and NCEP-CRU data plotted against their differences for the months of May-October. (a) Monthly rainfall. (b) Monthly rainfall frequency.
The prediction throughout the study area shows a slight underestimation of the precipitation in the dryer months and overestimation in the much wetter months and variability is always a little underestimated (
Figure 4a). The number of rain days is always underestimated and the estimated variability tends to be too large compared to the 24 months of NCEP-CRU data (
Figure 4b).
The months of May and June have the highest RMSE (
Figure 5a,b). The predicted precipitation for July, August and September has the lowest RMSE suggesting that while these wet months tend to be over predicted, the prediction is actually quite good (
Figure 5a) because of the significant agreement between the prediction and the reference data. The general underestimation of the number of rain days is reflected in a relatively high RMSE in particular for the dryer months May, June and October (
Figure 5b).
Figure 4.
Monthly means and variabilities throughout the study area of the 25 years (1980–2004), 2005 and 2006 NCEP-CRU data and 2005–2006 model predictions of (a) monthly rainfall in mm and (b) the frequency of the number of rain days in a month (in days) from May–October. The first box (25 yr) shows the 25-year data used in making the 24 months prediction into the future, “24 mon” is the 24 months reference data and “24 mon pred” is the prediction. The values are inclusive of all grids for the months plotted.
Figure 4.
Monthly means and variabilities throughout the study area of the 25 years (1980–2004), 2005 and 2006 NCEP-CRU data and 2005–2006 model predictions of (a) monthly rainfall in mm and (b) the frequency of the number of rain days in a month (in days) from May–October. The first box (25 yr) shows the 25-year data used in making the 24 months prediction into the future, “24 mon” is the 24 months reference data and “24 mon pred” is the prediction. The values are inclusive of all grids for the months plotted.
Figure 5.
RMSE between the 24-month NCEP-CRU and 24-month predictions of (a) monthly rainfall in mm and (b) the frequency of the number of rain days in a month (in days) from May–October. The axes show the latitude and longitude coordinates in degrees.
Figure 5.
RMSE between the 24-month NCEP-CRU and 24-month predictions of (a) monthly rainfall in mm and (b) the frequency of the number of rain days in a month (in days) from May–October. The axes show the latitude and longitude coordinates in degrees.
Figure 6.
P-values of a Student’s t-test between standard normal deviates of the prediction and standard normal deviates of the means of the long-term monthly data used for the prediction. The months used for this test at each grid scale were August–October.
Figure 6.
P-values of a Student’s t-test between standard normal deviates of the prediction and standard normal deviates of the means of the long-term monthly data used for the prediction. The months used for this test at each grid scale were August–October.
Figure 7.
The F-skill values of (a) monthly rainfall in mm and (b) the frequency of the number of rain days in a month (in days) from May–October. The axes show the latitude and longitude coordinates in degrees.
Figure 7.
The F-skill values of (a) monthly rainfall in mm and (b) the frequency of the number of rain days in a month (in days) from May–October. The axes show the latitude and longitude coordinates in degrees.
The study wanted to know parts of the study area where there are significant differences between the prediction and long-term means of the monthly data used for the prediction and where they are not. A Student’s
t-test was used to determine whether the means of the monthly standard normal deviates of the prediction was different from the means of the standard normal deviates of the long term monthly means of the data used for the prediction. The formulated null hypothesis states that there is no significant difference between the means of the standard normal deviates of the prediction and the standard normal deviates of the long term mean. The null hypothesis will be rejected in favor of the alternative hypothesis if
p < 0.05. The Student’s
t-test was carried out at grid level using the months that fall within the wet season (May–October), however, in the process of carrying out the test the months of August–October gave the best results (
Figure 6). Because of the shortness of the prediction it was not possible to make the student’s
t-test on a selected month for instance June or July only in each year as there will be only two variables in each case to test. Results of the grid level test for the months of August–October for monthly rainfall amounts show
p < 0.05 in many parts of the study area. However, for monthly rainfall frequency fewer grid cells have
p < 0.05.
The grid differences between the prediction and the reference data and the performance of the prediction in comparison with the long term mean was carried out using the forecast skill (
Figure 7). Areas where the prediction performed better than the long term mean have positive values, areas where the prediction underperformed have negative values and where the prediction replicated the long term mean have values of zero.
4. Discussion
In this study it was shown that spatial and temporal variability of rainfall increases northwards in each of the months in the wet season (May–October) similar to the findings by [
7,
17]. The CV and regression slopes obtained here show that the decrease in monthly rainfall amounts and frequency in the wet season is mostly in the months following the onset of rainfall (May, June, and July), which is similar to the results of previous studies on rainfall trends and dry spells in the study area. The main difference from previous studies is that in this study the decreasing trend over the 27-year period was mostly confined to the sub-humid part of the study area south of latitude 12° North. The reduction in rain days per month during this period found here is consistent with the findings of [
1,
4], who showed an increasing length of dry spells in the study area. The decreasing and erratic frequency and amount of rainfall at the onset of the wet season found here could be related to the rapid land use and land cover change experienced in the West African sub-region since the early turn of the 20th century as investigated by [
16]. A high inter-annual variation of rainfall in the months after the onset of rainfall has serious implications for rain-fed agriculture in relation to the timing of planting of crops. An increasing likelihood of water shortages both from surface and ground water sources in the study area was reported by [
2]. The highest CV found by [
2] was in October. Increasing trends in the amount and frequency of rainfall in October were less than the trends for those months at the peak of the wet season (August–September) [
2].
The results of Kolmogorov Smirnov test (plots not shown) for the significance in the agreement between the prediction and the reference data increases as the wet season progresses, reaching a peak in the month of September. The significance of the agreement also has a latitudinal gradient generally reducing northward. This exposes the weakness of the model in making prediction in highly varying conditions typical in the region especially at the onset of the wet season. The performance of the prediction was further evaluated in order to determine periods where the prediction fitted the observations better and whether the model is doing much better than simply replicating the long term mean. The Bland-Altman plots suggest much narrower differences between the predictions and the reference data at the peak of the wet season in August. Larger differences between the reference NCEP-CRU data and the ARIMA model’s prediction are in months at the onset of the wet season and in months with high temporal variability (like July). In particular, the comparison of the RMSE of the ARIMA prediction and the climatology, as well as the evaluation of the standard deviates show that the ARIMA model did well in predicting monthly rainfall amounts and to a lesser degree frequency in most of the months in the wet season in the study area in the sense that it provided a better prediction than the long term mean. It is also an indication that the model has memory. However, the model was least successful in predicting the amount of precipitation in the months of May, June, and October at the onset and towards the end the wet season. Those three months have the highest CV. The low performance of the model in making predictions in the dryer months of the wet season and months with higher variability in rainfall amounts and frequency indicates a weakness of the model in making predictions under such conditions.
Similar to other findings in this study, the level of agreement between the NCEP-CRU and predictions generally has a latitudinal gradient and increases southwards.