1. Introduction
Malaria poses the biggest threat with about 40% of the world’s population at risk of infection among other vector-borne diseases [
1]. Malaria is responsible for about 300–500 million cases and about one million deaths annually, with a larger percentage of the cases and deaths occurring in sub-Saharan Africa [
2]. According to [
3], about 4.9 million of the South African population representing 10% of the total population live in the malaria-endemic area. Malaria is majorly endemic in three provinces, namely Limpopo, Mpumalanga, and KwaZulu-Natal, although occasionally few major occurrences are sighted in the Northern Cape and North-West provinces along the Orange and Molopo Rivers as a result of the provision of suitable breeding habitats for mosquitoes to survive (Department of Health, South Africa, 2007).
Plasmodium falciparum accounts for about 95% of the total malaria infections in South Africa through
Anopheles arabiensis as the major local vector.
Potential risk factors for malaria transmission such as population movement, population immunity, availability of suitable vector habitats, malaria control measures, social and economic status (a reflection of housing types and housing conditions), environmental factors (land use/cover; vegetation, water body, irrigation/farmlands and elevation) and climatic factors (rainfall, temperature, relative humidity) have been shown to have a significant impact on the transmission of the disease [
4,
5,
6]. According to [
7], the severity of health risks such as malaria, associated with climate and its changes, will depend on the ability of public health and safety systems to address or prepare for these risks, as well as factors such as an individual's behavior, age, gender, and economic status. Impacts will vary based on where a person lives, how sensitive they are to health risks, how much they are exposed to climate impacts, and how well they and their community are able to adapt to these impacts.
With the recent surges in malaria morbidity, there is no better time to address the urgent need for the development of operational malaria early warning systems to predict when, where, and what magnitude of malaria epidemics might occur, with sufficient lead-time to target scarce resources for effective control measure. More often than not, meteorological conditions, such as high rainfall and/or high temperature, are cited as the causing factors for malaria epidemics [
8,
9,
10]. Many efforts have been made to predict malaria epidemics by using climatic variables on a local and/or global scale [
5,
6,
11,
12]. However, several studies have reported variations in the relationship between climatic factors and malaria occurrences from one geographic space to another. This suggests that one or more climate factors are more important than others for malaria prediction. This has led to subtle agreement about the relative importance and predictive value of different climatic factors. For instance, temperature was reported as a strong predictor of transmission of the malaria parasite
Plasmodium falciparum over the entire continent of Africa [
12]. On the other hand, rainfall has been reported to be the major climatic factor driving malaria incidence in India [
5], Mozambique [
6], Ethiopia [
10], and in Niger Sahel [
13].
Studies in other malaria endemic regions of South Africa (i.e., Kwazulu-Natal and Mpumalanga Provinces), have shown that the incidence of malaria is related to local climatic variables. In this regard, temperature and rainfall are the most common climate variables correlated against malaria cases [
14,
15,
16,
17,
18]. KwaZulu-Natal and Mpumalanga Provinces have received relatively more research attention compared to the neglected Limpopo province, particularly the rural remote north-eastern area, which continually records a high number of malaria cases.
Previous work by [
19] showed that the incidence rates in Limpopo was related to both temperature and rainfall and hence laid the foundation for relating climatic variables with malaria cases in the province. Therefore, in view of developing an operational malaria early warning system, it is imperative to critically examine the association and lag structure between climatic variables and malaria incidence in the Mutale-Vhembe region of Limpopo Province (
Figure 1). Therefore, the present study is designed with the objective of determining the role of each of the local climatic variables (monthly: total rainfall (Rain), mean maximum temperature (Tmax), mean minimum temperature (Tmin), and mean temperature (Tavg), as well as mean relative humidity (RH)) on malaria occurrence in the Mutale-Vhembe region of the South African Limpopo Province in order to model the effects of the variables on malaria occurrence and predict future cases of malaria in the Mutale local municipality.
3. Results
During the period under investigation, January 1998 to May 2017, a total of 15,739 malaria cases were recorded in Mutale local municipality. This number of cases accounts for 27.1% of the total malaria cases in the Vhembe district municipality (57,974 malaria cases). Within this period, a total of 11,989 (76.2%) malaria cases were locally transmitted, 538 (3.4%) were imported malaria, 1995 (12.7%) were source not indicated, 1057 (6.7%) were untraceable, and cases with an uncompleted investigation totalled 160 (1%). 14,462 (91.9%) of the cases were detected by passive surveillance and 1275 (8.1%) by active surveillance; 2 cases had no indication for surveillance type. Gender wise, the male gender account for 8566 (54.4%) and female 7170 (45.6%) of the cases. Three cases had no indication of gender type.
Figure 2a shows the annual incidence of malaria in Mutale local municipality from January 1998 to May 2017.
There was a remarkable peak in incidence in 2000 with a remarkable decline afterward.
Figure 2b indicates that there was monthly variation in the incidence of malaria in Mutale, with incidence rising from September and droping after May. Peak malaria incidence is noticeable in January and little incidences are recorded in June, July, and August (JJA), which are austral winter months.
Figure 3 shows box plots of malaria cases and climate variables with the quantile values for minimum, first quartile, median, third quartile, and maximum in Mutale for the period under investigation. At a monthly Tavg of 26.6 °C, December is the hottest month of the year. At 18.1 °C on average, July is the coldest month of the year. Between the driest and wettest months, the difference in rainfall is 110 mm. The variation in annual temperature is around 8.2 °C. In addition, over the period under investigation, as shown in
Figure 4, the monthly mean average of Tmax was 29.5 °C (Standard Deviation (SD) = 3.15), the monthly mean average of Tmin was 16.2 °C (SD = 3.49), monthly average Tavg was 22.8 °C (SD = 2.92), monthly average rainfall was 49.8 mm (SD = 5.45), and monthly mean average of RH was 61.7% (SD = 8.86). In addition, over the period considered in the study area, the highest mean Tmax received is 36.0 °C, highest mean Tmin is 23.0 °C, highest mean Tavg is 30.0 °C, highest mean RH is 95%, and the highest total rainfall received is 515 mm, recorded in February 2000.
As shown in
Figure 5, both climatic variables’ time series and malaria cases series from 1998 to 2017 exhibited seasonal patterns. Climatic variables are shown in blue lines and malaria cases in red line. All series showed a number of peaks and fluctuations. The peaks in the series seem to be separated by more than a few months, indicating a seasonal pattern in the variable. From the series, an increasing trend in temperature, with more observations of higher than normal temperatures since 2003, is noticeable. A similar trend is exhibited by both Tmax and Tmin. However, Tmax exhibits a much steeper trend with a slight increase in Tmax observed between November and February (see
Figure 4). The mean RH lies well above 61%. High and low values of RH are recorded during the summer months with the peak in February and in the winter months with the lowest in August, respectively. Total rainfall amount also followed a strongly seasonal pattern during the period under investigation. Between 1998 and 2017, the months of November, December, and January recorded the highest amount of rainfall during the year.
The cumulative rainfall amount per month in these months is above 460 mm. Furthermore, the analysis revealed a strong association between years of high malaria cases and years of above normal rainfall, which significantly increased RH to an average of about 65% and both Tmin and Tmax values at an average of 18 °C and 26 °C, respectively. For the period under investigation, positive correlations are found in two to three months after the tropical storm must have occurred. Four major periods of high malaria incidences found are the 1999–2000, 2007–2008, 2010–2011, and the recent 2016–2017 with an average of (
R = 0.79;
p < 0.001) (see
Figure 5d). Shown in
Figure 6 is the time series of total annual rainfall over the entire Limpopo Province from 1904 to 2016. Previous study has shown that the years of above normal rainfall are associated with positive sea surface temperature anomalies, La Niña conditions in the Pacific Ocean as well as tropical cyclones from the Mozambique Channel [
27]. In summary, at an Tavg of 26.6 °C, December is the hottest month of the year. At 18.1 °C on average, July is the coldest month of the year. Between the driest and wettest months, the difference in precipitation is 110 mm. The variation in annual temperature is around 8.2 °C.
Spearman’s correlation analysis indicates that statistically significant association between all climatic variables (Tmax, Tmin, Tavg, Rain and RH) and monthly malaria cases (
Figure 7). Tmin shows the highest correlation (
R = 0.39;
p < 0.001), followed by Rain and Tavg (r = 0.35;
p < 0.001, r = 0.35;
p < 0.001), followed by RH and Tmax with (
R = 0.29;
p < 0.001, r = 0.25;
p < 0.001), respectively. The relationship between monthly malaria cases and the climatic variables examined at zero to three months lagged-periods show different effects. At zero-month lag time, climatic variables did not show a strong relationship with malaria cases. However, strong, statistically significant correlations were seen between the climatic variables and monthly malaria cases when the climatic variables time-series lagged malaria time series by at least one month (
Table 1). Rainfall is negatively and significantly associated with malaria cases at lags 0 and 1-month. Most significant associations were observed at two-month lag time. Rainfall shows the highest association at two-month lag time (
R = 0.49;
p < 0.001), followed by Tmin (
R = 0.43;
p < 0.001), Tavg (
R = 0.42;
p < 0.001), and RH (
R = 0.39;
p < 0.001). Different combinations of climatic variables with malaria cases were performed. Total monthly rainfall and monthly mean minimum temperature, with a two-month lagged effect, were found to be the most significant climatic variables for malaria transmission in Mutale municipality (
R = 0.55;
p < 0.001).
With the transformation, a stabilized variance is achieved with no trend overtly observed in the series. The ADF test result indicated that the transformed time series of monthly malaria cases is stationary, i.e., the p-value is less than 0.05 meaning that there is no unit root. Hence, the ADF test rejects the null hypotheses of non-stationarity. The plot of the transformed series (
Figure 8) shows no visible strong trend, suggesting that differencing order of 0 term is sufficient for the model. The 95% significance boundaries are plotted as dotted blue lines. Significant spikes in the auto and partial correlation at lag 1, 2, 11, and 12 can be noticed. The spikes at lag 12 might suggest the presence of a seasonal pattern, perhaps as months of the year. The plots suggest SARIMA (2,1,2) (1,1,1)
12 confirmed with the autoarima function as the best forecasting model that fit the training dataset well. The AIC values for the SARIMA models fitted to the malaria cases are shown in
Table 2. From the coefficients obtained, the return equation can be written as:
The diagnostic plots of the fitted model are given in
Figure 9. With all lags of the ACF and PACF within the 95% significance boundaries, the residuals, therefore, suggest that the correlations are close to zero and are normally distributed centered at 0. This implies that the residuals do not deviate significantly from the zero mean white noise process. Hence, this indicates that the model passed the Ljung Box test approach test, model fits the data well, and it is therefore suitable to make a forecast.
Out-of-sample prediction malaria cases from January 2015 to May 2017 using the SARIMA (2,1,2) (1,1,1)
12 model was performed and compared with the 20% observation testing data left out of the modelling procedure (January 2015 to May 2017) (See
Figure 10 and
Table 3). The plots illustrate the predicted malaria cases fitted with observed malaria cases without climatic variables (
Figure 10A) and with climatic variables (
Figure 10B) as exogenous variables. The predictions made with climatic variables, either over-prediction (predictions greater than observed) or under-prediction (predictions less than observed), are within 10% of the observed malaria cases. The dashed black line is the observed cases while the blue line is the predicted cases. The dark gray shaded area is the 80% prediction interval and the light gray shaded area is the 95% prediction interval.
The comparison between the predicted and observed number of malaria cases is shown in
Table 3. The predicted values are relatively close to the observed values; this result indicates that the model provides an acceptable fit to predict the number of malaria cases. The prediction shows continued high amount of malaria cases further down till July, which is normally a low malaria-transmission period, indicating a shift in the malaria season. The performance measures of the prediction accuracy are shown in
Table 4.
4. Discussion
Malaria transmission is influenced by many factors, which include the abundance of malaria vector, the survival rate and longevity of the mosquitoes, the parasite’s development rate in the mosquitoes, the mosquitoes biting rate, and human susceptibility to parasites governed by human behaviour and immunity. Other factors include economic and social factors such as population movement, housing conditions, sanitation condition, and malaria control measures. However, because of its dependence on the environment and climate, its variability is considered as a major determinant of malaria transmission. This is because of climate impacts on the incubation rate of the Plasmodium and the breeding activity of Anopheles.
Increase in temperature is expected to increase the prevalence as well as transmission of malaria because it shortens the interval between mosquitoes’ blood meals, thus reducing the incubation period of the parasite in mosquitoes leading to a reduced time for the production of new mosquito generations. Temperatures range between 20 °C and 30 °C, shortening the extrinsic incubation period of the parasite. At 16 °C, larval development may take more than 45 days leading to a reduction in the number of mosquitoes [
12]. Hence, temperatures below 16 °C and above 30 °C negatively impact on the survival of the mosquitoes. On the other hand, rainfall influences the aquatic stage of the mosquitoes’ life cycle. The laying of mosquito eggs, the development to larvae, and the development into adults require aquatic breeding sites created through rainfall. Rainfall, in addition, increases relative humidity to sustain the longevity of the adult mosquito. Consequently, temperatures between 20 °C and 30 °C, RH above 65%, and adequate amount of rainfall (cumulative average of about 400 mm) are optimal for the survival of
Anopheles to acquire and transmit the parasite. Hence, with a monthly mean of Tmin as 17.45 °C, monthly highest of Tmin as 23 °C, and monthly lowest of Tmin as 11 °C recorded over the period of study, it can be implied that the temperature ranges create an ideal location for malaria vector breeding.
Positive correlations between monthly malaria cases and all climatic variables were found in Mutale local municipality. Mean Tmin has the highest correlation coefficient with malaria cases. This indicates that Tmin influences the transmission of malaria more than any other climatic variables in the Mutale local municipality. The combination of total monthly rainfall and monthly mean minimum temperature, with a two-month lagged effect, were found to be the most significant climatic variables for predicting malaria transmission in Mutale municipality with an
R2 = 0.65 compared to
R2 = 0.54 (mean maximum temperature and rainfall),
R2 = 0.51 (mean average temperature and rainfall), and
R2 = 0.49 (mean relative humidity and rainfall). This result is similar to what was reported by [
15], in the study in Mpumalanga Province in South Africa.
The negative but statistically significant association between rainfall and malaria cases at lag 0 and 1-month may be due to breeding sites being washed off by high rainfall (flooding) [
28]. The significant relationships noticed at time lag above 1-month between the occurrence of malaria and the associated climatic variables, such as rainfall and minimum temperature at two months, would suggest that climatic conditions in a given year (onset of early rainfall particularly those associated with La Niña conditions and tropical cyclones) may affect malaria transmission in the following year. The periods of high rainfall coupled with low temperature and high relative humidity favour saturated soil moisture content leading to longer life span for pockets of water, which could lead to the persistence of larval habitats [
29,
30]. Although La Niña years and tropical cyclones which are associated with flooding could wash away mosquito breeding sites, pockets of standing water created after the flood can generate new breeding sites. This implies that ideal climatic and environmental conditions could result in increased vector population with potential for malaria transmission within few weeks. However, habitat preference of the local vector species is a strong determinant [
31]. The result of the two-month lagged effect is realistic in view of the malaria cycle consisting of three components: (i) the growth of the
Anopheles female mosquito from egg to adult to parasite transmission; (ii) the development of the
Plasmodium parasites (gametocyte to sporozoites) that are able to infect humans; and (iii) the incubation period in the human host from infection to malaria symptoms [
32]. Hence, malaria occurrence can be expected to be at a peak at about one and a half and two months after the onset of rain. Similar results were found in Mozambique [
6,
33], Zimbabwe [
34], and Ethiopia [
11].
The SARIMA (2,1,2) (1,1,1)
12 model was found to be the best fit model for a reliable forecast. Two models were developed for this study; the output of models is shown in
Figure 9. The model with the inclusion of all climatic variables shows a better prediction performance of accuracy of 72%, while the model without climatic model has an accuracy of 51%. This implies that 71% of the variance in malaria cases can be explained by variance in the climatic variables. The result is comparable with results in India [
5], Mozambique [
6], and Burundi [
35]. However, this result is far lower than what was reported in world malaria report, where 90% of the variance in malaria cases can be explained by environmental factors. Other contributing factors may include economic and social factors such as population movement, housing conditions, sanitation condition, and malaria control measures and environmental factors such as land use type and topography, which are not considered in this present study.
5. Conclusions
Although malaria transmission shows seasonality in accordance with climate in South Africa, only a few studies have been conducted in assessing the relationship between malaria cases and climatic variables at a local level, particularly in the north-eastern part of South Africa. Therefore, the aim of this study was to find the relationship between historical malaria cases and climatic variables and hence develop the SARIMA model to predict the expected number of malaria cases per month based on observed malaria cases and climatic variables as the predictors. The study found that total monthly rainfall and monthly mean minimum temperature, with a two-month lagged effect, are the most significant climatic variables for malaria transmission in the Mutale municipality (r = 0.55; p < 0.001). In addition, this study found a relationship between years of high malaria cases and La Niña events, as well as tropical cyclones from the Mozambique Channel. A detailed analysis for comprehensive reporting of this association is in view by the same authors.
The lag-time effect between malaria occurrence and climatic events is essential for the prediction of malaria cases and can be used for planning and implementing malaria control and interventions [
36]. The model can be improved by incorporating interventions such as insecticide residual spraying, land use/cover characteristics, migration pattern, housing conditions, and biology of the vector in preparation for the development of an integrated malaria early warning.
Therefore, in view of the above and the national department of health’s target of 2020 as the malaria elimination year, an integrated multidisciplinary and coordinated response is required to identify all epidemiological and ecological factors driving the persistence of malaria in the northern region of South Africa/the southern region of Zimbabwe.