Next Article in Journal
Fuzzy Evaluation Model for Critical Components of Machine Tools
Previous Article in Journal
New Improvements of the Jensen–Mercer Inequality for Strongly Convex Functions with Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling and Analysis of Monkeypox Outbreak Using a New Time Series Ensemble Technique

by
Wilfredo Meza Cuba
1,
Juan Carlos Huaman Alfaro
1,
Hasnain Iftikhar
1,2,* and
Javier Linkolk López-Gonzales
1
1
Escuela de Posgrado, Universidad Peruana Union, Lima 15468, Peru
2
Department of Statistics, Quaid-i-Azam University, Islamabad 45320, Pakistan
*
Author to whom correspondence should be addressed.
Axioms 2024, 13(8), 554; https://doi.org/10.3390/axioms13080554
Submission received: 9 June 2024 / Revised: 24 June 2024 / Accepted: 27 June 2024 / Published: 14 August 2024
(This article belongs to the Special Issue Modeling and Analysis of Complex Network)

Abstract

:
The coronavirus pandemic has raised concerns about the emergence of other viral infections, such as monkeypox, which has become a significant hazard to public health. Thus, this work proposes a novel time series ensemble technique for analyzing and forecasting the spread of monkeypox in the four highly infected countries with the monkeypox virus. This approach involved processing the first cumulative confirmed case time series to address variance stabilization, normalization, stationarity, and a nonlinear secular trend component. After that, five single time series models and three proposed ensemble models are used to estimate the filtered confirmed case time series. The accuracy of the models is evaluated using typical accuracy mean errors, graphical evaluation, and an equal forecasting accuracy statistical test. Based on the results, it is found that the proposed time series ensemble forecasting approach is an efficient and accurate way to forecast the cumulative confirmed cases for the top four countries in the world and the entire world. Using the best ensemble model, a forecast is made for the next 28 days (four weeks), which will help understand the spread of the disease and the associated risks. This information can prevent further spread and enable timely and effective treatment. Furthermore, the developed novel time series ensemble approach can be used to forecast other diseases in the future.
MSC:
03H10; 34A08; 34A55; 37N40; 62P20; 65L09; 68T09; 91G15; 92D25

1. Introduction

Monkeypox (Mpox) is a contagious zoonotic disease that has experienced significant spread worldwide, with symptoms such as fever, fatigue, and rash as key indicators [1]. This disease has raised concern in Latin America and Chicago due to endemic transmission and low vaccination coverage, highlighting the importance of accessible vaccination strategies and ongoing surveillance to control the disease [2,3]. Consequently, there has been notable interest in recent years due to outbreaks occurring in various regions, leading to hundreds of research studies [4,5]. Therefore, developing and developed countries have had to prepare response plans and continuously monitor Mpox disease since it has significantly impacted international territories since 2022 [6,7,8]. However, it was necessary to emphasize the importance of genomic surveillance to understand and control the spread of emerging infectious diseases, emphasizing the need for a rapid and coordinated response at national and international levels [9,10]. Similarly, the need to highlight the importance of reliable information sources to understand and address this disease effectively was evident [11,12]. Additionally, it was relevant to highlight the assessment of fear associated with Mpox to design customized education and prevention programs, with more significant implications for the development of strategies considering psychosocial factors, such as epistemic credulity and media perception [13,14,15], as well as the critical need to improve awareness strategies and preventive measures to mitigate the risk of Mpox transmission and other potential public health threats to empower individuals in making informed decisions about their well-being [16,17]. Therefore, a rapid and collaborative international response was necessary to control and prevent the spread of the virus to develop effective detection strategies, emergency management, and advancements in antiviral drugs and vaccines to address this public health threat [18,19].
Short-term forecasts of its trajectory at various geographic levels can assist in developing policy and intervention measures for any fast-spreading new illness. However, there are few opportunities to evaluate predicting performance and improve models during a public health crisis [20]. Fortunately, as of September 2022, instances were rapidly declining globally, with non-endemic nations reporting a total of 90,574 cases and 170 fatalities as of 27 December 2023 [21]. Given the diverse effects of the epidemic on different geographical scales and the dramatic drop in Mpox cases, it is critical to retroactively review forecasting approaches to better prepare for future public health catastrophes [22]. In response to this concern, research began to understand the outbreak of Mpox. One study indicated that since May 2022, 108 countries with Mpox outbreaks have been identified, with the disease primarily affecting homosexual and bisexual men [23]. It was suggested that risk factors for contracting Mpox include being a young man, having sex with other men, having unprotected sex, being HIV positive, and having a history of sexually transmitted infections [24]. Furthermore, in November 2022, over 850,000 English tweets using the keyword “Monkeypox” were analyzed, revealing initially negative emotions towards a new global outbreak. It was observed that tweets helped disseminate information such as vaccination locations, global case quantification, symptoms, and prevention methods. However, they were also prone to providing misinformation [25]. On the other hand, the evolution of Mpox entails a significant risk of severe outcomes in terms of hospitalization, with significant differences between the recent outbreak and historical ones, suggesting a possible variation in disease severity at different periods [26].
Various researchers in the year 2022 studied the behavior of Mpox, proposing statistical models that could predict cases of contagion and death with different levels of success. In some cases, they suggested linear regression models for forecasting Mpox outbreaks [27]. In contrast, in another case, they proposed a convolutional neural network model to detect and predict Mpox contagion cases [28]. In 2023, a hybrid technique for predicting Mpox infection and death yielded notable results. In this regard, time series have emerged as a valuable tool for predicting the spread of infectious diseases and improving response capacity to outbreaks. The use of predictive statistical models in the healthcare domain has experienced a significant increase in recent times. These models serve as a crucial link between statistics and medical practice, offering valuable support in decision-making and facilitating the creation of various systems and tools to mitigate uncertainties, improve performance, and establish effective control measures to combat diseases [29].
Other studies addressed the growing threat of Mpox in a post-COVID-19 context, using neural networks to predict its spread in the USA, Germany, the UK, France, and Canada, showing high accuracy in outbreak prediction. The effectiveness of the artificial neural network (ANN) model compared to other methods such as LSTM and GRU was highlighted, emphasizing the importance of deep learning in predicting and controlling emerging diseases like Mpox [30]. Furthermore, classification systems based on neural networks and explainable AI tools were proposed, trained using a dataset of images and achieving over 98 percent accuracy [31]. Challenges such as data availability and quality, biases in datasets, and interpretability in this field were identified, emphasizing the importance of periodically updating the dataset with new images of infected patients for future research [32]. Similarly, time series models, such as ARIMA, have been used to understand the dynamics of infectious disease outbreaks and predict their spread, focusing on developing an effective prediction model to understand short-term behavior [33]. Likewise, a decrease in nucleotide mutation rates was observed, maintaining a balance between bidirectional rates through time series analysis by predicting Mpox virus mutation using deep learning models such as LSTM [34]. Other approaches, such as the innovative filtering and combination technique, accurately forecasted cumulative daily confirmed cases of Mpox using time series and machine learning models, demonstrating the forecasting system’s efficiency and accuracy [35]. Similarly, machine learning techniques and time series analysis allowed for identifying key patterns and trends in disease spread, showing that convolutional neural networks perform better in analysis [36,37]. Likewise, the effectiveness of the stacked ensemble learning approach in predicting transmission rates, especially in Europe, where the pandemic was severe, was evidenced [38]. Furthermore, other studies highlighted the superiority of machine learning approaches over traditional time series models for predicting Mpox, showing that the multilayer perceptron model outperformed ARIMA with lower mean squared error, recommending the application of methods such as extreme learning machine and support vector machine for better future adaptation [39].
Short-term forecasting of infectious diseases has become vital for health policy-making and improving the population’s standards in specific or general localities. In this regard, new contributions should be encouraged by proposing different forecasting tools to provide an extensive range of forecasting models that can be applied to specific or general areas for analysis and study. Hence, the main aim of this research work is two-fold: first, propose a new ensemble time series technique, and second, apply the proposed method to attain precise and efficient short-term Mpox infectious disease forecasting for the world’s four most influential countries (Brazil, France, Spain, and the USA) and the world. Therefore, this approach involved processing the first cumulative confirmed case time series to address variance stabilization, normalization, stationarity, and a nonlinear secular trend component. After that, five single time series models including autoregressive, simple exponential smoothing, autoregressive integrated moving averages, nonlinear autoregressive, and the Theta model, were used to ensure the cleaning (i.e., free from variance stabilization, normalization, stationarity, and seasonality issues), and three proposed ensemble models were used to estimate the filtered confirmed case time series. The proposed ensemble models are based on the weighting technique, such as equal weight to single models, in-sample-based weighing (training), and out-of-sample (validation). However, four different accuracy average errors, such as the mean absolute error, the mean absolute percent error, the root mean squared error, and the root mean log squared error, and a statistical equal forecast test, the Diebold–Marino test, are determined to check the performance of the proposed novel time series ensemble forecasting technique. Furthermore, the developed novel time series ensemble approach can be used to forecast other diseases in the future.
The remainder of this manuscript is structured as follows: Section 2 outlines the general framework of the proposed time series ensemble approach. In Section 3, the proposed time series forecasting approach is applied to the daily cumulative confirmed cases series from the four countries: Brazil, the USA, Spain, and France, as well as the total cases worldwide. Using the best ensemble model within the proposed forecasting approach in this paper, a projection has been made for the next twenty-eight days, equivalent to four weeks. To understand the spread of the disease and associated risks in the four countries with the highest number of infections, as well as the total cases worldwide, a comprehensive discussion is presented in Section 4. Lastly, Section 5 concludes by discussing the study’s limitations and proposing directions for future research.

2. The Proposed Ensemble Approach for Time Series

This section explains the proposed time series ensemble technique for short-term Mpox infectious disease forecasting for the four most influential countries in the world. In the proposed time series ensemble technique, the cumulative time series of confirmed cases is first preprocessed by variance stabilization, trend seasonality, and stationarity concerns. Then, five base models are employed: the autoregressive, the simple exponential smoothing, the autoregressive moving average, the nonlinear autoregressive, and the Theta model. These are utilized to implement the three proposed ensemble models, resulting in a filtered cumulative time series of confirmed cases. Further information about these steps is provided in the subsequent subsections.

2.1. Preparation of Data

This study aims to attain precise and efficient short-term Mpox infectious disease forecasting for the world’s four most influential countries (Brazil, France, Spain, and the USA) and the world. To accomplish this objective, this work employed a novel methodology to model and forecast the cumulative Mpox-confirmed cases. The intent is to discern intricate features inherent in the cumulative Mpox-confirmed case dynamics over time. The anticipated features include a nonlinear secular long-run trend component, high variance and standard deviation, non-normality, and non-stationarity. First, let the time series of the cumulative confirmed new cases of Mpox be donated by C d k ; the super subscript k (k = 1 , 2 , , 5 ) shows the country series, while d shows the dth day data point; and take the natural logarithm (log( C d k )) for all the series to normalize and stabilize the variance and standard deviation. The dynamics of the log daily cumulative confirmed times series, log( C d k ), may be described as:
log ( C d k ) = τ d k + c d k
That is, the log( C d k ) is divided into two major components: a nonlinear secular trend component ( τ d k ) and a residual component ( c d k ). The ( τ d k ) component is a function of the series ( 1 , 2 , 3 , , d ) and is estimated by the cubic regression splines method, and the regression coefficients are estimated by the ordinary least square method. Hence, the final estimated equation may be written in the following manner:
τ ^ d k = α ^ 1 ( τ d 1 k ) + α ^ 2 ( τ d 2 k ) + α ^ 3 ( τ d 3 k ) + c d k
It is worth mentioning that many authors in the literature capture the nonlinear trend in a time series using regression splines [40,41,42]. On the other hand, c d k is a residual component that describes the random series and is obtained as
c d k = l o g ( C d k ) τ ^ d k
Thus, once the cumulative confirmed series are preprocessed (normalize and stabilize the variance and standard deviation; long-run trend component), the next step is to model the remaining residual c d k series; the current work considers five single-time series models and three proposed ensemble models. Hence, all forecasting models are described in the coming subsection.

2.1.1. Autoregressive Model

The autoregressive (AR) model is a time series framework that evidences the interdependence between a given observation and multiple lagged observations (preceding time steps) for predictive purposes. Within an AR model, the forthcoming value of a variable is articulated as a linear combination of its past values, where the model determines the coefficients associated with these past values. The order of an AR model, denoted as ’p’, specifies the count of preceding values considered in predicting the subsequent value. For instance, an AR (1) model utilizes the antecedent value of the variable to forecast its future value, while an AR (2) model integrates the two preceding values into its predictive equation.
C d = ϕ 1 C d 1 + ϕ 2 C d 2 + + ϕ p C d p + ε d
where:
ϕ 1 : Coefficient for C d 1 ϕ 2 : Coefficient for C d 2 ϕ p : Coefficient for C d p C d : Current value at time d C d 1 : Value at time d 1 C d 2 : Value at time d 2 C d p : Value at time d p
The current work used an AR(3) model for the Brazil, France, and Spain cases, while an AR(4) model was used for the USA and the World cases.

2.1.2. Exponential Smoothing Model

The Exponential Smoothing Model (ESM) encompasses a repertoire of forecasting models characterized by exponentially diminishing weights allocated to antecedent observations. Functioning as a dedicated time series forecasting model, it employs a weighted average of historical observations to prognosticate the forthcoming value of a variable. The foundational premise of the ESM model lies in positing that the future value of a variable is intricately tied to its historical values, with a discernibly higher weighting accorded to more recent observations relative to their older counterparts. The ES model can be formally expressed as follows:
C d + 1 = α · C d + ( 1 α ) · C d 1
where:
C d + 1 : predicted value of the variable at time d + 1 C d : observed value of the variable at time d C d 1 : observed value of the variable at time d 1 α : smoothing parameter that determines the weight assigned to the most recent observation

2.1.3. Autoregressive Moving Average Model

The ARMA model integrates autoregressive and moving average components, drawing upon past observations and antecedent forecast errors to project forthcoming values. This model is parameterized by two variables, p, and q, wherein p denotes the order of the AR model, and q signifies the order of the Moving Average (MA) model. The AR component of the model captures the linear relationship between the current variable value and its antecedent values. In contrast, the MA component captures the linear association between the present variable value and its preceding errors. In formal terms, the ARMA model is expressed as follows:
C d = c + ϕ 1 C d 1 + ϕ 2 C d 2 + + ϕ p C d p + ε t + θ 1 ε d 1 + θ 2 ε d 2 + + θ q ε d q
where:
C t : Current value of the variable c : Constant ϕ 1 : AR coefficient for C d 1 ϕ p : AR coefficient for C d p C d 1 : Past values of the variable up to order p ε t : Error term at time d θ 1 , θ q : MA coefficients
In our case, for the selection of the model, we used the auto.arima function using an R-package, version 2023.09.1 forecast.

2.1.4. Nonparametric Autoregressive Model

The nonparametric autoregressive model (NAR) presents an alternative to conventional parametric models, departing from the latter’s reliance on specific mathematical equations to elucidate the relationship between past and future values. In contrast, NAR models employ flexible and adaptive techniques, such as kernel regression or spline functions, to capture dynamic patterns in the data without explicit parameter estimation. These models are distinguished by their flexibility, absence of predefined parameters, emphasis on local relationships, and reliance on data-driven structures to address intricate and nonlinear dependencies within time series data. In this model, the association between C d and its previous terms lacks a specific parametric form, allowing for potential nonlinearities. This relationship is expressed as:
C d = q 1 ( C d 1 ) + q 2 ( C d 2 ) + + q n ( C d n ) + ε d
Here, q j ( j = 1 , 2 , , n ) denotes smoothing functions describing the association between C d and its previous values. In this specific study, cubic regression splines represent the functions q i , and lags 1, 2, and 3 are employed for NPAR modeling.

2.1.5. Theta Model

The Theta model is a forecasting methodology employed to predict future values by analyzing the average change discerned within the time series data. This methodology encompasses the computation of the mean alteration between successive time points, subsequently extrapolating these findings into future periods. The mathematical representation of the Theta model is the following:
C d + 1 = 1 d C d + C d 1 + + C d m + 1
where:
C d + 1 : Predicted value of the variable at time d + 1 m : Number of past values used in the average C d : Past value of the variable at time d C d 1 : Past value of the variable at time d 1 C d m + 1 : Past value of the variable at time d ( m 1 )
The Theta model, characterized by its simplicity and user-friendly attributes, is an apt methodology for short-term forecasting within the domain of stationary time series devoid of discernible trends or seasonality.

2.1.6. The Proposed Ensemble Models

At its core, an ensemble technique integrates outcomes from various models, each meticulously calibrated before unity. This approach capitalizes on the inherent strengths of individual models while compensating for their inherent limitations. Within the scope of this study, ensemble techniques are initially employed to compute weights for the results derived from individual models. Consequently, the proposed ensemble encompasses three distinct weighting strategies: (a) equitable distribution of weight among all single models, denoted as EnsE; (b) weight assignment based on training average accuracy errors (Table 1), designated as EnsT; and (c) weight assignment based on validation mean accuracy measures, denoted as EnsV. The model allocates greater weight to the ensemble model for training and validation datasets with lower mean accuracy errors, while models exhibiting higher mean accuracy errors contribute comparatively less weight to the ensemble. Notably, the model weights assume small positive values, and their accumulation equates to one, signifying the percentage of reliance or anticipated performance from each model.
After estimating the nonlinear curve trend component using the regression spline method discussed above, the next step is forecasting the remaining part (c) using five single and three proposed ensemble models. Once this is performed, we can obtain the daily cumulative confirmed cases of Mpox for the next day as follows:
C ^ d k = e x p ( τ ^ d k + c ^ d k )

2.2. Evaluation Criteria

This study examines two evaluation criteria for the proposed time series ensemble forecasting technique: accuracy average errors and an equal forecast accuracy test. Primarily, Table 1 presents the accuracy average errors, outlining the formulas for computing each metric. The metrics encompass the mean absolute error (MAE), the mean absolute percent error (MAPE), the root mean squared error (RMSE), and the root mean log squared error (RMSLE).
In the given formulations, C d denotes observed values, while C ^ d represents forecasted daily cumulative confirmed cases for the dth observation ( d = 1 , 2 , , 126 = D ). Consequently, diminishing values for MAE, MAPE, RMSE, and RMSLE generally signify heightened predictive accuracy of the model.
Second, a statistically equal forecast test, the Diebold–Marino (DM) test [43], is performed to evaluate the forecasting ensemble time series proposed approach. In the literature on forecasting models, the DM test was mostly used to evaluate time series forecasting models, determining whether the forecast errors from one model are statistically different from those from another model [44,45,46]. To perform the DM test, the forecast errors of each model are calculated using a loss function. Then, a statistical value is computed by comparing the errors of each model. The test statistic is based on the difference between the mean squared errors of the two models. Suppose the test statistic is above a certain threshold and the p-value is below a significance level ( α = 0.05 ). In that case, the forecasts from one model are significantly better than the other model. For instance, calculate the forecast errors for both models. Forecast errors ( e t = C t C ^ t ) are the differences between the observed values ( C t ) and the forecast values ( C ^ t ). Compute the mean difference ( d ¯ ) of the forecast errors: d ¯ = 1 T t = 1 T ( e 1 t e 2 t ) . Where: e 1 t and e 2 t are the forecast errors from Model 1 and Model 2 at time t, respectively, and T is the number of observations. Next, calculate the variance of the differences, such as σ d 2 = 1 T t = 1 T ( e 1 t e 2 t d ¯ ) 2 . Thus, the Diebold–Mariano test statistic DM = d ¯ σ d 2 . Finally, the null and alternative hypotheses are generally stated as H 0 : There is no difference in forecast accuracy between the two models (H0: d ¯ = 0) vs. HA: The two models differ in forecast accuracy (HA: d ¯ 0 ). Hence, the null hypothesis implies that there was not a statistically significant difference in forecast accuracy between the models. In contrast, the alternative hypothesis suggests a significant difference in forecast accuracy between the two models.
To conclude this section, the main steps, including the proposed time series ensemble forecasting technique, are listed in bullet points below, and the flowchart is presented in Figure 1.
  • In the first step, the cumulative Mpox-confirmed cases time series ( C m ) is preprocessed (to address the issue of stabilizing the variance and standard deviation and remove the nonlinear long-run trend component), discussed in detail in Section 2.1.
  • In the second step, we divide the stochastic (short-run dynamic) residual component ( c n ) into three parts: training, validation, and testing datasets. Let { c n , n = 1 , 2 , . . . , N ( 507 ) } represent the residual series of the cumulative Mpox-confirmed cases time series. Then, training dataset: { c w , w = 1 , 2 , . . . , W ( 254 ) } ; validation dataset: { c v , v = 1 , 2 , . . . , V ( 127 ) } ; testing dataset: { c d , d = 1 , 2 , . . . , D ( 126 ) } ; where N ( N = W + V + D ) is the total number of data points.
  • In the third step, model the train data using single models, i.e., AR, ARMA, ESM, NAR, and Theta models.
  • In the fourth step, calculate the one-day-ahead forecast using the expanding window technique. The forecast values, c ^ N ( W + V + d ) j for j = 1, 2, 3, 4, 5, are obtained by the models listed in step 3.
  • In the fifth step, the output of a basic ensemble method is mathematically described by Equation (10).
    c ^ N ( W + V ) + d l = l = 1 5 Ω l c ^ N ( W + V ) + d l
    where Ω l , l = 1 , 2 , , 5 are obtained by three weighting strategies: (a) Equal weight to all single models and denoted by the EnsE; (b) Weight assigned based on training, mean accuracy measures (MAE, MAPE, RMSE, and RMSLE), and denoted by (EnsT); (c) Weight assigned based on validation mean accuracy measures and denoted by (EnsV). The lower accuracy means the error model assigns more weight to the ensemble model in training and validation datasets. In contrast, the highest accuracy means the error model has less weight than the ensemble model. However, the model weights are small, positive values, and the sum of all weights equals one, indicating the percentage of trust or expected performance from each model.
  • In the sixth step, obtain the one-day-ahead forecast values using equations for the EnsE, the EnsT, and the EnsV models.
  • In the last step, evaluate the model based on accuracy and average errors (see Table 1).

3. Case Study Results

This work aims to provide a short-term forecast of the cumulatively infected cases of Mpox using the four most influential countries (Brazil, France, Spain, and the USA) worldwide and for the whole world. The Mpox datasets (daily cumulative confirmed cases) were taken from the official website of “Our World in Data” from 1 June 2022 to 30 April 2023. The graphical presentation and the descriptive statistics of cumulative confirmed cases for all countries and the world can be seen in Figure 2 and Table 2. Figure 2 shows the cumulative Mpox-confirmed cases and an increasing nonlinear curve in all cases. However, this figure shows that the world has the most confirmed Mpox cases, while the USA shows the highest confirmed cases among the most affected countries. On the other hand, Spain had the most confirmed counts at the start, but after September 2022, Brazil obtained the second-highest number of confirmed cases of Mpox, and Spain had the third-highest number of confirmed cases. In the same way, France had higher infected cases at the start, but after August 2022, there were more confirmed new cases than in Spain and Brazil. The current situation is that amongst all countries, the USA has the most new confirmed counts, while Brazil and Spain are in the second and third positions, and France is the fourth most affected country among all countries until 30 April 2023.
In contrast to the graphic presentation, the descriptive statistics, such as minimum, 25% (first quartile), 50% (second quartile or median), 75% (third quartile), arithmetic mean, variance, standard deviation, skewness, kurtosis, and maximum statistics for Brazil, France, Spain, the USA, and the entire world, using original and natural logarithm cumulative confirmed cases time series, are tabulated in Table 1. It is clearly confirmed from this table that the natural logarithm effect on all considered cumulative time series stabilizes the variance and standard deviation as well. Due to this effect, this work will proceed with a log series for all cases for further analysis. Therefore, the complete datasets for all considered countries and the entire world of the daily cumulative confirmed cases covering 334 days were divided into three parts as follows: 1 June to 20 January 2023 (234 days) was used for model estimation (training part), 21 January 2023 to 11 March 2023 (50 days) was used for model validation (hold-out sample), and 12 March 2023 to 30 April 2023 (50 days) was used for model testing (out-of-sample) the one-day-ahead cumulative confirmed cases forecasts.
As confirmed by the previous discussion, all series have an increasing nonlinear trend component. This work extracts the nonlinear curve trend component using the regression spine method to achieve this. The graphical representation of the nonlinear curve trend component along with the original log confirmed cumulative series is shown in Figure 3. Clearly, it can be seen that in all cases, such as sky blue (the whole world), blue (the USA), green (Spain), black (Brazil), and red (France), the nonlinear curve trend component is extracted very well. Once the nonlinear curve trend component is removed, this work moves ahead with further modeling and forecasting with clean cumulative confirmed case time series. The remaining filtered series (clean cumulative confirmed case time series) for the four most affected countries and the entire world case are shown in Figure 4.
Before modeling and forecasting time series data, it is essential to check the stationarity property of the dataset. To do this, this work performed the augmented Dickey–Fuller test and reported the results (statistics and p-values) for the original and clean (taking the natural logarithm and removing the trend component) cumulative confirmed cases time series for all considered countries and the entire world case in Table 3. This table indicates the original cumulative established case time series of the four most affected countries and the world as a whole, all nonstationary. In contrast, the clean cumulative confirmed cases time series (with natural logarithm and removing the trend component) for all considered countries have a higher negative statistic value. They are mostly minuscule (less than 0.05), indicating that the series is stationary at a 5% significance level. Once the dataset has been preprocessed, confirmed cumulative case series are modeled and forecast. To this end, this work uses five single time series models, including the autoregressive model, the exponential smoothing model, the autoregressive moving averages model, the nonlinear autoregressive model, and the Theta model, and the three proposed ensemble models (the EnsE, the EnsT, and the EnsV). Therefore, in the proposed time series ensemble forecasting approach, compare nine total models within the two contexts, such as comparing single model performance, the proposed ensemble models, and single verse ensemble models.
Hence, for all nine models for the four most affected countries and the world case, one-day-ahead out-of-sample forecast outcomes (MAP, MAE, RMSE, and RMLSE) are listed in Table 4. From Table 4, it is concluded that the EnsV produced the best forecasting results compared to all nine forecasting models within the proposed time series ensemble forecasting approach in all four most affected countries and the entire world case. For instance, the average accuracy errors for these locations are the following: Brazil (MAPE = 0.0000111, MAE = 0.1681917, RMSLE = 0.0000992, RMSE = 1.1861); France (MAPE = 0.00000019, MAE = 0.00000199, RMSLE = 0.00000191, RMSE = 0.00000553); Spain (MAPE = 0.00003314, MAE = 0.2417941, RMSLE = 0.00019113, RMSE = 1.498126); the USA (MAPE = 0.00010156, MAE = 2.996107, RMSLE = 0.00039027, RMSE = 11.99131); and the entire world (MAPE = 0.00021311, MAE = 0.00079817, RMSLE = 21.92421, RMSE = 70.9131). However, the EnsT model shows the second-best forecasting results among all nine forecasting models in all four most affected countries and the entire world, while the third-best forecasting accuracy average error results are given in the following manner: Brazil (the Theta model; MAPE = 0.0000156, MAE = 0.1711713, RMSLE = 0.0001092, RMSE = 1.196843); France (the ARMA model; MAPE = 0.00000025, MAE = 0.00000309, RMSLE = 0.00000216, RMSE = 0.00000649); Spain (the ESM model; MPAE = 0.00004015, MAE = 0.3040281, RMSLE = 0.00020946, RMSE = 1.586085); and the entire world case (the Theta model; MAPE = 0.00010858, MAE = 3.306309, RMSLE = 0.00042027, RMSE = 12.79745). Therefore, it is seen that within all nine forecasting models, the proposed ensemble models (the EnsT and the EnsV models) generally perform better than single models; however, within the single models, different countries have different single best models, as mentioned previously. Note that the best model is an EnsV or equivalent model for all four countries most affected by Mpox and the world. Also, using the proposed ensemble learning leads to a marked reduction in extreme errors (see Table 1). The proposed ensemble learning approach, thus, proves to be particularly effective in forecasting new cumulative confirmed cases of Mpox diseases.
Table 5 gives the p-values for the hypothesis of equal forecast accuracy according to the Diebold and Mariano (DM) test. The DM test has been applied to the series obtained by joining the 50 one-day-ahead forecast errors for each country and each pair of forecasts. Each element of the table is the p-value of a hypothesis system, assuming no difference in the accuracy of the forecasters in the row or column compared to the alternative that the model in the row is more accurate than the model in the queue. Focusing on the EnsV model, in all considered countries and the entire world case, it is statistically significant in terms of accuracy average errors (MAPE, MAE, RMSLE, and RMSE; see Table 4) and statistically not different in terms of the DM test (see Table 5). On the other hand, if we restrict ourselves to single models for all considered countries and the entire world, the best model varies from country to country. Therefore, to conclude this section, from the accuracy average errors (MAPE, MAE, RMSLE, and RMSE) and an equal forecast statistical test (the DM test), we can conclude that the proposed time series ensemble learning forecasting approach is highly efficient and accurate for one day ahead of confirmed cumulative new cases of Mpox for the four most affected countries as well as for the entire world case. In addition, within the proposed time series ensemble learning approach, the proposed EnsV model produces more precise forecasts when compared with the alternative ensemble models and single time series models.

4. Discussion

Once the best models were assessed through average accuracy errors (MAPE, MAE, RMSLE, and RMSE) and an equal forecast statistical test (the DM test), this work proceeded to future forecasting with the superior model (the EnsV). The current work used the ENMV for the confirmed cases of Mpox and forecast from 1 June to 28 June 2023 (four weeks) for the cumulative confirmed cases. The predicted and actual values of the Mpox cumulative confirmed cases are tabulated in Table 5. As seen from this table, the cumulative confirmed cases gradually increased throughout the forecasted span in the case of Brazil. However, in France’s case, only the first two days increased in the cumulative confirmed cases and then became constant, which means there was no evidence of the spread of Mpox within 26 days. In the case of Spain, the same as in the case of France, the first two days increased in the cumulative confirmed cases and then were constant, which means no evidence of the spread of Mpox within 26 days. On the other hand, in the USA case, the cumulative combined cases gradually increased from 30,156 to 30,228. In the same way, in the entire world case, the Mpox increased in confirmed cumulative cases from 87,320 to 87,748. Thus, it is concluded that from this analysis, the spread of Mpox in Brazil, the US, and the entire world has been an increasing trend, while on the other hand, from France and Spain’s points of view, there is no evidence of an increase in the virus.
Finally, to demonstrate the superiority of the proposed final best (EnsV) model forecasting ability, we compared the cumulative infected cases of Mpox to the predicted cases by the suggested best (EnsV) model. To achieve this, we computed the percentage forecast error (PFE), defined as PEF = (|forecasted value - actual value|/|real value|)×100. The PFE values are presented in the final column of Table 6. This column demonstrates that the anticipated values were reasonably close to the actual values regarding low PFE. As a result, our findings provide policy-makers with useful information for guiding future resource allocation and informing mitigation actions. Furthermore, the forecasting exercise will aid in understanding the spread and risk, which may be used to avoid future spread and provide prompt and effective treatment.

5. Conclusions

This work mainly aimed to forecast the short-term transmission rate of the Mpox infection disease in the most infected countries, such as the USA, Brazil, France, Spain, and the world. To this end, this work proposes a unique time series ensemble approach to analyze and predict the spread of Mpox in the top four countries with high infection rates. This approach involved processing the first cumulative confirmed case time series to address variance stabilization, normalization, stationarity, and a nonlinear secular trend component. After that, five single-time series models and three of their proposed ensemble models were used to forecast the clean, confirmed-case time series. The accuracy of the models is evaluated using average accuracy errors (MAE, MAPE, RMSE, and RMSLE) and an equal forecasting accuracy statistical test (the DM test). Based on the results, it is found that the proposed time ensemble forecasting approach is an efficient and accurate way to forecast the cumulative confirmed cases for the top four countries on the globe and the entire world. In addition, using the best ensemble model, a forecast is made for the next 28 days (four weeks), which will help understand the spread of the disease and the associated risks. This information can prevent further spread and enable timely and effective treatment. Furthermore, the developed novel time series ensemble approach can be used to forecast other diseases in the future.
The study only used a cumulative Mpox dataset from the four most affected countries and the whole world. Still, it could be expanded to include other variables, such as the number of new daily cases and daily and cumulative death counts. This would help evaluate the effectiveness of the proposed time series ensemble forecasting approach. Furthermore, it could forecast short-term daily and cumulative COVID-19 confirmed cases, death counts, and recovered cases. However, the proposed forecasting methods only employed single-time series models. In the future, machine learning models such as random forest, support vector regression, Xboost gradient algorithm, etc., will be integrated to enhance the forecasting technique.

Author Contributions

Conceptualization, methodology, and software, H.I.; validation, formal analysis, and investigation, W.M.C. and J.C.H.A. and H.I.; resources, data curation, and writing—original draft preparation, H.I., W.M.C., J.C.H.A. and J.L.L.-G.; writing—review and editing, visualization, and supervision, H.I. and J.L.L.-G.; project administration, funding acquisition, H.I. and J.L.L.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data used in this study are available at https://ourworldindata.org/monkeypox (accessed on 8 January 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Asadi Noghabi, F.; GRizk, J.; Makkar, D.; Roozbeh, N.; Ghelichpour, S.; Zarei, A. Managing Monkeypox Virus Infections: A Contemporary Review. Iran. J. Med. Sci. 2024, 49, 1–9. [Google Scholar] [CrossRef] [PubMed]
  2. Leonard, C.M.; Poortinga, K.; Nguyen, E. Brote de Mpox: Condado de Los Ángeles, California, del 4 de mayo al 17 de agosto de 2023. MMWR Morb. Mortal. Wkly. Rep. 2024, 73, 44–48. [Google Scholar] [CrossRef] [PubMed]
  3. Alqahtani, R.T.; Musa, S.S.; Inc, M. Modeling the role of public health intervention measures in halting the transmission of monkeypox virus. Aims Math. 2023, 8, 14142–14166. [Google Scholar] [CrossRef]
  4. Bustanji, Y.; Shihab, K.H.A.; El-Huneidi, W.; Semreen, M.H.; Abu-Gharbieh, E.; Alzoubi, K.H.; Alqudah, M.A.Y.; Abuhelwa, A.Y.; Abu-Rish, E.Y.; Bajes, H.; et al. Analysis and mapping of global scientific research on human monkeypox over the past 20 years. Vet. World 2023, 16, 693–703. [Google Scholar] [CrossRef] [PubMed]
  5. Usman, S.; Adamu, I.I. Modeling the transmission dynamics of the monkeypox virus infection with treatment and vaccination interventions. J. Appl. Math. Phys. 2017, 5, 2335. [Google Scholar] [CrossRef]
  6. Ipinnimo, T.M.; Adeniyi, I.O.; Ehizibue, P.E.; Dan-Ugbomoiko, O.S. Monkeypox Outbreak—Is This Another Pandemic? Niger. J. Parasitol. 2023, 44, 78–86. [Google Scholar] [CrossRef]
  7. Çolakoğlu, Ö.; Kamran, M.; Bonyah, E. M-Polynomial and NM-Polynomial of Used Drugs against Monkeypox. J. Math. 2022, 2022, 9971255. [Google Scholar] [CrossRef]
  8. Zhu, C. An Adaptive Agent Decision Model Based on Deep Reinforcement Learning and Autonomous Learning. J. Logist. Inform. Serv. Sci. 2023, 10, 107–118. [Google Scholar]
  9. Vazquez, C.; Fonseca, V.; de la Fuente, A.G.; Gonzalez, S.; Fleitas, F.; Lima, M.; Guimarães, N.R.; Iani, F.C.M.; Rojas, A.; Alfonso, T.; et al. Exploring the Genomic Dynamics of the Monkeypox Epidemic in Paraguay. Viruses 2024, 16, 83. [Google Scholar] [CrossRef]
  10. Addai, E.; Ngungu, M.; Omoloye, M.A.; Marinda, E. Modelling the impact of vaccination and environmental transmission on the dynamics of monkeypox virus under Caputo operator. Math. Biosci. Eng. 2023, 20, 10174–10199. [Google Scholar] [CrossRef]
  11. León-Figueroa, D.A.; Barboza, J.J.; Valladares-Garrido, M.J. Sources of information on monkeypox virus infection. A systematic review with meta-analysis. BMC Public Health 2024, 24, 276. [Google Scholar] [CrossRef] [PubMed]
  12. Qurashi, M.A.; Rashid, S.; Alshehri, A.M.; Jarad, F.; Safdar, F. New numerical dynamics of the fractional monkeypox virus model transmission pertaining to nonsingular kernels. Math. Biosci. Eng. 2023, 20, 402–436. [Google Scholar] [CrossRef]
  13. Nimbi, F.M.; Baiocco, R.; Giovanardi, G.; Tanzilli, A.; Lingiardi, V. Who Is Afraid of Monkeypox? Analysis of Psychosocial Factors Associated with the First Reactions of Fear of Monkeypox in the Italian Population. Behav. Sci. 2023, 13, 235. [Google Scholar] [CrossRef] [PubMed]
  14. Sudsutad, W.; Thaiprayoon, C.; Kongson, J.; Sae-dan, W. A mathematical model for fractal-fractional monkeypox disease and its application to real data. Aims Math. 2024, 9, 8516–8563. [Google Scholar] [CrossRef]
  15. Wang, Q.; Jiang, Q.; Yang, Y.; Pan, J. The burden of travel for care and its influencing factors in China: An inpatient-based study of travel time. J. Transp. Health 2022, 25, 101353. [Google Scholar] [CrossRef]
  16. Crosato, V.; Formenti, B.; Gulletta, M.; Odolini, S.; Compostella, S.; Tomasoni, L.R.; Matteelli, A.; Castelli, F. Perception and Awareness about Monkeypox and Vaccination Acceptance in an At-Risk Population in Brescia, Italy: An Investigative Survey. Aids Behav. 2024, 28, 1594–1600. [Google Scholar] [CrossRef] [PubMed]
  17. Khan, A.; Sabbar, Y.; Din, A. Stochastic modeling of the Monkeypox 2022 epidemic with cross-infection hypothesis in a highly disturbed environment. Math. Biosci. Eng. 2022, 19, 13560–13581. [Google Scholar] [CrossRef] [PubMed]
  18. Kumar, P.; Chaudhary, B.; Yadav, N.; Devi, S.; Pareek, A.; Alla, S.; Kajal, F.; Nowrouzi-Kia, B.; Chattu, V.K.; Gupta, M.M. Recent Advances in Research and Management of Human Monkeypox Virus: An Emerging Global Health Threat. Viruses 2023, 15, 937. [Google Scholar] [CrossRef]
  19. Liu, K.; Nie, G.; Jiao, S.; Gao, B.; Ma, H.; Fu, J.; Wu, G. Research on fault diagnosis method of vehicle cable terminal based on time series segmentation for graph neural network model. Measurement 2024, 237, 114999. [Google Scholar] [CrossRef]
  20. Alzubaidi, A.M.; Othman, H.A.; Ullah, S.; Ahmad, N.; Alam, M.M. Analysis of Monkeypox viral infection with human to animal transmission via a fractional and Fractal-fractional operators with power law kernel. Math. Biosci. Eng. 2023, 20, 6666–6690. [Google Scholar] [CrossRef]
  21. Al-Shomrani, M.M.; Musa, S.S.; Yusuf, A. Unfolding the transmission dynamics of monkeypox virus: An epidemiological modelling analysis. Mathematics 2023, 11, 1121. [Google Scholar] [CrossRef]
  22. Alshehri, A.; Ullah, S. Optimal control analysis of Monkeypox disease with the impact of environmental transmission. Aims Math. 2023, 8, 16926–16960. [Google Scholar] [CrossRef]
  23. Minhaj, F.S.; Singh, V.; Cohen, S.E.; Townsend, M.B.; Scott, H.; Szumowski, J.; Hare, C.B.; Upadhyay, P.; Reddy, J.; Alexander, B.; et al. Prevalence of Undiagnosed Monkeypox Virus Infections during Global Mpox Outbreak, United States, June-September 2022. Emerg. Infect. Dis. 2023, 29, 2307–2314. [Google Scholar] [CrossRef] [PubMed]
  24. Spirito, F.; Guida, A.; Caponio, V.C.A.; Lo Muzio, L. Monkeypox: A New Challenge for Global Health System? Life 2023, 13, 1250. [Google Scholar] [CrossRef] [PubMed]
  25. Cooper, L.N.; Radunsky, A.P.; Hanna, J.J.; Most, Z.M.; Perl, T.M.; Lehmann, C.U.; Medford, R.J. Analyzing an Emerging Pandemic on Twitter: Monkeypox. Open Forum Infect. Dis. 2023, 10, ofad142. [Google Scholar] [CrossRef] [PubMed]
  26. Gaertner, F.; Preissner, S.; Petri, W.A., Jr.; Atolani, O.; Heil, M.; Nahles, S.; Preissner, R.; Hertel, M. Comparison of the risk of hospital admission, need for ventilation, sepsis, pneumonitis, and death among the recent monkeypox outbreak and historical outbreaks. BMC Infect. Dis. 2023, 23, 610. [Google Scholar] [CrossRef] [PubMed]
  27. Yasmin, F.; Hassan, M.M.; Zaman, S.; Aung, S.T.; Karim, A.; Azam, S. A Forecasting Prognosis of the Monkeypox Outbreak Based on a Comprehensive Statistical and Regression Analysis. Computation 2022, 10, 177. [Google Scholar] [CrossRef]
  28. Hussain, S.; Ghouse, S. Detection and prediction of monkey pox disease by enhanced convolutional neural network approach. Int. J. Public Health Sci. 2023, 12, 673. [Google Scholar] [CrossRef]
  29. Iftikhar, H.; Daniyal, M.; Qureshi, M.; Tawaiah, K.; Ansah, R.K.; Afriyie, J.K. A hybrid forecasting technique for infection and death from the mpox virus. Digit. Health 2023, 9, 20552076231204748. [Google Scholar] [CrossRef]
  30. Manohar, B.; Das, R. Artificial Neural Networks for the Prediction of Monkeypox Outbreak. Trop. Med. Infect. Dis. 2022, 7, 424. [Google Scholar] [CrossRef]
  31. Muñoz-Saavedra, L.; Escobar-Linero, E.; Civit-Masot, J.; Luna-Perejón, F.; Civit, A.; Domínguez-Morales, M. A Robust Ensemble of Convolutional Neural Networks for the Detection of Monkeypox Disease from Skin Images. Sensors 2023, 23, 7134. [Google Scholar] [CrossRef]
  32. Eliwa, E.H.I.; El Koshiry, A.M.; Abd El-Hafeez, T.; Farghaly, H.M. Utilizing convolutional neural networks to classify monkeypox skin lesions. Sci. Rep. 2023, 13, 14495. [Google Scholar] [CrossRef] [PubMed]
  33. Munir, T.; Khan, M.; Cheema, S.A.; Khan, F.; Usmani, A.; Nazir, M. Time series analysis and short-term forecasting of monkeypox outbreak trends in the 10 major affected countries. BMC Infect. Dis. 2024, 24, 16. [Google Scholar] [CrossRef] [PubMed]
  34. Pathan, R.K.; Uddin, M.A.; Paul, A.M.; Uddin, M.I.; Hamd, Z.Y.; Aljuaid, H.; Khandaker, M.U. Monkeypox genome mutation analysis using a time series model based on long short-term memory. PLoS ONE 2023, 18, e0290045. [Google Scholar] [CrossRef] [PubMed]
  35. Iftikhar, H.; Khan, M.; Khan, M.S.; Khan, M. Short-Term Forecasting of Monkeypox Cases Using a Novel Filtering and Combining Technique. Diagnostics 2023, 13, 1923. [Google Scholar] [CrossRef]
  36. Priyadarshini, I.; Mohanty, P.; Kumar, R.; Taniar, D. Monkeypox Outbreak Analysis: An Extensive Study Using Machine Learning Models and Time Series Analysis. Computers 2023, 12, 36. [Google Scholar] [CrossRef]
  37. Iftikhar, H.; Khan, M.; Khan, Z.; Khan, F.; Alshanbari, H.M.; Ahmad, Z. A comparative analysis of machine learning models: A case study in predicting chronic kidney disease. Sustainability 2023, 15, 2754. [Google Scholar] [CrossRef]
  38. Dada, E.G.; Oyewola, D.O.; Joseph, S.B.; Emebo, O.; Oluwagbemi, O.O. Ensemble Machine Learning for Monkeypox Transmission Time Series Forecasting. Appl. Sci. 2022, 12, 12128. [Google Scholar] [CrossRef]
  39. Qureshi, M.; Khan, S.; Bantan, R.A.R.; Daniyal, M.; Elgarhy, M.; Marzo, R.R.; Lin, Y. Modeling and Forecasting Monkeypox Cases Using Stochastic Models. J. Clin. Med. 2022, 11, 6555. [Google Scholar] [CrossRef]
  40. Shah, I.; Iftikhar, H.; Ali, S. Modeling and forecasting medium-term electricity consumption using component estimation technique. Forecasting 2020, 2, 163–179. [Google Scholar] [CrossRef]
  41. Iftikhar, H.; Bibi, N.; Canas Rodrigues, P.; López-Gonzales, J.L. Multiple novel decomposition techniques for time series forecasting: Application to monthly forecasting of electricity consumption in Pakistan. Energies 2023, 16, 2579. [Google Scholar] [CrossRef]
  42. Shah, I.; Iftikhar, H.; Ali, S. Modeling and forecasting electricity demand and prices: A comparison of alternative approaches. J. Math. 2022, 2022, 3581037. [Google Scholar] [CrossRef]
  43. Diebold, F.X.; Mariano, R.S. Comparing predictive accuracy. J. Bus. Econ. Stat. 2002, 20, 134–144. [Google Scholar] [CrossRef]
  44. Iftikhar, H.; Khan, M.; Turpo-Chaparro, J.E.; Rodrigues, P.C.; López-Gonzales, J.L. Forecasting stock prices using a novel filtering-combination technique: Application to the Pakistan stock exchange. Aims Math. 2024, 9, 3264–3288. [Google Scholar] [CrossRef]
  45. Alshanbari, H.M.; Iftikhar, H.; Khan, F.; Rind, M.; Ahmad, Z.; El-Bagoury, A.A.A.H. On the implementation of the artificial neural network approach for forecasting different healthcare events. Diagnostics 2023, 13, 1310. [Google Scholar] [CrossRef] [PubMed]
  46. Iftikhar, H.; Zafar, A.; Turpo-Chaparro, J.E.; Canas Rodrigues, P.; López-Gonzales, J.L. Forecasting day-ahead brent crude oil prices using hybrid combinations of time series models. Mathematics 2023, 11, 3548. [Google Scholar] [CrossRef]
Figure 1. Monkeypox infections forecasting: A complete proposed time series ensemble approach layout.
Figure 1. Monkeypox infections forecasting: A complete proposed time series ensemble approach layout.
Axioms 13 00554 g001
Figure 2. Comparison of daily confirmed monkeypox cases in four most affected countries from 1 June 2022 to 31 July 2023.
Figure 2. Comparison of daily confirmed monkeypox cases in four most affected countries from 1 June 2022 to 31 July 2023.
Axioms 13 00554 g002
Figure 3. Comparison of daily cumulative confirmed monkeypox cases with superimposed the nonlinear trend component in four most affected countries from 1 June 2022 to 31 July 2023.
Figure 3. Comparison of daily cumulative confirmed monkeypox cases with superimposed the nonlinear trend component in four most affected countries from 1 June 2022 to 31 July 2023.
Axioms 13 00554 g003
Figure 4. Residual series after extracting the nonlinear trend component in the four most affected countries and the entire world case within the proposed forecasting technique.
Figure 4. Residual series after extracting the nonlinear trend component in the four most affected countries and the entire world case within the proposed forecasting technique.
Axioms 13 00554 g004
Table 1. Evaluation average errors.
Table 1. Evaluation average errors.
S.NoErrorFormula
1MAE 1 D d = 1 D | C d C ^ d |
2MAPE 1 D d = 1 D C d C ^ d C d
3RMSE d = 1 D ( C m C ^ d ) 2 D 0.5
4RMSLE 1 D d = 1 D [ log ( C m + 1 ) log ( C ^ d + 1 ) ] 2 0.5
Table 2. Descriptive statistics.
Table 2. Descriptive statistics.
EstimatorMinimum25%50%Mean75%MaximumVarianceStandard DeviationSkewnessKurtosis
Brazil1.003705.509606.007418.4210,758.0010,915.0017,131,328.254139.00−0.862.06
France17.002889.004102.003329.434128.004144.001,848,070.371359.44−1.493.59
Spain142.006284.007392.006155.197531.757549.005,487,833.982342.61−1.644.13
USA25.0016,965.0028,809.0022,030.5229,933.0030,154.00125,647,213.4411,209.25−1.112.52
World756.0047,108.7580,054.5063,284.5485,658.7587,294.00894,676,742.0329,911.15−1.062.53
log (Brazil)0.008.219.178.059.289.305.672.38−2.257.10
log (France)2.837.978.327.848.338.331.131.06−2.528.73
log (Spain)4.968.758.918.508.938.930.850.92−2.447.89
log (USA)3.229.7410.279.3710.3110.313.341.83−2.116.24
log (World)6.6310.7611.2910.7211.3611.381.291.14−2.036.12
Table 3. Nonstationary outcomes: ADF test statistic and p-values for all considered series.
Table 3. Nonstationary outcomes: ADF test statistic and p-values for all considered series.
TestNonstatinay Outcomes
SeriesStatisticp-value
Brazil1.3999−0.9900
France−13.3770−0.0086
Spain−1.7358−0.6881
USA−15.9970−0.0061
World−2.2683−0.4635
log (Brazil)−17.6770−0.0041
log (France)0.3030−0.9910
log (Spain)−14.6470−0.0049
log (USA)0.1666−0.9919
log (World)−19.83−0.0056
Table 4. Accuracy mean errors.
Table 4. Accuracy mean errors.
Brazil
ModelMAPEMAERMLSERMSE
AR0.0000370.40552470.00012161.332314
ARIMA0.000414.49305020.00050715.557751
ESM0.00007980.8740990.00018071.98015
NPAR0.00029753.26041580.00038334.199699
Theta0.00001560.17117130.00010921.196843
EnsE0.000167981.84085220.000260382.8533514
EnsT0.00001210.17089170.00010021.18923
EnsV0.00001110.16819170.00009921.1861
France
ModelMAPEMAERMLSERMSE
AR0.000023610.000318090.000001070.00086441
ARIMA0.000000250.000003090.000002160.00000649
ESM0.000016080.065758630.000022430.09174509
NPAR0.000401451.641943090.000409241.67457669
Theta0.000000340.001379470.000002620.01071651
EnsE0.0000883260.3419232740.0000871040.355713038
EnsT0.000000230.000002190.000002010.00000581
EnsV0.000000190.000001990.000001910.00000553
Spain
ModelMAPEMAERMLSERMSE
AR0.000069450.52586390.000261011.977389
ARIMA0.000079030.59844670.000253691.921771
ESM0.000040150.30402810.000209461.586085
NPAR0.000112780.85400590.000311972.362472
Theta0.000057830.43785650.000216671.640763
EnsE0.0000718480.544040220.000250561.897696
EnsT0.000037580.28457810.000204261.546656
EnsV0.000033140.24179410.000191131.498126
USA
ModelMAPEMAERMLSERMSE
AR0.000222636.7781240.0004547113.8483
ARIMA0.0003369910.2594370.0004785414.57806
ESM0.0006404619.5074430.0008527726.00396
NPAR0.002634620.1429950.0009349333.12454
Theta0.000108583.3063090.0004202712.79745
EnsE0.00078865211.99886160.00062824420.070462
EnsT0.000102193.1001010.0004019112.12132
EnsV0.000101562.9961070.0003902711.99131
World
ModelMAPEMAERMLSERMSE
AR0.0003913434.986120.0008588776.997
ARIMA0.000930283.323060.00161852145.44294
ESM0.0002648223.635970.0008084872.37218
NPAR0.00188915168.466510.0024785221.42033
Theta0.0004573440.919390.000939184.17509
EnsE0.0007865770.266210.001340694120.081508
EnsT0.0002596123.174510.0008009971.71362
EnsV0.0002131121.924210.0007981770.9131
Table 5. The DM test outcomes (p values) for all considered models.
Table 5. The DM test outcomes (p values) for all considered models.
Brazil
ModelARARIMAESMNPARThetaEnsEEnsTEnsV
AR00.90380.865760.902610.118220.892950.12060.02124
ARIMA0.096200.0950.094970.096340.094410.096350.04636
ESM0.134240.90500.904970.131730.900730.132060.03214
NPAR0.097390.905030.0950300.097650.093710.097690.0377
Theta0.881780.903660.868270.9023500.892480.194050.08491
EnsE0.107050.905590.099270.906290.1075200.107640.0767
EnsT0.87940.903650.867940.902310.805950.8923600.16451
EnsV0.878760.903640.867860.90230.815090.892330.835490
France
ModelARARIMAESMNPARThetaEnsEEnsTEnsV
AR00.15280.893550.909090.80880.908910.161570.05157
ARIMA0.847200.893550.909090.808920.908910.168140.05214
ESM0.106450.1064500.90910.10590.909060.106450.04645
NPAR0.090910.090910.090900.090910.09090.090910.02091
Theta0.19120.191080.89410.9090900.908910.190980.01098
EnsE0.091090.091090.090940.90910.0910900.091090.0109
EnsT0.838430.831860.893550.909090.809020.9089100.10876
EnsV0.838430.831860.893550.909090.809020.908910.891240
Spain
ModelARARIMAESMNPARThetaEnsEEnsTEnsV
AR00.316860.162520.864380.177230.213560.163230.05276
ARIMA0.6831400.14350.850990.155140.096680.145850.04703
ESM0.837480.856500.853150.897170.84920.171610.04404
NPAR0.135620.149010.1468500.151090.144760.147710.04814
Theta0.822770.844860.102830.8489100.833560.121320.03136
EnsE0.786440.903320.15080.855240.1664400.152760.0332
EnsT0.836770.854150.828390.852290.878680.8472400.15806
EnsV0.837240.852970.835960.851860.868640.846680.841940
USA
ModelARARIMAESMNPARThetaEnsEEnsTEnsV
AR00.876140.903920.882120.09280.889230.092590.0135
ARIMA0.1238600.899140.874770.106210.852320.096280.02522
ESM0.096080.1008600.820320.094960.091670.095690.03581
NPAR0.117880.125230.1796800.114990.120340.115660.04574
Theta0.90720.893790.905040.8850100.896750.175060.04038
EnsE0.110770.147680.908330.879660.1032500.105560.04586
EnsT0.907410.903720.904310.884340.824940.8944400.14791
EnsV0.90650.904780.904190.884260.829620.894140.852090
World
ModelARARIMAESMNPARThetaEnsEEnsTEnsV
AR00.879490.09090.901170.881170.886530.091560.0228
ARIMA0.1205100.117440.906470.120650.131340.117560.01748
ESM0.90910.8825600.901440.899730.890380.143140.02083
NPAR0.098830.093530.0985600.098490.096620.098610.04863
Theta0.118830.879350.100270.9015100.887310.101360.00168
EnsE0.113470.868660.109620.903380.1126900.109850.00983
EnsT0.908440.882440.856860.901390.898640.8901500.08811
EnsV0.907720.882520.879170.901370.898320.890170.891890
Table 6. Monkeypox virus dataset: the forecasted and actual daily cumulative confirmed cases of the Mpox virus using the best proposed model over four weeks.
Table 6. Monkeypox virus dataset: the forecasted and actual daily cumulative confirmed cases of the Mpox virus using the best proposed model over four weeks.
AreaBrazil  France  Spain  USA  World  
DateActualForecastedPFEActualForecastedPFEActualForecastedPFEActualForecastedPFEActualForecastedPFE
1 May 202310,91510,9180.02290414441450.02413754975500.0132530,15430,1560.0066387,30587,3200.01718
2 May 202310,91510,9180.02749414641460.00000755175520.0132430,15430,1570.0099587,36787,2750.10530
3 May 202310,92010,9220.01832414641470.02412755175540.0397330,15430,1580.0132787,37387,2960.08813
4 May 202310,92010,9240.03663414641480.04824755175540.0397330,15430,1580.0132787,42587,4190.00686
5 May 202310,92010,9240.03663414641490.07236755175540.0397330,15430,1580.0132787,42587,4280.00343
6 May 202310,92010,9240.03663414641490.07236755175540.0397330,15430,1580.0132787,43187,4390.00915
7 May 202310,92010,9240.03663414641490.07236755175540.0397330,15430,1580.0132787,43187,4480.01944
8 May 202310,92010,9250.04579414641490.07236755175540.0397330,15430,1580.0132787,43887,4520.01601
9 May 202310,92010,9250.04579414641490.07236755175540.0397330,18830,1600.0927587,50987,4720.04228
10 May 202310,92910,9300.00915414641490.07236755175540.0397330,18830,1690.0629487,51087,4890.02400
11 May 202310,92910,9300.00915414641490.07236755175540.0397330,18830,1760.0397587,55687,5240.03655
12 May 202310,92910,9300.00915414641490.07236755175540.0397330,18830,1810.0231987,55787,5500.00799
13 May 202310,92910,9310.01830414641490.07236755175540.0397330,18830,1890.0033187,57187,5590.01370
14 May 202310,92910,9310.01830414641490.07236755175540.0397330,18831,0922.9945787,57687,5680.00913
15 May 202310,92910,9310.01830414641490.07236755175540.0397330,18830,1940.0198887,57687,5790.00343
16 May 202310,92910,9320.02745414641490.07236755175540.0397330,19430,1960.0066287,61587,5990.01826
17 May 202310,94110,9380.02742414641490.07236755175540.0397330,19430,2000.0198787,61587,6120.00342
18 May 202310,94110,9380.02742414641490.07236755175540.0397330,19430,2040.0331287,63787,6220.01712
19 May 202310,94110,9420.00914414641490.07236755175540.0397330,19430,2060.0397487,64287,6390.00342
20 May 202310,94110,9420.00914414641490.07236755175540.0397330,19430,2080.0463787,64287,6500.00913
21 May 202310,94110,9430.01828414641490.07236755175540.0397330,19430,2080.0463787,64287,6610.02168
22 May 202310,94110,9440.02742414641490.07236755175540.0397330,19430,2100.0529987,66787,6720.00570
23 May 202310,94110,9450.03656414641490.07236755175540.0397330,22530,2150.0330987,71087,6890.02394
24 May 202310,94110,9450.03656414641490.07236755175540.0397330,22530,2190.0198587,71187,7040.00798
25 May 202310,94110,9450.03656414641490.07236755175540.0397330,22530,2130.0397087,73387,7250.00912
26 May 202310,94110,9450.03656414641490.07236755175540.0397330,22530,2180.0231687,73387,7390.00684
27 May 202310,94110,9450.03656414641490.07236755175540.0397330,22530,2230.0066287,73387,7410.00912
28 May 202310,94110,9450.03656414641490.07236755175540.0397330,22530,2280.0099387,73387,7480.01710
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cuba, W.M.; Huaman Alfaro, J.C.; Iftikhar, H.; López-Gonzales, J.L. Modeling and Analysis of Monkeypox Outbreak Using a New Time Series Ensemble Technique. Axioms 2024, 13, 554. https://doi.org/10.3390/axioms13080554

AMA Style

Cuba WM, Huaman Alfaro JC, Iftikhar H, López-Gonzales JL. Modeling and Analysis of Monkeypox Outbreak Using a New Time Series Ensemble Technique. Axioms. 2024; 13(8):554. https://doi.org/10.3390/axioms13080554

Chicago/Turabian Style

Cuba, Wilfredo Meza, Juan Carlos Huaman Alfaro, Hasnain Iftikhar, and Javier Linkolk López-Gonzales. 2024. "Modeling and Analysis of Monkeypox Outbreak Using a New Time Series Ensemble Technique" Axioms 13, no. 8: 554. https://doi.org/10.3390/axioms13080554

APA Style

Cuba, W. M., Huaman Alfaro, J. C., Iftikhar, H., & López-Gonzales, J. L. (2024). Modeling and Analysis of Monkeypox Outbreak Using a New Time Series Ensemble Technique. Axioms, 13(8), 554. https://doi.org/10.3390/axioms13080554

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop