Next Article in Journal
Hierarchical Bayesian Modeling and Randomized Response Method for Inferring the Sensitive-Nature Proportion
Next Article in Special Issue
New Methodological Approach to Classify Educational Institutions—A Case Study on Romanian High Schools
Previous Article in Journal
Searching for a New Model of Governance in the High Seas: Game Theory Applied to International Commons Management
Previous Article in Special Issue
Hybrid Model for Unemployment Impact on Social Life
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Experiments with Fuzzy Methods for Forecasting Time Series as Alternatives to Classical Methods

by
Bogdan Oancea
1,*,
Richard Pospíšil
2,
Marius Nicolae Jula
1 and
Cosmin-Ionuț Imbrișcă
1
1
Department of Applied Economics and Quantitative Analysis, University of Bucharest, Bucharest 050663, Romania
2
Department of Economic and Managerial Studies, Palacky University of Olomouc, Olomouc 779 00, Czech Republic
*
Author to whom correspondence should be addressed.
Mathematics 2021, 9(19), 2517; https://doi.org/10.3390/math9192517
Submission received: 13 July 2021 / Revised: 16 September 2021 / Accepted: 5 October 2021 / Published: 7 October 2021
(This article belongs to the Special Issue Analysis and Mathematical Modeling of Economic - Related Data)

Abstract

:
Even though forecasting methods have advanced in the last few decades, economists still face a simple question: which prediction method gives the most accurate results? Econometric forecasting methods can deal with different types of time series and have good results, but in specific cases, they may fail to provide accurate predictions. Recently, new techniques borrowed from the soft computing area were adopted for economic forecasting. Starting from the importance of economic forecasts, we present an experimental study where we compared the accuracy of some of the most used econometric forecasting methods, namely the simple exponential smoothing, Holt and ARIMA methods, with that of two new methods based on the concept of fuzzy time series. We used a set of time series extracted from the Eurostat database and the R software for all data processing. The results of the experiments show that despite not being fully superior to the econometric techniques, the fuzzy time series forecasting methods could be considered as an alternative for specific time series.

1. Introduction

In terms of production and exchange relations, economic forecasts at a certain historical level have existed for many centuries. Scottish scientist and genius Adam Smith was among the first to take them seriously at the level of economics as a science. He defined the main economic categories, subjects and relationships in the form that we basically use them today. This concerns economic forecasts, economic models and their use and interpretation [1].
However, the need for economic forecasting was strongly shown by the great economic crisis of the 1930s, which again accelerated research in this area and reached a level that is still valid nowadays. After this catastrophe, economists, politicians and academic scientists focused on gaining an understanding of the real functioning of the economy and where the economy is heading. All this has led to the emergence, development and use of a wider range of statistical and analytical techniques, which have resulted in economic forecasts.
The principle of economic forecasting consists of a qualified effort to predict the future state of the economy using appropriately selected and reliably reported economic variables. They are one of the starting points for government decisions on investment, hiring, spending and other important policies that affect aggregate economic activity [2]. A correct estimate of future economic developments is essential for politicians and government officials, as it helps them to determine how to set the basic fiscal and monetary bases and instruments. Economists working in public administration play a crucial role in policy-making and in setting expenditure and tax parameters. In the area of fiscal policy, these are most often economic forecasts in relation to the amount of tax allocation depending on the forecasted GDP growth and public expenditure. Economic forecasting is important for the creation of plans and budgets in the medium and long term. In contrast, in the area of monetary policy, the main goal of central banks is to maintain a high degree of price stability, and this is primarily influenced by the development of demand and costs. Coyle [3] empirically compared the accuracy of model predictions with equilibrium correction mechanisms and models in differences (or growth rates) in a central bank inflation-targeting environment. As he noted, inflation forecasts for one or two years ahead are the primary source of information for monetary policy decisions in countries that use the inflation-targeting regime.
Thus, economic forecasts involve the creation of statistical models with the input of selected crucial variables, usually in an effort to focus on the future values for predicted economic indicators. These predicted economic indicators often include the inflation rate, interest rates, and the exchange rate as indicators for monetary policy, as well as industrial production, consumer confidence, labor productivity, retail sales and unemployment rate for the areas of fiscal and business policy.
The challenges and subjective aspects of people’s behavior in economic forecasts do not only affect the public sector. The private sector, commercial banks, investors, economists, academics, and also central banks have all issued economic forecasts that were partially or even wildly out of touch with reality [4].
Time series models that use historical economic data are popular methods of economic forecasting, while practice has verified that they are meaningful and competitive, mainly due to econometric systems of equations, especially in their multivariate forms.
Economic forecasting and prediction are often mentioned as ambiguous sciences. Due to the limited sources of data and their credibility, as well as the variously chosen methods, economic forecasting has its limits, which must be considered in practice and in scientific work [5]. Looking back, this is evident, for example, in assessing the reliability of crisis forecasting. The inability to record immediate declines reflects pressure on economists and forecasters to predict it sharply. Many of them prefer not to deviate from the mainstream because they keep in mind that bold projections could damage their reputation and the model’s future application [6].
Managers, business owners, investors and others should also not neglect the subjective nature of results and outputs within economic forecasts. Forecasts are strongly affected by the type of economic theory that forecasters use. Projections can vary very widely. One economist can believe in monetary policy, meaning that GDP growth is primarily influenced by the supply of money, and a second can argue that huge or even deficit spending is harmful for the economy. Thus, some of the outputs and conclusions do not come from qualified and credible economic analysis. On the contrary, they are usually determined on the basis of personal motives and assumptions about the functioning of markets and market forces. This inevitably means that the results of particular policies will be assessed differently [7].
The limits of the accuracy of economic forecasts also follow from the adoption of a certain economic theory as a basis for drawing conclusions. Economic theory can determine the background and basic features of a forecast, and economic knowledge and education can often play a key role. The educated economist should consider that certain conditions of time and place are specific and that the forecast provided by selected statistical methods may be adjusted while taking into account specific important circumstances. This is particularly important if a fundamentally uneconomic event also has significant economic consequences. If possible, the exact conclusion is not based purely on economic values; it is also necessary to consider uneconomic factors, including their future development. Predictions-based forecasts made only on the basis of subjective judgments cannot meet the strict requirements for the accuracy of outputs obtained by using objective techniques. As a result, the most reliable and the most trustworthy predictions are mostly based on fundamental and upgraded economic knowledge and standard statistical techniques.
The most frequent ant the most common form of errors in economic forecasting is probably still theoretical. Additionally, knowledge about the functioning of economic entities is relatively limited. Effective and accurate forecasting is also spoiled by the fact that economic conditions, economic policies and institutions change over time.
Time series represent consecutive series of observations measured at equal time intervals, and they are one of the most important concepts used to analyze economic processes and to make forecasts.
While the classical theory of econometric time series is well understood now [8,9], new techniques coming from the soft computing area have more recently been adopted for economic forecasts. The use of fuzzy sets in the area of analyzing and predicting time series came up based on the ability of fuzzy models to approximate functions and on the ease to understand the rules using linguistic variables by non-experts. Fuzzy time series methods have the advantage that they can deal with uncertainty or imprecision [10,11,12,13,14], which can be the case even for economic indicators computed based on data collected through probabilistic samples, or estimated using statistical models when the data are not available, or periodically updated based on the availability of new data sources (the revision policy for the datasets used in our experiments are given by the European Statistics Code of Practice, adopted by the European Statistical System Committee in 2011, and the European Statistical System Guidelines on the revision policy of the Principal European Economic Indicators), thus incorporating a certain degree of uncertainty.
Fuzzy time series were introduced by Song and Chisson [10], who developed the fuzzy time series model and applied it to some enrollment data series of University of Alabama, and later developed by Chen [11], who simplified the fuzzy time series prediction method using only arithmetic operations instead of max-min composition operations. Chen [12] also proposed a forecasting method based on high-order fuzzy time series of various orders: second, third, fourth and fifth. His experiments showed that the forecasts generated using the third-order model had better accuracy than other existing models.
Singh [13,15] proposed a new and computationally simplified method of fuzzy time series forecasting based on difference parameters. He tested his method on the same data series regarding students’ enrollment at the University of Alabama and compared the forecasted values obtained with his method against the results obtained by other existing methods to argue in favor of his method. Singh introduced a difference parameter as the fuzzy relation for forecasting and developed a forecasting algorithm of linear complexity.
Huarng [16] proposed a new heuristic model for time series forecasting using heuristic increasing and decreasing relations in order to increase the forecast accuracy. He tested his method on the same enrollments data and on Taiwan Future Exchange data.
Chen and Hsu [17] designed a first-order and time-variant forecast method starting form a division of the universe of discourse into intervals, and in a later stage re-dividing them into four, three, two and one depending on the statistical distribution of the data in each interval. The linguistic values represented by fuzzy sets are defined based on the intervals divided in the second stage, and then the data are fuzzified and the logical relationships are established. Finally, Chen’s method uses a set of rules to determine whether the forecasted data have an increasing or decreasing trend.
Recently, Liu et al. [18] proposed an ultra-short-term prediction approach based on the Takagi–Sugeno (T–S) fuzzy model, which has the advantage of a great linearization ability, by describing complex nonlinear systems by a number of linear or nearly linear systems and applied this model for wind power and wind speed forecast. The authors compared the results of their method with the results of some machine learning algorithms, showing the superiority of their method in increasing the forecasting accuracy.
Compared with classical time series forecasting methods, specific features of fuzzy time series add certain advantages: they do not require long series, and they can work with both crisp and fuzzy values. An extended overview of the fuzzy time series forecasting models was presented in Bose and Mali [14].
In this paper, using a set of time series with economic indicators, we compared the accuracy of some well-known econometric forecasting methods with that of two new methods based on the concept of fuzzy time series.
The next sections of the paper are organized as follows. In Section 2, we describe the data that we used in our experiments, the methods and the software used. In Section 3, we discuss the results that we obtained, and in the last section, we present the main conclusions and directions for future work.

2. Data and Methods

We used data series from the Eurostat database to test all the methods used in this study. Thus, we selected 3 monthly economic indicators that we considered important to assess the economic activity of a country from the Eurostat database with values from January 2017 to February 2020:
  • The volume index of production in construction (2015 = 100), seasonally and calendar-adjusted, for 20 European countries;
  • The volume index of production in the industry for electricity, gas, steam and air conditioning supply (2015 = 100), seasonally and calendar-adjusted, for 29 European countries;
  • The index of deflated turnover in retail trade, except for motor vehicles and motorcycles (2015 = 100), seasonally and calendar-adjusted, for 28 European countries.
From the available data series in the Eurostat database, we eliminated those countries with missing values for the time series, because we did not want the results to be influenced by an external factor such as the imputation method. The econometric methods used in our analysis are sensitive to missing values. In Figure 1, Figure 2 and Figure 3, we show a graphical representation of the datasets, while in Table 1, Table 2 and Table 3, we present descriptive statistics and the order of integration for each economic indicator selected and each country. We checked all the time series for seasonality using the Webel–Ollech overall seasonality test (WO test) [19], and none have a seasonal component, as we expected.
To test whether fuzzy time series models can be used as an alternative for time series forecasting and based on an overview study [20] that indicates exponential smoothing, moving average and autoregressive methods as the most widely used econometric forecasting techniques, we used the simple exponential smoothing, Holt and the ARIMA methods, and then compared the accuracy of predictions with the results obtained by using fuzzy methods.
The exponential smoothing method has been developed and used for time series forecasting since the 1950s [21,22], but only recently some procedures for automatic model selection were designed [23,24]. Exponential smoothing is a set of methods that was classified initially by Pegels [25], and then by Gardner [26], and Taylor [27]. In our study, we used the simple exponential smoothing method (sometimes called single exponential smoothing) [20] and the Holt method [28].
The simple exponential smoothing method (SEM) is usually applied for forecasting data, which do not exhibit a clear trend or seasonal pattern. For yt time series, the one-step-ahead forecast for time t + 1 through the SEM method is:
y ^ t + 1 | t = α y t + 1 α y ^ t | t 1 )
which means that one time ahead prediction is a weighted average between the most recent observation and the previous predicted value. We can write:
y ^ t + 1 | t = y ^ t | t 1 + α y t y ^ t | t 1
or
y ^ t + 1 | t = y ^ t | t 1 + α e t
where y ^ t + 1 | t is the smoothed value of the data series carried out at time t for t + 1, y ^ t | t 1 is the previous forecast, e t = y t y ^ t | t 1 is the forecasting error at time t, and 0 ≤ α ≤ 1 is the smoothing parameter. That is, the series forecasted value for time t + 1 is calculated as the forecasted value for time t, corrected with a proportion (α) of the forecast error. The smoothing parameter α is either exogenous or is calculated to minimize the in-sample sum of squared errors.
The simple exponential smoothing is generally appropriate for series that are random walk with or without drift. The Holt exponential smoothing method is a better forecasting method for data that reveal a linear trend, as it can be described as follows:
Forecast   equation :   y ^ t + h | t =   L t +   hb t
Level   equation :   L t =   α y t + 1   α L t 1 +   b t 1
Trend   equation :   b t =   β L t   L t 1 + 1   β b t 1
where y ^ t + h | t is the forecast carried out at time t for period t + h (h = 1, 2, …), Lt and L t 1 are the smoothed values of the level at time t and t − 1, respectively, bt and b t 1 denote the smoothed values for the trend, and 0 ≤ α ≤ 1 and 0 ≤ β ≤ 1 are the smoothing parameters for the level (α) and the trend (β), respectively. The smoothing parameters are exogenous, or they are calculated by using the sum of squared errors criterion (SSE) (or, alternatively the mean squared error (MSE), and root mean square errors (RMSE)).
Auto Regressive Integrated Moving Average (ARIMA) models [29,30] are widely used methods in the analysis and prediction of time series. In addition to exponential smoothing methods, ARIMA models aim to identify the stationarity and the autocorrelations in the data. The models are usually denoted as ARIMA(p, d, q), using p as the order (number of time lags) of the autoregressive part of the model, d to denote the degree of differencing and q for the order of the moving-average part. For a times series yt (possibly non-stationary in the sense of mean, but not non-stationary in variance/autocovariance), the ARIMA(p, d, q) model can be written as:
1 ϕ 1 L   ϕ 2 L 2     ϕ p L p 1   L d y t = 1 + θ 1 L + θ 2 L 2 + + θ q L q ε t
where L is the lag (backshift) operator (so that Ly t =   y t 1 ), ϕ 1 ,   ,   ϕ p are the p   coefficients in the autoregressive part of the model, θ 1 ,   ,   θ q are the q coefficients in the moving average part of the model, and εt is white noise. In polynomial notation, the ARIMA model can be written as:
Φ L 1 L d y t = Θ L ε t
where Φ(L) is a polynomial function of order p (the autoregressive part of the model), Θ(L) is a polynomial function of order q (the moving average part), and d is the order of integration (degree of differencing involved). A detailed discussion about ARIMA models can be found in Hamilton [9].
Besides classical methods of time series analysis, new, unconventional approaches have been developed based on the theory of fuzzy sets [31]. Among some of the most well-known time series analysis and forecasting methods based on fuzzy set theory, we can mention the Song and Chissom [10], Chen [11,12], Singh [13,15], heuristic [16], Chen–Hsu [17], Abbasov–Mamedova [32], and NFTS [33] methods. The first five methods cited above are used to fit/forecast time series even when historical values change over time. This could occur when the values of the time series are updated or changed due to different reasons, for example when new data are available to compute them. The last two methods allow users to make predictions for various time horizons.
Below, we present a very short introduction to the fuzzy time series concept. For more details, an interested reader can find a comprehensive review of fuzzy time series forecasting models in Bose and Mali [14].
Assuming that:
U   = u 1 ,   u 2 ,   ,   u n
is the universe of discourse where ui, i = 1 … n are the linguistic values, we can define the fuzzy set of linguistic variables Ai of U as:
A i =   μ A i u 1 / u 1 +   μ A i u 2 / u 2 + + μ A i u n / u n
where μ A i is the membership function of the fuzzy set Ai, μ A i :   U 0 , 1 , and μ A i u j represents the degree of belonging of u j to the fuzzy set Ai.
Let us assume that Z t   t   = 0 ,   1 ,   2 ,   is a subset of and it is also the universe of discourse of the fuzzy sets y i t ,   i   = 0 ,   1 ,   2 ,   . If Y(t) is the collection of y 1 t ,   y 2 t ,…, then Y(t) is a fuzzy time series defined over Zt. If Y(t) is caused only by Y(t − 1), which we will denote by Y t 1 Y t , then a fuzzy relationship is established between Y(t) and Y(t − 1), and it can be written as the following fuzzy relational equation:
Y t =   Y t 1 R t ,   t 1
where is the max-min composition operator and R is a relation named the first-order model of Y(t). If R t ,   t 1 is independent of time, R t 1 ,   t 2 1 =   R t 2 ,   t 2 1 for t 1 t 2 then Y(t) is called time invariant fuzzy time series.
If Y(t) is determined by several fuzzy sets Y t n ,   Y ( t n + 1 ,   ,   Y t 1 , then the fuzzy relationship is denoted as: A i 1 ,   A i 2 ,   ,   A in A j where A i 1 = Y t n ,   , A in = Y t 1 . This is called nth-order fuzzy time series model.
In cases when Y(t) is simultaneously determined by Y t 1 ,   Y t 2 ,   ,   Y t k ,   k > 0 and the relations are time variant, then Y(t) is called the time variant fuzzy time series and we have:
Y t =   Y t 1 R w t ,   t 1
where w > 1 is a time parameter by which Y(t) is affected, and there are several methods to compute the relation R w t ,   t 1 ranging from the max-min composition operations to more simplified rules.
The foundation of fuzzy time series forecasting was set by Song and Chissom [10] and later developed by various researchers. Thus, the Chen method [11] is basically an adapted version of the method presented in [10], but it is more efficient because it simplifies the arithmetic operations. Chen [11] provides a detailed description of the fitting procedure and showed that it can make robust predictions even in cases when the historical data are not accurate.
According to Chen, fuzzy time series analysis and forecasting consist of the following processing stages:
  • Set the universe of discourse
  • Divide the universe of discourse into several intervals
  • Fuzzify the datasets
  • Setting the fuzzy logical relationship and group
  • Defuzzification.
Huarng [16] proposed heuristic forecasting models through the integration of problem-specific heuristic knowledge with Chen’s model to obtain predictions that are more accurate. The empirical studies conducted by the author showed that the heuristic approach better reflects the fluctuations in fuzzy time series and obtained better overall forecasting results compared with previously developed methods.
The Chen–Hsu forecasting method [17] belongs to the first order and time-variant class of methods, and improves Chen’s method, obtaining a better prediction accuracy measured with MSE.
Singh [13,15] developed a forecasting method based on fuzzy time series which can efficiently cope up with cases when there are large fluctuations in consecutive values of the time series. Singh tested the method on time series from crop production and compared its accuracy with other existing methods, showing the superiority of his approach. His method can be summarized as:
  • Define U as the universe of discourse starting from the range of time series data, setting U   = Y min   D 1 ,   Y max   D 2 , with D1 and D2 being two convenient positive numbers
  • Divide U into equal length intervals u1, u2, … uk. The number of intervals should be equal to the number of linguistic variables A1, A2, … Ak
  • Build the fuzzy sets Ai according to the previously set intervals and apply triangular membership rule to each one
  • Fuzzify the available data and set the fuzzy logical relations A i A j where Ai corresponds to the fuzzified value at time step n and Aj to the time step n + 1.
  • Define the rules for forecasting. An interested reader is invited to consult [8,13] for a detailed description of the rules.
For our forecasting experiments, we chose the Abbasov–Mamedova and NFTS methods because they easily allow us to make predictions for a future time horizon, and they have efficient implementations.
Starting from a population forecasting problem, Abbasov and Mamedova [32] proposed a new method of time series forecasting that can even use incomplete, fuzzy input data. They successfully applied their procedure to demographic data, showing the effectiveness of the proposal. Their proposal can be described as follows:
  • Define the universe of discourse (U) starting from the range of time series data;
  • Divide the set U into equal-length intervals
  • Determine the respective values of linguistic variable
  • Fuzzify the input data
  • Select the parameter W > 1, and compute the fuzzy relationships matrix
  • Defuzzify the obtained results.
In the first step, the universal set U is set as U   = Y min   D 1 ,   Y max   D 2 , where D1 and D2 are two positive numbers, and then this set is divided into several equal intervals. The default number of intervals is 7, and the fuzzy set A 1 ,   A 2 ,   ,   A 7 is defined on the universal set U by using the following equation: μ A i = 1 1 + C · U u m i 2 , where u m i is the corresponding interval. After selecting a value for the w parameter, the R w relationship matrix is computed, and then the forecasted value can be calculated. For a detailed description of this method, an interested reader can consult [32].
The NFTS [33] method can interpolate historical data to make predictions. It normalizes the original data, and automatically computes the number of clusters and finds the fuzzy relationships of each element in series to the clusters. The convergence of the method is theoretically demonstrated and experimentally tested by the authors. The accuracy of predictions was compared with other existing models using several well-known datasets, showing the advantages of their proposal. The novelty of this method starts with the procedure of computing the number of clusters for the universal set U. While other methods set this number to a constant, the NTFS method proposes an automatic iterative algorithm to determine the optimal number of clusters (suitable number of clusters (SNC)). Then, to determine the elements in each cluster and also the fuzzy relation of each element to the cluster, one can use an automatic iterative procedure. The authors of this method demonstrated the convergence of the proposed iterative procedures for all times series.
We used already existing implementations of the above-mentioned forecasting methods, namely the AnalyzeTS R package [34] for the Abbasov–Mamedova and NFTS fuzzy methods.
We employed a standard technique and divided each times series used for experimentation in two parts: a first part used for model fitting (training) and a second part for testing and measuring out-of-sample prediction errors. The second part contains 2, 4 or 6 values for each time series and methods tested, thus measuring the performance of forecasting methods for 2, 4 and 6 steps ahead. The accuracy of forecasting models was evaluated using the mean absolute error (MAE), which reflects the amplitude of errors, the mean absolute percent error (MAPE), which is equivalent to standardized MAE and reflects absolute corresponding degree, the root of mean square error (RMSE), which reflects the closeness of the error distribution, and the mean absolute scaled error (MASE), a measure proposed in Hyndman and Koehler [35]. The formulas for computing these accuracy measures are given in Equations (13)–(16). We mention that MAPE is particularly important for time series with strong trends. MAE and RMSE are considered scale-dependent errors, MAPE is a percentage accuracy measure and is consequently unit free, and MASE is a scaled accuracy measure, considered a good alternative to percentage errors. MAE is very popular, being easy to understand and compute, and minimizing MAE will lead to forecasts of the median, while minimizing the value of the RMSE will lead to forecasts of the mean. The RMSE is also extensively used in forecasting applications, although it implies computations that are more complex.
MAE y pred ,   y = t = 1 n y pred , t y t n
MAPE y pred ,   y = t = 1 n y pred , t y t y t n
RMSE y pred ,   y = t = 1 n y pred , t y t 2 n
MASE y pred ,   y = 1 n t = 1 n y pred , t y t 1 T 1 t = 2 T | Y t Y t 1 |
where y pred , t denotes the forecasted value at the time instant t, yt denotes the actual value, and n denotes the prediction time horizon. In the case of MASE, the denominator is the MAE of the one-step naive forecast method on the training set.
All the calculations were performed using the R software system [36], and the script used to process the data together with the datasets are freely available [37]. The number of partitions of the universe of discussion was set to 6 for both fuzzy methods. This value was obtained by using the empirical formula given by Efendi et al. [38]:
n   = 1 + 1.33   · log 10 T
where T is the forecast horizon length. Forecasting was done using the fuzzy.ts2() function from the AnalyzeTS R package, which implements the Abbasov–Mametova and NFTS forecasting methods.
The implementations of the econometric times series forecasting methods used in our experiments, the simple exponential smoothing, Holt, and the ARIMA method, were those provided by the forecast R package [39,40], which also has an automatic procedure for setting the optimal parameters of them.

3. Results

The time series processing was performed by using the R script prelTS.R which is freely available [37]. We first fitted all models using the training datasets and then computed the two, four, and six points ahead predictions for each time series using the simple exponential smoothing, Holt, ARIMA, Abbasov–Mametova and NFTS methods. Then, we computed the out-of-sample prediction accuracy measures MAE, MAPE, RMSE, MASE using the test datasets.
The AnalyzeTS R package provides a function to compute the optimal value of the C parameter [34] for the fuzzy.ts2() method, and we called this function for all-time series and used the corresponding value for fitting and forecasting. The rest of the parameters of the fuzzy.ts2() function were left to their default values.
We present the values of the accuracy measures in the Supplementary Materials, Tables S1–S9. Then, we counted how many times the corresponding accuracy measure is lower for a fuzzy time series forecasting method than for a classical one. The results are presented in Table 4, which summarizes all 77 time series under consideration and all 3 prediction horizons.
We observed that all four accuracy indicators showed approximately the same results. Considering the RMSE, on average, in 37% of the cases for h = 2, 47% of the cases for h = 4 and 39% of the cases for h = 6, fuzzy forecasting methods had better results than the econometric ones for the volume index of production in construction. For the volume index of production in industry, this percentage was 52% for h = 2, 45% for h = 4 and 25% for h = 6, while for the index of deflated turnover in retail trade, it was 25% for h = 2, 34% for h = 4, and 28% for h = 6.
We also observed that, on average, increasing the prediction horizon from two to four resulted in better results for both fuzzy methods, while going to a prediction horizon of six periods resulted in a drop in accuracy, which is expected, with the length of the forecasting time horizon negatively influencing the accuracy of the forecasts for other several methods [41].
To support our affirmation, we computed the average value for the accuracy measures for the volume index of production in construction for both fuzzy time series methods and plotted then against the prediction horizon in Figure 4 and Figure 5.
Although not superior to the classical econometric time series methods, the fuzzy time series forecasting methods performed sufficiently well to be considered as an alternative. Nevertheless, the experimental results encourage us to say that with further improvements and parameter setting, the fuzzy methods tested in this study could be considered as alternatives to other forecasting methods. We could not detect any relationship between the stationarity/non-stationarity of the time series and the accuracy of the predictions.

4. Conclusions and Future Work

Times series forecasting is very important in decision-making in several economic domains, such as finance, macroeconomics, agriculture and tourism, as well as in other scientific areas such as the global climate or earthquake prediction. One of the biggest challenges for researchers in the field of time series forecasting is the selection of the best method to compute predictions.
Classical econometric techniques for time series analysis and forecasting are well understood now, but new methods coming from the soft computing area have emerged in this field. In this paper, we investigated whether fuzzy time series forecasting methods can be used for economic indicators. We compared three econometric time series methods, namely the simple exponential smoothing, Holt, and ARIMA with two fuzzy time series forecasting methods, Abbasov–Mamedova and NFTS, using 77 time series extracted from the Eurostat database. We ran the five above mentioned methods for two, four and six points ahead predictions and computed several accuracy indicators: root of mean square error, mean absolute error, mean absolute percent error, and mean absolute scaled error. Our experimental results show that fuzzy time series forecasting methods, while not yet superior in terms of prediction accuracy to econometric methods, could be considered, with further improvements, potential alternatives to the econometric techniques, especially for relatively short time series. We also noticed that the forecast horizon influences the prediction accuracy in the case of the fuzzy time series methods; an increase from 4 to 6 months in the forecast horizon leads to an important decrease in the accuracy. The accuracy indicators used in this study showed approximately the same results, and thus any of them could be used in assessing the prediction performance. During our experiments, we noticed that the prediction accuracy is greatly influenced by the parameters of the two fuzzy time series methods. We used only the optimal value of the C parameter provided by the AnalyzeTS R package, with the other parameters being set to their default value. Based on our results, it becomes clear that a grid search procedure to detect the optimal value of each parameter is necessary. The grid search procedure is computationally expensive, so a parallelization of the computations is also needed.
Other soft computing methods have been successfully used for time series predictions, such as the recurrent neural networks, long short-term memory networks or SVM-based methods [42]. Together with the fuzzy methods used in this study, they can be considered as good alternatives to the classical econometric time series methods when these later methods fail to provide good predictions. As future work, we intend to further investigate how the accuracy of the predictions change with the length of the time series and the integration order and with other features of the time series, as well as to test other methods based on fuzzy sets theory. We also intend to widen the spectrum of datasets used for experimentation, including time series with different features. To optimize the performance of fuzzy methods, we intend to develop an efficient automatic procedure to select the optimal values for all parameters of the fuzzy time series forecast methods. There are other studies that show a better prediction accuracy for forecasting methods coming from the soft computing area than the econometric methods for specific time series [43]. By widening the spectrum of the time series investigated, we intend to detect patterns for which fuzzy methods are better than classical econometric methods in terms of prediction accuracy.

Supplementary Materials

The following are available online at https://doi.org/10.6084/m9.figshare.14973867.v1. Table S1: The accuracy measures for the 2 points ahead predictions of the volume index of production in construction, Table S2: The accuracy measures for the 2 points ahead predictions of the volume index of production in industry for electricity, gas, steam and air conditioning supply, Table S3: The accuracy measures for the 2 points ahead predictions of the index of deflated turnover in retail trade, except for motor vehicles and motorcycles, Table S4: The accuracy measures for the 4 points ahead predictions of the volume index of production in construction, Table S5: The accuracy measures for the 4 points ahead predictions of the volume index of production in industry for electricity, gas, steam and air conditioning supply, Table S6: The accuracy measures for the 4 points ahead predictions of the index of deflated turnover in retail trade, except for motor vehicles and motorcycles, Table S7: The accuracy measures for the 6 points ahead predictions of the volume index of production in construction, Table S8: The accuracy measures for the 6 points ahead predictions of the volume index of production in industry for electricity, gas, steam and air conditioning supply, Table S9: The accuracy measures for the 6 points ahead predictions of the index of deflated turnover in retail trade, except for motor vehicles and motorcycles.

Author Contributions

The authors contributed equally to the writing and approved the final manuscript of this paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data used in this study are available at https://doi.org/10.6084/m9.figshare.14971707.v1, (accessed on 13 July 2021).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Allen, P.G.; Fildes, R. Econometric forecasting strategies and techniques. In Principles of Forecasting: A Handbook for Researchers and Practitioners; Armstrong, J.S., Ed.; Kluwer Academic Press: Norwell, MA, USA, 2001; pp. 1–45. [Google Scholar]
  2. Diebold, F.X. Elements of Forecasting; South-Western College Publishing: La Jolla, CA, USA, 1998. [Google Scholar]
  3. Coyle, D. Making sense of published economic forecasts. In Understanding Economic Forecasts; Hendry, D.F., Ericsson, N.R., Eds.; MIT Press: Cambridge, MA, USA, 2001; pp. 54–67. [Google Scholar]
  4. Granger, C.W.J.; Newbold, P. Forecasting Economic Time Series, 2nd ed.; Academic Press: Cambridge, MA, USA, 1986. [Google Scholar]
  5. Whitley, J.D. A Course in Macroeconomic Modeling and Forecasting; Harvester Wheatsheaf: London, UK, 1994. [Google Scholar]
  6. Ghosal, V.; Loungani, P. The Differential Impact of Uncertainty on Investment in Small and Large Businesses. Rev. Econ. Stat. 2000, 82, 338–343. [Google Scholar] [CrossRef]
  7. Liberto, D. Strategic analysis for investors. Invest. Chron. 2019, 127, 169–188. [Google Scholar]
  8. Enders, W. Applied Econometric Time Series; Wiley: New York, NY, USA, 1995. [Google Scholar]
  9. Hamilton, J. Time Series Analysis; Princeton University Press: Princeton, NJ, USA, 1994. [Google Scholar]
  10. Song, Q.; Chissom, B.S. Forecasting enrollments with fuzzy time series—Part I. Fuzzy Sets Syst. 1993, 54, 1–9. [Google Scholar] [CrossRef]
  11. Chen, S.-M. Forecasting enrollments based on fuzzy time series. Fuzzy Sets Syst. 1996, 81, 311–319. [Google Scholar] [CrossRef]
  12. Chen, S.M. Forecasting enrollments based on high-order fuzzy time series. Cybern. Syst. Int. J. 2002, 33, 1–16. [Google Scholar] [CrossRef]
  13. Singh, S.R. A computational method of forecasting based on fuzzy time series. Math. Comput. Simul. 2008, 79, 539–554. [Google Scholar] [CrossRef]
  14. Bose, M.; Mali, K. Designing fuzzy time series forecasting models: A survey. Int. J. Approx. Reason. 2019, 111, 78–99. [Google Scholar] [CrossRef]
  15. Singh, S.R. A simple method of forecasting based on fuzzy time series. Appl. Math. Comput. 2007, 186, 330–339. [Google Scholar] [CrossRef]
  16. Huarng, K. Heuristic models of fuzzy time series for forecasting. Fuzzy Sets Syst. 2001, 123, 369–386. [Google Scholar] [CrossRef]
  17. Chen, S.M.; Hsu, C.C. A New method to forecast enrollments using fuzzy time series. Int. J. Appl. Sci. Eng. 2004, 12, 234–244. [Google Scholar]
  18. Liu, F.; Li, R.; Dreglea, A. Wind Speed and Power Ultra Short-Term Robust Forecasting Based on Takagi–Sugeno Fuzzy Model. Energies 2019, 12, 3551. [Google Scholar] [CrossRef] [Green Version]
  19. Ollech, D.; Webel, K. A Random Forest-Based Approach to Identifying the Most Informative Seasonality Tests; Bundesbank Discussion Paper 55/2020; Deutsche Bundesbank: Frankfurt am Main, Germany, 2020. [Google Scholar]
  20. Armstrong, J.S. Selecting Forecasting Methods. 2009. Available online: https://ssrn.com/abstract=1941247 (accessed on 5 May 2021). [CrossRef] [Green Version]
  21. Brown, R.G. Exponential Smoothing for Predicting Demand; Arthur D. Little Inc.: Cambridge, MA, USA, 1956; p. 15. [Google Scholar]
  22. Brown, R.G. Statistical Forecasting for Inventory Control; McGraw-Hill: New York, NY, USA, 1959. [Google Scholar]
  23. Ord, J.K.; Koehler, A.B.; Snyder, R.D. Estimation and Prediction for a Class of Dynamic Nonlinear Statistical Models. J. Am. Stat. Assoc. 1997, 92, 1621–1629. [Google Scholar] [CrossRef]
  24. Hyndman, R.J.; Koehler, A.B.; Ord, J.K.; Snyder, R.D. Prediction Intervals for Exponential Smoothing Using Two New Classes of State Space Models. J. Forecast. 2005, 24, 17–37. [Google Scholar] [CrossRef]
  25. Pegels, C.C. Exponential Forecasting: Some New Variations. Manag. Sci. 1969, 15, 311–315. [Google Scholar]
  26. Gardner, E.S., Jr. Exponential smoothing: The state of the art. J. Forecast. 1985, 4, 1–28. [Google Scholar] [CrossRef]
  27. Taylor, J.W. Exponential smoothing with a damped multiplicative trend. Int. J. Forecast. 2003, 19, 715–725. [Google Scholar] [CrossRef] [Green Version]
  28. Holt, C.C. Forecasting Seasonals and Trends by Exponentially Weighted Averages, O.N.R. Memorandum No. 52; Carnegie Institute of Technology: Pittsburgh, PA, USA, 1957. [Google Scholar]
  29. Box, G.E.P.; Jenkins, G.M. Time Series Analysis: Forecasting and Control; Holden-Day Inc.: San Francisco, CA, USA, 1970. [Google Scholar]
  30. Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C.; Ljyng, G.M. Time Series Analysis: Forecasting and Control, 5th ed.; Wiley: Hoboken, NJ, USA, 2015. [Google Scholar]
  31. Zadeh, L.A. Fuzzy sets. Inf. Control 1965, 8, 338–353. [Google Scholar] [CrossRef] [Green Version]
  32. Abbasov, A.M.; Mamedova, M.H. Application of fuzzy time series to population forecasting. In Proceedings of the 8th Symposion on Information Technology in Urban and Spatial Planning, Vienna University of Technology, Vienna, Austria, 25 February–1 March 2003; pp. 545–552. [Google Scholar]
  33. Vovan, T.; Ledai, N. A New Fuzzy Time Series Model Based on Cluster Analysis Problem. Int. J. Fuzzy Syst. 2019, 21, 852–864. [Google Scholar] [CrossRef]
  34. Han, T.T.N.; Nghi, D.H.; Diem, M.T.H.; My, N.T.D.; Minh, H.V.; Vo, V.T.; Truc, P.M. AnalyzeTS: Analyze Fuzzy Time Series, 2019, R Package Version 2.3. Available online: https://CRAN.R-project.org/package=AnalyzeTS (accessed on 1 June 2021).
  35. Hyndman, R.J.; Koehler, A.B. Another look at measures of forecast accuracy. Int. J. Forecast. 2006, 22, 679–688. [Google Scholar] [CrossRef] [Green Version]
  36. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2020; Available online: https://www.R-project.org/ (accessed on 1 June 2021).
  37. Oancea, B.; Pospisil, R.; Jula, M.; Imbrișcă, C.-I. Data Sets and R Code for Experimentation of Forecasting Methods, 2021. Figshare, Dataset, 2021. Available online: https://figshare.com/articles/dataset/Data_sets_and_R_code_for_experimentation_of_forecasting_methods/14971707/1 (accessed on 13 July 2021). [CrossRef]
  38. Efendi, R.; Ismail, Z.; Deris, M.M. A new linguistic out-sample approach of fuzzy time series for daily forecasting of Malaysian electricity load demand. Appl. Soft Comput. 2015, 28, 422–430. [Google Scholar] [CrossRef]
  39. Hyndman, R.; Athanasopoulos, G.; Bergmeir, C.; Caceres, G.; Chhay, L.; O’Hara-Wild, M.; Petropoulos, F.; Razbash, S.; Wang, E.; Yasmeen, F. Forecast: Forecasting Functions for Time Series and Linear Models. R Package Version 8.13. 2020. Available online: https://pkg.robjhyndman.com/forecast (accessed on 1 June 2021).
  40. Hyndman, R.J.; Khandakar, Y. Automatic time series forecasting: The forecast package for R. J. Stat. Softw. 2008, 26, 1–22. [Google Scholar] [CrossRef] [Green Version]
  41. Petropoulos, F.; Makridakis, S.; Assimakopoulos, V.; Nikolopoulos, K. ‘Horses for Courses’ in demand forecasting. Eur. J. Oper. Res. 2014, 237, 152–163. [Google Scholar] [CrossRef] [Green Version]
  42. Dadabada, P.; Ravi, V. Soft computing hybrids for FOREX rate prediction: A comprehensive review. Comput. Oper. Res. 2018, 99, 262–284. [Google Scholar] [CrossRef]
  43. William, R.; O’Connor, M. Neural Networks for Time-Series Forecasting. In Principles of Forecasting: A Handbook for Researchers and Practitioners; Armstrong Scott, J., Ed.; Kluwer Academic Publishers: Dordrecht, The Netherlands, 2001. [Google Scholar]
Figure 1. The volume index of production in construction.
Figure 1. The volume index of production in construction.
Mathematics 09 02517 g001
Figure 2. The volume index of production in industry for electricity, gas, steam and air conditioning supply.
Figure 2. The volume index of production in industry for electricity, gas, steam and air conditioning supply.
Mathematics 09 02517 g002
Figure 3. The index of deflated turnover in retail trade, except for motor vehicles and motorcycles.
Figure 3. The index of deflated turnover in retail trade, except for motor vehicles and motorcycles.
Mathematics 09 02517 g003
Figure 4. Accuracy measures for the Abbasov–Mamedova method versus the prediction horizon for the volume index of production in construction horizon.
Figure 4. Accuracy measures for the Abbasov–Mamedova method versus the prediction horizon for the volume index of production in construction horizon.
Mathematics 09 02517 g004
Figure 5. Accuracy measures for the NTFS method versus prediction horizon for the volume index of production in construction horizon.
Figure 5. Accuracy measures for the NTFS method versus prediction horizon for the volume index of production in construction horizon.
Mathematics 09 02517 g005
Table 1. Descriptive statistics for the volume index of production in construction.
Table 1. Descriptive statistics for the volume index of production in construction.
CountryMin1sq QUMedianMean3rd QUMaxVARSDSEOrder of Integration
Belgium96.4100.62101.8101.95103107.33.411.840.291
Bulgaria73.789.0289.889.5590.8295.411.683.410.551
Czechia90.499.3106.9104.35109.4112.637.596.130.991
Denmark103.6109.97112.9112.49114.62125.914.473.80.611
Germany101.4109.25109.9110.63112.35119.110.383.220.521
Spain95.7102.32104.7103.73105.61109.843.130.51
France98.4101.25102.7102.65103.55107.14.12.020.322
Croatia98.5106.47112.75113.19118.77131.459.737.721.251
Italy95.5101.8102.7103.13104.97112.19.593.090.50
Luxembourg94.6105.4108.45108.38111.25125.433.245.760.930
Hungary87.8109.82132.5129.62150.3167.1584.7924.183.921
Netherlands112.2118.42124123.18127.55132.632.425.690.921
Austria101110.07116.85116.45122.25127.951.637.181.161
Poland85.1101.5117.65112.87122.65129.6153.1212.3721
Portugal96.699.37101.45101.19103.3106.67.122.660.430
Romania82.988.794.199.12110.82135.7187.4913.692.222
Slovenia78.3109.5126.55122.38134.42144.1240.8915.522.511
Slovakia87.893.1296.3596.2899.2104.919.694.430.712
Finland109.2111.22113.1112.95114.45118.14.372.090.331
Sweden105116.45121.6119.68122.77129.425.355.030.811
Table 2. Descriptive statistics for the volume index of production in industry for electricity, gas, steam and air conditioning supply.
Table 2. Descriptive statistics for the volume index of production in industry for electricity, gas, steam and air conditioning supply.
CountryMin1sq QUMedianMean3rd QUMaxVARSDSEOrder of Integration
Belgium92.6110.35121.65122.2132.4159.9219.5914.812.42
Bulgaria75.282.3285.787.292.67101.450.367.091.152
Czechia93.1102.02106.8106.27111.6115.837.196.090.981
Denmark81.996.12102.55102.68108.6123.367.678.221.330
Germany88.793.5598.3597.24100.8106.320.834.560.741
Estonia64.789.55115.95107.56121.15149573.323.943.882
Ireland104112.85115.5117.23121.87132.947.796.911.120
Greece88.3101.02106.65105.8110.45131.467.528.211.331
Spain88.494.697.3597.4399.17107.520.544.530.731
France94.997.8599.55100.01101.1110.811.683.410.551
Croatia90.1104.52110.55110.34115.92121.954.837.41.21
Italy91.6101.55102.95103.11105.27111.414.43.790.611
Cyprus103.4108.6110.85111.25112.95120.815.93.980.640
Latvia93101.9111.4110.37117.95128.486.679.31.511
Lithuania9398.97102.8102.48105.82116.731.445.60.90
Luxembourg81.490.9296.2597.29104.42109.463.537.971.292
Hungary93.396.499.3599.26101.05116.214.793.840.621
Malta74.2128.52150.7144.28162.67208.71012.4431.815.161
Netherlands102.8105.72110112.86119.6213377.638.811.422
Austria104112.15119.45118.47124.87133.269.58.331.352
Poland91.797.32106.9105.45111.07120.354.897.41.21
Portugal92.2111.12116.55118.68126.2146.4141.0711.871.921
Romania93.9100.97102.95102.62105.32110.515.583.940.641
Slovenia86.595.759998.77101.02113.523.834.880.791
Slovakia100.7106108.7108.56112.0711714.023.740.62
Finland93.296.359999.15100.72110.814.513.80.611
Sweden91.699.8101.2101.56103.65114.514.083.750.61
Norway86.894.1597.9598.44103112.537.456.120.990
Switzerland79.995.32100.8101.37107.0712161.677.851.271
Table 3. Descriptive statistics for the index of deflated turnover in retail trade, except for motor vehicles and motorcycles.
Table 3. Descriptive statistics for the index of deflated turnover in retail trade, except for motor vehicles and motorcycles.
CountryMin1sq QUMedianMean3rd QUMaxVARSDSEOrder of Integration
Belgium98.499.82100.3100.42100.77102.60.820.90.141
Bulgaria109.6112.72115.9116.5120.75128.821.384.620.752
Czechia106.9113.45116.8116.99121.75125.226.745.170.831
Denmark101.3102.5103.55103.48104.6105.31.591.260.21
Germany103.1106.15107.95108.29110.8112.76.912.620.421
Estonia103.8105.92107.5108.53110.75115.810.583.250.521
Ireland107111.85116.35115.83120.27123.822.264.710.760
Greece97.1100.62102.2101.96103.25106.14.122.030.321
Spain102.6105.22105.8106.21107.37109.72.871.690.271
France105.1107.9110.1109.98112.4114.38.122.850.461
Croatia102106.07111.75110.06114.12116.820.764.550.731
Cyprus106.1113.5117.25117.01120.45125.124.284.920.791
Latvia103.6108.2111.4110.61112.95119.713.33.640.591
Lithuania109.2112.4119.8118.98124.15131.241.516.441.041
Luxembourg28.230.330.931.0632.233.21.521.230.21
Hungary107112.87118.75118.75124.5132.250.627.111.151
Malta104.2108.42110.35112.26117123.124.774.970.81
Netherlands102.5105.32107.95107.64109.85112.56.462.540.411
Austria100.7102.22103.1102.93103.67106.21.511.230.191
Poland109.6114.77119.35119.6124.313239.716.31.021
Portugal103.7108.45111.7112.09115.52124.522.654.750.772
Romania117.3128.5132.85133.76139.45152.273.998.61.391
Slovenia110.8112.82116.15117.53121.45129.128.215.310.862
Slovakia105.4109.52110.2110.75112.42115.96.672.580.413
Finland103.2105.27106.5107.06109.2111.95.532.350.382
Sweden102.4104.37105.75105.93107.55110.94.132.030.320
Norway100.2101.9102.4102.42102.9105.20.840.910.140
Switzerland98.3100.8101.5101.39101.87103.41.351.160.181
Table 4. The number of cases the fuzzy time series forecast methods showed a better accuracy than the econometric methods for h = 2, 4, and 6 steps ahead predictions.
Table 4. The number of cases the fuzzy time series forecast methods showed a better accuracy than the econometric methods for h = 2, 4, and 6 steps ahead predictions.
Accuracy IndicatorFuzzy Time Series Forecasting MethodEconometric MethodThe Volume Index of Production in Construction
(20 Time Series)
The Volume Index of Production in Industry for Electricity, Gas, Steam and Air Conditioning Supply
(29 Time Series)
The Index of Deflated Turnover in Retail Trade, Except for Motor Vehicles and Motorcycles
(28 Time Series)
h = 2h = 4h = 6h = 2h = 4h = 6h = 2h = 4h = 6
MAEAbbasov–MamedovaSES811714136101211
Abbasov–MamedovaHolt710915147786
Abbasov–MamedovaARIMA78714157678
NFTSSES81071312610811
NFTSHolt899141396104
NFTSARIMA887141294127
MAPEAbbasov–MamedovaSES810714136101212
Abbasov–MamedovaHolt79915127786
Abbasov–MamedovaARIMA78715137678
NFTSSES910813126101311
NFTSHolt899141396104
NFTSARIMA88714118487
RMSEAbbasov–MamedovaSES81161312691212
Abbasov–MamedovaHolt710914146685
Abbasov–MamedovaARIMA78714136677
NFTSSES91081613781311
NFTSHolt811913148693
NFTSARIMA710716139586
MASEAbbasov–MamedovaSES811714136101211
Abbasov–MamedovaHolt7109151477106
Abbasov–MamedovaARIMA78714157688
NFTSSES91071312681211
NFTSHolt899141396104
NFTSARIMA88714129577
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Oancea, B.; Pospíšil, R.; Jula, M.N.; Imbrișcă, C.-I. Experiments with Fuzzy Methods for Forecasting Time Series as Alternatives to Classical Methods. Mathematics 2021, 9, 2517. https://doi.org/10.3390/math9192517

AMA Style

Oancea B, Pospíšil R, Jula MN, Imbrișcă C-I. Experiments with Fuzzy Methods for Forecasting Time Series as Alternatives to Classical Methods. Mathematics. 2021; 9(19):2517. https://doi.org/10.3390/math9192517

Chicago/Turabian Style

Oancea, Bogdan, Richard Pospíšil, Marius Nicolae Jula, and Cosmin-Ionuț Imbrișcă. 2021. "Experiments with Fuzzy Methods for Forecasting Time Series as Alternatives to Classical Methods" Mathematics 9, no. 19: 2517. https://doi.org/10.3390/math9192517

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop