1. Introduction
Financial markets are a complex system where fluctuation is the result of combined variables. These variables cause frequent market fluctuations with trends exhibiting degrees of ambiguity, inconsistency, and uncertainty. This pattern implies the importance of time series representations, and thus, an urgent demand arises for analyzing time series data in more detail. To some extent, an effective time series representation can be understood from two aspects: traditional time series prediction approaches [
1,
2,
3,
4]; and the fuzzy time series prediction approaches [
5,
6]. The former emphasizes the use of a crisp set to represent the time series, while the latter uses the fuzzy set.
Generally speaking, data are not only the source for prediction processes or prediction system inputs. The original data, however, are full of noise, incompleteness, and inconsistency, which limit the function of traditional prediction methods. Therefore, Song and Chissom [
7,
8,
9] developed a fuzzy time series model to predict real-time scenarios like college admissions. The fuzzification method effectively eliminates part of the noise inside the data, and the prediction performance of the time series is strengthened. Subsequently, with advancing research, the non-determinacy of information has become the main contradiction affecting prediction accuracy. Some studies proposed novel information representation approaches, such as the type 2 fuzzy time series [
5], rough set fuzzy time series [
10], and intuitionistic fuzzy time series [
11].
Although the above work has achieved considerable results for specific problems, certain shortcomings remain that pose a barrier to the accuracy and applicability of predictions. More specifically, complex scenarios and variables in actual situations make it unrealistic to define and classify explicitly the membership and non-membership of elements.
The neutrosophic sets (NSs) method, proposed by Smarandache [
12] for the first time, is suitable for the expression of incomplete, indeterminate, and inconsistent information. A neutrosophic set consists of true-, indeterminacy-, and false-memberships. From the perspective of information representation, scholars have proposed two specific concepts based on the neutrosophic set: single-valued NSs [
13] and interval-valued NSs [
14]. These concepts are intended to seek a more detailed information representation, thereby enabling NSs to quantify uncertain information more accurately. To deal with the above problem, entropy is an important representation of the degree of the complexity and inconsistency. In a nutshell, entropy is more focused on the representation and measure of inconsistency, while NSs tends to describe uncertainty. Zadeh [
15] first proposed the entropy of fuzzy events, which measures the uncertainty of fuzzy events by probability. Subsequently, De Luca and Termin [
16] proposed the concept of entropy for fuzzy sets (FSs) based on Shannon’s information entropy theory and further proposed a method of fuzzy entropy measurement. Since information entropy is an effective measurement in the degree of systematic order, it has been gaining popularity for different applications, such as climate variability [
17], uncertainty analysis [
18,
19], financial analysis [
20], image encryption [
21], and detection [
22]. Specifically, He et al. [
23] proposed a collapse hazard forecasting method and applied the information entropy measurement to reduce the influence of collapse activity indices. Bariviera [
24] proposed a prediction method based on the maximum entropy principle to predict the market and further monitor market anomalies. In Liang’s research [
25], information entropy was introduced to analyze trends for capacity assessment of sustainable hydropower development. Zhang et al. [
26] proposed a signal recognition theory and algorithm based on information entropy and integrated learning, which applied various types of information entropy including energy entropy and Renyi entropy.
In order to describe the indeterminacy of fluctuations and further measure the inconsistency and uncertainty of dynamic fluctuation trends, we propose a neutrosophic forecasting model based on NSs and information entropy of high-order fuzzy fluctuation time series (NFM-IE). The biggest difference compared to the original models is that the NFM-IE represents both fluctuation trend information and fluctuation consistency information. First of all, a time series is converted to a fluctuation time series by comparing each of the current data and corresponding previous data in the time series. Then, the upward trend of each of the fluctuation data is mapped to the truth-membership of a neutrosophic set and falsity-membership for the downward trend. Information entropy of high-order fluctuation time series is introduced to describe the inconsistency of historical fluctuations and is mapped to the indeterminacy-membership of the neutrosophic set. Finally, an existing similarity measurement method for the neutrosophic set is introduced to find similar states during the forecasting stage, and the weighted arithmetic averaging (WAA) aggregation operator is employed to obtain the forecasting result according to the corresponding similarity. The largest contributions of the proposed model are listed as follows: (1) Introducing information entropy to quantify the inconsistency of fluctuations in related periods and mapping it to the indeterminacy-membership of neutrosophic sets allow NFM-IE to extend traditional forecasting models to a certain level. (2) Employing a similarity measurement method and aggregation operator allows NFM-IE to integrate more possible rules. In order to test its performance, we used the proposed model to forecast some realistic time series, such as the Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX), the Shanghai Stock Exchange Composite Index (SHSECI), the Hang Seng Index (HSI), etc. The experimental results show that the model has a stable prediction ability for different datasets. Simultaneously, comparing the prediction error with that from other approaches proves that the model has outstanding prediction accuracy and universality.
The rest of this paper is organized as follows:
Section 2 introduces the basic concepts of wave time series and information entropy. Then, the concepts proposed in this paper, such as neutrosophic fluctuation time series (NFTS) and the neutrosophic fluctuation logical relationship, are defined.
Section 3 presents the specific modules of the model presented in this paper.
Section 4 details the prediction steps and validates the model using TAIEX as the dataset.
Section 5 further analyzes the prediction accuracy and universality of the model based on SHSECI and HSI. Finally, the conclusions and prospects are presented in
Section 6.
5. Results Analysis
5.1. Taiwan Stock Exchange Capitalization Weighted Stock Index
In general, TAIEX is a widely-used dataset in stock market forecasting. In order to facilitate comparison with other forecasting models, this paper also uses it as the main dataset to verify the model. Using non-stationary data can lead to spurious regressions, so we first performed a stationarity test based on the unit root test by software Eviews (Eviews10.0 Enterprise Edition, Microsoft, Redmond, WA, USA). It can be concluded that the first-order difference of TAIEX 1997–2005 was stationary data, which indicates that the fluctuation data used in this study were stationary. Further, other datasets in this study were also stationary data.
The model in this paper was based on high order, and thus, different orders may affect the accuracy of the prediction. Hence, the experimental analysis showed that when the order of fuzzy fluctuation information entropy was 9–11, the stability of the model was more ideal.
Table 3 shows the experimental errors for different years under different orders.
Not surprisingly, accurate fluctuation trend predictions are very important and needed. Therefore, the performance of different methods must be compared and evaluated, thus verifying the superiority or deficiency of the model. In order to verify the effects of model prediction, this section focuses on comparing this model’s experimental results with those from other models. Comparing the errors across model showed that the current model had certain advantages in prediction accuracy.
Table 4 shows the prediction errors for the different methods between 1997 and 2005. The NFM-IE hybrid model achieved better prediction accuracy compared to the traditional regression model, autoregressive model, neural network model, and fuzzy model (
Table 4). In addition, NFM-IE exhibited better predictive power in some years compared to other hybrid models based on the fuzzy theory.
5.2. Forecasting Shanghai Stock Exchange Composite Index
SHSECI is one of the most typical stock indices in China, with certain representativeness. We selected it as an experimental dataset to verify the model’s applicability.
Recently, scholars have proposed more comprehensive models based on traditional prediction methods. For example, Guan et al. [
39] proposed a two-actor autoregressive moving average model based on the fuzzy logical relationships (ARMA-FR). Guan et al. [
40] proposed a model based on back propagation neural network and high-order fuzzy-fluctuation trends (BPNN-HFT). This section compares several typical prediction methods. The results indicated that the model can also effectively predict the stock index.
Table 5 and
Figure 3 show a comparison of the different prediction methods.
The comparison shows that NFM-IE outperformed other methods in predicting SHSECI from 2007–2015.
Comparing the average value of the SHSECI prediction error showed that NFM-IE had better prediction accuracy and stability compared to the neural network-based BPNN-HFT model and the statistical-based ARMA-FR model.
5.3. Forecasting Hong Kong-Hang Seng Index
Finally, the Hong Kong-Hang Seng Index (HSI) was selected as the experimental dataset. Comparing several authoritative prediction methods, we can verify the universality of the model in other stock markets.
Table 6 and
Figure 4 show a comparison of the different prediction methods from 1998–2012.
To further evaluate the validity of the proposed model, we used Friedman’s test to perform a significance test based on the study of Demšar [
44]. For reference, Friedman’s test is a parametric statistical test that was proposed by Milton Friedman [
45,
46]. To further illustrate the significance of the model’s predictions compared to other prediction methods, this section will use Friedman’s test and the post-hoc test for significance analysis. In the Friedman test phase, SPSS was used for statistical testing, and the post-hoc test phase was based on manual calculations.
In the first stage, Friedman’s test requires comparison of the average ranking of different algorithms
, where,
is the rank of the
-th of
algorithms on the
-th of
N datasets. The ranking of each method was based on the analysis of HSI forecast results as shown in
Table 7.
Through software analysis, we concluded that the method had the highest comprehensive ranking. In addition, according to the Chi-square distribution, there were significant differences between these methods.
In the second stage, in order to further compare the different methods, we used the Nemenyi test [
47]. According to Equation (31), α = 0.05 and CD = 1.575. Upon further comparison, we found that the method proposed in this study had significant advantages over Yu (2005) [
41], Wan (2017) [
42], Ren (2016) [
43], etc. Although it was not significant compared with Cheng’s method (2018) [
10], the NFM-IE had certain advantages from the perspective of error mean and average level.
5.4. Discussion
The research was mainly focused on two issues. The first was whether the uncertainty of stock market volatility can be used as a key feature of forecasting in a complex environment. The other was whether the prediction method considering uncertainty and trend was effective. We first used the inconsistency of historical fluctuations as a stock forecasting feature and further characterized and quantified it. Then, we applied the neutrosophic set to be the representation of the information and established a neutrosophic logic relationship based on wave inconsistency. Through experimental analysis, the proposed model achieved robustness and stability with relatively few parameters. In addition, it was also proven that predictions that consider inconsistency are meaningful and effective. The advantages were embodied in the following aspects: First, NFM-IE did not need to establish complex assumptions compared to traditional regression-based prediction models. Second, the NFM-IE prediction process was more interpretable than the neural network. Finally, compared with the fuzzy prediction method, NFM-IE effectively utilized data inconsistency as key information. All in all, the model showed satisfactory performance. However, it also showed certain limitations: First, the model used single stock market data as the system input and failed to consider multiple factors fully. Secondly, using information entropy as a key tool for uncertainty measurement requires further optimization in characterizing data.