Next Article in Journal
A Finite Element–Finite Volume Code Coupling for Optimal Control Problems in Fluid Heat Transfer for Incompressible Navier–Stokes Equations
Next Article in Special Issue
A New Class of Probability Distributions via Half-Elliptical Functions
Previous Article in Journal
Optimizing Route Planning via the Weighted Sum Method and Multi-Criteria Decision-Making
Previous Article in Special Issue
Variable Selection for Generalized Single-Index Varying-Coefficient Models with Applications to Synergistic G × E Interactions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Mixed-Order Fuzzy Time Series Forecast

1
School of Economics, Guangxi University, Nanning 530004, China
2
College of Finance and Statistics, Hunan University, Changsha 410006, China
3
Charlotte Center for TaiMing AI, University of North Carolina, Charlotte, NC 28223, USA
4
Department of Mathematics and Statistics, School of Data Science, University of North Carolina, Charlotte, NC 28223, USA
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(11), 1705; https://doi.org/10.3390/math13111705
Submission received: 26 April 2025 / Revised: 19 May 2025 / Accepted: 20 May 2025 / Published: 22 May 2025
(This article belongs to the Special Issue Statistics: Theories and Applications)

Abstract

Fuzzy time series forecasting has gained significant attention for its accuracy, robustness, and interpretability, making it widely applicable in practical prediction tasks. In classical fuzzy time series models, the lag order plays a crucial role, with variations in order often leading to markedly different forecasting results. To obtain the best performance, we propose a mixed-order fuzzy time series model, which incorporates fuzzy logical relationships (FLRs) of different orders into its rule system. This approach mitigates the uncertainty in fuzzy forecasting caused by empty FLRs and FLR groups while fully exploiting the fitting advantages of different-order FLRs. Theoretical analysis is provided to establish the mathematical foundation of the mixed-order model, and its superiority over fixed-order models is demonstrated. Simulation studies reveal that the proposed model outperforms several classical time series models in specific scenarios. Furthermore, applications to real-world datasets, including a COVID-19 case study and a TAIEX financial dataset, validate the effectiveness and applicability of the proposed methodology.
MSC:
62A86; 62G35; 03E72; 62M10

1. Introduction

It is well known that fuzzy sets are somewhat like sets whose elements have degrees of membership. Since Lotfi A. Zadeh [1] introduced the concept of a fuzzy set as an extension of the classical notion of a set, it has been established as an important and practical modeling tool. For time series analysis, Song and Chissom [2,3,4] first transformed the observed time series to a fuzzy time series (FTS) and built FLRs for forecasting. Now, FTS forecasting is not only an extension of the time series approach applied to fuzzy sets but also a representative combination of intelligent algorithms and econometric methods [5,6,7,8,9,10,11]. Although FTS forecasting is a multidisciplinary approach, its core primarily involves the identification of FLRs and forecasting [12,13,14]. The process consists of three fundamental steps: fuzzification, identification of FLRs, and defuzzification.
In the fuzzification step, how to determine the length of the interval is a key issue. Relevant research in this direction can be divided into three categories. The first one is to take the length as a statistic parameter, including the data range [2,3,4], the data average [15], and other distribution-based parameters [16]. The second category is to construct an optimization problem for obtaining the length of the fuzzy intervals. The minimization of mean squared errors (MSE) is the goal of this kind of research. The quadratic optimization algorithms [17,18], intelligent algorithms [19,20], and information entropy methods [21], among others, are often used to optimize interval segmentation. The third category is to use clustering methods to realize the division of the variable universe, which includes Gath–Geva clustering [22], automatic clustering [23], affinity propagation clustering [24], etc. For illustration, in this paper we employ fuzzy c-means (FCM) clustering [25] to produce the interval segmentation, even though others are applicable. This method can effectively describe the memberships of data and form interpretable fuzzy intervals.
In the step of the identification of FLRs, a fixed-order [15,16,17] or high-order FLR [26,27] is employed. Typically, one chooses an optimal fixed-order FLR model to minimize the MSE [21,28,29]. There are also other attempts; for example, Ye et al. [30] used the weighted values of predictions from first-, second-, and third-order models. However, there still exist unsatisfactorily fitting segments in the application of different-order models. Naturally, one asks whether or not different-order FLRs can be combined for a better forecast. This motivates us to propose a mixed-order FLR model, which integrates different-order FLRs. The model seeks to find a unique best-matching FLR in the rule base, which avoids the confusion caused by FLR groups (FLRGs) and empty FLRs. To demonstrate the advantages of our method, we not only prove some of its properties but also compare it with classic ones in the literature [31,32,33,34,35,36]. Theoretically and practically, we show that the performance of the mixed-order FLRs is better than the fixed-order ones.
In the defuzzification step, one needs to transform the predicted fuzzy sets into the most representative real number. Usually, the center of the observations is used as a representative of the predicted fuzzy set [6,7,37]. Despite the intention of mixed-order FLRs to mitigate confusion stemming from FLRGs and empty FLRs, situations may still arise where empty FLRs and FLRGs unavoidably occur. For the completeness of the defuzzification rules, the classical methods, including the master voting scheme [20] and the mean method, are used to defuzzify the fuzzy set from empty FLRs and FLRGs.
Applying the above three steps, we obtain an FTS forecast procedure built upon a mixed-order FLR model. In practice, for a given sample, one needs to specify the order of the model as in classic time series analysis. We develop the well-known K-fold cross-validation criterion [38,39,40] to select the best mixed-order models for the sample. This furnishes us a data-driven FTS forecast approach.
To theoretically demonstrate the superiority of the mixed-order model over fixed-order models, we define the completeness and redundancy to characterize the set of empty FLRs and FLR groups, respectively. To explore the predictive relationships among different-order models, we introduce the sub-FLR and extended FLR to describe the relationships between FLRs with different orders that share the same components. Several favorable properties of the mixed-order model are identified: the first-order model has the greatest probability of generating a complete FLR set among all orders, and high-order redundant sets can always be derived from low-order redundant sets. To further validate the effectiveness of the mixed-order model, we conduct comparisons with other models using both synthetic and real-world datasets, providing additional insights into the applicability of the mixed-order model.
The remainder of this paper is organized as follows. In Section 2, we give a brief review of FTS, introduce the mixed-order FLRs, and develop the related mathematical theory. In Section 3, we introduce our FTS forecast method and the corresponding algorithm. In Section 4, we conduct numerical experiments to compare different methods. Finally, we conclude this paper. All proofs of theorems are postponed to the Appendix A.

2. FTS and Mixed-Order FLR Models

To motivate our mixed-order FLR models, let us first review the FTS.

2.1. Review of Fuzzy Time Series

For convenience of exposition, let us first introduce some definitions.
Definition 1
(Fuzzy Set [1]). Let the universe of discourse be U = u 1 , u 2 , , u n , and a fuzzy set A of U is characterized by a membership function μ A : U [ 0 , 1 ] , expressed as
A = μ A ( u 1 ) / u 1 + μ A ( u 2 ) / u 2 + + μ A ( u n ) / u n
where μ A ( u i ) [ 0 , 1 ] represents the grade of membership of u i in A, the symbol “/” separates the membership grades from the elements in U, and the symbol “+” means “union” rather than the algebraic symbol of summation.
Definition 2
(FTS [3]). Let X ( t ) , t = , 0 , 1 , 2 , , a subset of R , be the universe of discourse in which fuzzy sets f i ( t ) , i = 1 , 2 , , are defined, and F ( t ) is the collection of f i ( t ) , i = 1 , 2 , . Then, F ( t ) is called a fuzzy time series on X ( t ) , t = , 0 , 1 , 2 , .
Here, F ( t ) can be viewed as a linguistic variable, and f i ( t ) can be regarded as possible linguistic values of F ( t ) . If the maximum degree of membership of F ( t i ) belongs to fuzzy set A t i , i = 0 , , m , then the m-order FLR between F ( t m ) , , F ( t 1 ) and F ( t ) can be represented as
A t m , , A t 1 A t 0 .
FLRs with the same left-hand side can be grouped together and are called an FLR group.
On one hand, it is worth noting that using the collection of all FLRs based on the fuzzy sample is not enough for fuzzy forecasting in all possible situations, since there may not exist a forecast. In general, if there is no fuzzy forecast based on a fuzzy set series “ A i m , , A i 1 ”, then we use
A i m , , A i 1 #
to denote such an FLR. We call it an empty m-order FLR. That is, the one-step fuzzy forecast based on A i m , , A i 1 is not available.
On the other hand, if there is an m-order FLRG, then there are multiple fuzzy forecasts, and thus it is confusing to determine which to choose for the forecast. This motivates us to introduce the mixed-order FLRs to solve these problems.

2.2. Mixed-Order FLRs

To develop the mixed-order FLRs, we first define complete FLR sets and redundant FLR sets. The ideas of completeness and redundancy are similar to those of the rule-based system [41].
Definition 3
(Completeness). A set of m-order FLRs is complete if there is no empty m-order FLR in the set; otherwise, it is incomplete.
Definition 4
(Redundancy). A set of m-order FLRs is non-redundant if it contains no m-order FLRG; otherwise, it is redundant.
Let A be an FTS, F m ( A ) be the set consisting of all m-order FLRs derived from A, E m ( A ) be the incomplete FLR set composed of all empty m-order FLRs, and R m ( A ) be the redundant complete FLR set composed of all m-order FLRGs. Let C m ( A ) = F m ( A ) ( E m ( A ) R m ( A ) ) , that is, C m ( A ) is a complete and non-redundant FLR set composed of m-order FLRs which are not empty and not in any FLRGs. Then, for a given m, we have
F m ( A ) = E m ( A ) C m ( A ) R m ( A )
Due to the randomness of A, the FLR set F m ( A ) is random, and thus E m ( A ) , C m ( A ) and R m ( A ) are, too. F m ( A ) , E m ( A ) , R m ( A ) and C m ( A ) are abbreviated as F m , E m , R m and C m . If E m and R m are empty sets, then F m = C m , which is an ideal case and does not hold in general.
Our idea for the mixed-order FLR model is to construct as close to a one-to-one relationship between the left-hand fuzzy series and the right-hand fuzzy set as possible. Ideally, if the m-order model ( F m ) is used for prediction, we hope it is complete and non-redundant, because a fuzzy forecast cannot be obtained from the empty FLR in E m and is confronted with multiple predictions from the FLRGs in R m . Our mixed-order model attempts to achieve this goal. To this end, we begin with a simple set which has the greatest probability of being a complete FLR set, and then shrink the redundancy step by step. The following theorem suggests that F 1 is the desired set with which we should start.
Theorem 1.
For every FTS, F 1 has the greatest probability of being a complete FLR set among all F m , m = 1 , 2 , .
According to Theorem 1, we begin with F 1 = E 1 C 1 R 1 for constructing the mixed-order FLRs. Since E 1 (as well as E k , k > 1 ) cannot provide any information for forecasting the next state, if R 1 is empty, then F 1 is a desired model for the fuzzy forecast. Otherwise, we need to deal with R 1 , which is complete but redundant. To this end, we introduce the concepts of “sub-FLR” and “extended-FLR” in the following.
Definition 5
(Sub-FLR/Extended FLR). Given a j-order FLR
A k j A k j 1 A k i + 1 A k i A k 1 A k 0 ,
for i j , the i-order FLR { A k i A k 1 A k 0 } is called the sub-FLR of the FLR (5), and, conversely, the FLR (5) is called the extended FLR of the i-order FLR.
Obviously, a sub-FLR shares the same one-step fuzzy forecast with the original FLR. For any i-order FLR set B i F i and ( i + 1 ) -order FLR set D i + 1 F i + 1 , we define
D i + 1 ( B i ) = { T i + 1 : T i + 1 D i + 1 and there exists a T i B i such that T i + 1 is an extended FLR of T i } .
The redundancy of F 1 (or R 1 ) can be shrunk by replacing all first-order FLRGs in R 1 with their extended FLRs. Based on the Occam’s razor principle, it is natural to extend them to the second-order FLRs, which are simplest among all higher-order FLRs. Hence, we replace R 1 with F 2 ( R 1 ) . By Equation (4), we have
F 2 ( R 1 ) = E 2 ( R 1 ) C 2 ( R 1 ) R 2 ( R 1 ) .
It is obvious that R 2 ( R 1 ) = R 2 . More generally, we have the following result.
Theorem 2.
R m + 1 ( R m ) = R m + 1 .
Because each element in E 2 is an empty FLR, and each element in R 1 is a nonempty FLR, E 2 ( R 1 ) = ϕ . By Theorem 2, Equation (7) becomes
F 2 ( R 1 ) = C 2 ( R 1 ) R 2 .
In general, similarly to Equation (8),
F k + 1 ( R k ) = C k + 1 ( R k ) R k + 1 , for k 1 ,
where C k + 1 ( R k ) is a non-redundant and complete subset of C k + 1 . Then, the first-order FLR set F 1 can be replaced by a mixed-order FLR set
M 2 = E 1 C 1 C 2 ( R 1 ) R 2 .
If R 2 is empty, then model M 2 is desired. Otherwise, we replace R 2 with F 3 ( R 2 ) and obtain the model
M 3 = E 1 C 1 [ k = 2 3 C k ( R k 1 ) ] R 3 .
After repeating the process of replacing R k with F k + 1 ( R k ) up to k = m m a x 1 ( m m a x is the maximum lag order allowed by the method), we obtain a sequence of FLR models:
M m = E 1 C 1 [ k = 2 m C k ( R k 1 ) ] R m , m = 2 , , m m a x ,
and M 1 is defined as F 1 . Since each M m consists of different-order FLRs, we call it the mth mixed-order FLR model. As m gets larger, the model becomes more complicated. The parameter m can be regarded as a tuning parameter. Given a sample path, it can be chosen to minimize the mean squared errors in the forecast by K-fold cross-validation [39]. See Section 3.2 for detail.
Each of the mixed-order FLR models in Equation (12) consists of three parts: an incomplete subset E 1 , complete non-redundant subset C 1 [ k = 2 m C k ( R k 1 ) ] , and redundant subset R m . For the fixed-order model F m in Equation (4), the corresponding three parts are E m , C m , and R m , respectively. Now, we compare F m with M m . First, both of them have the same common part R m . Given a sample path, as m gets large enough, R m will eventually be an empty set. (In an extreme case, for an FTS with m + 1 observations, only one m-order FLR can be extracted, and R m = ϕ ). Second, by Theorem 1,
P ( E 1 = ϕ ) P ( E m = ϕ ) , for any given m > 1 ,
that is, the probability of an empty FLR appearing in M m , P ( E 1 = ϕ ) is less than the probability of an empty FLR appearing in F m , P ( E m = ϕ ) . Third, for any FLR in C m , there is a sub-FLR of C m in C 1 ( k = 2 m C k ( R k 1 ) ) , as demonstrated in the following theorem.
Theorem 3.
For every m-order FLR in C m , there exists a corresponding sub-FLR in C 1 [ k = 2 m C k ( R k 1 ) ] .
By Theorem 3, the corresponding fuzzy forecasts (right-hand side of FLR) in C m and C 1 ( k = 2 m C k ( R k 1 ) ) are the same because any sub-FLR shares the same one-step fuzzy forecast with the original FLR. Further, the FLR models in C 1 ( k = 2 m C k ( R k 1 ) ) are simpler than in C m due to the lower orders. Therefore, compared with the m-order FLR model F m , the proposed model M m has the following advantages:
(i)
If F m has a nonempty prediction, M m has the same prediction, but not vice versa;
(ii)
F m has more empty predictions than M m because of Equation (13);
(iii)
M m is simpler than F m because of the lower FLR orders.

3. The Proposed FTS Forecast

3.1. FTS Forecast Based on an mth Mixed-Order FLR

A traditional FTS forecasting method consists of fuzzification, FLR modeling, and defuzzification. In the following, we illustrate the proposed method in detail.
Step 1: Fuzzification. The aim of fuzzification is to assign each observation to a fuzzy set. It is similar to the clustering process. Given a training set X = { x 1 , x 2 , , x n } , suppose we assign it into c fuzzy sets, { A i , i = 1 , 2 , , c } . Let u i j be the degree of membership of x j assigned to fuzzy set A i . The value of u i j is to be determined. Use the weighted average of training samples, y i = j = 1 n ( u i j ) k x j / j = 1 n ( u i j ) k , to estimate the center of A i , where k is the degree of fuzziness. (In this paper, the degree of fuzziness is set to 2, which is the same as [42]). Let d ( x j , y i ) be the distance between x j and y i . Denote with U a c × n fuzzy partition matrix with the ( i , j ) component being u i j . For clustering observations into c fuzzy sets, we employ the FCM algorithm [25], which produces an optimal c partition by minimizing the weighted within-group sum of squared error objective function
g ( U ) = i = 1 c j = 1 n ( u i j ) k d 2 ( x j , y i )
s . t . y i = j = 1 n ( u i j ) k x j j = 1 n ( u i j ) k , i = 1 , 2 , , c ,
i = 1 c u i j = 1 , j = 1 , 2 , , n ,
u i j 0 , i = 1 , 2 , , c , j = 1 , 2 , , n .
The above problem can be solved via iterations. When { y 1 , , y c } are given, using the Lagrange multiplier method, one obtains the minimizer of g ( U ) [25]:
u i j = r = 1 c d ( x j , y i ) d ( x j , y r ) 2 k 1 1 , 1 i c , 1 j n .
Then, update the values of y i in Equation (15). Repeat this procedure until convergence. Then, one gets the optimal solution of U and assigns x j to fuzzy set A i if u i j = max u l j , l = 1 , 2 , , c . The fuzzy sets A i can be set as, for example,
A i = ( b i 1 , b i ] ,
where b 0 = m i n j { x j | x j X } Δ , b i is the maximum value of x j ’s which are assigned to A i , and Δ can be any small number as long as b 0 < m i n { x j | x j X } [43].
Step 2: Identification of mixed-order FLRs
According to Section 2.2, a key process of the mth mixed-order FLRs in Equation (12) is to replace R k with C k + 1 ( R k ) R k + 1 in Equation (9). The algorithm for constructing the mth mixed-order FLRs is shown in Algorithm 1.
Algorithm 1: Generation of the mth mixed-order FLRs
Mathematics 13 01705 i001
Step 3: Defuzzification
Defuzzification acts as a translator of the fuzzy system. It consists of two parts: selecting the FLR and defuzzifying the fuzzy forecasting result.
Suppose there is an FLR with “ A ? ” being the fuzzy forecast from F m . By Theorem 3, there exists a sub-FLR of “ A t m , , A t 1 A ? ” in C 1 [ k = 2 m C k ( R k 1 ) ] M m . The sub-FLR can be found by checking the FLR in M m from the first to the mth order, as illustrated in Algorithm 2. According to the property of sub-FLRs, a sub-FLR shares the same one-step fuzzy forecast with the original FLR, thus “ A ? ” can be determined.
Algorithm 2: The FLR selection
Mathematics 13 01705 i002
Per the number of items in the one-step fuzzy forecast set, we define the following three defuzzified rules to select the representative of the predicted fuzzy set. The representative of the fuzzy set is defined as the center of the observations. Notice that, even if M m aims to establish one-to-one FLRs, empty FLRs and FLRGs may appear in the forecasting. Thus, the weighted average of the centers is used to represent the predicted fuzzy set from empty FLRs and FLRGs.
Case I.
If the one-step fuzzy forecast is a single fuzzy set, like A i m A i 1 A j , the defuzzified value z j of fuzzy set A j is defined as
z j = arg min z n j 1 j = 1 n j ( z x j ( i m i i 1 ) ) 2 ,
where x j ( i m i i 1 ) represents the observations corresponding to the next state A j , which has a fuzzy logical relationship: A i m A i 1 A j , n j is the total number of observations whose next state is A j . Hence, the solution to Equation (19) is z j = n j 1 j = 1 n j x j ( i m i i 1 ) .
Case II.
If the one-step fuzzy forecast is an empty set, like A i m A i 1 # , the defuzzified value z # is calculated by the master voting scheme [20]:
z # = m c i 1 + c i 2 + + c i m 2 m 1 .
where c i m is the center value of the observations in fuzzy set A i m , c i m = n i m 1 j = 1 n i m x j i m . Since there is no nonempty FLR as the basis in this case, the left-hand side of the empty FLR is the only useful information in the prediction. The weighted average of the centers is employed, and the greatest weight m is given to the center of A i 1 because A i 1 is the most recent fuzzy set for the prediction.
Case III.
If the one-step fuzzy forecast is multiple fuzzy sets, like A i m A i 1 { A j 1 A j l } , the defuzzified value is the average of the forecasting fuzzy set.
z ( j 1 j l ) = c j 1 + + c j l l .

3.2. Choice of the Mixed-Order m

In Section 3.1, we need a predetermined value of m. The larger m is, the more complicated the model becomes. Hence, we regard it as a tuning parameter. Suppose there is a fuzzy time series sample of size n, { A t } t = 1 n . We extend the K-fold cross-validation [39] to the current situation for the choice of m. The basic idea of this method is to choose m to minimize the prediction’s mean squared errors, and the details are given in Algorithm 3.
Algorithm 3: Selection of mixed-order m
Mathematics 13 01705 i003

4. Experimental Results

In this section, we demonstrate the effectiveness of the mixed-order model through numerical experiments on both synthetic and real-world datasets. First, we compare the mixed-order model with fixed-order models on eight different types of non-stationary time series to analyze the scenarios where the mixed-order model is more applicable. Subsequently, we apply both mixed-order and fixed-order models to real-world datasets to verify the superiority of the mixed-order model. Finally, we compare the mixed-order model with models proposed in other studies. All methods are implemented in MATLAB R2023b and executed on a laptop with an Intel Core i5.

4.1. Performance Evaluation on Time Series Data with Varying Characteristics

To investigate the applicability of the mixed-order model, we conducted a comprehensive evaluation of its predictive performance on various synthetic time series with distinct characteristics. These time series include the following: stationary time series, stationary time series with blips, sudden changes in the mean and variance, periodic changes in the mean and variance, and incremental changes in the mean and variance. The dataset is visualized in Figure 1. To measure the forecast accuracy, we employ the square root of mean squared error (RMSE), which is defined as
R M S E = { n 1 t = 1 n [ x ^ ( t ) x ( t ) ] 2 } 1 / 2 .
The comparative results of the RMSE among the mixed-order and fixed-order models are summarized in Table 1.
The results clearly indicate that the mixed-order model consistently outperforms fixed-order models across all types of time series. Notably, the mixed-order model excels in handling non-stationary time series with periodic changes in variance, delivering superior performance even compared with its predictions on stationary time series. However, its weakest performance is observed on time series with incremental changes in variance, likely due to the growing number of unavoidable empty FLRs caused by the escalating variance in the data.

4.2. The Effectiveness of the Mixed-Order FLRs

Intuitively, the mixed-order model is an integration of multiple fixed-order models, which can easily produce a wide variety of FLRs. In order to demonstrate the effectiveness of the proposed mixed-order model, we compare it with different fixed-order models using the real-world dataset on the weekly percentage changes in newly confirmed COVID-19 cases published in the WHO Weekly Epidemiological Reports, as shown in Figure 2. (The sample used in this subsection consists of data from the first 100 COVID-19 Weekly Epidemiological Updates. The data are available on the official website of the WHO: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/ accessed on 19 May 2025).
Based on the K-fold cross-validation in Algorithm 3 ( K = 5 ), we obtain M 5 as the mixed-order FLRs for fitting the data. The smaller the RMSEs are, the better the method becomes. Table 2 reports the values of these measures, reflecting the nice performance of the mixed-order model. In fact, among the fixed-order models, the second-order model is the best, which indicates that the prediction accuracy is not always improved by merely increasing the order. As a whole, the mixed-order model has the smallest values of RMSE, which indicates that the mixed-order model is effective.

4.3. Comparison with Other Methods

For convenience of comparison, we use the TAIEX datasets from 2002 to 2004 (the data are available in Yahoo Finance: http://tradingeconomics.com/taiwan/stock-market/ accessed on 19 May 2025), which have been extensively analyzed in the the existing literature. The datasets are shown in Figure 3. In the experiment, the historical data of each year are divided into two parts: the historical data from January to October of each year are used as the training set, and the data from November to December of each year are the testing set. Table 3 shows a comparison of RMSEs and the average RMSEs.
The fuzzy intervals of Methods 1 and 7 have the same length. The fuzzification of Method 7 is to divide the [ min { X } , max { X } ] into n equal-length parts. The parameters n and M m are selected by K-fold cross-validation ( K = 5 ). It is easy to see that the forecasting performance of the proposed method with equal intervals is better than the classical equal-interval fuzzy forecasting method [31] in terms of the RMSE and average RMSE.
Methods 1 and 13 only consider the closing price of TAIEX as variable. Obviously, the forecasting performance of the proposed method is better than others in [32,33,35] and even better than some methods that use multiple variables, such as [31,34,36]. Methods 14 to 27 all employ auxiliary variables to predict the closing price. Methods 28 and 29 utilize Long Short-Term Memory (LSTM) and Neural Network (NN) methods, respectively, to handle FTSs. Method 30 is the same as the proposed mixed-order method except that the opening prices of TAIEX are employed to fuzzify the data. The mixed-order method with auxiliary variables is the best in terms of the average RMSE in Table 3. The comparison between Methods 13 and 30 shows that introducing auxiliary variables is conducive to reducing the RMSE. It is interesting to further incorporate auxiliary variables into our model, which will be explored in future work.

5. Conclusions

In this paper, we have proposed the mixed-order FLR model and defined several concepts, including sub-FLR, extended FLR, completeness for an FLR set, and redundancy for an FLR set. Building on these definitions, we reveal some favorable theoretical properties of the mixed-order FLR model compared with fixed-order FLR models. The strength of this approach lies in its novel framework, which effectively mitigates the uncertainty caused by empty FLRs and FLR groups in fuzzy forecasting. The K-fold cross-validation has been used to determine the order of mixed-order models. This provides us with an appealing data-driven FTS forecast approach. Numerical experiments demonstrate that the mixed-order FLR model significantly improves prediction accuracy. It is particularly well-suited to non-stationary time series with periodic changes. However, its performance is less effective for time series with increasing variance, although it still outperforms fixed-order FLR models in such cases. Future work will focus on further exploring the theoretical properties of the proposed mixed-order FLR model, integrating additional auxiliary variables into FTS forecasting, and introducing error learning mechanisms to enhance the methodology.

Author Contributions

Conceptualization, H.W.; Methodology, H.W. and J.J.; writing—original draft preparation, H.W., H.L. and J.J.; writing—review and editing, H.W., H.L. and J.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Research Base of Humanities and Social Sciences in Guangxi Universities, Guangxi Development Strategy Institute Fund, with the grant number: 2024GDSIYB18.

Data Availability Statement

The data can be accessed through the link mentioned in the paper.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A. Proofs of Theorems

Proof of Theorem 1. 
Given a fuzzy sample A = { A ( 1 ) , , A ( T ) } , one can derive ( T m ) m-order fuzzy set series. Let c be the size of t = 1 T A ( t ) . Then, there are c different fuzzy sets corresponding to A. By the permutation principle, the number of m-order FLRs with different left-hand sides derived from A is c m .
(i)
Suppose c m < T m . Note that P m c , T is the probability that F m is complete. It follows that
P m c , T = T m c m × 1 × c m 1 c m × c m 2 c m × × 1 c m > T m c m × 1 × c m 1 c m + 1 × c m 2 c m + 1 × × 1 c m + 1 > T m c m × ( c m + 1 1 c m + 1 × × c m + 1 c m c m + 1 ) × c m 1 c m + 1 × × 1 c m + 1 > T ( m + 1 ) c m + 1 × ( c m + 1 1 c m + 1 × × c m + 1 c m c m + 1 ) × c m 1 c m + 1 × × 1 c m + 1 = P m + 1 c , T ,
where T m c m is the combination number.
(ii)
Suppose c m T m . Then, c m + 1 > T ( m + 1 ) , since c m + 1 > c m T m > T ( m + 1 ) . Even if all ( T m )  m-order FLRs are different, there still exist c m ( T m ) empty m-order FLRs. Hence, F m is incomplete, and P m c , T = 0 . Similarly, all P m + k c , T = 0 for k = 1 , 2 , .
In summary, P m c , T P m + 1 c , T . □
Proof of Theorem 2. 
According to the definition, we have
R i + 1 ( R i ) = T i R i { T i + 1 | T i + 1 R i + 1 and T i + 1 is an extended FLR of T i } .
Therefore, R m + 1 ( R m ) R m + 1 . For every A i m + 1 A i 1 A i 0 R m + 1 , there exists A i m + 1 A i 1 A i 0 R m + 1 with A i 0 A i 0 . Further, A i m A i 1 { A i 0 , A i 0 } R m . Then, A i m + 1 A i 1 A i 0 R m + 1 ( R m ) . Therefore, R m + 1 R m + 1 ( R m ) . □
Proof of Theorem 3. 
For every A i m A i j A i 1 A i 0 C m , it holds that, for j m , A i j A i 1 A i 0 C j R j , since a sub-FLR shares the same one-step fuzzy forecast with the original FLR.
(i)
If A i l A i 1 A i 0 C l and A i l 1 A i 1 A i 0 R l 1 with l m , then A i l A i 1 A i 0 C l ( R l 1 ) C 1 ( k = 2 m 1 S k ) . Thus, A i l A i 1 A i 0 is the corresponding sub-FLR.
(ii)
If A i l 1 A i 1 A i 0 R l 1 for every l m , then A i 1 A i 0 C 1 C 1 ( k = 2 m 1 S k ) . Thus, A i 1 A i 0 is the corresponding sub-FLR.

References

  1. Zadeh, L.A. Fuzzy sets. Inf. Control 1965, 8, 338–353. [Google Scholar] [CrossRef]
  2. Song, Q.; Chissom, B.S. Forecasting enrollments with fuzzy time series-part I. Fuzzy Sets Syst. 1993, 54, 1–9. [Google Scholar] [CrossRef]
  3. Song, Q.; Chissom, B.S. Fuzzy time series and its model. Fuzzy Sets Syst. 1993, 54, 269–277. [Google Scholar] [CrossRef]
  4. Song, Q.; Chissom, B.S. Forecasting enrollments with fuzzy time series-part II. Fuzzy Sets Syst. 1994, 62, 1–8. [Google Scholar] [CrossRef]
  5. Wu, H.; Long, H.; Jiang, J. Handling forecasting problems based on fuzzy time series model and model error learning. Appl. Soft Comput. 2019, 78, 109–118. [Google Scholar] [CrossRef]
  6. Yolcu, O.C.; Yolcu, U. A novel intuitionistic fuzzy time series prediction model with cascaded structure for financial time series. Expert Syst. Appl. 2023, 215, 119336. [Google Scholar] [CrossRef]
  7. Dixit, A.; Jain, S. Intuitionistic fuzzy time series forecasting method for non-stationary time series data with suitable number of clusters and different window size for fuzzy rule generation. Inf. Sci. 2023, 623, 132–145. [Google Scholar] [CrossRef]
  8. Wang, B.; Liu, X.; Chi, M.; Li, Y. Bayesian network based probabilistic weighted high-order fuzzy time series forecasting. Expert Syst. Appl. 2024, 237, 121430. [Google Scholar] [CrossRef]
  9. Shi, X.; Wang, J.; Zhang, B. A fuzzy time series forecasting model with both accuracy and interpretability is used to forecast wind power. Appl. Energy 2024, 353, 122015. [Google Scholar] [CrossRef]
  10. Wu, B.; Wang, L. Two-stage decomposition and temporal fusion transformers for interpretable wind speed forecasting. Energy 2024, 288, 129728. [Google Scholar] [CrossRef]
  11. Didugu, G.; Gandhudi, M.; Alphonse, P.J.A.; Gangadharan, G.R. VWFTS-PSO: A novel method for time series forecasting using variational weighted fuzzy time series and particle swarm optimization. Int. J. Gen. Syst. 2025, 54, 540–559. [Google Scholar] [CrossRef]
  12. Bose, M.; Mali, K. Designing fuzzy time series forecasting models: A survey. Int. J. Approx. Reason. 2019, 111, 78–99. [Google Scholar] [CrossRef]
  13. Panigrahi, S.; Behera, H.S. Fuzzy time series forecasting: A survey. In Computational Intelligence in Data Mining: Proceedings of the International Conference on ICCIDM; Springer: Berlin/Heidelberg, Germany, 2020; pp. 641–651. [Google Scholar] [CrossRef]
  14. Orang, O.; de Lima e Silva, P.C.; Guimarães, F.G. Time series forecasting using fuzzy cognitive maps: A survey. Artif. Intell. Rev. 2023, 56, 7733–7794. [Google Scholar] [CrossRef]
  15. Huarng, K. Heuristic models of fuzzy time series for forecasting. Fuzzy Sets Syst. 2001, 123, 369–386. [Google Scholar] [CrossRef]
  16. Huarng, K. Effective length of intervals to improve forecasting in fuzzy time-series. Fuzzy Sets Syst. 2001, 123, 387–394. [Google Scholar] [CrossRef]
  17. Egrioglu, E.; Aladag, C.H.; Basaran, M.A.; Yolcu, U.; Uslu, V.R. A new approach based on the optimization of the length of intervals in fuzzy time series. J. Intell. Fuzzy Syst. Appl. Eng. Technol. 2011, 22, 15–19. [Google Scholar] [CrossRef]
  18. Egrioglu, E.; Aladag, C.H.; Yolcu, U.; Uslu, V.R.; Basaran, M.A. Finding an optimal interval length in high order fuzzy time series. Expert Syst. Appl. 2010, 37, 5052–5055. [Google Scholar] [CrossRef]
  19. Chen, S.M.; Chung, N.Y. Forecasting enrollments using high-order fuzzy time series and genetic algorithms. Int. J. Intell. Syst. 2006, 21, 485–501. [Google Scholar] [CrossRef]
  20. Kuo, I.H.; Horng, S.J.; Kao, T.W.; Lin, T.L.; Lee, C.L.; Pan, Y. An improved method for forecasting enrollments based on fuzzy time series and particle swarm optimization. Expert Syst. Appl. 2009, 36, 6108–6117. [Google Scholar] [CrossRef]
  21. Chen, M.Y. A high-order fuzzy time series forecasting model for internet stock trading. Future Gener. Comput. Syst. 2014, 37, 461–467. [Google Scholar] [CrossRef]
  22. Wang, L.; Liu, X.; Pedrycz, W.; Shao, Y. Determination of temporal information granules to improve forecasting in fuzzy time series. Expert Syst. Appl. 2014, 41, 3134–3142. [Google Scholar] [CrossRef]
  23. Wang, W.; Liu, X. Fuzzy forecasting based on automatic clustering and axiomatic fuzzy set classification. Inf. Sci. 2015, 294, 78–94. [Google Scholar] [CrossRef]
  24. Wu, H.; Long, H.; Wang, Y.; Wang, Y. Stock index forecasting: A new fuzzy time series forecasting method. J. Forecast. 2021, 40, 653–666. [Google Scholar] [CrossRef]
  25. Bezdek, J.C. Pattern Recognition with Fuzzy Objective Function Algorithms; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013. [Google Scholar] [CrossRef]
  26. Panigrahi, S.; Behera, H.S. A study on leading machine learning techniques for high order fuzzy time series forecasting. Eng. Appl. Artif. Intell. 2020, 87, 103245. [Google Scholar] [CrossRef]
  27. Pattanayak, R.M.; Behera, H.S.; Panigrahi, S. A novel high order hesitant fuzzy time series forecasting by using mean aggregated membership value with support vector machine. Inf. Sci. 2023, 626, 494–523. [Google Scholar] [CrossRef]
  28. Li, S.T.; Cheng, Y.C.; Lin, S.Y. A FCM based deterministic forecasting model for fuzzy time series. Comput. Math. Appl. 2008, 56, 3052–3063. [Google Scholar] [CrossRef]
  29. Kuo, I.H.; Horng, S.J.; Chen, Y.H.; Run, R.S.; Kao, T.W.; Chen, R.J.; Lai, J.L.; Lin, T.L. Forecasting TAIEX based on fuzzy time series and particle swarm optimization. Expert Syst. Appl. 2010, 37, 1494–1502. [Google Scholar] [CrossRef]
  30. Ye, F.; Zhang, L.; Zhang, D.; Fujita, H.; Gong, Z. A novel forecasting method based on multi-order fuzzy time series and technical analysis. Inf. Sci. 2016, 367–368, 41–47. [Google Scholar] [CrossRef]
  31. Huarng, K.; Yu, T.H.K.; Hsu, Y.W. A multivariate heuristic model for fuzzy time-series forecasting. IEEE Trans. Syst. Man Cybern.-Part B Cybern. 2007, 37, 836–846. [Google Scholar] [CrossRef]
  32. Chen, S.M. Forecasting enrollments based on fuzzy time series. Fuzzy Sets Syst. 1996, 81, 311–319. [Google Scholar] [CrossRef]
  33. Yu, T.H.K.; Huarng, K.H. A bivariate fuzzy time series model to forecast the TAIEX. Expert Syst. Appl. 2008, 34, 2945–2952. [Google Scholar] [CrossRef]
  34. Chen, S.M.; Chang, Y.C. Multi-variable fuzzy forecasting based on fuzzy clustering and fuzzy interpolation techniques. Inf. Sci. 2010, 180, 4772–4783. [Google Scholar] [CrossRef]
  35. Sullivan, J.; Woodall, W.H. A comparison of fuzzy forecasting and Markov modeling. Fuzzy Sets Syst. 1994, 64, 279–293. [Google Scholar] [CrossRef]
  36. Chen, S.M.; Chen, C.D. TAIEX forecasting based on fuzzy time series and fuzzy variation groups. IEEE Trans. Fuzzy Syst. 2010, 19, 1–12. [Google Scholar] [CrossRef]
  37. Bas, E.; Egrioglu, E.; Kolemen, E. A novel intuitionistic fuzzy time series method based on bootstrapped combined pi-sigma artificial neural network. Eng. Appl. Artif. Intell. 2022, 114, 105030. [Google Scholar] [CrossRef]
  38. Rodriguez, J.D.; Pérez, A.; Lozano, J.A. Sensitivity analysis of k-fold cross validation in prediction error estimation. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 32, 569–575. [Google Scholar] [CrossRef]
  39. Jiang, J. Multivariate functional-coefficient regression models for nonlinear vector time series data. Biometrika 2014, 101, 689–702. [Google Scholar] [CrossRef]
  40. Liu, Y.; Liao, S.; Jiang, S.; Ding, L.; Lin, H.; Wang, W. Fast cross-validation for kernel-based algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 42, 1083–1096. [Google Scholar] [CrossRef]
  41. Shrestha, B.P.; Duckstein, L.; Stakhiv, E.Z. Fuzzy rule-based modeling of reservoir operation. J. Water Resour. Plan. Manag. 1996, 122, 262–269. [Google Scholar] [CrossRef]
  42. Ozkan, I.; Turksen, I.B. Upper and lower values for the level of fuzziness in FCM. In Fuzzy Logic: A Spectrum of Theoretical & Practical Issues; Springer: Berlin/Heidelberg, Germany, 2007; pp. 99–112. [Google Scholar] [CrossRef]
  43. Cheng, S.H.; Chen, S.M.; Jian, W.S. Fuzzy time series forecasting based on fuzzy logical relationships and similarity measures. Inf. Sci. 2016, 327, 272–287. [Google Scholar] [CrossRef]
Figure 1. Synthetic time series with distinct characteristics.
Figure 1. Synthetic time series with distinct characteristics.
Mathematics 13 01705 g001
Figure 2. The weekly percentage changes in newly confirmed COVID-19 cases.
Figure 2. The weekly percentage changes in newly confirmed COVID-19 cases.
Mathematics 13 01705 g002
Figure 3. TAIEX datasets from 2002 to 2004.
Figure 3. TAIEX datasets from 2002 to 2004.
Mathematics 13 01705 g003
Table 1. Comparison of mixed-order and fixed-order models for synthetic time series.
Table 1. Comparison of mixed-order and fixed-order models for synthetic time series.
ModelMixed-Order1-Order2-Order3-Order4-Order5-Order
stationary0.82350.89551.12701.02580.94620.9435
with blip0.82110.89861.08791.03460.94650.9461
sud mean0.99543.88362.81862.61882.60302.5867
sud var1.65641.65751.97901.80391.79231.7938
per mean0.89231.01371.12111.02620.91500.9150
per var0.64030.65060.86640.69560.67690.6771
inc mean0.99412.69942.73212.63952.51892.4948
inc var2.89462.80453.08983.02723.00903.0166
Table 2. Comparison between the mixed-order model and fixed-order models.
Table 2. Comparison between the mixed-order model and fixed-order models.
ModelMixed-Order1-Order2-Order3-Order4-Order5-Order
RMSE0.10300.17890.11020.12150.13510.1393
Table 3. The comparison of the RMSEs for different methods.
Table 3. The comparison of the RMSEs for different methods.
No.Methods200220032004Average RMSE
1Huarng et al.’s method (Use NASDAQ) [31]95.1565.5173.5778.08
2Huarng et al.’s method (Use Dow Jones) [31]93.7372.9573.4980.06
3Huarng et al.’s method (Use M1b) [31]97.1075.2382.0184.78
4Huarng et al.’s method (Use NASDAQ & Dow Jones) [31]93.4865.5173.4977.49
5Huarng et al.’s method (Use NASDAQ & M1b) [31]97.1570.7673.4880.46
6Huarng et al.’s method (Use NASDAQ & Dow Jones & M1b) [31]95.7370.7672.3579.61
7The mixed-order method (with equal intervals and single variable)80.6855.6776.5470.96
8AR (1) model [35]97.0991.6779.9489.57
9AR (2) model [35]89.8066.5860.3372.24
10Chen’s fuzzy time series model [32]101.0074.0084.0086.33
11Univariate conventional regression model [33]116.00329.00146.00 197.00
12Univariate neural network-based fuzzy time series model [33]84.0056.00116.0085.33
13The mixed-order method (univariate)74.0460.5761.4065.34
14Bivariate conventional regression model [33]77.0054.0085.0072.00
15Bivariate neural network-based fuzzy time series model [33]85.0058.0067.0070.00
16Bivariate neural network-based fuzzy time series model use subsitutes [33]80.0058.0067.0068.33
17Chen & Chang’s method (Use NASDAQ) [34]73.0666.3660.4866.63
18Chen & Chang’s method (Use Dow Jones) [34]79.8164.0882.3275.40
19Chen & Chang’s method (Use M1b) [34]96.0690.27100.1095.48
20Chen & Chang’s method (Use NASDAQ & Dow Jones) [34]72.3360.2968.0766.90
21Chen & Chang’s method (Use NASDAQ & M1b) [34]76.4853.5169.2966.43
22Chen & Chen’s method (Use Dow Jones) [36]74.6566.0258.8966.52
23Chen & Chen’s method (Use NASDAQ) [36]71.0165.1461.9466.03
24Chen & Chen’s method (Use M1b) [36]85.8563.1067.2972.08
25Chen & Chen’s method (Use M1b & Dow Jones) [36]77.9660.3265.8668.05
26Chen & Chen’s method (Use M1b & NASDAQ) [36]74.0567.8365.0968.99
27Chen & Chen’s method (Use NASDAQ & Dow Jones & M1b) [36]          77.3860.6565.0967.71
28LSTM-FTS method89.0092.0070.0084.00
29NN-FTS method80.0058.0067.0068.00
30The mixed-order method (use opening price)73.1457.9356.3862.48
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, H.; Long, H.; Jiang, J. Mixed-Order Fuzzy Time Series Forecast. Mathematics 2025, 13, 1705. https://doi.org/10.3390/math13111705

AMA Style

Wu H, Long H, Jiang J. Mixed-Order Fuzzy Time Series Forecast. Mathematics. 2025; 13(11):1705. https://doi.org/10.3390/math13111705

Chicago/Turabian Style

Wu, Hao, Haiming Long, and Jiancheng Jiang. 2025. "Mixed-Order Fuzzy Time Series Forecast" Mathematics 13, no. 11: 1705. https://doi.org/10.3390/math13111705

APA Style

Wu, H., Long, H., & Jiang, J. (2025). Mixed-Order Fuzzy Time Series Forecast. Mathematics, 13(11), 1705. https://doi.org/10.3390/math13111705

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop