A Fuzzy Set-Valued Autoregressive Moving Average Model and Its Applications

Wang, Dabuxilatu; Zhang, Liang

doi:10.3390/sym10080324

Open AccessArticle

A Fuzzy Set-Valued Autoregressive Moving Average Model and Its Applications

by

Dabuxilatu Wang

^1,* and

Liang Zhang

²

¹

Department of Statistics, Guangzhou University, No. 230 Waihuanxi Road, Higher Education Mega Center, Guangzhou 510006, China

²

School of Applied Mathematics, Guangdong University of Technology, No. 161 Yinglong Road, Tianhe District, Guangzhou 510520, China

^*

Author to whom correspondence should be addressed.

Symmetry 2018, 10(8), 324; https://doi.org/10.3390/sym10080324

Submission received: 23 June 2018 / Revised: 19 July 2018 / Accepted: 2 August 2018 / Published: 7 August 2018

Download

Browse Figure

Versions Notes

Abstract

Autoregressive moving average (ARMA) models are important in many fields and applications, although they are most widely applied in time series analysis. Expanding the ARMA models to the case of various complex data is arguably one of the more challenging problems in time series analysis and mathematical statistics. In this study, we extended the ARMA model to the case of linguistic data that can be modeled by some symmetric fuzzy sets, and where the relations between the linguistic data of the time series can be considered as the ordinary stochastic correlation rather than fuzzy logical relations. Therefore, the concepts of set-valued or interval-valued random variables can be employed, and the notions of Aumann expectation, Fréchet variance, and covariance, as well as standardized process, were used to construct the ARMA model. We firstly determined that the estimators from the least square estimation of the ARMA (1,1) model under some

L_{2}

distance between two sets are weakly consistent. Moreover, the justified linguistic data-valued ARMA model was applied to forecast the linguistic monthly Hang Seng Index (HSI) as an empirical analysis. The obtained results from the empirical analysis indicate that the accuracy of the prediction produced from the proposed model is better than that produced from the classical one-order, two-order, three-order autoregressive (AR(1), AR(2), AR(3)) models, as well as the (1,1)-order autoregressive moving average (ARMA(1,1)) model.

Keywords:

stochastic process; fuzzy sets; autoregressive model; forecasting

1. Introduction

A time series is a set of observations, each one being recorded at a specified time. Time series analysis has been an important branch of both the stochastic process and mathematical statistics. Various time series can be found in the fields of engineering, science, sociology, and economics. The theory and methods of time series analysis have been extensively developed and achieved great success in the modeling and prediction of time series [1].

There are several famous time series models, such as autoregressive (AR), autoregressive moving average (ARMA), autoregressive integrated moving average (ARIMA), and autoregressive conditional heteroskedasticity (ARCH), which have been proposed for the purpose of future prediction [1]. There is extensive literature on the prediction of the future for some system using these models. For example, Metghalchi et al. proposed testing moving average technical trading rules for the NASDAQ (National Association of Securities Dealers Automated Quatations) composite index. They showed that moving average rules indeed have predictive power and could discern a recurring-price pattern for profitable trading [2]. Li et al. presented an intelligent prediction approach for degradation prognostics of rotating machinery based on an asymmetric penalty sparse decomposition algorithm combined with an autoregressive moving average-recursive least square algorithm (ARMA-RLS) and wavelet neural network [3].

Note that all of the data concerned with the models mentioned above are represented by real numbers or vectors. However, in this big-data era, various complex data have arisen in many fields of sciences and technologies. Among them, the interval-valued data, or more general, the set-valued data, have received great attention in recent years, since they are, in some sense, the extension of incomplete, missing, or censored data. Examples include the interval representing the salary range for a person, the interval representing the range of blood pressure for a person, the range of the weather temperature for a special day in some city, and some data represented by a complex medical image, symmetric color picture, etc. In the system decision-making area, we also face human perception mixed data, such as linguistic data, whose values are not numeric but are words or sentences of some language, some of which can be represented by nearly symmetric fuzzy numbers. We refer to such data as fuzzy data.

Accordingly, in recent years, the stochastic processes with set-valued members have received attention in the literature. Li et al. [4] considered fuzzy set-valued Gaussian processes and Brownian motions, in which the classical Gaussian stochastic process was extended to a case where the process elements are allowed to take values of fuzzy sets, and a new fuzzy Brownian motion was firstly introduced. Bongiorno [5] presented a note on the former Brownian motion, where it was pointed out that the former fuzzy set-valued Brownian motion can be handled by an n-dimensional vector-valued Wiener process, since the expectation of the fuzzy set-valued element is a constant. Furthermore, Wang et al. [6] firstly proposed an interval-valued stationary time series modeling approach, in which an interval-valued p-order autoregressive (AR(p)) model was proposed. Note that, here, they did not considered the stochastic process or time series with linguistic data. These works raise the possibility that some extension of time series modeling [1] to linguistic data (perception mixed data) could be realized under the consideration of ordinary stochastic correlation between the elements of the time series process.

We are aware that interval-valued or linguistic-valued data benefit from having a higher volume of information compared to real number-valued data. For instance, finance and economics are far from being free from imprecision or uncertainty. In the process of reducing some economy-related quantities and magnitudes to numbers and mathematical concepts, we have to deal with a wealth of vague terms (confidence, fear, instability, risk, etc.) which are meaningful for us. For example, a set of stocks with small volatility or countries with high unemployment rates are not crisp descriptions, since the words “small” and “high” are vague in meaning, reflecting a judgment of the observers for the observed objects based on their own perception. Also, the investor’s expected values of the future returns for investments are often given in a linguistic form such as “very optimum”, “around the values of last year’s return”, “may at least cover the cost”, etc. One typical feature of the linguistic data is that the data are characterized with fuzziness, therefore, it is often recommended to employ the fuzzy sets to model the linguistic data. Using a fuzzy set to model linguistic data is meaningful: the fuzzy set is not only easier to apply than words in mathematical modeling, but it also embraces more information with respect to the empirical judgment, as well as the emotional reaction of the human, than that of real numbers.

It has been demonstrated that the extension of time series models to the case of linguistic data (fuzzy data) was developed along two lines—parametric methods and nonparametric methods—in the literature.

When the parametric method is applied, the form of the original time series models is not changed; instead of the original real number-valued data, the linguistic data and their arithmetic operations are used. Such work can be found in Wang [7], in which the authors primarily proposed a special conceptualized p-order autoregressive model AR(p) (where p is a positive integer and

p \geq 1

) with n-dimensional fuzzy data [8] in the way of the set-valued stochastic process, wherein the semi-linear structure of the space of all fuzzy sets, the expectation, variance, and covariance of fuzzy random variables ([9]) are considered for the construction of the model. However, there was no work on the model’s estimation. Wang [10] further noted that former autoregressive models contain some deficiencies, so the model was complemented with an ARMA model and its primary application in financial market forecasting was proposed. Jung et al. [11] also considered a unified approach to asymptotic behavior for parameter estimation for an AR(1) model of a fuzzy number-valued time series, where a brief outline on the modeling of time series with fuzzy number inputs and fuzzy number outputs was given. An illustrative example of the AR(1) model with fuzzy numbers is that of the Dow Jones Industrial Average (DJI) index time series [11]. A significant advantage of the parametric methods is that the original natural relationships between the elements of the time series are maintained and investigated during the modeling.

When the nonparametric method is applied, we not only change the form of the original time series models, but also replace the original data with linguistic data (fuzzy data). There are a number of studies on this topic, which is called a fuzzy time series. For instance, in [12,13], the fuzzy time series were firstly proposed as a series with elements taking the values of linguistic or vaguely described data, and the elements can be linked with each other using fuzzy logical relationships that need to be given subjectively by a human. Various improvements and developments on the above fuzzy logical relationship-based fuzzy time series were given by [14,15,16,17], and others, where more effective forecasting models, such as two-factor high-order fuzzy time series forecasting, deterministic vector long-term forecasting, etc., were proposed. The fuzzy logical relationship-based fuzzy time series modeling methods are largely based on intellectual computing, such as the fuzzy relational equations and approximate reasoning. It should be pointed out that such soft computing methods may optimally capture the fuzzy information involved in the elements of the time series, however, the natural stochastic relationships between the elements of the time series are completely ignored, which may lead not only to a biased prediction for the future when we apply the fuzzy time series models for forecasting, but also to a disdain for investigating the mathematical statistical properties of the time series.

Our main interests are in the parametric methods for modeling the time series with linguistic data mentioned above, where the obtained previous results are reviewed. We are aware that there are several fundamental problems, such as parameter estimation (model estimation), asymptotic properties of the estimators, etc., which remain to be investigated further. For instance, parameter estimation has been carried out only for the AR(1) and ARMA(1,1) models with fuzzy data [10,11], and the asymptotic properties (consistency properties) of the estimators have been obtained only for the AR(1) model with fuzzy data [11]. In this study, based on previous works [7,10,11], we firstly investigated the asymptotic properties of the estimators for a (1,1)-order autoregressive moving average model ARMA(1,1) based on linguistic data (fuzzy data), then used the justified ARMA(1,1) model to forecast the future of the HSI with a simulation analysis.

This article proceeds as follows. In Section 1, the related previous work and some existing problems are discussed. Section 2 introduces the basic concepts of fuzzy sets, arithmetic operations for fuzzy sets, correlation, and independence, as well as expectation and Fréchet variance, and covariance under the

L_{2}

metric

δ_{2}

(proposed by Näther [9]) for fuzzy random variables. In Section 3, the asymptotic properties for a special ARMA model for fuzzy data-valued time series with standardized terms is described, and some extension of the classical results on causality for the ARMA models is presented. In Section 4, an empirical analysis of the proposed models in the linguistic monthly HSI time series modeling and prediction is detailed. In Section 5, we present a conclusion for this article.

2. Preliminaries

2.1. Fuzzy Set on $R^{n}$

The development of the concept of fuzzy sets was motivated by the need to efficiently process ambiguous information, human natural language, as well as human decision problems. A fuzzy set

\tilde{u}

of

R^{n}

is equivalent to its membership function

\tilde{u} : R^{n} \to [0, 1]

, where the number

\tilde{u} (x)

represents the degree of membership at which x belongs to

\tilde{u}

. By

F (R^{n})

, we denote the collection of all normal, convex, and compact fuzzy sets on

R^{n}

, i.e., for

\tilde{u} \in F (R^{n})

,

(1): There exists $x_{0} \in R^{n}$ such that $\tilde{u} (x_{0}) = 1$ ;
(2): The $α$ -cut of $\tilde{u}$ , ${\tilde{u}}_{α} : = {x \in R^{n} : \tilde{u} (x) \geq α}$ , $α \in (0, 1]$ , is a convex and compact set of $R^{n}$ ;
(3): ${\tilde{u}}_{0} : = c l {x \in R^{n} : \tilde{u} (x) > 0}$ , the support of $\tilde{u}$ , is compact.

If

n = 1

, then the fuzzy set of

R

is said to be a fuzzy number.

Zadeh’s extension principle [18,19] allows us to apply addition and scalar multiplication on

F (R^{n})

:

(\tilde{u} + \tilde{v}) (x) = \sup_{s + t = x} \min (\tilde{u} (s), \tilde{v} (t)), x \in R^{n} .

(1)

(a \tilde{u}) (x) = \{\begin{matrix} \tilde{u} (\frac{x}{a}), a \neq 0 \\ 0, a = 0 \end{matrix} a \in R,

(2)

and for any

a, b \in R

, the following holds:

(a b) \tilde{u} = a (b \tilde{u}), a (\tilde{u} + \tilde{v}) = (a \tilde{u}) + (a \tilde{v}) .

(3)

However, it holds only for

a b \geq 0, a, b \in R

(a + b) \tilde{u} = (a \tilde{u}) + (b \tilde{u}) .

(4)

It indicates that

(F (R^{n}), +, \cdot)

is not a linear space. With Minkowski’s set operation, it holds that

{(\tilde{u} + \tilde{v})}_{α} = {\tilde{u}}_{α} + {\tilde{v}}_{α}, α \in (0, 1] .

(5)

{(a \tilde{u})}_{α} = a {\tilde{u}}_{α}, α \in (0, 1] .

(6)

A support function of

\tilde{u} \in F (R^{n})

is defined as

S_{{\tilde{u}}_{α}} (x) = \{\begin{matrix} \sup_{t \in {\tilde{u}}_{α}} {x \cdot t}, & α \in (0, 1], \\ 0, & α = 0 . \end{matrix} x \in S^{n - 1} = {x : ∥ x ∥ = 1},

(7)

where · denotes the inner product in the Euclidean space

R^{n}

. It holds that for

\tilde{u}, \tilde{v} \in F (R^{n})

and

a \in R

,

S_{{(\tilde{u} + \tilde{v})}_{α}} = S_{{\tilde{u}}_{α}} + S_{{\tilde{v}}_{α}} .

(8)

S_{{(a \tilde{u})}_{α}} (x) = a S_{{\tilde{u}}_{α}} (x), a > 0; S_{{(a \tilde{u})}_{α}} (x) = - a S_{{\tilde{u}}_{α}} (- x), a < 0 .

(9)

It holds that

S_{{((a \tilde{u}) + (b \tilde{v}))}_{α}} (x) = \{\begin{matrix} (a S_{{\tilde{u}}_{α}} + b S_{{\tilde{v}}_{α}}) (x), & a, b > 0 \\ - (a S_{{\tilde{u}}_{α}} + b S_{{\tilde{v}}_{α}}) (- x), & a, b < 0 . \end{matrix}

(10)

where

α \in [0, 1]

. Thus, the semi-linear map

S : F (R^{n}) \to L^{2} (S^{n - 1} \times [0, 1])

,

\tilde{u} \mapsto S_{{\tilde{u}}_{α}} (x)

makes us view the fuzzy set

\tilde{u}

as a support function equivalently, i.e., the map S embeds

F (R^{n})

into a cone of functional Hilbert space [20].

Remark 1.

For modeling the fuzzy set-valued time series, the distance between two fuzzy sets needs to be clarified. It is well known that the distance between two real numbers or vectors is an important notion that measures the differences of the two numbers or vectors. Because the fuzzy set is a set, the distance between two fuzzy sets can be a distance between two sets [18]. There are many definitions of distances proposed for fuzzy sets, such as

d_{H}

,

d_{p}

,

d_{\infty}

,

ρ_{p}

, ρ, etc., defined on

F (R^{n})

[9,11,18], i.e.,

d_{H} (A, B) = \max {\sup_{a \in A} \inf_{b \in B} ∥ a - b ∥, \sup_{b \in B} \inf_{a \in A} ∥ a - b ∥},

(11)

where

A, B

are nonempty subsets of

R^{n}

,

∥ a - b ∥ = \sqrt{\sum_{i = 1}^{n} {(a_{i} - b_{i})}^{2}}, (a_{1}, \dots, a_{n}), (b_{1}, \dots, b_{n}) \in R^{n} .

(12)

For

\tilde{u}, \tilde{v} \in F (R^{n})

,

d_{p} (\tilde{u}, \tilde{v}) = {(\int_{0}^{1} {(d_{H} ({\tilde{u}}_{α}, {\tilde{v}}_{α}))}^{p} d α)}^{1 / p}, 1 \leq p < \infty .

(13)

d_{\infty} (\tilde{u}, \tilde{v}) = \sup_{α \in (0, 1]} {d_{H} ({\tilde{u}}_{α}, {\tilde{v}}_{α})},

(14)

ρ_{p} (\tilde{u}, \tilde{v}) = [\int_{0}^{1} (\int_{S^{n - 1}} | S_{{\tilde{u}}_{α}} (x) - S_{{\tilde{v}}_{α}} (x) |^{p} μ (d x) {) d α]}^{1 / p}, 1 \leq p < \infty .

(15)

ρ^{2} (\tilde{u}, \tilde{v}) = \int_{Z} (S_{{\tilde{u}}_{α}} (x) - S_{{\tilde{v}}_{α}} (x)) (S_{{\tilde{u}}_{β}} (y) - S_{{\tilde{v}}_{β}} (y)) d K (x, α, y, β),

(16)

where

Z = S^{n - 1} \times [0, 1] \times S^{n - 1} \times [0, 1]

and K is a symmetric positive definite kernel.

Some of sets are much too complicated in regard to the computation of distances [18]. In a practical application, for example, in system decision making, the human’s linguistic judgment or perception of the concerned items can be represented by a fuzzy number, and the distances for such fuzzy numbers should be chosen while considering the ease of computation. In the modeling of time series with fuzzy data, using different metrics, we may obtain different results from the models applied to the problems of interest.

In this work, we used a special distance between

\tilde{u}, \tilde{v} \in F (R^{n})

, defined by the

L_{2}

metric

δ_{2}

, which is a standard distance with ease of computation, and is widespread in applications using fuzzy data modeling.

δ_{2} (\tilde{u}, \tilde{v}) : = {(n \int_{0}^{1} \int_{S^{n - 1}} {| S_{{\tilde{u}}_{α}} (x) - S_{{\tilde{v}}_{α}} (x) |}^{2} μ (d x) d α)}^{1 / 2},

(17)

and let

〈 \tilde{u}, \tilde{v} 〉 : = n \int_{0}^{1} \int_{S^{n - 1}} S_{{\tilde{u}}_{α}} (x) S_{{\tilde{v}}_{α}} (x) μ (d x) d α .

(18)

where μ is a normalized Lebesgue measure.

(F (R^{n}, δ_{2})

is a complete and separable metric space [9,18].

The Hukuhara difference

-_{h}

between two fuzzy sets [21] is defined as follows. Let

\tilde{u}, \tilde{v} \in F (R^{n})

. If there exists a

\tilde{s} \in F (R^{n})

with

\tilde{u} = \tilde{v} + \tilde{s}

, then

\tilde{s}

is said to be the Hukuhara difference between

\tilde{u}, \tilde{v}

, and it is denoted by

\tilde{s} : = \tilde{u} -_{h} \tilde{v}

. The Hukuhara difference possesses good properties for the operation of the difference between sets. For

\tilde{u}, \tilde{v} \in F (R^{n})

, it holds that

(1): $\tilde{u} -_{h} \tilde{u} = {0};$
(2): $(\tilde{u} + \tilde{v}) -_{h} \tilde{v} = \tilde{u};$
(3): $\tilde{u} = \tilde{v}$ if and only if $\tilde{u} -_{h} \tilde{v} = \tilde{v} -_{h} \tilde{u} = {0}$ ;
(4): $S_{{\tilde{u}}_{α} -_{h} {\tilde{v}}_{α}} = S_{{\tilde{u}}_{α}} - S_{{\tilde{v}}_{α}}$ .

For more properties of the Hukuhara difference, the readers are refereed to Stifanini [22].

Note that

\tilde{u} - \tilde{v}

for

\tilde{u}, \tilde{v} \in F (R^{n})

is a fuzzy arithmetic and is based on Zadeh’s extension principle, which is different from the Hukuhara difference.

2.2. Fuzzy Random Variables (FRVs)

The concept of FRVs was inspired by the attempt to model the randomness and fuzziness that exist in real-life phenomena simultaneously. Typically, there are two kinds of FRVs: the FRVs of Kwakernaak—Kruse and Meyer [23,24] and of Puri-Ralescu [8]. The former is devoted to modeling the human vague perception of a random variable (the original) [24], and the latter is for modeling the completely fuzzy random phenomena of real life [9,23]. Both FRVs are mathematically equivalent to each other when they are valued in

F (R)

, and there are no appropriate distributional models for FRVs [9,23].

Remark 2.

Ref. [8] Let

(Ω, A, P)

be a complete probability space. The mapping

\tilde{X} : Ω \to F (R^{n})

is said to be a fuzzy random variable (FRV) if

\tilde{X}

is

A - B

measurable, i.e., for any measurable subset

B \subset R^{n}

,

{ω | {\tilde{X}}_{α} (ω) \cap B \neq \emptyset} \in A

, where

B

is a σ-algebra on

R^{n}

induced by

\tilde{X}

associated with the concerned metric, and

{\tilde{X}}_{α} (ω) : = {x \in R^{n} | \tilde{X} (ω) \geq α} = {[\tilde{X} (ω)]}_{α}, α \in [0, 1], ω \in Ω .

In the following, we assume that the FRV

\tilde{X}

is second order under the metric

δ_{2}

, i.e.,

E (∥ \tilde{X} ∥^{2}) : = E (δ_{2}^{2} (\tilde{X}, {0})) < + \infty .

(19)

This condition can ensure the existence of second moments for FRVs [9].

Remark 3.

([8] Aumann expectation) The expectation

E \tilde{X}

of an FRV

\tilde{X}

is a normal compact fuzzy set of

R^{n}

with the property that

{(E \tilde{X})}_{α} = E {\tilde{X}}_{α}, α \in [0, 1]

, where

E {\tilde{X}}_{α}

is the Aumann expectation of the random set

{\tilde{X}}_{α}

, i.e.,

E {\tilde{X}}_{α} = {E η | η (ω) \in {\tilde{X}}_{α} (ω), a . e ., η \in L (Ω, R)},

(20)

where

L (Ω, R)

is the set of all real random variables with the existing expectation defined on Ω, a.e. means almost everywhere.

Let

\tilde{X}

be an FRV, then

S_{{\tilde{X}}_{α}}

is a random element and

E (S_{{\tilde{X}}_{α}}) = S_{E ({\tilde{X}}_{α})}

[9,21] if the Aumann expectation

E ({\tilde{X}}_{α})

exists,

α \in [0, 1]

([8,23]).

The concepts of variance and covariance take an important role in stochastic analysis and statistical modeling, and they have been extended to FRVs in several different ways. Based on the extension principle, Kruse and Meyer [24] proposed a kind of fuzzy variance and covariance for FRVs, which seems to be weak from the aspect of keeping the original essence of the variance and covariance. In recent years, it was advocated to propose definitions in which the essence of variance and covariance is kept for FRVs, which means that the variance of an FRV is an accurate measurement of the spread or dispersion of the FRV with its mean, and the covariance or the correlation coefficient of two FRVs must measure their linear interdependence, so they should have no fuzziness [9,25].

Remark 4.

The Fréchet variance of FRV

\tilde{X}

w.r.t the distance

δ_{2}

is defined by

V a r (\tilde{X}) : = E (δ_{2}^{2} (\tilde{X}, E (\tilde{X}))) = n \int_{0}^{1} \int_{S^{n - 1}} V a r (S_{{\tilde{X}}_{α}} (x)) μ (d x) d α .

(21)

Remark 5.

The Fréchet covariance of two FRVs

\tilde{X}, \tilde{Y}

w.r.t. the distance

δ_{2}

is defined by

C o v (\tilde{X}, \tilde{Y}) : = n \int_{0}^{1} \int_{S^{n - 1}} C o v (S_{{\tilde{X}}_{α}} (x), S_{{\tilde{Y}}_{α}} (x)) μ (d x) d α .

(22)

Then, the usual classical form

C o v (\tilde{X}, \tilde{Y}) = E 〈 \tilde{X}, \tilde{Y} 〉 - 〈 E \tilde{X}, E \tilde{Y} 〉,

(23)

V a r (\tilde{X}) = E 〈 \tilde{X}, \tilde{X} 〉 - 〈 E \tilde{X}, E \tilde{X} 〉,

(24)

hold. In the case of

n = 1

, since the normalized Lebesgue measure

μ (S^{0}) = 1

, and

S^{0} = {- 1, 1}

is symmetric, then

μ (- 1) = μ (1) = \frac{1}{2}

, and we have

V a r (\tilde{X}) = \frac{1}{2} \int_{0}^{1} (V a r (\inf {\tilde{X}}_{α}) + V a r (\sup {\tilde{X}}_{α})) d α .

(25)

C o v (\tilde{X}, \tilde{Y}) = \frac{1}{2} \int_{0}^{1} (C o v (\inf {\tilde{X}}_{α}, \inf {\tilde{Y}}_{α}) + C o v (\sup {\tilde{X}}_{α}, \sup {\tilde{Y}}_{α})) d α .

(26)

This just coincides with the definitions of variance and covariance for a one-dimensional FRV, proposed by Feng et al. [26], which indicates that Feng’s definitions are a special case of Remark 4 and Remark 5 above.

Lemma 1.

([25]) Let

\tilde{X}

and

\tilde{Y}

be two FRVs with second order under the metric

δ_{2}

, then

(1): $V a r (\tilde{u}) = 0;$
(2): $V a r (a \tilde{X} + b \tilde{Y}) = a^{2} V a r (\tilde{X}) + b^{2} V a r (\tilde{Y}) + 2 a b C o v (\tilde{X}, \tilde{Y}), a b \geq 0, a, b \in R;$
(3): $V a r (a ξ) = {∥ a ∥}^{2} V a r ξ, a \in R^{n}, r . v . ξ \geq 0;$
(4): $C o v ((a \tilde{X}) + (b \tilde{Y}), c \tilde{Z}) = a c C o v (\tilde{X}, \tilde{Z}) + b c C o v (\tilde{Y}, \tilde{Z}), a c \geq 0, b c \geq 0, a, b, c \in R;$
(5): $C o v ((a \tilde{X}) + \tilde{u}, b \tilde{Y} + \tilde{v}) = a b C o v (\tilde{X}, \tilde{Y}), a b \geq 0, a, b \in R, \tilde{u}, \tilde{v} \in F (R^{n}) .$

Definition 1.

The fraction

R (\tilde{X}, \tilde{Y}) = C o v (\tilde{X}, \tilde{Y}) / \sqrt{V a r (\tilde{X}) V a r (\tilde{Y})},

(27)

a normalized Fréchet covariance, is said to be the Fréchet correlation coefficient of two FRVs.

\tilde{X}, \tilde{Y}

, where

V a r (\tilde{X}) > 0

,

V a r (\tilde{Y}) > 0

.

The independence of FRVs can follow from the independence of the random elements, which is already defined by [9].

Lemma 2.

([25]) Let

\tilde{X}

and

\tilde{Y}

be two FRVs with second order under the metric

δ_{2}

, then

(1): if $\tilde{X}$ and $\tilde{Y}$ are independent, then $C o v (\tilde{X}, \tilde{Y}) = 0$ ;
(2): $| R (\tilde{X}, \tilde{Y}) | \leq 1$ ;
(3): $R (\tilde{X}, \tilde{Y}) = 1$ if and only if $\tilde{Y} + (λ E \tilde{X}) = E \tilde{Y} + (λ \tilde{X})$ , a.e., $R (\tilde{X}, \tilde{Y}) = - 1$ if and only if $\tilde{Y} + (λ \tilde{X}) = E \tilde{Y} + (λ E \tilde{X})$ , a.e., where $λ = \sqrt{V a r \tilde{Y} / V a r \tilde{X}}, V a r (\tilde{X}) > 0, V a r (\tilde{Y}) > 0 .$

FRVs

\tilde{X}

and

\tilde{Y}

are said to be uncorrelated if

R (\tilde{X}, \tilde{Y}) = 0

. If

0 < | R (\tilde{X}, \tilde{Y}) | < 1

, then there may exist some weak linear dependent relations between

\tilde{X}

and

\tilde{Y}

.

Now we consider convergence properties of a sequence of FRVs with second order under the metric

δ_{2}

. Note that Feng [26] has already considered some convergence problems under the metric

d_{p}

for a sequence of one-dimensional FRVs, and under the metric

d_{\infty}

for a sequence of n-dimensional FRVs. Let

{{\tilde{X}}_{n}}

be a sequence of FRVs with second order, and it is thus

\tilde{X}

under the metric

δ_{2}

,

δ_{2} ({\tilde{X}}_{n}, \tilde{X})

is a random variable. Define

D_{2} ({\tilde{X}}_{n}, \tilde{X}) : = {[E (δ_{2}^{2} ({\tilde{X}}_{n}, \tilde{X}))]}^{1 / 2}

. If

D_{2} ({\tilde{X}}_{n}, \tilde{X}) \to 0 (n \to \infty)

, then it is said that the sequence

{{\tilde{X}}_{n}}

of FRVs converges to FRV

\tilde{X}

in mean square under metric

δ_{2}

. If

δ_{2}^{2} ({\tilde{X}}_{n}, \tilde{X})) \to 0 (n \to \infty)

in probability, then it is said that the sequence

{{\tilde{X}}_{n}}

of FRVs converges to FRV

\tilde{X}

in probability as

n \to \infty

. Referring to [26], we the have following theorem.

Theorem 1.

Let

{{\tilde{X}}_{n}}

be a sequence of FRVs with second order, and it is thus

\tilde{X}

under the metric

δ_{2}

, then the following conditions are equivalent:

(1): $D_{2} ({\tilde{X}}_{n}, \tilde{X}) \to 0 (n \to \infty)$ ;
(2): ${{\tilde{X}}_{n}}$ is a Cauchy sequence, i.e., $\lim_{m, l \to \infty} D_{2} ({\tilde{X}}_{m}, {\tilde{X}}_{l}) = 0$ ;
(3): The series ${∥ {\tilde{X}}_{n} ∥^{2}, n \geq 1}$ of random variables is uniformly integrable and $δ_{2} ({\tilde{X}}_{n}, \tilde{X})) \to 0 (n \to \infty)$ in probability.

Proof.

(1)⇒ (2): Since

D_{2} ({\tilde{X}}_{n}, \tilde{X}) \to 0 (n \to \infty)

,

D_{2} ({\tilde{X}}_{m}, \tilde{X}) \to 0 (m \to \infty)

, then from the triangle inequality of

D_{2}

, we have

D_{2} ({\tilde{X}}_{n}, {\tilde{X}}_{m}) \leq D_{2} ({\tilde{X}}_{n}, \tilde{X}) + D_{2} (\tilde{X}, {\tilde{X}}_{m}) \to 0 (n, m \to \infty) .

(28)

(2) ⇒ (3): Since

D_{2} ({\tilde{X}}_{n}, {\tilde{X}}_{m}) \to 0 (m, n \to \infty)

, which is equivalent to

D_{2}^{2} ({\tilde{X}}_{n}, {\tilde{X}}_{m}) \to 0 (m, n \to \infty)

. Using Markov inequality, we have

P (δ_{2}^{2} ({\tilde{X}}_{n}, {\tilde{X}}_{m}) \geq ϵ) \leq \frac{D_{2}^{2} ({\tilde{X}}_{n}, {\tilde{X}}_{m})}{ϵ} \to 0 (m, n \to \infty),

(29)

for any

ϵ > 0

, which means

δ_{2}^{2} ({\tilde{X}}_{n}, {\tilde{X}}_{m}) \to^{P} 0

, as

n, m \to \infty

, i.e.,

{{\tilde{X}}_{n}}

is a Cauchy sequence of FRVs under the metric

δ_{2}

in probability. From the completeness of the space

(F (R^{n}), δ_{2})

, the sequence

{{\tilde{X}}_{n}}

has a limit valued in

F (R^{n})

under the metric

δ_{2}

in probability, thus the limit is an FRV, we denote it by

\tilde{X}

, and we have

δ_{2} ({\tilde{X}}_{n}, \tilde{X}) \to^{P} 0

as

n \to \infty

. Since each

{\tilde{X}}_{n}

is of second order, it is obvious that

∥ {\tilde{X}}_{n} ∥^{2}

is uniformly integrable.

(3) ⇒ (1): Since

δ_{2} ({\tilde{X}}_{n}, \tilde{X}) \to^{P} 0

as

n \to \infty

, and we have

P (| δ_{2} ({\tilde{X}}_{n}, {0}) - δ_{2} ({0}, \tilde{X}) | \leq δ_{2} ({\tilde{X}}_{n}, \tilde{X}) < ϵ) \to 1 (n \to \infty),

(30)

i.e.,

∥ {\tilde{X}}_{n} ∥ \to^{P} ∥ \tilde{X} ∥

as

n \to \infty

. By Fatou’s lemma,

E ∥ \tilde{X} ∥^{2} \leq \lim_{n \to \infty} \inf E ∥ {\tilde{X}}_{n} ∥^{2} \leq \sup_{n} E {∥ {\tilde{X}}_{n} ∥}^{2} < \infty .

(31)

The uniform integrability of

{∥ \tilde{X_{n}} ∥^{2}} (n \geq 1)

, and the inequality

δ_{2}^{2} ({\tilde{X}}_{n}, \tilde{X}) \leq 2 (∥ {\tilde{X}}_{n} ∥^{2} + ∥ \tilde{X} ∥^{2})

imply that

{δ_{2}^{2} ({\tilde{X}}_{n}, \tilde{X})} (n \geq 1)

is uniformly integrable. Using the dominated convergence theorem, we have

\lim_{n \to \infty} D_{2}^{2} ({\tilde{X}}_{n}, \tilde{X}) = \lim_{n \to \infty} E (δ_{2}^{2} ({\tilde{X}}_{n}, \tilde{X})) = 0 .

(32)

☐

Theorem 2.

Let

{{\tilde{X}}_{n}}

be a sequence of FRVs with second order under the metric

δ_{2}

and

\sup_{n} E {∥ {\tilde{X}}_{n} ∥}^{2} < \infty

,

{b_{j}}

is a non-negative number series satisfying

\sum_{j = 0}^{\infty} b_{j}^{2} < \infty

, then the infinite semi-linear sum of FRVs

\sum_{j = 0}^{\infty} b_{j} {\tilde{X}}_{j}

converges in probability under the metric

δ_{2}

.

Proof.

(1) We prove that

\sum_{j = 0}^{\infty} b_{j} {\tilde{X}}_{j}

is integrable bounded.

\begin{matrix} E ∥ \sum_{j = 0}^{\infty} b_{j} {\tilde{X}}_{j} ∥ = \lim_{r \to \infty} E ∥ \sum_{j = 0}^{r} b_{j} {\tilde{X}}_{j} ∥ \\ = \lim_{r \to \infty} E [δ_{2}^{2} (\sum_{j = 0}^{r} b_{j} {\tilde{X}}_{j}, {0})] \\ = \lim_{r \to \infty} E [n \int_{0}^{1} \int_{S^{n - 1}} {(S_{{(\sum_{j = 0}^{r} (b_{j} {\tilde{X}}_{j}))}_{α}} (x) - S_{{0}} (x))}^{2} μ (d x) d α] \\ = \lim_{r \to \infty} E [n \int_{0}^{1} \int_{S^{n - 1}} {(b_{0} S_{{\tilde{X}}_{0 α}} + b_{1} S_{{\tilde{X}}_{1 α}} + \dots + b_{r} S_{{\tilde{X}}_{r α}})}^{2} μ (d x) d α] \\ = \lim_{r \to \infty} \sum_{j = 0}^{r} b_{j}^{2} E ∥ {\tilde{X}}_{j} ∥ + \lim_{r \to \infty} \sum_{i, l = 0, i \neq l}^{r} b_{i} b_{l} E 〈 {\tilde{X}}_{i}, {\tilde{X}}_{l} 〉 \\ \leq (\lim_{r \to \infty} \sum_{j = 0}^{r} b_{j}^{2}) \sup_{j} E ∥ {\tilde{X}}_{j} ∥^{2} + \lim_{r \to \infty} (\sum_{i, l = 0, i \neq l}^{r} \frac{b_{i}^{2} + b_{l}^{2}}{2}) \sup_{j} E {∥ {\tilde{X}}_{j} ∥}^{2} < \infty . \end{matrix}

(2) We prove that

{W_{n} = \sum_{j = 0}^{n} b_{j} {\tilde{X}}_{j}}

is a Cauchy sequence under the metric

D_{2}

.

\begin{matrix} \lim_{r, l \to \infty} D_{2}^{2} (W_{r}, W_{l}) = \lim_{r, l \to \infty} E (δ_{2}^{2} (W_{r}, W_{l})) \\ = \lim_{r, l \to \infty} E [n \int_{0}^{1} \int_{S^{n - 1}} {(S_{{(\sum_{j = 0}^{r} b_{j} {\tilde{X}}_{j})}_{α}} (x) - S_{{(\sum_{j = 0}^{l} b_{j} {\tilde{X}}_{j})}_{α}} (x))}^{2} μ (d x) d α] \\ = \lim_{r, l \to \infty} E [n \int_{0}^{1} \int_{S^{n - 1}} {(\sum_{j = l + 1}^{r} b_{j} S_{{\tilde{X}}_{r α}})}^{2} μ (d x) d α] \\ \leq \lim_{r, l \to \infty} [\sum_{j = l + 1}^{r} b_{j}^{2} + \sum_{i, s = l + 1, i \neq s}^{r} \frac{b_{i}^{2} + b_{s}^{2}}{2}] \sup_{j} E {∥ {\tilde{X}}_{j} ∥}^{2} \\ = 0 \sup_{j} E {∥ {\tilde{X}}_{j} ∥}^{2} = 0, \end{matrix}

since

\lim_{j \to \infty} b_{j}^{2} = 0 .

From Theorem 1, we determine that

{W_{n}}

converges to some FRV W in probability under the metric

δ_{2}

, i.e.,

\sum_{j = 0}^{\infty} b_{j} {\tilde{X}}_{j}

converges in probability under the metric

δ_{2}

. ☐

Remark 6.

In this paper, the expectation, variance, and covariance, as well as the correlation values of FRVs and the convergence of a sequence of FRVs, are defined under the metric

δ_{2}

only. This is obviously a special case from a general FRV point of view. Thus, our concerned autoregressive models for fuzzy data-valued time series are special ones. For obtaining more general models, we may further consider a bit more general metric on

F (R^{n})

.

3. A Fuzzy Set Valued ARMA Model Based on a Standardized Process

Based on the concepts of the Fréchet covariance and Fréchet linear correlation for the FRVs defined in the former section, we consider some autoregressive models for fuzzy data-valued time series. In a real-world situation, one may perceive such a process as a sequence of investment approximate returns by time. Even the observers timely evaluations on some stock prices may also form such a time series. Note that an example of autoregressive sequence of one-dimensional FRVs and the related correlation function had already been proposed by Feng et al. [26].

Definition 2.

Let

{{\tilde{X}}_{t}} (t \in Z)

be a process of FRVs valued in

F (R^{n})

with second order under the metric

δ_{2}

. If t denotes the time points, then

{{\tilde{X}}_{t}}

is said to be a fuzzy data valued time series. The Fréchet covariance function

C (l, s)

of the process

{{\tilde{X}}_{t}} (t \in Z)

is defined by

C (l, s) = C o v ({\tilde{X}}_{l}, {\tilde{X}}_{s})

,

l, s \in Z

. The process

{{\tilde{X}}_{t}} (t \in Z)

is said to be wide-sense (weakly) stationary if it holds that

C (l, s) = C o v ({\tilde{X}}_{t + l}, {\tilde{X}}_{t + s}), E ({\tilde{X}}_{t}), t, l, s \in Z,

are independent of t, where

Z

is the set of all integers.

Note that for a wide-sense stationary fuzzy data-valued time series, the Fréchet covariance function can be simply denoted by

C (h) : = C (h, 0)

, since

C (l, s) = C (l - s, 0)

.

Example 1.

For a process

{{\tilde{ξ}}_{t}, t \in Z}

of Gaussian FRVs ([27]):

{\tilde{ξ}}_{t} = E ({\tilde{ξ}}_{t}) + ξ_{t}

, where random vector

ξ_{t} \sim N_{n} (0, Σ)

, an n-dimensional Gaussian distribution with zero mean vector, the Fréchet covariance function can be carried out as

C (t, s) = C o v (ξ_{t}, ξ_{s}) = n \int_{S^{n - 1}} (\sum_{i = 1}^{n} \sum_{j = 1}^{n} x_{i} x_{j} c o v (ξ_{t_{i}}, ξ_{s_{j}})) μ (d x),

(33)

where

x = (x_{1}, \dots, x_{n}) \in S^{n - 1}, ξ_{t} = (ξ_{t_{1}}, \dots, ξ_{t_{n}}), ξ_{s} = (ξ_{s_{1}}, \dots, ξ_{s_{n}})

are real-valued n-dimensional random vectors with multivariate Gaussian distribution

N_{n} (0, Σ)

, and

c o v (ξ_{t_{i}}, ξ_{s_{j}})

is the classical covariance of random variables

ξ_{t_{i}}, ξ_{s_{j}} .

It is obvious that a process

{{\tilde{ξ}}_{t}, t \in Z}

of Gaussian FRVs is mutually uncorrelated in the sense of the Fréchet correlation if and only if the process

{ξ_{t}}

of the Gaussian random vectors is mutually uncorrelated in the sense of the Fréchet correlation. Note that the Fréchet correlation between two random vectors is different from the conventional concept of correlation of two random vectors in multivariate statistics; the former depends on the Fréchet covariance, whereas the latter depends on the ordinary covariance matrix. Also, in this example, we can determine that the wide-sense stationarity of the process of Gaussian FRVs is equivalent to the wide-sense stationarity of the process of Gaussian random vectors.

In the following, we consider a special error term process, which may help us to propose an applicable ARMA model with fuzzy data in the area of financial data analysis.

Definition 3.

([10]) Let

{{\tilde{w}}_{t}} (t \in Z)

be a process of fuzzy random sets valued in

F (R^{n})

with second order under the metric

δ_{2}

.

{{\tilde{w}}_{t}} (t \in Z)

is said to be a standardized process of FRVs if it holds that

C o v ({\tilde{w}}_{t + h}, {\tilde{w}}_{t}) = \{\begin{matrix} 1, & h = 0, \\ 0, & h \neq 0, \end{matrix}

(34)

where

t, h \in Z .

Obviously, a standardized process

{{\tilde{w}}_{t}} (t \in Z)

of FRVs is wide-sense stationary.

Sometimes, a standardized process of fuzzy random sets

{{\tilde{w}}_{t}} (t \in Z)

can be viewed in the sense of a white noise process, i.e., a fuzzy observation on a conventional white noise process, which means that if

ε_{t}

is a term of a white noise process

{ε_{t}}, t \in Z

, then

{\tilde{w}}_{t}

can be viewed as some fuzzy observation on

ε_{t}

satisfying the membership value

{\tilde{w}}_{t} (ε_{t}) = 1

. Note that, in general,

{\tilde{w}}_{t}

is not unique, as it depends on the observers’ opinions, and different observers may set different membership functions

{\tilde{w}}_{t}

.

In the one-dimensional case, we present a standardized process of FRVs based on a real-valued white noise process. However, it is difficult to give a standardized process of FRVs in an n-dimensional case (

n \geq 2

).

Example 2.

Let

{ε_{t}}

be a white noise process, i.e.,

E (ε_{t}) = 0, V a r (ε_{t}) = 1, c o v (ε_{i}, ε_{j}) = 0, i \neq j, t, i, j \in Z

. We define a process

{{\tilde{w}}_{t}}

of FRVs as follows,

{\tilde{w}}_{t} (x) = \{\begin{matrix} x - ε_{t} + 1, & ε_{t} - 1 \leq x \leq ε_{t}, \\ - x + ε_{t} + 1, & ε_{t} \leq x \leq ε_{t} + 1 . \end{matrix}

(35)

It is easy to know that

{{\tilde{w}}_{t}}

is a standardized process of FRVs.

In the following, we always assume that the standardized process of FRVs can be used for modeling the error term process of a time series model with fuzzy data.

Definition 4.

([10]) A process of FRVs

{{\tilde{X}}_{t}}

with second order under the metric

δ_{2}

is said to be a fuzzy set-valued p-order autoregressive (briefly, AR(p) with fuzzy data) process if

{{\tilde{X}}_{t}}

is wide-sense stationary and, for any

t \in Z

, it holds that

{\tilde{X}}_{t} = θ_{1} {\tilde{X}}_{t - 1} + θ_{2} {\tilde{X}}_{t - 2} + \dots + θ_{p} {\tilde{X}}_{t - p} + {\tilde{w}}_{t},

(36)

where

θ_{i}

is a real number-valued parameter,

{{\tilde{w}}_{t}}

is a standardized process of FRVs, and p is a natural number.

Definition 5.

([10]) A process of FRVs

{{\tilde{X}}_{t}}

with second order under the metric

δ_{2}

is said to be a fuzzy set-valued

(p, q)

-order autoregressive moving average (briefly, ARMA(

p, q

) with fuzzy data) process if

{{\tilde{X}}_{t}}

is wide-sense stationary and, for any

t \in Z

, it holds that

{\tilde{X}}_{t} = θ_{1} {\tilde{X}}_{t - 1} + θ_{2} {\tilde{X}}_{t - 2} + \dots + θ_{p} {\tilde{X}}_{t - p} + ϕ_{q} {\tilde{w}}_{t - q} + \dots + ϕ_{1} {\tilde{w}}_{t - 1} + {\tilde{w}}_{t},

(37)

where

θ_{i}, ϕ_{i}

are real number-valued parameters,

{{\tilde{w}}_{t}}

is a standardized process of FRVs, and

p, q

are natural numbers.

An ARMA(

p, q

) process

{{\tilde{X}}_{t}}

of FRVs is said to be a causal ARMA(

p, q

) process under the metric

δ_{2}

if it has a wide-sense stationary solution almost everywhere, i.e., there exists a positive (or negative) number series

{b_{j}}

such that

\sum_{j = 0}^{\infty} b_{j} {\tilde{w}}_{t - j}

converges in probability under the metric

δ_{2}

and

{\tilde{X}}_{t} = \sum_{j = 0}^{\infty} b_{j} {\tilde{w}}_{t - j}

, a.e., where

{{\tilde{w}}_{t}}

is a standardized process of FRVs.

Example 3.

Let

{{\tilde{Y}}_{t}}

be a wide-sense stationary process of fuzzy random sets with second order, set

{\tilde{X}}_{t} = {\tilde{Y}}_{t} - E ({\tilde{Y}}_{t})

, then, by (5) of Lemma 1, we have

C o v ({\tilde{X}}_{t + h}, {\tilde{X}}_{t}) = C o v ({\tilde{Y}}_{t + h}, {\tilde{Y}}_{t})

, thus,

{{\tilde{X}}_{t}}

is wide-sense stationary and with fuzzy zero expectations

{\tilde{0}}

, where

\tilde{0}

is independent of t and not unique.

Lemma 3.

([10]) Let

{{\tilde{X}}_{t}} (t \in Z)

be an AR(1) with fuzzy data:

{\tilde{X}}_{t} = θ {\tilde{X}}_{t - 1} + {\tilde{w}}_{t}

, where

{{\tilde{w}}_{t}}

is a standardized process of FRVs. Then,

{{\tilde{X}}_{t}}

possesses a wide-sense stationary solution

{\tilde{X}}_{t} = \sum_{j = 0}^{\infty} θ^{j} {\tilde{w}}_{t - j}

almost everywhere if

0 < θ < 1

, and

\sup_{t} {∥ {\tilde{w}}_{t} ∥}^{2} < \infty

.

For the estimation of an AR(1) with fuzzy data based on sample

{\tilde{x}}_{1}, {\tilde{x}}_{2}, \dots, {\tilde{x}}_{m}

from the process

{{\tilde{X}}_{t}}

of FRVs with second order, we can determine that

(1): If the AR(1) model is causal, then an estimator of the parameter $θ$ can be $\hat{θ} = \frac{\hat{C} (1)}{\hat{C} (0)}$ , where

$\hat{C} (1) : = \frac{1}{m} \sum_{t = 1}^{m - 1} n \int_{0}^{1} \int_{S^{n - 1}} (S_{{({\tilde{x}}_{t + 1})}_{α}} (x) - S_{{\bar{\tilde{x}}}_{α}} (x)) (S_{{({\tilde{x}}_{t})}_{α}} (x) - S_{{\bar{\tilde{x}}}_{α}} (x)) μ (d x) d α,$

(38)

$\hat{C} (0) : = \frac{1}{m} \sum_{t = 1}^{m} n \int_{0}^{1} \int_{S^{n - 1}} {(S_{{({\tilde{x}}_{t})}_{α}} (x) - S_{{\bar{\tilde{x}}}_{α}} (x))}^{2} μ (d x) d α,$

(39)

are the sample-based estimators of the Fréchet covariance $C (1), C (0)$ , respectively, and $\bar{\tilde{x}}$ = $\frac{1}{m} \sum_{t = 1}^{m} {\tilde{x}}_{t}$ .
(2): If the AR(1) with fuzzy data is not causal, then we may employ the least square method proposed by [20] to estimate the parameter $θ$ .

Now, we consider applying the least square estimation method proposed by [20] under the concerned metric

δ_{2}

to estimate an ARMA(1,1) model

{\tilde{X}}_{t} = θ_{1} {\tilde{X}}_{t - 1} + ϕ_{1} {\tilde{w}}_{t - 1} + {\tilde{w}}_{t}

, (

t \in Z

). Assume that we have the observations

{\tilde{x}}_{i},

i = 0, \dots, d,

on the process, and we generate some terms

{\tilde{w}}_{i}, i = 0, \dots, d

of a standardized process, where it is assumed that

{\tilde{x}}_{0} = {\tilde{w}}_{0} = 0

.

The estimation of the model can be carried out by minimizing the function

L (θ_{1}, ϕ_{1}) = \sum_{i = 1}^{d} δ_{2}^{2} ({\tilde{x}}_{i}, θ_{1} {\tilde{x}}_{i - 1} + ϕ_{1} {\tilde{w}}_{i - 1}),

(40)

on the set

A : = {θ_{1}, ϕ_{1} | {\tilde{x}}_{i} -_{h} (θ_{1} {\tilde{x}}_{i - 1} + ϕ_{1} {\tilde{w}}_{i - 1}) exists for i = 1, \dots, d} .

(41)

We obtain the least square estimates of the parameters

θ_{1} > 0, ϕ_{1} > 0

as follows,

\begin{matrix} {\hat{θ}}_{1} & = & \frac{\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i}, {\tilde{x}}_{i - 1} 〉}{\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉} \\ - & \frac{[\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i}, {\tilde{x}}_{i - 1} 〉 - \sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉] {[\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉]}^{2}}{(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉) [{(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉)}^{2} - (\sum_{i = 1}^{d} 〈 {\tilde{w}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉)]}, \end{matrix}

(42)

\begin{matrix} {\hat{ϕ}}_{1} & = & \frac{(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i}, {\tilde{x}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) - (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i}, {\tilde{w}}_{i - 1} 〉)}{{(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉)}^{2} - (\sum_{i = 1}^{d} 〈 {\tilde{w}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉)}, \end{matrix}

(43)

and

{\hat{θ}}_{1} > 0, {\hat{ϕ}}_{1} > 0

, otherwise, the estimators

{\hat{θ}}_{1}, {\hat{ϕ}}_{1}

are not a suitable solution.

If the parameters

θ_{1} < 0, ϕ_{1} < 0

, then their least square estimators can be carried out by replacing

{\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1}

with

- {\tilde{x}}_{i - 1}, - {\tilde{w}}_{i - 1}

, respectively, in the above formula of

{\hat{θ}}_{1}, {\hat{ϕ}}_{1}

.

The asymptotic properties of the least square estimators for ARMA(

1, 1

) with fuzzy data can be given as follows.

Theorem 3.

Let

{{\tilde{X}}_{t}}

be an ARMA(1,1) process with fuzzy data

{\tilde{X}}_{t} = θ_{1} {\tilde{X}}_{t - 1} + ϕ_{1} {\tilde{w}}_{t - 1} + {\tilde{w}}_{t}

,

(θ_{1} > 0, ϕ_{1} > 0)

and the least square estimators

{\hat{θ}}_{1}, {\hat{ϕ}}_{1}

shown in (42),(43) exist on

A : = {θ_{1}, ϕ_{1} | {\tilde{x}}_{i} -_{h} (θ_{1} {\tilde{x}}_{i - 1} + ϕ_{1} {\tilde{w}}_{i - 1}) exists for i = 1, \dots, d}

under the selected distance

δ_{2}

based on a sample

{{\tilde{x}}_{i}, i = 0, \dots, d}

. If

E ({\tilde{w}}_{i}) \approx 0, i = 0, \dots, d

, and

〈 {\tilde{w}}_{i}, {\tilde{w}}_{j} 〉

and

〈 {\tilde{w}}_{s}, {\tilde{w}}_{l} 〉

are uncorrelated for

(i, j) \neq (s, l)

,

i, j, s, l \in {1, 2, \dots, d}

, and

V a r (〈 {\tilde{w}}_{i}, {\tilde{w}}_{j} 〉) = 0, 0 < θ_{1} < 1, ϕ_{1} > 0

, then the least square estimators

{\hat{θ}}_{1}, {\hat{ϕ}}_{1}

are weakly consistent. In a special case of

ϕ_{1} = 1

,

{\hat{θ}}_{1}, {\hat{ϕ}}_{1}

are consistent.

Proof.

From (42), the definition of

〈 \cdot, \cdot 〉

, and the equality

{\tilde{x}}_{t} = θ_{1} {\tilde{x}}_{t - 1} + ϕ_{1} {\tilde{w}}_{t - 1} + {\tilde{w}}_{t}

, we have

\begin{matrix} {\hat{θ}}_{1} - θ_{1} = \\ \frac{(1 - θ_{1}) {(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉)}^{2} - (\sum_{i = 1}^{d} 〈 {\tilde{w}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) (ϕ_{1} \sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉 + \sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i} 〉)}{{(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉)}^{2} - (\sum_{i = 1}^{d} 〈 {\tilde{w}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉)} . \end{matrix}

(44)

Replacing

{\tilde{x}}_{t - 1}

with

{\tilde{x}}_{t - 1} = θ_{1} {\tilde{x}}_{t - 2} + ϕ_{1} {\tilde{w}}_{t - 2} + {\tilde{w}}_{t - 1}

, then

{\tilde{x}}_{t} = θ_{1}^{2} {\tilde{x}}_{t - 2} + θ_{1} ϕ_{1} {\tilde{w}}_{t - 2} + (θ_{1} + ϕ_{1}) {\tilde{w}}_{t - 1} + {\tilde{w}}_{t}

. Iterating the above equality, it holds that:

\begin{matrix} {\tilde{x}}_{t} = θ_{1}^{t} {\tilde{x}}_{0} + θ_{1}^{t - 1} ϕ_{1} {\tilde{w}}_{0} + (θ_{1}^{t - 1} + θ_{1}^{t - 2} ϕ_{1}) {\tilde{w}}_{1} + (θ_{1}^{t - 2} + θ_{1}^{t - 3} ϕ_{1}) {\tilde{w}}_{2} \\ + (θ_{1}^{t - 3} + θ_{1}^{t - 4} ϕ_{1}) {\tilde{w}}_{3} + \dots + (θ_{1}^{t - k} + θ_{1}^{t - k - 1} ϕ_{1}) {\tilde{w}}_{k} + \dots + (θ_{1} + ϕ_{1}) {\tilde{w}}_{t - 1} + {\tilde{w}}_{t} . \end{matrix}

(45)

By the assumption and Definition 3 and (34), we have

E 〈 {\tilde{w}}_{i}, {\tilde{w}}_{j} 〉 = 〈 E {\tilde{w}}_{i}, E {\tilde{w}}_{j} 〉 = 0

,

(i \neq j)

,

E 〈 {\tilde{w}}_{i}, {\tilde{w}}_{i} 〉 = 1

, and

E 〈 {\tilde{w}}_{i}, {\tilde{x}}_{j} 〉 = \{\begin{matrix} 0, i > j, \\ 1, i = j, \\ θ_{1}^{j - i} + θ_{1}^{j - i - 1} ϕ_{1}, i < j . \end{matrix}

(46)

\begin{matrix} E 〈 {\tilde{x}}_{i}, {\tilde{x}}_{i} 〉 & = 1 + {(θ_{1} + ϕ_{1})}^{2} + {(θ_{1}^{2} + θ_{1} ϕ_{1})}^{2} + {(θ_{1}^{3} + θ_{1}^{2} ϕ_{1})}^{2} + \dots + {(θ_{1}^{i - 1} + θ_{1}^{i - 2} ϕ_{1})}^{2} \\ = \frac{1 - θ_{1}^{2 i} + 2 ϕ_{1} θ_{1} (1 - θ_{1}^{2 (i - 1)}) + ϕ_{1}^{2} (1 - θ_{1}^{2 (i - 1)})}{1 - θ_{1}^{2}} . \end{matrix}

(47)

Also, we have

E (〈 {\tilde{w}}_{i}, {\tilde{w}}_{j} 〉 〈 {\tilde{w}}_{s}, {\tilde{w}}_{l} 〉) = \{\begin{matrix} 1, i = j and s = l, \\ 0, otherwise . \end{matrix}

(48)

E ({〈 {\tilde{w}}_{i}, {\tilde{w}}_{j} 〉}^{2}) = \{\begin{matrix} 1, i = j, \\ 0, i \neq j . \end{matrix}

(49)

Set the numerator and the denominator of (44) as follows,

\begin{matrix} S_{1} : = (1 - θ_{1}) {(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉)}^{2} - (\sum_{i = 1}^{d} 〈 {\tilde{w}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) (ϕ_{1} \sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉 + \sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i} 〉) . \end{matrix}

(50)

\begin{matrix} D_{1} : = {(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉)}^{2} - (\sum_{i = 1}^{d} 〈 {\tilde{w}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉) . \end{matrix}

(51)

From (45), we have

E ({〈 {\tilde{x}}_{i}, {\tilde{w}}_{i} 〉}^{2}) = E ({〈 {\tilde{w}}_{i}, {\tilde{w}}_{i} 〉}^{2}) = 1

, and

E {(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉)}^{2} = {(d - 1)}^{2}

,

E (\sum_{i = 1}^{d} 〈 {\tilde{w}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) (ϕ_{1} \sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉 + \sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i} 〉) = ϕ_{1} {(d - 1)}^{2} .

(52)

Thus, we have

E (S_{1}) = (1 - ϕ_{1}) {(d - 1)}^{2} .

It can also be determined that

E (S_{1}^{2})

is bounded, since

E ({〈 {\tilde{x}}_{i}, {\tilde{w}}_{j} 〉}^{2}) \leq E (∥ {\tilde{x}}_{i} ∥^{2} ∥ {\tilde{w}}_{j} ∥^{2})

,

E (〈 {\tilde{x}}_{i}, {\tilde{w}}_{j} 〉 〈 {\tilde{x}}_{s}, {\tilde{w}}_{l} 〉) \leq E (∥ {\tilde{x}}_{i} ∥ ∥ {\tilde{w}}_{j} ∥ ∥ {\tilde{x}}_{s} ∥ ∥ {\tilde{w}}_{l} ∥)

. By the assumption and Chebyshev’s inequality, it holds that

\frac{S_{1}}{{(d - 1)}^{2}} \overset{p}{\to} 1 - ϕ_{1}

. After computation, it can also be determined that

\begin{matrix} E (D_{1}) = & - \frac{d - 1}{{(1 - θ_{1}^{2})}^{2}} [(d - 2) (θ_{1}^{2} - θ_{1}^{4}) - θ_{1}^{4} (1 - θ_{1}^{2 (d - 1)}) \\ + 2 (d - 2) θ_{1} ϕ_{1} (1 - θ_{1}^{2}) - 2 θ_{1}^{3} ϕ_{1} (1 - θ_{1}^{2 (d - 2)}) \\ + (d - 2) ϕ_{1}^{2} (1 - θ_{1}^{2}) - ϕ_{1}^{2} θ_{1}^{2} (1 - θ_{1}^{2 (d - 2)})], \end{matrix}

(53)

and

E (D_{1}^{2})

is bounded, by the assumption and Chebyshev’s inequality, it holds that

\frac{D_{1}}{{(d - 1)}^{2}} \overset{p}{\to} \frac{1}{{(1 - θ_{1}^{2})}^{2}} [θ_{1}^{4} - θ_{1}^{2} + 2 θ_{1} ϕ_{1} (θ_{1}^{2} - 1) + ϕ_{1}^{2} (θ_{1}^{2} - 1)] .

(54)

Thus,

\frac{S_{1}}{D_{1}} = \frac{S_{1} / {(d - 1)}^{2}}{D_{1} / {(d - 1)}^{2}} \overset{p}{\to} \frac{1 - ϕ_{1}}{\frac{1}{{(1 - θ_{1}^{2})}^{2}} [θ_{1}^{4} - θ_{1}^{2} + 2 θ_{1} ϕ_{1} (θ_{1}^{2} - 1) + ϕ_{1}^{2} (θ_{1}^{2} - 1)]},

(55)

i.e.,

\hat{θ_{1}} - θ_{1} \overset{p}{\to} \frac{1 - ϕ_{1}}{\frac{1}{{(1 - θ_{1}^{2})}^{2}} [θ_{1}^{4} - θ_{1}^{2} + 2 θ_{1} ϕ_{1} (θ_{1}^{2} - 1) + ϕ_{1}^{2} (θ_{1}^{2} - 1)]}

.

Obviously,

\hat{θ_{1}} - θ_{1} \overset{p}{\to} 0

when

ϕ_{1} = 1

, i.e.,

{\hat{θ}}_{1}

is consistent.

Set the numerator and the denominator of (43) as follows,

S_{2} : = (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i}, {\tilde{x}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) - (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i}, {\tilde{w}}_{i - 1} 〉),

(56)

D_{2} : = {(\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{w}}_{i - 1} 〉)}^{2} - (\sum_{i = 1}^{d} 〈 {\tilde{w}}_{i - 1}, {\tilde{w}}_{i - 1} 〉) (\sum_{i = 1}^{d} 〈 {\tilde{x}}_{i - 1}, {\tilde{x}}_{i - 1} 〉),

(57)

Then, we have

E (D_{2}) = E (D_{1})

, and

E (D_{2}^{2})

is bounded, by the assumption and Chebyshev’s inequality, it holds that

\frac{D_{2}}{{(d - 1)}^{2}} \overset{p}{\to} \frac{1}{{(1 - θ_{1}^{2})}^{2}} [θ_{1}^{4} - θ_{1}^{2} + 2 θ_{1} ϕ_{1} (θ_{1}^{2} - 1) + ϕ_{1}^{2} (θ_{1}^{2} - 1)] .

(58)

Also, we have

E (S_{2}) = {(d - 1)}^{2} (1 - ϕ_{1}) - \frac{ϕ_{1} (d - 1)}{{(1 - θ_{1}^{2})}^{2}} [(d - 2) (θ_{1}^{2} - θ_{1}^{4}) - θ_{1}^{4} (1 - θ_{1}^{2 (d - 1)}) + 2 (d - 2) θ_{1} ϕ_{1} (1 - θ_{1}^{2}) - 2 θ_{1}^{3} ϕ_{1} (1 - θ_{1}^{2 (d - 2)}) + (d - 2) ϕ_{1}^{2} (1 - θ_{1}^{2}) - ϕ_{1}^{2} θ_{1}^{2} (1 - θ_{1}^{2 (d - 2)})]

and

E (S_{2}^{2})

is bounded, by the assumption and Chebyshev’s inequality, it holds that

\frac{S_{2}}{{(d - 1)}^{2}} \overset{p}{\to} 1 - ϕ_{1} + \frac{ϕ_{1}}{{(1 - θ_{1}^{2})}^{2}} [θ_{1}^{4} - θ_{1}^{2} + 2 θ_{1} ϕ_{1} (θ_{1}^{2} - 1) + ϕ_{1}^{2} (θ_{1}^{2} - 1)],

(59)

Thus,

\frac{S_{2}}{D_{2}} = \frac{S_{2} / {(d - 1)}^{2}}{D_{2} / {(d - 1)}^{2}} \overset{p}{\to} \frac{1 - ϕ_{1}}{\frac{1}{{(1 - θ_{1}^{2})}^{2}} [θ_{1}^{4} - θ_{1}^{2} + 2 θ_{1} ϕ_{1} (θ_{1}^{2} - 1) + ϕ_{1}^{2} (θ_{1}^{2} - 1)]} + ϕ_{1},

(60)

i.e.,

\hat{ϕ_{1}} - ϕ_{1} \overset{p}{\to} \frac{1 - ϕ_{1}}{\frac{1}{{(1 - θ_{1}^{2})}^{2}} [θ_{1}^{4} - θ_{1}^{2} + 2 θ_{1} ϕ_{1} (θ_{1}^{2} - 1) + ϕ_{1}^{2} (θ_{1}^{2} - 1)]}

. ☐

Remark 7.

(1) The proposed AR, ARMA model for the processes of FRVs is an extension of the autoregressive sequence model proposed by Feng et al. [26].

(2) In the proposed models, the so-called standardized process of FRVs plays an important role, as the causality of the AR(p) and ARMA(

p, q

) with fuzzy data are defined, and we only present an example of the standardized process in the one-dimensional case. This standardized process is a special error term process only.

(3) In the general case, without the restriction of the second order for the FRVs, the processes of the FRVs may not be posed for the standardized processes, and, at most, we may set an AR(p) with fuzzy data as

{\tilde{X}}_{t} = θ_{1} {\tilde{X}}_{t - 1} + θ_{2} {\tilde{X}}_{t - 2} + \dots + θ_{p} {\tilde{X}}_{t - p} + {\tilde{B}}_{t},

(61)

where

{{\tilde{B}}_{t}}

is only an unexplained remainder process of the plus operation among the successive

p + 1

elements in process

{{\tilde{X}}_{t}}

, and it may be no longer standardized,

t \in Z

. This general case is a hard open problem.

(4) The considered metric

δ_{2}

can also be extended to a general metric, like ρ, given in the literature [9,25].

4. An Empirical Analysis of the ARMA( $p, q$ ) Models with Fuzzy Data

In this section, we consider an empirical analysis for the proposed ARMA model with fuzzy data so as to demonstrate the goodness of the model. To this end, we use the following procedure: Step (1) investigate and collect the data from a practical time series related to the concerned problem; Step (2) generate the perception mixed fuzzy data based on the real data; Step (3) select and estimate the model based on the obtained fuzzy data; Step (4) give the results of prediction using the estimated model; Step (5) compare the model with other available models.

It is well known that the financial market is a complex, non-stationary, noisy, chaotic, and dynamic system. The main reason is the fact that a huge amount of information is reflected in the financial market. The main factors include the economic condition, political situation, traders’ expectations and emotions, catastrophes, and other unexpected events. Stock market data have to be considered in the framework of uncertainties. Therefore, predictions of stock market prices and their directions with high accuracy are quite difficult.

We consider the problem of predicting the trends of monthly HSI by means of the ARMA models for linguistic data, and here the linguistic data are the perception mixed HSI data.

Step 1

Consider the observations in three time series of close value, low value, and high value of the monthly HSI in the time period from January 2009 to December 2013, as shown in Figure 1, where, for simplicity, the employed data are the original data divided by 1000. Generally speaking, the observations can be simply expressed as a finite number series. For instance, since there are a total of 60 months in the time period from January 2009 to December 2013, we may assume that the three finite series

{m_{i}}, {a_{i}}, {b_{i}}, i = 1, \dots, 60

denote the observations in the three time series for close value, low value, and high value in the time period from January 2009 to December 2013, respectively; here, i is a serial number.

Step 2

Note that each monthly data implies very complex information about the random variation of the market, the psychological responses, and judgment-based behaviors of the market participators in one month-long period. In order to gain more informative predictions of the HSI trends, it is suggested to use the three data—the close value, low value, and high value—simultaneously in an appropriate way, in which the evaluator’s perception ought to be mixed, and the perception has to be vague, since the background information hidden behind the three data is so complicated that there is no way to make the perception clear. Though some predictions can be made through the ordinary time series models using a single close value or average value during the time period, the predicted judgment could be much more biased, as the data used here lack completeness of information. Therefore, we view the three values (close value, low value, high value) of each monthly data integrally as linguistic data, i.e., perception mixed financial data, and model it with a simple triangular (or symmetric ) fuzzy number (

L R

-fuzzy number [9]) defined on the interval [low value, high value] of the fluctuation. As mentioned above, by

m_{i}, a_{i}, b_{i}

we denote the close value, low value, and high value of the ith observation of the monthly HSI, respectively, and, according to the expression of an

L R

-fuzzy number [9], the three data form a simple

L R

-fuzzy number

{(m_{i}, m_{i} - a_{i}, b_{i} - m_{i})}_{L R},

where

m_{i}, m_{i} - a_{i}, b_{i} - m_{i}

denote the core, the left spread, and the right spread of the

L R

fuzzy number

{(m_{i}, m_{i} - a_{i}, b_{i} - m_{i})}_{L R}

, respectively, (

i = 1, \dots, 60

), and

L, R

denote the shape functions of the

L R

-fuzzy number. For simplicity, the shape functions are often taken as

L (x) = R (x) = \max {0, 1 - x}

. According to this procedure, the linguistic monthly data of HSI from January 2009 to December 2013 can be determined, and they are shown in Table 1. (Note that the serial numbers

i = 61, 62, \dots

represent Jan. 2014, Feb. 2014, ⋯, respectively.)

Step 3

For

L R

-fuzzy data

\tilde{u} = {(m, l, r)}_{L R}

, whose

α

-cut is

{\tilde{u}}_{α} = [m - l L^{(- 1)} (α), m + r R^{(- 1)} (α)], α \in [0, 1]

, where

L^{(- 1)} (α) = R^{(- 1)} (α) = 1 - α

for the above

L (x), R (x)

, we have the support function of

{\tilde{u}}_{α}

as

S_{{\tilde{u}}_{α}} (x) = \{\begin{matrix} m + (1 - α) r, x = 1, \\ m - (1 - α) l, x = - 1 . \end{matrix}

(62)

and the sample-based Fréchet covariance for linguistic monthly HSI in Table 1 can be computed using

\begin{matrix} C o v ({\tilde{u}}_{j + h}, {\tilde{u}}_{j}) & = \frac{1}{60} \sum_{j = 1}^{60 - h} [\int_{0}^{1} (S_{{({\tilde{u}}_{j + h})}_{α}} (1) - S_{{\bar{\tilde{u}}}_{α}} (1)) (S_{{({\tilde{u}}_{j})}_{α}} (1) - S_{{\bar{\tilde{u}}}_{α}} (1)) d α \\ + \int_{0}^{1} (S_{{({\tilde{u}}_{j + h})}_{α}} (- 1) - S_{{\bar{\tilde{u}}}_{α}} (- 1)) (S_{{({\tilde{u}}_{j})}_{α}} (- 1) - S_{{\bar{\tilde{u}}}_{α}} (- 1)) d α] \\ = \frac{1}{60} \sum_{j = 1}^{60 - h} [\int_{0}^{1} (m_{j + h} + (1 - α) r_{j + h} - \bar{m} - (1 - α) \bar{r}) (m_{j} + (1 - α) r_{j} - \bar{m} \\ - (1 - α) \bar{r}) d α + \int_{0}^{1} ((1 - α) l_{j + h} - m_{j + h} - (1 - α) \bar{l} + \bar{m}) \\ ((1 - α) l_{j} - m_{j} - (1 - α) \bar{l} + \bar{m}) d α] . \end{matrix}

(63)

The wide-sense stationarity of the considered linguistic monthly HSI time series may be obtained approximately from the stationarity of both series

\{\int_{0}^{1} S_{{({\tilde{u}}_{j})}_{α}} (1) d α\} = \{m_{j} + \frac{r_{j}}{2}\} and \{\int_{0}^{1} S_{{({\tilde{u}}_{j})}_{α}} (- 1) d α\} = \{\frac{l_{j}}{2} - m_{j}\}, j = 1, \dots, 60 .

(64)

The magnitude of the sample autocorrelation functions of the latter two series decay geometrically to zero, and the sample partial autocorrelation functions are negligible for lags greater than 1. Thus, we may fit an ARMA(1,1) with fuzzy data for the linguistic monthly HSI time series, because usually an ARMA is better than an AR, though the AR(1) with fuzzy data can also be employed here [11]. For estimating the model, according to Definition 3, a standardized process of FRVs

{{\tilde{w}}_{t}}

is generated, as shown in Table 2, based on a generated white noise process

{ε_{t}}

.

For the estimation of the parameters, here we assume that this standardized process

{{\tilde{w}}_{t}}

basically satisfies the condition of Theorem 3. Applying Equations (42) and (43) of the least square estimators for the ARMA(1,1) model with fuzzy data in Section 3 to the data from Table 1 and Table 2 (the case of

d = 60, n = 1

), we obtain the estimated ARMA(1,1) of the concerned linguistic monthly HSI with Matlab as

{\tilde{X}}_{i} = 0.992 {\tilde{X}}_{i - 1} + 0.104 {\tilde{w}}_{i - 1} + {\tilde{w}}_{i} .

(65)

Step 4

For the simplicity of computation and comparison, we only consider the prediction for the former 10 months in 2014. A predicted linguistic monthly HSI for the 10 months from January 2014 to October 2014 (the serial numbers

i = 61, 62, 63, 64, 65, 66, 67, 68, 69, 70 .

) are obtained using the prediction formula

{\hat{\tilde{X}}}_{i} = 0.992 {\tilde{X}}_{i - 1} + 0.104 {\tilde{w}}_{i - 1}

; both the real linguistic monthly HSI and the obtained predicted linguistic monthly HSI for the 10 months are shown in Table 3.

Table 3, in fact, also gives a direct comparison between the real and the predicted linguistic monthly HSI. The comparison indicates that the obtained forecasting model is quite reasonable in capturing the complex uncertain and imprecise information, since the linguistic forecasted data provide more information than the crisp data, so the decision makers could consider the best and worst possible situations. On the other hand, the accuracy of forecasting using this model could be improved by adjusting the terms of the standardized process.

Step 5

Note that the predicted linguistic monthly HSI in Table 3, in fact, gives the predictions of the close value series, low value series, and high value series of the monthly HSI simultaneously. Thus, the comparisons of the real close values with the predicted close values, the real low values with the predicted low values, and the real high values with the predicted high values can be done. For instance, the comparison of the close values shown in Table 4 indicates that the predictions for values numbered 62, 65, 66, 67, 68, 70 in the list are with absolute errors less than 0.632, relative errors less than 2.74%, and the predictions for the remainder values have absolute errors within the interval

(0.632, 1.239)

, and relative errors within the interval

(2.74 %, 5.62 %)

. Similarly, the comparisons regarding the low values and high values, respectively, of the monthly HSI can also be carried out.

Remark 8.

The study of the fuzzy set-valued time series modeling is just in its infancy. There are only two estimated fuzzy set-valued models like AR(1) and ARMA(1,1) [10,11] that can be considered for model comparison under special conditions. However, it is obvious here that the fuzzy set-valued ARMA(1,1) model is better than the fuzzy set-valued AR(1) model for the forecast of the linguistic monthly HSI data. On the other hand, it may not be appropriate to compare the fuzzy set-valued time series models with the classical time series models straightforwardly, since the types of data treated by the two kinds of time series are different.

In a special case, we may compare the predicted close values obtained by the proposed fuzzy set-valued ARMA(1,1) model above with the predicted close values obtained by the ordinary AR(1) or AR(2) or AR(3) or AR(1,1) models through a comparison of their prediction absolute errors and relative errors (note that using time series technology, it can be verified that the ordinary AR(1) or AR(2) or AR(3) or AR(1,1) models can be appropriately applied for the prediction of the concerned time series of close values). The comparison results of AR(1) or AR(2) or AR(3) or AR(1,1) with the real close values are shown in Table 5, Table 6, Table 7 and Table 8, respectively. Finally, a comparison result of the prediction errors from the fuzzy ARMA(1,1) with the prediction errors from AR(1), AR(2), AR(3), and AR(1,1) for the case of close values of the monthly HSI are shown in Table 9, which indicates that, on average, the prediction accuracy of our proposed model is better than that of the other four ordinary time series models, since the average absolute error 0.691 of fuzzy ARMA(1,1) is less than the average absolute errors 1.182, 1.194, 1.487, 1.191 of AR(1), AR(2), AR(3), ARMA(1,1), respectively. Further, the average relative error 3.03% of fuzzy ARMA(1,1) is less than the average relative errors 5.03%, 5.08%, 6.29%, 5.07% of AR(1), AR(2), AR(3), ARMA(1,1), respectively. Also, the error data shown in Table 9 indicate that for the months numbered 61,63,66,67,68,70, both the absolute errors and the relative errors of fuzzy ARMA(1,1) are less than those of AR(1), AR(2), AR(3), ARMA(1,1), thus, the prediction accuracy of our proposed model is better than that of the other four ordinary time series models. For the month numbered 62, the absolute errors and the relative errors of fuzzy ARMA(1,1) are slightly larger than those of AR(1), AR(2), AR(3), ARMA(1,1), but the differences for the absolute errors and relative errors are not more than 0.086 and 0.34%, respectively. Thus, the prediction accuracy of our proposed model is almost the same as that of the other four ordinary time series models. For the month numbered 64, the absolute errors and the relative errors of fuzzy ARMA(1,1) are larger than those of AR(1), AR(2), AR(3), ARMA(1,1), but the differences for the absolute errors and relative errors are not more than 0.454 and 2.059%, respectively, thus, the prediction accuracy of our proposed model is not better than that of the other four ordinary time series models. For the month numbered 65,the absolute errors and the relative errors of fuzzy ARMA(1,1) are slightly larger than those of AR(1), AR(2), ARMA(1,1); the differences for the absolute errors and relative errors are not more than 0.048 and 0.207%, respectively, but the absolute errors and the relative errors of fuzzy ARMA(1,1) are less than those of AR(3), thus, the prediction accuracy of our proposed model is not better than that of the other three ordinary time series models AR(1), AR(2), ARMA(1,1), but it is better than that of AR(3). For the month numbered 69, the absolute errors and the relative errors of fuzzy ARMA(1,1) are slightly larger than those of AR(1), AR(2), ARMA(1,1); the differences for the absolute errors and relative errors are not more than 0.547 and 2.39%, respectively, but the absolute errors and the relative errors of fuzzy ARMA(1,1) are less than those of AR(3), thus, the prediction accuracy of our proposed model is not better than that of the other three ordinary time series models AR(1), AR(2), ARMA(1,1), but it is better than that of AR(3).

Similarly, the same comparison can be done for the high values and low values of the monthly HSI.

5. Conclusions

The ARMA models are important in many fields and applications, although they are most widely applied in time series analysis. In this big-data era, various complex data, such as interval-valued data, linguistic data, etc., have arisen. Theoretically, it is meaningful and valuable to extend the statistical regression models and time series models to such complex data, and such research has recently received much attention. In this paper, we extended the ARMA model to the case of linguistic data that can be modeled by some symmetric fuzzy sets. We firstly determined that the estimators from the least square estimation of the ARMA(1,1) model under some

L_{2}

distance between two sets are weakly consistent. To verify the effectiveness of the proposed linguistic-valued ARMA models, we applied them to forecast the linguistic monthly Hang Seng Index (HSI) with an empirical analysis, and detailed comparisons of the models with other classical AR(1), AR(2), AR(3) models, as well as the ARMA(1,1) model, are given. Furthermore, we present theoretical proofs for some conclusions on the convergence properties of the sequence of the FRVs mentioned in this paper [10].

It should be pointed out that the semi-linear structure of the space of all fuzzy data make us consider all the parameters to be positive or negative, and the estimation of parameters for a high-order (the order is larger than 3) AR and ARMA models with fuzzy data becomes much more complicated. The theory of time series with FRVs (fuzzy set-valued data) needs to be further studied. In relation to the present paper, we expect to further investigate several problems: (1) The asymptotic properties of the least square estimators for the general model ARMA(

p, q

); (2) Improving the accuracy level of the forecasting using the fuzzy set-valued ARMA(1,1), AR(1) models.

Author Contributions

D.W. wrote and revised the paper. L.Z. handled the data, figure and computation with software. All authors read and approved the final version of the manuscript.

Funding

This research was funded by (The National Natural Science Foundation of China) grant number (11271096).

Acknowledgments

This research is supported by the NNSF of China under grant number No. 11271096. The author very appreciate the financial aids. The authors also thank the editors and anonymous referees who commented on this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Brockwell, P.J.; Davis, R.A. Time Series: Theory and Methods, 2nd ed.; Springer-Verlag: New York, NY, USA, 1991. [Google Scholar]
Metghalchi, M.; Chang, Y.H.; Du, J. Technical trading rules for NASDAQ composite intex. Int. Res. J. Finance Econ. 2011, 73, 109–121. [Google Scholar]
Li, Q.; Liang, S.Y. Intelligent Prognostics of Degradation Trajectories for Rotating Machinery Based on Asymmetric Penalty Sparse Decomposition Model. Symmetry 2018, 10, 214. [Google Scholar] [CrossRef]
Li, S.; Guan, L. Fuzzy set-valued Gaussian processes and Brownian motions. Inf. Sci. 2007, 177, 3251–3259. [Google Scholar] [CrossRef]
Bongiorno, E.G. A note on fuzzy set-valued Brownian motion. Stat. Prob. Lett. 2012, 82, 827–832. [Google Scholar] [CrossRef]
Wang, X.; Zhang, Z.; Li, S. Set-valued and interval-valued stationary time series. J. Multivar. Anal. 2016, 145, 208–223. [Google Scholar] [CrossRef]
Wang, D. An autoregressive model with fuzzy random variables. In Soft Methods for Handling Variability and Imprecision, Advances in Soft Computing 48; Dubois, D., Ed.; Springer-Verlag: Berlin, Germany, 2008; pp. 401–448. [Google Scholar]
Puri, M.; Ralescu, D. Fuzzy Random Variables. J. Math. Anal. Appl. 1986, 114, 409–422. [Google Scholar] [CrossRef]
Näther, W. On random fuzzy variables of second order and their application to linear statistical inference with fuzzy data. Metrika 2000, 51, 201–221. [Google Scholar] [CrossRef]
Wang, D. A note on autoregressive models with fuzzy random variables. J. Stat. Theor. Prac. 2018, 12, 356–369. [Google Scholar] [CrossRef]
Jung, H.Y.; Lee, W.J.; Yoon, J.H. A unified approach to asymptotic behaviors for the autoregressive model with fuzzy data. Inf. Sci. 2014, 257, 127–137. [Google Scholar] [CrossRef]
Song, Q.; Chissom, B.S. Fuzzy time series and its models. Fuzzy Sets Syst. 1993, 54, 269–277. [Google Scholar] [CrossRef]
Song, Q.; Chissom, B.S. Forecasting enrollments with fuzzy time series—Part I. Fuzzy Sets Syst. 1993, 54, 1–9. [Google Scholar] [CrossRef]
Guan, S.; Zhao, A. A Two-Factor Autoregressive Moving Average Model Based on Fuzzy Fluctuation Logical Relationships. Symmetry 2017, 9, 207. [Google Scholar] [CrossRef]
Chen, S.M. Forecasting enrollments based on fuzzy time series. Fuzzy Sets Syst. 1996, 81, 311–319. [Google Scholar] [CrossRef]
Li, S.-T.; Kuo, S.-C.; Cheng, Y.-C.; Chen, C.-C. Deterministic vector long-term forecasting for fuzzy time series. Fuzzy Sets Syst. 2010, 161, 1852–1870. [Google Scholar] [CrossRef]
Lee, L.W.; Wang, L.H.; Chen, S.M.; Leu, Y.H. Handling forecasting problem based on two-factors high-order fuzzy time series. IEEE Trans. Fuzzy Syst. 2006, 14, 468–477. [Google Scholar] [CrossRef]
Diamond, P.; Kloeden, P. Metric Spaces of Fuzzy Sets; World Scientific: London, UK, 1994. [Google Scholar]
Zadeh, L.A. The concept of a linguistic variable and its application to approximate reasoning-I. Inf. Sci. 1975, 8, 199–249. [Google Scholar] [CrossRef]
Wang, D.; Shi, M. Estimation of a simple multivariate linear model for fuzzy random sets. In Strengthening Links Between Data Analysis and Soft Computing, Advances in Intelligent Systems and Computing 315; Grzegorzewski, P., Ed.; Springer: New York, NY, USA, 2015; pp. 201–208. [Google Scholar]
Wünsche, A.; Näther, W. Least-square fuzzy regression with fuzzy random variables. Fuzzy Sets Syst. 2002, 130, 43–50. [Google Scholar] [CrossRef]
Stefanini, L. A generalization of Hukuhara difference and division for interval and fuzzy arithmetic. Fuzzy Sets Syst. 2010, 161, 1564–1584. [Google Scholar] [CrossRef]
Krätschmer, V. Probability theory in fuzzy sample space. Metrika 2004, 60, 67–189. [Google Scholar] [CrossRef]
Kruse, R.; Meyer, K. Statistics with Vague Data; Springer Science & Business Media: New York, NY, USA, 1987. [Google Scholar]
Wang, D.; Yasuda, M. Some asymptotic properties of point estimation with n-dimensional fuzzy data. Statistics 2004, 38, 167–181. [Google Scholar] [CrossRef]
Feng, Y.H.; Hu, L.J.; Shu, H.S. The variance and covariance of fuzzy random variables and their applications. Fuzzy Sets Syst. 2001, 120, 487–497. [Google Scholar] [CrossRef]
Puri, M.D.; Ralescu, D. The concept of normality of fuzzy random variables. Ann. Proba. 1985, 13, 1373–1379. [Google Scholar] [CrossRef]

Figure 1. The curves of the close value, low value, and high value for monthly Hang Seng Index (HSI). (https://www.hsi.com.hk/eng).

Table 1. Linguistic monthly HSI from January 2009 to December 2013 (

(m, l, r) : = {(m, l, r)}_{L R}

).

Table 1. Linguistic monthly HSI from January 2009 to December 2013 (

(m, l, r) : = {(m, l, r)}_{L R}

).

Year	Month	Data	Year	Month	Data
2009	1	(13.278, 0.839, 2.485)	2012	1	(20.39, 2.07, 0.2)
	2	(12.811, 0.177, 1.165)		2	(21.68, 1.411, 0.08)
	3	(13.576, 2.23, 0.681)		3	(20.555, 0.181, 1.086)
	4	(15.52, 2.188, 0.457)		4	(21.094, 1.059, 0.011)
	5	(18.171, 2.316, 0.056)		5	(18.629, 0.251, 2.756)
	6	(18.378, 1.002, 0.784)		6	(19.441, 1.385, 0.138)
	7	(20.573, 3.387, 0.139)		7	(19.796, 1.086, 0.073)
	8	(19.724, 0.132, 1.473)		8	(19.482, 0.032, 0.818)
	9	(20.956, 1.529, 0.975)		9	(20.84, 1.764, 0.055)
	10	(21.752, 1.447, 0.868)		10	(21.641, 0.874, 0.206)
	11	(21.821, 0.819, 1.278)		11	(22.03, 0.932, 0.119)
	12	(21.872, 0.939, 0.722)		12	(22.657, 0.969, 0.061)
2010	1	(20.121, 0.205, 2.551)	2013	1	(23.729, 0.869, 0.187)
	2	(20.268, 1.185, 0.172)		2	(23.02, 0.575, 0.924)
	3	(21.239, 0.664, 0.212)		3	(22.299, 0.323, 0.963)
	4	(21.108, 0.345, 1.281)		4	(22.737, 1.314, 0.125)
	5	(19.765, 0.974, 1.247)		5	(22.392, 0.102, 1.12)
	6	(20.128, 0.917, 0.829)		6	(20.803, 1.377, 1.761)
	7	(21.029, 1.251, 0.09)		7	(21.883, 1.764, 0.187)
	8	(20.536. 0.164, 1.27)		8	(21.731, 0.266, 0.964)
	9	(22.358, 1.828, 0.081)		9	(22.859, 0.911, 0.695)
	10	(23.096, 0.592, 0.77)		10	(23.206, 0.566, 0.328)
	11	(23.007, 0.224, 1.981)		11	(23.881, 1.418, 0.133)
	12	(23.035, 0.653, 0.577)		12	(23.306, 0.593, 0.805)
2011	1	(23.447, 0.39, 0.987)
	2	(23.338, 0.892, 0.644)
	3	(23.527, 1.404, 0.407)
	4	(23.72, 0.252, 0.748)
	5	(23.684, 1.165, 0.24)
	6	(22.398, 0.89, 1.308)
	7	(22.44, 0.829, 0.395)
	8	(20.534, 1.666, 2.274)
	9	(17.592, 0.593, 3.382)
	10	(19.864, 3.694, 0.409)
	11	(17.989, 0.376, 2.184)
	12	(18.434, 0.613, 0.6)

Table 2. The former 60 elements of a standardized process of fuzzy random variables (FRVs).

i	${\tilde{w}}_{i}$	i	${\tilde{w}}_{i}$	i	${\tilde{w}}_{i}$
1	(1.80482, 0.01, 0.01)	21	(1.30572, 0.001, 0.0001)	41	(−1.3595, 0.00008, 0.0001)
2	(−0.07992, 0.007, 0.008)	22	(1.42513, 0.0003, 0.0002)	42	(−2.33134, 0.001, 0.00012)
3	(0.39658, 0.01, 0.002)	23	(−0.4158, 0.0002, 0.0001)	43	(−0.40969, 0.00012, 0.0006)
4	(−1.08332, 0.0015, 0.001)	24	(1.61438, 0.0003, 0.001)	44	(0.6542, 0.0003, 0.0001)
5	(2.23829, 0.01, 0.001)	25	(−1.05773, 0.001, 0.00002)	45	(0.39926, 0.00003, 0.00001)
6	(−0.62423, 0.001, 0.001)	26	(−0.94833, 0.0001, 0.001)	46	(−0.46931, 0.00002, 0.0006 )
7	(0.51366, 0.002, 0.001)	27	(0.95365, 0.0003, 0.001)	47	(0.86633, 0.0003, 0.00001)
8	(−0.08661, 0.0002, 0.0013)	28	(0.39198, 0.0002, 0.0001)	48	(−0.92372, 0.0002, 0.00008)
9	(−0.59418, 0.0002, 0.001)	29	(−0.07614, 0.00102, 0.0001)	49	(1.27746, 0.0001, 0.00002)
10	(0.03189, 0.002, 0.0012)	30	(1.22056, 0.0017, 0.00018)	50	(−1.4526, 0.0001, 0.001)
11	(−0.7378, 0.00021, 0.0013)	31	(−0.63084, 0.00016,0.00018)	51	(0.34892, 0.0002, 0.0001)
12	(−0.25014, 0.01, 0.0003)	32	(−0.63576, 0.001, 0.0001)	52	(−0.05535, 0.00012, 0.0001)
13	(0.685, 0.0013, 0.00011)	33	(−0.34, 0.001, 0.00008)	53	(−1.228, 0.0008, 0.0001)
14	(−0.80416, 0.0013, 0.0003)	34	(0.07628, 0.0001, 0.0002)	54	(0.14502, 0.0001, 0.00006)
15	(−0.74428, 0.0011, 0.0003)	35	(0.95536, 0.000016, 0.00011)	55	(−0.8395, 0.0001, 0.00032)
16	(−0.7955, 0.0002, 0.0001)	36	(−1.2167, 0.0001, 0.00011)	56	(−0.09626, 0.00009, 0.0006)
17	(0.34071, 0.001, 0.0001)	37	(1.18449, 0.0006, 0.0003)	57	(−0.85758, 0.0001, 0.00002)
18	(−0.30051, 0.001, 0.00017)	38	(−0.34369, 0.0002, 0.0003)	58	(0.76497, 0.00002, 0.001)
19	(−1.34985, 0.00031, 0.0005)	39	(1.09024, 0.0001, 0.00006)	59	(0.04501, 0.000016, 0.00001)
20	(0.4327, 0.0001, 0.0002)	40	(−0.13531, 0.0002, 0.0001)	60	(1.92838, 0.00008, 0.0002)

Table 3. The real linguistic monthly HSI and the predicted linguistic monthly HSI.

i	Real Linguistic Monthly HSI	Predicted Linguistic Monthly HSI
61	(22.035, 0.289, 1.434)	(23.274, 0.381, 0.236)
62	(22.836, 1.639, 0.15)	(23.662, 0.313, 0.368)
63	(22.151, 1.014, 0.688)	(23.182, 0.121, 0.502)
64	(22.133, 0.037, 1.091)	(23.311, 0.248, 0.308)
65	(23.081, 1.401, 0.128)	(23.402, 0.313, 0.470)
66	(23.19, 0.388, 0.207)	(23.431, 0.402, 0.487)
67	(24.756, 1.63, 0.156)	(24.217, 0.418, 0.501)
68	(24.742, 0.552, 0.492)	(24.406, 0.419, 0.537)
69	(22.932, 0.077, 2.43)	(24.100, 0.428, 0.558)
70	(23.998, 1.433, 0.048)	(23.579, 0.432, 0.563)

Table 4. A comparison of the predicted close values obtained by the fuzzy set-valued autoregressive moving average (ARMA)(1,1) with the real close values in the monthly HSI.

i	Real Close Values	Predicted Close Values	Absolute Error	Relative Error
61	22.035	23.274	1.239	5.62%
62	22.836	23.462	0.626	2.74%
63	22.151	23.182	1.031	4.654%
64	22.133	23.210	1.077	4.869%
65	23.081	23.302	0.221	0.957%
66	23.190	23.413	0.223	0.95%
67	24.756	24.217	0.539	2.17%
68	24.742	24.406	0.336	1.356%
69	22.932	24.100	1.168	5.09%
70	23.998	23.579	0.419	1.746%

Table 5. A comparison of the predicted close values obtained by the classical AR(1) with the real close values in the monthly HSI.

i	Real Close Values	Predicted Close Values	Absolute Error	Relative Error
61	22.035	23.663	1.628	7.38%
62	22.836	23.458	0.622	2.723%
63	22.151	23.264	1.113	5.02%
64	22.133	23.081	0.948	4.283%
65	23.081	22.908	0.173	0.75%
66	23.19	22.746	0.444	1.91%
67	24.756	22.592	2.164	8.74%
68	24.742	22.448	2.294	9.27%
69	22.932	22.311	0.621	2.7%
70	23.998	22.183	1.815	7.56%

Table 6. A comparison of the predicted close values obtained by the classical AR(2) with the real close value in the monthly HSI.

i	Real Close Values	Predicted Close Values	Absolute Error	Relative Error
61	22.035	23.658	1.623	7.36%
62	22.836	23.447	0.611	2.67%
63	22.151	23.247	1.096	4.94%
64	22.133	23.06	0.927	4.19%
65	23.081	22.883	0.198	0.86%
66	23.19	22.717	0.473	2.04%
67	24.756	22.561	2.195	8.87%
68	24.742	22.414	2.328	9.41%
69	22.932	22.275	0.657	2.86%
70	23.998	22.145	1.835	7.65%

Table 7. A comparison of the predicted close values obtained by the classical AR(3) with the real close value in the monthly HSI.

i	Real Close Values	Predicted Close Values	Absolute Error	Relative Error
61	22.035	23.688	1.653	7.5%
62	22.836	23.382	0.546	2.4%
63	22.151	23.065	0.914	4.1%
64	22.133	22.756	0.623	2.81%
65	23.081	22.473	0.608	2.63%
66	23.19	22.219	0.971	4.19%
67	24.756	21.993	2.763	11.16%
68	24.742	21.794	2.948	11.91%
69	22.932	21.62	1.312	5.72%
70	23.998	21.467	2.531	10.5%

Table 8. A comparison of the predicted close values obtained by the classical ARMA(1,1) with the real close value in the monthly HSI.

i	Real Close Values	Predicted Close Values	Absolute Error	Relative Error
61	22.035	23.66	1.625	7.37%
62	22.836	23.45	0.614	2.69%
63	22.151	23.252	1.101	4.97%
64	22.133	23.066	0.923	4.21%
65	23.081	22.89	0.191	0.83%
66	23.19	22.725	0.465	2%
67	24.756	22.57	2.186	8.83%
68	24.742	22.423	2.319	9.37%
69	22.932	22.286	0.646	2.82%
70	23.998	22.156	1.842	7.68%

Table 9. A comparison of the proposed fuzzy set valued ARMA(1,1) with the classical AR(1), AR(2), AR(3), and ARMA(1,1) in the prediction errors for the close value of the monthly HSI.

i	fuzzy ARMA(1,1)	AR(1)	AR(2)	AR(3)	ARMA(1,1)
i	abs.err., rel.err.	abs.err., rel.err.	abs.err., rel.err.	abs.err., rel.err.	abs.err., rel.err.
61	1.239, 5.62%	1.628, 7.38%	1.623, 7.36%	1.653, 7.5%	1.625, 7.37%
62	0.626, 2.74%	0.622, 2.723%	0.611, 2.67%	0.546, 2.4%	0.614, 2.69%
63	1.031, 4.654%	1.113, 5.02%	1.096, 4.94%	0.914, 4.1%	1.101, 4.97%
64	1.077, 4.869%	0.948, 4.283%	0.927, 4.19%	0.623, 2.81%	0.923, 4.21%
65	0.221, 0.957%	0.173, 0.75%	0.198, 0.86%	0.608, 2.63%	0.191, 0.83%
66	0.223, 0.95%	0.444, 1.91%	0.473, 2.04%	0.971, 4.19%	0.465, 2%
67	0.539, 2.17%	2.164, 8.74%	2.195, 8.87%	2.763, 11.16%	2.186, 8.83%
68	0.336, 1.356%	2.294, 9.27%	2.328, 9.41%	2.948, 11.91%	2.319, 9.37%
69	1.168, 5.09%	0.621, 2.7%	0.657, 2.86%	1.312, 5.72%	0.646, 2.82%
70	0.449, 1.746%	1.815, 7.56%	1.835, 7.65%	2.531, 10.5%	1.842, 7.68%
ave.	0.691, 3.02%	1.182, 5.033%	1.194, 5.085%	1.487, 6.292%	1.191, 5.076%

(abs. = absolute, rel. = relative, err. = error, ave. = average).

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, D.; Zhang, L. A Fuzzy Set-Valued Autoregressive Moving Average Model and Its Applications. Symmetry 2018, 10, 324. https://doi.org/10.3390/sym10080324

AMA Style

Wang D, Zhang L. A Fuzzy Set-Valued Autoregressive Moving Average Model and Its Applications. Symmetry. 2018; 10(8):324. https://doi.org/10.3390/sym10080324

Chicago/Turabian Style

Wang, Dabuxilatu, and Liang Zhang. 2018. "A Fuzzy Set-Valued Autoregressive Moving Average Model and Its Applications" Symmetry 10, no. 8: 324. https://doi.org/10.3390/sym10080324

APA Style

Wang, D., & Zhang, L. (2018). A Fuzzy Set-Valued Autoregressive Moving Average Model and Its Applications. Symmetry, 10(8), 324. https://doi.org/10.3390/sym10080324

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Fuzzy Set-Valued Autoregressive Moving Average Model and Its Applications

Abstract

1. Introduction

2. Preliminaries

2.1. Fuzzy Set on $R^{n}$

2.2. Fuzzy Random Variables (FRVs)

3. A Fuzzy Set Valued ARMA Model Based on a Standardized Process

4. An Empirical Analysis of the ARMA( $p, q$ ) Models with Fuzzy Data

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Fuzzy Set-Valued Autoregressive Moving Average Model and Its Applications

Abstract

1. Introduction

2. Preliminaries

2.1. Fuzzy Set on R n

2.2. Fuzzy Random Variables (FRVs)

3. A Fuzzy Set Valued ARMA Model Based on a Standardized Process

4. An Empirical Analysis of the ARMA( p , q ) Models with Fuzzy Data

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.1. Fuzzy Set on $R^{n}$

4. An Empirical Analysis of the ARMA( $p, q$ ) Models with Fuzzy Data