Next Article in Journal
On Design of IIR Cascaded-Resonator-Based Complex Filter Banks
Previous Article in Journal
Symmetry Alignment–Feature Interaction Network for Human Ear Similarity Detection and Authentication
Previous Article in Special Issue
Parameter Estimation in Multifactor Uncertain Differential Equation with Symmetry Analysis for Stock Prediction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Parameter Estimation of Uncertain Moving Average Model Based on Least Squares Principle

College of Mathematics and Statistics, Shanxi Datong University, Datong 037009, China
*
Author to whom correspondence should be addressed.
Symmetry 2025, 17(5), 656; https://doi.org/10.3390/sym17050656
Submission received: 22 March 2025 / Revised: 24 April 2025 / Accepted: 25 April 2025 / Published: 26 April 2025
(This article belongs to the Special Issue Symmetry Applications in Uncertain Differential Equations)

Abstract

:
The uncertain moving average model is a powerful tool to study the time series in which the data are affected by the previous disturbance terms under uncertain environments. However, the influence of uncertain disturbance terms is often ignored in the relevant statistical inference studies. In order to solve this problem, this paper constructs a symmetric statistical invariant, normal uncertain variable, by sorting out the uncertain moving average model and combining the uncertain disturbance terms, and then applies the least square principle to the parameter estimation of the symmetric statistical invariant to determine the unknown parameters and uncertain disturbance terms in the uncertain moving average model. In addition, a numerical algorithm is designed to calculate the corresponding estimators, and the problems related to hypothesis testing and forecast are also studied. Finally, a numerical example is given to illustrate the proposed method.

1. Introduction

Time series data are of vital importance across a wide range of essential domains, including economics, finance, environmental science, meteorology, and health care, and its importance cannot be ignored. Time series analysis is also of vital importance as a basic means to deeply analyze the intrinsic evolution of time series data and accurately predict the future trends. Looking back on the history of time series analysis, a key moment with milestone significance should date back to 1927. At that time, Yule, while carefully studying the sunspot data recorded by Wolfer, cleverly introduced the concept of autoregressive models (ARMs) in [1]. This breakthrough ushered in the era of modern time series analysis and research. Shortly thereafter, Walker [2] was inspired by the concept of autoregressive models, and conducted a comprehensive analysis of atmospheric pressure data from the Port of Darwin, India in 1931, which not only greatly broadened the application boundaries of autoregressive models and made them more widely applicable, but also innovatively proposed moving average models (MAMs) and autoregressive moving average models (ARMAMs), which combine the characteristics of ARM and MAM. These contributions have laid a firm theoretical foundation for the development of the field of time series analysis, so that it can move forward more stably. In addition, for researchers looking to gain a more profound comprehension of methods and applications of time series analysis, the works of Box and Jenkins [3] and Shumway and Stoffer [4] are indispensable resources.
It should be particularly pointed out that traditional time series analysis generally assumes that the non-deterministic phenomena inside the data originate from random fluctuations, and it usually adopts classical statistics based on probability theory to handle time series data. Nevertheless, in real-world application scenarios, the sources of non-deterministic phenomena in time series data are diverse and intricate, and there frequently occurs a situation of frequency instability. In view of this situation, numerous scholars have carried out a substantial number of empirical studies with the aim of exploring whether the non-deterministic phenomena in time series data can be reduced to randomness in real situations. For example, Yang and Liu [5] utilized partial differential equations to characterize the size of the Chinese population under different age structures, and discovered that if the non-deterministic phenomena in the population system were treated as randomness, then the resulting outcomes would conflict with the relevant assumptions. Another example is the study of Liu et al. [6], which modeled the order data of an online car-hailing platform in Beijing with the help of the theory of renewal process, and found that if the numbers of passengers were regarded as random variables, then the actual data collected would deviate from the corresponding 99% confidence interval very seriously. Therefore, if the non-deterministic phenomenon in time series data is simply reduced to randomness, it is highly likely to cause bias in the prediction of the constructed time series model and the support it provides for decision making. As a result, the accuracy and credibility of the results are also affected. For the purpose of handling this problem, and more comprehensively describing and effectively dealing with the non-deterministic phenomenon in time series data among practical applications, we are in urgent need of introducing a new axiomatic mathematical system, uncertainty theory, which was established by Liu [7] and further refined by Liu [8]. The time series analysis based on this theory is called uncertain time series analysis. Specifically, it is a collection of technical methods to collect, model and interpret time series data, evolving from uncertainty theory as a technical tool. The pioneering research work in this field commenced with the study by Yang and Liu [9], which defined the uncertain autoregressive model based on uncertainty theory.
Although some scholars have conducted corresponding studies on the least squares estimation of unknown parameters and the least squares estimation of disturbance terms in the uncertain moving average model, the previous studies completely ignored the existence of disturbance terms when estimating unknown parameters, did not comprehensively describe the influence of disturbance terms on the observed data, and separated the estimation of unknown parameters from the estimation of disturbance terms. In this paper, a symmetric statistical invariant containing unknown parameters of the uncertain moving average model is constructed by deforming the uncertain moving average model and combining the uncertainty distribution of the disturbance terms. Then, the statistical inference problem of the uncertain moving average model is transformed into the parameter estimation problem of the symmetric statistical invariant, and the corresponding estimators are determined by the least square principle. Specifically, the main contributions of this paper are as follows:
Based on the uncertain moving average model after deformation and the properties of uncertain disturbance terms, a symmetric statistical invariant containing unknown parameters of the uncertain moving average model is constructed.
The corresponding estimators are determined by combining the least squares principle, and a numerical algorithm is designed to solve the corresponding estimators.
A numerical example is provided to illustrate the corresponding theoretical results.

2. Literature Review of Uncertain Time Series Analysis

Since the concept of uncertain time series analysis was put forward, a large number of scholars have carried out in-depth research on it, and uncertain time series analysis has gradually developed into a relatively large-scale and mature theoretical branch within the field of uncertain statistics.
In the research of uncertain time series analysis, the primary and core content is the exploration of various uncertain time series models. The earliest uncertain time series model is the uncertain autoregressive model proposed by Yang and Liu [9]. In their research work, they made the assumption that the specific values of the time series will be influenced by both the data observed in the past and the disturbance term at the current moment. Moreover, they defined the disturbance term as an uncertain variable. This pioneering research achievement has established an important theoretical foundation for subsequent research on uncertain time series analysis. Following that, Lu et al. [10] pointed out that the future values of time series data may not only be affected by historical observed data, but may even be jointly influenced by past observed data and historical disturbance terms. In order to deal with this special phenomenon, they creatively proposed the uncertain moving average model and the uncertain autoregressive moving average model, enabling more accurate modeling and analysis of the corresponding time series data. In the meantime, Tang [11] keenly perceived the complex characteristics of multivariate time series and then proposed the uncertain vector autoregressive model. This model performs more excellently in capturing the complex relationships between variables and can more accurately reflect the actual situation. On the basic of the above results, Tang [12] then conducted related research on the time series data evolving from nonlinear relations and proposed the uncertain max autoregression model. The main purpose of this model is to effectively describe and characterize time series systems that are heavily influenced by historical extreme observations. Soon afterwards, in further research work, Tang [13] proposed the concept of the uncertain threshold autoregression model for modeling and analogizing the time series data evolving from the system with different change rules at different stages. In addition, when the system generating the time series data is completely unknown, that is, it is impossible to determine the specific form of the uncertain time series model, the research results of Zhang and Gao [14] provide us with a very powerful and effective tool for studying non-parametric time series systems. Overall, this series of rich research achievements has not only greatly enriched and improved the theoretical system framework of uncertain time series analysis, but also provided very solid and reliable technical support and a guarantee for numerous practical application scenarios.
Another important research direction of uncertain time series analysis lies in the statistical inference of uncertain time series models. This aspect plays a vital bridging role in uncertain time series analysis, closely linking theoretical methods with practical applications. To be specific, the statistical inference work for uncertain time series models mainly encompasses two vitally important core tasks. Firstly, it is necessary to accurately estimate the unknown parameters within the model, enabling the model to better fit the time series data. Secondly, it is essential to conduct a reasonable quantitative analysis of the uncertain disturbance term of the model. In terms of estimating unknown parameters, Yang and Liu [9] first proposed the least squares estimation method, whose core idea is to continuously adjust the parameters to minimize the sum of the squares of the deviations between the predicted values of the model and the actual observed values. However, considering that extreme outliers may exist in actual data and these outliers can significantly affect the results of the least squares estimation, Yang et al. [15] further proposed the least absolute deviation estimation method. This method effectively improves the accuracy of parameter estimation by minimizing the sum of the absolute values of the deviations. Beyond that, many scholars are also actively exploring other estimation methods. For example, Chen and Yang [16] successfully applied the idea of uncertain maximum likelihood to the estimation of unknown parameters in uncertain autoregressive models. Other parameter estimation methods can be found in lasso estimation (Zhang et al. [17]), ridge estimation (Chen and Yang [18]), Huber estimation (Liu [19]), moment estimation (Liu and Qin [20]), and so on. These different estimation methods all play important roles in the field of parameter estimation of uncertain time series models and demonstrate their respective values. In the field of estimating the uncertain disturbance terms, the pioneering research conducted by Yang and Liu [9] laid a solid foundation for the subsequent moment estimation method. Later, Chen and Yang [16] introduced the uncertain maximum likelihood estimation method and further promoted the development of the research of this field. However, due to the presence of outliers in actual data, which may lead to biases in the estimation results, Liu and Liu [21] studied an improved uncertain maximum likelihood estimation method. This new method has a stronger adaptability to potential data biases. In addition, Liu and Liu [22] also studied a least squares estimation method. This method estimates the uncertain disturbance terms by minimizing the sum of the deviations between the empirical distribution of the observed values and the population distribution.

3. Deformation of Uncertain Moving Average Model

In the field of uncertain time series analysis, a common assumption is that a series of observations indexed in time are available, called time series data, as follows:
X t , t = 1 , 2 , , n .
In order to model the time series data in the scenario where current data are more likely to be affected by previous disturbance items, Yang and Ni [23] introduced the uncertain moving average model to smooth such time series data, and then explored the long-term trend and periodic changes of time series data. Based on the assumption of a linear relationship between observations and uncertain disturbance terms, the model successfully removes short-term fluctuations and noise components in the data, and has practical applications in many fields such as market forecasting and climate change research.
Consider an uncertain moving average model of order p which is denoted as
X t = a 0 + ϵ t i = 1 p a i ϵ t i .
Here, ϵ t , ϵ t 1 , ⋯, ϵ t p are called the uncertain disturbance terms, and are assumed to be independent and follow the same uncertainty distribution, uncertain normal distribution N ( 0 , σ ) , while a 0 , a 1 , ⋯, a p are a set of unknown parameters, which reflect the weights given to each uncertain disturbance term. With the help of the backshift operator B (satisfying B k ϵ t = ϵ t k ), the p order uncertain moving average model (2) can be reformulated as
X t a 0 = ϵ t i = 1 p a i B i ϵ t = 1 i = 1 p a i B i ϵ t = Θ ( B ) ϵ t ,
where
Θ ( B ) = 1 i = 1 p a i B i .
In order to calculate the corresponding disturbance term based on time series data, Xin et al. [24] proved that Formula (3) is reversible, and can be written as
ϵ t = 1 Θ ( B ) X t a 0 ,
where
1 Θ ( B ) = i = 0 π i B i , π 0 = 1 , π i = j = 1 p i a j π i j , p i = min ( p , i ) .
Thus, we have
ϵ t = i = 0 π i B i X t a 0 = i = 0 π i X t i a 0 ,
Since the minimum index of time series data (1) starts from 1, we set X t = a 0 if t 0 . Then, we have
ϵ t = i = 0 t 1 π i X t i a 0 = X t a 0 + i = 1 t 1 π i X t i a 0 .
Based on the above analysis, we can rewrite the p order uncertain moving average model (2) as
X t = a 0 i = 1 t 1 π i X t i a 0 + ϵ t
for t = 1 , 2 , , n .

4. Parameter Estimation of Uncertain Moving Average Model

In this section, we will discuss the parameter estimation of uncertain moving average model based on time series data and least squares principle.
For a given set of time series data (1), it follows from the deformation expression (4) and the normality assumption of the uncertain disturbance terms that
ϵ t = X t a 0 + i = 1 t 1 π i X t i a 0 N ( 0 , σ ) .
By defining the following n real functions with unknown parameters a 0 , a 1 , ⋯, a p and σ  as
h t ( a 0 , a 1 , , a p , σ ) = X t a 0 + i = 1 t 1 π i X t i a 0 σ , t = 1 , 2 , , n ,
it is not difficult to infer that when the unknown parameters a 0 , a 1 , ⋯, a p and σ in the p order uncertain moving average model (2) can take on true values, these n real functions h 1 , h 2 , , h n can be regarded as n samples of the standard normal uncertainty distribution N ( 0 , 1 ) . That is, we have
h 1 , h 2 , , h n N ( 0 , 1 ) .
Here, Equation (6) is the statistical invariant that we want to construct, and notice that its uncertainty distribution, standard normal uncertainty distribution N ( 0 , 1 ) , is symmetric about the origin, so it is also a symmetric statistical invariant.
Based on the above analysis, we know that the n real functions h 1 , h 2 , , h n defined above can be regarded as a set of samples of standard normal uncertainty distribution N ( 0 , 1 ) when the unknown parameters a 0 , a 1 , ⋯, a p and σ take true values. Therefore, in order to estimate the values of the unknown parameters, we should find the set of parameters that makes the empirical distribution function of h 1 , h 2 , , h n most close to the uncertainty distribution of N ( 0 , 1 ) . It should be noted that the empirical distribution corresponding to h 1 , h 2 , , h n is given by
F ( x ) = 1 n t = 1 n I ( h t x ) .
Here,  I ( h t x )  is the indicator function which takes the value of 1 when h t x and takes the value of 0 when h t > x . Moreover, the expression of the uncertainty distribution of N ( 0 , 1 ) is
Φ ( x ) = 1 + exp π x 3 1 .
According to the idea of the least squares principle, we can choose the parameter values that minimizes the sum of squares of the deviation between the empirical distribution F ( x ) and the uncertainty distribution Φ ( x ) as the estimated parameters, that is, the least squares estimation of p order uncertain moving average model (2) should solve the following minimization problem:
min a 0 , a 1 , , a p , σ > 0 t = 1 n 1 + exp π h t 3 1 1 n j = 1 n I ( h j h t ) 2 .
It is worth noting that the core challenge in solving the above optimization problem is the nonlinear connection between the decision variables and the objective function, because the decision variables are not only contained in the normal uncertainty distribution function, but also in the indicator function. This form makes the process of directly solving the optimization problem very difficult. To address this issue, we carefully designed and effectively implemented an algorithmic approach (Algorithm 1) whose key goal is to effectively process and approximate the least squares solution of the optimization problem, thereby achieving efficient exploration and solution of the problem.
Algorithm 1 Numerical algorithms for least squares estimation
Step 0: Input time series X t , t = 1 , 2 , , n .
Step 1: Determine the feasible regions A of unknown parameters vector ( a 0 , a 1 , , a p , σ ) .
Step 2: For each ( a 0 , a 1 , , a p , σ ) A and σ > 0 , compute π 1 , π 2 ,⋯, π n 1 by
π i = j = 1 p i a j π i j , i = 1 , 2 , , n 1 ,
and h 1 , h 2 , ⋯, h n by
h t = X t a 0 + i = 1 t 1 π i X t i a 0 σ , t = 1 , 2 , , n .
Step 3: Set t = 1 and E a 0 , a 1 , , a p , σ = 0 .
Step 4: Set
E a 0 , a 1 , , a p , σ = E a 0 , a 1 , , a p , σ + 1 + exp π h t 3 1 1 n j = 1 n I ( h j h t ) 2 ,
and t = t + 1 .
Step 5: If t n , then go to Step 4.
Step 6: Find a ^ 0 , a ^ 1 , , a ^ p and σ ^ such that E a 0 , a 1 , , a p , σ reaches its minimum value.
Step 7: Output a ^ 0 , a ^ 1 , , a ^ p and σ ^ .
Remark 1.
The feasible regions A in the above algorithm refer to a set of all possible values of unknown parameters vector ( a 0 , a 1 , , a p , σ ) . In addition, the function E a 0 , a 1 , , a p , σ defined above is essentially a process variable used to calculate the objective function in the optimization problem (8). In particular, when t = n , the obtained E a 0 , a 1 , , a p , σ is just the objective function in the optimization problem (8).

5. Uncertain Hypothesis Testing and Forecasting

Based on the estimated parameters a ^ 0 , a ^ 1 , , a ^ p , σ ^ using the least squares principle, we can obtain an estimated uncertain moving average model. In order to facilitate the subsequent derivation, we still use the deformed uncertain moving average model here, that is,
X t = a ^ 0 i = 1 t 1 π ^ i X t i a ^ 0 + ϵ ^ t ,
where ϵ ^ t follows the uncertain normal distribution N ( 0 , σ ^ ) . Then, the next key question is whether the estimated uncertain moving average model (9) is reasonable, and how to predict the value of time series data at future moments based on the estimated uncertain moving average model (9).

5.1. Uncertain Hypothesis Test

Before evaluating whether the estimated uncertain moving average model (9) is reasonable, we first calculate the corresponding residual series
ε t = X t a ^ 0 + i = 1 t 1 π ^ i X t i a ^ 0 , t = 1 , 2 , , n ,
which can be regarded as the samples of the uncertain disturbance terms ϵ ^ t , t = 1 , 2 , , n , respectively. If the estimated uncertain moving average model (9) is reasonable, that is, it can fit the given time series data (1) well, then according to the normality assumption (5) in the uncertain moving average model (4), these n residual terms ε 1 , ε 2 , , ε n should be a set of samples from the population N ( 0 , σ ^ ) . At this point, testing whether the estimated uncertain moving average model (9) is reasonable is equivalent to testing whether those obtained n residuals ε 1 , ε 2 , , ε n are a set of samples of N ( 0 , σ ^ ) .
According to the uncertain hypothesis test proposed by Ye and Liu [25], for a given significance level α , if the residuals ε 1 , ε 2 , , ε n are indeed a set of samples of population N ( 0 , σ ^ ) , then there should be no more than α points in the residuals ε 1 , ε 2 , , ε n beyond the 1 α confidence interval of population N ( 0 , σ ^ ) , where the α confidence interval of population N ( 0 , σ ^ ) is
Φ 1 α 2 , Φ 1 1 α 2
and
Φ 1 α = 3 σ ^ π ln α 1 α
is the inverse uncertainty distribution of population N ( 0 , σ ^ ) . Based on this idea, we can give the rejection region W to test whether the estimated uncertain moving average model (9) is suitable. That is,
W = { ( ε 1 , ε 2 , , ε n ) : there   are   at   least n × α + 1 of   indexes t s   with   1 t n such   that   ε t < Φ 1 α 2   or   ε t > Φ 1 1 α 2 }
where x represents the maximum integer not exceeding x. When we perform the test, we can do so by determining whether the obtained residual sequence ε 1 , ε 2 , , ε n belongs to the rejection region W. If the residuals sequence falls within the rejection region W, i.e.,
( ε 1 , ε 2 , , ε n ) W ,
In this case, we can only infer that the estimated uncertain moving average model (9) does not provide an optimal fit for time series data (1) and is not reasonable. Conversely, if the residuals sequence is not in the above rejected region, i.e.,
( ε 1 , ε 2 , , ε n ) W ,
then we can reasonably conclude that the estimated uncertain moving average model (9) has a good fit to the time series data (1) and is reasonable.

5.2. Forecast Value and Forecast Interval

If the uncertain hypothesis test shows that the estimated uncertain moving average model (9) is reasonable, then the next step is to forecast the time series data for the future moments.
Assume that we need to forecast the time series data at time n + 1 , then the time series data at time t = 1 , 2 , , n are all known. At this point, we can substitute the known time series data X t , t = 1 , 2 , , n into the estimated uncertain moving average model (9) to obtain the forecast uncertain variable X ^ n + 1 at time n + 1 as
X ^ n + 1 = a ^ 0 i = 1 n π ^ i X n + 1 i a ^ 0 + N ( 0 , σ ^ ) .
However, the forecast information needed in practice is often certain, and the above forecast uncertain variable X ^ n + 1 is still an uncertain variable in essence, which still contains a lot of uncertainty. Therefore, we also need to give the forecast value and forecast interval of X n + 1 based on the above forecast uncertain variable X ^ n + 1 .
Note that the forecast uncertain variable X ^ n + 1 is a normal uncertain variable whose uncertainty distribution is
Φ ^ n + 1 ( x ) = 1 1 + exp π 3 σ ^ a ^ 0 i = 1 n π ^ i X n + 1 i a ^ 0 x ,
and inverse uncertainty distribution is
Φ ^ n + 1 1 ( α ) = a ^ 0 i = 1 n π ^ i X n + 1 i a ^ 0 + σ ^ 3 π ln α 1 α .
It is easy to infer that the expected value of the forecast uncertain variable X ^ n + 1 is
a ^ 0 i = 1 n π ^ i X n + 1 i a ^ 0
and the standard deviation is σ ^ . Then, we can choose the expected value of X ^ n + 1 as its forecast value, i.e.,
μ ^ = a ^ 0 i = 1 n π ^ i X n + 1 i a ^ 0 .
On the other hand, for the given significance level α , we can also choose the 1 α confidence interval as its forecast interval, i.e.,
a ^ 0 i = 1 n π ^ i X n + 1 i a ^ 0 ± σ ^ 3 π ln 1 + α 1 α .

6. Numerical Example

In this section, we will provide a numerical example to illustrate the proposed parameter estimation methods and the model testing and forecast methods.
Suppose we have a set of time series data that contains 15 observations in total, as shown in Table 1. Next, we will describe the time series data in Table 1 based on the 1 order uncertain moving average model, and carry out corresponding research on parameter estimation, model testing and forecast. Specifically, we consider the following 1 order uncertain moving average model,
X t = a 0 + ϵ t a 1 ϵ t 1 .
In this model, a 0 and a 1 are unknown parameters, ε t , t = 1 , 2 , are a set of uncertain disturbance terms and are assumed to obey the normal uncertainty distribution N ( 0 , σ ) .
For the 1 order uncertain moving average model (14), it is easy to infer that
π 0 = 1 , π i = j = 1 1 i a j π i j = a 1 π i 1 = a 1 2 π i 2 = = a 1 i , i = 1 , 2 ,
Thus, it follows from the constructed statistical invariants (6) that we can obtain 15 real functions h 1 , h 2 , , h 15 as
h t ( a 0 , a 1 , σ ) = X t a 0 + i = 1 t 1 a 1 i X t i a 0 σ , t = 1 , 2 , , 15
by substituting the time series data from Table 1. Then, according to minimization problem (8), the least squares estimations of a 0 , a 1 , and σ should solve the following minimization problem:
min a 0 , a 1 , σ > 0 t = 1 15 1 + exp π h t 3 1 1 15 j = 1 15 I ( h j h t ) 2 .
By using Matlab with version of ‘9.13.0.2049777 (R2022b)’to solve the above optimization problem, we obtain
a ^ 0 = 0.9357 , a ^ 1 = 1.2114 , σ ^ = 1.8473 .
Thus, the estimated 1 order uncertain moving average model is obtained as
X t = 0.9357 + ϵ t 1.2114 ϵ t 1 ,
where ε t , t = 1 , 2 , are a set of uncertain disturbance terms and are assumed to obey the normal uncertainty distribution N ( 0 , 1.8473 ) .
Next, we will combine the time series data in Table 1 to test the estimated 1 order uncertain moving average model (15) to evaluate whether it is suitable. By substituting the 15 time series data and the estimated parameters a ^ 0 , a ^ 1 , and σ ^ into (5) and then using
ε t = X t 0.9357 + i = 1 t 1 1 . 2114 i X t i 0.9357 , t = 1 , 2 , , 15 ,
we can obtain data of 15 residuals ε 1 , ε 2 , , ε 15 , as shown in Table 2. Suppose we consider the suitability test of the estimated 1 order uncertain moving average model (15) with a significance level of α = 0.05 . Then, it follows from α × 15 = 0.75 and Equation (10) that the test is
W = { ( ε 1 , ε 2 , , ε 15 ) : there   is   at   least   1   of   index   t   with   1 t 15 such   that ε t < 3.7312   or   ε t > 3.7312 } .
As shown in Table 2, we can verify that all the residuals do not exceed the interval [ 3.7312 , 3.7312 ] . Therefore, we have
( ε 1 , ε 2 , , ε 15 ) W .
That is, according to the relevant results of the previous model test, we can reach the conclusion that the estimated 1 order uncertain moving average model (15) has a good fit to the time series data in Table 1 and is reasonable.
Finally, based on the estimated 1 order uncertain moving average model (15), we make a forecast of the time series data at the next moment, i.e., forecast X 16 . On the basis of Equation (11), the forecast uncertain variable of X 16 can be calculated by
X ^ 16 = 0.9357 i = 1 15 1.2114 i X 16 i 0.9357 + N ( 0 , 1.8473 ) .
Then, it follows from (12) and (13) that the forecast value of X 16 is
μ ^ = 0.9357 i = 1 15 1.2114 i X 16 i 0.9357 = 2.4745 ,
and the 1 α confidence interval is
2.4745 ± 1.8473 3 π ln 1 + α 1 α = [ 1.2567 , 6.2057 ] .
By putting time series data, forecast value and forecast interval in the same graph (please see Figure 1), it can be seen that point forecast and interval forecast based on the estimated 1 order uncertain moving average model (15) can effectively inherit the trend of existing data and produce an effective forecast.
Furthermore, given that only moment estimation (proposed by Liu and Liu [26]) has been studied among the parameter estimation methods of uncertain moving average models based on uncertainty theory, here we apply the moment estimation method based on the same dataset shown in Table 1 to compare with the method proposed in this paper. It follows from the work of Liu and Liu [26] that the moment estimations of a 0 , a 1 , and σ should solve the following system of equations:
1 15 t = 1 15 X t a 0 + i = 1 t 1 a 1 i X t i a 0 σ = 0 1 15 t = 1 15 X t a 0 + i = 1 t 1 a 1 i X t i a 0 σ 2 = 1 1 15 t = 1 15 X t a 0 + i = 1 t 1 a 1 i X t i a 0 σ 3 = 0 .
By using Matlab to solve the above system of equations, we obtain
a ^ 0 = 0.8814 , a ^ 1 = 0.6262 , σ ^ = 4.0660 .
Thus, the estimated 1 order uncertain moving average model by moment estimation method is obtained as
X t = 0.8814 + ϵ t + 0.6262 ϵ t 1 ,
where ε t , t = 1 , 2 , are a set of uncertain disturbance terms and are assumed to obey the normal uncertainty distribution N ( 0 , 4.0660 ) . It can be seen from the estimated models (17) and (15) that the standard deviation of the residuals estimated based on the moment estimation method is 4.0660, which is much larger than the standard deviation of the residuals estimated based on the least squares method, which is 1.8473. This indicates that the residuals estimated based on the moment estimation method still contain a large amount of uncertainties, and its prediction accuracy is much worse than the method proposed in this paper. On the other hand, we can also calculate the mean absolute error (MAE) and mean square error (MSE) of these two estimated models based on the dataset shown in Table 1, as shown in Table 3, respectively. It can also be seen from Table 3 that the prediction error of the least squares method proposed in this paper is much smaller than that of the moment estimation method, which fully demonstrates the effectiveness of the method proposed in this paper.

7. Real Data Example

In this section, we will provide a real data example to illustrate the proposed parameter estimation methods and the model testing and forecast methods, in which the dataset is global daily carbon dioxide emission per day during 1 July 2023 to 31 July 2023 and was collected by Liu and Liu [26], as shown in Table 4. In the work of Liu and Liu [26], the 3 order uncertain moving average model was obtained as
X t = 96.257 + ϵ t + 0.785 ϵ t 1 + 0.026 ϵ t 2 0.392 ϵ t 3 ,
where ε t , t = 1 , 2 , are a set of uncertain disturbance terms and are assumed to obey the normal uncertainty distribution N ( 0 , 3.494 ) .
Next, we will describe the above time series data in Table 4 based on the 3 order uncertain moving average model and the least squares estimation method, and carry out corresponding research on parameter estimation, model testing and forecast. For the 3 order uncertain moving average model, it is easy to infer that
π 0 = 1 , π i = j = 1 3 i a j π i j , i = 1 , 2 ,
Thus, it follows from the constructed statistical invariants (6) that we can obtain 31 real functions h 1 , h 2 , , h 31 as
h t ( a 0 , a 1 , a 2 , a 3 , σ ) = X t a 0 + i = 1 t 1 π i X t i a 0 σ , t = 1 , 2 , , 31
by substituting the time series data from Table 4. Then, according to minimization problem (8), the least squares estimations of a 0 , a 1 , a 2 , a 3 , and σ should solve the following minimization problem:
min a 0 , a 1 , a 2 , a 3 , σ > 0 t = 1 31 1 + exp π h t 3 1 1 31 j = 1 31 I ( h j h t ) 2 .
By using Matlab to solve the above optimization problem, we obtain
a ^ 0 = 95.5417 , a ^ 1 = 0.0701 , a ^ 2 = 0.6170 , a ^ 3 = 0.0880 , σ ^ = 3.0614 .
Thus, the estimated 3 order uncertain moving average model is obtained as
X t = 95.5417 + ϵ t 0.0701 ϵ t 1 + 0.6170 ϵ t 2 0.0880 ϵ t 3 ,
where ε t , t = 1 , 2 , are a set of uncertain disturbance terms and are assumed to obey the normal uncertainty distribution N ( 0 , 3.0614 ) . It can be seen from the estimated models (18) and (19) that the standard deviation of the residuals estimated based on the moment estimation method is 3.494, which is much larger than the standard deviation of the residuals estimated based on the least squares method, which is 3.0614. This indicates that the residuals estimated based on the moment estimation method still contain a large amount of uncertainties, and its prediction accuracy is much worse than the method proposed in this paper.
Next, we will combine the time series data in Table 4 to test the estimated 3 order uncertain moving average model (19) to evaluate whether it is suitable. By substituting the 31 time series data and the estimated parameters a ^ 0 , a ^ 1 , a ^ 2 , a ^ 3 , and σ ^ into (5) and then using
ε t = X t 95.5417 + i = 1 t 1 π i X t i 95.5417 , t = 1 , 2 , , 31 ,
we can obtain 31 residuals data ε 1 , ε 2 , , ε 31 as shown in Table 5. Suppose we consider the suitability test of the estimated 3 order uncertain moving average model (19) with a significance level of α = 0.05 . Then, it follows from α × 31 = 1.55 and Equation (10) that the test is
W = { ( ε 1 , ε 2 , , ε 31 ) : there   are   at   least   2   of   indexes t s     with 1 t 31 such   that ε t < 6.1836   or   ε t > 6.1836 } .
As shown in Table 5, we can verify that only ε 18 [ 6.1836 , 6.1836 ] . Therefore, we have
( ε 1 , ε 2 , , ε 31 ) W .
That is, according to the relevant results of the previous model test, we can reach the conclusion that the estimated 3 order uncertain moving average model (19) has a good fit to the time series data in Table 4 and is reasonable.
Finally, based on the estimated 3 order uncertain moving average model (19), we make a forecast of the time series data at the next moment, i.e., forecast X 32 . On the basic of Equation (11), the forecast uncertain variable of X 32 can be calculated by
X ^ 32 = 95.5417 i = 1 31 π i X 31 i 95.5417 + N ( 0 , 3.0614 ) .
Then, it follows from (12) and (13) that the forecast value of X 31 is
μ ^ = 95.5417 i = 1 31 π i X 31 i 95.5417 = 93.1167 ,
and the 1 α confidence interval is
93.1167 ± 3.0614 3 π ln 1 + α 1 α = [ 86.9331 , 99.3003 ] .
By putting time series data, forecast value and forecast interval in the same graph (please see Figure 2), it can be seen that point forecast and interval forecast based on the estimated 3 order uncertain moving average model (19) can effectively inherit the trend of existing data and produce an effective forecast.

8. Conclusions

In order to solve and enrich the research of statistical inference of the uncertain moving average model, a new method was proposed in this paper by deforming the uncertain moving average model to construct a symmetric statistical invariant, the normal uncertain variable. Based on this, the unknown parameters and uncertain disturbance terms in the uncertain moving average model were also determined by estimating the unknown parameters in the statistical invariants based on the least square principle. In addition, a numerical algorithm was designed to calculate the corresponding estimates. Then, this paper used the uncertain hypothesis test to verify the applicability of the estimated uncertain moving average model, discussed how to make point forecast and interval forecast for the future value of time series based on the estimated uncertain moving average model. Finally, a numerical example study was given to illustrate the proposed method and compared with existing methods to illustrate the advantages of the proposed method.

Author Contributions

Conceptualization, H.W.; methodology, H.W.; software, H.W.; validation, H.W. and H.S.; formal analysis, H.S.; investigation, H.W.; resources, H.W. and H.S.; data curation, H.W.; writing—original draft preparation, H.W.; writing—review and editing, H.S.; supervision, H.W. and H.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Scientific and Technological Innovation Programs of Higher Education Institutions in Shanxi (No. 2022L415) and Shanxi Datong University Project (No. 2022Q15).

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors especially thank the editors and anonymous referees for their kindly review and helpful comments. Any remaining errors are ours.

Conflicts of Interest

We declare that we have no relevant material or financial interests that relate to the research described in this paper. Neither the entire paper nor any part of its content has been published or has been accepted elsewhere. It is also not being submitted to any other journal.

References

  1. Yule, G. On a method of investigating periodicities disturbed series, with special reference to Wolfer’s sunspot numbers. Philos. Trans. R. Soc. Lond. Ser. A 1927, 226, 267–298. [Google Scholar]
  2. Walker, G. On periodicity in series of related terms. Proc. R. Soc. Lond. Ser. A 1931, 131, 518–532. [Google Scholar] [CrossRef]
  3. Box, G.; Jenkins, G. Time Series Analysis: Forecasting and Control; Holden-Day: San Francisco, CA, USA, 1970. [Google Scholar]
  4. Shumway, R.; Stoffer, D. Time Series Analysis and Its Applications; Springer: New York, NY, USA, 2000. [Google Scholar]
  5. Yang, L.; Liu, Y. Solution method and parameter estimation of uncertain partial differential equation with application to China’s population. Fuzzy Optim. Decis. Mak. 2024, 23, 155–177. [Google Scholar] [CrossRef]
  6. Liu, Y.; Qin, Z.; Li, X. Are the queueing systems in practice random or uncertain? Evidence from online car-hailing data in Beijing. Fuzzy Optim. Decis. Mak. 2024, 23, 497–511. [Google Scholar] [CrossRef]
  7. Liu, B. Uncertainty Theory, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
  8. Liu, B. Some research problems in uncertainty theory. J. Uncertain Syst. 2009, 3, 3–10. [Google Scholar]
  9. Yang, X.; Liu, B. Uncertain time series analysis with imprecise observations. Fuzzy Optim. Decis. Mak. 2019, 18, 263–278. [Google Scholar] [CrossRef]
  10. Lu, J.; Peng, J.; Chen, J.; Sugeng, K. Prediction method of autoregressive moving average models for uncertain time series. Int. J. Gen. Syst. 2020, 49, 546–572. [Google Scholar] [CrossRef]
  11. Tang, H. Uncertain vector autoregressive model with imprecise observations. Soft Comput. 2020, 24, 17001–17007. [Google Scholar] [CrossRef]
  12. Tang, H. Uncertain max-autoregressive model with imprecise observations. J. Intell. Fuzzy Syst. 2021, 41, 6915–6922. [Google Scholar] [CrossRef]
  13. Tang, H. Uncertain threshold autoregressive model with imprecise observations. Commun. Stat.—Theory Methods 2022, 51, 8776–8785. [Google Scholar] [CrossRef]
  14. Zhang, Y.; Gao, J. Nonparametric uncertain time series models: Theory and application in brent crude oil spot price analysis. Fuzzy Optim. Decis. Mak. 2024, 23, 239–252. [Google Scholar] [CrossRef]
  15. Yang, X.; Park, G.; Hu, Y. Least absolute deviations estimation for uncertain autoregressive model. Soft Comput. 2020, 24, 18211–18217. [Google Scholar] [CrossRef]
  16. Chen, D.; Yang, X. Maximum likelihood estimation for uncertain autoregressive model with application to carbon dioxide emissions. J. Intell. Fuzzy Syst. 2021, 40, 1391–1399. [Google Scholar] [CrossRef]
  17. Zhang, Z.; Yang, X.; Gao, J. Uncertain autoregressive model via LASSO procedure. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 2020, 28, 939–956. [Google Scholar] [CrossRef]
  18. Chen, D.; Yang, X. Ridge estimation for uncertain autoregressive model with imprecise observations. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 2021, 29, 37–55. [Google Scholar] [CrossRef]
  19. Liu, Z. Huber estimation for uncertain autoregressive model. J. Uncertain Syst. 2021, 14, 2150010. [Google Scholar] [CrossRef]
  20. Liu, Y.; Qin, Z. Moment estimation of uncertain autoregressive model and its application in financial market. Commun. Stat.—Simul. Comput. 2024. [Google Scholar] [CrossRef]
  21. Liu, Y.; Liu, B. A modified uncertain maximum likelihood estimation with applications in uncertain statistics. Commun. Stat. Theory Methods 2024, 53, 6649–6670. [Google Scholar] [CrossRef]
  22. Liu, Y.; Liu, B. Estimation of uncertainty distribution function by the principle of least squares. Commun. Stat.—Theory Methods 2024, 53, 7624–7641. [Google Scholar] [CrossRef]
  23. Yang, X.; Ni, Y. Least squares estimation for uncertain moving average model. Commun. Stat.—Theory Methods 2021, 50, 4134–4143. [Google Scholar] [CrossRef]
  24. Xin, Y.; Yang, X.; Gao, J. Least squares estimation for the high-order uncertain moving average model with application to carbon dioxide emissions. Int. J. Gen. Syst. 2021, 50, 724–740. [Google Scholar] [CrossRef]
  25. Ye, T.; Liu, B. Uncertain hypothesis test with application to uncertain regression analysis. Fuzzy Optim. Decis. Mak. 2022, 21, 157–174. [Google Scholar] [CrossRef]
  26. Liu, Z.; Liu, Y. Moment estimation for uncertain moving average model with application to CO2 emission. Commun. Stat. Simul. Comput. 2025. [Google Scholar] [CrossRef]
Figure 1. Forecast value and forecast interval of the estimated 1 order uncertain moving average model (15).
Figure 1. Forecast value and forecast interval of the estimated 1 order uncertain moving average model (15).
Symmetry 17 00656 g001
Figure 2. Forecast value and forecast interval of the estimated 3 order uncertain moving average model (19).
Figure 2. Forecast value and forecast interval of the estimated 3 order uncertain moving average model (19).
Symmetry 17 00656 g002
Table 1. Time series data of numerical example.
Table 1. Time series data of numerical example.
0.17003.2160−2.14450.66853.9696
2.8559−0.9928−3.35836.2429−1.5084
2.4760−0.69511.7182−0.58370.0949
Table 2. Residuals data of estimated 1 order uncertain moving average model (15).
Table 2. Residuals data of estimated 1 order uncertain moving average model (15).
−0.76581.3525−1.4418−2.01390.5943
2.64001.2697−2.75591.9686−0.0594
1.46830.14790.9617−0.3544−1.2702
Table 3. MAEs and MSEs of the estimated uncertain moving average models (15) and (17) using least squares method and moment estimation method.
Table 3. MAEs and MSEs of the estimated uncertain moving average models (15) and (17) using least squares method and moment estimation method.
MAEMSE
least squares method1.27102.2534
moment estimation method3.564016.5320
Table 4. Global daily carbon dioxide emission per day during 1 July 2023 to 31 July 2023.
Table 4. Global daily carbon dioxide emission per day during 1 July 2023 to 31 July 2023.
93.6590.2693.9794.6393.5496.4596.2995.0892.23
94.6596.2597.2698.597.695.9893.3596.26100.01
100.01100.25100.0697.2693.2796.1498.7998.8597.26
96.1194.6190.7293.87
Table 5. Residuals data of estimated 3 order uncertain moving average model (19).
Table 5. Residuals data of estimated 3 order uncertain moving average model (19).
−1.8917−5.4142−0.78382.2074−1.8399−0.65162.0321−0.0793−4.6284
−0.98823.48782.16500.87111.09040.1678−2.77610.51636.2320
4.34211.21282.47271.5254−3.5837−0.37635.56733.6151−1.4965
−1.27710.2203−4.1500−2.2108
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, H.; Shi, H. Parameter Estimation of Uncertain Moving Average Model Based on Least Squares Principle. Symmetry 2025, 17, 656. https://doi.org/10.3390/sym17050656

AMA Style

Wang H, Shi H. Parameter Estimation of Uncertain Moving Average Model Based on Least Squares Principle. Symmetry. 2025; 17(5):656. https://doi.org/10.3390/sym17050656

Chicago/Turabian Style

Wang, Han, and Haiyan Shi. 2025. "Parameter Estimation of Uncertain Moving Average Model Based on Least Squares Principle" Symmetry 17, no. 5: 656. https://doi.org/10.3390/sym17050656

APA Style

Wang, H., & Shi, H. (2025). Parameter Estimation of Uncertain Moving Average Model Based on Least Squares Principle. Symmetry, 17(5), 656. https://doi.org/10.3390/sym17050656

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop