Abstract
The multivariate random coefficient autoregression (RCAR) process is widely used in time series modeling applications. Random autoregressive coefficients are usually assumed to be independent and identically distributed sequences of random variables. This paper investigates the issue of coefficient constancy testing in a class of static multivariate first-order random coefficient autoregressive models. We construct a new test statistic based on the locally most powerful-type test and derive its limiting distribution under the null hypothesis. The simulation compares the empirical sizes and powers of the LMP test and the empirical likelihood test, demonstrating that the LMP test outperforms the EL test in accuracy by 10.2%, 10.1%, and 30.9% under conditions of normal, Beta-distributed, and contaminated errors, respectively. We provide two sets of real data to illustrate the practical effectiveness of the LMP test.
MSC:
62H15
1. Introduction
The autoregressive model proposed by Yule [1] is one of the most classical models in time series analysis. It has been extensively studied for its theoretical properties and applications. See, for example, Box and Jenkins [2] and Maleki and Nematollahi [3]. In recent years, statisticians have shown increased interest in multivariate time series. As a result, multivariate autoregressive models have been widely studied in the fields of traffic prediction (Dmitry [4]) and neuroscience (Gilson et al. [5]). The multivariate first-order autoregressive (AR(1)) model is defined as
where is an matrix with constant elements and is an independent and identically distributed (i.i.d.) random error vector of with E. However, the coefficients of a time series may not be determined in practical problems. To enhance the utility of autoregressive models, Anděl [6] extended the multivariate AR model to include the random coefficients case and proposes a multivariate random coefficients autoregressive model.
The multivariate first-order random coefficient autoregressive (RCAR(1)) model is defined as
where is an i.i.d. random sequence and is a sequence of dependent random errors. and are independent of each other for fixed t and all s < t.
The inference for the univariate RCAR(1) model has been extensively developed. RCAR models are essential in various fields, including predicting, financial, biology, meteorology, and other practical applications. Stationarity, ergodicity, and finiteness of moments for RCAR processes are discussed by Feigin and Tweedie [7]. Aue et al. [8] demonstrated that under optimal assumptions, the quasi maximum likelihood estimator in the RCAR(1) model exhibits asymptotic normality and strong consistency. Aue and Horváth [9] introduced a unified quasi-likelihood approach to estimating the unknown parameters of the RCAR(1) model. This approach works for both stationary and nonstationary processes. Inspired by Dette and Wied [10], Horváth and Trapani [11] and Li Bi et al. [12] proposed tests to determine whether the autoregressive coefficient is truly random or not. Horváth and Trapani [11] introduced a useful test to assess randomness within the RCAR(1) model using the two-stage WLS method, which can be applied even in the case when is non-stationary. Wang et al. [13] derived the asymptotic characteristics of the maximum likelihood estimator for parameters in the RCAR(1) model under the assumption of a unit root.
The increase in unknown parameters has significantly complicated the analysis of multivariate RCAR(1) models. Currently, there is limited research on this subject. Chen et al. [14] tested the randomness of coefficients in the multivariate RCAR(1) model based on the empirical likelihood method. Furthermore, the multivariate RCAR(1) model was researched by Prášková and Vaněček [15], Vaněček [16], and Nicholls and Quinn [17]. In this paper, we employ the locally most powerful-type (LMP) method to test the randomness of coefficients for the multivariate RCAR(1) model. The LMP test has been studied by various researchers. Manik et al. [18], Chikkagoudar and Biradar [19], and Rohatgi et al. [20] performed investigations using the LMP test to examine the stability of parameters. Some advantages of this method include its ability to generate stable sizes when parameters are close to their true values and the relatively simple calculation process. Moreover, our testing method exhibits a higher test performance compared to some traditional testing methods. In this article, we demonstrate the implementation of our proposed method for a finite number of samples, both under contamination errors and normal conditions. Numerical simulations indicate that our test outperforms the empirical likelihood (EL) test proposed by Chen et al. [14] in terms of maintaining both empirical sizes and powers.
The remainder of the paper is organized as follows. In Section 2, under the null hypothesis, we provide the LMP test statistic and its asymptotic distribution. Additionally, in Section 3, we conduct numerous numerical simulation studies and present the results. In Section 4, two practical examples are used to illustrate the superiority and justification of our approach. The proofs are provided in Appendix A.
2. Methodology and Main Results
In this paper, we study the multivariate first-order random coefficient autoregressive model. The definition is as follows:
where are independent and identically distributed (i.i.d.) random sequences with probability density function , and are i.i.d. sequences with joint probability density function where E Var Cov , for which .
In this section, in order to test the randomness of coefficients, we establish a test statistic to determine if matrix is a constant matrix, which is equivalent to testing whether is To this end, we consider the following hypothesis test:
Suppose that the time series data are generated from (3). Denote The unknown parameters are denotes the truth value of the unknown parameter. To construct the LMP test statistic, we make the following assumptions:
Suppose there exists a neighborhood of and a positive real-valued function N such that
The function has continuous partial derivatives up to the third order with respect to parameter , and the derivatives of both and the logarithmic function are bounded by a common upper bound
Since is a Markov chain, the transition probabilities can be expressed as follows:
Therefore, we can derive the likelihood function for the multivariate RCAR(1) model:
Now, the Taylor series expansion of around , which is the mean of , is given as follows:
Thus, the log-likelihood function is
where And we can conclude that if Therefore, the distribution function of random variable is degenerated. Additionally, we obtain
Moreover, since are i.i.d. random sequences and denotes the coefficients of the cross terms, which are independent of and , it follows that
The derivative of with respect to at gives the following equation:
The derivative of with respect to at gives the following equation:
Substituting the maximum likelihood estimate of parameter , denoted as , into (4) yields the LMP test statistic.
where
and
It can be easily derived that is a zero-mean martingale. Using Theorem 1.1 in Billingsley [21] and the Martingale Central Limit Theorem form Hall and Heyde [22], some lemmas are introduced.
Lemma 1.
We assume the conditions:
(A1)
(A2) , as , we have
where
Lemma 2.
Under conditions of and , as , we have
where
Lemma 3.
Under conditions of and , as , we have
where is defined in Lemma 2 . Then, for the asymptotic distribution of , we have the following theorem.
Theorem 1.
Under conditions of –; then, under , as , we have
where and is defined in Lemmas 1 and 2.
Based on Theorem 1, we obtain that the quadratic form of which is a convergent distribution, and the non-degeneracy of is given in the proof of the theorem.
3. Simulation
In this section, we calculate the probability of accepting the null hypothesis when it is true and the probability of rejecting the null hypothesis when it is false at the significance level of and . Compare the LMP test and EL test from the perspectives of empirical sizes and empirical powers. And add the case where the error term obeys a mixed binary normal distribution. Within each study, we set the initial values , and the simulation results are based on 500 repetitions using R software version R-4.4.1.
On the one hand, we consider the simulation results when the null hypothesis is true. We take the bivariate first-order autoregressive (BAR(1)) model
In order to calculate empirical sizes, we pay attention to two types defined as follows:
Model 1. where
Model 2. where Furthermore, follows a Bernoulli distribution with probability mass function indicated by
The following section describes the process used to obtain the simulation results. First of all, when and , data are generated using Model 1. Secondly, when , , and r takes values of 0.9 and 0.8, data are generated using Model 2. For Model 1 and Model 2, a sample size of 100, 300, 500, 700, 1000, 2000, and 5000 is taken. Table 1, Table 2, Table 3 and Table 4 show the empirical sizes for each of the two models. The tables illustrates that the LMP test yields similar outcomes across various error types, including normal error and contamination error. From the results, it can be observed that when the sample size is small, such as N = 100, the outcomes of both tests are suboptimal. In comparison to the EL test, the LMP test produces empirical sizes that are closer to the nominal significance level, particularly with larger sample sizes. And in the presence of contamination errors, the LMP test generally outperforms the EL test. Furthermore, the empirical sizes obtained from the EL test are higher than those from the LMP test at each sample size. Therefore, compared to the LMP test, the EL test is more likely to make incorrect decisions in hypothesis testing. Based on the results of the two models, it can be concluded that the LMP test is more effective than the EL test.
Table 1.
Empirical sizes of the LMP and EL tests at the significance level of 0.1 for Model 1.
Table 2.
Empirical sizes of the LMP and EL tests at the significance level of 0.1 for Model 2.
Table 3.
Empirical sizes of the LMP and EL tests at the significance level of 0.05 for Model 1.
Table 4.
Empirical sizes of the LMP and EL tests at the significance level of 0.05 for Model 2.
On the other hand, we consider the simulation results when the alternative hypothesis is true. We take the bivariate first-order random coefficient autoregressive (BRCAR(1)) model
Similarly, to calculate empirical powers, we focus on three types as defined below:
Model 3. where
Model 4. where
Model 5. where Furthermore, follows a Bernoulli distribution with probability mass function indicated by
Firstly, in order to calculate the empirical powers of the LMP test and EL test, we generate samples from Model 3 where parameters and Secondly, we generate samples from Model 4 where parameters and Third, we generate samples from Model 5 where parameters and For Model 3, Model 4, and Model 5, we take the sample size N=100, 300, 500, 700, 1000, 2000, 5000. Table 5, Table 6, Table 7, Table 8, Table 9 and Table 10 present a summary of the empirical powers for both models. From the results, it can be seen that both the EL test and LMP test perform poorly with small sample sizes. However, as the sample size increases, the empirical powers of both the LMP test and the EL test increase. Particularly, when the sample size reaches 5000, the empirical powers of both tests are close to 1. In summary, through these simulations, we conclude that under conditions of normal, Beta-distributed, and contaminated errors, the accuracy of using the LMP test is higher than the EL test by 10.2%, 10.1%, and 30.9%, respectively. As expected, our results show that the LMP test is an effective tool for detecting changes in the parameters of the Multivariate RCAR(1) model. In addition, the efficacy of the LMP test increases progressively as the parameter variances increase. Thus, it shows that the LMP test is feasible in practical applications. Overall, whether the null hypothesis is true or the alternative hypothesis is true, the LMP test will make a correct judgment with a high probability and is a robust test.
Table 5.
Empirical powers of the LMP and EL tests at the significance level of 0.1 for Model 3.
Table 6.
Empirical powers of the LMP and EL tests at the significance level of 0.1 for Model 4.
Table 7.
Empirical powers of the LMP and EL tests at the significance level of 0.1 for Model 5.
Table 8.
Empirical powers of the LMP and EL tests at the significance level of 0.05 for Model 3.
Table 9.
Empirical powers of the LMP and EL tests at the significance level of 0.05 for Model 4.
Table 10.
Empirical powers of the LMP and EL tests at the significance level of 0.05 for Model 5.
4. Applications
This section demonstrates the application of the LMP method in two practical examples.
4.1. Economic Data
The data represent the chain rate of the growth of building construction and the chain rate of the growth of industrial (excluding construction) production in Germany from May 1988 to April 1993, respectively. These data are available on the website: https://ceidata.cei.cn/new, accessed on 3 April 2024. The data of the chain growth rate of building construction are denoted as , and the data of the chain rate of the growth of industrial (excluding construction) production are denoted as . We observe that Figure 1 and Figure 2 present a sample path diagram and sample centering series for the two data sets.
Figure 1.
(a): The sample path of series . (b): The sample plot of centering series for the real data.
Figure 2.
(a): The sample path of series . (b): The sample plot of centering series for the real data.
After transforming these two data sets, the resulting data with and can be modeled using a BAR model. In addition, the plots of the autocorrelation function (ACF) and partial autocorrelation function (PACF) of sequences and are given in Figure 3 and Figure 4, respectively. From Figure 4, it can be seen that these two sets of data come from a binary first-order autoregressive process. Therefore, it is reasonable to model the data with BAR(1), and it also shows that the proposed test statistic is valid for both sets of actual data. From Figure 1 and Figure 2 , it is unknown whether the parameters are constant or not; therefore, we consider the following test problem with the help of the proposed test statistic
Figure 3.
(a): The sample ACF plot of . (b): The sample PACF plot of .
Figure 4.
(a): The sample ACF plot of . (b): The sample PACF plot of .
The test statistic value in (5) is 33.21573 with a p-value of 0.00104, which is less than 0.05. This result indicates that we are rejecting the null hypothesis at the significance level. Therefore, the BRCAR(1) model is more appropriate than the BAR(1) model in this case.
4.2. Import and Export Data
The data represent the month-on-month growth rates of Japan’s export volume and import volume, respectively, from November 2012 to February 2020 (available at https://ceidata.cei.cn/new, accessed on 3 April 2024). The data of the month-on-month chain growth rate of exports are denoted as , and the month-on-month chain growth rate of imports are denoted as . We observe that Figure 5 and Figure 6 shows a sample path diagram and sample centering series for the two data sets.
Figure 5.
(a): The sample path of series . (b): The sample plot of centering series for the real data.
Figure 6.
(a): The sample path of series . (b): The sample plot of centering series for the real data.
Similarly, the data with and can be modeled using a BAR model. Furthermore, the plots of the and the of sequences and are given in Figure 7 and Figure 8, respectively. From Figure 8, we can see that these two data sets come from a binary first-order autoregressive process. From the time series plot given in Figure 5 and Figure 6, the randomness of the parameter may be questioned; consequently, it is important to test the hypothesis of the randomness of the parameter. Similarly, we consider the following test problem with the help of the proposed test statistic
Figure 7.
(a): The sample ACF plot of . (b): The sample PACF plot of .
Figure 8.
(a): The sample ACF plots of . (b): The sample PACF plot of .
The test statistic value in (5) is 6.905108 with a p-value of 0.03166, which is less than 0.05. Consequently, we are rejecting the null hypothesis at the significance level. Thus, modeling these data with the BRCAR(1) model is preferred over the BAR(1) model.
5. Conclusions
This paper proposes a locally most powerful-type test for testing the constancy of the random coefficient parameters in multivariate autoregressive models. We derive the test statistic and its limiting distribution under the null hypothesis. In practical applications, the coefficients of autoregressive models may not remain constant throughout the entire time period. Therefore, it is necessary to conduct such tests whenever there is a possibility of random variation in the coefficients. Through simulation studies and an analysis of real data, we demonstrate the effectiveness of our testing method and show that its performance surpasses that of competing methods. Finally, we anticipate that our LMP test can be extended and applied to other similar types of time series models.
Author Contributions
Writing—original draft, L.B., D.W., L.C. and D.Q. All authors have read and agreed to the published version of the manuscript.
Funding
This paper is supported by the National Nature Science Foundation of China under Grant (No. 12171054).
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Acknowledgments
The authors would like to acknowledge the contributions of all the reviewers and thank them for their insightful comments on the early drafts of this paper.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Proof of Lemma 1.
Let be the field generated by the set ; we have
By calculation, we can obtain
So, is a martingale sequence. Since , is a strictly stationary ergodic process. By the ergodic properties, and the application of Zhang et al. [23] in Theorem 1.1 in Billingsley [21], we can obtain
Hence, we employ the Martingale Central Limit Theorem form Hall and Heyde [22]. As , we obtain
Similarly, we can verify that is a martingale sequence and the following equation holds:
For any vector we have
By the - device, we obtain
This completes the proof. □
Proof of Lemma 2.
By Theorem 1.1 in Billingsley [21], we know that
This completes the proof. □
Proof of Lemma 3.
Let be the field generated by the set , and Then, we have
Thus, is a martingale sequence. Since , and is a strictly stationary ergodic process, as , we have
Therefore, we employ the Martingale Central Limit Theorem from Hall and Heyde [22]; as , we obtain
Similarly, let
It is easy to verify that is a martingale. Because by Theorem 1.1 in Billingsley [21], as , we have that
By the Martingale Central Limit theorem (Hall and Heyde [22]), as , we obtain
Similarly,
where Thus, is a martingale. by Theorem 1.1 in Billingsley [21] and the Martingale Central Limit theorem (Hall and Heyde [22]), as , we obtain
In the same way, for any vector let then, is a martingale squence. By the ergodic theorem, as , we can prove that
where
Note that
By the device, Lemma 3 is established. □
Proof of Theorem 1.
Under the null hypothesis, the log-likelihood function is
where
The conditional maximum likelihood estimate of parameter can be obtained by maximizing the log-likelihood function . Applying the Taylor series expansion, we have
where is between and because Therefore, . The expression can be simplified to obtain
Performing a Taylor expansion of to the third term at a truth value of , we can obtain
where Because of we can prove that so the remainder term is Therefore, the second term in (A2) can be written as
Note that under the null hypothesis, it holds that
Thus, Equation (A3) is equivalent to
So, Equation (A2) can be written as
For any vector where and are column vectors of dimensions n and , respectively. By Lemmas 1 and 3 we know that and are martingale. By the ergodic theorem, as , we can deduce that
By the device, we obtain
where is a column vector of dimension . Then, we can make the conclusion that
where and is degenerated.
Furthermore, we can derive that
where is a chi-squared distribution with n degrees of freedom.
References
- George, U. Vii. on a method of investigating periodicities disturbed series, with special reference to wolfer’s sunspot numbers. Philos. Trans. R. Soc. Lond. 1927, 226, 267–298. [Google Scholar] [CrossRef]
- George, E.B.; Gwilym, M.J.; Gregory, C.R.; Greta, M.L. Time Series Analysis: Forecasting and Control; John Wiley & Sons: Washington, DC, USA, 2015. [Google Scholar]
- Mohsen, M.; Nematollahi, A.R. Autoregressive models with mixture of scale mixtures of gaussian innovations. Iran. J. Sci. Technol. Trans. Sci. 2017, 41, 1099–1107. [Google Scholar] [CrossRef]
- Dmitry, P. Short-term traffic forecasting using multivariate autoregressive models. Procedia Eng. 2017, 178, 57–66. [Google Scholar] [CrossRef]
- Matthieu, G.; Adrià Tauste, C.; Xing, C.; Alexander, T.; Gustavo, D. Nonparametric test for connectivity detection in multivariate autoregressive networks and application to multiunit activity data. Netw. Neurosci. 2017, 1, 357–380. [Google Scholar] [CrossRef]
- Jiří, A. Autoregressive series with random parameters. Math. Oper. Und Stat. 1976, 7, 735–741. [Google Scholar] [CrossRef]
- Paul, D.F.; Richard, L.T. Random coefficient autoregressive processes: A markov chain analysis of stationarity and finiteness of moments. J. Time Ser. Anal. 1985, 6, 1–14. [Google Scholar] [CrossRef]
- Alexander, A.; Lajos, H.; Josef, S. Estimation in random coefficient autoregressive models. J. Time Ser. Anal. 2006, 27, 61–76. [Google Scholar] [CrossRef]
- Alexander, A.; Lajos, H. Quasi-likelihood estimation in stationary and nonstationary autoregressive models with random coefficients. Stat. Sin. 2011, 21, 973–999. [Google Scholar] [CrossRef]
- Holger, D.; Dominik, W. Detecting relevant changes in time series models. J. R. Stat. Soc. Ser. Stat. Methodol. 2016, 78, 371–394. [Google Scholar] [CrossRef]
- Lajos, H.; Lorenzo, T. Testing for randomness in a random coefficient autoregression model. J. Econom. 2019, 209, 338–352. [Google Scholar] [CrossRef]
- Li, B.; Feilong, L.; Kai, Y.; Dehui, W. Locally most powerful test for the random coefficient autoregressive model. Math. Probl. Eng. 2019, 1, 1099–1107. [Google Scholar] [CrossRef]
- Dazhe, W.; Sujit, K.G.; Sastry, G.P. Maximum likelihood estimation and unit root test for first order random coefficient autoregressive models. J. Stat. Theory Pract. 2010, 4, 261–278. [Google Scholar] [CrossRef]
- Jin, C.; Dehui, W.; Cong, L.; Jingwen, H. Estimation and testing of multivariate random coefficient autoregressive model based on empirical likelihood. Commun.-Stat.-Simul. Comput. 2023, 52, 291–308. [Google Scholar] [CrossRef]
- Zuzana, P.; Pavel, V. On a class of estimators in a multivariate rca (1) model. Kybernetika 2011, 47, 501–551. [Google Scholar] [CrossRef]
- Pavel, V. Rate of convergence for a class of rca estimators. Kybernetika 2006, 42, 699–709. [Google Scholar] [CrossRef]
- Des, F.N.; Barry, G.Q. Random COEFFICIENT Autoregressive Models: An Introduction; Springer Science & Business Media: New York, NY, USA, 1982. [Google Scholar]
- Awale, M.; Balakrishna, N.; Ramanathan, T.V. Testing the constancy of the thinning parameter in a random coefficient integer autoregressive model. Stat. Pap. 2019, 60, 1515–1539. [Google Scholar] [CrossRef]
- Chikkagoudar, M.S.; Biradar, B.S. Locally most powerful rank tests for comparison of two failure rates based on multiple type-ii censored data. Commun.-Stat.-Theory Methods 2012, 41, 4315–4331. [Google Scholar] [CrossRef]
- Rohatgi, V.K.; Md Ehsanes Sleh, A.K.; Ahluwalia, R.; Ji, P. Locally most powerful tests for the two sample problem when the combined sample is type ii censored. Commun.-Stat.-Theory Methods 1990, 19, 2337–2355. [Google Scholar] [CrossRef]
- Patrick, B. The lindeberg-levy theorem for martingales. Proc. Am. Math. Soc. 1961, 12, 788–792. [Google Scholar] [CrossRef]
- Peter, H.; Christopher, C.H. Martingale Limit Theory and Its Application; Academic Press: Washington, DC, USA, 2014. [Google Scholar]
- Zhang, H.; Wang, D.; Zhu, F. The empirical likelihood for first-order random coeffi cient integer-valued autoregressive processes. Commun. Stat. Theory Methods 2011, 40, 492–509. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).