Abstract
In this paper we propose a test for a set of linear restrictions in a Vector Autoregressive Moving Average (VARMA) model. This test is based on the autoregressive metric, a notion of distance between two univariate ARMA models, and , introduced by Piccolo in 1990. In particular, we show that this set of linear restrictions is equivalent to a null distance between two given ARMA models. This result provides the logical basis for using as a null hypothesis in our test. Some Monte Carlo evidence about the finite sample behavior of our testing procedure is provided and two empirical examples are presented.
JEL classifications:
C1; C3; C4
1. Introduction
In this paper, we investigate the relationship between a set of linear restrictions on the parameters of a Vector Autoregressive Moving Average (VARMA) model (see []) and the autoregressive metric (AR-metric hereafter), a notion of the distance between two univariate ARMA models introduced by Piccolo []. In particular, we show that these linear restrictions are satisfied if and only if the distance d between the two given ARMA models (say and ) is zero. This result provides the logical basis for using as null hypothesis for testing this set of restrictions. Moreover, we show that the set of linear restrictions considered is sufficient for the condition of Granger noncausality ([]), while in the VAR framework it becomes also a necessary condition (see []). This theoretical result allows the implementation of an inferential procedure and a bootstrap algorithm. Our procedure is verified by some Monte Carlo experiments also in a quite small sample. The paper is organized as follows. Section 2 introduces the notion of the distance between ARMA models and specifies the relationship between the AR metric and the set of linear restrictions considered for a VARMA model. Section 3 presents the inferential implication. Section 4 provides some Monte Carlo evidence about the finite sample behavior of our testing procedure. Section 5 contains two empirical illustrations. Section 6 gives some concluding remarks.
2. Linear Restrictions in a VARMA Model and AR-Metric
Let be a zero mean invertible ARMA model defined as
where and are polynomials in the lag operator L, with no common factors, and is a white noise process with constant variance . It is well-known that this process admits the following representation:
where the AR(∞) operator is defined by
with .
Let be the class of ARMA invertible models. If and , following Piccolo [], the AR-metric is defined as the Euclidean distance between the corresponding π-weights sequence, ,
The AR-metric d has been widely used in time series analysis (see, e.g., [,,,,,]). We observe that Equation (1) is a well-defined measure because of the absolute convergence of the π-weights sequences.
Now, we consider the following VARMA model of order , for an vector time series :
where and are two matrices of polynomials in the lag operator L, and is an vector white noise process with positive definite covariance matrix Σ. We assume that det for . This condition allows non-stationarity for the series, in the sense that the characteristic polynomial of the VARMA model described by the equation det may have roots on the unit circle. Condition det for , however, excludes explicitly explosive processes from our consideration. We further assume that the model Equation (2) satisfies the usual identifiability conditions. If , we obtain a pure vector autoregressive (VAR) model of order p. If , we obtain a pure vector moving average (VMA) model of order q. Consider the partition where is a scalar time series and is an vector of time series. Accordingly, the model Equation (2) for the partition of can be rewritten as:
where and are matrix polynomials in the lag operator L, with . In this framework it is well-known (see, for example, []) that does not Granger-cause if and only if
and that a sufficient condition for Equation (4) to hold is
We note that if the condition Equation (5) holds then follows a univariate ARMA model given by:
The main aim of this paper is to establish the implications of the set of linear restrictions Equation (5), using the notion of the distance between ARMA models measured by Equation (1). In particular, we will consider the distance between the ARMA() model Equation (6) (denoted ) and the ARMA model for the subprocess implied by the VARMA() model Equation (2) (denoted ).
Following Lütkepohl [], the implied ARMA model can be obtained as follows. Premultiplying both sides of Equation (2) by the adjoint of , denoted as , we obtain
We note that each component of is a sum of finite order MA processes, thus it is a finite order MA process (see Proposition 11.1 in []). Hence, the subprocess follows an ARMA model given by:
where is univariate white noise and is an invertible polynomial in the lag operator L. More precisely, and are such that
where denotes the first row of the matrix . Finally, we observe that has also the following autoregressive representation of infinite order:
where
2.1. Theoretical Results
We consider the distance according to Equation (1) between the model Equations (6) and (8) and :
where
The following proposition provides a necessary and sufficient condition for the set of linear restrictions Equation (5) in terms of the distance .
Proposition 1. if and only if .
Proof of Proposition 1. (⇒) We have
and the first row the matrix is such that where
and
with
If , then
and
Thus we have that (where this equality between random variables means equality with probability 1) and . It follows that
and hence .
(⇐) We have to show that if , then . We may have two cases: or .
First case: .
If , then
On the other hand, we have
and hence
Using the Schur’s formula, we get
Thus assume the following expression
where .
Since the degree of polynomial is finite
Equation (9) implies that
Since
it follows for Equation (10) that it must be
Since by hypothesis , it follows that and this in turn implies that
and
On the other hand is such that
and hence
where this equality is with probability 1. Since is a white noise, Equation (11) implies that .
Second case: .
By hypothesis , this implies that
and the first row of the matrix is given by where
If , then
and hence
The following equality then occurs with probability 1:
Since is a white noise, this implies that and .
☐
We have also the following corollaries.
Corollary 1. Let be a pure VAR(p) process. y does not Granger-cause x if and only if .
Proof of Corollary 1. If y does not Granger-cause x, then . By hypothesis, . Hence we have . It follows from Proposition 1 that .
If , by Proposition 1, it follows that and this, in a VAR framework, implies that y does not Granger-cause x. ☐
Corollary 2. Let be a pure VMA(q) process. y does not Granger-cause x if and only if .
Proof of Corollary 2. It is similar to the proof of Corollary 1. ☐
3. Inferential Implications
Proposition 1 allows us to test the set of linear restrictions Equation (5) considering the null hypothesis . Further, we observe that if the process follows a VAR model, Corollary 1 establishes that the Granger noncausality from to is equivalent to the condition . Thus, in a VAR framework, we can test for Granger noncausality from to using the null hypothesis without considering the nature of the involved variables. In fact, it is well-known that the use of non-stationary data in causality tests can yield spurious causality results (see, e.g., []). Thus, before testing for Granger causality, it is important to establish the properties of the time series involved because different model strategies must be adopted when: the series are I(0), the series are partly I(0) and partly I(1), the series are determined I(1) but not cointegrated, or the series are cointegrated. Of course, the weakness of this strategy is that incorrect conclusions drawn from preliminary analysis might be carried over into the causality tests. In the VAR framework an alternative method is the so-called lag-augmented Wald test (see [,]), which is a modified Wald test that requires the knowledge of the maximum order of integration of the involved variables. In this way, the proposed test based on the AR-metric can be a valid alternative for a Granger noncausality test (see []), since it does not require the exact knowledge of the series properties or the knowledge of the maximum order of integration.
To conduct inference on the basis of Proposition 1, we need an asymptotic distribution for . In the class of ARMA processes, the asymptotic distribution of the maximum likelihood estimator has been studied, among others, in [,]. In this case, for two independent ARMA() processes X and Y, under the null hypothesis , the maximum likelihood estimator has the following asymptotic distribution:
where are independent -distributions with degrees of freedom, are the eigenvalues of the covariance matrix of and . The evaluation of this distribution can be cumbersome; hence approximations, as well as evaluation algorithms, have been proposed (see []). Anyhow, in our framework, the ARMA models implied by Equation (6) and by the VARMA model Equation (8) under the null hypothesis are equal, so they cannot be considered independent. Then, to conduct the inferential procedures, we suggest the bootstrap algorithm proposed by Di Iorio and Triacca [], which is described in the next section.
3.1. The Bootstrap Test Procedure
For an easy illustration of our bootstrap procedure, let us consider a bivariate VARMA() model simply denoted as where , with covariance matrix Σ and, based on Proposition 1, we want to test the null hypothesis using
- Estimate on the observed data the VARMA() and obtain , , and the residuals ;
- using the estimated parameters from Step 1, obtain the univariate ARMA implied by the estimated VARMA for the subprocess ;
- evaluate the AR(∞) representation truncated at some suitable lag of the ARMA model in Step 2 (model );
- estimate for , using the observed data, an ARMA() model under the null hypothesis and evaluate its AR(∞) representation truncated at some suitable lag (model );
- evaluate the distance between the AR() and the AR() obtained in Steps 3 and 4;
- estimate the VARMA() model under the null hypothesis to obtain the estimates , and ;
- apply bootstrap to the re-centered residuals and obtain the pseudo-residuals ;
- generate the pseudo-data obeying the null hypothesis using with ;
- using the pseudo-data , repeat Steps 1–5 to obtain the bootstrap estimate of the distance ;
- repeat Steps 7–9 for b times;
- evaluate the bootstrap p-value as the proportion of the b estimated bootstrap distance that exceeds the same statistic evaluated on the observed data , that is, .
When this procedure is applied, two remarks concerning the pseudo-data generation and the modeling of the dependency across the subprocess are in order. Firstly, in a well-specified model framework (as well as during a simulation exercise), the estimated residuals do not show any autocorrelation structure, so we do not need any particular resampling scheme for dependent data to obtain pseudo-error terms , and we can then apply a simple resampling procedure. Besides, for empirical studies the pseudo-data can be obtained considering several resampling strategies, as a block bootstrap algorithm (see []). Secondly, in order to reproduce the dependency across the subprocess expressed by Σ in the pseudo-data, we simply have to apply the resampling algorithm to the entire matrix of the estimated residuals .
4. Monte Carlo Experiments
The performance of the proposed inferential strategy can be investigated by means of a set of Monte Carlo experiments. In particular, we consider the test for the set of linear restriction associated to a Granger noncausality test for two different DGP: a stable bivariate VARMA() model and a cointegrated bivariate VAR(2) model. Our test will be compared with the performance of a Wald test for the VARMA() and with the lag-augmented Wald test suggested by Dolado et al. and Toda et al. [,] for the cointegrated VAR model.
4.1. Bivariate VARMA() Model
Consider the following stable VARMA() model:
with covariance matrix
In our study, the tests of the null hypothesis
were carried out using nominal significance levels of 1%, 5% and 10%. To analyze the power of the test, we consider the two cases below to verify how the test reacts when the parameter values move away from zero:
- Power 1. , ,
- Power 2. , .
It is well-known that a maximum likelihood estimation of a VARMA model can be a challenging task (see, e.g., [,]). For this reason we consider sample size and , which are quite large compared with what is usually found in empirical applications. Taking into account the dimension of our exercise, we perform the maximum likelihood estimation using the Kalman filter procedure implemented in (ver. 1.9.14) (see []). Therefore, due to computational time involved by the maximum likelihood estimation of the VARMA model, the experiments are based on 400 Monte Carlo replications and 400 bootstrap redrawings. We compare our results with the usual Wald test using, for a proper comparison, also the bootstrap p-values obtained by the same bootstrap algorithm described above. Finally, we verify by some preliminary experiments that a suitable value for and in Steps 3 and 4 in the bootstrap algorithm is 15. The results are reported in Table 1.
Table 1.
VARMA() AR-metric and Wald test. Size and Power.
| AR–Metric | Wald | Wald | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Boot p-Values | Boot p-Values | Asy p-Values | |||||||
| nom | Size | Power 1 | Power 2 | Size | Power 1 | Power 2 | Size | Power 1 | Power 2 |
| 0.01 | 0.15 | 0.45 | 0.01 | 0.85 | 0.89 | 0.01 | 0.97 | 0.99 | |
| 0.09 | 0.48 | 0.90 | 0.06 | 0.90 | 0.90 | 0.06 | 0.99 | 100 | |
| 0.14 | 0.62 | 0.97 | 0.12 | 0.90 | 0.90 | 0.11 | 0.99 | 100 | |
| 0.03 | 0.49 | 0.98 | 0.01 | 0.97 | 0.98 | 0.01 | 100 | 100 | |
| 0.08 | 0.70 | 1.00 | 0.05 | 0.99 | 0.98 | 0.04 | 100 | 100 | |
| 0.13 | 0.79 | 1.00 | 0.07 | 0.99 | 1.00 | 0.09 | 100 | 100 | |
As we can see from Table 1, the size for the AR-metric test is quite satisfactory, and the power increases with growing sample size and as the true parameter values move away from zero. In any case, as expected, the difficulties of the maximum likelihood estimation for the VARMA model affect the distance more than the Wald test, which shows a better power. In fact, as the bootstrap algorithm underlines, the distance-based test is built on the autocovariances obtained by the estimated values of the parameters. Hence, its performances are heavily dependent on the quality of these estimates.
4.2. Bivariate Cointegrated VAR(2) Model
Most encouraging results are obtained with the second DGP. Consider the following cointegrated bivariate VAR(2) model:
with covariance matrix .
As before, the tests of the null hypothesis
were carried out using nominal significance levels of 1%, 5% and 10%. To analyze the power of the test, we consider again the two cases below:
- Power 1.
- Power 2.
In this case the parameter estimation is easier. To make our Monte Carlo experiment more relevant for actual empirical applications, we consider sample size , a medium size in terms of annual data but small size for a quarterly frequency, and , which is a time span large in terms of annual data but pretty common for quarterly data. Now we compare the performances for our test with the lag-augmented Wald test proposed by Dolado et al. and Toda et al. [,] in this framework. The lag-augmented Wald test has an asymptotic -distribution with p degrees of freedom when a VAR() is estimated, where is the maximal order of integration for the series in the system. However, it is well-known that the lag-augmented Wald test based on asymptotic critical values may suffer from size distortion and low power especially for small samples [,]. Thus, to overcome this problem, we apply the same bootstrap algorithm described above using the Wald test from an augmented VAR(), with augmentation order , and we evaluate the bootstrap p-values.
For this DGP the experiment is based on 1000 Monte Carlo replications and 1000 Bootstrap redrawings, and, as before, in Step 3 we set . The results are collected in Table 2. We note that, for a nominal significance level of 5%, our results are rather similar to those of the second part of Table 3 reported in Shukur and Mantalos []. The comparison of the power estimates for our test and the lag-augmented Wald test of Toda et al. [] shows that our test has relatively high power properties in all situations, while the size is very close to the nominal values for both tests.
Table 2.
VAR(2) AR-metric and lag-augmented Wald test. Size and Power Bootstrap p-values. Asy p-values.
| AR-Metric | Aug-Wald | Aug-Wald | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Boot p-Values | Boot p-Values | Asy p-Values | |||||||
| nom | Size | Power 1 | Power 2 | Size | Power 1 | Power 2 | Size | Power 1 | Power 2 |
| 0.02 | 0.22 | 0.64 | 0.01 | 0.05 | 0.35 | 0.01 | 0.06 | 0.36 | |
| 0.07 | 0.42 | 0.82 | 0.04 | 0.18 | 0.62 | 0.04 | 0.17 | 0.61 | |
| 0.12 | 0.56 | 0.89 | 0.08 | 0.27 | 0.73 | 0.08 | 0.26 | 0.72 | |
| 0.01 | 0.54 | 0.98 | 0.01 | 0.18 | 0.78 | 0.01 | 0.18 | 0.80 | |
| 0.04 | 0.78 | 1.00 | 0.04 | 0.38 | 0.92 | 0.04 | 0.38 | 0.91 | |
| 0.11 | 0.85 | 1.00 | 0.09 | 0.50 | 0.95 | 0.08 | 0.50 | 0.95 | |
5. Empirical Applications
In this section we present two empirical examples to illustrate the application of the test suggested in the paper. First, we consider a VAR model and in particular we examine the causal relationship between the log of real per capita income and the inflation. Then, we consider a VARMA example based on the SCC dataset discussed in [].
To take into account any possible dependence structure in the residuals of the estimated models, we use the Stationary Bootstrap ([]) as resampling algorithm. The Stationary Bootstrap is a block bootstrap scheme where the resampled pseudo-series are stationary; this scheme chains blocks of observations of the original series starting at random locations, and the length of each block is randomly chosen from a geometric distribution. Following Palm et al. [], the mean block length can be computed as a function of the length of the time sample; by some exploratory simulations we verify the robustness of the tests to different block sizes, so we report results for blocks .
To discuss the possible causal relationship between the log of real per capita income (y) and inflation () we re-examined the dataset used by Ericsson et al. []. The dataset refers to United States over the period 1953–1992 and can be downloaded from the Journal of Applied Econometrics Data Archive. The VAR order selection is based on Bayesian Information Criterion and the following model is estimated.
The computed -statistic is equal to 0.35 with a bootstrap p-value 0. This result indicates the presence of Granger causality from output to inflation. This finding is in accordance with the results of Ericsson et al. []. The same result is obtained using the lag-augmented Wald test.
The SCC dataset discussed by Tiao and Box [] considers the quarterly time series of the U.K. Financial Time Ordinary Share Index, the U.K. Car Production and the U.K. Financial Time Commodity Price from the III Quarter 1952 to the IV Quarter 1967. The goal is verify the possibility of predicting the first variable from the lagged values of the last two. According to Tiao and Box [], a VARMA() is the best model for this data, then a null hypothesis following Equation (5) will be the inferential base to test just a sufficient condition on the predictability hypothesis. The VARMA() maximum likelihood parameter estimates using the Kalman filter procedure implemented in Gretl (ver. 1.9.9) are the following (standard errors in bracket):
The estimates are quite similar to the values reported as “full model” in the Table 10 in [], taking into account the difference in the estimation algorithm and software. The computed -statistic is equal to 4.58 with a bootstrap p-value 0.225, evaluated on 500 bootstrap replications, and this finding is in accordance with the results of “final model” in the Table 10 in Tiao and Box []. We perform also a Wald test on the same null hypothesis, the value is 36.684, which asymptotically rejects the null, but with a bootstrap p-value 0.146 that sustains the results of our test.
6. Conclusions
In this paper we characterized a set of linear restrictions in a Vector Autoregressive Moving Average (VARMA) model in terms of the notion of distance between ARMA models and we have derived a new inferential procedure. In particular, this new procedure can be useful for a new Granger noncausality test in a VAR framework. The advantage of this test is that it can be carried out irrespective of whether the variables involved are stationary or not and regardless of the existence of a cointegrating relationship among them. Our inferential procedure has been validated by a set of Monte Carlo experiments. In a VARMA framework this procedure shows encouraging results even if a deeper investigation, made complex by the computational time, is needed. In a cointegrated VAR framework our method for detecting causality has provided better results, as the conducted simulation study has shown that our test exhibits good performance in terms of size and power properties, even in small samples. Finally, we have shown that this test can be usefully applied in practical situations to test causality between economic time series.
Acknowledgments
Comments and suggestions from Giorgio Calzolari and participants to the Conference ICEEE2013 (Genova, Italy) are gratefully acknowledged; a special thanks to Riccardo "Jack" Lucchetti for his helpful comments and support. The authors are also grateful to the two referees for helpful comments and suggestions. The usual disclaimers apply.
Author Contributions
The authors contributed jointly to the paper.
Conflicts of Interest
The authors declare no conflict of interest.
References
- H. Lütkepohl. New Introduction to Multiple Time Series Analysis. New York, NY, USA: Springer, 2005. [Google Scholar]
- D. Piccolo. “A distance measure for classifying ARIMA models.” J. Time Ser. Anal. 11 (1990): 153–164. [Google Scholar] [CrossRef]
- C.W.J. Granger. “Investigating causal relations by econometric methods and cross-spectral methods.” Econometrica 37 (1969): 424–438. [Google Scholar] [CrossRef]
- F. Di Iorio, and U. Triacca. “Testing for Granger non-causality using the autoregressive metric.” Econ. Model. 33 (2013): 120–125. [Google Scholar] [CrossRef]
- M. Corduas, and D. Piccolo. “Time series clustering and classification by the autoregressive metric.” Comput. Stat. Data Anal. 52 (2008): 1860–1862. [Google Scholar] [CrossRef]
- J. Gonzalo, and T.H. Lee. “Relative power of t type tests for stationary and unit root processes.” J. Time Ser. Anal. 17 (1996): 37–47. [Google Scholar] [CrossRef]
- S. Grimaldi. “Linear parametric models applied on daily hydrological series.” J. Hydrol. Eng. 9 (2004): 383–391. [Google Scholar] [CrossRef]
- E.A. Maharaj. “A significance test for classifying ARMA models.” J. Stat. Comput. Simul. 54 (1996): 305–331. [Google Scholar] [CrossRef]
- E. Otranto. “Clustering heteroskedastic time series by model-based procedures.” Comput. Stat. Data Anal. 52 (2008): 4685–4698. [Google Scholar] [CrossRef]
- E. Otranto. “Identifying financial time series with similar dynamic conditional correlation.” Comput. Stat. Data Anal. 54 (2010): 1–15. [Google Scholar] [CrossRef]
- H. Boudjellaba, J.-M. Dufour, and R. Roy. “Testing causality between two vectors in multivariate ARMA models.” J. Am. Stat. Assoc. 87 (1992): 1082–1090. [Google Scholar] [CrossRef]
- C.A. Sims, J.H. Stock, and M.W. Watson. “Inference in linear time series models with some unit roots.” Econometrica 58 (1990): 113–144. [Google Scholar] [CrossRef]
- J. Dolado, and H. Lütkepohl. “Making Wald Tests work for cointegrated VAR systems.” Econ. Rev. 15 (1996): 369–386. [Google Scholar] [CrossRef]
- H.Y. Toda, and T. Yamamoto. “Statistical inferences in vector autoregressions with possibly integrated processes.” J. Econ. 66 (1995): 225–50. [Google Scholar] [CrossRef]
- M. Corduas. “La metrica Autoregressiva tra modelli ARIMA: Una procedura operativa in linguaggio GAUSS.” Quad. Stat. 2 (2000): 1–37. [Google Scholar]
- J. MacKinnon. “Bootstrap inference in econometrics.” Can. J. Econ. 35 (2002): 615–645. [Google Scholar] [CrossRef]
- K. Metaxoglou, and A. Smith. “Estimating VARMA models using EM algorithm.” J. Time Ser. Anal. 28 (2007): 666–685. [Google Scholar] [CrossRef]
- “GRETL Gnu Regression, Econometrics and Time-series Library.” Available online: http://gretl.sourceforge.net/ (accessed on 16 June 2014).
- D.E.A. Giles. “Causality between the measured and underground economies in New Zealand.” Appl. Econ. Lett. 4 (1997): 63–67. [Google Scholar] [CrossRef]
- G. Mavrotas, and R. Kelly. “Old wine in new bottles: Testing causality between savings and growth.” Manch. Sch. 69 (2001): 97–105. [Google Scholar] [CrossRef]
- G. Shukur, and P. Mantalos. “A simple investigation of the Granger causality test in integrated-cointegrated VAR systems.” J. Appl. Stat. 27 (2000): 1021–1031. [Google Scholar] [CrossRef]
- G.C. Tiao, and G.E.P. Box. “Modeling multiple times series with applications.” J. Am. Stat. Assoc. 76 (1981): 802–816. [Google Scholar]
- D.N. Politis, and J.P. Romano. “The stationary bootstrap.” J. Am. Stat. Assoc. 89 (1994): 1303–1313. [Google Scholar] [CrossRef]
- F.C. Palm, S. Smeekes, and J.P. Urbain. “Cross-sectional dependence robust block bootstrap panel unit root tests.” J. Econ. 163 (2011): 85–104. [Google Scholar] [CrossRef]
- N.R. Ericsson, J.S. Irons, and R.W. Tryon. “Output and inflation in the long run.” J. Appl. Econ. 16 (2001): 241–253. [Google Scholar] [CrossRef]
© 2014 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).