Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test

Di Iorio, Francesca; Triacca, Umberto

doi:10.3390/econometrics2040203

Open AccessArticle

Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test

by

Francesca Di Iorio

^1,* and

Umberto Triacca

²

¹

Department of Political Science, University of Naples Federico II, via L. Rodinò 22 ,I-80138 Naples, Italy

²

Department of Computer Engineering, Computer Science and Mathematics, University of L'Aquila, via Vetoio, I-67010 Coppito, L'Aquila, Italy

^*

Author to whom correspondence should be addressed.

Econometrics 2014, 2(4), 203-216; https://doi.org/10.3390/econometrics2040203

Submission received: 8 September 2014 / Revised: 30 October 2014 / Accepted: 3 December 2014 / Published: 22 December 2014

Download Versions Notes

Abstract

:

In this paper we propose a test for a set of linear restrictions in a Vector Autoregressive Moving Average (VARMA) model. This test is based on the autoregressive metric, a notion of distance between two univariate ARMA models,

M_{0}

and

M_{1}

, introduced by Piccolo in 1990. In particular, we show that this set of linear restrictions is equivalent to a null distance

d (M_{0}, M_{1})

between two given ARMA models. This result provides the logical basis for using

d (M_{0}, M_{1}) = 0

as a null hypothesis in our test. Some Monte Carlo evidence about the finite sample behavior of our testing procedure is provided and two empirical examples are presented.

Keywords:

VARMA; linear restriction; autoregressive metric; bootstrap

JEL classifications:

C1; C3; C4

1. Introduction

In this paper, we investigate the relationship between a set of linear restrictions on the parameters of a Vector Autoregressive Moving Average (VARMA) model (see [1]) and the autoregressive metric (AR-metric hereafter), a notion of the distance between two univariate ARMA models introduced by Piccolo [2]. In particular, we show that these linear restrictions are satisfied if and only if the distance d between the two given ARMA models (say

M_{0}

and

M_{1}

) is zero. This result provides the logical basis for using

d (M_{0}, M_{1}) = 0

as null hypothesis for testing this set of restrictions. Moreover, we show that the set of linear restrictions considered is sufficient for the condition of Granger noncausality ([3]), while in the VAR framework it becomes also a necessary condition (see [4]). This theoretical result allows the implementation of an inferential procedure and a bootstrap algorithm. Our procedure is verified by some Monte Carlo experiments also in a quite small sample. The paper is organized as follows. Section 2 introduces the notion of the distance between ARMA models and specifies the relationship between the AR metric and the set of linear restrictions considered for a VARMA model. Section 3 presents the inferential implication. Section 4 provides some Monte Carlo evidence about the finite sample behavior of our testing procedure. Section 5 contains two empirical illustrations. Section 6 gives some concluding remarks.

2. Linear Restrictions in a VARMA Model and AR-Metric

Let

z_{t}

be a zero mean invertible ARMA model defined as

ϕ (L) z_{t} = θ (L) ϵ_{t}

where

ϕ (L)

and

θ (L)

are polynomials in the lag operator L, with no common factors, and

ϵ_{t}

is a white noise process with constant variance

σ^{2}

. It is well-known that this process admits the following representation:

π (L) z_{t} = ϵ_{t}

where the AR(∞) operator is defined by

π (L) = ϕ (L) θ {(L)}^{- 1} = 1 - \sum_{i = 1}^{\infty} π_{i} L^{i}

with

\sum_{i = 1}^{\infty} | π_{i} | < \infty

.

Let

ℓ

be the class of ARMA invertible models. If

X \in ℓ

and

Y \in ℓ

, following Piccolo [2], the AR-metric is defined as the Euclidean distance between the corresponding π-weights sequence,

\{π_{j}\}

,

d (X, Y) = {[\sum_{i = 1}^{\infty} {(π_{x i} - π_{y i})}^{2}]}^{\frac{1}{2}}

(1)

The AR-metric d has been widely used in time series analysis (see, e.g., [5,6,7,8,9,10]). We observe that Equation (1) is a well-defined measure because of the absolute convergence of the π-weights sequences.

Now, we consider the following VARMA model of order

p, q

, for an

n \times 1

vector time series

\{w_{t}; t \in Z\}

:

A (L) w_{t} = B (L) ϵ_{t}

(2)

where

A (L) = I_{n} - A_{1} L - A_{2} L^{2} - \dots - A_{p} L^{p}

and

B (L) = I_{n} - B_{1} L - B_{2} L^{2} - \dots - B_{q} L^{q}

are two

n \times n

matrices of polynomials in the lag operator L, and

ϵ_{t}

is an

n \times 1

vector white noise process with positive definite covariance matrix Σ. We assume that det

(A (z)) \neq 0

for

|z| < 1

. This condition allows non-stationarity for the series, in the sense that the characteristic polynomial of the VARMA model described by the equation det

(A (z)) = 0

may have roots on the unit circle. Condition det

(A (z)) \neq 0

for

|z| < 1

, however, excludes explicitly explosive processes from our consideration. We further assume that the model Equation (2) satisfies the usual identifiability conditions. If

B (L) = I

, we obtain a pure vector autoregressive (VAR) model of order p. If

A (L) = I

, we obtain a pure vector moving average (VMA) model of order q. Consider the partition

w_{t} = {(x_{t}, y_{t}^{'})}^{'}

where

x_{t}

is a scalar time series and

y_{t}

is an

(n - 1) \times 1

vector of time series. Accordingly, the model Equation (2) for the partition of

w_{t}

can be rewritten as:

[\begin{matrix} 1 - A_{11} (L) & A_{12} (L) \\ A_{21} (L) & I - A_{22} (L) \end{matrix}] [\begin{matrix} x_{t} \\ y_{t} \end{matrix}] = [\begin{matrix} 1 - B_{11} (L) & B_{12} (L) \\ B_{21} (L) & I - B_{22} (L) \end{matrix}] [\begin{matrix} ϵ_{x_{t}} \\ ϵ_{y_{t}} \end{matrix}]

(3)

E ([\begin{matrix} ϵ_{x_{t}} \\ ϵ_{y_{t}} \end{matrix}] [\begin{matrix} ϵ_{x_{s}} & ϵ_{y_{s}} \end{matrix}]) = \{\begin{matrix} Σ t = s \\ 0 t \neq s \end{matrix}

where

A_{i j} (L) = \sum_{h = 1}^{p} A_{i j}^{(h)} L^{h}

and

B_{i j} (L) = \sum_{h = 1}^{q} B_{i j}^{(h)} L^{h}

i, j = 1, 2

are matrix polynomials in the lag operator L, with

det (A_{22} (L)) \neq 0

. In this framework it is well-known (see, for example, [11]) that

y_{t}

does not Granger-cause

x_{t}

if and only if

B_{12} (L) - A_{12} (L) A_{22} {(L)}^{- 1} B_{22} (L) = 0

(4)

and that a sufficient condition for Equation (4) to hold is

A_{12} (L) = B_{12} (L) = 0

(5)

We note that if the condition Equation (5) holds then

x_{t}

follows a univariate ARMA model given by:

[1 - A_{11} (L)] x_{t} = [1 - B_{11} (L)] ϵ_{x_{t}}

(6)

The main aim of this paper is to establish the implications of the set of linear restrictions Equation (5), using the notion of the distance between ARMA models measured by Equation (1). In particular, we will consider the distance between the ARMA(

p, q

) model Equation (6) (denoted

M_{0}

) and the ARMA model for the subprocess

\{x_{t}; t \in Z\}

implied by the VARMA(

p, q

) model Equation (2) (denoted

M_{1}

).

Following Lütkepohl [1], the implied ARMA model

M_{1}

can be obtained as follows. Premultiplying both sides of Equation (2) by the adjoint of

A (L)

, denoted as

Adj (A (L))

, we obtain

det (A (L)) w_{t} = Adj (A (L)) B (L) ϵ_{t}

(7)

We note that each component of

Adj (A (L)) B (L) ϵ_{t}

is a sum of finite order MA processes, thus it is a finite order MA process (see Proposition 11.1 in [1]). Hence, the subprocess

\{x_{t}; t \in Z\}

follows an ARMA model given by:

det (A (L)) x_{t} = δ (L) u_{t}

(8)

where

u_{t}

is univariate white noise and

δ (L)

is an invertible polynomial in the lag operator L. More precisely,

δ (L)

and

u_{t}

are such that

δ (L) u_{t} = C_{1} (L) ϵ_{t}

where

C_{1} (L)

denotes the first row of the matrix

C (L) = Adj (A (L)) B (L)

. Finally, we observe that

x_{t}

has also the following autoregressive representation of infinite order:

φ (L) x_{t} = u_{t}

where

φ (L) = \frac{det [A (L)]}{δ (L)} = 1 + φ_{1} L + φ_{2} L^{2} + . . .

2.1. Theoretical Results

We consider the distance according to Equation (1) between the model Equations (6) and (8)

M_{0}

and

M_{1}

:

d (M_{0}, M_{1}) = {[\sum_{i = 1}^{\infty} {(λ_{i} - φ_{i})}^{2}]}^{\frac{1}{2}}

where

λ (L) = \frac{1 - A_{11} (L)}{1 - B_{11} (L)} = 1 + λ_{1} L + λ_{2} L^{2} + . . .

The following proposition provides a necessary and sufficient condition for the set of linear restrictions Equation (5) in terms of the distance

d (M_{0}, M_{1})

.

Proposition 1.

A_{12} (L) = B_{12} (L) = 0

if and only if

d (M_{0}, M_{1}) = 0

.

Proof of Proposition 1. (⇒) We have

det [A (L)] = (1 - A_{11} (L)) det [I - A_{22} (L) - A_{21} (L) {(1 - A_{11} (L))}^{- 1} A_{12} (L)]

and the first row the matrix

C (L)

is such that

C_{1} (L) = [C_{11} (L), C_{12} (L)]

where

C_{11} (L) = [det (A (L)) D (L) (1 - B_{11} (L)) - det (A (L)) D (L) A_{12} (L) {(I - A_{22} (L))}^{- 1} B_{21} (L)]

and

C_{12} (L) = [det (A (L)) D (L) B_{12} (L) - det (A (L)) D (L) A_{12} (L) {(I - A_{22} (L))}^{- 1} (I - B_{22} (L))]

with

D (L) = {[1 - A_{11} (L) - A_{12} (L) {(I - A_{22} (L))}^{- 1} A_{21} (L)]}^{- 1}

If

A_{12} (L) = B_{12} (L) = 0

, then

det (A (L)) = (1 - A_{11} (L)) det (I - A_{22} (L))

and

C_{1} (L) = [det (I - A_{22} (L)) (1 - B_{11} (L)), 0]

Thus we have that

u_{t} = ϵ_{x t}

(where this equality between random variables means equality with probability 1) and

δ (L) = det (I - A_{22} (L)) (1 - B_{11} (L))

. It follows that

φ (L) = \frac{det (A (L))}{δ (L)} = \frac{1 - A_{11} (L)}{1 - B_{11} (L)}

and hence

d (M_{0}, M_{1}) = 0

.

(⇐) We have to show that if

d (M_{0}, M_{1}) = 0

, then

A_{12} (L) = B_{12} (L) = 0

. We may have two cases:

A_{21} (L) \neq 0

or

A_{21} (L) = 0

.

First case:

A_{21} (L) \neq 0

.

If

d (M_{0}, M_{1}) = 0

, then

φ (L) = \frac{1 - A_{11} (L)}{1 - B_{11} (L)}

On the other hand, we have

φ (L) = \frac{det (A (L))}{δ (L)}

and hence

\frac{1 - A_{11} (L)}{1 - B_{11} (L)} = \frac{(1 - A_{11} (L)) det (I - A_{22} (L) - A_{21} (L) {(1 - A_{11} (L))}^{- 1} A_{12} (L))}{δ (L)}

Using the Schur’s formula, we get

\frac{1 - A_{11} (L)}{1 - B_{11} (L)} = \frac{det (I - A_{22} (L)) (1 - A_{11} (L) - A_{12} (L) {(I - A_{22} (L))}^{- 1} A_{21} (L))}{δ (L)}

Thus

δ (L)

assume the following expression

δ (L) = det (C) (1 - B_{11} (L)) - {(1 - A_{11} (L))}^{- 1} det (C) A_{12} (L) {(I - A_{22} (L))}^{- 1} A_{21} (L) (1 - B_{11} (L))

(9)

where

C = I - A_{22} (L)

.

Since the degree of polynomial

δ (L)

is finite

deg (δ (L)) < \infty

Equation (9) implies that

deg ({(1 - A_{11} (L))}^{- 1} det (I - A_{22} (L)) A_{12} (L) {(I - A_{22} (L))}^{- 1} A_{21} (L) (1 - B_{11} (L))) < \infty

(10)

Since

deg ({(1 - A_{11} (L))}^{- 1}) = \infty

it follows for Equation (10) that it must be

A_{12} (L) {(I - A_{22} (L))}^{- 1} A_{21} (L) = 0

Since by hypothesis

A_{21} (L) \neq 0

, it follows that

A_{12} (L) = 0

and this in turn implies that

C_{1} (L) = [det (I - A_{22} (L)) (1 - B_{11} (L)), det (I - A_{22} (L)) B_{12} (L)]

and

δ (L) = det (I - A_{22} (L)) (1 - B_{11} (L))

On the other hand

δ (L)

is such that

δ (L) u_{t} = det (I - A_{22} (L)) (1 - B_{11} (L)) ϵ_{x_{t}} + d e t (I - A_{22} (L)) B_{12} (L) ϵ_{y_{t}}

and hence

u_{t} = ϵ_{x_{t}} + \frac{B_{12} (L)}{(1 - B_{11} (L))} ϵ_{y_{t}}

(11)

where this equality is with probability 1. Since

u_{t}

is a white noise, Equation (11) implies that

B_{12} (L) = 0

.

Second case:

A_{21} (L) = 0

.

By hypothesis

A_{21} (L) = 0

, this implies that

det (A (L)) = (1 - A_{11} (L)) det (I - A_{22} (L))

and the first row of the matrix

C (L)

is given by

C_{1} (L) = [C_{11} (L), C_{12} (L)]

where

C_{11} (L) = [det (I - A_{22} (L)) (1 - B_{11} (L)) - det (I - A_{22} (L)) A_{12} (L) {(I - A_{22} (L))}^{- 1} B_{21} (L)]

C_{12} (L) = [det (I - A_{22} (L)) B_{12} (L) - det (I - A_{22} (L)) A_{12} (L) {(I - A_{22} (L))}^{- 1} (I - B_{22} (L))]

If

d (M_{0}, M_{1}) = 0

, then

\frac{1 - A_{11} (L)}{1 - B_{11} (L)} = \frac{(1 - A_{11} (L)) det (I - A_{22} (L))}{δ (L)}

and hence

δ (L) = (1 - B_{11} (L)) det (I - A_{22} (L))

The following equality then occurs with probability 1:

u_{t} = ϵ_{x_{t}} - \frac{A_{12} (L) {(I - A_{22} (L))}^{- 1} B_{21} (L)}{1 - B_{11} (L)} ϵ_{x_{t}} + \frac{B_{12} (L)}{1 - B_{11} (L)} ϵ_{y_{t}}

- \frac{A_{12} (L) {(I - A_{22} (L))}^{- 1} (1 - B_{22} (L))}{1 - B_{11} (L)} ϵ_{y_{t}}

Since

u_{t}

is a white noise, this implies that

A_{12} (L) = 0

and

B_{12} (L) = 0

.

☐

We have also the following corollaries.

Corollary 1. Let

w_{t} = {(x_{t}, y_{t}^{'})}^{'}

be a pure VAR(p) process. y does not Granger-cause x if and only if

d (M_{0}, M_{1}) = 0

.

Proof of Corollary 1.

(\Rightarrow)

If y does not Granger-cause x, then

A_{12} (L) = 0

. By hypothesis,

B_{12} (L) = 0

. Hence we have

A_{12} (L) = B_{12} (L) = 0

. It follows from Proposition 1 that

d (M_{0}, M_{1}) = 0

.

(\Leftarrow)

If

d (M_{0}, M_{1}) = 0

, by Proposition 1, it follows that

A_{12} (L) = 0

and this, in a VAR framework, implies that y does not Granger-cause x. ☐

Corollary 2. Let

w_{t} = {(x_{t}, y_{t}^{'})}^{'}

be a pure VMA(q) process. y does not Granger-cause x if and only if

d (M_{0}, M_{1}) = 0

.

Proof of Corollary 2. It is similar to the proof of Corollary 1. ☐

3. Inferential Implications

Proposition 1 allows us to test the set of linear restrictions Equation (5) considering the null hypothesis

H_{0} : d (M_{0}, M_{1}) = 0

. Further, we observe that if the process

\{w_{t}; t \in Z\}

follows a VAR model, Corollary 1 establishes that the Granger noncausality from

y_{t}

to

x_{t}

is equivalent to the condition

d (M_{0}, M_{1}) = 0

. Thus, in a VAR framework, we can test for Granger noncausality from

y_{t}

to

x_{t}

using the null hypothesis

d (M_{0}, M_{1}) = 0

without considering the nature of the involved variables. In fact, it is well-known that the use of non-stationary data in causality tests can yield spurious causality results (see, e.g., [12]). Thus, before testing for Granger causality, it is important to establish the properties of the time series involved because different model strategies must be adopted when: the series are I(0), the series are partly I(0) and partly I(1), the series are determined I(1) but not cointegrated, or the series are cointegrated. Of course, the weakness of this strategy is that incorrect conclusions drawn from preliminary analysis might be carried over into the causality tests. In the VAR framework an alternative method is the so-called lag-augmented Wald test (see [13,14]), which is a modified Wald test that requires the knowledge of the maximum order of integration of the involved variables. In this way, the proposed test based on the AR-metric can be a valid alternative for a Granger noncausality test (see [4]), since it does not require the exact knowledge of the series properties or the knowledge of the maximum order of integration.

To conduct inference on the basis of Proposition 1, we need an asymptotic distribution for

d (M_{0}, M_{1})

. In the class of ARMA processes, the asymptotic distribution of the maximum likelihood estimator

{\hat{d}}^{2}

has been studied, among others, in [5,15]. In this case, for two independent ARMA(

p, q

) processes X and Y, under the null hypothesis

d (X, Y) = 0

, the maximum likelihood estimator

{\hat{d}}^{2}

has the following asymptotic distribution:

{\hat{d}}^{2} \sim 2 \sum_{j = 1}^{K} λ_{j} χ_{g_{j}}^{2}

where

χ_{g j}^{2}

are independent

χ^{2}

-distributions with

g_{j}

degrees of freedom,

λ_{j}

are the eigenvalues of the covariance matrix of

({\hat{φ}}_{x i} - {\hat{φ}}_{y i})

and

K < p + q

. The evaluation of this distribution can be cumbersome; hence approximations, as well as evaluation algorithms, have been proposed (see [15]). Anyhow, in our framework, the ARMA models implied by Equation (6) and by the VARMA model Equation (8) under the null hypothesis

A_{12} (L) = B_{12} (L) = 0

are equal, so they cannot be considered independent. Then, to conduct the inferential procedures, we suggest the bootstrap algorithm proposed by Di Iorio and Triacca [4], which is described in the next section.

3.1. The Bootstrap Test Procedure

For an easy illustration of our bootstrap procedure, let us consider a bivariate VARMA(

p, q

) model simply denoted as

A (L) w_{t} = B (L) ϵ_{t}

where

w_{t} = {(x_{t}, y_{t})}^{'}

,

ϵ_{t} = {(ϵ_{x t}, ϵ_{y t})}^{'}

with covariance matrix Σ and, based on Proposition 1, we want to test the null hypothesis

H_{0} : A_{12} (L) = B_{12} (L) = 0

using

H_{0} : d (M_{0}, M_{1}) = 0

Estimate on the observed data the VARMA( $p, q$ ) and obtain $\hat{A} (L)$ , $\hat{B} (L)$ , $\hat{Σ}$ and the residuals ${\hat{ϵ}}_{t}$ ;
using the estimated parameters from Step 1, obtain the univariate ARMA implied by the estimated VARMA for the subprocess $x_{t}$ ;
evaluate the AR(∞) representation truncated at some suitable lag $p_{1}$ of the ARMA model in Step 2 (model $M_{1}$ );
estimate for $x_{t}$ , using the observed data, an ARMA( $p, q$ ) model under the null hypothesis $H_{0} : A_{12} (L) = B_{12} (L) = 0$ and evaluate its AR(∞) representation truncated at some suitable lag $p_{0}$ (model $M_{0}$ );
evaluate the distance $\hat{d} (M_{0}, M_{1})$ between the AR( $p_{0}$ ) and the AR( $p_{1}$ ) obtained in Steps 3 and 4;
estimate the VARMA( $p, q$ ) model under the null hypothesis $H_{0} : A_{12} (L) = B_{12} (L) = 0$ to obtain the estimates $\tilde{A} (L)$ , $\tilde{B} (L)$ and $\tilde{Σ}$ ;
apply bootstrap to the re-centered residuals ${\hat{ϵ}}_{t}$ and obtain the pseudo-residuals $ϵ_{t}^{*}$ ;
generate the pseudo-data ${(x_{t}^{*}, y_{t}^{*})}^{'}$ obeying the null hypothesis using $\tilde{A} (L) {(x_{t}^{*}, y_{t}^{*})}^{'} = \tilde{B} (L) ϵ_{t}^{*}$ with $\tilde{Σ}$ ;
using the pseudo-data ${(x_{t}^{*}, y_{t}^{*})}^{'}$ , repeat Steps 1–5 to obtain the bootstrap estimate of the distance $d^{*} (M_{0}, M_{1})$ ;
repeat Steps 7–9 for b times;
evaluate the bootstrap p-value as the proportion of the b estimated bootstrap distance $d^{*}$ that exceeds the same statistic evaluated on the observed data $\hat{d}$ , that is, $p v a l_{b} = p r o p o r t i o n (d^{*} > \hat{d})$ .

When this procedure is applied, two remarks concerning the pseudo-data generation and the modeling of the dependency across the subprocess are in order. Firstly, in a well-specified model framework (as well as during a simulation exercise), the estimated residuals

{\hat{ϵ}}_{t}

do not show any autocorrelation structure, so we do not need any particular resampling scheme for dependent data to obtain pseudo-error terms

ϵ_{t}^{*}

, and we can then apply a simple resampling procedure. Besides, for empirical studies the pseudo-data can be obtained considering several resampling strategies, as a block bootstrap algorithm (see [16]). Secondly, in order to reproduce the dependency across the subprocess expressed by Σ in the pseudo-data, we simply have to apply the resampling algorithm to the entire

T \times n

matrix of the estimated residuals

{\hat{ϵ}}_{t}

.

4. Monte Carlo Experiments

The performance of the proposed inferential strategy can be investigated by means of a set of Monte Carlo experiments. In particular, we consider the test for the set of linear restriction associated to a Granger noncausality test for two different DGP: a stable bivariate VARMA(

1, 1

) model and a cointegrated bivariate VAR(2) model. Our test will be compared with the performance of a Wald test for the VARMA(

1, 1

) and with the lag-augmented Wald test suggested by Dolado et al. and Toda et al. [13,14] for the cointegrated VAR model.

4.1. Bivariate VARMA( $1, 1$ ) Model

Consider the following stable VARMA(

1, 1

) model:

[\begin{matrix} 1 - 0.8 L & - α_{1} L \\ - 0.3 L & 1 - 0.5 L \end{matrix}] [\begin{matrix} x_{t} \\ y_{t} \end{matrix}] = [\begin{matrix} 1 & - β_{1} L \\ 0.25 L & 1 - 0.5 L \end{matrix}] [\begin{matrix} ϵ_{x t} \\ ϵ_{y t} \end{matrix}]

(12)

with covariance matrix

Σ_{ϵ} = [\begin{matrix} 4 & 3 \\ 3 & 6 \end{matrix}] .

In our study, the tests of the null hypothesis

H_{0} : α_{1} = β_{1} = 0

were carried out using nominal significance levels of 1%, 5% and 10%. To analyze the power of the test, we consider the two cases below to verify how the test reacts when the parameter values move away from zero:

Power 1. $α_{1} = 0.2$ , $β_{1} = - 0.7$ ,
Power 2. $α_{1} = 0.5$ , $β_{1} = - 0.7$ .

It is well-known that a maximum likelihood estimation of a VARMA model can be a challenging task (see, e.g., [1,17]). For this reason we consider sample size

T = 100

and

T = 200

, which are quite large compared with what is usually found in empirical applications. Taking into account the dimension of our exercise, we perform the maximum likelihood estimation using the Kalman filter procedure implemented in

G r e t l

(ver. 1.9.14) (see [18]). Therefore, due to computational time involved by the maximum likelihood estimation of the VARMA model, the experiments are based on 400 Monte Carlo replications and 400 bootstrap redrawings. We compare our results with the usual Wald test using, for a proper comparison, also the bootstrap p-values obtained by the same bootstrap algorithm described above. Finally, we verify by some preliminary experiments that a suitable value for

p_{0}

and

p_{1}

in Steps 3 and 4 in the bootstrap algorithm is 15. The results are reported in Table 1.

Table 1. VARMA(

1, 1

) AR-metric and Wald test. Size and Power.

**Table 1.** VARMA( $1, 1$ ) AR-metric and Wald test. Size and Power.
	AR–Metric			Wald			Wald
	Boot p-Values			Boot p-Values			Asy p-Values
nom	Size	Power 1	Power 2	Size	Power 1	Power 2	Size	Power 1	Power 2
	$T = 100$
$0.01$	0.01	0.15	0.45	0.01	0.85	0.89	0.01	0.97	0.99
$0.05$	0.09	0.48	0.90	0.06	0.90	0.90	0.06	0.99	100
$0.10$	0.14	0.62	0.97	0.12	0.90	0.90	0.11	0.99	100
	$T = 200$
$0.01$	0.03	0.49	0.98	0.01	0.97	0.98	0.01	100	100
$0.05$	0.08	0.70	1.00	0.05	0.99	0.98	0.04	100	100
$0.10$	0.13	0.79	1.00	0.07	0.99	1.00	0.09	100	100

As we can see from Table 1, the size for the AR-metric test is quite satisfactory, and the power increases with growing sample size and as the true parameter values move away from zero. In any case, as expected, the difficulties of the maximum likelihood estimation for the VARMA model affect the distance more than the Wald test, which shows a better power. In fact, as the bootstrap algorithm underlines, the distance-based test is built on the autocovariances obtained by the estimated values of the parameters. Hence, its performances are heavily dependent on the quality of these estimates.

4.2. Bivariate Cointegrated VAR(2) Model

Most encouraging results are obtained with the second DGP. Consider the following cointegrated bivariate VAR(2) model:

[\begin{matrix} 1 - 1.5 L + 0.5 L^{2} & - α_{1} L - α_{2} L^{2} \\ - 0.8 L + 0.3 L^{2} & 1 - L + 0.5 L^{2} \end{matrix}] [\begin{matrix} x_{t} \\ y_{t} \end{matrix}] = [\begin{matrix} ϵ_{x t} \\ ϵ_{y t} \end{matrix}]

(13)

with covariance matrix

Σ_{ϵ} = [\begin{matrix} 5 & 2 \\ 2 & 3 \end{matrix}]

.

As before, the tests of the null hypothesis

H_{0} : α_{1} = α_{2} = 0

were carried out using nominal significance levels of 1%, 5% and 10%. To analyze the power of the test, we consider again the two cases below:

Power 1. $α_{1} = - α_{2} = 0.3$
Power 2. $α_{1} = - α_{2} = 0.6$

In this case the parameter estimation is easier. To make our Monte Carlo experiment more relevant for actual empirical applications, we consider sample size

T = 50

, a medium size in terms of annual data but small size for a quarterly frequency, and

T = 100

, which is a time span large in terms of annual data but pretty common for quarterly data. Now we compare the performances for our test with the lag-augmented Wald test proposed by Dolado et al. and Toda et al. [13,14] in this framework. The lag-augmented Wald test has an asymptotic

χ^{2}

-distribution with p degrees of freedom when a VAR(

p + d_{m a x}

) is estimated, where

d_{m a x}

is the maximal order of integration for the series in the system. However, it is well-known that the lag-augmented Wald test based on asymptotic critical values may suffer from size distortion and low power especially for small samples [19,20]. Thus, to overcome this problem, we apply the same bootstrap algorithm described above using the Wald test from an augmented VAR(

2 + a_{m a x}

), with augmentation order

a_{m a x} = 1

, and we evaluate the bootstrap p-values.

For this DGP the experiment is based on 1000 Monte Carlo replications and 1000 Bootstrap redrawings, and, as before, in Step 3 we set

p_{0} = 15

. The results are collected in Table 2. We note that, for a nominal significance level of 5%, our results are rather similar to those of the second part of Table 3 reported in Shukur and Mantalos [21]. The comparison of the power estimates for our test and the lag-augmented Wald test of Toda et al. [14] shows that our test has relatively high power properties in all situations, while the size is very close to the nominal values for both tests.

Table 2. VAR(2) AR-metric and lag-augmented Wald test. Size and Power Bootstrap p-values. Asy p-values.

**Table 2.** VAR(2) AR-metric and lag-augmented Wald test. Size and Power Bootstrap p-values. Asy p-values.
	AR-Metric			Aug-Wald			Aug-Wald
	Boot p-Values			Boot p-Values			Asy p-Values
nom	Size	Power 1	Power 2	Size	Power 1	Power 2	Size	Power 1	Power 2
	$T = 50$
$0.01$	0.02	0.22	0.64	0.01	0.05	0.35	0.01	0.06	0.36
$0.05$	0.07	0.42	0.82	0.04	0.18	0.62	0.04	0.17	0.61
$0.10$	0.12	0.56	0.89	0.08	0.27	0.73	0.08	0.26	0.72
	$T = 100$
$0.01$	0.01	0.54	0.98	0.01	0.18	0.78	0.01	0.18	0.80
$0.05$	0.04	0.78	1.00	0.04	0.38	0.92	0.04	0.38	0.91
$0.10$	0.11	0.85	1.00	0.09	0.50	0.95	0.08	0.50	0.95

5. Empirical Applications

In this section we present two empirical examples to illustrate the application of the test suggested in the paper. First, we consider a VAR model and in particular we examine the causal relationship between the log of real per capita income and the inflation. Then, we consider a VARMA example based on the SCC dataset discussed in [22].

To take into account any possible dependence structure in the residuals of the estimated models, we use the Stationary Bootstrap ([23]) as resampling algorithm. The Stationary Bootstrap is a block bootstrap scheme where the resampled pseudo-series are stationary; this scheme chains blocks of observations of the original series starting at random locations, and the length of each block is randomly chosen from a geometric distribution. Following Palm et al. [24], the mean block length can be computed as a function of the length of the time sample; by some exploratory simulations we verify the robustness of the tests to different block sizes, so we report results for blocks

1.75 {\sqrt{T}}^{3}

.

To discuss the possible causal relationship between the log of real per capita income (y) and inflation (

Δ p

) we re-examined the dataset used by Ericsson et al. [25]. The dataset refers to United States over the period 1953–1992 and can be downloaded from the Journal of Applied Econometrics Data Archive. The VAR order selection is based on Bayesian Information Criterion and the following model is estimated.

y_{t} = \underset{(0.21)}{0.03} + \underset{(0.15)}{0.93} y_{t - 1} + \underset{(0.16)}{0.93} y_{t - 2} - \underset{(0.24)}{0.82} Δ p_{t - 1} + \underset{(0.23)}{0.53} Δ p_{t - 2} + ϵ_{1_{t}}

Δ p_{t} = - \underset{(0.12)}{0.35} + \underset{(0.09)}{0.34} y_{t - 1} - \underset{(0.09)}{0.33} y_{t - 2} + \underset{(0.13)}{1.15} Δ p_{t - 1} - \underset{(0.13)}{0.33} Δ p_{t - 2} + ϵ_{2_{t}}

The computed

\hat{d} (M_{0}, M_{1})

-statistic is equal to 0.35 with a bootstrap p-value 0. This result indicates the presence of Granger causality from output to inflation. This finding is in accordance with the results of Ericsson et al. [25]. The same result is obtained using the lag-augmented Wald test.

The SCC dataset discussed by Tiao and Box [22] considers the quarterly time series of the U.K. Financial Time Ordinary Share Index, the U.K. Car Production and the U.K. Financial Time Commodity Price from the III Quarter 1952 to the IV Quarter 1967. The goal is verify the possibility of predicting the first variable from the lagged values of the last two. According to Tiao and Box [22], a VARMA(

1, 1

) is the best model for this data, then a null hypothesis following Equation (5) will be the inferential base to test just a sufficient condition on the predictability hypothesis. The VARMA(

1, 1

) maximum likelihood parameter estimates using the Kalman filter procedure implemented in Gretl (ver. 1.9.9) are the following (standard errors in bracket):

\begin{matrix} μ = (\begin{matrix} \underset{(1.178)}{1.157} \\ \underset{(0.879)}{0.774} \\ \underset{(1.941)}{2.444} \end{matrix}) & A (L) = (\begin{matrix} \underset{(0.132)}{0.853} & \underset{(0.147)}{0.117} & \underset{(0.07)}{- 0.057} \\ \underset{(0.110)}{- 0.034} & \underset{(0.108)}{0.960} & \underset{(0.051)}{- 0.030} \\ \underset{(0.308)}{- 0.091} & \underset{(0.308)}{0.153} & \underset{(0.112)}{0.843} \end{matrix}) & B (L) = (\begin{matrix} \underset{(0.134)}{0.386} & \underset{(0.179)}{- 0.488} & \underset{(0.097)}{- 0.223} \\ \underset{(0.164)}{0.752} & \underset{(0.181)}{- 0.782} & \underset{(0.117)}{- 0.175} \\ \underset{(0.353)}{1.373} & \underset{(0.372)}{- 0.820} & \underset{(0.185)}{- 0.017} \end{matrix}) \end{matrix}

The estimates are quite similar to the values reported as “full model” in the Table 10 in [22], taking into account the difference in the estimation algorithm and software. The computed

\hat{d} (M 0, M_{1})

-statistic is equal to 4.58 with a bootstrap p-value 0.225, evaluated on 500 bootstrap replications, and this finding is in accordance with the results of “final model” in the Table 10 in Tiao and Box [22]. We perform also a Wald test on the same null hypothesis, the value is 36.684, which asymptotically rejects the null, but with a bootstrap p-value 0.146 that sustains the results of our test.

6. Conclusions

In this paper we characterized a set of linear restrictions in a Vector Autoregressive Moving Average (VARMA) model in terms of the notion of distance between ARMA models and we have derived a new inferential procedure. In particular, this new procedure can be useful for a new Granger noncausality test in a VAR framework. The advantage of this test is that it can be carried out irrespective of whether the variables involved are stationary or not and regardless of the existence of a cointegrating relationship among them. Our inferential procedure has been validated by a set of Monte Carlo experiments. In a VARMA framework this procedure shows encouraging results even if a deeper investigation, made complex by the computational time, is needed. In a cointegrated VAR framework our method for detecting causality has provided better results, as the conducted simulation study has shown that our test exhibits good performance in terms of size and power properties, even in small samples. Finally, we have shown that this test can be usefully applied in practical situations to test causality between economic time series.

Acknowledgments

Comments and suggestions from Giorgio Calzolari and participants to the Conference ICEEE2013 (Genova, Italy) are gratefully acknowledged; a special thanks to Riccardo "Jack" Lucchetti for his helpful comments and support. The authors are also grateful to the two referees for helpful comments and suggestions. The usual disclaimers apply.

Author Contributions

The authors contributed jointly to the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

H. Lütkepohl. New Introduction to Multiple Time Series Analysis. New York, NY, USA: Springer, 2005. [Google Scholar]
D. Piccolo. “A distance measure for classifying ARIMA models.” J. Time Ser. Anal. 11 (1990): 153–164. [Google Scholar] [CrossRef]
C.W.J. Granger. “Investigating causal relations by econometric methods and cross-spectral methods.” Econometrica 37 (1969): 424–438. [Google Scholar] [CrossRef]
F. Di Iorio, and U. Triacca. “Testing for Granger non-causality using the autoregressive metric.” Econ. Model. 33 (2013): 120–125. [Google Scholar] [CrossRef] [Green Version]
M. Corduas, and D. Piccolo. “Time series clustering and classification by the autoregressive metric.” Comput. Stat. Data Anal. 52 (2008): 1860–1862. [Google Scholar] [CrossRef]
J. Gonzalo, and T.H. Lee. “Relative power of t type tests for stationary and unit root processes.” J. Time Ser. Anal. 17 (1996): 37–47. [Google Scholar] [CrossRef]
S. Grimaldi. “Linear parametric models applied on daily hydrological series.” J. Hydrol. Eng. 9 (2004): 383–391. [Google Scholar] [CrossRef]
E.A. Maharaj. “A significance test for classifying ARMA models.” J. Stat. Comput. Simul. 54 (1996): 305–331. [Google Scholar] [CrossRef]
E. Otranto. “Clustering heteroskedastic time series by model-based procedures.” Comput. Stat. Data Anal. 52 (2008): 4685–4698. [Google Scholar] [CrossRef]
E. Otranto. “Identifying financial time series with similar dynamic conditional correlation.” Comput. Stat. Data Anal. 54 (2010): 1–15. [Google Scholar] [CrossRef]
H. Boudjellaba, J.-M. Dufour, and R. Roy. “Testing causality between two vectors in multivariate ARMA models.” J. Am. Stat. Assoc. 87 (1992): 1082–1090. [Google Scholar] [CrossRef]
C.A. Sims, J.H. Stock, and M.W. Watson. “Inference in linear time series models with some unit roots.” Econometrica 58 (1990): 113–144. [Google Scholar] [CrossRef]
J. Dolado, and H. Lütkepohl. “Making Wald Tests work for cointegrated VAR systems.” Econ. Rev. 15 (1996): 369–386. [Google Scholar] [CrossRef]
H.Y. Toda, and T. Yamamoto. “Statistical inferences in vector autoregressions with possibly integrated processes.” J. Econ. 66 (1995): 225–50. [Google Scholar] [CrossRef]
M. Corduas. “La metrica Autoregressiva tra modelli ARIMA: Una procedura operativa in linguaggio GAUSS.” Quad. Stat. 2 (2000): 1–37. [Google Scholar]
J. MacKinnon. “Bootstrap inference in econometrics.” Can. J. Econ. 35 (2002): 615–645. [Google Scholar] [CrossRef]
K. Metaxoglou, and A. Smith. “Estimating VARMA models using EM algorithm.” J. Time Ser. Anal. 28 (2007): 666–685. [Google Scholar] [CrossRef]
“GRETL Gnu Regression, Econometrics and Time-series Library.” Available online: http://gretl.sourceforge.net/ (accessed on 16 June 2014).
D.E.A. Giles. “Causality between the measured and underground economies in New Zealand.” Appl. Econ. Lett. 4 (1997): 63–67. [Google Scholar] [CrossRef]
G. Mavrotas, and R. Kelly. “Old wine in new bottles: Testing causality between savings and growth.” Manch. Sch. 69 (2001): 97–105. [Google Scholar] [CrossRef]
G. Shukur, and P. Mantalos. “A simple investigation of the Granger causality test in integrated-cointegrated VAR systems.” J. Appl. Stat. 27 (2000): 1021–1031. [Google Scholar] [CrossRef]
G.C. Tiao, and G.E.P. Box. “Modeling multiple times series with applications.” J. Am. Stat. Assoc. 76 (1981): 802–816. [Google Scholar]
D.N. Politis, and J.P. Romano. “The stationary bootstrap.” J. Am. Stat. Assoc. 89 (1994): 1303–1313. [Google Scholar] [CrossRef]
F.C. Palm, S. Smeekes, and J.P. Urbain. “Cross-sectional dependence robust block bootstrap panel unit root tests.” J. Econ. 163 (2011): 85–104. [Google Scholar] [CrossRef]
N.R. Ericsson, J.S. Irons, and R.W. Tryon. “Output and inflation in the long run.” J. Appl. Econ. 16 (2001): 241–253. [Google Scholar] [CrossRef]

© 2014 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Di Iorio, F.; Triacca, U. Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test. Econometrics 2014, 2, 203-216. https://doi.org/10.3390/econometrics2040203

AMA Style

Di Iorio F, Triacca U. Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test. Econometrics. 2014; 2(4):203-216. https://doi.org/10.3390/econometrics2040203

Chicago/Turabian Style

Di Iorio, Francesca, and Umberto Triacca. 2014. "Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test" Econometrics 2, no. 4: 203-216. https://doi.org/10.3390/econometrics2040203

APA Style

Di Iorio, F., & Triacca, U. (2014). Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test. Econometrics, 2(4), 203-216. https://doi.org/10.3390/econometrics2040203

Article Menu

Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test

Abstract

1. Introduction

2. Linear Restrictions in a VARMA Model and AR-Metric

2.1. Theoretical Results

3. Inferential Implications

3.1. The Bootstrap Test Procedure

4. Monte Carlo Experiments

4.1. Bivariate VARMA( $1, 1$ ) Model

4.2. Bivariate Cointegrated VAR(2) Model

5. Empirical Applications

6. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test

Abstract

1. Introduction

2. Linear Restrictions in a VARMA Model and AR-Metric

2.1. Theoretical Results

3. Inferential Implications

3.1. The Bootstrap Test Procedure

4. Monte Carlo Experiments

4.1. Bivariate VARMA( 1 , 1 ) Model

4.2. Bivariate Cointegrated VAR(2) Model

5. Empirical Applications

6. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.1. Bivariate VARMA( $1, 1$ ) Model