Cointegration and Adjustment in the CVAR(∞) Representation of Some Partially Observed CVAR(1) Models

Johansen, Søren

doi:10.3390/econometrics7010002

Open AccessArticle

Cointegration and Adjustment in the CVAR(∞) Representation of Some Partially Observed CVAR(1) Models

by

Søren Johansen

Department of Economics, University of Copenhagen, ∅ster Farimagsgade 5, building 26, DK-1353 Copenhagen K, Denmark

Econometrics 2019, 7(1), 2; https://doi.org/10.3390/econometrics7010002

Submission received: 18 September 2018 / Revised: 29 October 2018 / Accepted: 8 January 2019 / Published: 10 January 2019

(This article belongs to the Special Issue Celebrated Econometricians: Katarina Juselius and Søren Johansen)

Download Versions Notes

Abstract

:

A multivariate CVAR(1) model for some observed variables and some unobserved variables is analysed using its infinite order CVAR representation of the observations. Cointegration and adjustment coefficients in the infinite order CVAR are found as functions of the parameters in the CVAR(1) model. Conditions for weak exogeneity for the cointegrating vectors in the approximating finite order CVAR are derived. The results are illustrated by two simple examples of relevance for modelling causal graphs.

Keywords:

adjustment coefficients; cointegrating coefficients; CVAR; causal models

JEL Classification:

C32

1. Introduction

In a conceptual exploration of long-run causal order, Hoover (2018) applies the CVAR(1) model for the processes

X_{t} = {(x_{1 t}, \dots, x_{p t})}^{'}

and

T_{t} = {(T_{1 t}, \dots, T_{m t})}^{'}

, to model a causal graph. The process

{(X_{t}^{'}; T_{t}^{'})}^{'}

is a solution to the equations

\begin{matrix} Δ X_{t + 1} = M X_{t} + C T_{t} + ε_{t + 1}, \\ Δ T_{t + 1} = η_{t + 1}, \end{matrix}

(1)

where the error terms

ε_{t}

are independent identically distributed (i.i.d.) Gaussian variables with mean 0 and variance

Ω_{ε} = diag (ω_{11}, \dots, ω_{p p}) > 0,

and are independent of the errors

η_{t}

, which are (i.i.d.) Gaussian with mean 0 and variance

Ω_{η}

.

Thus, the stochastic trends,

T_{t}

are nonstationary random walks and conditions will be given below for

X_{t}

to be

I (1),

that is, nonstationary, but

Δ X_{t}

stationary. This will imply that

M X_{t} + C T_{t}

is stationary, so that

X_{t}

and

T_{t}

cointegrate.

The entry

M_{i j} \neq 0

means that

x_{j}

causes

x_{i},

which is written

x_{j} \to x_{i}

, and

C_{i j} \neq 0

means that

T_{j} \to x_{i},

and it is further assumed that

M_{i i} \neq 0 .

Note that the model assumes that there are no causal links from

X_{t}

to

T_{t},

so that

T_{t}

is strongly exogenous.

A simple example for three variables,

x_{1}

,

x_{2}

,

x_{3}

, and a trend T, is the graph

T \to x_{1} \to x_{2} \to x_{3},

where the matrices are given by

M = (\begin{matrix} * & 0 & 0 \\ * & * & 0 \\ 0 & * & * \end{matrix}), C = (\begin{matrix} * \\ 0 \\ 0 \end{matrix})

where ∗ indicates a nonzero coefficient.

Provided that

I_{p} + M

has all eigenvalues in the open unit disk, it is seen that

M X_{t + 1} + C T_{t + 1} = (I_{p} + M) (M X_{t} + C T_{t}) + M ε_{t + 1} + C η_{t + 1},

determines a stationary process defined for all

t .

We define a nonstationary solution to (1) for

t = 0, 1, \dots

by

X_{t} = - M^{- 1} C \sum_{i = 1}^{t} η_{i} + M^{- 1} \sum_{i = 0}^{\infty} {(I_{p} + M)}^{i} (M ε_{t - i} + C η_{t - i}) and T_{t} = \sum_{i = 1}^{t} η_{i} .

(2)

Note that the starting values are

X_{0} = M^{- 1} \sum_{i = 0}^{\infty} {(I_{p} + M)}^{i} (M ε_{- i} + C η_{- i}) and T_{0} = 0 .

It is seen that

Δ X_{t + 1}

,

Δ T_{t + 1}

and

M X_{t} + C T_{t}

are stationary processes for all

t,

and that

{(X_{t}^{'}; T_{t}^{'})}^{'}

is a solution to Equation (1). In the following, we assume that

{(X_{t}^{'}; T_{t}^{'})}^{'}

is defined by (2) for

t = 0, 1, \dots

The paper by Hoover gives a detailed and general discussion of the problems of recovering causal structures from nonstationary observations

X_{t},

or subsets of

X_{t},

when

T_{t}

is unobserved, that is,

X_{t} = {(X_{1 t}^{'}; X_{2 t}^{'})}^{'}

where the observations

X_{1 t}

are

p_{1}

-dimensional and the unobserved processes

X_{2 t}

and

T_{t}

are

p_{2}

- and m-dimensional respectively,

p = p_{1} + p_{2}

. It is assumed that there are at least as many observations as trends, that is

p_{1} \geq m .

Model (1) is therefore rewritten as

\begin{matrix} Δ X_{1, t + 1}^{} = M_{11} X_{1 t}^{} + M_{12} X_{2 t}^{} + C_{1} T_{t}^{} + ε_{1, t + 1}, \\ Δ X_{2, t + 1}^{} = M_{21} X_{1 t}^{} + M_{22} X_{2 t}^{} + C_{2} T_{t}^{} + ε_{2, t + 1}, \\ Δ T_{t + 1}^{} = η_{t + 1} . \end{matrix}

(3)

Note that there is now a causal link from the observed process

X_{1 t}^{}

to the unobserved process

X_{2 t}

if

M_{21} \neq 0

.

It follows from (3) that

X_{1 t}

is

I (1)

and cointegrated with

p_{1} - m

cointegrating vectors

β,

see Theorem 1. Therefore,

Δ X_{1 t}^{}

has an infinite order autoregressive representation, see (Johansen and Juselius 2014, Lemma 2), which is written as

Δ X_{1, t + 1}^{} = α β^{'} X_{1 t}^{} + \sum_{i = 1}^{\infty} Γ_{i} Δ X_{1, t + 1 - i}^{} + ν_{t + 1}^{β},

(4)

where the operator norm

| | Γ_{i} | | = λ_{max}^{1 / 2} (Γ_{i}^{'} Γ_{i})

is

O (ρ^{i})

for some

0 < ρ < 1

. The matrices

α

and

β

are

p_{1} \times m

of rank m, and

ν_{t + 1}^{β} = Δ X_{1, t + 1} - E (Δ X_{1, t + 1} | F_{t}^{β}),

where

F_{t}^{β} = σ (Δ X_{1 s}, s \leq t

,

β^{'} X_{1 t})

. Thus,

X_{1 t}

is not measurable with respect to

F_{t}^{β}

, but

β^{'} X_{1 t}

is measurable with respect to

F_{t}^{β} .

Here, the prediction errors

ν_{t + 1}^{β}

are i.i.d.

N_{p_{1}} (0, Σ)

, where

Σ

is calculated below. The representation of

X_{1 t},

similar to (2), is

X_{1 t} = β_{⊥} {(α_{⊥}^{'} Γ β_{⊥})}^{- 1} α_{⊥}^{'} \sum_{i = 1}^{t} ν_{i}^{β} + \sum_{i = 0}^{\infty} C_{i} ν_{t - i}^{β}, t = 0, 1, \dots

(5)

where

Γ = I_{p_{1}} - \sum_{i = 1}^{\infty} Γ_{i}

and

| | C_{i} | | = O (ρ^{i}) .

Here,

β_{⊥}

is a

p_{1} \times (p_{1} - m)

matrix of full rank for which

β^{'} β_{⊥} = 0

, and similarly for

α_{⊥}

. This shows that

X_{1 t}

is a cointegrated

I (1)

process, that is,

X_{1 t}

is nonstationary, while

β^{'} X_{1 t}

and

Δ X_{1 t}

are stationary.

A statistical analysis, including estimation of

α

,

β

, and

Γ,

can be conducted for the observations

X_{1 t},

t = 1, \dots T,

using an approximating finite order CVAR, see Saikkonen (1992) and Saikkonen and Lütkepohl (1996).

Hoover (2018) investigates, in particular, whether weak exogeneity for

β

in the approximating finite order CVAR, that is, a zero row in

α,

is a useful tool for finding the causal structure in the graph.

The present note solves the problem of finding expressions for the parameters

α

and

β

in the CVAR(∞) model (4) for the observation

X_{1 t}

, as functions of the parameters in model (3), and finds conditions on these for the presence of a zero row in

α,

and hence weak exogeneity for

β

in the approximating finite order CVAR.

2. The Assumptions and Main Results

First, some definitions and assumptions are given, then the main results on

α

and

β

are presented and proved in Theorems 1 and 2. These results rely on Theorem A1 on the solution of an algebraic Riccati equation, which is given and proved in the Appendix A.

In the following, a

k \times k

matrix is called stable, if all eigenvalues are contained in the open unit disk. If A is a

k_{1} \times k_{2}

matrix of rank

k \leq min (k_{1}, k_{2})

, an orthogonal complement,

A_{⊥},

is defined as a

k_{1} \times (k_{1} - k)

matrix of rank

k_{1} - k

for which

A_{⊥}^{'} A = 0

. If

k_{1} = k

,

A_{⊥} = 0 .

Note that

A_{⊥}

is only defined up to multiplication from the right by a

(k_{1} - k) \times (k_{1} - k)

matrix of full rank. Throughout,

E_{t} (.)

and

V a r_{t} (.)

denote conditional expectation and variance given the sigma-field

F_{0, t} = σ {X_{1, s}

,

0 \leq s \leq t},

generated by the observations.

Assumption 1.

In Equation (3), it is assumed that

(i)

ε_{1 t}

,

ε_{2 t}

, and

η_{t}

are mutually independent and i.i.d. Gaussian with mean zero and variances

Ω_{1}

,

Ω_{2}

, and

Ω_{η},

where

Ω_{1}

and

Ω_{2}

are diagonal matrices,

(ii)

I_{p_{1}} + M_{11}

,

I_{p_{2}} + M_{22}

and

I_{p} + M

are stable,

(iii)

C_{1.2} = C_{1} - M_{12} M_{22}^{- 1} C_{2}

has full rank m.

Let

{(X_{1 t}^{'}; X_{2 t}^{'}; T_{t}^{'})}^{'}

,

0 = 1, \dots, n

, be the solution to (3) given in (2), such that

Δ X_{t}

and

M X_{t} + C T_{t}

are stationary.

Assumption 1(ii) on

M_{11}, M_{22}

and M is taken from Hoover (2018) to ensure that, for instance, the process

X_{t}

given by the equations

X_{t} = (I_{p} + M) X_{t - 1} + i n p u t,

is stationary if the input is stationary, such that the nonstationarity of

X_{t}

in model (3) is created by the trends

T_{t},

and not by the own dynamics of

X_{t}

as given by

M .

It follows from this assumption that M is nonsingular, because

I_{p} + M

is stable, and similarly for

M_{11}

and

M_{22} .

Moreover

M_{11.2} = M_{11} - M_{12} M_{22}^{- 1} M_{21}

is nonsingular because

det M = det M_{22} det M_{11.2} \neq 0 .

The Main Results

The first result on

β

is a simple consequence of model (3).

Theorem 1.

Assumption 1 implies that the cointegrating rank is

r = p_{1} - m,

and that the coefficients β and

β_{⊥}

in the CVAR(

\infty)

representation for

X_{1 t}

, see (4), are given for

p_{1} > m

as

β_{⊥} = M_{11.2}^{- 1} C_{1.2} a n d β = M_{11.2}^{'} {(C_{1.2})}_{⊥} .

(6)

For

p_{1} = m,

β_{⊥}

has rank

p_{1},

and there is no cointegration:

α = β = 0

.

Proof Theorem of 1.

From the model Equation (3), it follows, by eliminating

X_{2 t}

from the first two equations, that

Δ X_{1, t + 1} - M_{12} M_{22}^{- 1} Δ X_{2, t + 1} = M_{11.2} X_{1 t} + C_{1.2} T_{t} + ε_{1 t + 1} - M_{12} M_{22}^{- 1} ε_{2, t + 1} .

Solving for the nonstationary terms gives

M_{11.2} X_{1 t} + C_{1.2} T_{t} = Δ X_{1, t + 1} - M_{12} M_{22}^{- 1} Δ X_{2, t + 1} - ε_{1 t + 1,} + M_{12} M_{22}^{- 1} ε_{2, t + 1} .

(7)

Multiplying by

β^{'} M_{11.2}^{- 1}

, it is seen that

β^{'} X_{1 t}

is stationary, if

β^{'} M_{11.2}^{- 1} C_{1.2} = 0 .

By Assumption 1(i),

C_{1.2}

has rank

m,

so that

β

has rank

p_{1} - m,

which proves (6). ☐

The result for

α

is more involved and is given in Theorem 2. The proof is a further analysis of (7) and involves first, the representation

X_{1 t}

in terms of a sum of prediction errors

ν_{t}^{β} = Δ X_{1 t} - E (Δ X_{1 t} | F_{t - 1}^{β}),

see (5), and second, a representation of

E (T_{t} | F_{0, t}) = E (T_{t} | X_{10}, \dots, X_{1 t})

as the (weighted) sum of the prediction errors

ν_{0 t} = Δ X_{1 t} - E (Δ X_{1 t} | F_{0, t - 1})

. The second representation requires a result from control theory on the solution of an algebraic Riccati equation, together with some results based on the Kalman filter for the calculation of the conditional mean and variance of the unobserved processes

X_{2 t}, T_{t}

given the observations

X_{0 s}

,

0 \leq s \leq t

. These are collected as Theorem A1 in the Appendix A.

For the discussion of these results, it is useful to reformulate (3) by defining the unobserved variables and errors

T_{t}^{*} = (\begin{matrix} X_{2 t} \\ T_{t} \end{matrix}), η_{t}^{*} = (\begin{matrix} ε_{2 t} \\ η_{t} \end{matrix}), Ω^{*} = V a r (η_{t}^{*}) = (\begin{matrix} Ω_{2} & 0 \\ 0 & Ω_{η} \end{matrix})

(8)

and the matrices

Q^{*} = (\begin{matrix} I_{p_{2}} + M_{22} & C_{2} \\ 0 & I_{m} \end{matrix}), M_{21}^{*} = (\begin{matrix} M_{21} \\ 0 \end{matrix}), C^{*} = (M_{12}; C_{1}) .

(9)

Then, (3) becomes

\begin{matrix} X_{1, t + 1} = (I_{p_{1}} + M_{11}) X_{1 t} + C^{*} T_{t}^{*} + ε_{1, t + 1}, \\ T_{t + 1}^{*} = M_{21}^{*} X_{1 t} + Q^{*} T_{t}^{*} + η_{t + 1}^{*} . \end{matrix}

(10)

One can then show, see Theorem A1, that based on properties of the Gaussian distribution, a recursion can be found for the calculation of

V_{t} = V a r_{t} (T_{t}^{*})

and

E_{t} = E_{t} (T_{t}^{*}) = E_{t} (T_{t}^{*} | F_{0 t})

and

V_{t} = V a r_{t} (T_{t}^{*}) = V a r_{t} (T_{t}^{*} | F_{0 t})

, using the matrices in (8) and (9), by the equations Some

\begin{matrix} V_{t + 1} & = Q^{*} V_{t} Q^{*'} + Ω^{*} - Q^{*} V_{t} C^{*'} {(C^{*} V_{t} C^{*'} + Ω_{1})}^{- 1} C^{*} V_{t} Q^{*'}, \end{matrix}

(11)

\begin{matrix} E_{t + 1} & = M_{21}^{*} X_{1 t} + Q^{*} E_{t} + Q^{*} V_{t} C^{*'} {(C^{*} V_{t} C^{*'} + Ω_{1})}^{- 1} ν_{0 t + 1} . \end{matrix}

(12)

It then follows from results from control theory, that

V = {lim}_{t \to \infty} V a r_{t} (T_{t}^{*})

exists and satisfies the algebraic Riccati equation

V = Q^{*} V Q^{*'} + Ω^{*} - Q^{*} V C^{*'} {(C^{*} V C^{*'} + Ω_{1})}^{- 1} C^{*} V Q^{*'} .

(13)

Moreover, the prediction errors

ν_{0 t} = Δ X_{1 t} - E (Δ X_{1 t} | F_{0, t - 1})

are independent

N_{p_{1}} (0, Σ_{t})

for

Σ_{t} = C^{*} V_{t} C^{*'} + Ω_{1},

and the prediction errors

ν_{t}^{β} = Δ X_{1 t} - E (Δ X_{1 t} | F_{t - 1}^{β})

are independent identically distributed

N_{p_{1}} (0, Σ)

for

Σ = C^{*} V C^{*'} + Ω_{1}

. Finally,

E_{t} (T_{t})

has the representation in the prediction errors,

ν_{0 i},

E_{t} (T_{t}) = E_{0} (T_{0}) + (0; I_{m}) \sum_{i = 1}^{t} V_{i} C^{*'} Σ_{i}^{- 1} ν_{0 i},

(14)

where

E_{0} (T_{0}) = E (T_{0} | X_{10}) = 0

.

Comparing the representation (5) for

X_{1 t}

and (14) for

E_{t} (T_{t})

gives a more precise relation between the coefficients of the nonstationary terms in (7). The main result of the paper is to show how this leads to expressions for the coefficients

α

and

α_{⊥}

as functions of the parameters in model (3).

Theorem 2.

Assumption 1 implies, that the coefficients α and

α_{⊥}

in the CVAR(

\infty)

representation of

X_{1 t}

are given for

p_{1} > m

as

α_{⊥} = Σ^{- 1} (M_{12} V_{2 T} + C_{1} V_{T T}), α = Σ {(M_{12} V_{2 T} + C_{1} V_{T T})}_{⊥},

(15)

where

Σ = V a r (ν_{t}^{β}) = C^{*} V C^{*'} + Ω_{1} = (M_{12}; C_{1}) (\begin{matrix} V_{22} & V_{2 T} \\ V_{T 2} & V_{T T} \end{matrix}) {(M_{12}; C_{1})}^{'} + Ω_{1} .

(16)

Proof of Theorem 2.

The left hand side of (7) has two nonstationary terms. The observation

X_{1 t}

is represented in (5) in terms of a random walk in the prediction errors

ν_{i}^{β},

plus a stationary term, and

T_{t}

is a random walk in

η_{i} .

Calculating the conditional expectation given the sigma-field

F_{0, t}

,

T_{t}

is replaced by

E_{t} (T_{t}),

which in (14) is represented as a weighted sum of

ν_{0 i} .

Thus, the conditional expectation of (7) gives

M_{11.2} X_{1 t} + C_{1.2} E_{t} (T_{t}) = E_{t} (Δ X_{1 t + 1} - M_{12} M_{22}^{- 1} Δ X_{2, t + 1}),

(17)

where the right hand side is bounded in mean:

E | E_{t} (Δ X_{1, t + 1} - M_{12} M_{22}^{- 1} Δ X_{2, t + 1}) | \leq c {E | Δ X_{1, t + 1} | + | Δ X_{2, t + 1} |} \leq c .

Setting

t = [n u]

and dividing by

n^{1 / 2},

it follows from (5) that

n^{- 1 / 2} X_{1 [n u]} \overset{D}{\to} β_{⊥} {(α_{⊥}^{'} Γ β_{⊥})}^{- 1} α_{⊥}^{'} W_{ν} (u),

(18)

where

W_{ν} (u)

is the Brownian motion generated by the i.i.d. prediction errors

ν_{t}^{β} .

From (14), it can be proved that

n^{- 1 / 2} E_{[n u]} (T_{[n u]}) = (0; I_{m}) n^{- 1 / 2} \sum_{t = 1}^{[n u]} V_{t} C^{*'} Σ_{t}^{- 1} ν_{0 t} \overset{D}{\to} (0; I_{m}) V C^{*'} Σ^{- 1} W_{ν} (u) .

(19)

This follows by replacing

V_{t}, Σ_{t}

by

V, Σ,

because for

δ_{t}^{'} = V_{t} C^{*'} Σ_{t}^{- 1} - V C^{*'} Σ^{- 1} \to 0,

it holds that

V a r (n^{- 1 / 2} \sum_{t = 1}^{[n u]} δ_{t}^{'} ν_{0 t}) = n^{- 1} \sum_{t = 1}^{[n u]} δ_{t}^{'} Σ_{t} δ_{t} \to 0, n \to \infty .

Next we can replace

ν_{0 t}

by

ν_{t}^{β}

as follows: For

t = 0, 1, \dots

the sum

α β^{'} X_{1 t} + \sum_{i = 1}^{t} Γ_{i} Δ X_{1, t + 1 - i} = α β^{'} X_{1 t} + Γ_{1} Δ X_{1 t} + \dots + Γ_{t} Δ X_{11},

is measurable with respect to both

F_{t}^{β}

and

F_{0 t},

such that

ν_{0, t + 1} - ν_{t + 1}^{β} = - E (\sum_{i = t + 1}^{\infty} Γ_{i} Δ X_{1, t + 1 - i} | F_{0, t}) + \sum_{i = t + 1}^{\infty} Γ_{i} Δ X_{1, t + 1 - i} .

Then

E | ν_{0, t + 1} - ν_{t + 1}^{β} | \leq c \sum_{i = t + 1}^{\infty} ρ^{i} E | Δ X_{1, t + 1 - i} | = O (ρ^{t}),

and therefore

E | n^{- 1 / 2} \sum_{i = 1}^{[n u]} (ν_{t + 1}^{β} - ν_{0, t + 1}) | \leq n^{- 1 / 2} \sum_{i = 1}^{[n u]} E | ν_{t + 1}^{β} - ν_{0, t + 1} | \leq c n^{- 1 / 2} \sum_{i = 1}^{[n u]} ρ^{i} \to 0, n \to \infty,

which proves (19).

Finally, setting

t = [n u]

and normalizing (17) by

n^{- 1 / 2},

it follows that in the limit

M_{11.2} β_{⊥} {(α_{⊥}^{'} Γ β_{⊥})}^{- 1} α_{⊥}^{'} W_{ν} (u) + C_{1.2} (0; I_{m}) V C^{*'} Σ^{- 1} W_{ν} (u) = 0 for u \in [0, 1] .

This relation shows that the coefficient to

W_{ν} (u)

is zero, so that

α_{⊥}

can be chosen as

α_{⊥} = Σ^{- 1} C^{*} V {(0; I_{m})}^{'} = Σ^{- 1} (M_{12} V_{2 T} + C_{1} V_{T T})

and therefore

α = Σ {(M_{12} V_{2 T} + C_{1} V_{T T})}_{⊥}

which proves (15). ☐

3. Two Examples of Simplifying Assumptions

It follows from Theorem 2 that in order to investigate a zero row in

α,

the matrix V is needed. This is easy to calculate from the recursion (11), for a given value of the parameters, but the properties of V are more difficult to evaluate. In general,

α

does not contain a zero row, but if

M_{12} V_{2 T} = 0,

the expressions for

α

and

α_{⊥}

simplify, so that simple conditions on

M_{12}

and

C_{1}

imply a zero row in

α

and hence give weak exogeneity in the statistical analysis of the approximating finite order CVAR. This extra condition,

M_{12} V_{2 T} = 0,

implies that

Σ = (M_{12}; C_{1}) V {(M_{12}; C_{1})}^{'} + Ω_{1} = M_{12} V_{22} M_{12}^{'} + C_{1} V_{T T} C_{1}^{'} + Ω_{1},

and

{(M_{12} V_{2 T} + C_{1} V_{T T})}_{⊥} = {(C_{1} V_{T T})}_{⊥} = C_{1 ⊥},

such that

α

simplifies to

α = (M_{12} V_{22} M_{12}^{'} + C_{1} V_{T T} C_{1}^{'} + Ω_{1}) C_{1 ⊥} = (M_{12} V_{22} M_{12}^{'} + Ω_{1}) C_{1 ⊥} .

Thus, a condition for a zero row in

α

is

e_{i}^{'} α = e_{i}^{'} M_{12} V_{22} M_{12}^{'} C_{1 ⊥} + ω_{i} e_{i}^{'} C_{1 ⊥} = 0

(20)

because

Ω_{1} = diag (ω_{1}, \dots, ω_{p_{1}}) .

This is simple to check by inspecting the matrices

M_{12}

and

C_{1 ⊥}

in model (3). In the next section, two cases are given, where such a simple solution is available.

Case 1

(M₁₂ = 0). If the unobserved process

X_{2 t}

does not cause the observation

X_{1 t},

then

M_{12} = 0 .

Therefore,

M_{12} V_{2 T} = 0

and from (20) it follows that

e_{i}^{'} α = ω_{i} e_{i}^{'} C_{1 ⊥} = 0 .

Thus, α has a zero row if

C_{1 ⊥}

has a zero row.

An example of

M_{12} = 0

is the chain

T \to x_{1} \to x_{2} \to x_{3},

where

X_{1} = {x_{1}, x_{2}, x_{3}}

is observed and

X_{2} = 0,

and hence

M_{12} = 0

and

C_{2} = 0 .

Then, because

T \to x_{1}

C_{1} = (\begin{matrix} * \\ 0 \\ 0 \end{matrix}), C_{1 ⊥} = (\begin{matrix} 0 & 0 \\ 1 & 0 \\ 0 & 1 \end{matrix}) .

Thus, the first row of

C_{1 ⊥}

is a zero row, such that

x_{1}

is weakly exogenous.

To formulate the next case, a definition of strong orthogonality of two matrices is introduced.

Definition 1.

Let A be a

k \times k_{1}

matrix and B a

k \times k_{2}

matrix. Then, A and B are called strongly orthogonal if

A^{'} D B = 0

for all diagonal matrices D, or equivalently if

A_{j i} B_{j ℓ} = 0

for all

i, j, ℓ

.

Thus, if

A_{j i} \neq 0,

we assume that row j of B is zero, and if

B_{j ℓ} \neq 0,

row j of A is zero. A simple example is

A = (\begin{matrix} * & * \\ 0 & * \\ 0 & 0 \end{matrix}), B = (\begin{matrix} 0 \\ 0 \\ * \end{matrix}) .

Thus, the definition means that if two matrices are strongly orthogonal, it is due to the positions of the zeros and not to linear combination of nonzero numbers being zero.

Thus, in particular if

M_{12}

and

C_{1}

are strongly orthogonal, and if T causes a variable in

X_{1},

then

X_{2}

does not cause that variable. The expression for V simplifies in the following case.

Lemma 1.

If

C_{2} = 0,

and

M_{12}^{'} Ω_{1}^{- 1} C_{1} = 0,

then

Q^{*} = blockdiag (I_{p_{2}} + M_{22}; I_{m}),

and

V_{2 T} = 0

such that

V = blockdiag (V_{22}; V_{T T}) .

Proof of Lemma 1.

We first prove that

V_{t}

is blockdiagonal for

t = 0

. From (2), it follows that

(\begin{matrix} X_{10} \\ X_{20} \end{matrix}) = M^{- 1} \sum_{i = 0}^{\infty} {(I_{p} + M)}^{i} (M ε_{- i} + C η_{- i}) and T_{0} = 0 .

Thus, if

Φ

denotes the variance of

{(X_{10}^{'}; X_{20}^{'})}^{'},

then

V_{0} = V a r ((\begin{matrix} X_{20} \\ T_{0} \end{matrix}) | X_{10}) = (\begin{matrix} Φ_{22.1} & 0 \\ 0 & 0 \end{matrix}),

and hence blockdiagonal. Assume, therefore, that

V_{t} =

blockdiag(

V_{t 22}; V_{t T T})

and consider the expression for

V_{t + 1},

see (11). In this expression,

Q^{*}

is block diagonal (because

C_{2} = 0)

and

Q^{*} V_{t} Q^{*'}

and

Ω^{*}

are block diagonal, and the same holds for

Q^{*} V_{t}^{1 / 2} .

Thus, it is enough to show that

V_{t}^{1 / 2} C^{*'} {C^{*} V_{t} C^{*'} + Ω_{1}}^{- 1} C^{*} V_{t}^{1 / 2},

is block diagonal. To simplify the notation, define the normalized matrices

\overset{ˇ}{M} = Ω_{1}^{- 1 / 2} M_{12} V_{t 22}^{1 / 2} and \overset{ˇ}{C} = Ω_{1}^{- 1 / 2} C_{1} V_{t T T}^{1 / 2} .

Then, by assumption,

{\overset{ˇ}{M}}^{'} \overset{ˇ}{C} = V_{t 22}^{1 / 2} M_{12}^{'} Ω_{1}^{- 1} C_{1} V_{t T T}^{1 / 2} = 0,

so that, using

V_{t 2 T} = 0,

V_{t}^{1 / 2} C^{*'} {(C^{*} V_{t} C^{*'} + Ω_{1})}^{- 1} C^{*} V_{t}^{1 / 2} = {(\overset{ˇ}{M}, \overset{ˇ}{C})}^{'} {(\overset{ˇ}{M} {\overset{ˇ}{M}}^{'} + \overset{ˇ}{C} {\overset{ˇ}{C}}^{'} + I_{p_{1}})}^{- 1} (\overset{ˇ}{M}, \overset{ˇ}{C}) .

A direct calculation shows that

{(\overset{ˇ}{M} {\overset{ˇ}{M}}^{'} + \overset{ˇ}{C} {\overset{ˇ}{C}}^{'} + I_{p_{1}})}^{- 1} = I_{p_{1}} - \overset{ˇ}{M} {(I_{p_{2}} + {\overset{ˇ}{M}}^{'} \overset{ˇ}{M})}^{- 1} {\overset{ˇ}{M}}^{'} - \overset{ˇ}{C} {(I_{p_{2}} + {\overset{ˇ}{C}}^{'} \overset{ˇ}{C})}^{- 1} {\overset{ˇ}{C}}^{'},

and that

{\overset{ˇ}{M}}^{'} {I_{p_{1}} - \overset{ˇ}{M} {(I_{p_{2}} + {\overset{ˇ}{M}}^{'} \overset{ˇ}{M})}^{- 1} {\overset{ˇ}{M}}^{'} - \overset{ˇ}{C} {(I_{p_{2}} + {\overset{ˇ}{C}}^{'} \overset{ˇ}{C})}^{- 1} {\overset{ˇ}{C}}^{'}} \overset{ˇ}{C} = 0

such that

{(\overset{ˇ}{M}, \overset{ˇ}{C})}^{'} {(\overset{ˇ}{M} {\overset{ˇ}{M}}^{'} + \overset{ˇ}{C} {\overset{ˇ}{C}}^{'} + I_{p_{1}})}^{- 1} (\overset{ˇ}{M}, \overset{ˇ}{C})

is block diagonal.

Then,

V_{t}^{1 / 2} C^{*'} {C^{*} V_{t} C^{*'} + Ω_{1}}^{- 1} C^{*} V_{t}^{1 / 2}

and hence

V_{t + 1}

are block diagonal. Taking the limit for

t \to \infty,

it is seen that also V is block diagonal. ☐

Case 2

(C₂ = 0, and M₁₂ and C₁ are strongly orthogonal). Because

C_{2} = 0

and

M_{21}^{'} Ω_{1}^{- 1} C_{1} = 0,

Lemma 1 shows that

V_{2 T} = 0,

so that the condition

M_{12} V_{2 T} = 0

and (20) hold. Moreover, strong orthogonality also implies that

M_{12}^{'} C_{1} = 0

such that

M_{12} = C_{1 ⊥} ξ

for some

ξ .

Hence

e_{i}^{'} α = e_{i}^{'} M_{12} V_{22} M_{12}^{'} C_{1 ⊥} + ω_{i} e_{i}^{'} C_{1 ⊥} = e_{i}^{'} C_{1 ⊥} (ξ V_{22} M_{12}^{'} C_{1 ⊥} + ω_{i} I_{p_{1} - m}),

(21)

and therefore, a zero row in

C_{1 ⊥}

gives a zero row in α.

Consider again the chain

T \to x_{1} \to x_{2} \to x_{3},

but assume now that

x_{2}

is not observed. Thus,

X_{1} = {x_{1}, x_{3}}

and

X_{2} = {x_{2}} .

Here, T causes

x_{1},

and

x_{2}

causes

x_{3},

so that

M_{12} = (\begin{matrix} 0 \\ * \end{matrix}), C_{1} = (\begin{matrix} * \\ 0 \end{matrix}), C_{2} = 0 .

Note that

M_{12}^{'} D C_{1} = 0

for all diagonal D because T and

X_{2}

cause disjoint subsets of

X_{1}

. This, together with

C_{2} = 0

, implies that V is block diagonal and that (21) holds. Thus,

x_{i}

is weakly exogenous,

e_{i}^{'} α = 0

, if

e_{i}^{'} C_{1 ⊥} = e_{i}^{'} (\begin{matrix} 0 \\ * \end{matrix}) = 0 .

4. Conclusions

This paper investigates the problem of finding adjustment and cointegrating coefficients for the infinite order CVAR representation of a partially observed simple CVAR(1) model. The main tools are some classical results for the solution of the algebraic Riccati equation, and the results are exemplified by an analysis of CVAR(1) models for causal graphs in two cases where simple conditions for weak exogeneity are derived in terms of the parameters of the CVAR(1) model.

Funding

This research received no external funding

Acknowledgments

The author would like to thank Kevin Hoover for long discussions on the problem and its solution, and Massimo Franchi for reading a first version of the paper and for pointing out the excellent book by Lancaster and Rodman, and two anonymous referees who helped clarify some of the proofs.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A.

The next Theorem shows how the Kalman filter can be used to calculate

V a r_{t} (T_{t}^{*})

and

E_{t} (T_{t}^{*})

using the same technique as for the common trends model and proves the existence of the limit of

V_{t}

. The last result follows from the theory of the algebraic Riccati equation, see Lancaster and Rodman (1995), in the following LR(1995).

Theorem A1.

Let

X_{1 t}

and

T_{t}^{*}

be given by model (10) and let Assumption 1 be satisfied. Then,

V_{t} = V a r_{t} (T_{t}^{*})

and

E_{t} = E_{t} (T_{t}^{*})

are given recursively, using the starting values

E_{0}

and

V_{0}

by

\begin{matrix} V_{t + 1} & = Q^{*} V_{t} Q^{*'} + Ω^{*} - Q^{*} V_{t} C^{*'} Σ_{t}^{- 1} C^{*} V_{t} Q^{*'}, \end{matrix}

(A1)

\begin{matrix} E_{t + 1} & = M_{21}^{*} X_{1 t} + Q^{*} E_{t} + Q^{*} V_{t} C^{*'} Σ_{t}^{- 1} ν_{0, t + 1}, \end{matrix}

(A2)

where

Σ_{t} = C^{*} V_{t} C^{*'} + Ω_{1},

(A3)

and the prediction errors

ν_{0, t + 1} = X_{1, t + 1} - E_{t} (X_{1, t + 1})

(A4)

are independent

N_{p_{1}} (0, Σ_{t})

.

The sequence

V_{t}

starting with

V_{0},

converges to a finite positive limit V, which satisfies the algebraic Riccati equation,

V = Q^{*} V Q^{*'} + Ω^{*} - Q^{*} V C^{*'} Σ^{- 1} C^{*} V Q^{*'}, Σ = C^{*} V C^{*'} + Ω_{1} .

(A5)

Furthermore,

Q^{*} - Q^{*} V C^{*'} Σ^{- 1} C^{*}

(A6)

is stable, and

E_{t} (T_{t})

satisfies the equation

E_{t + 1} (T_{t + 1}) = E_{t} (T_{t}) + (0; I_{m}) V_{t} C^{*'} Σ_{t}^{- 1} ν_{0, t + 1} .

(A7)

Proof of Theorem A1.

The variance

V_{t} = V a r_{t} (T_{t}^{*})

can be calculated recursively, using the properties of the Gaussian distribution, as

\begin{matrix} V a r_{t + 1} (T_{t + 1}^{*}) & = V a r_{t} (T_{t + 1}^{*} | X_{1, t + 1}) \\ = V a r_{t} (T_{t + 1}^{*}) - C o v_{t} (T_{t + 1}^{*}; X_{1, t + 1}) V a r_{t} {(X_{1, t + 1})}^{- 1} C o v_{t} (X_{1, t + 1}; T_{t + 1}^{*}) . \end{matrix}

(A8)

From the model Equation (10), it follows that

\begin{matrix} V a r_{t} (T_{t + 1}^{*}) & = V a r_{t} {M_{21}^{*} X_{1 t} + Q^{*} T_{t}^{*} + η_{t + 1}^{*}} = Q^{*} V a r_{t} (T_{t}^{*}) Q^{*'} + Ω^{*}, \end{matrix}

(A9)

\begin{matrix} C o v_{t} (T_{t + 1}^{*}; X_{1, t + 1}) & = C o v_{t} {T_{t + 1}^{*}; (I_{p_{1}} + M_{11}) X_{1 t} + C^{*} T_{t}^{*} + ε_{1 t + 1}} = Q^{*} V a r_{t} (T_{t}^{*}) C^{*'}, \end{matrix}

(A10)

\begin{matrix} V a r_{t} (X_{1, t + 1}) & = V a r_{t} {(I_{p_{1}} + M_{11}) X_{1 t} + C^{*} T_{t}^{*} + ε_{1 t + 1}} = C^{*} V a r_{t} (T_{t}^{*}) C^{*'} + Ω_{1} . \end{matrix}

(A11)

Then, (A8)–(A11) give the recursion for

V_{t} = V a r_{t} (T_{t}^{*})

in (A1). Similarly, for the conditional mean, it is seen that

\begin{matrix} E_{t + 1} (T_{t + 1}^{*}) & = E_{t} (T_{t + 1}^{*} | X_{1, t + 1}) = E_{t} (T_{t + 1}^{*}) + C o v_{t} (T_{t + 1}^{*}; X_{1, t + 1}) V a r_{t} {(X_{1, t + 1})}^{- 1} ν_{0, t + 1}, \\ E_{t} (T_{t + 1}^{*}) & = M_{21}^{*} X_{1 t} + Q^{*} E_{t} (T_{t}^{*}), \end{matrix}

which implies (A2) with prediction error

ν_{0, t + 1} = Δ X_{1, t + 1} - E_{t} (Δ X_{1, t + 1})

.

Note that (A1) is the usual recursion from the Kalman filter equations for the state space model obtained from (10) for

M_{21}^{*} = 0

, see Durbin and Koopman (2012). Note also, however, that (A2) is not the usual recursion from the common trends model, because of the first term containing

M_{21}^{*}

. It is seen from (A1) that if

V_{t}

converges to

V,

then V has to satisfy the algebraic Riccati equation (A5) and

Σ

is given as indicated.

The result that

V_{t}

converges to a finite positive limit follows from LR (1995, Theorem 17.5.3), where the assumptions, in the present notation, are

a . 1

(Q^{*}; I_{p_{2} + m})

is controllable,

a . 2

(Q^{*}; I_{p_{2} + m})

is stabilizable,

a . 3

(C^{*}; Q^{*})

is detectable.

Before giving the proof, some definitions from control theory are given, which are needed for checking the conditions of the results in LR(1995).

Let A be a

k \times k

matrix and B be a

k \times k_{1}

matrix.

d . 1

The pair

{A, B}

is called controllable if

r a n k (B; A B; \dots; A^{k - 1} B) = k,

LR(1995, (4.1.3)).

d . 2

The pair

{A; B}

is stabilizable if there is a

k_{1} \times k

matrix

K,

such that

A + B K

is stable LR(1995, page 90, line 5-).

d . 3

Finally

{B; A}

is detectable means that

{A^{'}; B^{'}}

is stabilizable, LR(1995, page 91 line 6-).

The first assumption,

a . 1,

is easy to check: The pair

(Q^{*}; I_{p_{2} + m})

is controllable, see

d . 1

, means that

rank (I_{p_{2} + m}; Q^{*} I_{p_{2} + m}; \dots; Q^{* p_{2} + m - 1} I_{p_{2} + m}) = p_{2} + m .

The second assumption,

a . 2,

follows because controllability implies stabilizability, see LR (1995, Theorem 4.4.2).

Finally,

d . 3

shows that

(C^{*}; Q^{*})

detectable means

(Q^{*'}; C^{*'})

stabilizable, and LR(1995, Theorem 4.5.6 (b)), see also Hautus (1969), shows that

(Q^{*'}; C^{*'})

is stabilizable, if and only if

rank (Q^{*'} - λ I_{p_{2} + m}; C^{*'}) = rank (\begin{matrix} M_{12} & C_{1} \\ I_{p_{2}} + M_{22} - λ I_{p_{2}} & C_{2} \\ 0 & I_{m} - λ I_{m} \end{matrix}) = p_{2} + m for all | λ | \geq 1 .

For

λ = 1

, using

C_{1.2} = C_{1} - M_{12} M_{22}^{- 1} C_{2}

and Assumption 1, it follows that

\begin{matrix} rank (M (1)) & = rank (\begin{matrix} M_{12} & C_{1} \\ M_{22} & C_{2} \end{matrix}) = rank (\begin{matrix} 0 & C_{1.2} \\ M_{22} & C_{2} \end{matrix}) \\ = rank (C_{1.2}) + rank (M_{22}) = m + p_{2} . \end{matrix}

For

| λ | > 1

, using Assumption 1(ii), it is seen that

rank (M (λ)) = rank (I_{p_{2}} + M_{22} - λ I_{p_{2}}) + rank (I_{m} - λ I_{m}) = p_{2} + m,

because

λ

is not an eigenvalue of the stable matrix

I_{p_{2}} + M_{22},

when

| λ | > 1 .

Thus,

(Q^{*'}; C^{*'})

is stabilizable, and assumptions

a . 1

,

a . 2

,

a . 3

hold, such that and LR (1995, Theorem 17.5.3) applies. This proves that limit

V = {lim}_{t \to \infty} V_{t}

exists and (A6) holds.

Multiplying (A2) by

(0; I_{m})

, it is seen, using

(0; I_{m}) Q^{*} = (0; I_{m}),

and

(0; I_{m}) M_{21}^{*} = 0,

that a recursion for

E_{t} (T_{t})

is given by (A7). ☐

References

Durbin, James, and Siem Jan Koopman. 2012. Time Series Analysis by State Space Methods, 2nd ed. Oxford: Oxford University Press. [Google Scholar]
Hautus, Malo L. J. 1969. Controllability and observability conditions of linear autonomous systems. Koninklijke Nederlandse Akademie van Wetenschappen. Indagationes Mathematicae 12: 443–48. [Google Scholar]
Hoover, Kevin D. 2018. Long-Run Causal Order: A Preliminary Investigation. Durham: Department of Economics and Department of Philosophy, Duke University. [Google Scholar]
Johansen, Søren, and Katarina Juselius. 2014. An asymptotic invariance property of the common trends under linear transformations of the data. Journal of Econometrics 17: 310–15. [Google Scholar] [CrossRef]
Lancaster, Peter, and Leiba Rodman. 1995. Algebraic Riccati Equations. Oxford: Clarendon Press. [Google Scholar]
Saikkonen, Pentti. 1992. Estimation and testing of cointegrated systems by an autoregressive approximation. Econometric Theory 8: 1–27. [Google Scholar] [CrossRef]
Saikkonen, Pentti, and Helmut Lütkepohl. 1996. Infinite order cointegrated vector autoregressive processes. Estimation and Inference. Econometric Theory 12: 814–44. [Google Scholar] [CrossRef]

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Johansen, S. Cointegration and Adjustment in the CVAR(∞) Representation of Some Partially Observed CVAR(1) Models. Econometrics 2019, 7, 2. https://doi.org/10.3390/econometrics7010002

AMA Style

Johansen S. Cointegration and Adjustment in the CVAR(∞) Representation of Some Partially Observed CVAR(1) Models. Econometrics. 2019; 7(1):2. https://doi.org/10.3390/econometrics7010002

Chicago/Turabian Style

Johansen, Søren. 2019. "Cointegration and Adjustment in the CVAR(∞) Representation of Some Partially Observed CVAR(1) Models" Econometrics 7, no. 1: 2. https://doi.org/10.3390/econometrics7010002

APA Style

Johansen, S. (2019). Cointegration and Adjustment in the CVAR(∞) Representation of Some Partially Observed CVAR(1) Models. Econometrics, 7(1), 2. https://doi.org/10.3390/econometrics7010002

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cointegration and Adjustment in the CVAR(∞) Representation of Some Partially Observed CVAR(1) Models

Abstract

1. Introduction

2. The Assumptions and Main Results

The Main Results

3. Two Examples of Simplifying Assumptions

4. Conclusions

Funding

Acknowledgments

Conflicts of Interest

Appendix A.

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI