Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality

Hidaka, Shohei; Torii, Takuma

doi:10.3390/e23060742

Open AccessArticle

Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality

by

Shohei Hidaka

^*

and

Takuma Torii

Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi 923-1292, Ishikawa, Japan

^*

Author to whom correspondence should be addressed.

Entropy 2021, 23(6), 742; https://doi.org/10.3390/e23060742

Submission received: 12 May 2021 / Revised: 1 June 2021 / Accepted: 8 June 2021 / Published: 12 June 2021

(This article belongs to the Special Issue Measures of Information)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

In this manuscript, we analyze a bivariate vector auto-regressive (VAR) model in order to draw the design principle of a timeseries with a controlled statistical inter-relationship. We show how to generate bivariate timeseries with given covariance and Granger causality (or, equivalently, transfer entropy), and show the trade-off relationship between these two types of statistical interaction. In principle, covariance and Granger causality are independently controllable, but the feasible ranges of their values which allow the VAR to be proper and have a stationary distribution are constrained by each other. Thus, our analysis identifies the essential tri-lemma structure among the stability and properness of VAR, the controllability of covariance, and that of Granger causality.

Keywords:

Granger causality; transfer entropy; vector auto-regressive model; Lyapunov equation

1. Introduction

1.1. Background and Motivation

In the field of cognitive psychology, the human perception of the life-likeness (called animacy perception) of one or multiple moving geometric patterns has been studied for decades [1,2,3,4,5]. There are multiple findings on the effect of “synchrony” or “temporal contingency” between multiple moving points on animacy perception. Findings from one line of research [2] have suggested that a higher degree of “temporal contingency” of the moving objects is related to a higher likelihood of animacy perception. Findings from the other line of research [6] have suggested that the highest “temporal contingency”, presented in the form of perfect synchronization, would decrease the likelihood of animacy perception.

These two lines of research have together suggested the existence of multiple types of “temporal contingency”. Nevertheless, this past research does not appear to clarify these types. Further, confusion surrounding these two distinct types of effects have led to two lines of apparently conflicting effects of “temporal contingency”.

With this potential conflict in the literature on animacy perception in mind, we explore a theoretical framework which can generate timeseries of multiple random variables with multiple distinct types of statistical dependency. One such system, which is sufficiently simple and readily manipulable, is vector auto-regression (VAR). Vector auto-regression is a random process for generating multivariate timeseries for a given set of parameters. In this manuscript, we specifically consider only bivariate VAR, which is a minimal system with interaction between two moving points.

1.2. Vector Auto-Regressive Model, Granger Causality, and Transfer Entropy

Importantly, bivariate VAR, a series of paired random variables

(x_{t}, y_{t})

for

t = 0, 1, \dots

, has two types of statistical dependency—that is, the correlation and Granger causality of a timeseries [7], which has been identified as transfer entropy [8] of the timeseries generated by a Gaussian process by [9]. The correlation between univariate series x and y is a statistical dependency between

x_{t}

and

y_{t}

in the limit

t \to \infty

(if it exists), while a Granger causality from y to x is that between

x_{t}

and

y_{t - 1}

given

x_{t - 1}

in the limit

t \to \infty

(if it exists). Conceptually, correlation captures a type of similarity between two timeseries, whereas Granger causality captures the “reactiveness” of one timeseries to another.

Thus, given these differences in theory, our goal is to propose a theoretical method to generate a bivariate random timeseries with a desired correlation and two types. The Granger causality of VAR has also been considered in other fields. In econometrics, the field in which it was originally proposed [7], Granger causality has been used as a measure of interaction between a pair of economic timeseries [10,11,12]. It has also been used in general behavioral sciences [13,14], particularly in computational neuroscience [15,16,17,18]. Such an in-principle data generation technique would be vital to the testing of any hypothesis on the empirical nature of timeseries (e.g., animacy perception) in the empirical sciences using VAR and Granger causality, as mentioned above. To our knowledge, however, there has been no mathematical analysis of the theoretical limitation of such a data generation technique for a given statistics.

Thus, we first need to explore the mathematical relationship among the parameters in a VAR with the correlation and Granger causality in a timeseries generated from it. In this paper, we therefore explore a theoretical structure of bivariate VAR from the designer’s perspective, and analyze a mathematical limit to the extent which we can simultaneously control correlation and Granger causality of a bivariate timeseries.

This paper is written with the following structure. In Section 2, the VAR model is defined, from which a set of basic statistical properties of the VAR model are derived, such as Granger causality (Section 3) with a set of parameters. In Section 4, the existence of the stationary distribution of the VAR is analyzed. This is a foundation which sets the limit of a controllable set of parameters. In Section 5, we give a method to derive the parameters of VAR for a given set of statistics in bivariate timeseries. In Section 6, the mathematical analysis provided in this paper is summarized and a remark on the design principle of bivariate timeseries generated by VAR is added.

2. Vector Auto-Regression (VAR)

In theory, Granger causality (GC) is the transfer entropy of random variables in a bivariate vector auto-regression (VAR) model up to a constant factor of 2, if the VAR model has a stationary distribution [9]. Thus, it is straightforward to start with the bivariate VAR and derive its transfer entropy. In this way, we can derive a rich mathematical relationship between GC and the properties of VAR, rather than just a statistics of the bivariate timeseries.

Definition 1.

For some real vector

μ \in R^{2}

and some positive-definite matrix

Σ \in R^{2 \times 2}

, suppose the random variable

ϵ_{t}

for every integer

t = 0, 1, \dots

is drawn from the bivariate normal distribution

N (ϵ_{t} | μ, Σ)

with mean μ and variance Σ. Define the initial vector by

v_{0} = (\begin{matrix} x_{0} \\ y_{0} \end{matrix})

, and for

t \geq 0

and a given coefficient matrix with real entries

A : = (\begin{matrix} a_{0, 0} & a_{0, 1} \\ a_{1, 0} & a_{1, 1} \end{matrix}) \in R^{2 \times 2}

, define the random variable

v_{t}

by

v_{t + 1} : = A v_{t} + ϵ_{t} .

(1)

Then, bivariate vector auto-regression is defined by the semi-infinite series of random variables

V = (v_{0}, v_{1}, v_{2}, \dots)

.

In general, one can generate a timeseries

v_{0}, v_{1}, \dots

by fixing a set of the VAR parameters, the coefficient matrix A, and the base covariance matrix

Σ

, where the base mean vector

μ

is omitted as its effect is lost in the limit

t \to \infty

when the VAR is stationary. The stationary correlation (covariance)

\hat{Σ}

and Granger causality

G_{0}

and

G_{1}

defined later are the statistics of the timeseries generated by a VAR model (Figure 1). In what follows, we first explore the forward relationship of how the statistics

\hat{Σ}

and

G_{0}, G_{1}

are given by the VAR parameters

(A, Σ)

. We then consider the backward relationship in which the VAR parameters

(A, Σ)

suffice to generate a timeseries with given desired timeseries statistics

(G_{0}, G_{1}, \hat{Σ})

.

2.1. Marginal Distribution of the VAR at Each Step

Lemma 1

(Marginal distribution of the VAR random variables at each step). The VAR model with the initial vector

v_{0} \in R^{2}

and the coefficient matrix

A \in R^{2 \times 2}

has the bivariate normal distribution

N (v_{t} | μ_{t}, Σ_{t})

as its marginal distribution of the random variable

v_{t}

at each step

t = 0, 1, \dots

, where

μ_{t} : = A^{t} v_{0}, Σ_{t} = \sum_{s = 0}^{t} A^{s} Σ {(A^{s})}^{⊤} .

Proof.

By Definition 1, Lemma 1 holds for

t = 0

. For

t + 1 > 0

, we prove Lemma 1 by assuming that it holds up to

t \geq 0

. By this assumption held for t, we have the distribution of

v_{t} \in R^{2}

as the bivariate normal distribution

N (v_{t} | μ_{t}, Σ_{t}) = {(2 π)}^{- 1} {| Σ_{t} |}^{- \frac{1}{2}} e^{- \frac{1}{2} {(v_{t} - μ_{t})}^{⊤} Σ_{t}^{- 1} (v_{t} - μ_{t})}

with its mean

μ_{t}

and its covariance matrix

Σ_{t}

. Then, the random variable

A v_{t}

is distributed by the normal distribution

\begin{matrix} N (A v_{t} | A μ_{t}, A Σ_{t} A^{⊤}) & = & {(2 π)}^{- 1} {| A Σ_{t} A^{⊤} |}^{- \frac{1}{2}} \\ \times & e^{- \frac{1}{2} {(A (v_{t} - μ_{t}))}^{⊤} {(A Σ_{t} A^{⊤})}^{- 1} (A (v_{t} - μ_{t}))} \end{matrix}

(2)

with its mean

A μ_{t}

and the covariance matrix

A Σ_{t} A^{⊤}

. The random variable

ϵ_{t}

is distributed by the following normal distribution:

N (ϵ_{t} | 0, Σ) = {(2 π)}^{- 1} {| Σ |}^{- \frac{1}{2}} e^{- \frac{1}{2} ϵ_{t}^{⊤} Σ^{- 1} ϵ_{t}} .

Thus, by VAR, Equation (1), we have the random variable

v_{t + 1} : = A v_{t} + ϵ_{t}

which has a distribution calculated by the following integral:

P (v_{t + 1}) = \int_{ϵ_{t} \in R^{2}} N (v_{t + 1} - ϵ_{t} | A μ_{t}, A Σ_{t} A^{⊤}) N (ϵ_{t} | 0, Σ) d ϵ_{t} .

Calculating this, we have

P (v_{t + 1}) = N (v_{t + 1} | A μ_{t}, A Σ_{t} A^{⊤} + Σ) .

(3)

Thus defining by

μ_{t + 1} : = A μ_{t}

and

Σ_{t + 1} = A Σ_{t} A^{⊤} + Σ

, Lemma 1 holds for

t + 1

. By expanding this, we have the Lemma 1 for any integer

t \geq 0

. □

2.2. Stability of VAR: Lyapunov Equation

By Lemma 1, the mean and covariance matrix of the random variable at the

t th

step are

μ_{t} = A^{t} v_{0} and Σ_{t} = \sum_{s = 0}^{t} A^{s} Σ {(A^{s})}^{⊤} .

From this, we have the stationary distribution

lim_{t \to \infty} N (v_{t} | μ_{t}, Σ_{t}),

if and only if the absolute values of all the eigenvalues

λ_{0}, λ_{1} \in C

of the coefficient matrix A are less than 1. If there is such a stationary distribution, we call the VAR stable, and its stationary distribution is the bivariate normal distribution

N (\hat{v} | 0_{2}, \hat{Σ}),

where the stationary mean vector

\hat{v} \in R^{2}

and stationary covariance matrix

\hat{Σ} \in R^{2 \times 2}

are defined as follows. If the VAR is stable, we have the following Lyapunov equation of the stationary covariance matrix

\hat{Σ} \in R^{2 \times 2}

:

\hat{Σ} = Σ + A \hat{Σ} A^{⊤} .

(4)

The Lyapunov equation is solved analytically by

vec (\hat{Σ}) = {(I_{4} - A \otimes A)}^{- 1} vec (Σ),

(5)

where

I_{d} \in R^{d \times d}

is the

d th

order identity matrix, ⊗ denotes the Kronecker product, and

vec (X)

for any matrix

X = {(x_{i, j})}_{i = 1, \dots, n, j = 1, \dots, m}

is the vectorization operator

vec () : R^{n \times m} \to R^{n m \times 1}

defined by

vec (X) : = {(x_{1, 1}, x_{2, 1}, \dots, x_{n, 1}, \dots, x_{1, m}, x_{2, m}, \dots, x_{n, m})}^{⊤} .

The Lyapunov Equation (4) has the solution for

\hat{Σ}

if the VAR is stable, but not vice versa. This is shown by Lemma 4.

3. Transfer Entropy and Granger Causality

In [9], the transfer entropy of an appropriate triplet of variables in the VAR model is shown to be equivalent to Granger causality up to the constant factor 2. Following this guide, we define this quantity as the Granger causality of the VAR model, below.

Although this relationship has been known in a more general form [9], we re-derive it for bivariate VAR in order to later analyze the structure of VAR and GCs in depth—for example, its upper and lower bounds (Lemma 3), stability (Section 4), and design principle (Section 5).

Definition 2.

If VAR with its random variables

v_{t} = {(x_{t}, y_{t})}^{⊤} \in R^{2}

for

t = 0, 1, \dots

is stable, transfer entropy from y to x is defined by

T_{y \to x} : = lim_{t \to \infty} (H (x_{t + 1} | x_{t}) - H (x_{t + 1} | x_{t}, y_{t})),

and the transfer entropy from x to y is defined by

T_{x \to y} : = lim_{t \to \infty} (H (y_{t + 1} | y_{t}) - H (y_{t + 1} | y_{t}, x_{t})),

where the differential entropy of random variable x with its probability density function P is

H (x) : = - \int_{x \in Ω} P (x) log P (x) d x,

and the conditional entropy is

H (x | y) : = H (x, y) - H (y) .

In particular, we call two times of transfer entropy Granger causality denoted by

G_{0} = 2 T_{y \to x} and G_{1} = 2 T_{x \to y} .

(6)

Specifically, GCs are specifically written by the terms of the VAR parameters in the following lemma.

Lemma 2

(Granger causality). If a stable VAR has its covariance matrix, coefficient matrix, and stationary matrix

Σ = (\begin{matrix} σ_{0, 0} & σ_{0, 1} \\ σ_{1, 0} & σ_{1, 1} \end{matrix}), A = (\begin{matrix} a_{0, 0} & a_{0, 1} \\ a_{1, 0} & a_{1, 1} \end{matrix}), \hat{Σ} = (\begin{matrix} {\hat{σ}}_{0, 0} & {\hat{σ}}_{0, 1} \\ {\hat{σ}}_{1, 0} & {\hat{σ}}_{1, 1} \end{matrix}),

each Granger causality of this VAR for

i = 0, 1

is

\begin{matrix} G_{i} & = & log (1 + \frac{a_{i, 1 - i}^{2} \det (\hat{Σ})}{{\hat{σ}}_{i, i} σ_{i, i}}) . \end{matrix}

(7)

Proof.

In general, the differential entropy of multivariate normal distribution

N (v | μ, Σ)

is

H (v) = \frac{1}{2} log | 2 π e Σ |,

where

e \approx 2.71

is Napier’s constant. For the joint probability distribution of

v_{t} = {(x_{t}, y_{t})}^{⊤}

P (v_{t + 1} | v_{t}) = N (v_{t + 1} | A v_{t}, Σ),

the two marginal probability distributions of

x_{t}, y_{t}

are

P (x_{t + 1} | v_{t}) = N (x_{t + 1} | (1, 0) A v_{t}, σ_{0, 0}) and P (y_{t + 1} | v_{t}) = N (y_{t + 1} | (0, 1) A v_{t}, σ_{1, 1}) .

Thus, the conditional entropy of

x_{t + 1}

and

y_{t + 1}

given

v_{t} = {(x_{t}, y_{t})}^{⊤}

are

H (x_{t + 1} | x_{t}, y_{t}) = \frac{1}{2} log | 2 π e σ_{0, 0} | and H (y_{t + 1} | x_{t}, y_{t}) = \frac{1}{2} log | 2 π e σ_{1, 1} | .

With the conditional probability distribution and marginal probability distribution

P (v_{t + 1} | v_{t}) = N (v_{t + 1} | A v_{t}, Σ) and P (v_{t}) = N (v_{t} | A^{t} v_{0}, Σ_{t}),

the joint probability distribution of

v_{t}

and

v_{t + 1}

is their product

P (v_{t + 1}, v_{t}) = N (v_{t + 1} | A v_{t}, Σ) N (v_{t} | A^{t} v_{0}, Σ_{t}) .

Specifically, this quad-variate normal distribution is

\begin{matrix} P (v_{t + 1}, v_{t}) & = & e^{- \frac{1}{2} {(v_{t + 1} - A v_{t})}^{⊤} Σ^{- 1} (v_{t + 1} - A v_{t}) - \frac{1}{2} {(v_{t} - A^{t} v_{0})}^{⊤} Σ_{t}^{- 1} (v_{t} - A^{t} v_{0})} {(2 π)}^{- 2} {| Σ |}^{- \frac{1}{2}} {| Σ_{t} |}^{- \frac{1}{2}} . \end{matrix}

Applying the identities

v_{t + 1} - A v_{t} = v_{t + 1} - A^{t + 1} v_{0} - A (v_{t} - A^{t} v_{0}),

Σ_{t}^{'} : = (\begin{matrix} Σ + A Σ_{t} A^{⊤} & A Σ_{t} \\ Σ_{t} A^{⊤} & Σ_{t} \end{matrix}) = {(\begin{matrix} Σ^{- 1} & - Σ^{- 1} A \\ - A^{⊤} Σ^{- 1} & Σ_{t}^{- 1} + A Σ^{- 1} A^{⊤} \end{matrix})}^{- 1},

and

| Σ | | Σ_{t} | = | Σ_{t}^{'} |

to

P (v_{t + 1}, v_{t})

, we have

P (v_{t + 1}, v_{t}) = N (v_{t}^{'} | μ_{t}^{'}, Σ_{t}^{'}),

where

v_{t}^{'} : = (\begin{matrix} v_{t + 1} \\ v_{t} \end{matrix}), μ_{t}^{'} : = (\begin{matrix} A^{t + 1} v_{0} \\ A^{t} v_{0} \end{matrix}), Σ_{t}^{'} : = (\begin{matrix} Σ + A Σ_{t} A^{⊤} & A Σ_{t} \\ Σ_{t} A^{⊤} & Σ_{t} \end{matrix}) .

From this joint probability distribution

P (v_{t + 1}, v_{t})

, we drive the marginal distributions

P (x_{t + 1}, x_{t}) = N (x_{t + 1}, x_{t} | μ_{t, 0}, Σ_{t, 0}), P (y_{t + 1}, y_{t}) = N (y_{t + 1}, y_{t} | μ_{t, 1}, Σ_{t, 1}),

where for

i = 0, 1

the mean vectors and covariance matrices are defined as follows:

\begin{matrix} μ_{t, i} & : = & (\begin{matrix} e_{i}^{⊤} A^{t + 1} v_{0} \\ e_{i}^{⊤} A^{t} v_{0} \end{matrix}) = (I_{2} \otimes e_{i}) μ_{t}^{'}, \\ Σ_{t, i} & : = & (\begin{matrix} e_{i}^{⊤} (Σ + A Σ_{t} A^{⊤}) e_{i} & e_{i}^{⊤} A Σ_{t} e_{i} \\ e_{i}^{⊤} Σ_{t} A^{⊤} e_{i} & e_{i}^{⊤} Σ_{t} e_{i} \end{matrix}) = {(I_{2} \otimes e_{i})}^{⊤} Σ_{t}^{'} (I_{2} \otimes e_{i}), \end{matrix}

with the unit vectors

e_{0} : = {(1, 0)}^{⊤}

,

e_{1} : = {(0, 1)}^{⊤}

.

Thus, we have the joint entropy of

x_{t}

and

x_{t + 1}

\begin{matrix} H (x_{t + 1}, x_{t}) & = & \frac{1}{2} log | 2 π e Σ_{t, 0} | \end{matrix}

(8)

\begin{matrix} = & \frac{1}{2} log {(2 π e)}^{2} | e_{0}^{⊤} (Σ + A Σ_{t} A^{⊤}) e_{0} e_{0}^{⊤} Σ_{t} e_{0} - {(e_{0}^{⊤} A Σ_{t} e_{0})}^{2} |, \end{matrix}

(9)

and the marginal distribution of

x_{t}

H (x_{t}) = \frac{1}{2} log 2 π e | e_{0}^{⊤} Σ_{t} e_{0} | .

(10)

Using these, we have the conditional entropy

H (x_{t + 1} | x_{t}) = \frac{1}{2} log 2 π \frac{e_{0}^{⊤} (Σ + A Σ_{t} A^{⊤}) e_{0} e_{0}^{⊤} Σ_{t} e_{0} - {(e_{0}^{⊤} A Σ_{t} e_{0})}^{2}}{| e_{0}^{⊤} Σ_{t} e_{0} |} .

(11)

By the stability of VAR, the Lyapunov Equation (4) holds, and this conditional entropy in the limit

t \to \infty

is

\begin{matrix} lim_{t \to \infty} H (x_{t + 1} | x_{t}) & = & \frac{1}{2} log 2 π \frac{{(e_{0}^{⊤} \hat{Σ} e_{0})}^{2} - {(e_{0}^{⊤} A \hat{Σ} e_{0})}^{2}}{| e_{0}^{⊤} \hat{Σ} e_{0} |} . \end{matrix}

(12)

Applying Definition 2 and denoting by entries in the stationary covariance matrix

\hat{Σ} = (\begin{matrix} {\hat{σ}}_{0, 0} & {\hat{σ}}_{0, 1} \\ {\hat{σ}}_{1, 0} & {\hat{σ}}_{1, 1} \end{matrix})

, we have

G_{0} = 2 T_{y \to x} = log \frac{{({\hat{σ}}_{0, 0})}^{2} - {(a_{0, 0} {\hat{σ}}_{0, 0} + a_{0, 1} {\hat{σ}}_{1, 0})}^{2}}{{\hat{σ}}_{0, 0} σ_{0, 0}} .

(13)

Similarly, we have

G_{1} = 2 T_{x \to y} = log \frac{{({\hat{σ}}_{1, 1})}^{2} - {(a_{1, 0} {\hat{σ}}_{0, 1} + a_{1, 1} {\hat{σ}}_{1, 1})}^{2}}{{\hat{σ}}_{1, 1} σ_{1, 1}} .

(14)

Let us define for

i = 0, 1

δ_{i} : = {({\hat{σ}}_{i, i})}^{2} - {(a_{i, i} {\hat{σ}}_{i, i} + a_{i, 1 - i} {\hat{σ}}_{1 - i, i})}^{2} and δ_{i}^{'} : = {\hat{σ}}_{i, i} σ_{i, i} .

(15)

By the Lyapunov equation,

σ_{i, i} = {\hat{σ}}_{i, i} - e_{i}^{⊤} A \hat{Σ} A^{⊤} e_{i}

. Applying this to

δ_{i}^{'}

, we have

δ_{i} = {\hat{σ}}_{i, i}^{2} - e_{i}^{⊤} A (\begin{matrix} {\hat{σ}}_{i, i}^{2} & {\hat{σ}}_{i, i} {\hat{σ}}_{1 - i, i} \\ {\hat{σ}}_{i, i} {\hat{σ}}_{1 - i, i} & {\hat{σ}}_{1 - i, i}^{2} \end{matrix}) A^{⊤} e_{i},

(16)

δ_{i}^{'} = {\hat{σ}}_{i, i}^{2} - e_{i}^{⊤} A (\begin{matrix} {\hat{σ}}_{i, i} {\hat{σ}}_{0, 0} & {\hat{σ}}_{i, i} {\hat{σ}}_{0, 1} \\ {\hat{σ}}_{i, i} {\hat{σ}}_{1, 0} & {\hat{σ}}_{i, i} {\hat{σ}}_{1, 1} \end{matrix}) A^{⊤} e_{i} .

(17)

As

δ_{i} - δ_{i}^{'} = a_{i, 1 - i}^{2} \det (\hat{Σ})

and

G_{0} = log (1 + \frac{δ_{i} - δ_{i}^{'}}{δ_{i}^{'}})

, we have

\begin{matrix} G_{i} = log (1 + \frac{a_{i, 1 - i}^{2} \det (\hat{Σ})}{{\hat{σ}}_{i, i} σ_{i, i}}) . \end{matrix}

□

The Granger causality has its lower and upper bounds in theory. Although these bounds may be further narrowed by considering the stability of the VAR, what follows below are the theoretical bounds regardless of the stability of the VAR.

Lemma 3

(The upper and lower bound for Granger causality). For each

i = 0, 1

, Granger causality

G_{i}

has the following bounds:

0 \leq G_{i} \leq log γ_{i},

(18)

where

γ_{i} : = \frac{{\hat{σ}}_{i, i}}{σ_{i, i}} \geq 1

due to the Lyapunov Equation (4). The lower bound

G_{i} = 0

is given only if

a_{i, 1 - i}^{2} \det (\hat{Σ}) = 0 .

(19)

The upper bound

G_{i} = log γ_{i}

is given only if

a_{i, i} {\hat{σ}}_{i, i} + a_{i, 1 - i} {\hat{σ}}_{1 - i, i} = 0 .

(20)

Proof.

As the stationary covariance matrix is (semi-)positive definite,

\det (\hat{Σ}) \geq 0

. Thus, the lower bound of Granger causality is

G_{i} \geq log (1) = 0

and this bound is only reacheable when

a_{i, 1 - i}^{2} \det (\hat{Σ}) = 0

.

Modifying (13) and (14), for

i = 0, 1

we have

{(a_{i, i} {\hat{σ}}_{i, i} + a_{i, 1 - i} {\hat{σ}}_{1 - i, i})}^{2} = {\hat{σ}}_{i, i} ({\hat{σ}}_{i, i} - σ_{i, i} e^{G_{i}}) .

(21)

As

{(a_{i, i} {\hat{σ}}_{i, i} + a_{i, 1 - i} {\hat{σ}}_{1 - i, i})}^{2} \geq 0

and

{\hat{σ}}_{i, i} > 0

,

G_{i} \leq log γ_{i} .

This upper bound holds only if

a_{i, i} {\hat{σ}}_{i, i} + a_{i, 1 - i} {\hat{σ}}_{1 - i, i} = 0

. □

The upper bound Lemma 3 can also be obtained by the following information-theoretic identity:

lim_{t \to \infty} I (x_{t - 1}; x_{t}) + I (x_{t}; x_{t - 1} | x_{t - 1}) = lim_{t \to \infty} I (x_{t}; x_{t - 1}, x_{t - 1}),

where

{lim}_{t \to \infty} I (x_{t}; x_{t - 1} | x_{t - 1}) = \frac{1}{2} G_{0}

is the transfer entropy,

{lim}_{t \to \infty} I (x_{t}; x_{t - 1}, x_{t - 1}) = \frac{1}{2} log \frac{{\hat{σ}}_{0, 0}}{σ_{0, 0}}

, and

lim_{t \to \infty} I (x_{t - 1}; x_{t}) = \frac{1}{2} log \frac{{\hat{σ}}_{0, 0}^{2}}{|\begin{matrix} \hat{Σ} & \hat{Σ} A^{⊤} \\ A \hat{Σ} & \hat{Σ} \end{matrix}|} = \frac{1}{2} (log γ_{0} - G_{0}) .

4. Stability and Constraints of VAR

In this study, we primarily consider the class of stable VAR models with a proper set of parameters. In this class, the statistical nature of any VAR is characterized by the base covariance matrix

Σ \in R^{2 \times 2}

, coefficient matrix

A \in R^{2 \times 2}

, and stationary covariance matrix

\hat{Σ} \in R^{2 \times 2}

. Let us denote the set of (strictly) positive definite matrices by

R_{+}^{2 \times 2} : = \{M \in R^{2 \times 2} | \det (M) > 0 and tr (M) > 0\},

and the set of coefficient matrices of stable VAR models

R_{*}^{2 \times 2} : = \{M \in R^{2 \times 2} | - 1 + | tr (M) | < \det (M) < 1\} .

We will briefly show that the stable set

R_{*}^{2 \times 2}

includes all and only coefficient matrices of stable bivariate VAR models.

With this notation of the set of matrices, the two conditions that any proper VAR model needs to satisfy are as follows.

Stability: Any stable VAR model has both of the eigenvalues $λ_{0}, λ_{1}$ of its coefficient matrix A meeting $| λ_{0} |, | λ_{1} | < 1$ .
Properness: To have a proper (non-degenerated) bivariate normal distribution in a VAR model, its base covariance matrix $Σ$ and stationary covariance matrix $\hat{Σ}$ need to satisfy $Σ, \hat{Σ} \in R_{+}^{2 \times 2}$ . The set of positive-definite matrices is equivalently written with the entries of the following matrix:

$R_{+}^{2 \times 2} = \{C \in R^{2 \times 2} ∣ C_{0, 0} > 0, C_{1, 1} > 0, and C_{0, 0} C_{1, 1} - C_{0, 1} C_{1, 0} > 0\} .$

(22)

4.1. Stability of VAR

As stated previously in Section 2.2, the stability of VAR is primarily characterized by the eigenvalues of the coefficient matrix A. However, this condition is equivalent to

A \in R_{*}^{2 \times 2}

, as shown by the following lemma.

Lemma 4.

A given bivariate VAR model with its coefficient matrix

A \in R^{2 \times 2}

is stable if and only if

| tr (A) | - 1 < \det (A) < 1 .

(23)

Proof.

Let

λ

be an eigenvalue of the coefficient matrix A. Such an eigenvalue then satisfies

f (λ) = |A - λ I_{2}| = λ^{2} - tr (A) λ + \det (A) = 0 .

(24)

If a VAR is stable, this eigenvalue needs to satisfy

| λ | < 1

. As (24) is rewritten by

f (λ) = {(λ - \frac{1}{2} tr (A))}^{2} - \frac{1}{4} (tr {(A)}^{2} - 4 \det (A)),

(25)

we analyze this condition on (24) for the following two cases with

λ

being real or non-real:

If $λ$ is real, this stability condition is equivalent to

$tr {(A)}^{2} \leq 4 \det (A), f (1) > 0, f (- 1) > 0, | tr (A) | < 2 .$

(26)
If $λ$ is not real, this stability condition is equivalent with

$tr {(A)}^{2} < 4 \det (A), {| λ |}^{2} < 1 .$

(27)

If

λ

of (24) is non-real (Case 2),

λ

(and its conjugate) is

λ = \frac{1}{2} tr (A) \pm \frac{j}{2} \sqrt{| tr {(A)}^{2} - 4 \det (A) |},

(28)

with the imaginary unit denoted by j.

With the inequality (27), the stability condition in this case is

{(\frac{tr (A)}{2})}^{2} < {| λ |}^{2} = \det (A) < 1 .

(29)

If

λ

of (24) is real,

tr {(A)}^{2} - 4 \det (A) \geq 0

and

\begin{matrix} f (1) = 1 - tr (A) + \det (A) > 0 \end{matrix}

(30)

\begin{matrix} f (- 1) = 1 + tr (A) + \det (A) > 0 \end{matrix}

(31)

\begin{matrix} | tr (A) | < 2 . \end{matrix}

(32)

Combining (30) and (31), we have

| tr (A) | - 1 < \det (A)

. This inequality with (26),

C_{0} < \det (A) \leq {(\frac{tr (A)}{2})}^{2} < C_{1},

(33)

where

C_{0} : = | tr (A) | - 1

and

C_{1} : = min (1, {(\frac{1 + \det (A)}{2})}^{2})

. Find for an arbitrary

A \in R^{2 \times 2}

we have the following two inequalities:

| tr (A) | - 1 \leq {(\frac{1}{2} tr (A))}^{2},

(34)

and

\det (A) \leq {(\frac{1 + \det (A)}{2})}^{2} .

(35)

The inequality (34) holds equality for and only for

tr (A) = 2

, and the inequality (34) holds equality for and only for

\det (A) = 1

. As both of these equality conditions do not hold under (29), (29) is equivalent to

C_{0} < {(\frac{tr (A)}{2})}^{2} < \det (A) < C_{1} .

(36)

Integrating the two inequalities (33) for real

λ

and (36) for non-real

λ

, the VAR with the coefficient matrix A is stable if

C_{0} < \det (A) < C_{1}

(37)

and

C_{0} < {(\frac{tr (A)}{2})}^{2} < C_{1} .

(38)

As the inequality (37) implies

0 < \frac{1 + \det (A)}{2} < 1

and

tr (A) < 2

, (38) is equivalent to

{(\frac{tr (A)}{2})}^{2} < {(\frac{1 + \det (A)}{2})}^{2} .

(39)

As the upper bound for

\det (A)

in (37) can be implied by

\det (A) < 1

, it is equivalent to

| tr (A) | - 1 < \det (A) < 1 .

(40)

Thus, the pair of inequalities (37) and (38) for A is equivalent to the single inequality (40) for A. □

4.2. Stability and Existence of the Solution for the Lyapunov Equation

Intuitively, it would be reasonable if there was a stationary covariance matrix

\hat{Σ} \in R_{+}^{2 \times 2}

satisfying the Lyapunov Equation (4), if the coefficient matrix is

A \in R_{*}^{2 \times 2}

. However, this is not trivial, as the opposite may not be always true: the existence of

\hat{Σ} \in R_{+}^{2 \times 2}

does not imply

A \in R_{*}^{2 \times 2}

. This relationship between A and

\hat{Σ}

is stated by the following Theorem 1.

Theorem 1.

There is a stationary covariance matrix

\hat{Σ} \in R_{+}^{2 \times 2}

satisfying the Lyapunov Equation (4), if the coefficient matrix is

A \in R_{*}^{2 \times 2}

. However, the existence of

\hat{Σ} \in R_{+}^{2 \times 2}

does not imply

A \in R_{*}^{2 \times 2}

.

Proof.

Find the identity

\begin{matrix} \det (I_{4} - A \otimes A) & = \det ((I_{2} - a_{0, 0} A) (I_{2} - a_{1, 1} A) - a_{0, 1} a_{1, 0} A^{2}) \\ = \det (I_{2} - A tr (A) + A^{2} \det (A)) \\ = {(1 - \det (A))}^{2} ((1 - a_{0, 1} - a_{0, 0}^{2}) (1 - a_{0, 1} - a_{1, 1}^{2}) - a_{0, 1} a_{1, 0} tr (A)) \\ = {(1 - \det (A))}^{2} ({(1 + \det (A))}^{2} - tr {(A)}^{2}) . \end{matrix}

(41)

By Lemma 4 and (41), we have

\det (I_{4} - A \otimes A) > 0

. Thus, Lyapunov Equation (5) has the solution for

\hat{Σ}

, as the matrix

(I_{4} - A \otimes A)

is invertible. The converse of this theorem does not hold, as we construct a counter-example of the coefficient matrix A such that

\det (I_{4} - A \otimes A) < 0

, with which there is a

\hat{Σ} \in R_{+}^{2 \times 2}

, but such a VAR is not stable. □

5. Design of Bivariate Timeseries Given GCs

The goal of this study was to derive a design principle of bivariate timeseries generated by a VAR model with the desired correlation and two types of Granger causality. In this section, we explore the inter-dependent relationships among the variables in the VAR. This analysis revealed a trade-off limitation in designing these variables of timeseries. Specifically, a timeseries with a certain range of desired Granger causality cannot be realized by a stable VAR, in which no stationary covariance is defined in theory.

The set of parameters in any stable VAR model includes

The coefficient matrix A;
The base covariance matrix $Σ$ ;
The stationary covariance matrix $\hat{Σ}$ ; and
The two types of Granger causality $G_{0}, G_{1}$ .

There are equality constraints on these variables:

The variables $A, Σ, \hat{Σ}$ need to satisfy the Lyapunov Equation (4).
Granger causality $G_{i}$ ( $i = 0, 1$ ) is the function of $a_{i, 1 - i}$ , $σ_{i, i}$ , and $\hat{Σ}$ (Lemma 2).

Besides, it is important to know the feasibility of a set of parameters in VAR, which constrains the range of these variables:

Stability: $A \in R_{*}^{2 \times 2}$ (Section 4);
Properness: $Σ, \hat{Σ} \in R_{+}^{2 \times 2}$ (Section 4) and $σ_{i, i} \leq {\hat{σ}}_{i, i}$ due to the existence of a solution for the Lyapunov equation; and
The bound for each Granger causality: $G_{i} \in [0, log γ_{i}]$ (Lemma 3).

The Lyapunov Equation (4) on the matrices can be decomposed into the three equations on the scalar variables as follows. For a coefficient matrix

A = (\begin{matrix} a_{0, 0} & a_{0, 1} \\ a_{1, 0} & a_{1, 1} \end{matrix})

, let us define two vectors by

a_{0} : = (\begin{matrix} a_{0, 0} \\ a_{0, 1} \end{matrix}), a_{1} : = (\begin{matrix} a_{1, 0} \\ a_{1, 1} \end{matrix}) .

The Lyapunov equation is then equivalently written with these vectors

a_{0}, a_{1}

by the set of the three equations

\begin{matrix} {\hat{σ}}_{0, 0} - σ_{0, 0} & = & a_{0}^{⊤} \hat{Σ} a_{0} \end{matrix}

(42)

\begin{matrix} {\hat{σ}}_{1, 1} - σ_{1, 1} & = & a_{1}^{⊤} \hat{Σ} a_{1} \end{matrix}

(43)

\begin{matrix} {\hat{σ}}_{0, 1} - σ_{0, 1} & = & a_{0}^{⊤} \hat{Σ} a_{1} . \end{matrix}

(44)

Equations (42) and (43) above imply that each of the vectors

a_{0}

and

a_{1}

are on an ellipsis on each of their planes. This gives the lower bound for

{\hat{σ}}_{i, i} \geq σ_{i, i}

(i.e., one condition of the properness above), as

x^{⊤} \hat{Σ} x \geq 0

for any

x \in R^{2}

with a positive-definite matrix

\hat{Σ}

.

Fixing

G_{0}

and

G_{1}

imposes each of the two vectors

a_{0}

and

a_{1}

on the two parallel lines by

{(a_{i}^{⊤} {\hat{σ}}_{i})}^{2} = τ_{i}^{2},

(45)

where

{\hat{σ}}_{i} : = {({\hat{σ}}_{i, 0}, {\hat{σ}}_{i, 1})}^{⊤}, τ_{i}^{2} : = {\hat{σ}}_{i, i}^{2} (1 - γ_{i}^{- 1} e^{G_{i}}) .

Thus, the solution of

a_{i}

which satisfies the Lyapunov equation and the fixed Granger causality is the four intersections of the ellipsis and the two parallel lines (Figure 2). This ellipsis is obtained by scaling and shearing transformation to the standard circle

a_{0, 0}^{2} + a_{0, 1}^{2} = 1

. This observation gives the angular parametrization of the solution vector

{(a_{0, 0}, a_{0, 1})}^{⊤}

, which is explicitly stated by Lemma 5 in the next section.

5.1. Solution A of the Lyapunov Equality Given $\hat{Σ}$ , $G_{0}$ , and $G_{1}$

In what follows, we start with the derivation of the coefficient matrix A as a root of the equality constraint by the Lyapunov Equation (4) and the Granger causality, for a fixed proper

\hat{Σ}

,

σ_{i, i}

and

G_{i}

for each

i = 0, 1

. The following Lemma 5 gives a necessary condition for the coefficient matrix

A \in R^{2 \times 2}

to satisfy the equality conditions above. Note, however, that such a solution A in this equation does not guarantee the stability of the corresponding VAR (i.e.,

A \in R_{*}^{2 \times 2}

). This sufficiency is explored in Section 5.2.

Lemma 5.

For a given set of parameters, a positive-definite matrix

\hat{Σ} = (\begin{matrix} {\hat{σ}}_{0, 0} & {\hat{σ}}_{0, 1} \\ {\hat{σ}}_{1, 0} & {\hat{σ}}_{1, 1} \end{matrix}) \in R_{+}^{2 \times 2}

,

σ_{i, i} \in (0, {\hat{σ}}_{i, i}), G_{i} \in [0, log γ_{i}]

for each

i = 0, 1

, suppose that a coefficient matrix

A = (\begin{matrix} a_{0, 0} & a_{0, 1} \\ a_{1, 0} & a_{1, 1} \end{matrix}) \in R^{2 \times 2}

, satisfies the set of the equations

\{\begin{matrix} a_{0}^{⊤} \hat{Σ} a_{0} = {\hat{σ}}_{0, 0} - σ_{0, 0} \\ a_{1}^{⊤} \hat{Σ} a_{1} = {\hat{σ}}_{1, 1} - σ_{1, 1} \\ {({\hat{σ}}_{0}^{⊤} a_{0})}^{2} = τ_{0}^{2} \\ {({\hat{σ}}_{1}^{⊤} a_{1})}^{2} = τ_{1}^{2} \end{matrix},

(46)

where for

i = 0, 1

{\hat{σ}}_{i} : = {({\hat{σ}}_{i, 0}, {\hat{σ}}_{i, 1})}^{⊤}, τ_{i}^{2} : = {\hat{σ}}_{i, i}^{2} (1 - γ_{i}^{- 1} e^{G_{i}}) .

Any coefficient matrix A of a root of this Equation (46) is in the form

A = {(\begin{matrix} S_{0} (\begin{matrix} cos θ_{0} \\ sin θ_{0} \end{matrix}) & P_{2} S_{1} (\begin{matrix} cos θ_{1} \\ sin θ_{1} \end{matrix}) \end{matrix})}^{⊤},

(47)

where each pair of the angles

θ_{0} \in [0, 2 π)

and

θ_{1} \in [0, 2 π)

takes one of the two or four pairs satisfying for each

i = 0, 1

{sin}^{2} θ_{i} = \frac{e^{G_{i}} - 1}{γ_{i} - 1}

(48)

and

P_{2} : = (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}), S_{i} : = \sqrt{1 - γ_{i}^{- 1}} (\begin{matrix} 1 & - \frac{{\hat{σ}}_{i, 1 - i}}{{\hat{σ}}_{i, i}} \\ 0 & 1 \end{matrix}) (\begin{matrix} 1 & 0 \\ 0 & \frac{{\hat{σ}}_{i, i}}{\sqrt{\det (\hat{Σ})}} \end{matrix}) .

Proof.

Find that the following pair of equations in (46) is symmetric under exchange of

i = 0, 1

:

\{\begin{matrix} a_{i}^{⊤} \hat{Σ} a_{i} = {\hat{σ}}_{i, i} - σ_{i, i} \\ {({\hat{σ}}_{i}^{⊤} a_{i})}^{2} = τ_{i}^{2} \end{matrix} .

(49)

Thus, we solve this for

i = 0

below, and it holds for

i = 1

.

Solving the second equation of (49) for

a_{0, 0}

, we have

a_{0, 0} = \frac{\pm τ_{0} - a_{0, 1} {\hat{σ}}_{0, 1}}{{\hat{σ}}_{0, 0}} .

(50)

Inserting this into the first equation of (49), we have

a_{0, 1}^{2} = \frac{{\hat{σ}}_{0, 0}^{2} (1 - γ_{0}^{- 1}) - τ_{0}^{2}}{\det (\hat{Σ})} .

(51)

Inserting

τ_{0}^{2} = {\hat{σ}}_{0, 0}^{2} (1 - γ_{0}^{- 1} e^{G_{0}})

, we have

a_{0, 1} = \pm \sqrt{\frac{{\hat{σ}}_{0, 0} σ_{0, 0} (e^{G_{0}} - 1)}{\det (\hat{Σ})}} .

(52)

Inserting this into (50), we have at most four vectors

a_{0} = {(a_{0, 0}, a_{0, 1})}^{⊤}

as the solution of (49) for

i = 1

:

a_{0} = (\begin{matrix} c_{0} - \frac{{\hat{σ}}_{0, 1}}{{\hat{σ}}_{0, 0}} d_{0} \\ d_{0} \end{matrix}), (\begin{matrix} c_{0} + \frac{{\hat{σ}}_{0, 1}}{{\hat{σ}}_{0, 0}} d_{0} \\ - d_{0} \end{matrix}), (\begin{matrix} - c_{0} - \frac{{\hat{σ}}_{0, 1}}{{\hat{σ}}_{0, 0}} d_{0} \\ d_{0} \end{matrix}), (\begin{matrix} - c_{0} + \frac{{\hat{σ}}_{0, 1}}{{\hat{σ}}_{0, 0}} d_{0} \\ - d_{0} \end{matrix}),

(53)

where for

i = 0, 1

c_{i} : = \sqrt{1 - γ_{i}^{- 1} e^{G_{i}}}, d_{i} : = \sqrt{\frac{{\hat{σ}}_{i, i} σ_{i, i} (e^{G_{i}} - 1)}{\det (\hat{Σ})}} .

By symmetry to

i = 0, 1

, there are at most four vectors as the solution of (49) for

i = 1

:

P_{2} a_{1} = (\begin{matrix} a_{1, 1} \\ a_{1, 0} \end{matrix}) = (\begin{matrix} c_{1} - \frac{{\hat{σ}}_{1, 0}}{{\hat{σ}}_{1, 1}} d_{1} \\ d_{1} \end{matrix}), (\begin{matrix} c_{1} + \frac{{\hat{σ}}_{1, 0}}{{\hat{σ}}_{1, 1}} d_{1} \\ - d_{1} \end{matrix}), (\begin{matrix} - c_{1} - \frac{{\hat{σ}}_{1, 0}}{{\hat{σ}}_{1, 1}} d_{1} \\ d_{1} \end{matrix}), (\begin{matrix} - c_{1} + \frac{{\hat{σ}}_{1, 0}}{{\hat{σ}}_{1, 1}} d_{1} \\ - d_{1} . \end{matrix})

(54)

Find that these four solution vectors parameterized by

a_{0} = S_{0} (\begin{matrix} cos θ_{0} \\ sin θ_{0} \end{matrix}) and a_{1} = P_{2} S_{1} (\begin{matrix} cos θ_{1} \\ sin θ_{1} \end{matrix}),

satisfy (49), if

(θ_{0}, θ_{1})

holds (48), with the trigonometric identity

{cos}^{2} θ_{i} + {sin}^{2} θ_{i} = 1

and

S_{0}^{⊤} \hat{Σ} S_{0} = ({\hat{σ}}_{0, 0} - σ_{0, 0}) I_{2} and S_{1}^{⊤} P_{2}^{⊤} \hat{Σ} P_{2} S_{1} = ({\hat{σ}}_{1, 1} - σ_{1, 1}) I_{2} .

(55)

□

5.2. Sufficiency of the Solution

For a given set of parameters, Lemma 5 in the previous section gives a set of solutions of the coefficient matrix A for the Lyapunov equation. Note that not all of these solutions A are feasible, in the sense that they satisfy all constraints such as the stability of

A \in R_{*}^{2 \times 2}

and the properness of

Σ \in R_{+}^{2 \times 2}

, in which

Σ

can be derived from Lyapunov Equation (4) given A and

\hat{Σ}

. The following lemmas provide the sufficient condition for a solution A by checking the properness of

Σ

and the stability of A.

Lemma 6.

Suppose A is a solution of Equation (46) in Lemma 5, represented by a pair of

(θ_{0}, θ_{1})

. In this case,

Σ \in R_{+}^{2 \times 2}

, if and only if

cos \hat{η} - {(γ_{0} γ_{1})}^{- \frac{1}{2}} \leq {\hat{γ}}_{0} {\hat{γ}}_{1} cos (\hat{η} - θ_{0} - θ_{1}) \leq cos \hat{η} + {(γ_{0} γ_{1})}^{- \frac{1}{2}},

(56)

where

\hat{η} \in [0, 2 π]

is the angler parametrization of the correlation coefficient defined by

cos \hat{η} : = \frac{{\hat{σ}}_{0, 1}}{\sqrt{{\hat{σ}}_{0, 0} {\hat{σ}}_{1, 1}}}

and

{\hat{γ}}_{i} = \sqrt{1 - γ_{i}^{- 1}}

.

Proof.

Applying the polar representation of the

a_{0}, a_{1}

in (47) in Lemma 5 to the third Equation (44) of the Lyapunov equation, we have

{\hat{σ}}_{0, 1} - σ_{0, 1} = \sqrt{({\hat{σ}}_{0, 0} - σ_{0, 0}) ({\hat{σ}}_{1, 1} - σ_{1, 1})} cos (\hat{η} - θ_{0} - θ_{1}) .

(57)

By the positive definiteness of

Σ

,

σ_{0, 1}^{2} \leq σ_{0, 0} σ_{1, 1}

. This inequality applied to (57) gives the lemma. □

If we have

{\hat{γ}}_{0} {\hat{γ}}_{1} \leq cos \hat{η} + {(γ_{0} γ_{1})}^{- \frac{1}{2}} and cos \hat{η} - {(γ_{0} γ_{1})}^{- \frac{1}{2}} \leq - {\hat{γ}}_{0} {\hat{γ}}_{1},

Equation (56) holds for any pair of angles

(θ_{0}, θ_{1})

. This condition is

| cos \hat{η} | \leq {(γ_{0} γ_{1})}^{- \frac{1}{2}} - {\hat{γ}}_{0} {\hat{γ}}_{1},

or

| {\hat{σ}}_{0, 1} | \leq \sqrt{σ_{0, 0} σ_{1, 1}} - \sqrt{({\hat{σ}}_{0, 0} - σ_{0, 0}) ({\hat{σ}}_{1, 1} - σ_{1, 1})} .

(58)

On the other hand, (56) holds for any

0 < \hat{η} < π

(equivalently

- 1 < cos \hat{η} < 1

) if

\frac{\sqrt{γ_{0} γ_{1}} - 1}{\sqrt{(γ_{0} - 1) (γ_{1} - 1)}} \leq cos (θ_{0} + θ_{1}) \leq \frac{\sqrt{γ_{0} γ_{1}} + 1}{\sqrt{(γ_{0} - 1) (γ_{1} - 1)}} .

(59)

These bounds (58) and (59) mean that the range of feasible GCs (

θ_{0}, θ_{1}

) and the range of feasible correlation

cos \hat{η}

are in a trade-off relationship in general.

Lemma 7.

The VAR with the correlation

| cos (\hat{η}) | < 1

and Granger causality

θ_{0}, θ_{1}

in the angular form is stable if and only if

- sin \hat{η} + | {\hat{γ}}_{0} sin (\hat{η} - θ_{0}) + {\hat{γ}}_{1} sin (\hat{η} - θ_{1}) | < {\hat{γ}}_{0} {\hat{γ}}_{1} sin (\hat{η} - θ_{0} - θ_{1}) < sin \hat{η} .

(60)

Proof.

Using the angular notation of the solution A with

θ_{0}, θ_{1}

, we have

\det (A) = {\hat{γ}}_{0} {\hat{γ}}_{1} \frac{sin (\hat{η} - θ_{0} - θ_{1})}{sin \hat{η}} and tr (A) = \frac{{\hat{γ}}_{0} sin (\hat{η} - θ_{0}) + {\hat{γ}}_{1} sin (\hat{η} - θ_{1})}{sin \hat{η}} .

Inserting these into stability condition (23), we have the inequality (60). □

As well as Lemma 6, Lemma 7 leads the trade-off relationship between

\hat{Σ}

in the angle from

\hat{η}

and A in the angle from

θ_{0}, θ_{1}

. In general, this bound further narrows the upper and lower bounds given by Lemma 3. In general, correlation is limited to close to zero if one wishes for higher Granger causality. On the other hand, the two types of Granger causality are limited to close to zero if one wishes for a higher correlation in the absolute value.

6. Concluding Remarks

6.1. Summary and Potential Usage of the Algorithm

In this paper, we explored the relationship between the VAR parameters and timeseries statistics (Figure 1), and identified the trade-off limitation between the stationary covariance

\hat{Σ}

and Granger causality

G_{0}, G_{1}

(Lemma 6). This suggests that the following Algorithm 1 will generate a timeseries with desired statistics.

Algorithm 1: Compute a VAR parameter set for the desired statistics

Data: Desired timeseries statistics

(G_{0}, G_{1}, \hat{Σ}, σ_{0, 0}, σ_{1, 1})

in the feasible range satisfy both inequalities (56) in Lemma 6 and (60) in Lemma 7.
₁ Derive four sets of VAR parameters

(A, Σ)

for the given timeseries statistics

(G_{0}, G_{1}, \hat{Σ}, σ_{0, 0}, σ_{1, 1})

by Lemma 5
₂ Choose one of the four sets of VAR parameters

(A, Σ)

.
Result: The VAR parameters

(A, Σ)

.

This data-generation algorithm can be used to generate surrogate data [19], which can be used to test whether an empirical timeseries is a sample from a VAR with a given correlation and Granger causality. This algorithm is also useful in analyzing to what extent a class of VAR timeseries varies under the same statistics.

6.2. Validity of Granger Causality Estimated on Empirical Timeseries

Our analysis also warns that not all Granger causality (or transfer entropy) is “valid”, in the sense that its underlying VAR model is not stable and thus the Granger causality is undefined in theory. In theory, we can identify some value of Granger causality for a finite empirical time series, which is generated by an underlying unstable VAR model without any stationary statistics. Such timeseries statistics will diverge in the long run, but it may be difficult to identify this with a finite empirical timeseries. This asymmetry—namely, that the Granger causality can be calculated numerically but does not guarantee the stability of the underlying VAR—is explicitly demonstrated by Theorem 1. For a given empirical timeseries

v = (v_{0}, v_{1}, \dots, v_{T}) \in R^{2 \times (T + 1)}

, one should calculate not just the Granger causality but also its validity by checking (1)

A \in R_{*}^{2 \times 2}

, (2)

Σ \in R_{+}^{2 \times 2}

, and (3)

Σ_{0, 1} \leq {\hat{Σ}}_{0, 1}

, by calculating the maximum likelihood estimator of

(A (v), Σ (v))

, such as

A (v) : = V_{1, 0} V_{0, 0}^{- 1} and Σ (v) = V_{1, 1} - V_{1, 0} V_{0, 0}^{- 1} V_{0, 1},

where

V_{i, j} : = T^{- 1} \sum_{t = 1}^{T} v_{t - 1 + i} v_{t - 1 + j}^{⊤} .

In fact,

Σ (v)

is always (semi-)positive definite, as it takes the form of the Schur complement of the (semi-)positive-definite matrix

(\begin{matrix} V_{0, 0} & V_{0, 1} \\ V_{1, 0} & V_{1, 1} \end{matrix})

. Thus, this maximum likelihood estimator readily satisfies condition (2).

6.3. Future Work

In this paper, we limit the VAR model to be bivariate for simplicity of analysis. We expect it is possible to generalize the current result to any higher dimensional VAR model. In such a generalization, feasible boundaries for the stable VAR models may require further effort to understand.

Author Contributions

Conceptualization, S.H. and T.T.; Methodology, S.H. and T.T.; Software, S.H.; Validation, S.H. and T.T.; Formal Analysis, S.H.; Investigation, S.H.; Writing—Original Draft Preparation, S.H.; Writing—Review and Editing, S.H. amd T.T.; Visualization, S.H.; Supervision, S.H.; Project Administration, S.H.; Funding Acquisition, S.H. Both authors have read and agreed to the published version of the manuscript.

Funding

Japan Society for the Promotion of Science: JP20H04994, Japan Science and Technology Agency: JPMJPR20C9.

Institutional Review Board Statement

No human participants are involved with this study.

Informed Consent Statement

No human participants are involved with this study.

Data Availability Statement

Data sharing not applicable.

Acknowledgments

This work was supported by JSPS KAKENHI Grant Number JP 20H04994 and JST, PRESTO Grant Number JPMJPR20C9, Japan.

Conflicts of Interest

The authors declare no conflict of interest.

References

Heider, F.; Simmel, M. An experimental study of apparent behavior. Am. J. Psychol. 1944, 57, 243–259. [Google Scholar] [CrossRef]
Bassili, J.N. Temporal and spatial contingencies in the perception of social events. J. Personal. Soc. Psychol. 1976, 33, 680–685. [Google Scholar] [CrossRef]
Dittrich, W.H.; Lea, S.E. Visual perception of intentional motion. Perception 1994, 23, 253–268. [Google Scholar] [CrossRef] [PubMed]
Tremoulet, P.D.; Feldman, J. Perception of animacy from the motion of a single object. Perception 2000, 29, 943–951. [Google Scholar] [CrossRef] [PubMed]
Scholl, B.J.; Tremoulet, P.D. Perceptual causality and animacy. Trends Cogn. Sci. 2000, 4, 299–308. [Google Scholar] [CrossRef]
Takahashi, K.; Watanabe, K. Synchronous motion modulates animacy perception. J. Vis. 2015, 15, 1–17. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Granger, C.W. Investigating causal relations by econometric models and cross-spectral methods. Econometrica 1969, 37, 424–438. [Google Scholar] [CrossRef]
Schreiber, T. Measuring information transfer. Phys. Rev. Lett. 2000, 85, 461–464. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Barnett, L.; Barrett, A.B.; Seth, A.K. Granger causality and transfer entropy are equivalent for Gaussian variables. Phys. Rev. Lett. 2009, 103, 238701. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Eichler, M. Granger causality and path diagrams for multivariate time series. J. Econom. 2007, 137, 334–353. [Google Scholar] [CrossRef]
Freeman, J.R. Granger causality and the times series analysis of political relationships. Am. J. Political Sci. 1983, 27, 327–358. [Google Scholar] [CrossRef]
Joerding, W. Economic growth and defense spending: Granger causality. J. Dev. Econ. 1986, 21, 35–40. [Google Scholar] [CrossRef]
Seth, A.K. Measuring autonomy and emergence via Granger causality. Artif. Life 2010, 16, 179–196. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Okazaki, S.; Hirotani, M.; Koike, T.; Bosch-Bayard, J.; Takahashi, H.K.; Hashiguchi, M.; Sadato, N. Unintentional interpersonal synchronization represented as a reciprocal visuo-postural feedback system: A multivariate autoregressive modeling approach. PLoS ONE 2015, 10, e0137126. [Google Scholar] [CrossRef] [PubMed]
Bressler, S.L.; Seth, A.K. Wiener–Granger causality: A well established methodology. Neuroimage 2011, 58, 323–329. [Google Scholar] [CrossRef] [PubMed]
Ding, M.; Chen, Y.; Bressler, S.L. 17 Granger causality: Basic theory and application to neuroscience. In Handbook of Time Series Analysis: Recent Theoretical Developments and Applications; Wiley Online Library: Hoboken, NJ, USA, 2006; Volume 437. [Google Scholar]
Porta, A.; Faes, L. Wiener–Granger causality in network physiology with applications to cardiovascular control and neuroscience. Proc. IEEE 2015, 104, 282–309. [Google Scholar] [CrossRef]
Seth, A.K.; Barrett, A.B.; Barnett, L. Granger causality analysis in neuroscience and neuroimaging. J. Neurosci. 2015, 35, 3293–3297. [Google Scholar] [CrossRef] [PubMed]
Prichard, D.; Theiler, J. Generating surrogate data for time series with several simultaneously measured variables. Phys. Rev. Lett. 1994, 73, 951. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Schematic diagram of the organization of this paper.

Figure 2. The ellipsis (42) and two parallel lines (45) (forming the parallelogram touching the ellipsis) on the plane

(a_{0, 0}, a_{0, 1}) \in R^{2}

. The solution

(a_{0}, a_{1})

is four intersections of these two (depicted by the colored points). Granger causality takes its maximum with the largest

| a_{0, 1} |

on the ellipsis and its minimum with

| a_{0, 1} | = 0

.

Figure 2. The ellipsis (42) and two parallel lines (45) (forming the parallelogram touching the ellipsis) on the plane

(a_{0, 0}, a_{0, 1}) \in R^{2}

. The solution

(a_{0}, a_{1})

is four intersections of these two (depicted by the colored points). Granger causality takes its maximum with the largest

| a_{0, 1} |

on the ellipsis and its minimum with

| a_{0, 1} | = 0

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hidaka, S.; Torii, T. Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality. Entropy 2021, 23, 742. https://doi.org/10.3390/e23060742

AMA Style

Hidaka S, Torii T. Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality. Entropy. 2021; 23(6):742. https://doi.org/10.3390/e23060742

Chicago/Turabian Style

Hidaka, Shohei, and Takuma Torii. 2021. "Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality" Entropy 23, no. 6: 742. https://doi.org/10.3390/e23060742

APA Style

Hidaka, S., & Torii, T. (2021). Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality. Entropy, 23(6), 742. https://doi.org/10.3390/e23060742

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality

Abstract

1. Introduction

1.1. Background and Motivation

1.2. Vector Auto-Regressive Model, Granger Causality, and Transfer Entropy

2. Vector Auto-Regression (VAR)

2.1. Marginal Distribution of the VAR at Each Step

2.2. Stability of VAR: Lyapunov Equation

3. Transfer Entropy and Granger Causality

4. Stability and Constraints of VAR

4.1. Stability of VAR

4.2. Stability and Existence of the Solution for the Lyapunov Equation

5. Design of Bivariate Timeseries Given GCs

5.1. Solution A of the Lyapunov Equality Given $\hat{Σ}$ , $G_{0}$ , and $G_{1}$

5.2. Sufficiency of the Solution

6. Concluding Remarks

6.1. Summary and Potential Usage of the Algorithm

6.2. Validity of Granger Causality Estimated on Empirical Timeseries

6.3. Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality

Abstract

1. Introduction

1.1. Background and Motivation

1.2. Vector Auto-Regressive Model, Granger Causality, and Transfer Entropy

2. Vector Auto-Regression (VAR)

2.1. Marginal Distribution of the VAR at Each Step

2.2. Stability of VAR: Lyapunov Equation

3. Transfer Entropy and Granger Causality

4. Stability and Constraints of VAR

4.1. Stability of VAR

4.2. Stability and Existence of the Solution for the Lyapunov Equation

5. Design of Bivariate Timeseries Given GCs

5.1. Solution A of the Lyapunov Equality Given Σ ^ , G 0 , and G 1

5.2. Sufficiency of the Solution

6. Concluding Remarks

6.1. Summary and Potential Usage of the Algorithm

6.2. Validity of Granger Causality Estimated on Empirical Timeseries

6.3. Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.1. Solution A of the Lyapunov Equality Given $\hat{Σ}$ , $G_{0}$ , and $G_{1}$