Panel Cointegration Testing in the Presence of Linear Time Trends

Hassler, Uwe; Hosseinkouchack, Mehdi

doi:10.3390/econometrics4040045

Open AccessArticle

Panel Cointegration Testing in the Presence of Linear Time Trends

by

Uwe Hassler

^* and

Mehdi Hosseinkouchack

Department of Business and Economics, Goethe University Frankfurt, Theodor-W.-Adorno-Platz 4, 60323 Frankfurt, Germany

^*

Author to whom correspondence should be addressed.

Econometrics 2016, 4(4), 45; https://doi.org/10.3390/econometrics4040045

Submission received: 15 May 2016 / Revised: 28 September 2016 / Accepted: 20 October 2016 / Published: 1 November 2016

(This article belongs to the Special Issue Recent Developments in Cointegration)

Download Versions Notes

Abstract

:

We consider a class of panel tests of the null hypothesis of no cointegration and cointegration. All tests under investigation rely on single-equations estimated by least squares, and they may be residual-based or not. We focus on test statistics computed from regressions with intercept only (i.e., without detrending) and with at least one of the regressors (integrated of order 1) being dominated by a linear time trend. In such a setting, often encountered in practice, the limiting distributions and critical values provided for and applied with the situation “with intercept only” are not correct. It is demonstrated that their usage results in size distortions growing with the panel size N. Moreover, we show which are the appropriate distributions, and how correct critical values can be obtained from the literature.

Keywords:

single-equations; large N asymptotics; integrated series with drift

JEL Classification:

C22; C23

1. Introduction

Most panel tests for the null hypothesis of (no) cointegration rely on single-equations, notable exceptions being Larsson et al. [1], Groen and Kleibergen [2], Breitung [3] and Karaman Örsal and Droge [4] who proposed panel system approaches. In particular, the more recent paper by Miller [5], building on nonlinear instrumental variable likelihood-based rank tests, allows for cross-correlation between the units. Similarly, recent single-equation tests by Chang and Nguyen [6] or Demetrescu et al. [7] also rely on nonlinear instrumental variable estimation, while the vast majority of such panel tests builds on ordinary or fully modified or dynamic least squares (LS). Here, we study exactly this class of LS-based single-equation panel tests for the null of either cointegration or no cointegration.

We focus on the situation where the test statistics are computed from regressions with an intercept only, and with at least one of the integrated regressors displaying a linear time trend on top of the stochastic trend. Such a constellation is often met in practical applications, see for instance Coe and Helpman [8] and Westerlund [9] on R&D spillovers (total factor productivity and capital stock), Larsson et al. [1] on log real consumption and income (per capita), or Hanck [10] on prices and exchange rate series testing the weak purchasing power parity (PPP). The relevance of a linear trend in panel data has been addressed in Hansen and King [11] when commenting on the link between health care expenditure and GDP, see McCoskey and Selden [12]; consequently, Blomqvist and Carter [13], Gerdtham and Löthgren [14] or Westerlund [15] worked (partly) with detrended series, i.e., they included time as an explanatory variable in their panel tregressions. Hansen ([16], p. 103), however, argue that “it seems reasonable that excess detrending will reduce the test’s power”. Therefore, we study the empirically relevant case where test statistics are computed from regressions with intercept only (i.e., without detrending) when at least one of the I(1) regressors displays a linear time trend.

Before becoming more technical, we want to outline our findings as a rule for empirical applications. Let

{\bar{Z}}^{(m)}

denote a generic panel cointegration statistic computed from a regression with intercept only involving

m = k + 1

I(1) variables. The least squares regression may be static in levels,

y_{i, t} = {\bar{α}}_{i} + {\bar{β}}_{i, 1} x_{i, 1, t} + \dots + {\bar{β}}_{i, k} x_{i, k, t} + {\bar{u}}_{i, t}, t = 1, \dots, T, i = 1, \dots, N,

where

{u_{i, t}}

is assumed to be I(1) in the case of no cointegration, or I(0) under the null hypothesis of cointegration, see Remarks 1 and 3 below, respectively. Alternatively,

{\bar{Z}}^{(m)}

may be from the error-correction regression1,

Δ y_{i, t} = {\bar{κ}}_{i} + {\bar{γ}}_{i} y_{i, t - 1} + {\bar{θ}}_{i, 1} x_{i, 1, t - 1} + \dots + {\bar{θ}}_{i, k} x_{i, k, t - 1} + {\bar{ε}}_{i, t}, t = 1, \dots, T, i = 1, \dots, N,

where contemporaneous differences

Δ x_{i, t}

or additional lags of differences may be required as additional regressors to render

{\bar{ε}}_{i, t}

free of serial correlation, see Remark 2 below. The test statistic may be constructed from pooling the data or from averaging individual statistics, see e.g., Pedroni [18,19] or Westerlund [15]. Much of the nonstationary panel literature relies on sequential limit theory where

T \to \infty

is followed by

N \to \infty

, such that limiting normality can be established under the assumption that none of the I(1) regressors follows a deterministic time trend:

\sqrt{N} ({\bar{Z}}^{(m)} - {\bar{μ}}_{m}) / {\bar{σ}}_{m} \sim N (0, 1) .

The constants

{\bar{μ}}_{m}

and

{\bar{σ}}_{m}

required for appropriate normalization are typically tabulated for a selected number of values of m, see again Remarks 1 through 3. A different set of such moments

{\tilde{μ}}_{m}

and

{\tilde{σ}}_{m}

is also typically given for detrended regressions, where the test statistic

{\tilde{Z}}^{(m)}

stems from regressions of the type (

m = k + 1

)

y_{i, t} = {\tilde{α}}_{i} + {\tilde{δ}}_{i} t + {\tilde{β}}_{i, 1} x_{i, 1, t} + \dots + {\tilde{β}}_{i, k} x_{i, k, t} + {\tilde{u}}_{i, t},

or

Δ y_{i, t} = {\tilde{κ}}_{i} + {\tilde{ψ}}_{i} t + {\tilde{γ}}_{i} y_{i, t - 1} + {\tilde{θ}}_{i, 1} x_{i, 1, t - 1} + \dots + {\tilde{θ}}_{i, k} x_{i, k, t - 1} + {\tilde{ε}}_{i, t} .

We call such regressions “detrended” because, in a single-equation framework, the resulting parameter estimators are equivalent to what one obtains from a two-step procedure: first, regress all variables on a linear time trend, and, second, regress the individually detrended residuals on each other. This equivalence is sometimes called Frisch-Waugh-Lovell Theorem, see e.g., Greene ([20], Theorem 3.2). For generic

{\tilde{Z}}^{(m)}

from, e.g., the tests mentioned in Remarks 1 through 3, it holds, irrespective of an eventual linear trend in the data, that

\sqrt{N} ({\tilde{Z}}^{(m)} - {\tilde{μ}}_{m}) / {\tilde{σ}}_{m} \sim N (0, 1) .

Our main contribution is twofold for the case that at least one of the I(1) regressors has a linear time trend and the regressions are run with intercept only (without detrending). First, it is shown that the normalization with

{\bar{μ}}_{m}

and

{\bar{σ}}_{m}

and the resulting critical values for

{\bar{Z}}^{(m)}

from the regression “with intercept only” are not correct in the presence of linear time trends in the data. It is analytically (Proposition 1) and numerically demonstrated that their usage results in size distortions growing with the panel size N. Second, we characterize the appropriate limiting distributions by showing that normalization of

{\bar{Z}}^{(m)}

with

{\tilde{μ}}_{m - 1}

and

{\tilde{σ}}_{m - 1}

results in a standard normal limit, such that the size of the tests can be controlled (Theorem 1). Put differently, Theorem 1 means in non-technical terms: The limiting distribution arising from a regression on k I(1) variables with drift and an intercept amounts to the limiting distribution in the case of a regression on

k - 1

I(1) variables and an intercept plus a linear time trend. Such a rule is known in a pure time series context for the special case of the residual-based Phillips-Ouliaris test for no cointegration from Hansen ([16], p. 103): “[...] deterministic trends in the data affect the limiting distribution of the test statistics whether or not we detrend the data”; see also the expositions in Hamilton ([21], p. 596, 597) and Hassler ([22], Proposition 16.6). It is even more relevant in our panel framework since we illustrate numerically and analytically that the size distortions of an inappropriate normalization grow with the panel size N (either to zero or one, depending on the specific test). Moreover, we compare our proposal to account for linear trends in the data with the more traditional method of detrending the regression. By simulation, we show that power gains of our new strategy according to Theorem 1 over detrending may be considerable. We hence recommend this strategy as being superior to detrending.

The rest of the paper is organized as follows. The next section sets some notation and assumptions. Section 3 establishes and discusses our asymptotic results and illustrates them with numerical evidence. It also compares our suggestion to account for linear trends with the conventional method of detrending. The last section discusses consequences for applied work. Mathematical proofs are relegated to the Appendix A.

2. Notation and Assumptions

Restricting our attention to the single-equation framework we partition the m-vector

z_{i, t}

of observables into a scalar

y_{i, t}

and a k-element vector

x_{i, t}

,

z_{i, t}^{'} = (y_{i, t}, x_{i, t}^{'})

,

m = k + 1

. As usual, the index i stands for the cross-section,

i = 1, \dots, N

, while t denotes time,

t = 1, \dots, T

. Each sequence

{z_{i, t}}

,

t = 1, \dots, T

, is assumed to be integrated of order 1, I(1), where we allow for a non-zero drift, and assume for simplicity a negligible starting value,

z_{i, 0} = 0

. While

{z_{i, t}}

may be cointegrated or not, depending on the respective null hypothesis, we rule out cointegration among

{x_{i, t}}

. Technically, these assumptions translate as follows, where

W_{i, m} (\cdot)

denotes an m-dimensional standard Wiener process,

⌊ x ⌋

stands for the integer part of a number x, and ⇒ is the symbol for weak convergence.

Assumption 1.

With obvious partitioning according to

{(y_{i, t}, x_{i, t}^{'})}^{'}

, we assume (

i = 1, \dots, N

)

z_{i, t} = μ_{i, z} t + \sum_{j = 1}^{t} e_{i, j} = (\begin{matrix} μ_{i, y} \\ μ_{i, x} \end{matrix}) t + \sum_{j = 1}^{t} (\begin{matrix} e_{i, y, j} \\ e_{i, x, j} \end{matrix}), t = 1, \dots, T .

The stochastic zero mean process

{e_{i, t}}

is integrated of order 0 in that it satisfies

T^{- 0.5} \sum_{t = 1}^{⌊ r T ⌋} e_{i, t} \Rightarrow Ω_{i}^{0.5} W_{i, m} (r) = Ω_{i}^{0.5} (\begin{matrix} W_{i, y} (r) \\ W_{i, x} (r) \end{matrix}), r \in [0, 1],

with

Ω_{i} = (\begin{matrix} ω_{i, y y}^{2} & ω_{i, x y}^{'} \\ ω_{i, x y} & Ω_{i, x x} \end{matrix}),

where

ω_{i, y y}^{2} > 0

and

Ω_{i, x x}

is positive definite.

Now, we turn to assumptions with respect to the tests. Let

{\bar{S}}_{i}^{(m)}

and

{\tilde{S}}_{i}^{(m)}

stand again for generic test statistics computed from individual single-equation least squares regressions with “intercept only” and “intercept plus linear trend”, respectively. The superscript

(m)

stands for the dimension of the I(1) vector entering the equations. One route to panel testing relies on so-called group statistics averaging individual statistics. We denote them as follows:

{\bar{G}}^{(m)} = \frac{1}{N} \sum_{i = 1}^{N} {\bar{S}}_{i}^{(m)} or {\tilde{G}}^{(m)} = \frac{1}{N} \sum_{i = 1}^{N} {\tilde{S}}_{i}^{(m)} .

Similarly, panel statistics rely on pooling the data across the dimension within, i.e., summing over terms showing up in the numerator and denominator separately,

{\bar{P}}^{(m)} = g (\sum_{i = 1}^{N} {\bar{N}}_{i, T}^{(m)}, \sum_{i = 1}^{N} {\bar{D}}_{i, T}^{(m)}) or {\tilde{P}}^{(m)} = g (\sum_{i = 1}^{N} {\tilde{N}}_{i, T}^{(m)}, \sum_{i = 1}^{N} {\tilde{D}}_{i, T}^{(m)}) .

A typical example for the mapping g is

g (x, y) = x / \sqrt{y}

in the case of t-type statistics. Here, it is assumed that the generic

{\bar{N}}_{i, T}^{(m)}

and

{\bar{D}}_{i, T}^{(m)}

or

{\tilde{N}}_{i, T}^{(m)}

and

{\tilde{D}}_{i, T}^{(m)}

are computed from individually demeaned or detrended regressions, respectively. We allow for group and panel statistics by introducing the generic notation

{\bar{Z}}^{(m)}

and

{\tilde{Z}}^{(m)}

, and maintain for the panel the joint null hypothesis

H_{0} = ⋂_{i = 1}^{N} H_{i, 0} .

(1)

A distinction between the individual null hypotheses

H_{i, 0}

of cointegration or absence of cointegration is not required, and both cases are treated in the generic assumption as follows.

Assumption 2.

Consider linear single-equation least squares regressions (

i = 1, \dots, N

,

t = 1, \dots, T

)

y_{i, t} = {\bar{α}}_{i} + {\bar{β}}_{i}^{'} x_{i, t} + {\bar{u}}_{i, t} a n d y_{i, t} = {\tilde{α}}_{i} + {\tilde{δ}}_{i} t + {\tilde{β}}_{i}^{'} x_{i, t} + {\tilde{u}}_{i, t},

(2)

or

Δ y_{i, t} = {\bar{κ}}_{i} + {\bar{γ}}_{i} y_{i, t - 1} + {\bar{θ}}_{i}^{'} x_{i, t - 1} + {\bar{ε}}_{i, t} a n d Δ y_{i, t} = {\tilde{κ}}_{i} + {\tilde{ψ}}_{i} t + {\tilde{γ}}_{i} y_{i, t - 1} + {\tilde{θ}}_{i}^{'} x_{i, t - 1} + {\tilde{ε}}_{i, t},

(3)

where contemporaneous differences

Δ x_{i, t}

or lags of

Δ z_{i, t - j}

,

j > 0

, may be required as additional regressors in (3) to ensure residuals free of serial correlation. Let

{\bar{Z}}^{(m)}

and

{\tilde{Z}}^{(m)}

stand for group statistics

{\bar{G}}^{(m)}

and

{\tilde{G}}^{(m)}

or for panel statistics

{\bar{P}}^{(m)}

and

{\tilde{P}}^{(m)}

computed from regressions with “intercept only” and “intercept plus linear trend”, respectively. We assume under the null hypothesis (1) that

\begin{matrix} \sqrt{N} ({\bar{Z}}^{(m)} - {\bar{μ}}_{m}) & \Rightarrow & N (0, {\bar{σ}}_{m}^{2}) i f μ_{i, z} = 0 f o r a l l i = 1, \dots, N, \\ \sqrt{N} ({\tilde{Z}}^{(m)} - {\tilde{μ}}_{m}) & \Rightarrow & N (0, {\tilde{σ}}_{m}^{2}) f o r μ_{i, z} u n r e s t r i c t e d, \end{matrix}

as

T \to \infty

followed by

N \to \infty

.

Tests, e.g., by Kao [23], Pedroni [18,19], Westerlund [9,24] or Westerlund [15] meet Assumption 2 under different sets of restrictions, and they will be considered in the next section, see Remarks 1 through 3. In particular, these authors tabulate values of

({\bar{μ}}_{m}, {\bar{σ}}_{m})

and

({\tilde{μ}}_{m}, {\tilde{σ}}_{m})

,

m \geq 2

. Our assumption of a single equation approach is motivated by the fact that much of applied work relies on this. However, such an assumption comes at a price: In (2), we have to assume that the regressors

x_{i, t}

alone are not cointegrated (

Ω_{i, x x}

is positive definite according to Assumption 1), and, in (3), we have to assume under the alternative of cointegration that

Δ y_{i, t}

adjust to deviations from the long-run equilibrium, and not

Δ x_{i, t}

.

Much of the earlier panel cointegration literature assumed independent units invoking a central limit theorem to establish Assumption 2, see e.g., Pedroni [18,19] and Kao [23]. Cross-sectional independence, however, is not maintained in our Assumption 2. Westerlund [15,24] e.g., allows for cross-correlation driven by a common factor. To account for this, he suggests replacing

x_{i, t}

and

y_{i, t}

by the cross-sectionally demeaned time series,

x_{i, t} - {\bar{x}}_{t} and y_{i, t} - {\bar{y}}_{t}, where {\bar{x}}_{t} = \frac{1}{N} \sum_{i = 1}^{N} x_{i, t}, {\bar{y}}_{t} = \frac{1}{N} \sum_{i = 1}^{N} y_{i, t} .

This way, he establishes that the limiting results maintained under Assumption 2 are met under cross-sectional correlation (subject to some restrictions).

3. Results

3.1. Asymptotic Theory

The first paper allowing for linear time trends in a panel cointegration context was by Kao [23]. He considers a residual-based unit root test for the null hypothesis of no cointegration in the tradition of Phillips and Ouliaris [25]. His test builds on pooling the data while allowing for a individual-specific intercept. Kao [23] does not consider regressions containing a linear time trend as additional regressors, but allows for a linear drift in the data when performing a regression with a fixed effect intercept. In the case of the

k = 1

regressor (i.e.,

m = 2

), Kao ([23], Equation (15)) observed that the linear time trend dominates the I(1) component; hence, the limiting distribution amounts to that of the panel unit root test by Levin et al. [26] upon detrending. To be precise: let

{\tilde{μ}}_{1}

and

{\tilde{σ}}_{1}

denote the normalizing constants provided by Levin et al. [26] for detrended panel unit root tests; then, one should use them for the pooled residual-based panel cointegration statistic

{\bar{P}}^{(2)}

in a bivariate regression if the regressor is I(1) with drift, see Kao ([23], Theorem 4):

\sqrt{N} ({\bar{P}}^{(2)} - {\tilde{μ}}_{1}) \Rightarrow N (0, {\tilde{σ}}_{1}^{2}) under μ_{i, x} \neq 0 .

(4)

In Theorem 1, we extend Kao’s result for any panel or group statistics from static or dynamic regressions with

m \geq 2

computed from regressions with intercept only in the presence of linear time trends.

Theorem 1.

Let the data satisfy Assumption 1, and the generic test statistic

{\bar{Z}}^{(m)}

meets Assumption 2 for

m \geq 2

. Furthermore, assume that

μ_{i, x} \neq 0

for all

i = 1, \dots, N

. Under the null hypothesis (1), it then holds true that

\sqrt{N} ({\bar{Z}}^{(m)} - {\tilde{μ}}_{m - 1}) \Rightarrow N (0, {\tilde{σ}}_{m - 1}^{2})

as

N \to \infty

, where

({\tilde{μ}}_{m - 1}, {\tilde{σ}}_{m - 1})

are from Assumption 2.

For proof, see Appendix A.

Note that Assumption 2 does not impose any restriction on

μ_{i, y}

. As is shown in the proof, Theorem 1 holds irrespective of whether

{y_{i, t}}

displays a linear trend or not (

μ_{i, y} \neq 0

or

μ_{i, y} = 0

).

Two research strategies can be employed in the presence of linear time trends when dealing with statistics resulting from regressions with intercept only. The first one simply ignores the linear time trends in the data and standardizes

{\bar{Z}}^{(m)}

with

{\bar{μ}}_{m}

and

{\bar{σ}}_{m}

. The second strategy accounts for the drift in the data according to Theorem 1; in other words, it applies

{\bar{Z}}^{(m)}

upon standardizing with

{\tilde{μ}}_{m - 1}

and

{\tilde{σ}}_{m - 1}

. We summarize as follows:

Strategy S_I:

When

{\bar{Z}}^{(m)}

is computed from panel regressions without detrending, then compare

\sqrt{N} ({\bar{Z}}^{(m)} - {\bar{μ}}_{m}) / {\bar{σ}}_{m}

with quantiles from the standard normal distribution, i.e., ignore the presence of linear trends in the data.

Strategy S_A:

When

{\bar{Z}}^{(m)}

is computed from panel regressions without detrending, then compare

\sqrt{N} ({\bar{Z}}^{(m)} - {\tilde{μ}}_{m - 1}) / {\tilde{σ}}_{m - 1}

with quantiles from the standard normal distribution, i.e., account for the presence of linear trends in the data.

For the rest of the paper, we assume that an applied econometrician is able to distinguish between the two cases, whether a linear time trend underlies the variables (e.g., log income or log prices) or not (e.g., interest or inflation rates). Hence, we maintain the assumption behind Theorem 1: the researcher knows that at least one regressor is I(1) with drift (

μ_{i, x} \neq 0

). We assume that strategy

S_{A}

is only employed when linear trends are truly present and thus refrain from the discussion of misspecification: what happens if there are no linear time trends in the data, but one erroneously accounts for trends.

The situation analyzed in Theorem 1 has not been considered in the previous panel cointegration literature, with the notable exception of Kao [23]. Consequently, all applied papers that we are aware of standardize

{\bar{Z}}^{(m)}

, with

{\bar{μ}}_{m}

and

{\bar{σ}}_{m}

ignoring the effect of deterministic trends in the series, which amounts to strategy

S_{I}

. The effect of strategy

S_{I}

under linear time trends is discussed for growing N in the following proposition. The resulting size distortions depend on whether the test is right-tailed or left-tailed (null hypothesis is rejected for too large or too small values, respectively).

Proposition 1.

Let the assumptions from Theorem 1 hold true. Furthermore, assume

{\tilde{μ}}_{m - 1} < {\bar{μ}}_{m} .

(5)

Under the null hypothesis, one has the following results for strategy

S_{I}

:

(a): For a left-tailed test, the probability to reject according to strategy $S_{I}$ increases with growing N to 1;
(b): for a right-tailed test, the probability to reject according to strategy $S_{I}$ decreases with growing N to 0.

For proof, see Appendix A.

We now discuss a couple of panel tests satisfying Assumption 2 and (5), such that Theorem 1 and Proposition 1 apply.

Remark 1.

The residual-based unit root tests for the null hypothesis of no cointegration proposed by Pedroni [18,19] build on static regressions as in (2). The null hypothesis (1) is rejected for too negative values of the test statistic (of

{\bar{Z}}^{(m)}

in our generic notation). The expected values and standard deviations

({\bar{μ}}_{m}, {\bar{σ}}_{m})

and

({\tilde{μ}}_{m}, {\tilde{σ}}_{m})

showing up in Assumption 2 are available from Pedroni ([18], Table 2) for

m > 2

and from Pedroni ([19], Corollary 1) for

m = 2

. In order to apply Theorem 1 (strategy

S_{A}

) for

m = 2

, one requires

{\tilde{μ}}_{1}

and

{\tilde{σ}}_{1}

. These values stem from the detrended Dickey-Fuller distribution in the case of group statistics and have been tabulated by Nabeya ([27], Table 4):

{\tilde{μ}}_{1} = - 2.18136

and

{\tilde{σ}}_{1} = 0.74991

. Throughout this, we observe

{\tilde{μ}}_{m - 1} < {\bar{μ}}_{m} < 0

. Hence, Proposition 1(a) applies. If strategy

S_{I}

is employed under linear trends, and then the probability to reject the true null hypothesis converges into one with growing panel size N. Alternatively, Westerlund [24] suggested group and panel variance ratio type tests along the lines of Breitung [28]. The null hypothesis of no cointegration is rejected again for too small values of the variance ratio statistic, and

({\bar{μ}}_{m}, {\bar{σ}}_{m})

and

({\tilde{μ}}_{m}, {\tilde{σ}}_{m})

showing up in Assumption 2 are given in Westerlund ([24], Table 1) for

m \geq 2

. To apply Theorem 1 with

m = 2

, we need

{\tilde{μ}}_{1}

and

{\tilde{σ}}_{1}

. For the detrended Breitung distribution we obtain by simulation

{\tilde{μ}}_{1} = 0.0110

and

{\tilde{σ}}_{1} = 0.005197

, which are the values corresponding to the case of group statistics. Again, we observe

0 < {\tilde{μ}}_{m - 1} < {\bar{μ}}_{m}

, so that (5) holds. Consequently, Proposition 1(a) applies, and the probability to reject the true null hypothesis under strategy

S_{I}

grows with N as long as there is a linear trend in the data. To sum up: in the case of residual-based tests for no cointegration, strategy

S_{I}

results in massive size distortions; numerical evidence for finite N is given in Table 1 below.

Remark 2.

The error-correction tests by Westerlund [15] relies on regressions of type (3). It is again a left-tailed test: The null hypothesis of no cointegration is rejected for too negative t-values associated with γ. Values of

({\bar{μ}}_{m}, {\bar{σ}}_{m})

and

({\tilde{μ}}_{m}, {\tilde{σ}}_{m})

are tabulated in Westerlund ([15], Table 1) for

m \geq 2

. In case of

m = 1

(i.e., no

x_{i, t}

on the right-hand side), the limiting distributions are of the usual Dickey-Fuller type. Hence,

{\tilde{μ}}_{1}

and

{\tilde{σ}}_{1}

for group statistics are again from detrended Dickey-Fuller-type distributions and given in Nabeya ([27], Table 4) (see above). Comparing

{\bar{μ}}_{m}

with

{\tilde{μ}}_{m}

, we find

{\tilde{μ}}_{m - 1} < {\bar{μ}}_{m} < 0

meeting (5) again. Consequently, strategy

S_{I}

is increasingly liberal in the presence of linear time trends, and the probability to reject the true null hypothesis approaches 1 in the limit as long as the series display a linear time trend. For numerical evidence, see Table 2 below.

Remark 3.

We now flip the null and the alternative hypotheses. Westerlund [9] suggested testing the null hypothesis of cointegration. He proposed a CUSUM group test statistic for this null hypothesis to be applied with tabulated values

({\bar{μ}}_{m}, {\bar{σ}}_{m})

and

({\tilde{μ}}_{m}, {\tilde{σ}}_{m})

,

m \geq 2

. To apply Theorem 1 for

m = 2

, we provide as moments of the univariate, detrended distribution by simulation:

{\tilde{μ}}_{1} = 0.6367

and

{\tilde{σ}}_{1} = 0.14595

2. This test is right-tailed and in accordance with Westerlund ([9], Table 1)

0 < {\tilde{μ}}_{m - 1} < {\bar{μ}}_{m}

. Thus, this time Proposition 1(b) comes in. Under strategy

S_{I}

in the presence of linear trends, the test is increasingly undersized with growing N. Such a conservative behaviour implies low power under the alternative hypothesis.

3.2. Numerical Evidence

The statements obtained from Proposition 1 may be quantified more precisely by means of Equations (A2) and (A3) given in the Appendix. These rejection probabilities apply approximately (for large N) under the null hypothesis at nominal significance level α. We report results for the group t-tests by Pedroni [18,19] and by Westerlund [15] in Table 1 and Table 2.

Generally, the size distortions in Table 1 and Table 2 grow with N, while decreasing with

m = k + 1

at the same time. The fact that

S_{I}

is too liberal is characteristic for these tests where we reject for too negative values (of

\sqrt{N} ({\bar{Z}}^{(m)} - {\bar{μ}}_{m}) / {\bar{σ}}_{m}

in our generic notation). Overrejection is not the general case, however, as we see when reversing the null and alternative hypotheses. To quantify distortions for the CUSUM test discussed in Remark 3, we use Equation (A3) from the Appendix. When evaluating

S_{I}

under

μ_{i, x} \neq 0

, we observe rejection probabilities equal to zero up to three digits for

N = 10, 20, \dots

; this strongly supports the limiting result from Proposition 1 (b).

3.3. Regressions with a Linear Time Trend

For regressions with intercept only, strategy

S_{I}

has been used in the literature and applied with the tests mentioned in the remarks above. We have illustrated its failure to control size under the null hypothesis in the presence of a linear time trend. In practice, one may use two strategies to account for linear time trends. The first one is the new

S_{A}

according to Theorem 1 from regressions without detrending. The second one consists of detrending the series, or equivalently running detrended regressions, i.e., including

{\tilde{δ}}_{i} t

and

{\tilde{ψ}}_{i} t

in (2) and (3), respectively. The empirical strategy then becomes the following:

Strategy S_D:

Compute

{\tilde{Z}}^{(m)}

from detrended panel regressions and compare the normalization

\sqrt{N} ({\tilde{Z}}^{(m)} - {\tilde{μ}}_{m}) / {\tilde{σ}}_{m}

with quantiles from the standard normal distribution.

By Assumption 2, this strategy will provide asymptotically correct size. However, tests from detrended regressions will be prone to power losses relative to strategy

S_{A}

, which is more parsimonious. For this reason, we next investigate the price of strategy

S_{D}

relative to

S_{A}

in terms of power.

In Monte Carlo experiments, we study in particular the error-correction test (group t-statistic) by Westerlund [15]. Before turning to a power analysis, we make sure that size is under control. For the data-generating process (DGP), we consider hence the null hypothesis of no cointegration under linear time trends:

y_{i, t} = δ_{i} t + x_{i, 1, t} + x_{i, 2, t} + \dots + x_{i, k, t} + r_{i, 0, t}, t = 1, 2, \dots, T, i = 1, 2, \dots, N,

(6)

x_{i, j, t} = μ_{i, j, x} + x_{i, j, t - 1} + v_{i, j, t}, j = 1, 2, . . ., k,

(7)

where

{v_{i, j, t}}

are normal iid sequences,

N (0, σ_{i, j}^{2})

, independent of each other. Finally,

r_{i, 0, t} = r_{i, 0, t - 1} + v_{i, 0, t}

is an independent random walk entering (6). The DGP under the alternative of cointegration becomes

Δ y_{i, t} = - 0.02 (y_{i, t - 1} - δ_{i} (t - 1) - x_{i, 1, t - 1} - x_{i, 2, t - 1} - \dots - x_{i, k, t - 1}) + v_{i, 0, t},

(8)

where

x_{i, j, t}

and

v_{i, 0, t}

are generated as before. Using the regression

Δ y_{i, t} = {\bar{κ}}_{i} + {\bar{γ}}_{i} y_{i, t - 1} + {\bar{θ}}_{i}^{'} x_{i, t - 1} + {\bar{ϕ}}_{i}^{'} Δ x_{i, t} + {\bar{ε}}_{i, t},

(9)

we computed the group t-statistic proposed by Westerlund [15]. Strategy

S_{D}

is employed with

Δ y_{i, t} = {\tilde{κ}}_{i} + {\tilde{ψ}}_{i} t + {\tilde{γ}}_{i} y_{i, t - 1} + {\tilde{θ}}_{i}^{'} x_{i, t - 1} + {\tilde{ϕ}}_{i}^{'} Δ x_{i, t} + {\tilde{ε}}_{i, t} .

(10)

All reported rejection frequencies rely on 10,000 replications.

The leading case consists of the following parameterization, where only the first component of the regressors

{x_{i, t}}

is driven by a linear time trend:

T = 250, μ_{i, x}^{'} = (1, 0, \dots, 0), σ_{i, 0}^{2} = σ_{i, 1}^{2} = \dots = σ_{i, k}^{2} = 1 .

(11)

This mimics with

k = 2

or

k = 3

a typical macro panel with monthly data and e.g., income, interest rates and inflation rates as regressors. Table 3 reports the frequencies of rejection for different values of

δ_{i}

from (6), and rejection is based on strategy

S_{A}

according to Theorem 1. It illustrates how well the rule of Theorem 1 works: the experimental sizes are close to the nominal ones. This is particularly true for

δ_{i} = 1

, while the test is mildly conservative for

δ_{i} = 0.1

, and a bit more conservative for

δ_{i} = 0

, in particular for N large relative to

T = 250

. Next, we consider strategy

S_{D}

with the same data. The rejection frequencies are given in Table 4. We observe that the experimental size from detrended regressions is close to the nominal one under the null hypothesis of no cointegration, irrespective of

δ_{i}

.

Since strategies

S_{A}

and

S_{D}

both hold the nominal size, the question of which one is more powerful naturally arises. The results contained in Table 5 are very clear: first, the power increases with

δ_{i}

; second, strategy

S_{A}

always outperforms

S_{D}

considerably, and has, e.g., rejection frequencies more than twice as large for

N = 10

or

k = 3

. In particular, detrending becomes all the more costly; relative to strategy

S_{A}

, the larger N is, which is intuitively clear: including a linear time trend in a regression requires the estimation of an additional parameter; in a panel of N units, detrending thus involves the estimation of N additional parameters compared to strategy

S_{A}

. At the same time, these estimated trends can be spuriously correlated with the stochastic trends in the data, and, therefore, incorrectly lead to support for cointegration, in particular when the time dimension is relatively short.

We have varied the leading case with the parameterization from (11). First, we allowed for more and stronger trends in the regressors,

μ_{i, x}^{'} = (1, 1, \dots, 1), or μ_{i, x}^{'} = (1, 2, \dots, k),

with all other parameters fixed. This corrects the mild undersizedness of strategy

S_{A}

reported in Table 3 yielding empirical sizes very close to the nominal one. At the same, time power relative to Table 5 is increased, with strategy

S_{A}

still dominating

S_{D}

. Second, we have increased the magnitude of the random walks, namely

σ_{i, 0}^{2} = σ_{i, 1}^{2} = \dots = σ_{i, k}^{2} = 4

, while the other parameters are from (11) and

δ_{i} = 0

(see Table 6). Here, the linear trends are less pronounced, such that

S_{A}

results in slightly more conservative tests (compared to the first panel in Table 3), and similarly, power is reduced (compared to the first panel in Table 5). Still,

S_{A}

clearly dominates

S_{D}

in Table 6. Third, we simulated shorter panels,

T = 100

. This makes both strategies,

S_{A}

and

S_{D}

, conservative under

H_{0}

, which is accompanied by a loss of power.

4. Conclusions

In time series econometrics, it has been known for a long time that “the deterministic trends in the data affect the limiting distributions of the test statistics whether or not we detrend the data” (Hansen [16], p. 103). This has been shown for the residual-based Phillips-Ouliaris (or Engle-Granger) cointegration test by Hansen [16], see also the exposition in Hamilton ([21], p. 596, 597). Analogous results have been given for other cointegration tests by Hassler [30,31], see also the summary by Hassler ([22], Proposition 16.6). In this paper, these findings are carried over to the panel framework, and they are shown to continue to hold for single-equation tests relying on least squares, no matter whether the null hypothesis is absence or presence of cointegration. In a regression involving

m \geq 2

variables, much of the panel cointegration theory relies on normalization with suitable constants

{\bar{μ}}_{m}

and

{\bar{σ}}_{m}

and letting the panel dimension N go to infinity to obtain a standard normal distribution. The numbers

{\bar{μ}}_{m}

and

{\bar{σ}}_{m}

are tabulated for the case of regressions with intercept only. Different figures

{\tilde{μ}}_{m}

and

{\tilde{σ}}_{m}

are tabulated for regressions with intercept and linear time trend. We show the following: when statistics are computed from regressions with m integrated variables with intercept only, but one of the integrated regressors is dominated by a linear time trend, then normalization with

{\tilde{μ}}_{m - 1}

and

{\tilde{σ}}_{m - 1}

is required to achieve asymptotically valid inference under the null hypothesis (Theorem 1). Normalization with

{\bar{μ}}_{m}

and

{\bar{σ}}_{m}

, however, which has been the conventional strategy so far, results in a loss of size control under the null hypothesis. In fact, employing

{\bar{μ}}_{m}

and

{\bar{σ}}_{m}

in the presence of linear time trends gives rejection probabilities converging with N to 1 or 0, depending on whether the null hypothesis is no cointegration or cointegration, respectively (see Proposition 1). To avoid such size distortions, one may employ the strategy following Theorem 1, or one may work with detrended regressions. Detrending, however, comes at a price: a regression with intercept only will provide more powerful tests (see e.g., Hamilton [21], p. 598); according to our simulations, power gains of our new strategy over detrending may be considerable and growing with N, and this also holds true if there is a linear trend superimposing the level relation. Our Monte Carlo evidence, however, is limited to the case of testing for the null hypothesis of no cointegration.

Hence, we propose the following empirical strategy if at least one of the integrated regressors is driven by a linear time trend when testing for no cointegration. First, test the null hypothesis of no cointegration with our new strategy

S_{A}

from Theorem 1, since it is more powerful than tests relying on detrending. If the null hypothesis of no cointegration is rejected according to Theorem 1, then one may test, in a second step, whether a linear time trend is present, superimposing the level relation between

y_{i, t}

and

x_{i, t}

. If strategy

S_{A}

does not reject the null hypothesis of no cointegration, then one may, of course, try a test building on detrending, although it will tend to be less powerful, since it requires the estimation of N additional parameters.

Acknowledgments

We thank Matei Demetrescu, Christoph Hanck and Joakim Westerlund for helpful comments. Moreover, comments by Katarina Juselius and three anonymous referees that improved the paper are gratefully acknowledged.

Author Contributions

The authors contributed equally to the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proof of Theorem 1

Since group statistics

{\bar{G}}^{(m)}

are computed from individual regressions and since panel statistics

{\bar{P}}^{(m)}

build on pooled regressions, we suppress the index i and consider the generic case with

{y_{t}}

and

{x_{t}}

satisfying Assumption 1. Furthermore, we focus on the stochastic regressors and ignore the constant intercept without loss of generality. We will proceed in four steps.

Step 1: First, note that the regressors may be rotated in such a way that all linear trends are concentrated in one scalar component. To that end, define the k-element vector

λ_{1} : = μ_{x} / \sqrt{μ_{x}^{'} μ_{x}}

of unit length with

\begin{matrix} τ_{t} : = λ_{1}^{'} x_{t} = \sqrt{μ_{x}^{'} μ_{x}} t + λ_{1}^{'} \sum_{j = 1}^{t} e_{x, j} . \end{matrix}

At the same time, there exist

k - 1

linearly independent k-element columns collected in the

k \times (k - 1)

-matrix

Λ_{2}

. Due to the Gram-Schmidt orthogonalization, one may assume that the invertible matrix

Λ : = (λ_{1}, Λ_{2})

is orthogonal:

Λ Λ^{'} = I_{k}

. All columns of

Λ_{2}

eliminate the linear trend in

x_{t}

:

ξ_{t} : = Λ_{2}^{'} x_{t} = Λ_{2}^{'} \sum_{j = 1}^{t} e_{x, j} .

Hence,

ξ_{t}

is a

(k - 1)

-vector integrated of order 1 without drift. Now, we are able to write

\begin{matrix} Λ^{'} x_{t} = (\begin{matrix} τ_{t} \\ ξ_{t} \end{matrix}) . \end{matrix}

Step 2: Second, we show that the deterministic term in

τ_{t}

dominates the I(1) component, which is clear from

T^{- 1} τ_{⌊ r T ⌋} = \sqrt{μ_{x}^{'} μ_{x}} \frac{⌊ r T ⌋}{T} + O_{p} (T^{- 0.5}) \Rightarrow \sqrt{μ_{x}^{'} μ_{x}} r, 0 \leq r \leq 1 .

More precisely, we can show that empirical moments involving

τ_{t}

equal those with

t \sqrt{μ_{x}^{'} μ_{x}}

up to

O_{p} (T^{- 0.5})

. We have from Park and Phillips ([32], Lemma 2.1) that the row vector

(\frac{1}{T^{2}} \sum_{t = 1}^{T} τ_{t}, \frac{1}{T^{3}} \sum_{t = 1}^{T} τ_{t}^{2}, \frac{1}{T^{2.5}} \sum_{t = 1}^{T} τ_{t} ξ_{t}^{'}, \frac{1}{T^{2}} \sum_{t = 1}^{T} τ_{t} Δ x_{t - j}^{'})

equals

\sqrt{μ_{x}^{'} μ_{x}} (\frac{1}{T^{2}} \sum_{t = 1}^{T} t, \frac{\sqrt{μ_{x}^{'} μ_{x}}}{T^{3}} \sum_{t = 1}^{T} t^{2}, \frac{1}{T^{2.5}} \sum_{t = 1}^{T} t ξ_{t}^{'}, \frac{1}{T^{2}} \sum_{t = 1}^{T} t Δ x_{t - j}^{'}) + O_{p} (T^{- 0.5});

furthermore, if

μ_{y} \neq 0

,

(\frac{1}{T^{3}} \sum_{t = 1}^{T} τ_{t} y_{t}, \frac{1}{T^{2}} \sum_{t = 1}^{T} τ_{t} Δ y_{t}) = \sqrt{μ_{x}^{'} μ_{x}} (\frac{1}{T^{3}} \sum_{t = 1}^{T} t y_{t}, \frac{1}{T^{2}} \sum_{t = 1}^{T} t Δ y_{t}) + O_{p} (T^{- 0.5}),

or, if

μ_{y} = 0,

(\frac{1}{T^{2.5}} \sum_{t = 1}^{T} τ_{t} y_{t}, \frac{1}{T^{1.5}} \sum_{t = 1}^{T} τ_{t} Δ y_{t}) = \sqrt{μ_{x}^{'} μ_{x}} (\frac{1}{T^{2.5}} \sum_{t = 1}^{T} t y_{t}, \frac{1}{T^{1.5}} \sum_{t = 1}^{T} t Δ y_{t}) + O_{p} (T^{- 0.5}) .

Now, we are equipped to deal with the two cases: residual-based tests from (2) and t-type tests from (3).

Step 3: Consider the least squares regression (without intercept for brevity)

y_{t} = {\hat{β}}^{'} x_{t} + {\hat{u}}_{t}

with

\begin{matrix} \hat{β} & = & {(\sum_{t = 1}^{T} x_{t} x_{t}^{'})}^{- 1} Λ Λ^{'} \sum_{t = 1}^{T} x_{t} y_{t} = {(\sum_{t = 1}^{T} Λ^{'} x_{t} x_{t}^{'})}^{- 1} \sum_{t = 1}^{T} Λ^{'} x_{t} y_{t} \\ = & {(\sum_{t = 1}^{T} (\begin{matrix} τ_{t} \\ ξ_{t} \end{matrix}) x_{t}^{'})}^{- 1} \sum_{t = 1}^{T} (\begin{matrix} τ_{t} \\ ξ_{t} \end{matrix}) y_{t} . \end{matrix}

Similarly,

\begin{matrix} {\hat{β}}^{'} Λ = \sum_{t = 1}^{T} (τ_{t}, ξ_{t}^{'}) y_{t} {(\sum_{t = 1}^{T} (\begin{matrix} τ_{t} \\ ξ_{t} \end{matrix}) (τ_{t}, ξ_{t}^{'}))}^{- 1} . \end{matrix}

Consequently, the empirical residuals are

\begin{matrix} {\hat{u}}_{t} = y_{t} - {\hat{β}}^{'} Λ Λ^{'} x_{t} = y_{t} - \sum_{t = 1}^{T} (τ_{t}, ξ_{t}^{'}) y_{t} {(\sum_{t = 1}^{T} (\begin{matrix} τ_{t} \\ ξ_{t} \end{matrix}) (τ_{t}, ξ_{t}^{'}))}^{- 1} (\begin{matrix} τ_{t} \\ ξ_{t} \end{matrix}) . \end{matrix}

(A1)

For

y_{t}

, we have by assumption

\begin{matrix} y_{t} & = & δ t + β^{'} x_{t} + u_{t} = δ t + β^{'} Λ Λ^{'} x_{t} + u_{t} \\ = & δ t + β^{'} λ_{1} τ_{t} + θ^{'} ξ_{t} + u_{t}, θ : = Λ_{2}^{'} β . \end{matrix}

If

{u_{t}}

is I(1), then the series are not cointegrated. If

{u_{t}}

is I(0), then there is cointegration, where a linear time trend may superimpose the cointegrating relation (

δ \neq 0

) or not (

δ = 0

). In any case,

{y_{t}}

is composed of the

(k - 1)

-vector

{ξ_{t}}

, which is I(1), and a linear time trend asymptotically (since

δ t + β^{'} λ_{1} τ_{t} \approx (δ + β^{'} λ_{1} \sqrt{μ_{x}^{'} μ_{x}}) t

in the sense of Step 2). Therefore, the residuals

{{\hat{u}}_{t}}

behave asymptotically as if they were computed from a regression on

(k - 1)

I(1) regressors and on a linear trend. This establishes Theorem 1 for the case of residual-based tests.

Step 4: Consider the dynamic least squares regression (without intercept and without (lagged) differences as further regressors for brevity):

Δ y_{t} = \hat{γ} y_{t - 1} + {\hat{θ}}^{'} x_{t - 1} + {\hat{ε}}_{t} .

In order to investigate error-correction tests relying on the t statistic

t_{γ}

, we employ what is sometimes called the Frisch-Waugh-Lovell theorem. In the first stage, regress both

Δ y_{t}

and

y_{t - 1}

on

x_{t - 1}

, and denote the fitted values as

f_{0, t}

and

f_{1, t}

, respectively. In the second stage, the regression of

f_{0, t}

on

f_{1, t}

produces a slope estimator that is numerically identical to

\hat{γ}

, and so are the residuals, while the t-statistics differ negligibly due to differences in degrees of freedom. As in Step 3, Equation (A1), one can argue that both

f_{0, t}

and

f_{1, t}

behave asymptotically, as if they were computed from a regression on

(k - 1)

I(1) regressors and on a linear trend. Hence, because of the Frisch-Waugh-Lovell theorem,

t_{γ}

behaves as if

x_{t - 1}

in (3) had been replaced by a linear time trend as regressor plus

(k - 1)

regressors that are I(1). This establishes Theorem 1 for the case of error-correction tests, and the proof is complete. ☐

Appendix A.2. Proof of Proposition 1

According to Theorem 1, the statistic

{\bar{Z}}^{(m)}

requires under

μ_{i, x} \neq 0

normalization with

{\tilde{μ}}_{m - 1}

and

{\tilde{σ}}_{m - 1}

, in order to result in a standard normal distribution under

H_{0}

. Let

z_{1 - α}

denote a quantile from the standard normal distribution. In the case where the panel tests are left-tailed, the rejection probability of strategy

S_{I}

under the null hypothesis becomes approximately for large N (at nominal level α):

P (\sqrt{N} \frac{{\bar{Z}}^{(m)} - {\bar{μ}}_{m}}{{\bar{σ}}_{m}} < - z_{1 - α}) = Φ (\sqrt{N} \frac{{\bar{μ}}_{m} - {\tilde{μ}}_{m - 1}}{{\tilde{σ}}_{m - 1}} - \frac{{\bar{σ}}_{m}}{{\tilde{σ}}_{m - 1}} z_{1 - α}) .

(A2)

Analogously for right-tailed tests, the rejection probability of strategy

S_{I}

becomes under

μ_{i, x} \neq 0

according to Theorem 1 with growing N:

P (\sqrt{N} \frac{{\bar{Z}}^{(m)} - {\bar{μ}}_{m}}{{\bar{σ}}_{m}} > z_{1 - α}) = 1 - Φ (\sqrt{N} \frac{{\bar{μ}}_{m} - {\tilde{μ}}_{m - 1}}{{\tilde{σ}}_{m - 1}} + \frac{{\bar{σ}}_{m}}{{\tilde{σ}}_{m - 1}} z_{1 - α}) .

(A3)

For

N \to \infty

, one gets the limits given in Proposition 1 from (A2) and (A3). ☐

References

R. Larsson, J. Lyhagen, and M. Löthgren. “Likelihood-based cointegration tests in heterogeneous panels.” Econom. J. 4 (2001): 109–142. [Google Scholar] [CrossRef]
J.J.J. Groen, and F. Kleibergen. “Likelihood-Based Cointegration Analysis in Panels of Vector Error-Correction Models.” J. Bus. Econ. Stat. 21 (2003): 295–318. [Google Scholar] [CrossRef]
J. Breitung. “A parametric approach to the estimation of cointegration vectors in panel data.” Econom. Rev. 24 (2005): 151–173. [Google Scholar] [CrossRef]
D.D. Karaman Örsal, and B. Droge. “Panel cointegration testing in the presence of a time trend.” Comput. Stat. Data Anal. 76 (2014): 377–390. [Google Scholar] [CrossRef]
J.I. Miller. “A Nonlinear IV Likelihood-Based Rank Test for Multivariate Time Series and Long Panels.” J. Time Ser. Econom. 2 (2010): 1–38. [Google Scholar] [CrossRef]
Y. Chang, and C.M. Nguyen. “Residual Based Tests for Cointegration in Dependent Panels.” J. Econom. 167 (2012): 504–520. [Google Scholar] [CrossRef]
M. Demetrescu, C. Hanck, and A.I. Tarcolea. “IV-based cointegration testing in dependent panels with time-varying variance.” J. Time Ser. Anal. 35 (2014): 393–406. [Google Scholar] [CrossRef]
D.T. Coe, and E. Helpman. “International R&D Spillovers.” Eur. Econ. Rev. 39 (1995): 859–887. [Google Scholar]
J. Westerlund. “A panel CUSUM test of the null of cointegration.” Oxf. Bull. Econ. Stat. 67 (2005): 231–262. [Google Scholar] [CrossRef]
C. Hanck. “Cross-sectional Correlation Robust Tests for Panel Cointegration.” J. Appl. Stat. 36 (2009): 817–833. [Google Scholar] [CrossRef]
P. Hansen, and A. King. “Health care expenditure and GDP: Panel data unit root test results—Comment.” J. Health Econ. 17 (1998): 377–381. [Google Scholar] [CrossRef]
S.K. McCoskey, and T.M. Selden. “Health care expenditures and GDP: Panel data unit root test results.” J. Health Econ. 17 (1998): 369–376. [Google Scholar] [CrossRef]
A.G. Blomqvist, and R.A.L. Carter. “Is Health Care Really a Luxury? ” J. Health Econ. 16 (1997): 207–229. [Google Scholar] [CrossRef]
U.-G. Gerdtham, and M. Löthgren. “On stationarity and cointegration of international health expenditure and GDP.” J. Health Econ. 19 (2000): 461–475. [Google Scholar] [CrossRef]
J. Westerlund. “Testing for error correction in panel data.” Oxf. Bull. Econ. Stat. 69 (2007): 709–748. [Google Scholar] [CrossRef]
B.E. Hansen. “Efficient estimation and testing of cointegrating vectors in the presence of deterministic trends.” J. Econom. 53 (1992): 87–121. [Google Scholar] [CrossRef]
K. Juselius. The Cointegrated VAR Model: Methodology and Applications. Oxford, UK: Oxford University Press, 2006. [Google Scholar]
P. Pedroni. “Critical values for cointegration tests in heterogeneous panels with multiple regressors.” Oxf. Bull. Econ. Stat. 61 (1999): 653–670. [Google Scholar] [CrossRef]
P. Pedroni. “Panel cointegration: Asymptotic and finite sample properties of pooled time series tests with an application to the PPP hypothesis.” Econom. Theory 20 (2004): 597–625. [Google Scholar] [CrossRef]
W.H. Greene. Econometric Analysis, 7th ed. Upper Saddle River, NJ, USA: Pearson, 2012. [Google Scholar]
J.D. Hamilton. Time Series Analysis. Princeton, NJ, USA: Princeton University Press, 1994. [Google Scholar]
U. Hassler. Stochastic Processes and Calculus: An Elementary Introduction with Applications. Berlin, Germany: Springer, 2016. [Google Scholar]
C. Kao. “Spurious regression and residual-based tests for cointegration in panel data.” J. Econom. 90 (1999): 1–44. [Google Scholar] [CrossRef]
J. Westerlund. “New simple tests for panel cointegration.” Econom. Rev. 24 (2005): 297–316. [Google Scholar] [CrossRef]
P.C.B. Phillips, and S. Ouliaris. “Asymptotic propositionrties of residual based tests for cointegration.” Econometrica 58 (1990): 165–193. [Google Scholar] [CrossRef]
A. Levin, C.-F. Lin, and C.-S.J. Chu. “Unit root tests in panel data: Asymptotic and finite-sample properties.” J. Econom. 108 (2002): 1–24. [Google Scholar] [CrossRef]
S. Nabeya. “Asymptotic moments of some unit root test statistics in the null case.” Econom. Theory 15 (1999): 139–149. [Google Scholar] [CrossRef]
J. Breitung. “Nonparametric tests for unit roots and cointegration.” J. Econom. 108 (2002): 343–363. [Google Scholar] [CrossRef]
D. Kwiatkowski, P.C.B. Phillips, P. Schmidt, and Y. Shin. “Testing the Null Hypothesis of Stationarity against the Alternative of a Unit Root: How sure are we that economic time series have a unit root? ” J. Econom. 54 (1992): 159–178. [Google Scholar] [CrossRef]
U. Hassler. “Cointegration testing in single error-correction equations in the presence of linear time trends.” Oxf. Bull. Econ. Stat. 62 (2000): 621–632. [Google Scholar] [CrossRef]
U. Hassler. “The effect of linear time trends on the KPSS test for cointegration.” J. Time Ser. Anal. 22 (2001): 283–292. [Google Scholar] [CrossRef]
J.Y. Park, and P.C.B. Phillips. “Statistical Inference in regressions with integrated processes: Part 1.” Econom. Theory 4 (1988): 468–497. [Google Scholar] [CrossRef]

^1.Under the alternative of cointegration, the intercept κ from the error correction form can be decomposed in a rather complicated way, see Juselius ([17], Section 6.2)); below; however, we will maintain the assumption of no cointegration.
^2.The univariate distribution is the supremum over the absolute value of a so-called second-level Brownian bridge, which shows up with the detrended KPSS test, too; see Kwiatkowski et al. [29].

Table 1. Approximate effective size of the group t-test by Pedroni [18,19] computed from (A2) at nominal level α under strategy

S_{I}

for

μ_{i, x} \neq 0

.

**Table 1.** Approximate effective size of the group t-test by Pedroni [18,19] computed from (A2) at nominal level α under strategy $S_{I}$ for $μ_{i, x} \neq 0$ .
	$N =$	10	20	30	40	50
	$α = 0.01$	0.030	0.053	0.079	0.107	0.137
$k = 1$	$α = 0.05$	0.126	0.190	0.249	0.307	0.361
	$α = 0.10$	0.227	0.314	0.389	0.455	0.515
	$α = 0.01$	0.017	0.024	0.030	0.036	0.043
$k = 2$	$α = 0.05$	0.080	0.102	0.122	0.141	0.159
	$α = 0.10$	0.154	0.188	0.217	0.243	0.268
	$α = 0.01$	0.014	0.017	0.020	0.022	0.025
$k = 3$	$α = 0.05$	0.067	0.078	0.087	0.096	0.104
	$α = 0.10$	0.130	0.148	0.162	0.175	0.187

Table 2. Approximate effective size of the group t-test by Westerlund [15] computed from (A2) at nominal level α under strategy

S_{I}

for

μ_{i, x} \neq 0

**Table 2.** Approximate effective size of the group t-test by Westerlund [15] computed from (A2) at nominal level α under strategy $S_{I}$ for $μ_{i, x} \neq 0$
	$N =$	10	20	30	40	50
	$α = 0.01$	0.139	0.352	0.564	0.732	0.846
$k = 1$	$α = 0.05$	0.394	0.669	0.836	0.924	0.967
	$α = 0.10$	0.566	0.808	0.921	0.969	0.988
	$α = 0.01$	0.089	0.208	0.344	0.478	0.598
$k = 2$	$α = 0.05$	0.283	0.484	0.644	0.763	0.846
	$α = 0.10$	0.436	0.645	0.783	0.870	0.924
	$α = 0.01$	0.067	0.150	0.247	0.349	0.450
$k = 3$	$α = 0.05$	0.232	0.392	0.531	0.647	0.738
	$α = 0.10$	0.372	0.553	0.687	0.783	0.852

Table 3. Experimental size at nominal level α under

S_{A}

according to Theorem 1 for (11); data-generating process (DGP): (6) and (7).

**Table 3.** Experimental size at nominal level α under $S_{A}$ according to Theorem 1 for (11); data-generating process (DGP): (6) and (7).
	$N =$	10	20	30	40	50
		$δ_{i} = 0$
	$α = 0.01$	0.010	0.009	0.008	0.008	0.008
$k = 1$	$α = 0.05$	0.048	0.048	0.043	0.040	0.041
	$α = 0.10$	0.093	0.095	0.087	0.080	0.083
	$α = 0.01$	0.008	0.009	0.007	0.009	0.006
$k = 2$	$α = 0.05$	0.046	0.040	0.039	0.040	0.034
	$α = 0.10$	0.093	0.082	0.083	0.082	0.073
	$α = 0.01$	0.010	0.010	0.012	0.010	0.011
$k = 3$	$α = 0.05$	0.051	0.052	0.052	0.049	0.052
	$α = 0.10$	0.101	0.101	0.102	0.098	0.098
		$δ_{i} = 0.1$
	$α = 0.01$	0.009	0.010	0.009	0.008	0.008
$k = 1$	$α = 0.05$	0.050	0.048	0.048	0.044	0.043
	$α = 0.10$	0.096	0.095	0.096	0.089	0.088
	$α = 0.01$	0.009	0.009	0.008	0.009	0.007
$k = 2$	$α = 0.05$	0.044	0.047	0.045	0.046	0.040
	$α = 0.10$	0.093	0.094	0.090	0.092	0.085
	$α = 0.01$	0.013	0.013	0.013	0.013	0.011
$k = 3$	$α = 0.05$	0.055	0.056	0.058	0.056	0.054
	$α = 0.10$	0.107	0.112	0.110	0.107	0.108
		$δ_{i} = 1$
	$α = 0.01$	0.009	0.010	0.008	0.009	0.009
$k = 1$	$α = 0.05$	0.047	0.045	0.045	0.042	0.044
	$α = 0.10$	0.095	0.092	0.095	0.090	0.091
	$α = 0.01$	0.011	0.009	0.008	0.010	0.009
$k = 2$	$α = 0.05$	0.051	0.046	0.045	0.044	0.042
	$α = 0.10$	0.097	0.093	0.090	0.086	0.084
	$α = 0.01$	0.011	0.013	0.011	0.013	0.013
$k = 3$	$α = 0.05$	0.056	0.054	0.055	0.055	0.056
	$α = 0.10$	0.105	0.108	0.108	0.107	0.107

Table 4. Experimental size at nominal level α under

S_{D}

(detrending) for (11); DGP: (6) and (7).

**Table 4.** Experimental size at nominal level α under $S_{D}$ (detrending) for (11); DGP: (6) and (7).
	$N =$	10	20	30	40	50
		$δ_{i} = 0$
	$α = 0.01$	0.010	0.010	0.008	0.010	0.010
$k = 1$	$α = 0.05$	0.047	0.047	0.045	0.048	0.044
	$α = 0.10$	0.098	0.095	0.091	0.091	0.087
	$α = 0.01$	0.012	0.012	0.013	0.012	0.012
$k = 2$	$α = 0.05$	0.057	0.057	0.056	0.059	0.057
	$α = 0.10$	0.112	0.108	0.106	0.108	0.111
	$α = 0.01$	0.013	0.008	0.012	0.013	0.010
$k = 3$	$α = 0.05$	0.055	0.050	0.050	0.055	0.049
	$α = 0.10$	0.104	0.099	0.101	0.098	0.099
		$δ_{i} = 0.1$
	$α = 0.01$	0.009	0.009	0.010	0.009	0.010
$k = 1$	$α = 0.05$	0.051	0.050	0.049	0.051	0.048
	$α = 0.10$	0.097	0.103	0.100	0.100	0.096
	$α = 0.01$	0.012	0.013	0.013	0.015	0.014
$k = 2$	$α = 0.05$	0.059	0.062	0.061	0.058	0.063
	$α = 0.10$	0.116	0.117	0.115	0.113	0.117
	$α = 0.01$	0.011	0.012	0.012	0.012	0.013
$k = 3$	$α = 0.05$	0.054	0.052	0.052	0.052	0.055
	$α = 0.10$	0.107	0.104	0.104	0.104	0.107
		$δ_{i} = 1$
	$α = 0.01$	0.011	0.010	0.013	0.011	0.010
$k = 1$	$α = 0.05$	0.052	0.047	0.048	0.050	0.050
	$α = 0.10$	0.104	0.095	0.096	0.098	0.095
	$α = 0.01$	0.015	0.011	0.014	0.014	0.012
$k = 2$	$α = 0.05$	0.062	0.061	0.060	0.062	0.058
	$α = 0.10$	0.119	0.119	0.116	0.119	0.115
	$α = 0.01$	0.013	0.015	0.012	0.014	0.013
$k = 3$	$α = 0.05$	0.053	0.056	0.053	0.053	0.054
	$α = 0.10$	0.110	0.106	0.101	0.112	0.105

Table 5. Experimental power at nominal level 5% for (11); DGP: (8) and (7).

**Table 5.** Experimental power at nominal level 5% for (11); DGP: (8) and (7).
$N =$	10	20	30	40	50
$S_{A}$ : Theorem 1 for $δ_{i} = 0$
$k = 1$	0.486	0.727	0.859	0.933	0.965
$k = 2$	0.284	0.433	0.559	0.672	0.751
$k = 3$	0.155	0.234	0.288	0.336	0.387
$S_{D}$ : Detrended regression for $δ_{i} = 0$
$k = 1$	0.235	0.379	0.497	0.615	0.707
$k = 2$	0.163	0.247	0.331	0.399	0.452
$k = 3$	0.087	0.118	0.133	0.156	0.173
$S_{A}$ : Theorem 1 for $δ_{i} = 0.1$
$k = 1$	0.536	0.772	0.904	0.953	0.981
$k = 2$	0.311	0.485	0.622	0.726	0.802
$k = 3$	0.174	0.252	0.319	0.384	0.439
$S_{D}$ : Detrended regression for $δ_{i} = 0.1$
$k = 1$	0.262	0.402	0.531	0.636	0.716
$k = 2$	0.170	0.261	0.337	0.417	0.476
$k = 3$	0.090	0.127	0.143	0.168	0.186
$S_{A}$ : Theorem 1 for $δ_{i} = 1$
$k = 1$	0.824	0.974	0.997	1.000	1.000
$k = 2$	0.526	0.759	0.890	0.950	0.979
$k = 3$	0.292	0.419	0.553	0.661	0.734
$S_{D}$ : Detrended regression for $δ_{i} = 1$
$k = 1$	0.395	0.605	0.757	0.852	0.911
$k = 2$	0.219	0.330	0.433	0.524	0.595
$k = 3$	0.101	0.129	0.145	0.170	0.189

Table 6. Experimental size and power at nominal level 5% for

T = 250

,

μ_{i, x}^{'} = (1, 0, \dots, 0)

,

δ_{i} = 0

, and

σ_{i, j}^{2} = 4

; DGP: (6) or (8) and (7).

**Table 6.** Experimental size and power at nominal level 5% for $T = 250$ , $μ_{i, x}^{'} = (1, 0, \dots, 0)$ , $δ_{i} = 0$ , and $σ_{i, j}^{2} = 4$ ; DGP: (6) or (8) and (7).
$N =$	10	20	30	40	50
$S_{A}$ : size
$k = 1$	0.043	0.039	0.037	0.036	0.034
$k = 2$	0.043	0.041	0.037	0.034	0.031
$k = 3$	0.050	0.049	0.045	0.045	0.042
$S_{A}$ : power
$k = 1$	0.410	0.622	0.771	0.869	0.920
$k = 2$	0.252	0.383	0.501	0.604	0.683
$k = 3$	0.147	0.205	0.257	0.304	0.364
$S_{D}$ : size
$k = 1$	0.048	0.047	0.049	0.048	0.049
$k = 2$	0.056	0.063	0.061	0.062	0.062
$k = 3$	0.058	0.052	0.053	0.051	0.057
$S_{D}$ : power
$k = 1$	0.236	0.387	0.522	0.622	0.699
$k = 2$	0.178	0.273	0.350	0.437	0.493
$k = 3$	0.102	0.136	0.161	0.192	0.213

© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hassler, U.; Hosseinkouchack, M. Panel Cointegration Testing in the Presence of Linear Time Trends. Econometrics 2016, 4, 45. https://doi.org/10.3390/econometrics4040045

AMA Style

Hassler U, Hosseinkouchack M. Panel Cointegration Testing in the Presence of Linear Time Trends. Econometrics. 2016; 4(4):45. https://doi.org/10.3390/econometrics4040045

Chicago/Turabian Style

Hassler, Uwe, and Mehdi Hosseinkouchack. 2016. "Panel Cointegration Testing in the Presence of Linear Time Trends" Econometrics 4, no. 4: 45. https://doi.org/10.3390/econometrics4040045

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Panel Cointegration Testing in the Presence of Linear Time Trends

Abstract

1. Introduction

2. Notation and Assumptions

3. Results

3.1. Asymptotic Theory

3.2. Numerical Evidence

3.3. Regressions with a Linear Time Trend

4. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendix A

Appendix A.1. Proof of Theorem 1

Appendix A.2. Proof of Proposition 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI