Common Correlated Effects Estimation for Dynamic Heterogeneous Panels with Non-Stationary Multi-Factor Error Structures

Cao, Shiyun; Zhou, Qiankun

doi:10.3390/econometrics10030029

Open AccessArticle

Common Correlated Effects Estimation for Dynamic Heterogeneous Panels with Non-Stationary Multi-Factor Error Structures

by

Shiyun Cao

¹ and

Qiankun Zhou

^2,*

¹

School of Science, Guangxi University of Science and Technology, Liuzhou 545006, China

²

Department of Economics, Louisiana State University, Baton Rouge, LA 70803, USA

^*

Author to whom correspondence should be addressed.

Econometrics 2022, 10(3), 29; https://doi.org/10.3390/econometrics10030029

Submission received: 28 January 2022 / Revised: 1 August 2022 / Accepted: 4 August 2022 / Published: 11 August 2022

(This article belongs to the Special Issue Special Issue on Time Series Econometrics)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we consider the estimation of a dynamic panel data model with non-stationary multi-factor error structures. We adopted the common correlated effect (CCE) estimation and established the asymptotic properties of the CCE and common correlated effects mean group (CCEMG) estimators, as N and T tend to infinity. The results show that both the CCE and CCEMG estimators are consistent and the CCEMG estimator is asymptotically normally distributed. The theoretical findings were supported for small samples by an extensive simulation study, showing that the CCE estimators are robust to a wide variety of data generation processes. Empirical findings suggest that the CCE estimation is widely applicable to models with non-stationary factors. The proposed procedure is also illustrated by an empirical application to analyze the U.S. cigar dataset.

Keywords:

dynamic panel models; cross-sectional dependence; non-stationary; common factors; common correlated effects

JEL Classification:

C01; C13; C23

1. Introduction

Recently, there has been increased interest in the analysis of panel data models with cross-sectionally dependent errors (also known as unobserved common factors or multi-factor error structures), which are motivated by empirical applications in economics, such as common shocks and the global financial crisis, see Omay and Kan (2010); Bussiere et al. (2013); Eberhardt et al. (2013) and Chudik et al. (2017), etc. The dependencies across the units violate the traditional assumption of independent and identically distributed errors; conventional panel estimation methods (such as fixed effects estimation) could have serious consequences and lead to inconsistent estimations and misleading inferences. Therefore, in econometrics literature, much effort has been devoted to the estimations for panels with cross-sectional dependence, for example, Pesaran (2006); Bai (2009); Zaffaroni (2009); Greenaway-McGrevy et al. (2012); Kao et al. (2012); Chudik and Pesaran (2013); Moon and Weidner (2015, 2017), among others. See also Chudik and Pesaran (2015b) for a survey of recent developments in large panel models with cross-sectional dependence.

Among these studies, a predominant approach of dealing with cross-sectionally dependent errors in panel models is the so-called common correlated effect (CCE) method proposed by Pesaran (2006)1. The basic idea of CCE estimation is to proxy the unobserved common factors using the cross-sectional averages of the observables in the regression. Comparatively, it has several advantages. For instance, it can be computed by least squares to auxiliary regression, and it does not require the knowledge of the number of unobserved factors. The CCE method has been further developed and applies to different types of panel models. To name a few, Chudik and Pesaran (2015a) suggested the CCE approach to analyze dynamic heterogeneous panels with stationary unobserved common factors. Kapetanios et al. (2011) extended the CCE method to static panel data models with non-stationary multi-factor error structures. Westerlund et al. (2019) considered the CCE for short panels, and Zhou and Zhang (2016) extended the CCE for unbalanced panels.

Among the aforementioned works, there is a gap in the CCE estimation for dynamic panels with non-stationary unobservable factors. To fill this gap, in this paper we consider a linear dynamic heterogeneous panel data model with non-stationary unobserved common factors when both the cross-sectional and time dimensions of the dataset grow to infinity. Under these settings, we find that the CCE estimator of the individual coefficient is consistent, and the CCE mean group (CCEMG) estimator is consistent and has a normal limit distribution. The practical implication of this finding is that for inferential purposes of the CCE estimation, one does not necessarily need to test the stationarity of the unobserved common factors in the model. The finite sample properties are examined through Monte Carlo simulations and the simulation results confirm our theoretical findings in the paper. Moreover, the proposed procedure is illustrated by an empirical application, which analyzes the U.S. cigar dataset.

The rest of the paper is organized as follows. Section 2 sets up the basic model and introduces the CCE estimation of the dynamic heterogeneous panel data model with common factors. The asymptotics of the CCE estimation with non-stationary unobserved common factors is provided in Section 3. Monte Carlo simulation results and an empirical application are reported in Section 4 and Section 5, respectively. The concluding remarks are made in Section 6. Proof of the main results is provided in Appendix A.

Notation: The letter K stands for a finite positive constant. All vectors are column vectors represented by bold lower case letters, and matrices are represented by bold capital letters. Let

∥A∥ = \sqrt{t r ({AA}^{'})}

denote the Frobenius norm.

{∥A∥}_{1} = {max}_{1 \leq j \leq n} Σ_{i = 1}^{n} |a_{i j}|

, and

{∥A∥}_{\infty} = {max}_{1 \leq i \leq n} Σ_{j = 1}^{n} |a_{i j}|

denote the maximum absolute column and row sum matrix norms, respectively.

A^{+}

denotes the Moore–Penrose inverse of

A

,

r a n k (A)

and

ϱ (A)

denotes the rank and the spectral radius of

A

, respectively.

2. Dynamic Panel Data Model with Non-Stationary Unobserved Common Factors

2.1. The Model

We assume the scalar dependent variable

y_{i t}

and regressors

x_{i t}

are generated as follows2

y_{i t} = c_{y i} + ϕ_{i} y_{i, t - 1} + β_{i}^{'} x_{i t} + γ_{i}^{'} f_{t} + ε_{i t},

(1)

and

x_{i t} = c_{x i} + α_{i} y_{i, t - 1} + Γ_{i}^{'} f_{t} + u_{i t},

(2)

for

i = 1, 2, \dots, N

and

t = 1, 2, \dots, T

, where

c_{y i}

and

c_{x i}

are individual fixed effects for unit i,

x_{i t}

is a

k \times 1

vector of the regressors specific to cross-sectional unit i at time t,

ε_{i t}

are the individual-specific (idiosyncratic) errors and

u_{i t}

are the individual-specific components of

x_{i t}

,

γ_{i}

and

Γ_{i}

are

m \times 1

and

m \times k

factor loading matrices, and the

m \times 1

vector

f_{t}

represents unobserved common factors. In what follows, we maintain the restriction that model (1) is stationary, such that

0 <

|ϕ_{i}| < 1

for

i = 1, 2, \dots, N .

Models (1)–(2) have been widely studied in the literature; see, for instance, Pesaran (2006), Chudik and Pesaran (2015a), Westerlund et al. (2019), and the references therein. We follow these studies to consider the CCE estimation for

ϕ_{i}

and

β_{i},

and reexamine the validity of the CCE estimation when

f_{t}

is non-stationary.

2.2. CCE Estimation

Following Chudik and Pesaran (2015a), let

z_{i t} = {(y_{i t}, x_{i t}^{'})}^{'}

, then (1) and (2) can be compactly written as

z_{i t} = c_{z i} + A_{i} z_{i, t - 1} + A_{0 i}^{- 1} C_{i} f_{t} + e_{z i t},

(3)

where

c_{z i} = A_{0 i}^{- 1} c_{i}

,

A_{i} = A_{0 i}^{- 1} A_{1 i}

,

C_{i} = {(γ_{i}, Γ_{i})}^{'}

, and

e_{z i t} = A_{0 i}^{- 1} e_{i t}

, with

c_{i} = {(c_{y i}, c_{x i}^{'})}^{'}

,

e_{i t} = {(ε_{i t}, u_{i t}^{'})}^{'}

, and

A_{0 i} = (\begin{matrix} 1 & - β_{i}^{'} \\ 0 & I_{k} \end{matrix}), A_{1 i} = (\begin{matrix} ϕ_{i} & 0 \\ α_{i} & 0 \end{matrix}) .

(4)

If the support of

ϱ (A_{i})

lies strictly inside the unit circle, then (3) can be rewritten as the following distributed lag form

z_{i t} = \sum_{l = 0}^{\infty} A_{i}^{l} (c_{z i} + A_{0 i}^{- 1} C_{i} f_{t - l} + e_{z i, t - l}),

(5)

for

i = 1, 2, \dots, N .

Taking the cross-sectional average of (5) yields

{\bar{z}}_{t} = {\bar{c}}_{z} + Λ (L) {Cf}_{t} + O_{p} (N^{- 1 / 2}),

where

{\bar{z}}_{t} = \frac{1}{N} \sum_{i = 1}^{N} z_{i t}

is a

k + 1

dimensional vector of the cross-section average,

{\bar{c}}_{z} = \frac{1}{N} \sum_{i = 1}^{N} {(I_{k + 1} - A_{i})}^{- 1} c_{z i}

, and

C = E (C_{i}) = {(γ, Γ)}^{'}

,

Λ (L) = \sum_{l = 0}^{\infty} Λ_{l} L^{l}

with

Λ_{l} = E (A_{i}^{l} A_{0 i}^{- 1})

, and L being the lag operator. Furthermore, if

Λ (L)

is invertible (see Assumption 4 below), then we have

{Cf}_{t} = Λ^{- 1} (L) ({\bar{z}}_{t} - {\bar{c}}_{z}) + O_{p} (N^{- 1 / 2}) .

When the

(k + 1) \times m

matrix

C

has the full column rank, i.e., the rank condition

r a n k (C) = m \leq k + 1,

(6)

holds, we have

f_{t} = B (L) ({\bar{z}}_{t} - {\bar{c}}_{z}) + O_{p} (N^{- 1 / 2}),

(7)

where

B (L) = {(C^{'} C)}^{- 1} C^{'} Λ^{- 1} (L)

. This suggests that the contemporary and lagged value of

{\bar{z}}_{t} = {({\bar{y}}_{t}, {\bar{x}}_{t}^{'})}^{'}

can be used as observable proxies for the unobserved common factors

f_{t}

.

Substituting the observed proxies of the unobserved common factors (7) into (1) yields the following augmented regression

y_{i t} = c_{y i}^{*} + ϕ_{i} y_{i, t - 1} + β_{i}^{'} x_{i t} + δ_{i}^{'} (L) {\bar{z}}_{t} + ε_{i t} + O_{p} (N^{- 1 / 2}),

or

y_{i t} = c_{y i}^{*} + ϕ_{i} y_{i, t - 1} + β_{i}^{'} x_{i t} + \sum_{l = 0}^{p_{T}} δ_{i l}^{'} {\bar{z}}_{t - l} + w_{i t},

(8)

for

t = p_{T} + 1, p_{T} + 2, \dots, T

, where

c_{y i}^{*} = c_{y i} - δ_{i}^{'} (1) {\bar{c}}_{z}

,

δ_{i} (L) = B^{'} (L) γ_{i} = \sum_{l = 0}^{\infty} δ_{i l} L^{l}

,

p_{T}

is the number of lags used to truncate the infinite polynomial distributed lag function

δ_{i} (L)

,3 and the composite error

w_{i t}

has the form of

w_{i t} = ε_{i t} + \sum_{l = p_{T} + 1}^{\infty} δ_{i l}^{'} {\bar{z}}_{t - l} + O_{p} (N^{- 1 / 2}) .

For notational simplicity, let

y_{i} = {(y_{i, p_{T} + 1}, y_{i, p_{T} + 2}, \dots, y_{i T})}^{'}

,

Ξ_{i} = (y_{i, - 1}, X_{i})

, with

y_{i, - 1} = {(y_{i, p_{T}}, y_{i, p_{T} + 1}, \dots, y_{i, T - 1})}^{'}

and

X_{i} = {(x_{i, p_{T} + 1}, x_{i, p_{T} + 2}, \dots, x_{i T})}^{'}

, and

w_{i} = {(w_{i, p_{T} + 1}, w_{i, p_{T} + 2}, \dots, w_{i T})}^{'}

, then the augmented regression (8) can be expressed in vector form as

y_{i} = Ξ_{i} π_{i} + \bar{Q} d_{i} + w_{i},

(9)

where

π_{i} = {(ϕ_{i}, β_{i}^{'})}^{'}

are the parameters of interest,

d_{i} = {(c_{y i}^{*}, δ_{i 0}^{'}, δ_{i 1}^{'}, \dots, δ_{i p}^{'})}^{'}

are nuisance parameters, and4

\bar{Q} = (\begin{matrix} 1 & {\bar{z}}_{p_{T} + 1}^{'} & {\bar{z}}_{p_{T}}^{'} & \dots & {\bar{z}}_{1}^{'} \\ 1 & {\bar{z}}_{p_{T} + 2}^{'} & {\bar{z}}_{p_{T} + 1}^{'} & \dots & {\bar{z}}_{2}^{'} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & {\bar{z}}_{T}^{'} & {\bar{z}}_{T - 1}^{'} & \dots & {\bar{z}}_{T - p_{T}}^{'} \end{matrix}) .

(10)

Based on the cross-sectionally augmented regression model (9) and by the formula for partitioned regression, the CCE estimator of the individual coefficients

π_{i}

is given by

{\hat{π}}_{i} = {(Ξ_{i}^{'} M_{q} Ξ_{i})}^{- 1} Ξ_{i}^{'} M_{q} y_{i},

(11)

which is an ordinary least squares estimate, where

M_{q} = I_{T - p_{T}} - \bar{Q} {({\bar{Q}}^{'} \bar{Q})}^{+} {\bar{Q}}^{'}

is an orthogonal projection matrix, with

I_{T - p_{T}}

a

(T - p_{T})

-dimensional identity matrix. In panel models with N large, the primary parameters of interest are the means of the individual–specific coefficients,

E (π_{i}) = π

, which can be estimated by the common correlated effects mean group (CCEMG) estimator

{\hat{π}}_{M G} = \frac{1}{N} \sum_{i = 1}^{N} {\hat{π}}_{i} .

(12)

3. Asymptotics of CCE Estimators with Non-Stationary Factors

3.1. Assumptions

When the unobserved common factors

f_{t}

are stationary processes, Chudik and Pesaran (2015a) showed that the CCE estimator (11) of the individual coefficient is consistent, and the CCEMG estimator (12) is consistent and asymptotically normal. However, in practice, the common factors

f_{t}

may follow a non-stationary process (see Bai and Ng 2004, 2010; Pesaran 2007; Pesaran et al. 2013, among others). In this scenario, the validity of CCE estimators and their asymptotic properties need to be re-examined.

Following Kapetanios et al. (2011), we assume the unobserved common factors follow the multivariate unit root process

f_{t} = f_{t - 1} + ς_{t} .

(13)

To derive the asymptotic properties of the CCE type estimators (11) and (12) when

f_{t}

follows (13), we make the following assumptions.

Assumption 1.

(Individual-specific errors). (i) The individual–specific errors

ε_{i t}

follow a linear stationary process with uniformly-bounded positive variance,

{sup}_{i} σ_{i}^{2} < K

, for some constant K, and uniformly-bounded fourth-order cumulants.

u_{i t}

follows a linear stationary process with absolute summable auto-covariances (uniformly in i), with covariance matrices,

Σ_{u_{i}}

, which are non-singular and satisfy

{sup}_{i} ∥Σ_{u_{i}}∥ < K

, and have uniformly-bounded fourth-order cumulants. (ii)

ε_{i t}

are independently distributed of

u_{j t^{'}}

for all

i, j, t

, and

t^{'}

. For each i,

e_{i t} = {(ε_{i t}, u_{i t}^{'})}^{'}

is an

(k + 1) \times 1

vector of

L_{2 + δ}, δ > 0

, stationary near epoch dependent processes of size

2 δ / (2 δ + 4)

on the

α

-mixing process of size

- (2 + δ) / δ

, and for

i = 1, 2, \dots, N

,

V a r (e_{i t}) = Σ_{e_{i}}

, which is a non-singular matrix and satisfies

{sup}_{i} ∥Σ_{e_{i}}∥ < K

.

Assumption 2.

(Factor loadings). The factor loadings

γ_{i}

and

Γ_{i}

are independently and identically distributed (

I I D

) across i, and of the common factors

f_{t}

, for all i and t, with means

γ

and

Γ

, respectively, and the bounded second moments. In particular,

γ_{i} = γ + η_{γ i}, with η_{γ i} \sim I I D (0, Σ_{γ}), for i = 1, 2, \dots, N,

and

v e c (Γ_{i}) = v e c (Γ) + η_{Γ i}, with η_{Γ i} \sim I I D (0, Σ_{Γ}), for i = 1, 2, \dots, N,

where

Σ_{γ}

and

Σ_{Γ}

are

m \times m

and

m k \times m k

symmetric nonnegative definite matrices,

∥γ∥ < K, ∥Σ_{γ}∥ < K, ∥Γ∥ < K

and

∥Σ_{Γ}∥ < K

, for some constant

K .

Assumption 3.

(Heterogeneous coefficients). The slope coefficients

π_{i} = {(ϕ_{i}, β_{i}^{'})}^{'}

follow the random coefficient model

π_{i} = π + υ_{π i}, υ_{π i} \sim I I D (0, Σ_{π}), for i = 1, 2, \dots, N,

where

π = E (π_{i}) = {(ϕ, β^{'})}^{'}

,

∥π∥ < K, ∥Σ_{π}∥ < K

,

Σ_{π}

is the

(k + 1) \times (k + 1)

symmetric nonnegative definite matrix and the random deviations

υ_{π i}

are distributed independently of

γ_{j}, Γ_{j}, ε_{j t}, u_{j t}

and

ς_{t}

for

i, j

, and t. Furthermore, the support of

ϕ_{i}

lies strictly inside the unit circle, and

E ∥c_{i}∥ < K

,

E ∥α_{i}∥ < K

for all i, where

c_{i} = {(c_{y i}, c_{x i}^{'})}^{'}

.

Assumption 4.

(Exogenous regressors). Regressors

x_{i t}

are either strictly exogenous and generated according to the canonical factor model (2) with

α_{i} = 0

, or weakly exogenous and generated according to (2) with

α_{i}

, for

i = 1, 2, \dots, N

,

I I D

across i, and independently distributed of

υ_{π j}

,

γ_{j}, Γ_{j}, ε_{j t}, u_{j t}

and

f_{t}

for all

i, j

, and t. In the case where the regressors are weakly exogenous, we also assume:

(i) The support of

ϱ (A_{i})

lies strictly inside the unit circle, for

i = 1, 2, \dots, N

, where

A_{i} = A_{0 i}^{- 1} A_{1 i}

with

A_{0 i}

and

A_{1 i}

are defined in (4).

(ii) The inverse of polynomial

Λ (L) = \sum_{l = 0}^{\infty} Λ_{l} L_{l}

exists and has exponentially decaying coefficients, where

Λ_{l} = E (A_{i}^{l} A_{0 i}^{- 1})

.

Assumption 5.

(Rank condition). The

(k + 1) \times m

matrix

C

has a full column rank, such that

r a n k (C) = m \leq k + 1,

where

C = E (C_{i}) = E {(γ_{i}, Γ_{i})}^{'} .

Assumption 6.

(i) As

(N, T) \to \infty,

the

(k + 1) \times (k + 1)

matrices

Ψ_{i T}^{- 1} = {(Ξ_{i}^{'} M_{q} Ξ_{i} / T)}^{- 1},

Ψ_{i h}^{- 1} = {(Ξ_{i}^{'} M_{h} Ξ_{i} / T)}^{- 1}

and

Ψ_{i g}^{- 1} = {(Ξ_{i}^{'} M_{g} Ξ_{i} / T)}^{- 1}

exist for all i, and

Ψ_{i T}^{- 1},

Ψ_{i h}^{- 1}

and

Ψ_{i g}^{- 1}

have finite second-order moments for all

i,

where

M_{h} = I_{T - p_{T}} - H {(H^{'} H)}^{+} H^{'}

and

M_{g} = I_{T - p_{T}} - G {(G^{'} G)}^{+} G^{'}

are projection matrices, where

A^{+}

denotes the Moore–Penrose generalized inverse of

A,

H

, and

G

, defined as

H = (\begin{matrix} 1 & h_{p_{T} + 1}^{'} & \dots & h_{1}^{'} \\ 1 & h_{p_{T} + 2}^{'} & \dots & h_{2}^{'} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & h_{T}^{'} & \dots & h_{T - p_{T}}^{'} \end{matrix}), G = (τ, \tilde{F}) = (\begin{matrix} 1 & f_{p_{_{T}} + 1}^{'} & \dots & f_{1}^{'} \\ 1 & f_{p_{_{T}} + 2}^{'} & \dots & f_{2}^{'} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & f_{T}^{'} & \dots & f_{T - P_{T}}^{'} \end{matrix}),

where

h_{t} = Ψ (L) f_{t} + {\bar{c}}_{z}

with

Ψ (L) = \frac{1}{N} \sum_{i = 1}^{N} {(I_{k + 1} - A_{i} L)}^{- 1} A_{0 i}^{- 1} C_{i}

and

{\bar{c}}_{z} = \frac{1}{N} \sum_{i = 1}^{N} {(I_{k + 1} - A_{i})}^{- 1} c_{z i}

,

τ = {(1, 1, \dots 1)}^{'}

is a

(T - p_{T}) \times 1

vector of ones.

(ii) The matrix

Ψ^{*} = {lim}_{N \to \infty} \frac{1}{N} \sum_{i = 1}^{N} Σ_{Ω_{i}}

is non-singular, where

Σ_{Ω i} = Ψ_{ξ i} (\begin{matrix} σ_{i}^{2} & 0 \\ 0 & Σ_{u_{i}} \end{matrix}) Ψ_{ξ i}^{'}

with

Ψ_{ξ i} = (\begin{matrix} S_{y}^{'} {(I_{k + 1} - A_{i} L)}^{- 1} L \\ S_{x}^{'} {(I_{k + 1} - A_{i} L)}^{- 1} \end{matrix}) A_{0 i}^{- 1}

a

(k + 1) \times (k + 1)

matrix,

S_{y}^{'} = {(\begin{matrix} 1 & 0 \end{matrix})}_{1 \times (k + 1)}

, and

S_{x}^{'} = {(\begin{matrix} 0 & I_{k} \end{matrix})}_{k \times (k + 1)}

.

Assumption 7.

ς_{t}

in (13) is an

m \times 1

vector of

L_{2 + δ}, δ > 0,

stationary near epoch-dependent processes of size

1 / 2

, on an

α

-mixing process of size

- (2 + δ) / δ

, and is distributed independently of the idiosyncratic errors

ε_{i t}

and

u_{i t}

for all i and t.

Several remarks can be made for these assumptions. Assumptions 1–3 are quite standard in the literature for (dynamic) panel models with cross-sectional dependence, for example, see Pesaran (2006) and Kapetanios et al. (2011) and the references therein. Assumption 4 is also made on Chudik and Pesaran (2015a) for exogenous regressors and stationarity conditions for dynamic panels. Assumption 5 is a common condition for the implementation of the CCE estimation (e.g., Pesaran (2006) and Chudik and Pesaran (2015a), etc.), which implies that there are more included regressors than the unobserved factors in the model. See Juodis et al. (2021) for a detailed discussion of the validity of the rank condition and the resulting asymptotics for the CCE estimation. Assumption 6 is a common assumption for the CCE estimation and it is imposed for the partition regression in augmented regression for the dynamic panels (e.g., Chudik and Pesaran 2015a). Assumption 7 requires that the error structures in the unit root process

f_{t}

are stationary.

3.2. Asymptotics

Under these assumptions, we can establish the asymptotic properties of CCE estimators (11) and (12) when

f_{t}

is non-stationary. To begin with, we note that for the original model (1), it can be rewritten as in the vector form

y_{i} = c_{y i} τ + ϕ_{i} y_{i, - 1} + X_{i} β_{i} + F γ_{i} + ε_{i},

(14)

or more compactly as

y_{i} = c_{y i} τ + Ξ_{i} π_{i} + F γ_{i} + ε_{i},

(15)

for

i = 1, 2, \dots, N

, where

y_{i} = {(y_{i, p_{T} + 1}, y_{i, p_{T} + 2}, \dots, y_{i T})}^{'}

,

X_{i} = {(x_{i, p_{T} + 1}, \dots, x_{i T})}^{'}

,

y_{i, - 1} = {(y_{i, p_{T}}, y_{i, p_{T} + 1}, \dots, y_{i, T - 1})}^{'}

,

Ξ_{i} = (y_{i, - 1}, X_{i})

,

F = {(f_{p_{T} + 1}, \dots, f_{T})}^{'}

, and

ε_{i} = {(ε_{i, p_{T} + 1}, \dots, ε_{i T})}^{'}

.

Using the CCE estimator (11) into (15), we have

{\hat{π}}_{i} - π_{i} = {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{q} F γ_{i}}{T} + {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{q} ε_{i}}{T},

(16)

which shows that the asymptotics of

{\hat{π}}_{i}

depends on the unobserved factors through

T^{- 1} Ξ_{i}^{'} M_{q} F

.

Using the results in Lemma A2, A5, and A6 in Appendix A, we obtain

\frac{Ξ_{i}^{'} M_{q} F}{T} = O_{p} (\frac{1}{N}) + O_{p} (\frac{1}{\sqrt{N T}}), uniformly over i,

(17)

and, thus,

{\hat{π}}_{i} - π_{i} = {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{q} ε_{i}}{T} + O_{p} (\frac{1}{N}) + O_{p} (\frac{1}{\sqrt{N T}}),

(18)

when the rank condition (6) is satisfied. The above results are summarized in the following theorem, establishing the consistency of the CCE estimator of individual coefficients of interest.

Theorem 1.

Consider the panel models (1) and (2), suppose Assumption 1–7 hold, then, as

(N, T, p_{T}) \overset{j}{\to} \infty

, such that

p_{T}^{3} / T \to λ,

0 < λ < \infty

, we have

{\hat{π}}_{i} - π_{i} \overset{p}{\to} 0 .

See the Appendix A for the proof.

Remark 1.

The above theorem suggests that the CCE estimator of the individual slope coefficient is consistent even if the common factors are non-stationary. When the rank condition (6) is not satisfied, the CCE estimator of the individual slope coefficients would be inconsistent due to the correlation of

x_{i t}

and

f_{t}

. See Juodis et al. (2021) for more discussions on the validity of the CCE estimator when the rank condition does not hold.

Next, we establish the asymptotic properties of the CCEMG estimator of the mean group coefficients,

π = E (π_{i})

. We have

\begin{matrix} \sqrt{N} ({\hat{π}}_{M G} - π) & = & \frac{1}{\sqrt{N}} \sum_{i = 1}^{N} υ_{π i} + \frac{1}{\sqrt{N}} \sum_{i = 1}^{N} {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{q} F}{T} γ_{i} \\ + \frac{1}{\sqrt{N}} \sum_{i = 1}^{N} {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{q} ε_{i}}{T} . \end{matrix}

When the rank condition is satisfied, by (17), we have

\frac{\sqrt{N} Ξ_{i}^{'} M_{q} F}{T} = O_{p} (\frac{1}{\sqrt{N}}) + O_{p} (\frac{1}{\sqrt{T}}),

hence, we can obtain

\sqrt{N} ({\hat{π}}_{M G} - π) = \frac{1}{\sqrt{N}} \sum_{i = 1}^{N} υ_{π i} + O_{p} (\frac{1}{\sqrt{N}}) + O_{p} (\frac{1}{\sqrt{T}}) \overset{d}{\sim} \frac{1}{\sqrt{N}} \sum_{i = 1}^{N} υ_{π i},

and, thus,

Theorem 2.

Consider the panel models (1) and (2), suppose Assumptions 1–7 hold, as

(N, T, p_{T}) \overset{j}{\to} \infty

, such that

p_{T}^{3} / T \to λ,

0 < λ < \infty

, then we have

{\hat{π}}_{M G} - π \overset{p}{\to} 0 .

If it is further assumed that

N / T \to φ, 0 < φ < \infty

, then

\sqrt{N} ({\hat{π}}_{M G} - π) \overset{d}{\to} N (0, Σ_{M G}) .

The asymptotic variance of

{\hat{π}}_{M G}

can be consistently estimated nonparametrically by

{\hat{Σ}}_{M G} = \frac{1}{N - 1} \sum_{i = 1}^{N} ({\hat{π}}_{i} - {\hat{π}}_{M G}) {({\hat{π}}_{i} - {\hat{π}}_{M G})}^{'} .

For the results in both Theorems 1 and 2, we find that, for models with non-stationary common factors, although the intermediate results needed for deriving the asymptotic properties of the common correlated effects estimators significantly differ from the stationary case, as in Chudik and Pesaran (2015a), the final results are surprisingly similar. This is in direct contrast to the usual phenomenon where distributional results of

I (1)

processes are radically different from those of

I (0)

processes.

Remark 2.

For the consistency of

{\hat{π}}_{i}

and

{\hat{π}}_{M G}

, no restrictions on the relative expansion rates of N and T to infinity are required. However, they require

N / T \to φ, 0 < φ < \infty

for the derivation of the asymptotic distribution of

{\hat{π}}_{M G}

due to the time series bias, which arises from the presence of lagged values of the dependent variable; therefore, it is unsuitable for panels with T being small relative to N.

Including a lagged dependent variable as the regressor in the model could induce the estimators with time series bias of order

O (T^{- 1})

. When T is not large, the bias is non-negligible; hence, a certain bias correction approach should be considered. In the simulations below, we consider the Jackknife bias-corrected method for bias reduction (e.g., see (21) below), which is used extensively in the relevant literature (e.g., Hahn and Newey 2004).

4. Monte Carlo Simulation

In this section, we investigate the finite sample properties of the CCEMG estimation for dynamic heterogeneous panels with non-stationary common factors. We consider the following data-generating processes5

y_{i t} = c_{y i} + ϕ_{i} y_{i, t - 1} + β_{0 i} x_{i t} + β_{1 i} x_{i, t - 1} + γ_{i}^{'} f_{t} + ε_{i t},

(19)

and

x_{i t} = c_{x i} + α_{x i} y_{i, t - 1} + γ_{x i}^{'} f_{t} + u_{i t} .

(20)

for

i = 1, 2, \dots, N,

and

t = - 99, \dots, 0, \dots, T .

Let

ϕ_{i} \sim I I D U (0, 0.8)

,

β_{0 i} \sim I I D U (0.5, 1)

,

β_{1 i} = - 0.5

and

α_{x i} \sim I I D U (0, 0.35)

,

c_{y i} \sim I I D N (1, 1)

,

c_{x i} = c_{y i} + ϵ_{c_{x i}}

with

ϵ_{c_{x i}} \sim I I D N (0, 1)

. The main purpose of this paper is to illustrate the validity of the CCEMG estimator in the case of non-stationary unobserved common factors; hence, for the unobserved common factors

f_{t}

, we consider the following three different non-stationary DGPs:

DGP 1. Two non-stationary unobserved common factors

(m = 2)

,

f_{l t} = f_{l, t - 1} + ς_{f l t},

ς_{f l t} \sim I I D N (0, σ_{f l}^{2}),

where

σ_{f l} = 0.2

, for

l = 1, 2

, and

t = - 99, \dots, 0, \dots, T

.

DGP 2. One non-stationary unobserved common factor and a stationary common factor

(m = 2)

,

f_{1 t} = f_{1, t - 1} + ς_{f l t},

f_{2 t} = 0.6 f_{2, t - 1} + ς_{f l t},

ς_{f l t} \sim I I D N (0, σ_{f l}^{2}),

where

σ_{f l} = 0.5

, for

l = 1, 2

, and

t = - 99, \dots, 0, \dots, T

.

DGP 3. Cointegrated unobserved common factors

(m = 2)

,

f_{1 t} = f_{1, t - 1} + ς_{f 1 t},

f_{2 t} = 0.5 f_{1 t} + ς_{f 2 t},

ς_{f l t} \sim I I D N (0, σ_{f l}^{2}),

where

σ_{f l} = 1

, for

l = 1, 2

, and

t = - 99, \dots, 0, \dots, T

.

For the above DGPs, the starting values are

f_{l, - 100} = 0

, for

l = 1, 2

; the first 100 observations are discarded.

Correspondingly, the factor loadings are generated independently across replications as

γ_{i l} = γ_{l} + η_{i, γ l}, η_{i, γ l} \sim I I D N (0, σ_{γ l}^{2}),

and

γ_{x i l} = γ_{x l} + η_{i, γ x l}, η_{i, γ x l} \sim I I D N (0, σ_{γ x l}^{2}),

for

i = 1, 2, \dots, N

and

l = 1, 2

, where

σ_{γ l}^{2} = {0.2}^{2}

,

σ_{γ x l}^{2} = {0.3}^{2}

, and

γ_{l} = \sqrt{b_{γ l}}

,

γ_{x l} = \sqrt{l b_{x l}}

for

l = 1, 2

, where

b_{γ l} = 1 / 2 - σ_{γ l}^{2}

and

b_{x l} = 1 / 2 - σ_{γ x l}^{2}

for

l = 1, 2

.

For the idiosyncratic errors,

ε_{i t} \sim I I D N (0, 1)

for all i and t, and the unit-specific components

u_{i t}

are generated as independent stationary AR(1) processes:

u_{i t} = ρ_{x i} u_{i, t - 1} + ϵ_{x i t}, ρ_{x i} \sim I I D U (0, 0.95), ϵ_{x i t} \sim I I D N (0, 1),

for

i = 1, 2, \dots, N

and

t = - 99, \dots, 0, \dots, T

with the starting values

u_{i, - 100} = 0

. The first 100 observations are discarded.

We consider the combination of

N = 50, 100, 200

, and

T = 50, 100, 150, 200

. The number of replications is set at 2000 times. In what follows, we focus on the lagged coefficient

ϕ

(the cross-section mean of

ϕ_{i}

), as well as

β_{0}

(the cross-section mean of

β_{0 i}

). To save space, we only report the results of

β_{0}

since the results for

β_{1}

are very similar to that of

β_{0}

and they are available upon request.

Two estimators are considered in the simulation. The first is the main result of the CCEMG estimator

{\hat{π}}_{M G}

given in (12), in which, the lag order

p_{T}

is selected to satisfy

p_{T}^{3} / T \to λ,

as

T \to \infty

, for some

0 < λ < \infty

; that is,

p_{T} = [T^{1 / 3}]

, which works well in our Monte Carlo design6. The second is the Jackknife bias-corrected CCEMG estimator, which is constructed as

{\hat{π}}_{J a c k - M G} = 2 {\hat{π}}_{M G} - \frac{1}{2} ({\hat{π}}_{M G}^{a} + {\hat{π}}_{M G}^{b}),

(21)

where

{\hat{π}}_{M G}^{a}

is the CCEMG estimator calculated using the first two-thirds of the available time period, namely over the period

t = 1, 2, \dots, [2 T / 3]

, and

{\hat{π}}_{M G}^{b}

denotes the CCEMG estimator computed using the observations over the period

t = [T / 3], [T / 3] + 1, \dots, T

, where

[T / 3]

denotes the integer part of

T / 3

. Note that a new strategy is applied to improve the performance of the Jackknife estimator, i.e., the whole time period is divided into three parts, the first two-thirds of the available period is applied to calculate the first estimator and another one is computed from the last two-thirds of the period. We find that, in our settings, this division strategy performs better than the half-panel Jackknife method discussed in Chudik and Pesaran (2015a).

We used the statistical software MATLAB to conduct the Monte Carlo experiments; the simulation results are summarized in Table 1, Table 2 and Table 3 for DGPs 1–3, respectively.

From Table 1, we note that for the estimation of

ϕ

, the CCEMG performs well in terms of bias and RMSE, with the bias diminishing as T is increased, and the associated RMSEs fall steadily when T increases, which implies that the CCEMG estimator is consistent. However, it still suffers from the time series bias when T is small. While the Jackknife bias-corrected CCEMG estimator is quite effective at reducing the time series bias of the CCEMG estimator, the bias has been significantly reduced compared with the original CCEMG estimator when T was not large, and the RMSE also decreased with the increase of either N or T. Similar findings can be observed for

β_{0} .

In order to evaluate the robustness of various estimators, we considered additional results in Table 2 and Table 3 for DGPs with both stationary and non-stationary factors or cointegrated factors. Similar to the case with non-stationary factors in DGP1, we find that the CCEMG estimator still performs well regardless of the number of common factors and the non-stationary type, and it can be improved by the Jackknife bias-corrected for the estimation of the autoregressive coefficient

ϕ

, the CCEMG estimator of the slope coefficient

β_{0}

performs very well in almost all cases.

Overall, the findings of our Monte Carlo simulations show that, if the parameter of interest is the mean coefficient of the regressors,

β_{0}

, the CCEMG estimator performs well even if N and T are not large. For the mean coefficient of the lagged dependent,

ϕ

, the CCEMG estimator is still consistent, but it suffers from the time series bias unless T is sufficiently large and, thus, the Jackknife bias-corrected CCEMG estimator is proposed, it helps to mitigate the time series bias.

5. Empirical Study

In this section, we illustrate our method by considering the U.S. Cigar dataset, which is frequently used in the literature on panel models (e.g., Baltagi and Li 2004; Bada and Liebl 2014). The panel contains the per capita cigarette consumption of

N = 46

American states from 1963 to 1992 (

T = 30

) as well as data on the income per capita and cigarette prices; the dataset can be obtained from the R package phtt.

To test the cross-sectional dependence in the panel data, following Pesaran (2015) and Bailey et al. (2016), we compute the

C D

statistic and the

α

statistic for the variables of interest in Table 4. As can be seen from the table, the

C D

statistics turn out to be

101.519

,

166.27

, and

154.142

for consumption, income, and price, respectively; these are highly significant and reject the null hypothesis of weak cross-sectional dependence for all three variables. Additionally, the estimates of

α

together with their

95 %

confidence bands further confirm the above results. As a result, we can conclude that there is an obvious cross-sectional dependence for these three variables.

To investigate the relationship between the per capita cigarette consumption and the income per capita as well as cigarette prices, following Baltagi and Li (2004), we consider the panel model

y_{i t} = c_{i} + ϕ_{i} y_{i, t - 1} + β_{1 i} x_{1 i t} + β_{2 i} x_{2 i t} + e_{i t},

(22)

where

y_{i t},

x_{1 i t}

, and

x_{2 i t}

denote the per capita cigarette consumption, the income per capita, and cigarette price for the ith state at time t, respectively, and the idiosyncratic error has the multi-factor structure

e_{i t} = γ_{i}^{'} f_{t} + ε_{i t} .

(23)

The proposed dynamic CCE approach is applied to estimate the coefficients in model (22), and the augmented equation to be estimated can be written as

y_{i t} = c_{i} + ϕ_{i} y_{i, t - 1} + β_{1 i} x_{1 i t} + β_{2 i} x_{2 i t} + \sum_{l = 0}^{p_{T}} δ_{i l}^{'} {\bar{z}}_{t - l} + w_{i t},

(24)

where the number of lags

p_{T} = [\sqrt[3]{T}] = 3

, and

{\bar{z}}_{t} = {({\bar{y}}_{t - 1}, {\bar{x}}_{1 t}, {\bar{x}}_{2 t})}^{'}

. We focus on the CCEMG estimators and the results are presented in Table 5.

The following conclusions can be drawn from Table 5. On the one hand, the income per capita has a positive effect on the per capita cigarette consumption, while the increase in cigarette price will restrain cigarette consumption to a certain extent, and both are significant. These results are consistent with the conclusions of Bada and Liebl (2014). On the other hand, the lagged explained variable is highly significant, indicating that it is appropriate to use dynamic models for the per capita cigarette consumption.

To illustrate the heterogeneous slopes across states, we display both the CCE and the CCEMG estimators in Figure 1, which clearly show that the estimates of coefficients vary from state to state, reflecting the heterogeneity among states. Moreover, to illustrate the potential non-stationarity of unobservable common factors in (23), we consider the method proposed by Bada and Kneip (2014) to select the number of unobservable common factors and estimate the selected common factors. The results are given in Figure 2, where the top panel shows the estimated common factors and the bottom panel shows the estimated time-varying individual effects of

N = 46

states. As can be seen from the figure, five common factors have been selected, among which the first and second common factors have obvious tendencies and violate the stationarity condition.

6. Conclusions

In this paper, we re-examined the CCE type estimator for dynamic heterogeneous panel regression models with non-stationary common factors. Asymptotic properties of CCE estimators are established when both N and T are large. It is shown that, under certain conditions, the main results of Pesaran (2006) and Chudik and Pesaran (2015a) hold for a dynamic panel with non-stationary factors. Monte Carlo simulations were conducted to investigate the finite sample properties of the CCE estimation for the panel with non-stationary factors. An empirical application to the U.S. cigarette consumption dataset shows that the real data may have cross-sectional dependence as well as dynamic and non-stationary common factors (at the same time). Based on the findings of this paper, together with the results by Pesaran (2006); Kapetanios et al. (2011), and Chudik and Pesaran (2015a), we can conclude that the CCE method can be widely used to deal with panel models with error cross-sectional dependence, regardless of whether the model is static or dynamic, and whether the unobservable common factors are stationary.

Author Contributions

These authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

Cao acknowledges the financial support from the National Natural Science Foundation of China (no. 11861014) and Guangxi Natural Science Foundation (no. 2020JJA110007 and no. 2020JJA110013.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data used in the empirical application can be obtained from the R package phtt.

Acknowledgments

We are grateful for the constructive comments from the guest editor as well as the two anonymous referees.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Useful Lemmas and Theoretical Derivations of Theorems

The appendix includes proofs of the theorems and lemmas used in the derivations of the main results in the paper.

Recall that

H = (\begin{matrix} 1 & h_{p_{T} + 1}^{'} & \dots & h_{1}^{'} \\ 1 & h_{p_{T} + 2}^{'} & \dots & h_{2}^{'} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & h_{T}^{'} & \dots & h_{T - p_{T}}^{'} \end{matrix}), G = (τ, \tilde{F}) = (\begin{matrix} 1 & f_{p_{_{T}} + 1}^{'} & \dots & f_{1}^{'} \\ 1 & f_{p_{_{T}} + 2}^{'} & \dots & f_{2}^{'} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & f_{T}^{'} & \dots & f_{T - P_{T}}^{'} \end{matrix}),

(A1)

\bar{P} = (\begin{matrix} 1 & {\bar{c}}_{z}^{'} & \dots & {\bar{c}}_{z}^{'} \\ 0 & Ψ^{'} (L) & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & Ψ^{'} (L) \end{matrix}), {\bar{V}}^{*} = (0, \tilde{V}) = (\begin{matrix} 0 & {\bar{v}}_{p_{T} + 1}^{'} & \dots & {\bar{v}}_{1}^{'} \\ 0 & {\bar{v}}_{p_{T} + 2}^{'} & \dots & {\bar{v}}_{2}^{'} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & {\bar{v}}_{T}^{'} & \dots & {\bar{v}}_{T - p_{T}}^{'} \end{matrix}),

(A2)

where

h_{t} = Ψ (L) f_{t} + {\bar{c}}_{z}

,

Ψ (L) = \frac{1}{N} \sum_{i = 1}^{N} {(I_{k + 1} - A_{i} L)}^{- 1} A_{0 i}^{- 1} C_{i}

and

{\bar{v}}_{t} = \frac{1}{N} \sum_{i = 1}^{N} {(I_{k + 1} - A_{i} L)}^{- 1} A_{0 i}^{- 1} e_{i t}

. Then, the following equalities hold,

\bar{Q} = G \bar{P} + {\bar{V}}^{*} and H = G \bar{P},

(A3)

where matrix

\bar{Q}

is given by (10).

Appendix A.1. Useful Lemmas

Now, let us turn to the lemmas, which are needed for the derivation of the results in the main paper.

Lemma A1.

(a) If

A \in R_{r}^{m \times n}

,

r > 0

, has a full-rank factorization

A = BC,

where

B \in R_{r}^{m \times r}

,

C \in R_{r}^{r \times n}

, then

A^{+} = C^{+} B^{+} .

(b) If

A \in R_{m}^{m \times n}

, i.e.,

A

is full row rank, then

A^{+} = A^{'} {({AA}^{'})}^{- 1}

.

Using the properties of Moore–Penrose inverse, Lemma A.1 can be easily established by the MacDuffe Theorem of Ben-Israe and Greville (2003).

Lemma A2.

If the rank condition (6) in the main text is satisfied, then

M_{h} = M_{g}

(A4)

where

M_{h} = I_{T - p_{T}} - H {(H^{'} H)}^{+} H^{'}

,

M_{g} = I_{T - p_{T}} - G {(G^{'} G)}^{+} G^{'}

and

H = G \bar{P}

.

Lemma A3.

Ξ_{i} = (y_{i, - 1}, X_{i})

can be written as

Ξ_{i} = G_{1} Π_{i 1} + Ω_{i},

(A5)

or more concisely as

Ξ_{i} = G_{2} Π_{i 2} + Ω_{i},

(A6)

where

G_{1} = G = (τ, \tilde{F})

given by (A1),

Π_{i 1} = {(c_{ξ i}, Ψ_{ξ i} C_{i}, 0, \dots, 0)}^{'},

G_{2} = (τ, F)

,

Π_{i 2} = {(c_{ξ i}, Ψ_{ξ i} C_{i})}^{'}

and

Ω_{i} = e_{i} Ψ_{ξ i}^{'}

, with

e_{i} = (ε_{i}, U_{i})

,

c_{ξ i} = {(S_{y}, S_{x})}^{'} {(I_{k + 1} - A_{i})}^{- 1} c_{z i}

,

Ψ_{ξ i} = (\begin{matrix} S_{y}^{'} {(I_{k + 1} - A_{i} L)}^{- 1} L \\ S_{x}^{'} {(I_{k + 1} - A_{i} L)}^{- 1} \end{matrix}) A_{0 i}^{- 1}

is

(k + 1) \times (k + 1)

matrix, and

S_{y}^{'} = (\begin{matrix} 1 & \underset{1 \times k}{0} \end{matrix})

,

S_{x}^{'} = (\begin{matrix} \underset{k \times 1}{0} & I_{k} \end{matrix})

.

Lemma A4.

Under Assumption 1–7 as well as restriction

(N, T, p_{T}) \overset{j}{\to} \infty

, such that

p_{T}^{2} / T \to 0

and

T / N \to φ \neq 0 < \infty

, then the following holds,

{∥\frac{{\bar{V}}^{*^{'}} {\bar{V}}^{*}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{N})

(A7)

{∥\frac{ε_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{N}) + O_{p} (\frac{p_{T}}{\sqrt{N T}}), {∥\frac{U_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{N}) + O_{p} (\frac{p_{T}}{\sqrt{N T}})

(A8)

{∥\frac{G^{'} {\bar{V}}^{*}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{\sqrt{N}}), {∥\frac{H^{'} {\bar{V}}^{*}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{\sqrt{N}}), {∥\frac{Ξ_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{\sqrt{N}})

(A9)

{∥\frac{G^{'} ε_{i}}{T}∥}_{\infty} = O_{p} (1), {∥\frac{G^{'} U_{i}}{T}∥}_{\infty} = O_{p} (1), {∥\frac{{\bar{Q}}^{'} ε_{i}}{T}∥}_{\infty} = O_{p} (1)

(A10)

{∥\frac{G^{'} G}{T^{2}}∥}_{\infty} = O_{p} (p_{T}), {∥\frac{H^{'} H}{T^{2}}∥}_{\infty} = O_{p} (p_{T}), {∥\frac{{\bar{Q}}^{'} \bar{Q}}{T^{2}}∥}_{\infty} = O_{p} (p_{T})

(A11)

{∥\frac{{\bar{Q}}^{'} G}{T^{2}}∥}_{\infty} = O_{p} (p_{T}), {∥\frac{H^{'} Ξ_{i}}{T^{2}}∥}_{\infty} = O_{p} (p_{T}), {∥\frac{{\bar{Q}}^{'} Ξ_{i}}{T^{2}}∥}_{\infty} = O_{p} (p_{T})

(A12)

Lemma A5.

Under Assumption 1–7 and

(N, T, p_{T}) \overset{j}{\to} \infty

, such that

p_{T}^{3} / T \to λ,

0 < λ < \infty

. Then,

\frac{Ξ_{i}^{'} M_{g} Ξ_{i}}{T} \overset{p}{\to} Σ_{Ω i},

(A13)

where

Σ_{Ω i} = Ψ_{ξ i} (\begin{matrix} σ_{i}^{2} & 0 \\ 0 & Σ_{u_{i}} \end{matrix}) Ψ_{ξ i}^{'}

is a positive definite matrix. Additionally, if the rank condition (6) is satisfied, then

\frac{Ξ_{i}^{'} M_{q} F}{T} = O_{p} (\frac{1}{N}) + O_{p} (\frac{1}{\sqrt{N T}}), uniformly over i .

(A14)

Lemma A6.

If the rank condition (6) is satisfied, and

(N, T, p_{T}) \overset{j}{\to} \infty

, such that

N / T \to φ

,

0 < φ < \infty

and

p_{T}^{2} / T \to 0

, it follows that

\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T} - \frac{Ξ_{i}^{'} M_{h} Ξ_{i}}{T} = O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i,

(A15)

\frac{Ξ_{i}^{'} M_{q} F}{T} - \frac{Ξ_{i}^{'} M_{h} F}{T} = O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i,

(A16)

\frac{Ξ_{i}^{'} M_{q} ε_{i}}{T} - \frac{Ξ_{i}^{'} M_{h} ε_{i}}{T} = O_{p} (\frac{p_{T}}{\sqrt{N T}}) + O_{p} (\frac{p_{T}}{N}), uniformly over i .

(A17)

Appendix A.2. Theoretical Derivation of the Asymptotics of the CCE Estimators

Proof of Theorem 1.

Since

{\hat{π}}_{i} - π_{i} = {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{q} F γ_{i}}{T} + {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{g} ε_{i}}{T},

and using the results of Lemmas A5 and A6, we have

{\hat{π}}_{i} - π_{i} = {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{g} ε_{i}}{T} + O_{p} (\frac{1}{N}) + O_{p} (\frac{1}{\sqrt{N T}}) .

Noting that under our assumptions,

T^{- 1} Ξ_{i}^{'} M_{q} Ξ_{i}

tends to a fixed positive definite matrix. Since

Ξ_{i} = G Π_{i} + Ω_{i}

, then we have

\frac{Ξ_{i}^{'} M_{g} ε_{i}}{T} = - {({\hat{Π}}_{i} - Π_{i})}^{'} \frac{G^{'} ε_{i}}{T} + \frac{Ω_{i}^{'} ε_{i}}{T},

where

{\hat{Π}}_{i}

is the OLS estimator of

Ξ_{i}

on

G

. Since

({\hat{Π}}_{i} - Π_{i}) = O_{p} (T^{- 1})

, the first part of (A10) in Lemma A4 implies that the first term is

O_{p} (T^{- 1})

. Next, we establish

T^{- 1} Ω_{i}^{'} ε_{i} \overset{p}{\to} 0

. Note that

Ω_{i} = e_{i} Ψ_{ξ i}^{'}

with

e_{i} = (ε_{i}, U_{i})

, i.e.,

Ω_{i}

contains the lags of

ε_{i t}

, as well as the contemporary and lags of

u_{i t}

, by Assumption 1,

ε_{i t}

is the series uncorrelated and independent of

u_{i t}

, then we have

T^{- 1} Ω_{i}^{'} ε_{i} \overset{p}{\to} 0

; consequently,

\frac{Ξ_{i}^{'} M_{g} ε_{i}}{T} \overset{p}{\to} 0, uniformly over i .

(A18)

as

(N, T, p_{T}) \overset{j}{\to} \infty

and

p_{T}^{2} / T \to 0

. Then it is followed by the consistency of

{\hat{π}}_{i}

. □

Proof of Theorem 2.

Using the consistency of

{\hat{π}}_{i}

, and the definition of the mean group estimator

{\hat{π}}_{M G}

, we obtain

{\hat{π}}_{M G} - \frac{1}{N} \sum_{i = 1}^{N} π_{i} \overset{p}{\to} 0,

(A19)

By the assumption of the random coefficient model,

π_{i} = π + υ_{π i}

, it follows that

\frac{1}{N} \sum_{i = 1}^{N} π_{i} = π + \frac{1}{N} \sum_{i = 1}^{N} υ_{π i} .

(A20)

Combining (A19) and (A20), we have

{\hat{π}}_{M G} \overset{p}{\to} π + \frac{1}{N} \sum_{i = 1}^{N} υ_{π i},

(A21)

so we only need to show that

\frac{1}{N} \sum_{i = 1}^{N} υ_{π i} \overset{p}{\to} 0

. Since

υ_{π i} \sim I I D (0, Σ_{π})

by Assumption 3, we have

E (\frac{1}{N} \sum_{i = 1}^{N} υ_{π i}) = 0

and

V a r (\frac{1}{N} \sum_{i = 1}^{N} υ_{π i}) = \frac{1}{N^{2}} \sum_{i = 1}^{N} V a r (υ_{π i}) = O (\frac{1}{N})

, which implies

\frac{1}{N} \sum_{i = 1}^{N} υ_{π i} \overset{p}{\to} 0 .

(A22)

Using (A21) and (A22), we obtain

{\hat{π}}_{M G} \overset{p}{\to} π

as desired.

Next, we establish the asymptotic distribution of

{\hat{π}}_{M G}

. We have

\begin{matrix} \sqrt{N} ({\hat{π}}_{M G} - π) & = & \frac{1}{\sqrt{N}} \sum_{i = 1}^{N} υ_{π i} + \frac{1}{N} \sum_{i = 1}^{N} {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{\sqrt{N} Ξ_{i}^{'} M_{q} F}{T} γ_{i} \\ + \frac{1}{N} \sum_{i = 1}^{N} {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{\sqrt{N} Ξ_{i}^{'} M_{q} ε_{i}}{T} . \end{matrix}

(A23)

Using the result (A14) in Lemma A5, when the rank condition is satisfied, we have

\frac{\sqrt{N} Ξ_{i}^{'} M_{q} F}{T} = O_{p} (\frac{1}{\sqrt{N}}) + O_{p} (\frac{1}{\sqrt{T}}),

which, together with the assumption of

γ_{i}

to be bounded, and the results of Lemma A2, A5 and Lemma A6, we obtain

{(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{\sqrt{N} Ξ_{i}^{'} M_{q} F}{T} γ_{i} = O_{p} (\frac{1}{\sqrt{N}}) + O_{p} (\frac{1}{\sqrt{T}}),

uniformly over i, it follows that

\frac{1}{N} \sum_{i = 1}^{N} {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{\sqrt{N} Ξ_{i}^{'} M_{q} F}{T} γ_{i} = O_{p} (\frac{1}{\sqrt{N}}) + O_{p} (\frac{1}{\sqrt{T}}) .

(A24)

For the third term, by Lemmas A2 and A6, we have

\begin{matrix} \frac{1}{N} \sum_{i = 1}^{N} {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{\sqrt{N} Ξ_{i}^{'} M_{q} ε_{i}}{T} \\ = & \frac{1}{\sqrt{N}} \sum_{i = 1}^{N} {(\frac{Ξ_{i}^{'} M_{g} Ξ_{i}}{T})}^{- 1} \frac{Ξ_{i}^{'} M_{g} ε_{i}}{T} + O_{p} (\frac{1}{\sqrt{N}}) + O_{p} (\frac{1}{\sqrt{T}}) . \end{matrix}

Moreover, the result (A13) of Lemma A5 implies

T^{- 1} Ξ_{i}^{'} M_{g} Ξ_{i} = O_{p} (1)

, and

T^{- 1} Ξ_{i}^{'} M_{g} ε_{i} = O_{p} (T^{- 1 / 2})

by (A18); hence, we have

\frac{1}{N} \sum_{i = 1}^{N} {(\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T})}^{- 1} \frac{\sqrt{N} Ξ_{i}^{'} M_{q} ε_{i}}{T} = O_{p} (\frac{1}{\sqrt{N}}) + O_{p} (\frac{1}{\sqrt{T}}) .

(A25)

Using (A24) and (A25) in (A23), we can obtain

\sqrt{N} ({\hat{π}}_{M G} - π) \overset{d}{\sim} \frac{1}{\sqrt{N}} \sum_{i = 1}^{N} υ_{π i} .

By the random coefficient assumption, it now follows that

\sqrt{N} ({\hat{π}}_{M G} - π) \overset{d}{\to} N (0, Σ_{M G}),

and

Σ_{M G}

can be consistently estimated nonparametrically by

{\hat{Σ}}_{M G} = \frac{1}{N - 1} \sum_{i = 1}^{N} ({\hat{π}}_{i} - {\hat{π}}_{M G}) {({\hat{π}}_{i} - {\hat{π}}_{M G})}^{'} .

□

Appendix A.3. Proofs of Lemmas

Notation: All vectors are column vectors represented by bold lower case letters, and matrices are represented by bold capital letters. Let

∥A∥ = \sqrt{t r ({AA}^{'})}

denote the Frobenius norm.

{∥A∥}_{1} = {max}_{1 \leq j \leq n} Σ_{i = 1}^{n} |a_{i j}|

and

{∥A∥}_{\infty} = {max}_{1 \leq i \leq n} Σ_{j = 1}^{n} |a_{i j}|

denote the maximum absolute column and row sum matrix norms, respectively.

λ_{min} (A)

denotes the minimum eigenvalue of

A,

and

λ_{max} (A)

denotes the maximum eigenvalue of

A .

A^{+}

denotes the Moore–Penrose inverse of

A,

and

r k (A)

denotes the rank of

A .

We also let K denote a generic finite constant, which does not depend on N or T, and whose value may vary case by case.

Proof of Lemma A2.

Since

H = G \bar{P}

, where

\bar{P} = (\begin{matrix} 1 & {\bar{c}}_{z}^{'} & {\bar{c}}_{z}^{'} & \dots & {\bar{c}}_{z}^{'} \\ 0 & Ψ^{'} (L) & 0 & \dots & 0 \\ 0 & 0 & Ψ^{'} (L) & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & Ψ^{'} (L) \end{matrix}),

with

Ψ (L) = Λ (L) C + O_{p} (N^{- 1 / 2})

,

Λ (L)

is invertible. If the rank condition (6) is satisfied, i.e.,

C

is a full column rank matrix, then

Λ (L) C

has a full column rank; hence,

Ψ^{'} (L)

has full row rank asymptotically, which implies

{lim}_{N \to \infty} \bar{P}

is a full row rank matrix. Moreover, noting that when the rank condition holds, matrix

G^{'} G

is full rank, so we have

\begin{matrix} M_{h} & = & I - H {(H^{'} H)}^{+} H^{'} \\ = & I - G \bar{P} {({\bar{P}}^{'} G^{'} G \bar{P})}^{+} {\bar{P}}^{'} G^{'} \\ = & I - G \bar{P} {\bar{P}}^{+} {(G^{'} G)}^{+} {\bar{P}}^{' +} {\bar{P}}^{'} G^{'} \\ = & I - G \bar{P} {\bar{P}}^{'} {(\bar{P} {\bar{P}}^{'})}^{- 1} {(G^{'} G)}^{+} {(\bar{P} {\bar{P}}^{'})}^{- 1} \bar{P} {\bar{P}}^{'} G^{'} \\ = & I - G {(G^{'} G)}^{+} G^{'} = M_{g} . \end{matrix}

where the third equality follows from Lemma A1(a) since

\bar{P}

has the full row rank asymptotically, and the fourth equality is based on the result of Lemma A1(b). □

Proof of Lemma A3.

Denote

Ξ_{i} = {(ξ_{i, p_{T} + 1}, ξ_{i, p_{T} + 2}, \dots, ξ_{i T})}^{'}

, where

ξ_{i t} = {(y_{i, t - 1}, x_{i, t}^{'})}^{'}

. Note that

z_{i t} = {(y_{i, t}, x_{i, t}^{'})}^{'}

, so we can write

y_{i, t - 1} = S_{y}^{'} z_{i, t - 1}

and

x_{i, t} = S_{x}^{'} z_{i t}

, where

S_{y}^{'} = (\begin{matrix} 1 & \underset{1 \times k}{0} \end{matrix})

,

S_{x}^{'} = (\begin{matrix} \underset{k \times 1}{0} & I_{k} \end{matrix})

. Hence, we have

ξ_{i t} = (\begin{matrix} y_{i, t - 1} \\ x_{i, t} \end{matrix}) = (\begin{matrix} S_{y}^{'} z_{i, t - 1} \\ S_{x}^{'} z_{i t} \end{matrix}) .

(A26)

We also note that

\begin{matrix} z_{i t} & = & \sum_{l = 0}^{\infty} A_{i}^{l} (c_{z i} + A_{0 i}^{- 1} C_{i} f_{t - l} + e_{z i, t - l}) \\ = & {(I_{k + 1} - A_{i})}^{- 1} c_{z i} + {(I_{k + 1} - A_{i} L)}^{- 1} A_{0 i}^{- 1} (C_{i} f_{t} + e_{i t}), \end{matrix}

(A27)

using (A27) into (A26), we have

\begin{matrix} ξ_{i t} & = & (\begin{matrix} S_{y}^{'} {(I_{k + 1} - A_{i})}^{- 1} \\ S_{x}^{'} {(I_{k + 1} - A_{i})}^{- 1} \end{matrix}) c_{z i} + (\begin{matrix} S_{y}^{'} {(I_{k + 1} - A_{i} L)}^{- 1} L \\ S_{x}^{'} {(I_{k + 1} - A_{i} L)}^{- 1} \end{matrix}) A_{0 i}^{- 1} (C_{i} f_{t} + e_{i t}) \\ = & {(S_{y}, S_{x})}^{'} {(I_{k + 1} - A_{i})}^{- 1} c_{z i} + (\begin{matrix} S_{y}^{'} {(I_{k + 1} - A_{i} L)}^{- 1} L \\ S_{x}^{'} {(I_{k + 1} - A_{i} L)}^{- 1} \end{matrix}) A_{0 i}^{- 1} (C_{i} f_{t} + e_{i t}) \\ \equiv & c_{ξ i} + Ψ_{ξ i} (C_{i} f_{t} + e_{i t}) . \end{matrix}

Consequently, we have

\begin{matrix} Ξ_{i} & = & (\begin{matrix} ξ_{i, p_{T} + 1}^{'} \\ ξ_{i, p_{T} + 2}^{'} \\ ⋮ \\ ξ_{i T}^{'} \end{matrix}) = (\begin{matrix} c_{ξ i}^{'} + f_{p_{T} + 1}^{'} C_{i}^{'} Ψ_{ξ i}^{'} \\ c_{ξ i}^{'} + f_{p_{T} + 2}^{'} C_{i}^{'} Ψ_{ξ i}^{'} \\ ⋮ \\ c_{ξ i}^{'} + f_{T}^{'} C_{i}^{'} Ψ_{ξ i}^{'} \end{matrix}) + (\begin{matrix} e_{i, p_{T} + 1}^{'} \\ e_{i, p_{T} + 2}^{'} \\ ⋮ \\ e_{i T}^{'} \end{matrix}) Ψ_{ξ i}^{'} \\ = & (\begin{matrix} 1 & f_{p_{_{T}} + 1}^{'} & f_{p_{_{T}}}^{'} & \dots & f_{1}^{'} \\ 1 & f_{p_{_{T}} + 2}^{'} & f_{p_{_{T}} + 1}^{'} & \dots & f_{2}^{'} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & f_{T}^{'} & f_{T - 1}^{'} & \dots & f_{T - P_{T}}^{'} \end{matrix}) (\begin{matrix} c_{ξ i}^{'} \\ C_{i}^{'} Ψ_{ξ i}^{'} \\ 0 \\ ⋮ \\ 0 \end{matrix}) + e_{i} Ψ_{ξ i}^{'} \\ = & G_{1} Π_{i 1} + Ω_{i}, \end{matrix}

or more concisely as

Ξ_{i} = (\begin{matrix} 1 & f_{p_{T} + 1}^{'} \\ 1 & f_{p_{T} + 2}^{'} \\ ⋮ & ⋮ \\ 1 & f_{T}^{'} \end{matrix}) (\begin{matrix} c_{ξ i}^{'} \\ C_{i}^{'} Ψ_{ξ i}^{'} \end{matrix}) + e_{i} Ψ_{ξ i}^{'} = G_{2} Π_{i 2} + Ω_{i} .

□

Proof of Lemma A4.

We consider (A7) firstly. Note that

{\bar{V}}^{*} = (0, \tilde{V}) = [\begin{matrix} 0 & {\bar{v}}_{p_{T} + 1}^{'} & {\bar{v}}_{p_{T}}^{'} & \dots & {\bar{v}}_{1}^{'} \\ 0 & {\bar{v}}_{p_{T} + 2}^{'} & {\bar{v}}_{p_{T} + 1}^{'} & \dots & {\bar{v}}_{2}^{'} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & {\bar{v}}_{T}^{'} & {\bar{v}}_{T - 1}^{'} & \dots & {\bar{v}}_{T - p_{T}}^{'} \end{matrix}],

where

{\bar{v}}_{t} = \frac{1}{N} \sum_{i = 1}^{N} {(I_{k + 1} - A_{i} L)}^{- 1} A_{0 i}^{- 1} e_{i t} = \frac{1}{N} \sum_{i = 1}^{N} A_{i}^{l} e_{z i, t - l} .

So we only need to consider

T^{- 1} ({\tilde{V}}^{'} \tilde{V})

, which is a

(k + 1) (p_{T} + 1) \times (k + 1) (p_{T} + 1)

matrix. Since the elements of

e_{z i t}

are weakly cross-sectionally dependent, together with the random coefficient assumptions, we have

E ∥{\bar{v}}_{t}∥ = O (N^{- \frac{1}{2}})

and

E {∥{\bar{v}}_{t}∥}^{2} = O (N^{- 1})

. Consider the

(s, r) t h

block element of

T^{- 1} ({\tilde{V}}^{'} \tilde{V})

, which can be written as

T^{- 1} (\sum_{t = p_{T} + 1}^{T} {\bar{v}}_{t - s} {\bar{v}}_{t - r}^{'})

, for

s, r \in {0, 1, \dots, p_{T}}

. where the cross-product terms with finite means and variances. Hence,

E ∥\frac{1}{T} \sum_{t = p_{T} + 1}^{T} {\bar{v}}_{t - s} {\bar{v}}_{t - r}^{'}∥ \leq \frac{1}{T} \sum_{t = p_{T} + 1}^{T} E {∥{\bar{v}}_{t}∥}^{2} = O (\frac{1}{N}),

(A28)

then we have

E {∥\frac{{\tilde{V}}^{'} \tilde{V}}{T}∥}_{\infty} \leq O (\frac{p_{T}}{N}),

which establishes (A7).

Now, we establish (A8), as before, we consider

T^{- 1} ε_{i}^{'} \tilde{V}

here, and note that the lth column block of

T^{- 1} ε_{i}^{'} \tilde{V}

is

T^{- 1} (\sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{v}}_{t - l}^{'})

, for

l = 0, 1, \dots, p_{T}

, which can be partitioned as

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{v}}_{t - l}^{'} = (\begin{matrix} \frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{ε}}_{t - l}, & \frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{u}}_{t - l}^{'} \end{matrix}) .

(A29)

We consider the first term and note that

\begin{matrix} {\bar{ε}}_{t} & = & \frac{1}{N} \sum_{j = 1}^{N} {(I_{k + 1} - A_{j} L)}^{- 1} A_{0 j}^{- 1} ε_{j t} = \frac{1}{N} \sum_{j = 1}^{N} \sum_{l = 0}^{\infty} A_{j}^{l} A_{0 j}^{- 1} ε_{j, t - l} \\ = & \frac{1}{N} \sum_{j = 1}^{N} (A_{0 j}^{- 1} ε_{j t} + A_{j} A_{0 j}^{- 1} ε_{j, t - 1} + A_{j}^{2} A_{0 j}^{- 1} ε_{j, t - 2} + \dots), \end{matrix}

which implies that

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{ε}}_{t - l} = \frac{1}{N T} \sum_{t = p_{T} + 1}^{T} \sum_{j = 1}^{N} ε_{i t} (A_{0 j}^{- 1} ε_{j, t - l} + A_{j} A_{0 j}^{- 1} ε_{j, t - l - 1} + \dots) .

Under the assumption of the individual–specific error, we have

c o v (ε_{i t}, ε_{j s}) = 0, i \neq j

; hence,

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{ε}}_{t - l} = \frac{1}{N T} \sum_{t = p_{T} + 1}^{T} ε_{i t} (A_{0 i}^{- 1} ε_{i, t - l} + A_{i} A_{0 i}^{- 1} ε_{i, t - l - 1} + \dots),

when

l = 0

,

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{ε}}_{t - l} = \frac{A_{0 i}^{- 1}}{N T} \sum_{t = p_{T} + 1}^{T} ε_{i t}^{2}

, since by Assumption 1,

E (ε_{i t}^{2}) = O (1)

, and

∥A_{0 i}^{- 1}∥ \leq K

, then it easily follows that

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{ε}}_{t - l} = O_{p} (\frac{1}{N})

. When

l = 1, 2, \dots, p_{T}

, we have the result

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{ε}}_{t - l} = O_{p} (\frac{1}{N})

since

ε_{i t}

is a serial uncorrelated covariance stationary process under Assumption 2. Combining these results yields

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{ε}}_{t - l} = O_{p} (\frac{1}{N}), uniformly over i .

(A30)

Now, we consider the second term

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{u}}_{t - l}^{'}

, noting that

ε_{i t}

and

u_{i t}

are independently distributed stationary processes with zero means, it follows that

sup_{i} V a r (\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{u}}_{t - l}^{'}) = O (\frac{1}{N}) [\frac{1}{T^{2}} \sum_{t = p_{T} + 1}^{T} \sum_{t^{'} = p_{T} + 1}^{T} E (ε_{i t} ε_{i t^{'}}) = O (\frac{1}{N T})],

which follows that

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{u}}_{t - l}^{'} = O_{p} (\frac{1}{\sqrt{N T}}), uniformly over i .

(A31)

Using (A30) and (A31) in (A29), we have

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} ε_{i t} {\bar{v}}_{t - l}^{'} = O_{p} (\frac{1}{N}) + O_{p} (\frac{1}{\sqrt{N T}}), uniformly over i .

Consequently, we have

{∥\frac{ε_{i}^{'} \tilde{V}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{N}) + O_{p} (\frac{p_{T}}{\sqrt{N T}}), uniformly over i,

hence, the first part of (A8) is established. Similarly, the result for

T^{- 1} U_{i}^{'} {\bar{V}}^{*}

of the second part of (A8) is established.

For the first part of (A9), since

G = (τ, \tilde{F})

and

{\bar{V}}^{*} = (0, \tilde{V})

, we consider

\begin{matrix} \frac{{\tilde{F}}^{'} \tilde{V}}{T} & = & \frac{1}{T} (\begin{matrix} f_{p_{_{T}} + 1} & f_{p_{_{T}} + 2} & \dots & f_{T} \\ f_{p_{_{T}}} & f_{p_{_{T}} + 1} & \dots & f_{T - 1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ f_{1} & f_{2} & \dots & f_{T - P_{T}} \end{matrix}) (\begin{matrix} {\bar{v}}_{p_{T} + 1}^{'} & {\bar{v}}_{p_{T}}^{'} & \dots & {\bar{v}}_{1}^{'} \\ {\bar{v}}_{p_{T} + 2}^{'} & {\bar{v}}_{p_{T} + 1}^{'} & \dots & {\bar{v}}_{2}^{'} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\bar{v}}_{T}^{'} & {\bar{v}}_{T - 1}^{'} & \dots & {\bar{v}}_{T - p_{T}}^{'} \end{matrix}) \\ = & \frac{1}{T} (\begin{matrix} \sum_{t = p_{T} + 1}^{T} f_{t} {\bar{v}}_{t}^{'} & \sum_{t = p_{T} + 1}^{T} f_{t} {\bar{v}}_{t - 1}^{'} & \dots & \sum_{t = p_{T} + 1}^{T} f_{t} {\bar{v}}_{t - p_{T}}^{'} \\ \sum_{t = p_{T} + 1}^{T} f_{t - 1} {\bar{v}}_{t}^{'} & \sum_{t = p_{T} + 1}^{T} f_{t - 1} {\bar{v}}_{t - 1}^{'} & \dots & \sum_{t = p_{T} + 1}^{T} f_{t - 1} {\bar{v}}_{t - p_{T}}^{'} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ \sum_{t = p_{T} + 1}^{T} f_{t - p_{T}} {\bar{v}}_{t}^{'} & \sum_{t = p_{T} + 1}^{T} f_{t - p_{T}} {\bar{v}}_{t - 1}^{'} & \dots & \sum_{t = p_{T} + 1}^{T} f_{t - p_{T}} {\bar{v}}_{t - p_{T}}^{'} \end{matrix}), \end{matrix}

which is a

m (p_{T} + 1) \times (k + 1) (p_{T} + 1)

matrix. Without loss of generality, we consider the first block element,

T^{- 1} \sum_{t = p_{T} + 1}^{T} f_{t} {\bar{v}}_{t}^{'}

, and note that the lth row of that can be written as

T^{- 1} \sum_{t = p_{T} + 1}^{T} f_{t l} {\bar{v}}_{t}^{'},

l = 1, 2, \dots, m

. According to the assumption of

f_{t l}

and

{\bar{v}}_{t}

(independently distributed processes), it easily follows that

∥E \frac{1}{T} \sum_{t = p_{T} + 1}^{T} f_{t l} {\bar{v}}_{t}^{'}∥ = 0,

and

V a r (\frac{1}{T} \sum_{t = p_{T} + 1}^{T} f_{t l} {\bar{v}}_{t}^{'}) = O (\frac{1}{N}) [\frac{_{\sum_{t} \sum_{t^{'}} E (f_{t l} f_{t^{'} l})}}{T^{2}}] = O (\frac{1}{N}) .

by the standard unit root asymptotic analysis result

T^{- 2} \sum_{t = 1}^{T} \sum_{t^{'} = 1}^{T} E (f_{t l} f_{t^{'} l}) = O (1)

, which establishes that

T^{- 1} \sum_{t = p_{T} + 1}^{T} f_{t l} {\bar{v}}_{t}^{'}

converges to its limit at the desired rate of

O_{p} (\frac{1}{\sqrt{N}})

. Consequently, we have

{∥\frac{{\tilde{F}}^{'} \tilde{V}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{\sqrt{N}}),

then the first part of (A9) is proven.

To establish the second part of (A9), recalling that

H = G \bar{P}

, we have

{∥\frac{H^{'} {\bar{V}}^{*}}{T}∥}_{\infty} \leq {∥{\bar{P}}^{'}∥}_{\infty} {∥\frac{G^{'} {\bar{V}}^{*}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{\sqrt{N}})

, since the norm of

\bar{P}

is assumed to be bounded.

To establish the third part of (A9), noting that

Ξ_{i} = G Π_{i} + Ω_{i}

, and using triangle inequality and the submultiplicative property of matrix norm

{∥\cdot∥}_{\infty}

, we have

\begin{matrix} {∥\frac{Ξ_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} & = & {∥Π_{i}^{'} \frac{G^{'} {\bar{V}}^{*}}{T} + \frac{Ω_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} \\ \leq & {∥Π_{i}^{'} \frac{G^{'} {\bar{V}}^{*}}{T}∥}_{\infty} + {∥Ψ_{ξ i} (L) \frac{{(ε_{i}, U_{i})}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} \\ \leq & {∥Π_{i}^{'}∥}_{\infty} {∥\frac{G^{'} {\bar{V}}^{*}}{T}∥}_{\infty} + {∥Ψ_{ξ i} (L)∥}_{\infty} {∥\frac{{(ε_{i}, U_{i})}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} \\ = & O_{p} (\frac{p_{T}}{\sqrt{N}}), \end{matrix}

by (A8), the first and second parts of (A9), as well as the norm of

Π_{i}

and

Ψ_{ξ i} (L)

are assumed to be bounded in probability uniformly over i.

To establish the first part of (A10), recalling

G = (τ, \tilde{F})

, consider the

m (p_{T} + 1) \times 1

vector

T^{- 1} {\tilde{F}}^{'} ε_{i}

, the element of

T^{- 1} {\tilde{F}}^{'} ε_{i}

can be written as

T^{- 1} \sum_{t = p_{T} + 1}^{T} f_{t - l, s} ε_{i t}

,

l = 0, 1, \dots, p_{T},

s = 1, 2, \dots, m

. Since by the assumption of

f_{t - l, s}

and

ε_{i t}

(independently distributed processes), it easily follows that

∥E \frac{1}{T} \sum_{t = p_{T} + 1}^{T} f_{t - l, s} ε_{i t}∥ = 0,

and

sup_{i} V a r (\frac{1}{T} \sum_{t = p_{T} + 1}^{T} f_{t - l, s} ε_{i t}) = O (1) [\frac{_{\sum_{t} \sum_{t^{'}} E (f_{t - l, s} f_{t^{'} - l, s})}}{T^{2}}] = O (1) .

by the standard unit root asymptotic analysis result

T^{- 2} \sum_{t} \sum_{t^{'}} E (f_{t - l, s} f_{t^{'} - l, s}) = O (1)

, which establishes that

\frac{1}{T} \sum_{t = p_{T} + 1}^{T} f_{t - l, s} ε_{i t}

converges to its limit at the desired rate of

O_{p} (1)

. It follows that

{∥T^{- 1} {\tilde{F}}^{'} ε_{i}∥}_{\infty} = O_{p} (1)

; hence, the first part of (A10) is established. Moreover, the second part of (A10) can be proven similarly.

Recalling that

\bar{Q} = G \bar{P} + {\bar{V}}^{*}

, the third part of (A10) is established because

\begin{matrix} {∥\frac{{\bar{Q}}^{'} ε_{i}}{T}∥}_{\infty} & = & {∥{\bar{P}}^{'} \frac{G^{'} ε_{i}}{T} + \frac{{\bar{V}}^{*'} ε_{i}}{T}∥}_{\infty} \\ \leq & {∥{\bar{P}}^{'}∥}_{\infty} {∥\frac{G^{'} ε_{i}}{T}∥}_{\infty} + {∥\frac{{\bar{V}}^{*'} ε_{i}}{T}∥}_{\infty} = O_{p} (1), \end{matrix}

by (A8) and the first part of (A10).

For the first part of (A11), note that

G = (τ, \tilde{F}),

we only need to consider the

T^{- 2} {\tilde{F}}^{'} \tilde{F}

, a

m (p_{T} + 1) \times m (p_{T} + 1)

matrix. We consider the

(s, r) t h

block element of

T^{- 2} {\tilde{F}}^{'} \tilde{F}

, which can be written as

T^{- 2} (\sum_{t = p_{T} + 1}^{T} f_{t - s} f_{t - r}^{'})

, for

s, r \in {0, 1, \dots, p_{T}}

. Without loss of generality, we consider the first block,

T^{- 2} \sum_{t = p_{T} + 1}^{T} f_{t} f_{t}^{'}

, and the element of

T^{- 2} \sum_{t = p_{T} + 1}^{T} f_{t} f_{t}^{'}

can be written as

T^{- 2} \sum_{t = p_{T} + 1}^{T} f_{t l} f_{t l^{'}},

l, l^{'} \in {1, 2, \dots, m}

. By the standard unit root asymptotic analysis, we have

\frac{1}{T^{2}} \sum_{t = p_{T} + 1}^{T} E (f_{t l} f_{t l^{'}}) = O (1),

which implies that

T^{- 2} \sum_{t = p_{T} + 1}^{T} f_{t} f_{t}^{'} = O_{p} (1)

, then we have

{∥T^{- 2} {\tilde{F}}^{'} \tilde{F}∥}_{\infty} = O_{p} (p_{T})

, which establishes the first part of (A11). The second part of (A11) is established by

\begin{matrix} {∥\frac{H^{'} H}{T^{2}}∥}_{\infty} & = & {∥{\bar{P}}^{'} \frac{G^{'} G}{T^{2}} \bar{P}∥}_{\infty} \\ \leq & {∥{\bar{P}}^{'}∥}_{\infty} {∥\frac{G^{'} G}{T^{2}}∥}_{\infty} {∥\bar{P}∥}_{\infty} = O_{p} (p_{T}), \end{matrix}

since the norm of

\bar{P}

is assumed bounded (and the above result).

To prove the third part of (A11), note that

\bar{Q} = H + {\bar{V}}^{*}

, by (A7), the second part of (A9), and the previous result in (A11), we have

\begin{matrix} {∥\frac{{\bar{Q}}^{'} \bar{Q}}{T^{2}}∥}_{\infty} & = & \frac{{(H + {\bar{V}}^{*})}^{'} (H + {\bar{V}}^{*})}{T^{2}} \\ \leq & {∥\frac{H^{'} H}{T^{2}}∥}_{\infty} + {∥\frac{H^{'} {\bar{V}}^{*}}{T^{2}}∥}_{\infty} + {∥\frac{{\bar{V}}^{*'} H}{T^{2}}∥}_{\infty} + {∥\frac{{\bar{V}}^{*'} {\bar{V}}^{*}}{T^{2}}∥}_{\infty} \\ = & O_{p} (p_{T}) . \end{matrix}

Noting that

\bar{Q} = G \bar{P} + {\bar{V}}^{*}

, by the first part of (A9) and (A11), the first part of (A12) can be established.

To establish the second part of (A12), note that

H = G \bar{P}

and

Ξ_{i} = G Π_{i} + Ω_{i}

, and recalling that

Ω_{i} = e_{i} Ψ_{ξ i}^{'} (L)

with

e_{i} = (ε_{i}, U_{i})

, we have

\begin{matrix} {∥\frac{H^{'} Ξ_{i}}{T^{2}}∥}_{\infty} & = & {∥\frac{{\bar{P}}^{'} G^{'} (G Π_{i} + Ω_{i})}{T^{2}}∥}_{\infty} \leq {∥{\bar{P}}^{'} \frac{G^{'} G}{T^{2}} Π_{i}∥}_{\infty} + {∥{\bar{P}}^{'} \frac{G^{'} Ω_{i}}{T^{2}}∥}_{\infty} \\ \leq & {∥{\bar{P}}^{'}∥}_{\infty} {∥\frac{G^{'} G}{T^{2}}∥}_{\infty} {∥Π_{i}∥}_{\infty} + {∥{\bar{P}}^{'}∥}_{\infty} {∥\frac{G^{'} Ω_{i}}{T^{2}}∥}_{\infty} = O_{p} (p_{T}) . \end{matrix}

by (A10) and the first part of (A11), as well as the assumption that the norm of

\bar{P}

,

Π_{i}

, and

Ψ_{ξ i} (L)

is assumed bounded in probability uniformly over i. The third part of (A12) is proven straightforwardly since

\bar{Q} = H + {\bar{V}}^{*}

, using (A8) and the second part of (A12). □

Proof of Lemma A5.

To proof (A13), we note that

Ξ_{i} = G Π_{i} + Ω_{i},

(A32)

where

G = (τ, F, F_{- 1}, \dots, F_{- p_{T}})

is a matrix of

I (1)

factors,

Ω_{i} = e_{i} Ψ_{ξ i}^{'}

with

e_{i} = (ε_{i}, U_{i})

. Denote the OLS estimator of the multiple regression (A32) as

{\hat{Π}}_{i} = {(G^{'} G)}^{- 1} G^{'} Ξ_{i} .

Since that

Ξ_{i}^{'} M_{g} Ξ_{i} = Ω_{i}^{'} M_{g} Ω_{i} = {\hat{Ω}}_{i}^{'} {\hat{Ω}}_{i}

, where

{\hat{Ω}}_{i}

is the OLS residuals, i.e.,

{\hat{Ω}}_{i} = Ξ_{i} - G {\hat{Π}}_{i}

, and in the light of Assumption,

T^{- 1} (Ω_{i}^{'} Ω_{i}) \to Σ_{Ω_{i}}

, we only need to show that

T^{- 1} ({\hat{Ω}}_{i}^{'} {\hat{Ω}}_{i}) - T^{- 1} (Ω_{i}^{'} Ω_{i}) \to 0

. In fact, we can write

\begin{matrix} T^{- 1} ({\hat{Ω}}_{i}^{'} {\hat{Ω}}_{i}) - T^{- 1} (Ω_{i}^{'} Ω_{i}) & = & T^{- 1} {\hat{Ω}}_{i}^{'} ({\hat{Ω}}_{i} - Ω_{i}) + T^{- 1} ({\hat{Ω}}_{i} - Ω_{i}) Ω_{i} \\ = & - T^{- 1} Ξ_{i}^{'} M_{g} G ({\hat{Π}}_{i} - Π_{i}) - T^{- 1} ({\hat{Π}}_{i} - Π_{i}) G^{'} Ω_{i} \\ = & - ({\hat{Π}}_{i} - Π_{i}) (T^{- 1} G^{'} Ω_{i}), \end{matrix}

because

M_{g} G = 0 .

However, since

{∥T^{- 1} G^{'} Ω_{i}∥}_{\infty} = O_{p} (1)

by (A10) of Lemma A4,

{\hat{Π}}_{i} - Π_{i} = O_{p} (T^{- 1})

, it follows that

T^{- 1} ({\hat{Ω}}_{i}^{'} {\hat{Ω}}_{i}) - T^{- 1} (Ω_{i}^{'} Ω_{i}) = O_{p} (T^{- 1}) .

Hence,

\frac{Ξ_{i}^{'} M_{g} Ξ_{i}}{T} \overset{p}{\to} Σ_{Ω_{i}}

, (A13) is established.

To prove (A14), we follow the same spirit of Lemma A.4 in Kapetanios et al. (2011), but need more attention because of the lags. Specifically, note that

M_{q} \bar{Q} = M_{q} (G \bar{P} + {\bar{V}}^{*}),

since

\bar{Q} = G \bar{P} +

{\bar{V}}^{*}

, where

\bar{Q} =

(τ, \bar{Z}, {\bar{Z}}_{- 1}, \dots, {\bar{Z}}_{- p_{T}})

,

G = (τ, \tilde{F}) = (τ, F, F_{- 1}, \dots, F_{- p_{T}})

and

{\bar{V}}^{*} = (0, \bar{V}, {\bar{V}}_{- 1}, \dots, {\bar{V}}_{- p_{T}})

. However,

M_{q} \bar{Q} = 0

and

M_{q} τ = 0

since

τ \in \bar{Q}

. Then

(0, M_{q} \tilde{F}) (\begin{matrix} 1 & {\tilde{c}}_{z} \\ 0 & \tilde{Ψ} \end{matrix}) + (0, M_{q} {\bar{V}}^{*}) = 0,

or

M_{q} \tilde{F} \tilde{Ψ} = - M_{q} {\bar{V}}^{*}

.

For the second column block of the above equation, we have

M_{q} F Ψ^{'} (L) = - M_{q} \bar{V}

or

M_{q} {FC}^{'} Λ^{'} (L)

=

- M_{q} \bar{V}

as

N \overset{P}{\to} \infty

, since

\tilde{Ψ} = d i a g (Ψ^{'} (L))

and

Ψ (L) =

Λ (L) C + O_{p} (N^{- 1 / 2})

, Hence

{\bar{V}}^{'} M_{q} F C^{'} Λ^{'} (L) = - {\bar{V}}^{'} M_{q} \bar{V},

(A33)

and

Ξ_{i}^{'} M_{q} F C^{'} Λ^{'} (L) = - Ξ_{i}^{'} M_{q} \bar{V} .

(A34)

Since

Λ (L)

is invertible under the assumption, then (A34) can be rewritten as

Ξ_{i}^{'} M_{q} F C^{'} = - Ξ_{i}^{'} M_{q} \bar{V} Λ^{- 1} (L) .

When the rank condition is satisfied, we have

Ξ_{i}^{'} M_{q} F = - Ξ_{i}^{'} M_{q} \bar{V} Λ^{- 1} (L) C {(C^{'} C)}^{- 1} .

(A35)

Note that

Ξ_{i}

can be written as

Ξ_{i} = G_{2 i} Π_{2 i} + Ω_{i}

, where

G_{2 i} = (τ, F)

and

Π_{2 i} = {(c_{ξ i}, Ψ_{ξ i} C_{i})}^{'}

, then

\begin{matrix} Ξ_{i}^{'} M_{q} \bar{V} & = & {(G_{2 i} Π_{2 i} + Ω_{i})}^{'} M_{q} \bar{V} = Π_{2 i}^{'} G_{2 i}^{'} M_{q} \bar{V} + Ω_{i}^{'} M_{q} \bar{V} \\ = & (c_{ξ i}^{'}, C_{i}^{'} Ψ_{ξ i}^{'}) (\begin{matrix} τ^{'} \\ F^{'} \end{matrix}) M_{q} \bar{V} + Ω_{i}^{'} M_{q} \bar{V} \\ = & (c_{ξ i}^{'}, C_{i}^{'} Ψ_{ξ i}^{'}) (\begin{matrix} 0 \\ F^{'} M_{q} \bar{V} \end{matrix}) + Ω_{i}^{'} M_{q} \bar{V} \\ = & C_{i}^{'} Ψ_{ξ i}^{'} F^{'} M_{q} \bar{V} + Ψ_{ξ i} e_{i}^{'} M_{q} \bar{V} . \end{matrix}

(A36)

Substituting (A36) into (A35), we obtain

Ξ_{i}^{'} M_{q} F = - C_{i}^{'} Ψ_{ξ i}^{'} F^{'} M_{q} \bar{V} Λ^{- 1} (L) C {(C^{'} C)}^{- 1} - Ψ_{ξ i} e_{i}^{'} M_{q} \bar{V} Λ^{- 1} (L) C {(C^{'} C)}^{- 1} .

(A37)

Moreover, from (A33),

Λ (L) C F^{'} M_{q} \bar{V} = - {\bar{V}}^{'} M_{q} \bar{V}

, which directly follows

F^{'} M_{q} \bar{V} = - {(C^{'} C)}^{- 1} C^{'} Λ^{- 1} (L) {\bar{V}}^{'} M_{q} \bar{V},

(A38)

under the assumption of

Λ (L)

is invertible and the rank condition is satisfied. Then, using this result in (A37), we have

\begin{matrix} ∥\frac{Ξ_{i}^{'} M_{q} F}{T}∥ & = & ∥C_{i}^{'}∥ ∥Ψ_{ξ i}^{'}∥ {∥{(C^{'} C)}^{- 1} C^{'}∥}^{2} {∥Λ^{- 1} (L)∥}^{2} ∥\frac{{\bar{V}}^{'} M_{q} \bar{V}}{T}∥ \\ + ∥Ψ_{ξ i}∥ ∥\frac{e_{i}^{'} M_{q} \bar{V}}{T}∥ ∥Λ^{- 1} (L)∥ ∥C {(C^{'} C)}^{- 1}∥ . \end{matrix}

(A39)

Since the norms of

C_{i}

,

Λ^{- 1} (L)

and

Ψ_{ξ i}

are assumed to be bounded, we need to establish the probability orders of

∥{\bar{V}}^{'} M_{q} \bar{V} / T∥

and

∥e_{i}^{'} M_{q} \bar{V} / T∥

. For

{\bar{V}}^{'} M_{q} \bar{V} / T

, since

\bar{V}

is a

(T - p_{T}) \times (k + 1)

submatrix of

{\bar{V}}^{*}

, (A7) and (A9) imply

{\bar{V}}^{'} \bar{V} / T = O_{p} (N^{- 1})

and

{∥{\bar{Q}}^{'} \bar{V} / T∥}_{\infty} = O_{p} (N^{- 1 / 2})

, which together with (A7), we obtain

\frac{{\bar{V}}^{'} M_{q} \bar{V}}{T} = O_{p} (\frac{1}{N}) .

Similarly, by (A7)–(A9),

\frac{e_{i}^{'} M_{q} \bar{V}}{T} = O_{p} (\frac{1}{N}) + O_{p} (\frac{1}{\sqrt{N T}}), uniformly over i,

Substituting the above two results into (A39) establishes the result. □

Proof of Lemma A6.

To prove (A15), we need to determine the order of probability of

{∥\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T} - \frac{Ξ_{i}^{'} M_{h} Ξ_{i}}{T}∥}_{\infty}

, by the triangle inequality of the matrix norm

{∥\cdot∥}_{\infty}

, which equals

\begin{matrix} {∥\frac{Ξ_{i}^{'} \bar{Q} {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} Ξ_{i}}{T} - \frac{Ξ_{i}^{'} H {(H^{'} H)}^{- 1} H^{'} Ξ_{i}}{T}∥}_{\infty} \\ \leq & {∥\frac{1}{T} (Ξ_{i}^{'} \bar{Q} - Ξ_{i}^{'} H) {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} Ξ_{i}∥}_{\infty} \\ + {∥\frac{1}{T} Ξ_{i}^{'} H ({({\bar{Q}}^{'} \bar{Q})}^{- 1} - {(H^{'} H)}^{- 1}) {\bar{Q}}^{'} Ξ_{i}∥}_{\infty} \\ + {∥\frac{1}{T} Ξ_{i}^{'} H {(H^{'} H)}^{- 1} ({\bar{Q}}^{'} Ξ_{i} - H^{'} Ξ_{i})∥}_{\infty} . \end{matrix}

(A40)

Using the results of Lemma A.3 and the submultiplicative property of the matrix norm, and noting that

\bar{Q} = H + {\bar{V}}^{*}

, we focus on the individual elements on the right side of (A40).

For the first term, we have

\begin{matrix} {∥\frac{1}{T} (Ξ_{i}^{'} \bar{Q} - Ξ_{i}^{'} H) {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} Ξ_{i}∥}_{\infty} & \leq & {∥\frac{Ξ_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} {∥{(\frac{{\bar{Q}}^{'} \bar{Q}}{T^{2}})}^{- 1} \frac{{\bar{Q}}^{'} Ξ_{i}}{T^{2}}∥}_{\infty} \\ = & O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i, \end{matrix}

(A41)

by the third parts of (A9), (A11), and (A12).

For the second term, we also have

\begin{matrix} {∥\frac{1}{T} Ξ_{i}^{'} H ({({\bar{Q}}^{'} \bar{Q})}^{- 1} - {(H^{'} H)}^{- 1}) {\bar{Q}}^{'} Ξ_{i}∥}_{\infty} \\ \leq & {∥\frac{{\bar{V}}^{*^{'}} {\bar{V}}^{*}}{T} + \frac{H^{'} {\bar{V}}^{*}}{T} + \frac{{\bar{V}}^{*^{'}} H}{T}∥}_{\infty} {∥\frac{Ξ_{i}^{'} H}{T^{2}} {(\frac{{\bar{Q}}^{'} \bar{Q}}{T^{2}})}^{- 1}∥}_{\infty} {∥{(\frac{H^{'} H}{T^{2}})}^{- 1} \frac{{\bar{Q}}^{'} Ξ_{i}}{T^{2}}∥}_{\infty} \\ = & O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i, \end{matrix}

(A42)

by (A7), as well as some results in (A9), (A11), and (A12).

Finally, we have

\begin{matrix} {∥\frac{1}{T} Ξ_{i}^{'} H {(H^{'} H)}^{- 1} ({\bar{Q}}^{'} Ξ_{i} - H^{'} Ξ_{i})∥}_{\infty} & \leq & {∥\frac{Ξ_{i}^{'} H}{T^{2}} {(\frac{H^{'} H}{T^{2}})}^{- 1}∥}_{\infty} {∥\frac{Ξ_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} \\ = & O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i, \end{matrix}

(A43)

by (A9), (A11), and (A12). Substituting (A41)–(A43) into (A40), we have

{∥\frac{Ξ_{i}^{'} M_{q} Ξ_{i}}{T} - \frac{Ξ_{i}^{'} M_{h} Ξ_{i}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i,

as required.

To establish result (A16), similar to the proof of (A15), we have

\begin{matrix} {∥\frac{Ξ_{i}^{'} \bar{Q} {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} F}{T} - \frac{Ξ_{i}^{'} H {(H^{'} H)}^{- 1} H^{'} F}{T}∥}_{\infty} \\ \leq & {∥\frac{1}{T} (Ξ_{i}^{'} \bar{Q} - Ξ_{i}^{'} H) {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} F∥}_{\infty} \\ + {∥\frac{1}{T} Ξ_{i}^{'} H ({({\bar{Q}}^{'} \bar{Q})}^{- 1} - {(H^{'} H)}^{- 1}) {\bar{Q}}^{'} F∥}_{\infty} \\ + {∥\frac{1}{T} Ξ_{i}^{'} H {(H^{'} H)}^{- 1} ({\bar{Q}}^{'} F - H^{'} F)∥}_{\infty}, \end{matrix}

(A44)

then, examine each term of (A44), and note that

F \in G

.

For the first term, the third part of (A9) and (A12) imply

\begin{matrix} {∥\frac{1}{T} (Ξ_{i}^{'} \bar{Q} - Ξ_{i}^{'} H) {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} F∥}_{\infty} & \leq & {∥\frac{Ξ_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} {∥{(\frac{{\bar{Q}}^{'} \bar{Q}}{T^{2}})}^{- 1} \frac{{\bar{Q}}^{'} F}{T^{2}}∥}_{\infty} \\ = & O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i . \end{matrix}

(A45)

Using some results in Lemma A4, we have

\begin{matrix} {∥\frac{1}{T} Ξ_{i}^{'} H ({({\bar{Q}}^{'} \bar{Q})}^{- 1} - {(H^{'} H)}^{- 1}) {\bar{Q}}^{'} F∥}_{\infty} \\ \leq & {∥\frac{{\bar{V}}^{*^{'}} {\bar{V}}^{*}}{T} + \frac{H^{'} {\bar{V}}^{*}}{T} + \frac{{\bar{V}}^{*^{'}} H}{T}∥}_{\infty} {∥\frac{Ξ_{i}^{'} H}{T^{2}} {(\frac{{\bar{Q}}^{'} \bar{Q}}{T^{2}})}^{- 1}∥}_{\infty} {∥{(\frac{H^{'} H}{T^{2}})}^{- 1} \frac{{\bar{Q}}^{'} F}{T^{2}}∥}_{\infty} \\ = & O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i . \end{matrix}

(A46)

Finally, by the first part of (A9), the second part of (A11) and (A12), we have

\begin{matrix} {∥\frac{1}{T} Ξ_{i}^{'} H {(H^{'} H)}^{- 1} ({\bar{Q}}^{'} F - H^{'} F)∥}_{\infty} & \leq & {∥\frac{Ξ_{i}^{'} H}{T^{2}} {(\frac{H^{'} H}{T^{2}})}^{- 1}∥}_{\infty} {∥\frac{{\bar{V}}^{*^{'}} F}{T}∥}_{\infty} \\ = & O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i . \end{matrix}

(A47)

Substituting (A45)–(A47) into (A44), we have

{∥\frac{Ξ_{i}^{'} {\bar{M}}_{q} F}{T} - \frac{Ξ_{i}^{'} M_{h} F}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{\sqrt{N}}), uniformly over i,

which completes the proof of (A16).

Result (A17) can also be established in a similar way, we have

\begin{matrix} {∥\frac{Ξ_{i}^{'} {\bar{M}}_{q} ε_{i}}{T} - \frac{Ξ_{i}^{'} M_{h} ε_{i}}{T}∥}_{\infty} & = & {∥\frac{Ξ_{i}^{'} \bar{Q} {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} ε_{i}}{T} - \frac{Ξ_{i}^{'} H {(H^{'} H)}^{- 1} H^{'} ε_{i}}{T}∥}_{\infty} \\ \leq & {∥\frac{1}{T} (Ξ_{i}^{'} \bar{Q} - Ξ_{i}^{'} H) {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} ε_{i}∥}_{\infty} \\ + {∥\frac{1}{T} Ξ_{i}^{'} H ({({\bar{Q}}^{'} \bar{Q})}^{- 1} - {(H^{'} H)}^{- 1}) {\bar{Q}}^{'} ε_{i}∥}_{\infty} \\ + {∥\frac{1}{T} Ξ_{i}^{'} H {(H^{'} H)}^{- 1} ({\bar{Q}}^{'} ε_{i} - H^{'} ε_{i})∥}_{\infty}, \end{matrix}

(A48)

then we examine each of the above terms. The first term equals

\begin{matrix} {∥\frac{1}{T} (Ξ_{i}^{'} \bar{Q} - Ξ_{i}^{'} H) {({\bar{Q}}^{'} \bar{Q})}^{- 1} {\bar{Q}}^{'} ε_{i}∥}_{\infty} & \leq & \frac{1}{T} {∥\frac{Ξ_{i}^{'} {\bar{V}}^{*}}{T}∥}_{\infty} {∥{(\frac{{\bar{Q}}^{'} \bar{Q}}{T^{2}})}^{- 1} \frac{{\bar{Q}}^{'} ε_{i}}{T}∥}_{\infty} \\ = & O_{p} (\frac{1}{\sqrt{N} T}), uniformly over i, \end{matrix}

(A49)

by the third part of (A9)–(A11). Next, we have

\begin{matrix} {∥\frac{1}{T} Ξ_{i}^{'} H ({({\bar{Q}}^{'} \bar{Q})}^{- 1} - {(H^{'} H)}^{- 1}) {\bar{Q}}^{'} ε_{i}∥}_{\infty} \\ \leq & \frac{1}{T} {∥\frac{{\bar{V}}^{*^{'}} {\bar{V}}^{*}}{T} + \frac{H^{'} {\bar{V}}^{*}}{T} + \frac{{\bar{V}}^{*^{'}} H}{T}∥}_{\infty} {∥\frac{Ξ_{i}^{'} H}{T^{2}} {(\frac{{\bar{Q}}^{'} \bar{Q}}{T^{2}})}^{- 1}∥}_{\infty} {∥{(\frac{H^{'} H}{T^{2}})}^{- 1} \frac{{\bar{Q}}^{'} ε_{i}}{T}∥}_{\infty} \\ = & O_{p} (\frac{1}{\sqrt{N} T}), uniformly over i, \end{matrix}

(A50)

by (A7), and the results in (A9)–(A12). Finally,

\begin{matrix} {∥\frac{1}{T} Ξ_{i}^{'} H {(H^{'} H)}^{- 1} ({\bar{Q}}^{'} ε_{i} - H^{'} ε_{i})∥}_{\infty} \\ \leq & {∥\frac{Ξ_{i}^{'} H}{T^{2}} {(\frac{H^{'} H}{T^{2}})}^{- 1}∥}_{\infty} {∥\frac{{\bar{V}}^{*'} ε_{i}}{T}∥}_{\infty} = O_{p} (\frac{p_{T}}{\sqrt{N T}}) + O_{p} (\frac{p_{T}}{N}), \end{matrix}

(A51)

by (A8), the second part of (A11) and (A12). Using (A49)–(A51) into (A48), (A17) is proven. □

Notes

1	An alternative approach to deal with cross-sectional dependence is the principle component analysis proposed by Bai (2009).
2	As in Pesaran (2006) and Kapetanios et al. (2011), observed factors, such as time effects, can also be included in model (1). For notational simplicity and illustration purpose, we do not include such factors in the model (1).
3	As Chudik and Pesaran (2015a) point out, the number of lags $p_{T}$ needs to be restricted. Letting $p_{T}^{3} / T \to λ, 0 < λ < \infty$ can ensures that, on the one hand, the number of lags is not too large, so that there are sufficient degrees of freedom for the consistent estimator, and on the other hand, the number of lags is not too small, so that the bias due to the truncation of infinite lag polynomials is sufficiently small
4	We note that $\bar{Q}$ can be denoted as $\bar{Q} \overset{▵}{=} (τ, \tilde{Z}),$ , where $τ = {(1, 1, \dots 1)}^{'}$ is a $(T - p_{T}) \times 1$ vector of ones, $\tilde{Z} is the (T - p_{T}) \times (k + 1) p_{T}$ matrices of observations on ${\bar{z}}_{t}$ for $t = p_{T} + 1, p_{T} + 2, \dots, T .$
5	To illustrate the validity and robustness of the CCE estimator in the case of non-stationary common factors, the data-generating process and parameter settings are similar to the settings in Chudik and Pesaran (2015a), except for unobserved common factors.
6	We also conducted additional Monte Carlo simulations for other settings, such as $p_{T} = [0.75 T^{1 / 3}]$ and $p_{T} = [1.25 T^{1 / 3}]$ ; the corresponding results are slightly worse than that of $p_{T} = [T^{1 / 3}]$ , these results are not reported to save space.

References

Bada, Oualid, and Alois Kneip. 2014. Parameter cascading for panel models with unknown number of unobserved factors: An application to the credit spread puzzle. Computational Statistics and Data Analysis 76: 95–115. [Google Scholar] [CrossRef]
Bada, Oualid, and Dominik Liebl. 2014. The R package phtt: Panel data analysis with heterogeneous time trends. Journal of Statistical Software 59: 1–34. [Google Scholar] [CrossRef]
Bai, Jushan. 2009. Panel data models with interactive fixed effects. Econometrica 77: 1229–79. [Google Scholar]
Bai, Jushan, and Serena Ng. 2004. A panic on unit root tests and cointegration. Econometrica 72: 1127–77. [Google Scholar] [CrossRef]
Bai, Jushan, and Serena Ng. 2010. Panel unit root tests with cross section dependence: A further investigation. Econometric Theory 26: 1088–114. [Google Scholar] [CrossRef]
Bailey, Natalia, George Kapetanios, and M. Hashem Pesaran. 2016. Exponent of cross-sectional dependence: Estimation and inference. Journal of Applied Econometrics 31: 929–1196. [Google Scholar] [CrossRef]
Baltagi, Badi H., and Dong Li. 2004. Prediction in the panel data model with spatial correlation. In Advances in Spatial Econometrics: Methodology, Tools and Applications. Edited by Luc Anselin, Raymond J. G. M. Florax and Sergio J. Rey. Berlin/Heidelberg: Springer, pp. 283–295. [Google Scholar]
Ben-Israel, Adi, and Thomas N. E. Greville. 2003. Generalized Inverses: Theory and Applications, 2nd ed. New York: Springer. [Google Scholar]
Bussiere, Matthieu, Alexander Chudik, and Arnaud Mehl. 2013. How have global shocks impacted the real effective exchange rates of individual euro area countries since the euro’s creation? The B.E. Journal of Macroeconomics 13: 1–48. [Google Scholar] [CrossRef][Green Version]
Chudik, Alexander, and M. Hashem Pesaran. 2013. Econometric analysis of high dimensional VARs featuring a dominant unit. Econometric Reviews 32: 592–649. [Google Scholar] [CrossRef]
Chudik, Alexander, and M. Hashem Pesaran. 2015a. Common correlated effects estimation of heterogeneous dynamic panel data models with weakly exogenous regressors. Journal of Econometrics 188: 393–420. [Google Scholar] [CrossRef]
Chudik, Alexander, and M. Hashem Pesaran. 2015b. Large panel data models with cross-sectional dependence: A survey. In The Oxford Handbook of Panel Data. Edited by Badi H. Baltagi. Oxford: Oxford University Press, pp. 2–45. [Google Scholar]
Chudik, Alexander, Kamiar Mohaddes, M. Hashem Pesaran, and Mehdi Raissi. 2017. Is there a debt-threshold effect on output growth? Review of Economics and Statistics 99: 135–50. [Google Scholar] [CrossRef]
Eberhardt, Markus, Christian Helmers, and Hubert Strauss. 2013. Do spillovers matter when estimating private returns to R&D? Review of Economics and Statistics 95: 436–48. [Google Scholar]
Greenaway-McGrevy, Ryan, Chirok Han, and Donggyu Sul. 2012. Asymptotic distribution of factor augmented estimators for panel regression. Journal of Econometrics 169: 48–53. [Google Scholar] [CrossRef]
Hahn, Jinyong, and Whitney Newey. 2004. Jackknife and analytical bias reduction for nonlinear panel models. Econometrica 72: 1295–319. [Google Scholar] [CrossRef]
Juodis, Artūras, Hande Karabiyik, and Joakim Westerlund. 2021. On the robustness of the pooled CCE estimator. Journal of Econometrics 220: 325–48. [Google Scholar] [CrossRef]
Kao, Chihwa, Lorenzo Trapani, and Giovanni Urga. 2012. Asymptotics for panel models with common shocks. Econometric Reviews 31: 390–439. [Google Scholar] [CrossRef]
Kapetanios, George, M. Hashem Pesaran, and Takashi Yamagata. 2011. Panels with non-stationary multifactor error structures. Journal of Econometrics 160: 326–48. [Google Scholar] [CrossRef]
Moon, Hyungsik Roger, and Martin Weidner. 2015. Linear regression for panel with unknown number of factors as interactive fixed effects. Econometrica 83: 1543–79. [Google Scholar] [CrossRef]
Moon, Hyungsik Roger, and Martin Weidner. 2017. Dynamic linear panel regression models with interactive fixed effects. Econometric Theory 33: 158–95. [Google Scholar] [CrossRef]
Omay, Tolga, and Elif Oznur Kan. 2010. Re-examing the threshold effects in the inflation-growth nexus with cross-sectionally dependent non-linear panel: Evidence from six industrialized economies. Economic Modelling 27: 996–1005. [Google Scholar] [CrossRef]
Pesaran, M. Hashem. 2006. Estimation and inference in large heterogeneous panels with a multifactor error structure. Econometrica 74: 967–1012. [Google Scholar] [CrossRef]
Pesaran, M. Hashem. 2007. A simple panel unit root test in the presence of cross section dependence. Journal of Applied Econometrics 22: 265–312. [Google Scholar] [CrossRef]
Pesaran, M. Hashem. 2015. Testing weak cross-sectional dependence in large panels. Econometric Reviews 34: 1089–117. [Google Scholar] [CrossRef]
Pesaran, M. Hashem, Ron Smith, and Takashi Yamagata. 2013. Panel unit root tests in the presence of multifactor error structure. Journal of Econometrics 175: 94–115. [Google Scholar] [CrossRef]
Westerlund, Joakim, Petrova Yana, and Norkute Milda. 2019. CCE in fixed-T panels. Journal of Applied Econometrics 34: 746–761. [Google Scholar] [CrossRef]
Zaffaroni, Paolo. 2009. Generalized least estimation of panel with common shocks. Unpublished Manuscript. [Google Scholar]
Zhou, Qiankun, and Yonghui Zhang. 2016. Common correlated effects estimation of unbalanced panel data models with cross-sectional dependence. Journal of Economic Theory and Econometrics 27: 25–45. [Google Scholar]

Figure 1. CCE and CCEMG estimations for income (Left) and price (Right), respectively (CCE estimates of individual coefficients are indicated by a cross, CCEMG estimates by the red line, and the

95 %

confidence interval by the upper and lower range and dashed red line).

Figure 1. CCE and CCEMG estimations for income (Left) and price (Right), respectively (CCE estimates of individual coefficients are indicated by a cross, CCEMG estimates by the red line, and the

95 %

confidence interval by the upper and lower range and dashed red line).

Figure 2. Estimated factors (Top) and the factor structure (Bottom).

Table 1. Estimation results for DGP 1.

	Bias					RMSE
Parameter	$(N, T)$	50	100	150	200	50	100	150	200
$ϕ$	CCEMG estimation
	50	−0.1065	−0.0393	−0.0163	−0.0004	0.1131	0.0530	0.0392	0.0371
	100	−0.1085	−0.0392	−0.0156	−0.0004	0.1120	0.0476	0.0311	0.0284
	200	−0.1105	−0.0402	−0.0163	−0.0012	0.1116	0.0450	0.0265	0.0220
	Jackknife bias-corrected CCEMG estimation
	50	−0.0508	−0.0126	−0.0071	−0.0043	0.0747	0.0417	0.0415	0.0440
	100	−0.0411	−0.0124	−0.0036	0.0034	0.0689	0.0324	0.0309	0.0365
	200	−0.0417	−0.0127	−0.0042	−0.0039	0.0664	0.0273	0.0253	0.0309
$β_{0}$	CCEMG estimation
	50	0.0136	0.0071	0.0032	0.0012	0.0461	0.0332	0.0282	0.0275
	100	0.0129	0.0058	0.0029	0.0008	0.0341	0.0232	0.0200	0.0192
	200	0.0119	0.0049	0.0024	0.0003	0.0252	0.0169	0.0150	0.0139
	Jackknife bias-corrected CCEMG estimation
	50	0.0112	0.0043	0.0011	−0.0007	0.0550	0.0361	0.0307	0.0289
	100	0.0098	0.0030	0.0003	−0.0015	0.0397	0.0251	0.0215	0.0206
	200	0.0091	0.0020	0.0000	0.0017	0.0281	0.0180	0.0160	0.0150

Table 2. Estimation results for DGP 2.

	Bias					RMSE
Parameter	$(N, T)$	50	100	150	200	50	100	150	200
$ϕ$	CCEMG estimation
	50	−0.0983	−0.0384	−0.0188	−0.0093	0.1053	0.0513	0.0386	0.0348
	100	−0.1004	−0.0389	−0.0200	−0.0104	0.1040	0.0461	0.0312	0.0259
	200	−0.1015	−0.0395	−0.0193	−0.0102	0.1036	0.0434	0.0271	0.0203
	Jackknife bias-corrected CCEMG estimation
	50	−0.0506	−0.0179	−0.0069	−0.0014	0.0840	0.0410	0.0360	0.0346
	100	−0.0491	−0.0172	−0.0072	0.0015	0.0768	0.0316	0.0262	0.0245
	200	−0.0457	−0.0167	−0.0071	−0.0013	0.0704	0.0257	0.0194	0.0177
$β_{0}$	CCEMG estimation
	50	0.0124	0.0077	0.0048	0.0036	0.0451	0.0334	0.0285	0.0275
	100	0.0122	0.0063	0.0042	0.0033	0.0335	0.0235	0.0202	0.0193
	200	0.0112	0.0056	0.0039	0.0028	0.0248	0.0171	0.0151	0.0138
	Jackknife bias-corrected CCEMG estimation
	50	0.0109	0.0061	0.0037	0.0027	0.0535	0.0362	0.0306	0.0284
	100	0.0104	0.0045	0.0026	0.0020	0.0396	0.0252	0.0212	0.0200
	200	0.0090	0.0036	0.0024	0.0017	0.0279	0.0179	0.0156	0.0145

Table 3. Estimation results for DGP 3.

	Bias					RMSE
Parameter	$(N, T)$	50	100	150	200	50	100	150	200
$ϕ$	CCEMG estimation
	50	−0.0649	−0.0330	−0.0154	−0.0076	0.0733	0.0475	0.0363	0.0342
	100	−0.0760	−0.0370	−0.0159	−0.0119	0.0801	0.0440	0.0313	0.0259
	200	−0.0789	−0.0378	−0.0179	−0.0127	0.0814	0.0434	0.0284	0.0222
	Jackknife bias−corrected CCEMG estimation
	50	−0.0195	−0.0073	0.0041	0.0010	0.0425	0.0368	0.0340	0.0335
	100	−0.0246	−0.0098	−0.0030	0.0001	0.0410	0.0272	0.0252	0.0235
	200	−0.0309	−0.0091	−0.0059	−0.0026	0.0395	0.0224	0.0182	0.0171
$β_{0}$	CCEMG estimation
	50	0.0094	0.0063	0.0043	0.0045	0.0412	0.0329	0.0299	0.0276
	100	0.0092	0.0060	0.0045	0.0039	0.0312	0.0237	0.0205	0.0198
	200	0.0092	0.0062	0.0043	0.0038	0.0224	0.0174	0.0148	0.0141
	Jackknife bias-corrected CCEMG estimation
	50	0.0069	0.0045	0.0032	0.0039	0.0441	0.0341	0.0307	0.0284
	100	0.0068	0.0041	0.0035	0.0027	0.0330	0.0243	0.0212	0.0202
	200	0.0058	0.0040	0.0029	0.0021	0.0230	0.0175	0.0148	0.0143

Table 4. Exponent of the cross-sectional dependence of variables.

Variable	$CD$	$α$	$α_{0.025}$	$α_{0.975}$
consumption	101.519	0.975	0.887	1.064
income	166.270	1.004	−0.635	2.644
price	154.142	1.004	0.620	1.389

Table 5. Estimation results (Jackknife bias-corrected CCEMG).

Variable	coef.	Std.Err.	p-Value	${CI}_{0.025}$	${CI}_{0.975}$
$L .$ consumption	0.368	0.091	0.000	0.190	0.545
income	0.936	0.387	0.016	0.177	1.695
price	−0.629	0.115	0.000	−0.854	−0.404

Note: L. consumption denotes the first lag of cigarette consumption.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cao, S.; Zhou, Q. Common Correlated Effects Estimation for Dynamic Heterogeneous Panels with Non-Stationary Multi-Factor Error Structures. Econometrics 2022, 10, 29. https://doi.org/10.3390/econometrics10030029

AMA Style

Cao S, Zhou Q. Common Correlated Effects Estimation for Dynamic Heterogeneous Panels with Non-Stationary Multi-Factor Error Structures. Econometrics. 2022; 10(3):29. https://doi.org/10.3390/econometrics10030029

Chicago/Turabian Style

Cao, Shiyun, and Qiankun Zhou. 2022. "Common Correlated Effects Estimation for Dynamic Heterogeneous Panels with Non-Stationary Multi-Factor Error Structures" Econometrics 10, no. 3: 29. https://doi.org/10.3390/econometrics10030029

APA Style

Cao, S., & Zhou, Q. (2022). Common Correlated Effects Estimation for Dynamic Heterogeneous Panels with Non-Stationary Multi-Factor Error Structures. Econometrics, 10(3), 29. https://doi.org/10.3390/econometrics10030029

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Common Correlated Effects Estimation for Dynamic Heterogeneous Panels with Non-Stationary Multi-Factor Error Structures

Abstract

1. Introduction

2. Dynamic Panel Data Model with Non-Stationary Unobserved Common Factors

2.1. The Model

2.2. CCE Estimation

3. Asymptotics of CCE Estimators with Non-Stationary Factors

3.1. Assumptions

3.2. Asymptotics

4. Monte Carlo Simulation

5. Empirical Study

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Useful Lemmas and Theoretical Derivations of Theorems

Appendix A.1. Useful Lemmas

Appendix A.2. Theoretical Derivation of the Asymptotics of the CCE Estimators

Appendix A.3. Proofs of Lemmas

Notes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI