Testing Cross-Sectional Correlation in Large Panel Data Models with Serial Correlation

Baltagi, Badi H.; Kao, Chihwa; Peng, Bin

doi:10.3390/econometrics4040044

Open AccessArticle

Testing Cross-Sectional Correlation in Large Panel Data Models with Serial Correlation

by

Badi H. Baltagi

¹,

Chihwa Kao

² and

Bin Peng

^3,*

¹

Department of Economics & Center for Policy Research, 426 Eggers Hall, Syracuse University, Syracuse, NY 13244-1020, USA

²

Department of Economics, 365 Fairfield Way, U-1063, University of Connecticut, Storrs, CT 06269-1063, USA

³

Department of Finance, 523 School of Economics, Huazhong University of Science and Technology, Wuhan 430074, China

^*

Author to whom correspondence should be addressed.

Econometrics 2016, 4(4), 44; https://doi.org/10.3390/econometrics4040044

Submission received: 23 July 2016 / Revised: 12 October 2016 / Accepted: 19 October 2016 / Published: 4 November 2016

(This article belongs to the Special Issue Recent Developments in Panel Data Methods)

Download Versions Notes

Abstract

:

This paper considers the problem of testing cross-sectional correlation in large panel data models with serially-correlated errors. It finds that existing tests for cross-sectional correlation encounter size distortions with serial correlation in the errors. To control the size, this paper proposes a modification of Pesaran’s Cross-sectional Dependence (CD) test to account for serial correlation of an unknown form in the error term. We derive the limiting distribution of this test as

(N, T) \to \infty

. The test is distribution free and allows for unknown forms of serial correlation in the errors. Monte Carlo simulations show that the test has good size and power for large panels when serial correlation in the errors is present.

Keywords:

cross-sectional correlation test; serial correlation; large panel data model

JEL Classification:

C13; C33

1. Introduction

This paper studies testing for cross-sectional correlation in panel data when serial correlation is also present in the disturbances. It does that for the case of strictly-exogenous regressors1. Cross-sectional correlation could be due to unknown common shocks, spatial effects or interactions within social networks. Ignoring cross-sectional correlation in panels can have serious consequences. In time series with serial correlation, existing cross-sectional correlation leads to efficiency loss for least squares and invalidates inference. In some cases, it results in inconsistent estimation; see Lee [1] and Andrews [2]. Testing the cross-sectional correlation of panel residuals is therefore important.

One could test for a specific form of correlation in the error like spatial correlation; see Anselin and Bera [3] for cross-sectional data and Baltagi et al. [4] for panel data, to mention a few. Alternatively, one could test for correlation without imposing any structure on the form of correlation among the disturbances. The null hypothesis, in that case, is testing the diagonality of the covariance or correlation matrix of the N-dimensional disturbance vector

u_{t} = {(u_{1 t}, \dots, u_{N t})}^{'},

which is usually assumed to be independent over time, for

t = 1, \dots, T

. When N is fixed and T is large, the traditional multivariate statistics techniques, including log-likelihood ratio and Lagrange multiplier tests, are applicable; see, for example, Breusch and Pagan [5], who propose a Lagrange Multiplier (LM) test, which is based on the average of the squared pair-wise correlation coefficients of the least squares residuals.

However, as N becomes large because of the growing availability of the comprehensive databases in macro and finance, this so-called “high dimensional” phenomenon brings challenges to classical statistical inference. As shown in the Random Matrix Theory (RMT) literature, the sample covariance and correlation matrices are ill-conditioned since their eigenvectors are not consistent with their population counterparts; see Johnstone [6] and Jiang [7]. New approaches have been considered in the statistics literature for the testing the diagonality of the sample covariance or correlation matrices; see Ledoit and Wolf [8], Schott [9] and Chen et al. [10], to mention a few.

The above tests for raw data cannot be used directly to test cross-sectional correlation in panel data regressions since the disturbances are not observable. Noise caused by substituting residuals for the actual disturbances may accumulate due to large dimensions, and this in turn may lead to biased inference. The bias for cross-sectional correlation tests in large panels depends on the model specification, the estimation method and the sample sizes N and T, among other things. For example, Pesaran et al. [11] consider an LM test and correct its bias in a large heterogeneous panel data model; Baltagi et al. [12] extend Schott’s test [9] to a fixed effects panel data model and correct the bias caused by estimating the disturbances with fixed effects residuals in a homogeneous panel data model. Following Ledoit and Wolf [8], Baltagi et al. [13] propose a bias-adjusted test for testing the null of sphericity in the fixed effects homogeneous panel data model. However, this method does not test cross-sectional correlation directly. Rejection of the null could be due to cross-sectional correlation or heteroscedasticity or both. A general test for cross-sectional correlation was proposed by Pesaran [14]. His test statistic is based on the average of pair-wise correlation coefficients, defined as CD

_{P}

(CD, Cross-sectional Dependence). The test is exactly centered at zero under the null and does not need bias correction. Pesaran [15] extends his test statistic to test the null of weak cross-sectional correlation and derives its asymptotic distribution using joint limits. This test is robust to many model specifications and has many applications. Recent surveys for cross-sectional correlation or dependence tests in large panels are provided by Moscone and Tosetti [16], Sarafidis and Wansbeek [17] and Chudik and Pesaran [18].

The asymptotics and bias-correction of existing tests for cross-sectional correlation in large panels are carried out under some, albeit restrictive, assumptions. For instance, the errors are normally distributed;

N / T \to c \in (0, \infty)

as

(N, T) \to \infty

, and so on. One fundamental restriction is that the errors are independent over time. In fact, the presence of serial correlation in panel data applications is likely to be the rule rather than the exception, especially for macro applications and when T is large. Ignoring serial correlation does not affect the consistency of estimates, but it leads to incorrect inference. In RMT, when

u_{1}, u_{2}, \dots, u_{T}

are independent across

t = 1, \dots, T,

and N is large, the Limiting Spectral Distribution (LSD) of the corresponding sample covariance matrix is the Marchenko-Pastur (M-P) law; see Bai and Silverstein [19]. Existing correlation among these disturbances may cause a deviation of the LSD from the M-P law. Indeed, Bai and Zhou [20] show that the LSD of the sample covariance matrix with correlations in columns is different from the M-P law. Gao et al. [21] show similar results for the sample correlation matrix. Therefore, the cross-sectional correlation tests, which heavily depend on the assumption of independence over time, could lead to misleading inference if there is a serial correlation in the disturbances.

To better understand the effects of potential serial correlation on the existing tests of cross-sectional correlation, let us assume that the

T \times 1

independent random vectors

u_{i} = {(u_{i 1}, \dots, u_{i T})}^{'},

for

i = 1, \dots, N

are observable. The correlation coefficients

ρ_{i j}

of any

u_{i}

and

u_{j}

(i \neq j)

are defined by

u_{i}^{'} u_{j} / (∥u_{i}∥ \cdot ∥u_{j}∥)

. Their means are zero vectors. If all of the elements of each

u_{i}

are independent and identically spherically distributed, Muirhead [22] shows that

E (ρ_{i j}^{2}) = 1 / T .

When N is fixed, the summation of all distinct

N (N - 1) / 2

terms of

ρ_{i j}^{2}

will be small, as

T \to \infty .

In Section 3, we show that if all of the elements of each

u_{i}

follow a multiple Moving Average model of order one (MA(1)) with parameter

θ,

then

E (ρ_{i j}^{2}) = [1 / T + θ^{2} / (T + T θ^{2})] .

As

N \to \infty

, the extra term

θ^{2} / (T + T θ^{2})

can accumulate and lead to extra bias for the existing LM type tests in panels. Although CD

_{P}

is centered at zero, it may still encounter size distortions because serial correlation is ignored.

This paper proposes a modification of Pesaran’s CD test of cross-sectional correlation when the error terms are serially correlated in large panel data models. First, using results from RMT, we study the first two moments of the test statistic and propose an unbiased and consistent estimate of the variance with unknown serial correlation under the null. Second, we derive the limiting distribution of the test under the asymptotic framework with

(N, T) \to \infty

simultaneously in any order without any distribution assumption. We also discuss its local power properties under a multi-factor alternative. Monte Carlo simulations are conducted to study the performance of our test statistic in finite samples. The results confirm our theoretical findings.

The plan for the paper is as follows. The next section introduces the model and notation, existing LM type tests and the Cross-sectional Dependence (CD) test. It then presents our assumptions and the proposed modified Pesaran’s CD test statistic. Section 3 derives the asymptotics of this test statistic. Section 4 reports the results of the Monte Carlo experiments. Section 5 provides some concluding remarks. All of the mathematical proofs are provided in the Appendix.

Throughout the paper, we adopt the following notation. For a squared matrix B, tr

(B)

is the trace of

B;

||B|| = (

tr

(B^{'} B))^{1 / 2}

denotes the Frobenius norm of a matrix or the Euclidean norm of a vector B;

\overset{d}{⟶}

denotes convergence in distribution; and

\overset{p}{⟶}

denotes convergence in probability. We use

(N, T) \to \infty

to denote the joint convergence of N and T when N and T pass to infinity simultaneously. K is a generic positive number not depending on N nor

T .

2. Model and Tests

Consider the following heterogeneous panel data model

y_{i t} = β_{i}^{'} x_{i t} + u_{i t}, for i = 1, \dots, N; t = 1, \dots, T,

(1)

where i and t index the cross-section dimension and time dimension, respectively;

y_{i t}

is the dependent variable, and

x_{i t}

is a

k \times 1

vector of exogenous regressors. The individual coefficients

β_{i}

are defined on a compact set and allowed to vary across

i .

The null hypothesis of no cross-sectional correlation is

H_{0} : c o v (u_{i t}, u_{j t}) = 0, for all t, i \neq j,

or equivalently as

H_{0} : ρ_{i j} = 0, for i \neq j,

(2)

where

ρ_{i j}

is the pair-wise correlation coefficients of the disturbances defined by

ρ_{i j} = \frac{\sum_{t = 1}^{T} u_{i t} u_{j t}}{{(\sum_{t = 1}^{T} u_{i t}^{2})}^{1 / 2} {(\sum_{t = 1}^{T} u_{j t}^{2})}^{1 / 2}} .

Under the alternative, there exists at least one

ρ_{i j} \neq 0,

for some

i \neq j .

For the panel regression model 1, the residuals are unobservable. In this case, the test statistic is based on the residual-based correlation coefficients

{\hat{ρ}}_{i j} .

Specifically,

{\hat{ρ}}_{i j} = \frac{\sum_{t = 1}^{T} e_{i t} e_{j t}}{{(\sum_{t = 1}^{T} e_{i t}^{2})}^{1 / 2} {(\sum_{t = 1}^{T} e_{j t}^{2})}^{1 / 2}},

(3)

where

e_{i t}

is the Ordinary Least Squares (OLS) residuals using T observations for each

i = 1, \dots, N

. These OLS residuals are given by

e_{i t} = y_{i t} - x_{i t}^{'} {\hat{β}}_{i},

(4)

with

{\hat{β}}_{i}

being the OLS estimates of

β_{i}

from (1) for

i = 1, \dots, N .

Let

M_{i} = I_{T} - P_{X_{i}},

where

P_{X_{i}} = X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'},

and

X_{i}

is a

T \times k

matrix of regressors with the t-th row being the

1 \times k

vector

x_{i t}^{'} .

We also define

u_{i} = {(u_{i 1}, \dots, u_{i T})}^{'}

,

e_{i} = {(e_{i 1}, \dots, e_{i T})}^{'}

and

v_{i} = e_{i} / ∥e_{i}∥,

for

i = 1, \dots, N .

The OLS residuals can be rewritten in vector form as

e_{i} = M_{i} u_{i},

and the residual-based pair-wise correlation coefficients can be rewritten as

{\hat{ρ}}_{i j} = v_{i}^{'} v_{j}

, for any

1 \leq i \neq j \leq N

.

2.1. LM and CD Tests

For N fixed and

T \to \infty,

Breusch and Pagan [5] propose an LM test to test the null of no cross-sectional correlation in (2) without imposing any structure on this correlation. It is given by

L M_{B P} = T \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} {\hat{ρ}}_{i j}^{2} .

(5)

LM

_{B P}

is asymptotically distributed as a Chi-squared distribution with

N (N - 1) / 2

degrees of freedom under the null. However, for a typical micro-panel dataset, N is larger than T; and the Breusch-Pagan LM test statistic is not valid under this “large N, small T” setup. In fact, Pesaran [14] proposes a scaled version of this LM test as follows

L M_{P} = \sqrt{\frac{1}{N (N - 1)}} \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} (T {\hat{ρ}}_{i j}^{2} - 1) .

(6)

Pesaran [14] shows that

L M_{P}

is distributed as

N (0, 1)

with

T \to \infty

first, then

N \to \infty

under the null. However,

E (T {\hat{ρ}}_{i j}^{2} - 1)

is not correctly centered at zero with fixed T and large N. Hence, Pesaran et al. [11] propose a bias-adjusted version of this LM test, denoted by LM

_{P U Y} .

They show that the exact mean and variance of

(T - k) {\hat{ρ}}_{i j}^{2}

are given by

μ_{T i j} = E [(T - k) {\hat{ρ}}_{i j}^{2}] = \frac{1}{T - k} tr [E (M_{i} M_{j})],

(7)

and

ν_{T i j}^{2} = var [(T - k) {\hat{ρ}}_{i j}^{2}] = \{{tr}^{2} [E (M_{i} M_{j})]\} a_{1 T} + 2 tr \{{[E (M_{i} M_{j})]}^{2}\} a_{2 T},

(8)

where

a_{1 T} = a_{2 T} - \frac{1}{{(T - k)}^{2}},

and

a_{2 T} = 3 {[\frac{(T - k - 8) (T - k + 2)}{(T - k + 2) (T - k - 2) (T - k - 4)}]}^{2} .

L M_{P U Y}

is given by

L M_{P U Y} = \sqrt{\frac{2}{N (N - 1)}} \frac{(T - k) {\hat{ρ}}_{i j}^{2} - μ_{T i j}}{ν_{T i j}} .

(9)

Pesaran et al. [11] show that

L M_{P U Y}

is asymptotically distributed as

N (0, 1)

under the null (2) and the normality assumption of the disturbances as

T \to \infty

followed by

N \to \infty .

Alternatively, Pesaran [14] proposes a test based on the average of pair-wise correlation coefficients rather than their squares. The test statistic is given by

C D_{P} = \sqrt{\frac{2 T}{N (N - 1)}} \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} {\hat{ρ}}_{i j} .

(10)

Pesaran [15] shows that this test is asymptotically distributed as

N (0, 1)

with

(N, T) \to \infty

. He also extends this to test the null of weak cross-sectional correlation.

2.2. Assumptions and the Modified CD Test Statistic

So far, all of the methods surveyed above for testing cross-sectional correlation in panel data models assume that the disturbances are independent over time. Ignoring serial correlation usually results in efficiency loss and biased inference. In fact, we show in Section 3 that the existence of serial correlation leads to extra bias in the LM-type tests. For the CD

_{P}

test in (10), it is still centered at zero with serial correlation, but its variance is affected by serial correlation. As a result, we also expect size distortions in CD

_{P}

. To correct for this, we consider a modification of this test statistic that accounts for an unknown form of serial correlation in the disturbances. First, we introduce the assumptions needed:

Assumption 1.

Define

ξ_{i} = {(ξ_{i 0}, ξ_{i 1}, \dots, ξ_{i T})}^{'}

and

ε_{i} = {(ε_{i 0}, ε_{i 1}, \dots, ε_{i T})}^{'} .

We also assume that

ξ_{i} = σ_{i} ε_{i}

, for

i = 1, \dots, N,

where

ε_{i}

is a random vector with mean vector zero and covariance matrix

I_{T} .

Let

ε_{i t}

denote the t-th entry of

ε_{i},

for any

i = 1, \dots, N .

ε_{i t}

has a uniformly bounded fourth moment, and there exists a finite constant Δ, such that

E (ε_{i t}^{4}) = 3 + Δ .

Following Bai and Zhou [20], the disturbances

u_{t} = {(u_{1 t}, u_{2 t}, \dots, u_{N t})}^{'}

are generated by

u_{t} = \sum_{s = 0}^{\infty} d_{s} ξ_{t - s}, f o r t = 1, \dots, T,

(11)

where

ξ_{t} = {(ξ_{1 t}, ξ_{2 t}, \dots, ξ_{N t})}^{'},

for

t = 0, 1, \dots, T,

are

I I D

random vectors across time, and

{\{d_{s}\}}_{s = 0}^{\infty}

is a sequence of numbers satisfying

\sum_{s = 0}^{\infty} |d_{s}| < K < \infty .

Assumption 1 allows the error term

u_{i t}

to be correlated over time. The condition

\sum_{s = 0}^{\infty} |d_{s}| < K < \infty

excludes long memory-type strong dependence. We need bounded moment conditions to ensure large

(N, T)

asymptotics for panel data models with serial correlation. The conditions in Assumption 1 are quite relaxable; they are satisfied by many parametric weak dependence processes, such as stationary and invertible finite-order Auto-Regressive and Moving Average (ARMA) models. Under Assumption 1, the covariance matrix of each

u_{i}

is

Σ_{i} = σ_{i}^{2} Σ,

where Σ is a

T \times T

symmetric positive definite matrix. The random vector

u_{i}

can be written as

u_{i} = σ_{i} Γ ε_{i},

where

Γ Γ^{'} = Σ .

The generic covariance matrix

Σ_{i}

of each

u_{i}

captures the serial correlation. Bai and Zhou [20] use this representation and show that

1 / T

tr

(Σ^{κ})

is bounded for any fixed positive integer κ. More specifically, considering a multiple Moving Average model of order one (MA(1))

u_{t} = ξ_{t} + θ ξ_{t - 1}, t = 1, \dots, T,

(12)

where

|θ| < 1

and

u_{t}

,

ξ_{t}, u_{i}

and

ξ_{i}

are defined in Assumption 1. For this case,

Σ^{MA} = {(δ_{l r})}_{T \times T}

, where

δ_{l r} = \{\begin{matrix} (1 + θ^{2}), & l = r; \\ θ, & |l - r| = 1; \\ 0, & |l - r| > 1 . \end{matrix}

(13)

One can also verify that for (11), we have the following generic representation,

Σ = {(ϖ_{l r})}_{T \times T}, where ϖ_{l r} = \sum_{s = 0}^{\infty} d_{s} d_{(| l - r | + s)} .

(14)

We use this representation throughout the paper for convenience.

Assumption 2.

The regressors,

x_{i t}

, are strictly exogenous, such that

E (u_{i t} | X_{i}) = 0, f o r a l l i = 1, \dots, N a n d t = 1, \dots, T,

(15)

and

X_{i}^{'} X_{i}

is a positive definite matrix.

Assumption 3.

T > k

and the OLS residuals,

e_{i t},

defined by (4), are not all zeros with probability approaching one.

Assumptions 2 and 3 are standard for model (1); see Pesaran [14] and Pesaran et al. [11]. We impose the assumption that the regressors are strictly exogenous. We do not impose any restrictions on the distribution of the errors or the relative convergence speed of (

N, T

). This framework is quite relaxable while LM-type tests usually impose the normality assumption and restrictions on the relative speed of N and T, namely

N / T \to c \in (0, \infty) .

Under these assumptions, the OLS estimates for model (1) are consistent, but inefficient. We focus on the term used in Pesaran’s CD test [14]

T_{n} = {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} {\hat{ρ}}_{i j} .

(16)

In the next section, we derive the first two moments of this test statistic and later derive its limiting distribution under this general unknown form of serial correlation over time.

3. Asymptotics

3.1. Asymptotic Distribution under the Null

In this section, we study the asymptotics of the test statistic

T_{n}

defined in (16). To derive its limiting distribution, we first consider its first two moments.

Theorem 1.

Under Assumptions 1–3 and the null given in (2),

E (T_{n}) = 0

(17)

and

γ^{2} = v a r (T_{n}) = \frac{2}{N (N - 1)} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} E ({\hat{ρ}}_{i j}^{2}) = \frac{2}{N (N - 1)} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{t r (M_{j} Σ M_{j} M_{i} Σ M_{i})}{t r (M_{i} Σ) tr (M_{j} Σ)},

(18)

where

M_{i} = I_{T} - X_{i}^{'} {(X_{i}^{'} X_{i})}^{- 1} X_{i},

and Σ is defined by (14).

Theorem 1 shows that the mean of the test statistic is zero. Its variance depends on

Σ,

which is a generic form containing serial correlation.

In fact, as shown in the proof of Theorem 1 (see the Appendix B), E

({\hat{ρ}}_{i j}^{2}) = t r (M_{j} Σ M_{j} M_{i} Σ M_{i}) / [tr (M_{i} Σ) tr (M_{j} Σ)]

. In the special case where the error terms are independent over time,

Σ = I_{T},

and E

({\hat{ρ}}_{i j}^{2})

reduces to tr

(M_{j} M_{i}) / {(T - k)}^{2},

which yields the results given in Equation (7) for the LM

_{P U Y}

test statistic with no serial correlation. However, with serial correlation in the errors, an extra bias term is introduced in LM

_{P U Y}

since

\frac{tr (M_{j} Σ M_{j} M_{i} Σ M_{i})}{tr (M_{i} Σ) tr (M_{j} Σ)} - \frac{tr (M_{j} M_{i})}{{(T - k)}^{2}} \neq 0, if Σ \neq I_{T} .

More specifically, let us assume that

u_{i},

i = 1, \dots, N

, are observable, then E

(ρ_{i j}^{2}) = tr (Σ^{2}) /

tr

^{2} (Σ) .

For the MA(1) process defined by (12), tr

(Σ^{2}) /

tr

^{2} (Σ) = 1 / T + θ^{2} / (T + T θ^{2})

and tr

(Σ^{2}) /

tr

^{2} (Σ) = 1 / T,

for

θ = 0 .

The extra term

θ^{2} / (T + T θ^{2})

accumulates in the LM test statistic and leads to extra bias as

N \to \infty

. As discussed above, we expect that LM

_{P U Y}

to have serious size distortions when serial correlation is present in the disturbances.

Unlike LM-type tests, the test statistic

T_{n}

is centered at zero; it does not need bias adjustment. Note that if

u_{i t}

are independent over time, our model reduces to that of Pesaran [14]. Let

γ_{0}^{2}

be the variance of

T_{n}

without serial correlation; it can be written as

γ_{0}^{2} = \frac{2}{N (N - 1)} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} [\frac{T - 2 k}{{(T - k)}^{2}} + \frac{tr (P_{X_{i}} P_{X_{j}})}{{(T - k)}^{2}}],

(19)

where

P_{X_{i}} = X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'}

and

P_{X_{j}} = X_{j} {(X_{j}^{'} X_{j})}^{- 1} X_{j}^{'}

. The above result is the exact variance for

T_{n}

without serial correlation; it is derived by Pesaran [15]. A modified version of CD

_{P}

is also given by Pesaran [15] using this exact variance. From Theorem 1,

γ^{2}

is different from

γ_{0}^{2}

if

Σ \neq I_{T} .

Hence, we also expect CD

_{P}

to have size distortions when serial correlation is present in the disturbances. Next, we consider the limiting distribution of the proposed test. The result is given in the following theorem.

Theorem 2.

Under Assumptions 1–3 and the null in (2), as

(N, T) \to \infty,

we have

γ^{- 1} T_{n} \overset{d}{⟶} N (0, 1) .

(20)

Theorem 2 shows that appropriately standardized

γ^{- 1} T_{n}

is asymptotically distributed as a standard normal. It is valid for N and T tending to infinity jointly in any order. However, we do not observe Σ in a panel data regression model; and an estimate of the variance

γ^{2}

is needed for practical applications. Following Chen and Qin [23], an unbiased and consistent estimator of

γ^{2}

under the null is obtained using the cross-validation approach proposed in the following theorem:

Theorem 3.

Let

{\hat{γ}}^{2} = \frac{1}{N (N - 1)} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} v_{i}^{'} (v_{j} - {\bar{v}}_{(i, j)}) v_{j}^{'} (v_{i} - {\bar{v}}_{(i, j)}),

where

{\bar{v}}_{(i, j)} = \frac{1}{N - 2} \sum_{1 \leq τ \neq i, j \leq N} v_{τ}

. Under Assumptions 1–3 and the null in (2),

E ({\hat{γ}}^{2}) = γ^{2} .

As

(N, T) \to \infty,

{\hat{γ}}^{2} \overset{p}{⟶} γ^{2} .

(21)

Define

C D_{R} = {\hat{γ}}^{- 1} T_{n} .

As

(N, T) \to \infty,

C D_{R} \overset{d}{⟶} N (0, 1) .

(22)

Theorem 3 shows that

{\hat{γ}}^{2}

is a good approximation for the variance, and we do not need to specify the structure of Σ. In other words, the test statistic allows the error terms of model (1) to be dependent over time. Furthermore, CD

_{R}

is a modified version of CD

_{P}

, so they are likely to perform very similarly with respect to many model specifications (see Pesaran [14]).

3.2. Local Power Properties

We now consider the power analysis of the test. Naturally, the power properties depend on the specifications of the alternatives. One general alternative specification that allows for global cross-sectional correlation in panels is the unobserved multi-factor model. Under this alternative, the new error terms are defined by

u_{i}^{'} = u_{i} + σ_{i} F λ_{i} = σ_{i} (Γ ε_{i} + F λ_{i}),

(23)

where

F = {(f_{1}, f_{2}, \dots, f_{T})}^{'}

denotes the

T \times r

common factor matrix and

λ_{i}

is the r factor loading vector. Under the null hypothesis,

λ_{i} = 0

, for all i. We now consider the following Pitman-type local alternative2

H_{a} : λ_{i} = \frac{1}{T^{1 / 4} N^{1 / 2}} δ_{i}, for some i,

(24)

where

δ_{i}

is a non-random and non-zero

r \times 1

vector, which does not depend on N or T. To simplify the analysis, we add the following assumption:

Assumption 4.

(1)

f_{t} \sim I I D (0, I_{r})

; (2)

f_{t}

are independent of

ε_{i t}

,

x_{i t}

, for all i and t; (3) for each i,

T^{- 1 / 2} \sum_{t = 1}^{T} u_{i t} f_{t} = O_{p} (1)

;

T^{- 1 / 2} \sum_{t = 1}^{T} x_{i t} f_{t}^{'} = O_{p} (1)

and

T^{- 1} \sum_{t = 1}^{T} f_{t} f_{t}^{'} = I_{r} + O_{p} (T^{- 1 / 2})

; (4)

T^{- 1 / 2} X_{i}^{'} X_{j} = O_{p} (1)

and

T^{- 1 / 2} X_{i}^{'} u_{i} = O_{p} (1)

, for all i and j.

The following theorem gives the power properties under the local alternative (24).

Theorem 4.

Under Assumptions 1–4 and local alternative (24), as

(N, T) \to \infty

,

γ^{- 1} T_{n} \overset{d}{⟶} N (ψ, 1),

(25)

where

ψ = p l i m_{(N, T) \to \infty} γ^{- 1} {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} (\frac{T^{1 / 2} N^{- 1} δ_{i}^{'} δ_{j}}{{t r}^{1 / 2} (M_{i} Σ) {t r}^{1 / 2} (M_{j} Σ)}) \neq 0 .

From Theorem 4, the test has nontrivial power against the local alternative that contracts to the null at the rate of

T^{- 1 / 4} N^{- 1 / 2}

. Hence, Theorem 4 establishes the consistency of the proposed test at the rate of

N \sqrt{T}

under the alternative, as long as

ψ \neq 0

.

4. Monte Carlo Simulations

This section conducts Monte Carlo simulations to examine the empirical size and power of the proposed test (CD

_{R}

) defined in (22) in heterogeneous panel data regression models. We also look at the performance of LM

_{P U Y}

and CD

_{P}

defined by (9) and (10), respectively, for comparison purposes. We consider four scenarios: (1) the errors are independent over time, with no serial correlation; (2) the errors follow a moving average model of order one

(MA (1))

over time; (3) the errors follow an Auto-Regressive model of order one (AR

(1)

) over time; (4) the errors follow an Auto-Regressive and Moving Average of order (1,1) (ARMA

(1, 1)

) over time. Finally, we provide small sample evidence on the power performance of the modified CD

_{R}

test against a factor and spatial auto-regressive model of order one alternatives, which are popular in economics for modeling cross-sectional correlation.

4.1. Experimental Design

Following Pesaran et al. [11], our experiments use the following data-generating process

\begin{matrix} y_{i t} & = α_{i} + β_{i} x_{i t} + u_{i t}, i = 1, \dots, N; t = 1, \dots, T, \end{matrix}

(26)

\begin{matrix} x_{i t} & = η x_{i t - 1} + υ_{i t}, \end{matrix}

(27)

where

α_{i} \sim

IID

N (1, 1);

β_{i} \sim

IID

N (1, 0.04) .

x_{i t}

is a strictly exogenous regressor, and we set

η = 0.6

and

υ_{i t} \sim

IID

N (ϕ_{i}^{2} / (1 - {0.6}^{2}))

with

ϕ_{i} \sim

IID

χ^{2} (6) / 6

, for

i = 1, \dots, N .

The error terms of (26) are generated using the following four data generating processes

\begin{matrix} (1) IID : u_{i t} = ξ_{i t}; \end{matrix}

(28)

\begin{matrix} (2) MA (1) : u_{i t} = ξ_{i t} + θ ξ_{i t - 1}; \end{matrix}

(29)

\begin{matrix} (3) AR (1) : u_{i t} = ρ u_{i t - 1} + ξ_{i t}; \end{matrix}

(30)

\begin{matrix} (4) ARMA (1, 1) : u_{i t} = ρ u_{i t - 1} + ξ_{i t} + θ ξ_{i t - 1}, \end{matrix}

(31)

where

ξ_{i t} = σ_{i} ε_{i t}

;

σ_{i}^{2} \sim

IID

χ^{2} (2) / 2

and

ε_{i t}

∼IID

(0, 1) .

We further set

θ = 0.8

and

ρ = 0.6

. To check the robustness of the tests to non-normal distributions,

ε_{i t}

are generated from a Normal

(0, 1)

and a Chi-squared distribution

(χ^{2} (2) / 2 - 1) .

To examine the empirical power of the tests, we consider two different cross-sectional correlation alternatives: factor and spatial models. The factor model is generated by

u_{i t}^{*} = λ_{i} f_{t} + u_{i t}, for i = 1, \dots, N; t = 1, \dots, T,

(32)

where

f_{t} \sim

IID

N (0, 1)

and

λ_{i} \sim

IID

U [0.1, 0.3];

In this case,

u_{i t}^{*}

replaces

u_{i t}

in (26) for the power studies.

u_{i t}

is generated by the four scenarios defined by (28)–(31), respectively. For the spatial model, we consider a first-order spatial auto-correlation model (SAR(1)),

u_{i t}^{*} = δ (0.5 u_{i - 1, t}^{*} + 0.5 u_{i + 1, t}^{*}) + u_{i t},

(33)

where

δ = 0.4

and

u_{i t}

are defined by (28)–(31), respectively.

The experiments are conducted for

N = 10, 20, 30, 50, 100, 200

and

T = 10, 20, 30, 50, 100 .

For each pair of

(N, T),

we run 2000 replications. To obtain the empirical size, we conduct the proposed test (CD

_{R})

and CD

_{P}

at the two-sided 5% nominal significance level and LM

_{P U Y}

at the positive one-sided 5% nominal significance level.

4.2. Simulation Results

Table 1 reports the empirical size of CD

_{P}

, LM

_{P U Y}

and CD

_{R}

for normal and chi-squared distributed errors. The error terms are assumed to be independent over time. The results show that all of the tests have the correct size with different

(N, T)

combinations under both normal and chi-squared scenarios. Those are consistent with the theoretical findings. The only exceptions are for small N or T equal to 10, especially for LM

_{P U Y} .

Table 2 reports the empirical size of the three tests with MA(1) error terms defined by (29). The results show that CD

_{R}

has the correct size for all

(N, T)

, but CD

_{P}

has size distortions for different

(N, T)

combinations because the disturbances are MA(1) over time. For example, under the normality scenario, the size of CD

_{P}

is

9.35 %

for

N = 10

and

T = 20

; it becomes

11.1 %

when T grows to

100 .

LM

_{P U Y}

suffers serious size distortions, because of the extra bias caused by ignoring serial correlation. From Table 2, the empirical size of LM

_{P U Y}

is

100 %

as N or T becomes larger than 30. Table 3 and Table 4 report the empirical size of the tests with AR(1) and ARMA(1,1) errors under the two distributions: normal and chi-squared scenarios. Note that CD

_{R}

is over-sized in Table 4 for the chi-squared case when

T = 10 .

However, it has the correct size as T gets larger than

20 .

In contrast, LM

_{P U Y}

has serious size issues, rejecting

100 %

of the time, and CD

_{P}

is over sized by as much as

25 %

. Overall, in comparison with CD

_{P}

and LM

_{P U Y}

, the proposed test CD

_{R}

controls for size distortions when serial correlation in the disturbances is present and is not much affected when serial correlation is not present.

Table 5 summarizes the size-adjusted power of CD

_{R}

with MA(1), AR(1) and ARMA(1,1) errors under the factor model alternative. Results show that CD

_{R}

performs reasonably well under the two distribution scenarios especially for N and

T > 10

. Table 6 confirms the power properties of CD

_{R}

for MA(1), AR(1) and ARMA(1,1) errors under the SAR(1) alternative, especially for large N and T.

5. Conclusions

In this paper, we find that in the large heterogeneous panel data model, LM

_{P U Y}

exhibits serious size bias when there is serial correlation in the disturbances. While CD

_{P}

is centered at zero, it still encounters size distortions caused by ignoring serial correlation. We modify Pesaran’s CD

_{P}

test to account for serial correlation of an unknown form in the error term and call it CD

_{R}

. This paper has several novel aspects: first, an unbiased and consistent estimate of the variance under the assumptions and the null of no cross-section correlation is proposed without knowing the form of serial correlation over time. Second, the limiting distribution of the test is derived as

(N, T) \to \infty

in any order. Third, it is distribution free. Simulations show that the proposed test CD

_{R}

successfully controls for size distortions with serial correlation in the error term. It also has reasonable power under the alternatives of a factor model and a spatial auto-correlation SAR(1) model for different serial correlation specifications.

Author Contributions

All authors contributed equally to the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix

This Appendix includes proofs of the main results in the text. The Appendix includes two parts: Part A includes some useful lemmas, which are frequently used in the proofs of the theorems; Part B gives the proofs of all of the theorems included in the paper.

Let us introduce some notation before proceeding: For two matrices

B = (b_{i j})

and

C = (c_{i j})

, we define

B \circ C = (b_{i j} c_{i j})

. ∑ denotes summation over mutually-different indices, e.g.,

\sum_{(i_{1}, i_{2}, j_{1}, j_{2})}

means summation over

\{(i_{1}, i_{2}, j_{1}, j_{2}) : i_{1}, i_{2}, j_{1}, j_{2} are mutually different\} .

Appendix A. Some Useful Lemmas

Lemma A1.

Let F and G be non-stochastic

N \times N

symmetric and positive definite matrices. Define

r = \frac{u_{i}^{'} F u_{i}}{u_{i}^{'} G u_{i}} .

Under Assumptions 1, we have

(a): E $(r^{k}) = \frac{E [{(ε_{i}^{'} F ε_{i})}^{k}]}{{[E (ε_{i}^{'} G ε_{i})]}^{k}}$ ;
(b): E $(ε_{i}^{'} F ε_{i}) =$ tr $(F);$
(c): E ${(ε_{i}^{'} F ε_{i})}^{2} =$ tr $(F^{2}) + 2$ tr $^{2} (F) + Δ$ tr $(F \circ F);$
(d): tr $(F \circ F) \leq$ tr $(F^{2}) .$

The Proof of part (a) is given by Lieberman [24], and the proofs of (b)–(d) are from Proposition 1 of Chen et al. [10]; hence, we omit the proof here.

Lemma A2.

Define

B_{j} = M_{j} Σ M_{j},

for any

j,

respectively. Under Assumptions 1–3 and the null in (2), we have

(a): E $({\hat{ρ}}_{i j}^{2}) = \frac{t r (B_{i} B_{j})}{t r (B_{i}) t r (B_{j})};$
(b): E $({\hat{ρ}}_{i j}^{4}) \leq (3 + Δ) \frac{(2 + Δ) t r {(B_{i} B_{j})}^{2} + {t r}^{2} (B_{i} B_{j})}{{t r}^{2} (B_{i}) {t r}^{2} (B_{j})};$
(c): For any $j_{1} \neq j_{2},$ E $({\hat{ρ}}_{i j_{1}}^{2} {\hat{ρ}}_{i j_{2}}^{2}) \leq \frac{{((2 + Δ) t r {(B_{i} B_{j_{1}})}^{2} + {t r}^{2} (B_{i} B_{j_{1}}))}^{1 / 2} {((2 + Δ) t r {(B_{i} B_{j_{2}})}^{2} + {t r}^{2} (B_{i} B_{j_{2}}))}^{1 / 2}}{t r (B_{j_{1}}) t r (B_{j_{2}}) {t r}^{2} (B_{i})} .$

Proof.

Recall that the pair-wise correlation coefficients is defined as

{\hat{ρ}}_{i j} = v_{i}^{'} v_{j} = \sum_{t = 1}^{T} v_{i t} v_{j t},

where

v_{i}

are the scaled residual vectors defined by

v_{i} = \frac{e_{i}}{{(e_{i}^{'} e_{i})}^{1 / 2}} .

e_{i}

is the OLS residual vector from the individual-specific least squares regression, and it is given by

e_{i} = M_{i} u_{i} = M_{i} σ_{i} Γ ε_{i}, with M_{i} = I_{T} - P_{X_{i}} = I_{T} - X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'},

where

M_{i}

is idempotent. Consider part (a),

E ({\hat{ρ}}_{i j}^{2}) = E {(v_{i}^{'} v_{j})}^{2} = E {(\frac{e_{i}^{'} e_{j}}{{(e_{i}^{'} e_{i})}^{1 / 2} {(e_{j}^{'} e_{j})}^{1 / 2}})}^{2} = E (\frac{e_{i}^{'} A_{j} e_{i}}{e_{i}^{'} e_{i}}),

where

A_{j} = \frac{e_{j} e_{j}^{'}}{e_{j}^{'} e_{j}} .

Then

E ({\hat{ρ}}_{i j}^{2}) = E [E ({\hat{ρ}}_{i j}^{2} | ε_{j})] = E [E (\frac{e_{i}^{'} A_{j} e_{i}}{e_{i}^{'} e_{i}} | ε_{j})] .

Since

e_{i} = M_{i} σ_{i} Γ ε_{i},

and using parts (a) and (b) of Lemma A1, we have E

(\frac{e_{i}^{'} A_{j} e_{i}}{e_{i}^{'} e_{i}} | ε_{j}) = \frac{tr (Γ^{'} M_{i} A_{j} M_{i} Γ)}{tr (Γ^{'} M_{i} Γ)} .

Moreover,

\begin{matrix} E [tr (Γ^{'} M_{i} A_{j} M_{i} Γ)] & = E (\frac{ε_{j}^{'} Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ ε_{j}}{ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j}}) \\ = \frac{tr (Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ)}{tr (Γ^{'} M_{j} Γ)} \\ = \frac{tr (M_{j} Σ M_{j} M_{i} Σ M_{i})}{tr (M_{j} Σ)} . \end{matrix}

Together with the above results, we have:

E ({\hat{ρ}}_{i j}^{2}) = \frac{tr (M_{j} Σ M_{j} M_{i} Σ M_{i})}{tr (M_{i} Σ) tr (M_{j} Σ)} = \frac{tr (B_{i} B_{j})}{tr (B_{i}) tr (B_{j})} .

Consider part (b),

\begin{matrix} E (ρ_{i j}^{4}) & = E [E (ρ_{i j}^{4} ∣ v_{j})] = E (E [({(\frac{e_{i}^{'} A_{j} e_{i}}{e_{i}^{'} e_{i}})}^{2} ∣ v_{j})]) = E [\frac{E {(ε_{i}^{'} Γ^{'} M_{i} A_{j} M_{i} Γ ε_{i})}^{2}}{{tr}^{2} (Γ^{'} M_{i} Γ)} ∣ v_{j}] \\ = E [\frac{2 tr {(Γ^{'} M_{i} A_{j} M_{i} Γ)}^{2} + {tr}^{2} (Γ^{'} M_{i} A_{j} M_{i} Γ) + Δ tr (Γ^{'} M_{i} A_{j} M_{i} Γ \circ Γ^{'} M_{i} A_{j} M_{i} Γ)}{{tr}^{2} (B_{i})}] . \end{matrix}

Using part (a) of Lemma A1, we have

E [{tr}^{2} (Γ^{'} M_{i} A_{j} M_{i} Γ)] = E {(\frac{ε_{j}^{'} Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ ε_{j}}{ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j}})}^{2} = \frac{E {(ε_{j}^{'} Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ ε_{j})}^{2}}{{[E (ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j})]}^{2}} .

Using part (c) of Lemma A1, we also have

\begin{matrix} E {(ε_{j}^{'} Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ ε_{j})}^{2} = & 2 tr {(Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ)}^{2} + {tr}^{2} (Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ) \\ + Δ tr (Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ \circ Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ) \\ = & 2 tr {(B_{i} B_{j})}^{2} + {tr}^{2} (B_{i} B_{j}) + Δ tr (Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ \circ Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ) \\ \leq & (2 + Δ) tr {(B_{i} B_{j})}^{2} + {tr}^{2} (B_{i} B_{j}) . \end{matrix}

With the fact that E

(ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j}) =

tr

(B_{j}),

we obtain

E [{tr}^{2} (Γ^{'} M_{i} A_{j} M_{i} Γ)] \leq \frac{(2 + Δ) tr {(B_{i} B_{j})}^{2} + {tr}^{2} (B_{i} B_{j})}{{tr}^{2} (B_{j})} .

Next, we consider E

[tr {(Γ^{'} M_{i} A_{j} M_{i} Γ)}^{2}] .

\begin{matrix} E [tr {(Γ^{'} M_{i} A_{j} M_{i} Γ)}^{2}] & = E [{(\frac{ε_{j}^{'} Γ^{'} M_{j} M_{i} Γ Γ^{'} M_{i} M_{j} Γ ε_{j}}{ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j}})}^{2}] \\ \leq \frac{(2 + Δ) tr {(B_{i} B_{j})}^{2} + {tr}^{2} (B_{i} B_{j})}{{tr}^{2} (B_{j})} . \end{matrix}

Hence,

E ({\hat{ρ}}_{i j}^{4}) \leq (3 + Δ) \frac{(2 + Δ) tr {(B_{i} B_{j})}^{2} + {tr}^{2} (B_{i} B_{j})}{{tr}^{2} (B_{i}) {tr}^{2} (B_{j})} .

Consider part (c); since

\begin{matrix} E ({\hat{ρ}}_{i j_{1}}^{2} {\hat{ρ}}_{i j_{2}}^{2}) & = EE ({\hat{ρ}}_{i j_{1}}^{2} {\hat{ρ}}_{i j_{2}}^{2} | v_{i}) = E (E ({\hat{ρ}}_{i j_{1}}^{2} | v_{i}) E ({\hat{ρ}}_{i j_{2}}^{2} | v_{i})) \\ = \frac{E (v_{i}^{'} B_{j_{1}} v_{i} v_{i}^{'} B_{j_{2}} v_{i})}{tr (B_{j_{1}}) tr (B_{j_{2}})} . \end{matrix}

Note that

|E (v_{i}^{'} B_{j_{1}} v_{i} v_{i}^{'} B_{j_{2}} v_{i})| \leq {[E {(v_{i}^{'} B_{j_{1}} v_{i})}^{2}]}^{1 / 2} {[E {(v_{i}^{'} B_{j_{2}} v_{i})}^{2}]}^{1 / 2}

by using the Cauchy–Schwarz inequality and

E {(v_{i}^{'} B_{j_{1}} v_{i})}^{2} = E {(\frac{ε_{i}^{'} Γ^{'} M_{i} M_{j_{1}} Γ Γ^{'} M_{j_{1}} M_{i} Γ ε_{i}}{ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i}})}^{2} \leq \frac{(2 + Δ) tr {(B_{i} B_{j_{1}})}^{2} + {tr}^{2} (B_{i} B_{j_{1}})}{{tr}^{2} (B_{i})} .

Hence:

E ({\hat{ρ}}_{i j_{1}}^{2} {\hat{ρ}}_{i j_{2}}^{2}) \leq \frac{{((2 + Δ) tr {(B_{i} B_{j_{1}})}^{2} + {tr}^{2} (B_{i} B_{j_{1}}))}^{1 / 2} {((2 + Δ) tr {(B_{i} B_{j_{2}})}^{2} + {tr}^{2} (B_{i} B_{j_{2}}))}^{1 / 2}}{tr (B_{j_{1}}) tr (B_{j_{2}}) {tr}^{2} (B_{i})} .

☐

Lemma A3.

Under Assumptions 1–3 and the null in (2), for any fixed positive number k, we have

(a): $\frac{1}{T}$ tr $(Σ^{k}) = O (1);$
(b): $\frac{1}{T}$ tr $(B_{i}^{k}) = O (1);$
(c): $\frac{1}{T}$ tr $(B_{i_{1}} B_{i_{2}} \dots B_{i_{k}}) = O (1),$ for $i_{1} \neq i_{2} \neq \dots \neq i_{k} .$

Proof.

Part (a) is directly from Bai and Zhou [20]; hence, we omit it here. Next, we consider part (b). Since

I_{T} - P_{X_{i}}

is idempotent, for any

i = 1, \dots, N

; hence, tr

(B_{i}^{k}) =

tr

{[(I_{T} - P_{X_{i}}) Σ (I_{T} - P_{X_{i}})]}^{k} =

tr

({[(I_{T} - P_{X_{i}}) Σ]}^{k}) .

By using the inequality that for any positive definite matrices A and B (see Bushell and Trustrum [25])

tr {(A B)}^{k} \leq tr (A^{k} B^{k}),

we have

tr (B_{i}^{k}) \leq tr ((I_{T} - P_{X_{i}}) Σ^{k}) \leq tr (Σ^{k}) .

Using part (a), then

\frac{1}{T} tr (B_{i}^{k}) \leq \frac{1}{T} tr (Σ^{k}) = O (1) .

For part (c), since for each

B_{i_{l}}, l = 1, \dots, k,

it is positive semi-definite. We also have

B_{i_{l}} \leq

Σ,

l = 1, \dots, k .

By using the facts that for any matrices

A, B,

with

A \leq B

and C positive definite, tr

(A C) \leq

tr

(B C),

we conclude that

\frac{1}{T} tr (B_{i_{1}} B_{i_{2}} \dots B_{i_{k}}) \leq \frac{1}{T} tr (Σ^{k}) = O (1) .

Part (c) holds. ☐

Appendix B. Proof of the Theorems

Appendix B.1. Proof of Theorem 1

Proof.

Since E

(e_{i} | X_{i}) = 0

and

ε_{i}, i = 1, \dots, N,

are independent, it is easy to show that

E ({\hat{ρ}}_{i j}) = 0,

which further implies E

(T_{n}) = 0 .

Next, we consider the variance of

T_{n} .

var (\sum_{i = 1}^{N} \sum_{j = 1}^{i - 1} {\hat{ρ}}_{i j}) = E {(\sum_{i = 1}^{N} \sum_{j = 1}^{i - 1} {\hat{ρ}}_{i j})}^{2} = E (\sum_{i_{1} = 1}^{N} \sum_{j_{1} = 1}^{i_{1} - 1} \sum_{i_{2} = 1}^{N} \sum_{j_{2} = 1}^{i_{2} - 1} {\hat{ρ}}_{i_{1} j_{1}} {\hat{ρ}}_{i_{2} j_{2}}) .

To calculate the above term, we have three cases to discuss:

(1): $i_{1}, i_{2}, j_{1}, j_{2}$ are mutually different. E $({\hat{ρ}}_{i_{1} j_{1}} {\hat{ρ}}_{i_{2} j_{2}}) = 0 .$
(2): $i_{1} = i_{2},$ $j_{1} = j_{2}$ . By using Lemma A2, we have E $({\hat{ρ}}_{i j}^{2}) = \frac{tr (B_{i} B_{j})}{tr (B_{i}) tr (B_{j})} .$
(3): $i_{1} = i_{2},$ $i_{1} \neq j_{1} \neq j_{2} .$ Since $v_{i_{1}}, v_{j_{1}}, v_{i_{1}}$ and $v_{j_{2}}$ are independent, we have $E ({\hat{ρ}}_{i_{1} j_{1}} {\hat{ρ}}_{i_{1} j_{2}}) = E (v_{i_{1}}^{'} v_{j_{1}} v_{i_{1}}^{'} v_{j_{2}}) = 0 .$

Hence, the above results give us the variance of

T_{n},

which is

\begin{matrix} γ^{2} & = var (T_{n}) = \frac{2}{N (N - 1)} \sum_{i = 1}^{N} \sum_{j = 1, j \neq i}^{N} \frac{tr (M_{j} Σ M_{j} M_{i} Σ M_{i})}{tr (M_{i} Σ) tr (M_{j} Σ)} \\ = \frac{2}{N (N - 1)} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{tr (B_{i} B_{j})}{tr (B_{i}) tr (B_{j})}, \end{matrix}

and Theorem 1 is proven. ☐

Appendix B.2. Proof of Theorem 2

Proof.

To prove this theorem, we need to employ the Martingale central limit theorem (Billingsley [26]). For that purpose, we define

F_{0} = \{ϕ, Ω\},

F_{N i}

as the σ-field generated by

\{ε_{1}, ε_{2}, \dots, ε_{i}\}

for

1 \leq i \leq N .

Let

E_{N r} (\cdot)

denote the conditional expectation given filtration

F_{N r} [E_{0} (\cdot) = E (\cdot)] .

Write

L_{n} = \sum_{i = 1}^{N} D_{N, i}

with

D_{N, 1} = 0 .

More specifically,

D_{N, i} = {(\frac{1}{N (N - 1)})}^{1 / 2} \sum_{j = 1}^{i - 1} v_{i}^{'} v_{j} .

For every

N,

we can further show that

E (D_{N, i} ∣ F_{N, i - 1}) = 0 .

Hence,

D_{N, i}

(1 \leq i \leq N)

is a martingale difference sequence with respect to

F_{N, i}

(1 \leq i \leq N) .

Let

δ_{N i}^{2} =

E

[{(D_{N i})}^{2} ∣ F_{N, i - 1}] .

By applying the Martingale central limit theorem, it is sufficient to show that, as

(N, T) ⟶ \infty

,

\frac{\sum_{i = 1}^{N} δ_{N i}^{2}}{var (T_{n})} \overset{p}{⟶} 1 and \frac{\sum_{i = 1}^{N} E (D_{N, i}^{4})}{{var}^{2} (T_{n})} ⟶ 0 .

Lemmas B1 and B2 prove the above conditions. Hence, we can apply the Martingale central limit theorem, and as

(N, T) ⟶ \infty

, we have

γ^{- 1} T_{n} \overset{d}{⟶} N (0, 1) .

☐

Lemma B1.

Under Assumptions 1–3 and the null (2), as

(N, T) \to \infty,

\frac{\sum_{i = 1}^{N} δ_{N i}^{2}}{v a r (T_{n})} \overset{p}{⟶} 1,

where

δ_{N i}^{2} =

E

[{(D_{N i})}^{2} ∣ F_{N, i - 1}] .

Proof.

To prove Lemma B1, we first show that E

(\sum_{i = 1}^{N} δ_{N i}^{2}) =

var

(T_{n}) .

Then, we will show that as

(N, T) \to \infty,

var

(\sum_{i = 1}^{N} δ_{N i}^{2}) /

var

^{2} (T_{n}) \to 0 .

It is easy to show that

E (\sum_{i = 1}^{N} δ_{N i}^{2}) = \sum_{i = 1}^{N} E \{E [{(D_{N i})}^{2} ∣ F_{N, i - 1}]\} = var (T_{n}) .

Next, we only need to show that the second condition is satisfied. We first consider the magnitude of var(

T_{n}

). From Lemma A3, we know that

\frac{tr (B_{j} B_{i})}{tr (B_{i}) tr (B_{j})} = O (T^{- 1}),

which implies var

^{2} (T_{n}) = O (T^{- 2}) .

Now, consider var

(\sum_{i = 1}^{N} δ_{N i}^{2})

. Let

Q_{j} = \sum_{j = 1}^{i - 1} v_{j},

then:

\begin{matrix} δ_{N i}^{2} & = E [{(D_{N i})}^{2} ∣ F_{N, i - 1}] \\ = \frac{2}{N (N - 1)} E (v_{i}^{'} Q_{j} Q_{j}^{'} v_{i} ∣ F_{N, i - 1}) \\ = \frac{2}{N (N - 1)} E (\frac{ε_{i}^{'} Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ ε_{i}}{(ε_{i}^{'} M_{i} Γ^{'} Γ M_{i} ε_{i})} ∣ F_{N, i - 1}) \\ = \frac{2}{N (N - 1)} \frac{(Q_{j}^{'} M_{i} Γ Γ^{'} M_{i} Q_{j})}{tr (B_{i})} . \end{matrix}

Therefore, we need to show the magnitude of var

(\sum_{i = 1}^{N} Q_{j}^{'} M_{i} Γ Γ^{'} M_{i} Q_{j}) .

Rewrite

Q_{j}^{'} M_{i} Γ Γ^{'} M_{i} Q_{j} = \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1}^{i - 1} v_{j_{1}}^{'} B_{i} v_{j_{2}}

and:

E (\sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1}^{i - 1} v_{j_{1}}^{'} M_{i} Γ Γ^{'} M_{i} v_{j_{2}}) = E (\sum_{j = 1}^{i - 1} v_{j}^{'} B_{i} v_{j}) = \sum_{j = 1}^{i - 1} E [\frac{ε_{j}^{'} Γ^{'} M_{j} B_{i} M_{j} Γ ε_{j}}{(ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j})}] = \sum_{j = 1}^{i - 1} \frac{tr (B_{j} B_{i})}{tr (B_{j})} .

Next, we consider E

{(\sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1}^{i - 1} v_{j_{1}}^{'} B_{i} v_{j_{2}})}^{2}

.

E {(\sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1}^{i - 1} v_{j_{1}}^{'} B_{i} v_{j_{2}})}^{2} = E \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1}^{i - 1} \sum_{j_{3} = 1}^{i - 1} \sum_{j_{4} = 1}^{i - 1} (v_{j_{1}}^{'} B_{i} v_{j_{2}} v_{j_{3}}^{'} B_{i} v_{j_{4}}) .

To calculate the magnitude order of the above term, we have three cases to discuss:

(1): $j_{1} = j_{2} = j_{3} = j_{4} = j$ .

$\begin{matrix} E {(v_{j}^{'} B_{i} v_{j})}^{2} = & E \frac{{(ε_{j}^{'} Γ^{'} M_{j} B_{i} M_{j} Γ ε_{j})}^{2}}{{(ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j})}^{2}} = \frac{E {(ε_{j}^{'} Γ^{'} M_{j} B_{i} M_{j} Γ ε_{j})}^{2}}{{[E (ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j})]}^{2}} \\ = & \frac{{tr}^{2} (B_{j} B_{i}) + 2 tr {(B_{j} B_{i})}^{2} + Δ tr (B_{j} B_{i} \circ B_{j} B_{i})}{{tr}^{2} (B_{j})} \leq \frac{(3 + Δ) {tr}^{2} (B_{j} B_{i})}{{tr}^{2} (B_{j})} . \end{matrix}$
(2): $j_{1} = j_{2} \neq j_{3} = j_{4} .$

$E (v_{j_{1}}^{'} B_{i} v_{j_{1}}) (v_{j_{3}}^{'} B_{i} v_{j_{3}}) = E (v_{j_{1}}^{'} B_{i} v_{j_{1}}) E (v_{j_{3}}^{'} B_{i} v_{j_{3}}) = \frac{tr (B_{j_{1}} B_{i})}{tr (B_{j_{1}})} \frac{tr (B_{j_{3}} B_{i})}{tr (B_{j_{3}})} .$
(3): $j_{1} = j_{3} \neq j_{2} = j_{4} .$

$\begin{matrix} E (v_{j_{1}}^{'} B_{i} v_{j_{2}}) (v_{j_{1}}^{'} B_{i} v_{j_{2}}) = & E [E (v_{j_{1}}^{'} B_{i} v_{j_{2}} v_{j_{2}}^{'} B_{i} v_{j_{1}} ∣ v_{j_{2}})] \\ = & E [\frac{tr (Γ^{'} M_{j_{1}} B_{i} M_{j_{2}} Γ ε_{j_{2}} ε_{j_{2}}^{'} Γ^{'} M_{j_{2}} B_{i} M_{j_{1}} Γ)}{tr (M_{j_{1}} Σ) ε_{j_{2}}^{'} Γ^{'} M_{j_{2}} Γ ε_{j_{2}}}] \\ = & \frac{tr (B_{j_{2}} B_{i} B_{j_{1}} B_{i})}{tr (B_{j_{1}}) tr (B_{j_{2}})} . \end{matrix}$

Hence,

\begin{matrix} var (Q_{j}^{'} Γ M_{i} Γ^{'} Q_{j}) & = E {(Q_{j}^{'} Γ M_{i} Γ^{'} Q_{j})}^{2} - {[E (Q_{j}^{'} Γ M_{i} Γ^{'} Q_{j})]}^{2} \\ \leq \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1, j_{2} \neq j_{1}}^{i - 1} \frac{tr (B_{j_{1}} B_{i}) tr (B_{j_{2}} B_{i})}{tr (B_{j_{1}}) tr (B_{j_{2}})} + (3 + Δ) \sum_{j = 1}^{i - 1} \frac{{tr}^{2} (B_{j} B_{i})}{{tr}^{2} (B_{j})} \\ + 2 \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1, j_{2} \neq j_{1}}^{i - 1} \frac{tr (B_{j_{2}} B_{i} B_{j_{1}} B_{i})}{tr (B_{j_{1}}) tr (B_{j_{2}})} - {(\sum_{j = 1}^{i - 1} \frac{tr (B_{j} B_{i})}{tr (B_{j})})}^{2} \\ = 2 \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1, j_{2} \neq j_{1}}^{i - 1} \frac{tr (B_{j_{2}} B_{i} B_{j_{1}} B_{i})}{tr (B_{j_{1}}) tr (B_{j_{2}})} + (2 + Δ) \sum_{j = 1}^{i - 1} \frac{{tr}^{2} (B_{j} B_{i})}{{tr}^{2} (B_{j})} . \end{matrix}

It further leads to

\begin{matrix} var (\sum_{i = 1}^{N} δ_{N i}^{2}) & \leq \frac{4}{N^{2} {(N - 1)}^{2}} N \sum_{i = 1}^{N} var (δ_{N i}^{2}) \\ \leq \frac{8}{N {(N - 1)}^{2}} \sum_{i = 1}^{N} \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1, j_{2} \neq j_{1}}^{i - 1} \frac{tr (B_{j_{2}} B_{i} B_{j_{1}} B_{i})}{{tr}^{2} (B_{i}) tr (B_{j_{1}}) tr (B_{j_{2}})} \\ + \frac{4 (2 + Δ)}{N {(N - 1)}^{2}} \sum_{i = 1}^{N} \sum_{j = 1}^{i - 1} \frac{{tr}^{2} (B_{j} B_{i})}{{tr}^{2} (B_{i}) {tr}^{2} (B_{j})} . \end{matrix}

By using Lemma A3, we have

var (\sum_{i = 1}^{N} δ_{N i}^{2}) \leq K [O (\frac{1}{T^{3}}) + O (\frac{1}{N T^{2}})] .

As

(N, T) \to \infty,

var

(\sum_{i = 1}^{N} δ_{N i}^{2}) /

var

^{2} (T_{n}) \to 0 .

Lemma B1 is proven. ☐

Lemma B2.

Under Assumptions 1–3 and the null (2), as

(N, T) \to \infty,

\frac{\sum_{i = 1}^{N} E (D_{N, i}^{4})}{{v a r}^{2} (T_{n})} ⟶ 0 .

Proof.

Rewrite

\begin{matrix} E (D_{N, i}^{4}) & = E [E (D_{N, i}^{4} | F_{N, i - 1})] = E \{E [{(v_{i}^{'} Q_{j} Q_{j}^{'} v_{i})}^{2} ∣ F_{N, i - 1}]\} \\ = E [\frac{{tr}^{2} (Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ) + 2 tr {(Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ)}^{2} + Δ tr (Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ \circ Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ)}{{tr}^{2} (B_{i})}] . \end{matrix}

By using the results from Lemma B1, we have

\begin{matrix} E [{tr}^{2} (Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ)] & = E {(Q_{j}^{'} B_{i} Q_{j})}^{2} \\ \leq \sum_{j_{1} = 1}^{i - 1} \sum_{j_{3} = 1, j_{3} \neq j_{1}}^{i - 1} \frac{tr (B_{j_{1}} B_{i}) tr (B_{j_{3}} B_{i})}{tr (B_{j_{1}}) tr (B_{j_{3}})} + (3 + Δ) \sum_{j = 1}^{i - 1} \frac{{tr}^{2} (B_{j} B_{i})}{{tr}^{2} (B_{j})} \\ + 2 \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1, j_{2} \neq j_{1}}^{i - 1} \frac{tr (B_{j_{2}} B_{i} B_{j_{1}} B_{i})}{tr (B_{j_{1}}) tr (B_{j_{2}})} . \end{matrix}

Since

tr {(Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ)}^{2} \leq {tr}^{2} (Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ)

and

tr (Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ \circ Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ) \leq {tr}^{2} (Γ^{'} M_{i} Q_{j} Q_{j}^{'} M_{i} Γ),

thus

\begin{matrix} \sum_{i = 1}^{N} E (D_{N, i}^{4}) & \leq \frac{K}{N^{2} {(N - 1)}^{2}} \sum_{i = 1}^{N} \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1, j_{2} \neq j_{1}}^{i - 1} \frac{tr (B_{j_{1}} B_{i}) tr (B_{j_{2}} B_{i})}{{tr}^{2} (B_{i}) tr (B_{j_{1}}) tr (B_{j_{3}})} \\ + \frac{K}{N^{2} {(N - 1)}^{2}} \sum_{i = 1}^{N} \sum_{j = 1}^{i - 1} \frac{{tr}^{2} (B_{j} B_{i})}{{tr}^{2} (B_{i}) {tr}^{2} (B_{j})} \\ + \frac{K}{N^{2} {(N - 1)}^{2}} \sum_{i = 1}^{N} \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} = 1, j_{2} \neq j_{1}}^{i - 1} \frac{tr (B_{j_{2}} B_{i} B_{j_{1}} B_{i})}{{tr}^{2} (B_{i}) tr (B_{j_{1}}) tr (B_{j_{2}})} \\ \leq \frac{K^{2}}{N T^{2}} = O (\frac{1}{N T^{2}}) . \end{matrix}

Hence,

\frac{\sum_{i = 1}^{N} E (D_{N, i}^{4})}{{var}^{2} (T_{n})} ⟶ 0,

as

(N, T) \to \infty .

Lemma B2 is proven. ☐

Appendix B.3. Proof of Theorem 3

Proof.

We want to show

E ({\hat{γ}}^{2}) = γ^{2} and {\hat{γ}}^{2} - γ^{2} = o_{p} (1) .

Note that

\begin{matrix} {\hat{γ}}^{2} & = \frac{1}{2 N (N - 1)} \sum_{(i, j)}^{N} v_{i}^{'} (v_{j} - {\bar{v}}_{(i, j)}) v_{j}^{'} (v_{i} - {\bar{v}}_{(i, j)}) \\ = \frac{1}{2 N (N - 1)} [\sum_{(i, j)}^{N} {(v_{i}^{'} v_{j})}^{2} - v_{i}^{'} v_{j} v_{j}^{'} {\bar{v}}_{(i, j)} - v_{i}^{'} {\bar{v}}_{(i, j)} v_{j}^{'} v_{i} + v_{i}^{'} {\bar{v}}_{(i, j)} v_{j}^{'} {\bar{v}}_{(i, j)}] \\ = a_{1} + a_{2} + a_{3} + a_{4}, say . \end{matrix}

It is easy to show that the first term E

(a_{1}) = γ^{2}

, and E

(a_{i}) = 0, i = 2, 3, 4 .

Therefore, we prove the first part. By using Lemma A3 and Theorem 1, we have

γ^{2} = O (T^{- 1}) .

Hence, to prove

{\hat{γ}}^{2} - γ^{2} = o_{p} (1)

, we only need to show that var(

a_{1}

)

= o_{p} (T^{- 2})

and

a_{i} = o_{p} (γ^{2}),

for

i = 2, 3, 4 .

Let us consider var(

a_{1}

).

\begin{matrix} var (a_{1}) & = E (a_{1}^{2}) - γ^{4} \\ = \frac{4}{N^{2} {(N - 1)}^{2}} E {(\sum_{i = 1}^{N} \sum_{j = 1}^{i - 1} {\hat{ρ}}_{i j}^{2})}^{2} - \frac{4}{N^{2} {(N - 1)}^{2}} {(\sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{tr (B_{j} B_{i})}{tr (B_{i}) tr (B_{j})})}^{2} \\ = \frac{4}{N^{2} {(N - 1)}^{2}} E (\sum_{i_{1} = 2}^{N} \sum_{j_{1} = 1}^{i - 1} \sum_{i_{2} = 2}^{N} \sum_{j_{2} = 1}^{i_{2} - 1} ρ_{i_{1} j_{1}}^{2} ρ_{i_{2} j_{2}}^{2}) - \frac{4}{N^{2} {(N - 1)}^{2}} {(\sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{tr (B_{j} B_{i})}{tr (B_{i}) tr (B_{j})})}^{2} . \end{matrix}

Now, we only consider the term E

(\sum_{i_{1} = 2}^{N} \sum_{j_{1} = 1}^{i - 1} \sum_{i_{2} = 2}^{N} \sum_{j_{2} = 1}^{i - 1} ρ_{i_{1} j_{1}}^{2} ρ_{i_{2} j_{2}}^{2}) .

There are three cases for this term, and Lemma A2 is used frequently:

(1): $i_{1}, i_{2}, j_{1}$ and $j_{2}$ are mutually different.

$E (ρ_{i_{1} j_{1}}^{2} ρ_{i_{2} j_{2}}^{2}) = \frac{tr (B_{i_{1}} B_{j_{1}}) tr (B_{i_{2}} B_{j_{2}})}{tr (B_{i_{1}}) tr (B_{i_{1}}) tr (B_{i_{2}}) tr (B_{i_{2}})} = O_{p} (\frac{1}{T^{2}}) .$
(2): $i_{1} = i_{2},$ $j_{1} =$ $j_{2}$ and $i_{1} \neq j_{1} .$

$E (ρ_{i j}^{4}) \leq (3 + Δ) \frac{(2 + Δ) tr {(B_{i} B_{j})}^{2} + {tr}^{2} (B_{i} B_{j})}{{tr}^{2} (B_{i}) {tr}^{2} (B_{j})} = O_{p} (\frac{1}{T^{2}}) .$
(3): $i_{1} = i_{2},$ $i_{1} \neq$ $j_{1} \neq$ $j_{2} .$

$\begin{matrix} E (ρ_{i j_{1}}^{2} ρ_{i j_{2}}^{2}) & \leq \frac{{((2 + Δ) tr {(B_{i} B_{j_{1}})}^{2} + {tr}^{2} (B_{i} B_{j_{1}}))}^{1 / 2} {((2 + Δ) tr {(B_{i} B_{j_{2}})}^{2} + {tr}^{2} (B_{i} B_{j_{2}}))}^{1 / 2}}{tr (B_{j_{1}}) tr (B_{j_{2}}) {tr}^{2} (B_{i})} \\ = O_{p} (\frac{1}{T^{2}}) . \end{matrix}$

From the above results, we have

var (a_{1}) = O_{p} (\frac{1}{N^{2} T^{2}}) .

Hence

a_{1} \overset{p}{\to} γ^{2} .

Consider the second term

a_{2}

, which is equal to

\frac{1}{2 N (N - 1) (N - 2)} \sum_{(i, j, τ)}^{N} v_{i}^{'} v_{j} v_{j}^{'} v_{τ} .

The first term of E

{(\sum_{(i, j, τ)}^{N} v_{i}^{'} v_{j} v_{j}^{'} v_{τ})}^{2}

is

\begin{matrix} \sum_{(i, j_{1}, j_{2}, τ)}^{N} E (v_{i}^{'} v_{j_{1}} v_{j_{1}}^{'} v_{τ} v_{i}^{'} v_{j_{2}} v_{j_{2}}^{'} v_{τ}) \\ = & \sum_{(i, j_{1}, j_{2}, τ)}^{N} \frac{t r (M_{j_{2}} M_{τ} Σ M_{τ} M_{i} Σ M_{i} M_{j_{1}} Σ M_{j_{1}} M_{j_{2}} Σ)}{t r (B_{τ}) t r (B_{j_{2}}) t r (B_{j_{1}}) t r (B_{i})} \\ = & O (N^{4} T^{- 3}), \end{matrix}

by using Lemmas A2 and A3. By using part (c) of Lemma A3, the second term of E

{(\sum_{(i, j, τ)}^{N} v_{i}^{'} v_{j} v_{j}^{'} v_{τ})}^{2}

is

E [\sum_{(i, j, τ)}^{N} {(v_{i}^{'} v_{j} v_{j}^{'} v_{τ})}^{2}] = O_{p} (N^{3} T^{- 2}) .

Hence,

a_{2} = O_{p} (N^{- 1} T^{- 3 / 2}) + O_{p} (N^{- 3 / 2} T^{- 1}),

which further implies

a_{2} = o_{p} (γ^{2}) .

Since

a_{2} = a_{3},

a_{3} = o_{p} (γ^{2}) .

Consider

a_{4}

; it can be divided into two terms

\frac{1}{2 N (N - 1) {(N - 2)}^{2}} \sum_{(i, j, τ)}^{N} (v_{i}^{'} v_{τ} v_{j}^{'} v_{τ}) and \frac{1}{2 N (N - 1) {(N - 2)}^{2}} \sum_{(i, j, τ_{1}, τ_{2})}^{N} (v_{i}^{'} v_{τ_{1}} v_{j}^{'} v_{τ_{2}}) .

It is easy to show that the former term is

O_{p} (N^{- 1} a_{2}),

then it is

o_{p} (γ^{2}) .

We only need to consider the latter term E

{(\sum_{(i, j, τ_{1}, τ_{2})}^{N} (v_{i}^{'} v_{τ_{1}} v_{j}^{'} v_{τ_{2}}))}^{2}

.

E [\sum_{(i, j, τ_{1}, τ_{2})}^{N} {(v_{i}^{'} v_{τ_{1}} v_{j}^{'} v_{τ_{2}})}^{2}] = \sum_{(i, j, τ_{1}, τ_{2})}^{N} E [{(v_{i}^{'} v_{τ_{1}})}^{2} {(v_{j}^{'} v_{τ_{2}})}^{2}] = O (N^{4} T^{- 2}),

by using Lemma A2–A3. Hence, the latter term is

O_{p} (N^{- 2} T^{- 1}) .

The above results together lead to

a_{4} = o_{p} (γ^{2}) .

The first part of Theorem 3 holds; the second part of Theorem 3 is directly derived by using Theorem 2 and the first part of Theorem 3. ☐

Appendix B.4. Proof of Theorem 4

Proof.

The OLS residuals under the local alternative are defined by

M_{i} u_{i}^{'} = σ_{i} (M_{i} Γ ε_{i} + M_{i} F λ_{i})

, thus

\begin{matrix} T_{n} = & {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{{(M_{i} Γ ε_{i} + M_{i} F λ_{i})}^{'} (M_{j} Γ ε_{j} + M_{j} F λ_{j})}{| | M_{i} Γ ε_{i} + M_{i} F λ_{i} | | | | M_{j} Γ ε_{j} + M_{j} F λ_{j} | |} \\ = & {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{{(M_{i} Γ ε_{i} + M_{i} F λ_{i})}^{'} (M_{j} Γ ε_{j} + M_{j} F λ_{j})}{{(ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i} + 2 ε_{i}^{'} Γ^{'} M_{i} F λ_{i} + λ_{i}^{'} F^{'} M_{i} F λ_{i})}^{1 / 2} {(ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j} + 2 ε_{j}^{'} Γ^{'} M_{j} F λ_{j} + λ_{j}^{'} F^{'} M_{j} F λ_{j})}^{1 / 2}} . \end{matrix}

Consider the denominator. Note that

E {(ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i})}^{2} = tr {(Σ M_{i})}^{2} + 2 {[tr (Σ M_{i})]}^{2} + Δ tr (Σ M_{i} \circ Σ M_{i}) = O_{p} (T^{2})

, which lead to

ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i} = O (T)

. Consider the term

ε_{i}^{'} Γ^{'} M_{i} F λ_{i}

. Since

ε_{i}^{'} Γ^{'} M_{i} F λ_{i} = ε_{i}^{'} Γ^{'} F λ_{i} - ε_{i}^{'} Γ^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} F λ_{i} .

From Assumption 4, we have

X_{i}^{'} F = O_{p} (T^{1 / 2})

,

ε_{i}^{'} Γ^{'} F = O_{p} (T^{1 / 2})

and

ε_{i}^{'} Γ^{'} X_{i} = O_{p} (T^{1 / 2})

, which lead to

| | ε_{i}^{'} Γ^{'} F λ_{i} | | = O_{p} (T^{1 / 4} N^{- 1 / 2})

and

| | ε_{i}^{'} Γ^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} F λ_{i} | | = O_{p} (T^{- 1 / 4} N^{- 1 / 2}) .

Hence,

ε_{i}^{'} Γ^{'} M_{i} F λ_{i} = o_{p} (ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i})

. Similarly, by using Assumption 4, we also have

λ_{i}^{'} F^{'} M_{i} F λ_{i} = o_{p} (ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i})

. From the above results, we further have

ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i} + 2 ε_{i}^{'} Γ^{'} M_{i} F λ_{i} + λ_{i}^{'} F^{'} M_{i} F λ_{i} = (1 + o_{p} (1)) ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i} .

It results in

\begin{matrix} T_{n} = & {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{ε_{i}^{'} Γ^{'} M_{i} M_{j} Γ ε_{j} + ε_{i}^{'} Γ^{'} M_{i} M_{j} F λ_{j} + λ_{i}^{'} F^{'} M_{i} M_{j} Γ ε_{j} + λ_{i}^{'} F^{'} M_{i} M_{j} F λ_{j}}{{((1 + o_{p} (1)) ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i})}^{1 / 2} {((1 + o_{p} (1)) ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j})}^{1 / 2}} \\ = & T_{n 1} + T_{n 2} + T_{n 3} + T_{n 4}, \end{matrix}

where

T_{n 1} = {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{ε_{i}^{'} Γ^{'} M_{i} M_{j} Γ ε_{j}}{D_{i j}}

;

T_{n 2} = {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{ε_{i}^{'} Γ^{'} M_{i} M_{j} F λ_{j}}{D_{i j}}

;

T_{n 3} = {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{λ_{i}^{'} F^{'} M_{i} M_{j} Γ ε_{j}}{D_{i j}}

and

T_{n 4} = {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{λ_{i}^{'} F^{'} M_{i} M_{j} F λ_{j}}{D_{i j}}

with

D_{i j} = {((1 + o (1)) ε_{i}^{'} Γ^{'} M_{i} Γ ε_{i})}^{1 / 2} {((1 + o (1)) ε_{j}^{'} Γ^{'} M_{j} Γ ε_{j})}^{1 / 2}

. From Theorem 2,

γ^{- 1} T_{n 1} \overset{d}{⟶} N (0, 1)

.

From Theorem 1,

T_{n 1} = O_{p} (T^{- 1 / 2})

. Consider

T_{n 2}

. We observe that

E (T_{n 2}) = 0

and

\begin{matrix} E {(T_{n 2})}^{2} = & \frac{2}{N (N - 1)} E {(\sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} \frac{ε_{i}^{'} Γ^{'} M_{i} M_{j} F λ_{j}}{D_{i j}})}^{2} \\ = & \frac{2}{N (N - 1)} [\sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} E {(\frac{ε_{i}^{'} Γ^{'} M_{i} M_{j} F λ_{j}}{D_{i j}})}^{2} + \sum_{i = 2}^{N} \sum_{j_{1} = 1}^{i - 1} \sum_{j_{2} \neq j_{1}}^{i - 1} E \frac{ε_{i}^{'} Γ^{'} M_{i} M_{j_{1}} F λ_{j_{1}} λ_{j_{2}}^{'} F^{'} M_{j_{2}} M_{i} Γ ε_{i}}{D_{i j_{1}} D_{i j_{2}}}] . \end{matrix}

Consider the term

ε_{i}^{'} Γ^{'} M_{i} M_{j} F λ_{j}

.

\begin{matrix} ε_{i}^{'} Γ^{'} M_{i} M_{j} F λ_{j} \\ = & ε_{i}^{'} Γ^{'} F λ_{j} - ε_{i}^{'} Γ^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} F λ_{j} - ε_{i}^{'} Γ^{'} X_{j} {(X_{j}^{'} X_{j})}^{- 1} X_{j}^{'} F λ_{j} + ε_{i}^{'} Γ^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} X_{j} {(X_{j}^{'} X_{j})}^{- 1} X_{j}^{'} F λ_{j} . \end{matrix}

Using Assumption 4 and under the local alternative, we first have

| | ε_{i}^{'} Γ^{'} F λ_{j} | | = O_{p} (T^{1 / 4} N^{- 1 / 2})

; we then have

| | ε_{i}^{'} Γ X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} F λ_{j} | | = O_{p} (T^{- 1 / 4} N^{- 1 / 2})

since

ε_{i}^{'} Γ X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} F = (\frac{ε_{i}^{'} Γ X_{i}}{\sqrt{T}}) {(\frac{X_{i}^{'} X_{i}}{T})}^{- 1} (\frac{X_{i}^{'} F}{\sqrt{T}}) = O_{p} (1);

we last have

| | ε_{i}^{'} Γ^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} X_{j} {(X_{j}^{'} X_{j})}^{- 1} X_{j}^{'} F λ_{j} | | = O_{p} (T^{- 1 / 4} N^{- 1 / 2})

. Hence,

| | ε_{i}^{'} Γ^{'} M_{i} M_{j} F λ_{j} | | = O_{p} (T^{1 / 4} N^{- 1 / 2})

. Together with the fact that

| | D_{i j} | | = O_{p} (T)

, the first term of

E {(T_{n 2})}^{2}

is of order

O_{p} (T^{- 3 / 2} N^{- 1})

. Similar to the proof of above,

| | ε_{i}^{'} Γ^{'} M_{i} M_{j_{1}} F λ_{j_{1}} λ_{j_{2}}^{'} F^{'} M_{j_{2}} M_{i} Γ ε_{i} | | = O_{p} (T^{1 / 2} N^{- 1})

; with the facts that

| | D_{i j_{1}} | | = O_{p} (T)

and

| | D_{i j_{2}} | | = O_{p} (T)

; the second term of

E {(T_{n 2})}^{2}

is of order

O_{p} (T^{- 3 / 2}) .

Thus,

T_{n 2} = O_{p} (T^{- 3 / 4}) = o_{p} (T_{n 1})

. Similarly,

T_{n 3} = o_{p} (T_{n 1})

.

Consider

T_{n 4}

. Note that

\begin{matrix} λ_{i}^{'} F^{'} M_{i} M_{j} F λ_{j} \\ = & λ_{i}^{'} F^{'} F λ_{j} - λ_{i}^{'} F^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} F λ_{j} - λ_{i}^{'} F^{'} X_{j} {(X_{j}^{'} X_{j})}^{- 1} X_{j}^{'} F λ_{j} + λ_{i}^{'} F^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} X_{j} {(X_{j}^{'} X_{j})}^{- 1} X_{j}^{'} F λ_{j} . \end{matrix}

From Assumption 4, we know that

λ_{i} F^{'} F λ_{j} = λ_{i}^{'} T (I_{r} + O_{p} (T^{- 1 / 2})) λ_{j} \overset{p}{⟶} T λ_{i}^{'} λ_{j}

. Since

F^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} F = (\frac{F^{'} X_{i}}{\sqrt{T}}) {(\frac{X_{i}^{'} X_{i}}{T})}^{- 1} (\frac{X_{i}^{'} F}{\sqrt{T}}) = O_{p} (1),

λ_{i}^{'} F^{'} X_{i} {(X_{i}^{'} X_{i})}^{- 1} X_{i}^{'} F λ_{j} = o_{p} (λ_{i}^{'} F^{'} F λ_{j})

. Similarly, we can also show that the third and the fourth terms are of smaller order of the first term. Hence,

λ_{i}^{'} F^{'} M_{i} M_{j} F λ_{j} = (1 + o_{p} (1)) λ_{i}^{'} F^{'} F λ_{j}

.

Note that

E (λ_{i}^{'} F^{'} F λ_{j}) = T λ_{i}^{'} λ_{j} \neq 0

, and

{(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{N} \frac{λ_{i}^{'} F^{'} F λ_{j}}{D_{i j}} = O_{p} (T^{- 1 / 2})

; hence,

γ^{- 1} T_{n 4} = O_{p} (1)

. One can also show that

D_{i j} \overset{p}{⟶} {tr}^{1 / 2} (M_{i} Σ) {tr}^{1 / 2} (M_{j} Σ)

. Let

ψ = p l i m_{(N, T) \to \infty} γ^{- 1} {(\frac{2}{N (N - 1)})}^{1 / 2} \sum_{i = 2}^{N} \sum_{j = 1}^{i - 1} (\frac{T^{1 / 2} N^{- 1} δ_{i}^{'} δ_{j}}{{tr}^{1 / 2} (M_{i} Σ) {tr}^{1 / 2} (M_{j} Σ)})

; from all of the above results, as

(N, T) \to \infty

,

γ^{- 1} T_{n 1} - ψ \overset{d}{⟶} N (0, 1) .

☐

References

L. Lee. “Consistency and Efficiency of Least Squares Estimation for Mixed Regressive, Spatial Autoregressive Models.” Econom. Theory 18 (2002): 252–277. [Google Scholar] [CrossRef]
D.W.K. Andrews. “Cross-Section Regression with Common Shocks.” Econometrica 73 (2005): 1551–1585. [Google Scholar] [CrossRef]
L. Anselin, and A.K. Bera. “Spatial Dependence in Linear Regression Models with an Introduction to Spatial Econometrics.” In Handbook of Applied Economic Statistics. Edited by A. Ullah and D.E. Giles. New York, NY, USA: Marcel Dekker, 1998, pp. 237–289. [Google Scholar]
B.H. Baltagi, S.H. Song, and W. Koh. “Testing Panel Data Regression Models with Spatial Error Correlation.” J. Econom. 117 (2003): 123–150. [Google Scholar] [CrossRef]
T.S. Breusch, and A.R. Pagan. “The Lagrange Multiplier Test and Its Application to Model Specifications in Econometrics.” Rev. Econ. Stud. 47 (1980): 239–253. [Google Scholar] [CrossRef]
I.M. Johnstone. “On the Distribution of the Largest Eigenvalue in Principal Components Analysis.” Ann. Stat. 29 (2001): 295–327. [Google Scholar] [CrossRef]
T.F. Jiang. “The Limiting Distributions of Eigenvalues of Sample Correlation Matrices.” Sankhyā 66 (2004): 35–48. [Google Scholar]
O. Ledoit, and M. Wolf. “Some Hypothesis Tests for the Covariance Matrix When the Dimension is Large Compared to the Sample Size.” Ann. Stat. 41 (2002): 1055–1692. [Google Scholar] [CrossRef]
J.R. Schott. “Testing for Complete Independence in High Dimensions.” Biometrika 92 (2005): 951–956. [Google Scholar] [CrossRef]
S.X. Chen, L.X. Zhang, and P.S. Zhong. “Tests for High Dimensional Covariance Matrices.” J. Am. Stat. Assoc. 105 (2010): 810–819. [Google Scholar] [CrossRef]
M.H. Pesaran, A. Ullah, and T. Yamagata. “A Bias-Adjusted LM Test of Error Cross-Section Independence.” Econom. J. 11 (2008): 105–127. [Google Scholar] [CrossRef]
B.H. Baltagi, Q. Feng, and C. Kao. “A Lagrange Multiplier Test for Cross-Sectional Dependence in a Fixed Effects Panel Data Model.” J. Econom. 170 (2012): 164–177. [Google Scholar] [CrossRef]
B.H. Baltagi, Q. Feng, and C. Kao. “Testing for Sphericity in a Fixed Effects Panel Data Model.” Econom. J. 14 (2011): 25–47. [Google Scholar] [CrossRef]
M.H. Pesaran. “General Diagnostic Test for Cross Section Dependence in Panels.” CESifo Working Paper Series No. 1229, IZA Discussion Paper No. 1240. Available online: http://ssrn.com/abstract=572504 (accessed on 2 May 2015).
M.H. Pesaran. “Testing Weak Cross-Sectional Dependence in Large Panels.” Econom. Rev. 34 (2015): 1089–1117. [Google Scholar] [CrossRef]
F. Moscone, and E. Tosetti. “A Review and Comparisons of Tests of Cross-Section Dependence in Panels.” J. Econ. Surv. 23 (2009): 528–561. [Google Scholar] [CrossRef]
V. Sarafidis, and T. Wansbeek. “Cross-Sectional Dependence in Panel Data Analysis.” Econom. Rev. 31 (2012): 483–531. [Google Scholar] [CrossRef] [Green Version]
A. Chudik, and M.H. Pesaran. “Large Panel Data Models with Cross-Sectional Dependence: A Survey.” In The Oxford Handbook on Panel Data. Edited by B.H. Baltagi. Oxford, UK: Oxford University Press, 2015, Chapter 1; pp. 3–45. [Google Scholar]
Z.D. Bai, and J.W. Silverstein. “CLT for Linear Spectral Statistics of Large-Dimensional Sample Covariance Matrices.” Ann. Probab. 32 (2004): 553–605. [Google Scholar]
Z.D. Bai, and W. Zhou. “Large Sample Covariance Matrices without Independence Structures in Columns.” Stat. Sin. 18 (2008): 425–442. [Google Scholar]
J.T. Gao, X. Han, G.M. Pan, and Y.R. Yang. “High Dimensional Correlation Matrices: CLT and Its Applications.” J. R. Stat. Soc. Ser. B Stat. Methodol., 2016. [Google Scholar] [CrossRef]
R.J. Muirhead. Aspects of Multivariate Statistical Theory. Hoboken, NJ, USA: John Wiley & Sons, 1982. [Google Scholar]
S.X. Chen, and Y.L. Qin. “A Two-Sample Test for High Dimensional Data with Application to Gene-Set Testing.” Ann. Stat. 38 (2010): 808–835. [Google Scholar] [CrossRef]
O. Lieberman. “A Laplace Approximation to the Moments of a Ratio of Quadratic Forms.” Biometrika 81 (1994): 681–690. [Google Scholar] [CrossRef]
P.J. Bushell, and G.B. Trustrum. “Trace Inequality for Positive Definite Matrix Power Products.” Linear Algebra Appl. 132 (1990): 173–178. [Google Scholar] [CrossRef]
P. Billingsley. Probability and Measure, 3rd ed. New York, NY, USA: Wiley, 1995. [Google Scholar]

^1.The inclusion of predetermined variables, which is the weakly-exogenous case, alters the results.
^2.We only consider the case that the number of non-zero factor loading vectors is N or of order N, which means the model has strong error cross-sectional correlation. For the weak error cross-sectional correlation case, we conjecture that it is similar to Pesaran [15].

Table 1. Size of tests with IID errors over time.

**Table 1.** Size of tests with IID errors over time.
Tests	(N,T)	Normal					Chi-Squared
Tests	(N,T)	10	20	30	50	100	10	20	30	50	100
$C D_{R}$	10	5.75	5.90	5.50	4.75	6.45	5.90	4.80	5.55	5.15	6.45
	20	3.85	4.55	5.05	4.70	5.15	4.60	4.50	4.50	5.85	5.40
	30	4.45	4.10	4.70	5.10	4.60	4.40	4.80	4.45	4.50	6.25
	50	4.45	4.75	5.40	5.25	4.50	4.10	3.65	4.75	4.05	4.60
	100	4.65	4.85	4.20	5.65	5.30	4.35	4.80	4.70	4.35	4.95
	200	4.05	4.65	3.90	4.60	5.00	5.65	5.05	4.85	4.65	5.40
$C D_{P}$	10	5.60	5.50	5.25	4.10	6.00	5.60	4.70	5.05	4.70	5.65
	20	4.05	4.75	5.05	4.90	5.30	4.90	4.70	4.65	5.85	5.30
	30	4.90	4.45	4.85	5.20	5.00	5.20	5.20	4.55	5.00	6.05
	50	4.95	5.20	5.60	5.55	4.45	5.00	4.15	5.00	4.55	4.70
	100	5.65	5.15	4.50	5.95	5.45	5.15	5.65	5.05	4.50	5.05
	200	5.00	5.00	4.45	4.85	5.15	6.35	5.75	5.15	4.70	5.55
$L M_{P U Y}$	10	6.75	6.05	6.10	6.00	5.60	6.60	6.85	7.65	7.95	6.60
	20	6.20	5.45	6.75	7.00	5.50	7.05	6.40	6.40	7.15	5.60
	30	6.20	6.25	5.40	6.35	5.95	7.65	5.95	6.35	5.85	7.00
	50	6.55	4.95	5.25	5.60	5.40	7.00	6.85	7.20	5.40	5.85
	100	8.10	5.45	5.40	4.60	4.55	7.00	5.85	6.10	5.85	5.90
	200	8.60	5.75	6.50	5.90	5.35	8.00	7.20	6.30	6.40	6.70

Notes: This table reports the size of

{CD}_{P}

,

{LM}_{P U Y}

and

{CD}_{R}

with

u_{i t} = ξ_{i t}

, where

ξ_{i t} = σ_{i} ε_{i t}

;

σ_{i}^{2} \sim IID χ^{2} (2) / 2

.

ε_{i t}

\sim IID (0, 1)

and are generated from normal and Chi-squared distributions. The tests are conducted at the 5% nominal significance level.

Table 2. Size of tests with MA(1) errors.

**Table 2.** Size of tests with MA(1) errors.
Tests	(N,T)	Normal					Chi-Squared
Tests	(N,T)	10	20	30	50	100	10	20	30	50	100
$C D_{R}$	10	6.10	6.25	4.45	5.35	6.25	6.30	5.40	5.90	5.85	6.50
	20	5.15	4.80	5.05	4.60	5.30	5.20	5.35	4.70	6.15	4.75
	30	4.50	4.35	4.20	5.35	4.95	5.55	4.75	4.90	5.30	6.15
	50	5.25	4.50	5.30	5.70	4.30	5.00	4.65	4.60	4.35	4.85
	100	4.75	5.35	4.50	5.45	5.60	5.80	4.15	5.45	4.35	4.90
	200	4.35	4.95	3.50	4.50	4.90	6.20	6.30	4.30	4.30	5.50
$C D_{P}$	10	7.60	9.35	8.40	10.05	11.10	7.80	7.75	10.30	10.25	10.95
	20	6.60	8.30	9.95	9.10	10.90	7.00	8.95	9.30	10.70	10.50
	30	6.45	8.35	8.30	10.50	10.60	7.90	9.65	9.50	10.80	10.60
	50	7.45	7.95	10.75	11.30	9.65	7.55	7.90	9.20	9.70	9.15
	100	6.50	9.35	9.00	10.85	11.55	7.85	8.35	10.60	9.30	10.20
	200	6.65	8.45	8.45	9.70	10.95	9.90	9.50	9.35	9.65	11.20
$L M_{P U Y}$	10	37.95	54.40	57.10	59.55	60.70	39.15	53.00	56.50	60.75	61.55
	20	81.55	96.00	96.80	98.25	97.90	83.25	95.45	97.05	97.70	98.20
	30	98.30	100.00	100.00	100.00	100.00	98.45	100.00	100.00	100.00	100.00
	50	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00
	100	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00
	200	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00

Notes: This table reports the size of

{CD}_{P}

,

{LM}_{P U Y}

and

{CD}_{R}

with

u_{i t} = ξ_{i t} + θ ξ_{i t - 1}

, where

ξ_{i t} = σ_{i} ε_{i t}

;

σ_{i}^{2} \sim IID χ^{2} (2) / 2

.

ε_{i t}

\sim IID (0, 1)

and are generated from normal and Chi-squared distributions. The tests are conducted at the 5% nominal significance level.

Table 3. Size of tests with AR(1) errors.

**Table 3.** Size of tests with AR(1) errors.
Tests	(N,T)	Normal					Chi-Squared
Tests	(N,T)	10	20	30	50	100	10	20	30	50	100
$C D_{R}$	10	6.10	6.25	4.90	6.15	6.75	6.05	4.80	6.10	6.00	5.65
	20	4.75	5.65	4.65	4.70	5.00	4.85	5.60	4.50	5.55	4.80
	30	4.15	4.85	4.00	4.55	4.65	5.50	4.25	5.75	5.10	6.65
	50	4.15	4.50	5.20	5.45	4.40	5.25	5.35	4.60	4.40	4.35
	100	4.35	4.80	4.80	5.45	4.80	5.75	4.15	5.30	4.05	5.10
	200	4.85	4.60	4.05	4.55	5.05	7.80	5.35	4.95	4.20	4.55
$C D_{P}$	10	6.80	9.65	10.20	14.55	16.80	6.55	8.25	12.25	13.90	16.30
	20	5.75	9.50	11.35	13.25	16.85	5.90	9.60	11.50	15.05	15.45
	30	5.65	9.80	10.00	13.30	14.05	7.35	9.65	12.00	15.20	17.15
	50	5.90	8.45	11.95	14.80	14.10	7.10	9.55	9.70	12.40	15.80
	100	6.05	10.00	10.40	14.70	16.55	7.25	8.70	12.25	13.85	15.00
	200	6.65	9.00	10.25	13.30	16.70	9.40	10.3	10.85	13.70	16.10
$L M_{P U Y}$	10	37.95	54.40	57.10	59.55	60.70	27.60	66.30	82.45	90.80	95.35
	20	55.50	97.90	99.85	100.00	100.00	59.95	98.40	99.85	100.00	100.00
	30	98.30	99.95	100.00	100.00	100.00	82.75	100.00	100.00	100.00	100.00
	50	97.80	100.00	100.00	100.00	100.00	98.60	100.00	100.00	100.00	100.00
	100	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00
	200	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00

Notes: This table reports the size of

{CD}_{P}

,

{LM}_{P U Y}

and

{CD}_{R}

with

u_{i t} = ρ u_{i t - 1} + ξ_{i t}

, where

ξ_{i t} = σ_{i} ε_{i t}

;

σ_{i}^{2} \sim IID χ^{2} (2) / 2

.

ε_{i t}

\sim IID (0, 1)

and are generated from normal and Chi-squared distributions. The tests are conducted at the 5% nominal significance level.

Table 4. Size of tests with ARMA(1,1) errors.

**Table 4.** Size of tests with ARMA(1,1) errors.
Tests	(N,T)	Normal					Chi-Squared
Tests	(N,T)	10	20	30	50	100	10	20	30	50	100
$C D_{R}$	10	6.95	6.45	4.90	6.20	5.85	7.20	5.25	6.40	5.40	5.45
	20	5.40	5.55	4.95	4.75	4.95	6.40	5.70	4.95	5.55	4.70
	30	4.65	4.75	4.05	4.80	4.65	7.45	4.60	5.95	5.10	6.50
	50	4.95	4.95	5.25	5.30	4.50	7.50	5.70	4.80	4.35	4.80
	100	5.05	5.15	4.60	5.10	4.90	10.25	5.10	4.65	4.00	4.80
	200	5.75	4.65	4.45	4.85	5.20	17.45	6.60	5.75	4.50	4.25
$C D_{P}$	10	9.10	15.95	16.35	22.50	24.30	10.95	13.80	19.20	21.70	25.15
	20	8.30	14.40	17.80	20.15	25.05	10.10	14.80	18.90	22.85	23.15
	30	8.30	15.40	17.70	21.55	22.55	10.95	15.25	19.25	23.55	24.25
	50	8.70	14.85	18.80	22.70	23.40	11.75	15.40	17.30	19.15	23.95
	100	9.35	15.90	17.50	22.15	24.20	17.20	14.45	17.95	22.05	22.70
	200	9.50	14.05	18.35	20.00	24.95	25.45	17.00	18.55	21.35	24.65
$L M_{P U Y}$	10	83.65	98.45	99.45	99.75	99.80	83.65	98.40	99.70	99.90	100.00
	20	99.85	100.00	100.00	100.00	100.00	99.85	100.00	100.00	100.00	100.00
	30	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00
	50	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00
	100	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00
	200	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00

Notes: This table reports the size of

{CD}_{P}

,

{LM}_{P U Y}

and

{CD}_{R}

with

u_{i t} = ρ u_{i t - 1} + ξ_{i t} + θ ξ_{i t - 1}

, where

ξ_{i t} = σ_{i} ε_{i t}

;

σ_{i}^{2} \sim IID χ^{2} (2) / 2

.

ε_{i t}

\sim IID (0, 1)

and are generated from normal and Chi-squared distributions. The tests are conducted at the 5% nominal significance level.

Table 5. Size adjusted power of

{CD}_{R}

: factor model.

**Table 5.** Size adjusted power of ${CD}_{R}$ : factor model.
DGP	(N,T)	Normal					Chi-Squared
DGP	(N,T)	10	20	30	50	100	10	20	30	50	100
$M A (1)$	10	14.55	23.95	30.30	45.40	63.05	21.95	30.75	33.65	46.00	66.10
	20	35.70	56.65	68.95	84.05	95.95	47.30	63.25	75.80	86.00	97.40
	30	59.65	81.70	91.75	97.65	99.95	69.75	87.50	92.60	98.00	99.95
	50	83.65	96.60	99.30	100.00	100.00	88.75	98.00	99.55	100.00	100.00
	100	96.75	99.95	100.00	100.00	100.00	98.90	99.90	100.00	100.00	100.00
	200	99.70	100.00	100.00	100.00	100.00	99.70	100.00	100.00	100.00	100.00
$A R (1)$	10	18.95	23.95	32.40	38.10	56.75	26.95	35.00	28.90	37.15	61.25
	20	45.60	62.10	69.95	81.45	94.20	55.10	67.45	74.85	85.65	96.60
	30	68.80	83.50	92.30	97.60	99.75	78.15	90.85	92.70	97.40	99.85
	50	88.55	97.45	99.40	100.00	100.00	92.90	98.50	99.65	100.00	100.00
	100	98.80	100.00	100.00	100.00	100.00	99.60	99.95	100.00	100.00	100.00
	200	99.90	100.00	100.00	100.00	100.00	99.85	100.00	100.00	100.00	100.00
$A R M A (1, 1)$	10	7.70	7.70	10.00	10.80	14.80	9.65	10.35	8.80	9.60	19.60
	20	22.05	18.85	24.25	27.80	39.50	24.85	22.35	23.40	30.60	46.20
	30	37.75	37.45	46.15	48.90	75.00	41.75	47.35	44.15	53.15	71.25
	50	66.50	66.75	71.60	83.10	96.20	66.25	72.35	82.45	88.20	98.00
	100	91.15	96.60	98.75	99.90	100.00	90.45	98.55	99.40	99.95	100.00
	200	98.95	100.00	100.00	100.00	100.00	98.45	99.95	100.00	100.00	100.00

Notes: This table computes the size adjusted power for

{CD}_{R}

with a factor model that allows for cross-sectional correlation in the errors:

u_{i t}^{*} = λ_{i} f_{t} + u_{i t}

.

u_{i t}

are generated by MA(1), AR(1) and ARMA (1,1) defined by (29)–(31).

ξ_{i t} = σ_{i} ε_{i t}

;

σ_{i}^{2} \sim IID χ^{2} (2) / 2

.

ε_{i t}

\sim IID (0, 1)

and are generated from normal and Chi-squared distributions.

Table 6. Size adjusted power of

{CD}_{R}

: SAR(1) model.

**Table 6.** Size adjusted power of ${CD}_{R}$ : SAR(1) model.
DGP	(N,T)	Normal					Chi-Squared
DGP	(N,T)	10	20	30	50	100	10	20	30	50	100
$M A (1)$	10	38.85	60.55	72.20	88.25	97.30	43.05	67.15	72.55	88.45	97.70
	20	37.45	61.70	76.00	92.15	99.05	39.25	61.25	76.80	89.55	99.10
	30	39.60	64.55	78.60	92.00	99.60	40.30	65.65	78.80	91.90	99.35
	50	40.05	66.45	79.15	92.70	99.75	39.95	66.55	78.65	94.65	99.70
	100	33.60	62.70	80.55	92.55	99.65	37.85	64.65	79.20	94.40	99.90
	200	40.65	64.50	80.65	94.70	99.8	37.75	62.50	81.25	95.65	99.80
$A R (1)$	10	37.20	53.95	68.20	79.20	92.10	42.85	63.20	61.15	78.00	94.80
	20	38.25	56.50	69.30	82.90	95.85	38.55	55.50	68.65	83.70	97.20
	30	37.90	56.90	71.80	84.65	98.10	38.70	62.00	66.25	85.70	96.90
	50	38.80	59.80	71.40	86.60	98.60	39.70	59.15	71.25	89.00	99.00
	100	38.85	57.85	70.90	86.60	98.75	35.25	59.85	72.55	88.95	98.60
	200	40.75	55.95	74.40	87.75	98.80	33.80	56.00	70.85	90.40	99.10
$A R M A (1, 1)$	10	29.00	43.40	58.05	70.20	85.90	32.75	49.75	51.30	67.40	88.20
	20	31.05	43.55	56.65	72.10	89.10	28.35	43.45	54.80	71.35	91.35
	30	30.00	45.70	59.35	71.35	94.20	28.10	48.10	54.00	73.05	91.90
	50	33.05	45.30	54.40	71.70	93.30	27.30	43.90	58.00	75.75	94.45
	100	30.60	45.15	55.50	75.40	94.95	21.80	45.45	57.85	77.35	94.75
	200	30.30	42.05	58.15	75.75	95.15	21.05	38.80	55.70	77.50	95.80

Notes: This table computes the size adjusted power for

{CD}_{R}

with a SAR(1) model that allows for cross-sectional correlation in the error:

u_{i t}^{*} = δ (0.5 u_{i - 1, t}^{*} + 0.5 u_{i + 1, t}^{*}) + u_{i t}

with

δ = 0.4

.

u_{i t}

are generated by MA(1), AR(1) and ARMA (1,1) defined by (29)–(31).

ξ_{i t} = σ_{i} ε_{i t}

;

σ_{i}^{2} \sim IID χ^{2} (2) / 2

.

ε_{i t}

\sim IID (0, 1)

and are generated from normal and Chi-squared distributions.

© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Baltagi, B.H.; Kao, C.; Peng, B. Testing Cross-Sectional Correlation in Large Panel Data Models with Serial Correlation. Econometrics 2016, 4, 44. https://doi.org/10.3390/econometrics4040044

AMA Style

Baltagi BH, Kao C, Peng B. Testing Cross-Sectional Correlation in Large Panel Data Models with Serial Correlation. Econometrics. 2016; 4(4):44. https://doi.org/10.3390/econometrics4040044

Chicago/Turabian Style

Baltagi, Badi H., Chihwa Kao, and Bin Peng. 2016. "Testing Cross-Sectional Correlation in Large Panel Data Models with Serial Correlation" Econometrics 4, no. 4: 44. https://doi.org/10.3390/econometrics4040044

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Testing Cross-Sectional Correlation in Large Panel Data Models with Serial Correlation

Abstract

1. Introduction

2. Model and Tests

2.1. LM and CD Tests

2.2. Assumptions and the Modified CD Test Statistic

3. Asymptotics

3.1. Asymptotic Distribution under the Null

3.2. Local Power Properties

4. Monte Carlo Simulations

4.1. Experimental Design

4.2. Simulation Results

5. Conclusions

Author Contributions

Conflicts of Interest

Appendix

Appendix A. Some Useful Lemmas

Appendix B. Proof of the Theorems

Appendix B.1. Proof of Theorem 1

Appendix B.2. Proof of Theorem 2

Appendix B.3. Proof of Theorem 3

Appendix B.4. Proof of Theorem 4

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI