1. Introduction
Macroeconomic, technological, institutional, political, environmental, health, and sociological shocks are plausible in empirical research in many situations. For example, a researcher may be using a panel of countries to investigate a composite measure of health care attainment in terms of per capita health expenditure and educational attainment. Both the explained and the explanatory variables may be affected by the introduction of new medical technologies, medicines, hospital procedures, the occurrence of a flu epidemic, or of a particular cold winter or hot summer in a large region, or, as in recent years, the financial crisis. Such shocks are not observed. They affect several countries and their impact depends on the characteristics of the countries themselves. Similar problems arise in micro-econometric studies. For example, in analyzing a panel of executive compensation in terms of returns on assets, stock returns, level of responsibility and gender (among other explanatory variables), unobservable financial, political, environmental as well as industry specific shocks may occur and may affect both the dependent variables and the regressors simultaneously.
Andrews (
2005) has shown that common shocks—modelled by unobservable common factors in both the disturbance and the regressors—strongly affect the properties of the ordinary least squares (OLS) estimator in a linear regression model. Precisely,
Andrews (
2005) shows that, in order for the OLS estimator to be consistent, the factor loadings for the disturbance and the regressors must be uncorrelated conditional on the unobservable factors. These results have profound implications for applied researchers since reliable inferences based on OLS in the presence of common shock require very strong assumptions. In panel data with both large time series and cross-sectional dimensions, the assumption that the factor loadings in the disturbances and the regressors are conditionally uncorrelated can be relaxed (e.g.,
Pesaran 2006;
Bai 2009). However, this situation is certainly not the norm in microeconometric applications, where the time dimension tends to be limited.
Since the common shocks affect both the errors and the regressors, they induce correlation between some of the regressors and the disturbance term (we will refer to this as
factors endogeneity). Econometric models often contain explanatory variables that are endogenous due to simultaneity so that the dependent variable and some of the explanatory variables are co-determined (we will refer to this as
classical endogeneity). In the presence of instrumental variables, two standard approaches to endogeneity in panel data are the panel two-stage least squares (TSLS) (e.g., among others (
Wooldridge 2005;
Arellano 2016) and the panel limited information maximum likelihood (LIML) estimators (e.g.,
Wansbeek and Meijer 2000;
Alonso-Borrego and Arellano 1999; and
Wansbeek and Prak 2017). This paper investigates how the panel TSLS and LIML estimators are affected by common shocks for which, surprisingly, no results seem available in the literature.
The literature on the effects of common shocks in models affected by classical endogeneity is very small.
Ahn et al. (
2001,
2013) generalize a fixed effects model in which the unobserved individual effects vary over time, and they propose a generalized method of moments (GMM) estimator that generalizes the fixed effects estimator through quasi-differencing.
Robertson and Sarafidis (
2015) consider linear panel data models with classical endogeneity in which the common factors affect the errors and the factor loading may be correlated with the exogenous variables. Following
Ahn et al. (
2001,
2013),
Robertson and Sarafidis (
2015) regard the common factors as unknown parameters, investigate the identification conditions and suggest a GMM estimator. Notice that this literature on the effects of common shocks differs from the one initiated by
Andrews (
2005) in two fundamental ways: (1) common factors are regarded as parameters not as random variables; (2) the explanatory variables are correlated to the error factor loadings while
Andrews (
2005) assumes that both the explanatory variables and the error term depend on the same factors.
Harding and Lamarche (
2011,
2014) extend the model of
Pesaran (
2006) to allow for classical endogeneity. Precisely,
Harding and Lamarche (
2011) show that the estimators suggested by
Pesaran (
2006) also account for classical endogeneity when both the time series and the cross sectional dimensions are large (see also
Harding and Lamarche (
2014) for an approach based on quantiles). Thus, they investigate how the estimators of
Pesaran (
2006) are affected by classical endogeneity but are uninformative about how classical estimators are affected by factor endogeneity. Notice also that, by assuming that
tends to infinity, one allows the information about the shocks to accumulate over time. This is not usually a reasonable assumption in micro-econometric studies where the time dimension tends to be small.
This paper investigates the effects of factor endogeneity on standard estimators used in the presence of classical endogeneity by studying the asymptotic properties of the panel TSLS and LIML estimators. Our results, which are in line with those of
Andrews (
2005), show that as the cross-sectional dimension tends to infinity (for a fixed time dimension):
the panel TSLS and LIML estimators have a non-degenerate non-standard asymptotic distribution if the factor loadings in the explanatory variables and the instruments are correlated to the reduced form errors conditional on the common factors; and
they are consistent but have mixed-normal asymptotic distributions when the factor loadings in the explanatory variables and the instruments are uncorrelated to the reduced form errors conditional on the common factors. In this case, tests on the structural coefficients can be constructed in the usual way and have standard distributions under the null hypothesis.
Therefore, the presence of common shocks may have a significant impact on the statistical properties of the TSLS and the LIML estimators depending on the properties of the errors and regressors conditional on the common shocks. These estimators are consistent if the reduced form errors and regressors (including the instruments) are conditionally independent given the common shocks, but they are inconsistent otherwise. In other words, consistency of the TSLS and LIML estimators holds if the the model satisfies classical conditions of the validity of instrumental variables estimators conditional on the shocks (i.e., the ‘exogenous’ variables must be uncorrelated with the errors given the shocks and the instruments need to be correlated with the right-hand-side endogenous variables but be uncorrelated with the errors conditional on the shocks). As far as we know, there are no tests for conditional independence of the factor loadings affecting the endogenous and the exogenous variables directly. Thus, this work draws the attention of researchers on the possible inferential problems affecting classical estimators when unobservable shocks are plausible.
The rest of the paper is organized as follows.
Section 2 presents the model, the estimators considered, and the technical assumptions underlying the model.
Section 3 contains the main results and
Section 4 concludes. A discussion of some technical results—including stable convergence, conditional strong law of large numbers and conditional central limit theorem—and proofs of the main results are in an online
supplementary file.
2. The Model
We consider a panel data structural equation model with a fixed number of time periods
. The observations for unit
i concerning an endogenous variable observed over
T periods are collected in the
vector
. This depends linearly on the endogenous variables in
, the exogenous variables in
, and an unobservable structural error
:
where
,
and
are the structural parameters. The dimensions of vectors and matrices are reported in brackets the first time they are used, unless they are obvious from the context. The reduced form for
is
where
denotes the observations on the exogenous variables excluded from the structural equation, usually referred to as the instruments,
is unobservable reduced form errors, and
,
and
are the reduced form parameters. Notice that, for
, Equations (
1) and (
2) form a classical cross-sectional structural equations model. We will now discuss how a factor structure can be introduced in both the errors and some of the regressors in Equation (
1).
To allow for a general model, we define the two matrices and , where , . The matrices and are fixed selection matrices of full rank equal to, respectively, and . The matrices and represent observations on variables that are not affected by the common shocks such as gender, age and nationality; and contain observations on variables that are affected by common shocks.
We model common shocks by using unobservable factors structures. Precisely, we assume that
where
,
for
is a
unobservable vector of factors capturing the common shocks at time
t,
is a random matrix of factor loadings, and
is a random matrix representing the values of the exogenous variables
that one would observe if there were no common shocks. Notice that, although the shocks are common, the way they affect each unit
i is determined by
, which varies randomly from an individual to another. The number of factors,
m, affecting the regressors is finite but unknown.
We also assume that the common shocks affect the error terms. Without loss of generality, we impose a factor structure on the reduced form errors and see how this implies a factor structure in the error of Equation (
1). Let
then, the reduced form errors satisfy
where
is a matrix of factor loadings and
are proper idiosyncratic errors. Once again, different units are affected by the common shocks in different ways because the factor loadings vary randomly among units. The model considered has classical endogeneity due to the presence of the endogenous variables on the right-hand side of the structural equation. It also has factors’ endogeneity induced by the common shocks in the reduced form. To see this, we replace the reduced form into the structural equation, and obtain the compatibility restrictions
and
It follows from Equation (
6) that the structural parameter
is identified if and only if
In order to identify
and
one also needs
and
to be identified. Moreover, rewriting Equation (
7) using Equations (
4) and (
5), we obtain
Thus, the structural error also has a factor structure with factor loadings .
The model considered allows for the case in which the common factors affect the errors (both in the reduced form and the structural equation) as well as some of the instruments. It therefore entails a complex interaction between factors and classical endogeneity. Notice that the model could be generalized by imposing a more complex structure on the error term to allow for unobserved individual effects as in a random or fixed effects model.
We now briefly discuss the main differences between the model considered here and the models of
Robertson and Sarafidis (
2015) and
Harding and Lamarche (
2011).
Robertson and Sarafidis (
2015) regard the factors as unknown parameters. As a consequence, they require some extra identification conditions and standardizations, which include, for example, bounds on the number of factors that depend on the time series dimension of the panel (e.g., to have
factors the the number of waves must be
). A second difference is related to the specification of our Equation (
3), which allows the common shocks to affect the exogenous variables as additive shocks that may induce correlation between exogenous variables and the error terms through the correlation between
and
. On the other hand,
Robertson and Sarafidis (
2015) allow for a correlation between
and
but assume the factors to be constant parameters.
Harding and Lamarche (
2011) allow the reduced form errors to have a factor structure (so that both
and
have a factor structure) with unobserved but random factors but the instruments
are assumed not to be affected by shocks. They also focus on the case where
T and
N are large and study the properties of the CCEP and CCEMG of
Pesaran (
2006) but not those of standard estimators.
2.1. TSLS and LIML Estimators
Let
and
. The reduced form can then be written as
and the OLS estimators of
and
are respectively
and
where
, and
.
Let
where
, and
,
, and
are defined similarly. Then, the panel TSLS estimator of
is
and the panel LIML estimator is
The estimators for
and
are respectively
where
can be either the panel TSLS or LIML estimator. Notice that the panel TSLS and LIML estimators of the structural coefficients reduce to the classical TSLS and LIML estimators when
.
2.2. Model Assumptions
Following
Andrews (
2005) and
Kuersteiner and Prucha (
2013), it is natural to state the assumptions for our model conditional on the factors. In order to do this, we will regard all variables as defined on a probability space
. The sigma-algebra generated by the random vector
is denoted by
, where
is the Borel sigma algebra in
. In addition, in the rest of this paper,
denotes
, where the operator
is the trace of a square matrix.
The assumptions of the model are formulated conditional on
to allow for the application of a conditional version of the strong law of large numbers (e.g.,
Majerek et al. 2005;
Rao 2009; and
Cabrera et al. 2012) and the central limit theorem (e.g.,
Dedecker and Merlevede 2002;
Grzenda and Ziȩba 2008; and
Yuan et al. 2014) in the hope to make the paper accessible to practitioners. The results could be obtained under more general conditions along the lines of the work of
Kuersteiner and Prucha (
2013), which allows for sequential exogeneity. Notice, however, that the main condition for consistency of the TSLS and LIML estimators that we identify is the lack of correlation between the factor loadings in the exogenous variables and those in the errors conditional on the unobservable factors. Even if all other assumptions are weakened, this condition cannot be relaxed.
For notational simplicity, we define the following
-measurable matrices:
where
. We make the following two sets of assumptions.
Assumption 1. Let Δ be arbitrary -measurable, and a.s. and let .
- i
The random matrices for form a sequence of -independent random matrices, and have mean 0 and a.s. over i for some . Moreover, , where is -measurable.
- ii
The random matrices for form a sequence of -independent random matrices, a.s. over i for some . Moreover, and , where and are -measurable.
- iii
The random matrices for form a sequence of -independent random matrices with a.s. over i for some . Moreover, and , where and are -measurable.
- iv
The factor loadings for form a sequence of -independent random matrices with mean and a.s. over i for some , where is -measurable.
- v
The factor loadings for form a sequence of -independent random matrices with and a.s. over i for some .
- vi
The random matrices , , , and are uncorrelated conditional on . a.s.
- vii
The random matrices , , and are uncorrelated conditional on . a.s.
Reviews and discussions of the concept of conditional independence are given by
Phillips (
1988),
Majerek et al. (
2005),
Rao (
2009) and
Roussas (
2008). Notice that conditional independence does not imply unconditional independence (see
Phillips 1988 for conditions under which this is the case). Since
,
,
,
and
are heterogeneous conditional on
, they are not only unconditionally dependent but also non-identically distributed. This is an extension of
Andrews (
2005) whose assumptions imply that such quantities are exchangeable and hence identically distributed. Appendix A in the online
Supplementary Material contains a brief review of the various technical concepts used and gives further references.
Assumption 1 implies that the reduced form error
and the exogenous variable
are uncorrelated conditional on
. Notice however that our model allows
,
,
,
and
to be (unconditionally) dependent among themselves and/or over units (e.g.,
Phillips 1988). This assumption could have been formulated in term of the structural equation and the reduced form for
in an equivalent way given the one-to-one connection between structural and reduced form.
The conditional heterogeneity allowed by Assumption 1 implies that the shocks affect each individual differently through the terms and as well as through the conditional means and and the error covariance matrix .
In a standard set-up, where there are no common shocks, it is possible to establish the identification of the structural parameters in terms of the reduced form parameters since the latter are identified and can be consistently estimated. It will be shown in
Section 3 that if Assumption 1 i–vi holds,
is a consistent estimator of
as
. Hence, in this case, one can state the identification conditions for the structural parameters
and
as follows.
Assumption 2. - i
and is satisfied.
- ii
is non-singular and is positive definite uniformly.
Together with Assumption 1 i–vi, Assumption 2 provides identification restrictions, which are analogous to those for structural equations in the classical set-up (e.g., Assumptions 2.2 and 2.3 of
Hausman (
1983, p. 398), and
Schmidt (
1976, chp. 4)). Notice that Assumption 2 on its own does not identify
or
because
may itself be unidentified. This is the case, for example, if the factor loadings
and
are not uncorrelated conditional on
: the reduced form errors and (some of) the reduced form regressors are correlated even conditional on the factors.
3. Consistency and Asymptotic Distribution
We now investigate the effects of common shocks on the panel TSLS and LIML estimators of the structural parameters. Notice that the presence of the factors in the reduced form errors and some of the regressors implies a correlation between the reduced form errors and regressors, so that the OLS estimator of the reduced form parameter, , may or may not be consistent. As a consequence, the panel TSLS and LIML estimators of the structural parameters may or may not be consistent.
We show that the panel LIML and TSLS estimators are consistent estimators if the factor loadings are uncorrelated conditional on
. t. Let
and
. Partition the selection matrices
and
conformably to
and
, respectively, and define
Notice that , and .
Lemma 1. Under Assumptions 1 i–vi, as , a.s. conditional on . Moreover, conditional on whereand Lemma 1 implies that
is consistent and
converges
-stably to a random vector having a covariance matrix mixed-normal distribution (for the notion of stable convergence see Appendix A in the online
Supplementary Material and references therein).
It follows from Lemma 1 that
so that
a.s. if and only if
a.s., which requires
a.s. The inconsistency of the OLS estimator of the constant has been noted by
Andrews (
2005) for the case of i.i.d. observations conditional on
.
If the factor loadings in the reduced form explanatory variables and the reduced form errors are not independent conditional on , then the reduced form parameters cannot be estimated consistently using OLS. This is the case because conditioning on the shocks does not remove the correlation between the reduced form errors and explanatory variables.
Lemma 2. Under Assumptions 1 i–v and vii, then, as ,where , is defined in Equation (17) and Notice that if the factor loadings in the reduced form explanatory variables and the reduced form errors are correlated conditional on
, the OLS estimator of the reduced form parameters has a non-degenerate asymptotic distribution as the cross-sectional dimension tends to infinity (cf. Theorem 1 of
Andrews 2005).
All results to follow depend on Lemmas 1 and 2. In this paper, these have been derived under Assumption 1. However, Equations (
16) and (
23) may hold under weaker conditions. Precisely, writing
we see that for Lemma 1 to hold one needs:
and
-stably. These conditions can hold under weaker assumptions than the one used in this paper (cf.
Kuersteiner and Prucha 2013), but would be less accessible to practitioners.
We focus on the coefficients of the endogenous variables.
Theorem 1. Under Assumptions 1 i–vi and 2, conditional on , as N tends to infinity,
- 1.
a.s. and ,
- 2.
a.s. and , where is
When the factor loadings in the reduced form explanatory variables and the reduced form errors are uncorrelated conditional on , the panel TSLS and LIML estimators are consistent and asymptotically equivalent in the sense that, once normalized, they converge -stably to the same covariance matrix mixed normal distribution. We will see in the next section that this allows the construction of standard tests on these coefficients. However, if and are correlated conditional on , consistency does not hold.
Theorem 2. Suppose Assumption 1 i–v and vii hold. Conditional on , as ,
- 1.
The panel TSLS estimator has a non-degenerate asymptotic distribution - 2.
The panel LIML estimator has also a non-degenerate asymptotic distributionwhere is partitioned conformably to Π
as , and
Therefore, if the factor loadings in the reduced form explanatory variables and the reduced form errors are not uncorrelated conditional on
, the panel TSLS and LIML estimators have the same asymptotic non-degenerate distribution. The fact that they have degenerate distribution should be expected from the result of
Phillips (
1988) in the presence of total lack of identification. Notice, however, that, in our case, TSLS and LIML have the same asymptotic distribution because the failure of identification of the structural parameters is due to failure of identification of the reduced form parameters not to failure of the rank condition.
We now focus on the coefficients of the exogenous variables in the structural equation.
Theorem 3. Under Assumptions 1 i–vi and 2, conditional on , as N tends to infinity
- 1.
a.s. and ,
- 2.
a.s. and ,whereMoreover, if we let , then - 3.
a.s.,
- 4.
a.s.
If the factor loadings in the reduced form explanatory variables and the reduced form errors are uncorrelated conditional on , the panel TSLS and LIML estimators of the coefficients of the exogenous variables in the structural equation are consistent and have a covariance matrix mixed normal distribution. The estimators of the constant, however, have a non-degenerate distribution unless a.e.
If the factor loadings in the reduced form explanatory variables and the reduced form errors are not uncorrelated conditional on , the panel TSLS and LIML estimators for have the same non-degenerated distribution as the following result shows.
Theorem 4. Under Assumptions 1 i–v, vii and 2, conditional on , as N tends to infinity
- 1.
a.s.,
- 2.
a.s.
4. Tests of Hypothesis
In order to perform tests on the slope parameters
, we need to be able to estimate
consistently. We focus on the case where the factor loadings in the reduced form explanatory variables and the reduced form errors are uncorrelated conditional on
. In the development of the above asymptotic results, we have shown that conditional on
,
a.s.,
a.s.,
a.s.,
a.s., and
Lemma 3. Let , where . Given Assumption 1 i–vi, conditional on , a.s.
Therefore, let
be either the panel TSLS or LIML estimator. Then, conditional on
,
a.s., and
a.s.
Then, it is straightforward to obtain the following result.
Theorem 5. If Assumptions 1 i–vi and 2 hold and R is a known matrix of rank and r is a known and fixed vector, such that , then,
- 1.
,
- 2.
.
The above theorem allows us to construct t and F tests on having standard asymptotic distributions under the null hypothesis. Test on the coefficients of the exogenous variables in the structural equation can be constructed using standard procedures.
Theorem 6. Assumptions 1 i–vi and 2 hold and is a known matrix of rank and is a known and fixed vector, such that then,
- 1.
,
- 2.
.