On The Interpretation of Instrumental Variables in the Presence of Specification Errors: A Causal Comment

Burkhard Raunig

doi:10.3390/econometrics5030031

Economic Studies Division, Central Bank of Austria, Otto-Wagner-Platz 3, Vienna 1090, Austria

^†

The views expressed in this paper do not necessarily reflect those of the Central Bank of Austria or the Eurosystem.

Econometrics2017, 5(3), 31;https://doi.org/10.3390/econometrics5030031

Version Notes

Order Reprints

Abstract

Swamy et al. (2015) argue that valid instruments cannot exist when a structural model is misspecified. This note shows that this is not true in general. In simple examples valid instruments can exist and can help to estimate parameters of interest.

Keywords:

causal inference; instrumental variable; omitted variable; structural model

JEL Classification:

C18; C26

1. Introduction

Does an efficient financial system enhance growth? Does uncertainty depress private investment? Do private schools provide better education than public schools? In answering these and many other empirical questions instrumental variables play a central role.

In a recent paper Swamy et al. (2015) argue that valid instruments cannot exist in the presence of any model misspecification. Such mis-specifications include wrong functional form, omission of relevant explanatory variables, and the presence of measurement error in explanatory variables. As a consequence, instrumental variables (IV) and generalized method of moments (GMM) would not work.

No doubt, instruments may be hard to find or difficult to justify in empirical applications. The claim that instruments cannot exist appears to be too strong, however. This note discusses three simple examples where valid instruments can exist and where IV methods will work.

This note proceeds as follows. The next section reproduces the derivations in Swamy et al. (2015) for a simple linear model. This example helps to explain the logic of their nonexistence result for instrumental variables.

Section 3 briefly discusses the nature of structural models. The aim is to clarify some misconceptions that bedevil the interpretation of structural models.

Section 4 reexamines the basic arguments of Swamy et al. (2015) in the context of three extremely simple structural models. All three regressions for estimating the structural parameter of interest are misspecified in these cases. In the first two cases one of the two explanatory variables has been omitted. In the third case the single explanatory variable contains measurement error.

The following will be shown. In the first case the existence of a valid instruments depends on the purpose of the analysis. In the second case instruments can exist and will produce consistent estimates of the structural parameter of interest. In the third example instruments can solve the measurement error problem.

2. Non-Existence of Instruments

The central argument in Swamy et al. (2015) is that the explanatory variables that must be instrumented in an empirical model to obtain consistent estimates are at the same time also part of the error term of the model. All potential instruments are therefor necessarily correlated with the error term. The requirement that instruments must be uncorrelated with the error term is always violated. Hence, valid instruments cannot exist.

Swamy et al. (2015) start with a very general theoretical relationship

y_{t}^{*} = f_{t} (x_{1 t}^{*}, ..., x_{L (t) t}^{*})

(1)

with unknown functional form, where a possibly time dependent number

L (t)

of variables

x_{i t}^{*}

determine

y_{t}^{*}

. This theoretical relationship is assumed to be exact. Thus there is no need for an error term.

The derivations in Swamy et al. (2015) are now repeated for a simple linear model where

y_{t}^{*}

is exactly determined by two variables

x_{1 t}^{*}

and

x_{2 t}^{*}

. This model can be written as

y_{t}^{*} = α_{0} + α_{1} x_{1 t}^{*} + α_{2} x_{2 t}^{*} .

(2)

Equation (2) is the true model.

Let us assume that we can only observe the measurements

y_{t} = y_{t}^{*} + ν_{0 t}

and

x_{1 t} = x_{1 t}^{*} + ν_{1 t}

where

ν_{0 t}

and

ν_{1 t}

are measurement errors. Thus, the model that we can estimate has the correct functional form, but suffers from measurement error and an omitted variable.

Swamy et al. (2015) argue that the relationship between the unobserved and the observed determinant can always be written as

x_{2 t}^{*} = λ_{0 t} + λ_{1 t} x_{1 t}^{*}

(3)

where

λ_{0 t}

is the portion of

x_{2 t}^{*}

that remains after the effect of

x_{1 t}^{*}

is removed. Substituting (3) into (2) yields

y_{t}^{*} = α_{0} + α_{2} λ_{0 t} + (α_{1} + α_{2} λ_{1 t}) x_{1 t}^{*} .

(4)

Accounting for the measurement errors in (4) gives

y_{t} = α_{0} + α_{2} λ_{0 t} + ν_{0 t} + (α_{1} + α_{2} λ_{1 t}) (1 - \frac{ν_{1 t}}{x_{1 t}}) x_{1 t}

(5)

This equation includes the bias arising from omitting

x_{2 t}^{*}

, the bias from measurement error in

x_{1 t}

, and the measurement error in

y_{t}

. It is a simple version of the fully general expression (8) in Swamy et al. (2015). This model can be written as

y_{t} = γ_{0 t} + γ_{1 t} x_{1 t}

where the parameters

γ_{0 t} = α_{0} + α_{2} λ_{0 t} + ν_{0 t}

and

γ_{1 t} = (α_{1} + α_{2} λ_{1 t}) (1 - \frac{ν_{1 t}}{x_{1 t}})

are time-varying.

Let us now consider two of the cases discussed in Swamy et al. (2015). In the first case the model is linear. Adding and subtracting the constant parameter model

β_{0} + β_{1} x_{1 t}

to (5) yields

y_{t} = β_{0} + β_{1} x_{1 t} + (α_{0} + α_{2} λ_{0 t} + ν_{0 t} - β_{0} + (α_{1} + α_{2} λ_{1 t}) (1 - \frac{ν_{1 t}}{x_{1 t}}) - β_{1}) x_{1 t})

(6)

where the last two terms become the error term in the model to be estimated. A valid instrument must be correlated with

x_{1 t}

and uncorrelated with the error term. But

x_{1 t}

is also in the error term of (6). Thus, every instrument for

x_{1 t}

must be correlated with the error. No valid instrument exists in model (6).

The other case is a simple example of a model with a measurement error in the explanatory variable. This model is

y_{t}^{*} = β x_{t}^{*}

(7)

where the measured value of the explanatory variable is

x_{t} = x_{t}^{*} + ν_{t}

. The model that can be estimated is

y_{t}^{*} = β x_{t} - β ν_{t}

(8)

where

- β ν_{t}

is the error term. Written as time-varying coefficient model (8) becomes

y_{t}^{*} = β_{t} x_{t}

(9)

where

β_{t} = β (1 - \frac{ν_{t}}{x_{t}})

. Adding and subtracting a fixed coefficient model where

β^{* *} \neq β

yields

y_{t}^{*} = β^{* *} x_{t} + (β_{t} - β^{* *}) x_{t}

(10)

and

x_{t}

is again also in the error term of Equation (8). No instrument for

x_{t}

would work.

3. Structural Models

Before we reexamine the arguments in Section 2 we need to clarify what structural models are and what kind of assumptions they encode. This note cannot give a full exposition, of course. Pearl (2009a) provides a comprehensive treatment of structural models and causal inference in general. In particular, Chapter 5 in Pearl (2009a) discusses the interpretation of structural parameters and the error term in structural models in great detail. This section draws heavily on Pearl (2009b) which gives an excellent overview of the foundations of causal modeling with structural equations.

Let us consider the simple linear structural equation

y = β x + ν_{0} .

(11)

where y depends on x and

ν_{0}

, a variable that stands for all other factors that affect y when x is held constant. Particular values of x and

ν_{0}

assign a particular value

y = β x + ν_{0}

.

Equation (11) encodes the causal assumption that changing or manipulating x causes y to vary. The strength of the this effect is

β

. Note that the interpretation of

β

does not depend on

ν_{0}

. The equation says that the effect of a unit change in x on y is

β

, regardless of the values taken by the other variables in the model. Whether or not x is correlated with

ν_{0}

plays no role. Equation (11) describes a causal mechanism, not statistical associations. The correlation between x and

ν_{0}

becomes important, however, when one attempts to estimate the causal parameter

β

from observational data.

Furthermore, the relation between x and y is asymmetric. The equality sign is therefore somewhat misleading. Rewriting (11) as

x = (y - ν_{0}) / β

(12)

would lead to the miss-interpretation that y causes x. For example, when y is a symptom of a disease x than (12) would imply that the symptom causes the disease. This makes of course no sense.

The graph in Figure 1 makes the causal relationship between x and y explicit. The arrow that points from x to y shows the direction of causality between these variables. The solid nodes indicate that x and y are observable variables. The hollow node indicates that the variable

ν_{0}

is unobserved. The absence of a link between x and

ν_{0}

in this graph indicate the these two variables are assumed to be independent.

Figure 1. Graph of the structural relationship (11).

Another important assumption in Equation (11) is the invariance of the target parameter

β

. This is an identifying assumption. The assumption implies that the causal link between x and y is stable. One could of course assume that

β

changes over time in some way. But this would imply a very different structural model.

Until now nothing was said about estimating Equation (11). The question is whether the causal effect

β

can be estimated from observational data. When x is uncorrelated with

ν_{0}

then

β

is identified and consistently estimable from data on y and x. When x and

ν_{0}

are dependent, however, then we need additional information to consistently estimate

β

. This information may come from an instrumental variable.

4. Non-Existence Revisited: Instruments Do Exist

Let us now turn again to our simple model given by Equation (2) and let us assume that we can only observe

y^{*}

and

x_{1}^{*}

. We cannot observe

x_{2}^{*}

. For simplicity the intercept

α_{0}

is set to zero. The time subscript t is superfluous and therefore dropped.

Figure 2 shows two possible structural models. In model (a)

x_{1}^{*}

affects

y^{*}

directly and indirectly via its effect on

x_{2}^{*}

. In model (b) the variable

x_{2}^{*}

is a confounding variable that jointly affects

y^{*}

and

x_{1}^{*}

. In both graphs z is an instrumental variable.

Figure 2. Two possible structural relationships.

The regression model that is actually estimated is

y^{*} = β x_{1}^{*} + ϵ

(13)

where the error term

ϵ = α_{2} x_{2}^{*}

. Thus, the model is misspecified because

x_{2}^{*}

is omitted. In both cases

x_{1}^{*}

is correlated with the error. Let us now compute the ordinary least squares (OLS) and IV estimates for both cases.

Case (a): Easy computations show that OLS yields

β_{(a)}^{O L S} = \frac{C o v (y^{*}, x_{1}^{*})}{V a r (x_{1}^{*})} = \frac{(α_{1} + α_{2} λ_{12}) V a r (x_{1}^{*})}{V a r (x_{1}^{*})} = α_{1} + α_{2} λ_{12}

(14)

This is the total (i.e., direct + indirect) effect of

x_{1}^{*}

on

y^{*}

. The IV estimate for

β

is

β_{(a)}^{I V} = \frac{C o v (y^{*}, z)}{C o v (x_{1}^{*}, z)} = \frac{(α_{1} + α_{2} λ_{12}) δ V a r (z)}{δ V a r (z)} = α_{1} + α_{2} λ_{12}

(15)

and therefore the same as the OLS estimate. When the goal is to estimate only the direct effect of

x_{1}^{*}

on

y^{*}

no instrument for

x_{1}^{*}

works as argued in Swamy et al. (2015). If one is interested in the total effect no instrument is needed anyway.

Case (b): Now

x_{2}^{*}

is a confounding variable. As already mentioned, this model is miss-specified and Swamy et al. (2015) would conclude that a valid instrument cannot exist. But in fact a valid instrument can exist. OLS wont work but IV estimation will.

OLS yields

β_{(b)}^{O L S} = \frac{α_{1} λ_{21}^{2} V a r (x_{2}^{*}) + α_{2} λ_{21} V a r (x_{2}^{*})}{λ_{21}^{2} V a r (x_{2}^{*})} = α_{1} + α_{2} (1 / λ_{21})

(16)

where the second term is the well known omitted variable bias. The term

(1 / λ_{21})

is the coefficient from a regression of

x_{2}^{*}

on

x_{1}^{*}

. Note that this auxiliary regression has no causal content. The regression just measures the statistical association between

x_{2}^{*}

and

x_{1}^{*}

. Some simple algebra shows that IV estimation yields now

β_{(b)}^{I V} = \frac{α_{1} δ V a r (z)}{δ V a r (z)} = α_{1}

(17)

which is the structural parameter that we wanted to estimate.

Why can an instrument work in case (b) but not in case (a)? In case (a) the error term

ϵ = α_{2} x_{2}^{*} = α_{2} λ_{12} x_{1}^{*}

is caused by the included variable. The error term is thus indeed a function of

x_{1}^{*}

. No instrument for

x_{1}^{*}

will therefore work if one wants to estimate the direct effect of

x_{1}^{*}

on

y^{*}

.

Case (b) is quite different. in Figure 2b the error is not a function of the included explanatory variable. Equation (3) in Section 2 is thus not consistent with the underlying structural model. Varying

x_{1}^{*}

does not affect

x_{2}^{*}

. Instead the omitted variable

x_{2}^{*}

in the error term is a cause of the explanatory variable

x_{1}^{*}

. The instrument z is another cause of

x_{1}^{*}

that is independent of

x_{2}^{*}

. Thus the error and the instrument are uncorrelated. Moreover, the instrument affects the dependent variable

y^{*}

only via

x_{1}^{*}

. IV estimation works with a proper instrument.

Let us now turn to the second example in Section 2 where

y_{t}^{*} = β x_{t}^{*}

and the explanatory variable

x_{t} = x_{t}^{*} + ν_{t}

is measured with error. If

y_{t}^{*} = β x_{t}^{*}

is the true structural model then the causal effect of

x_{t}^{*}

on

y_{t}^{*}

is stable. The constant parameter

β

reflects this. Hence, we cannot simply transform this model into a time-varying coefficient model. Such a model states that the causal effect is unstable and changes over time. The transformed model is therefore inconsistent with the true structural model that we want to estimate. The transformed model has very different implications for the causal link between

x_{t}^{*}

and

y_{t}^{*}

. It is a different structural model.

To be consistent with the original structural model one should estimate the model

y_{t}^{*} = β x_{t} - β ν_{t}

. Here the error is not a function of

x_{t}

and instruments can in principle be found. For example, any cause of

x_{t}^{*}

that is unrelated to the measurement error

ν_{t}

and does not directly cause

y_{t}^{*}

would be a valid instrument for

x_{t}

.

5. Conclusions

The examples presented in this note demonstrate that instruments can exist. The arguments in Swamy et al. (2015) hold when the structural error is indeed a function of the included explanatory variables. But this is rarely the case in a structural model. For instance, omitted confounding variables are not functions of included variables. The three variable model (b) in Figure 2 provides a simple example. The model is misspecified, but a valid instrument exists since the omitted confounding variable is not a function of the included variable.

Furthermore, Swamy et al. (2015) assume that any model can be expressed as a time varying coefficient (TVC) model. The true model would then be a special case of this general model.

Of course, a constant parameter model is a special case of a TVC model. A true structural model with constant parameters cannot be turned into an equivalent TVC model, however. A structural model with constant parameters cannot at the same time be expressed as a model where the parameters vary over time. The later model has very different causal implications.

Conflicts of Interest

The author declares no conflict of interest.

References

Swamy, Paravastu A.V.B., George S. Tavlas, and Stephen G. Hall. 2015. On the Interpretation of Instrumental Variables in the Presence of Specification Errors. Econometrics 3: 55–64. [Google Scholar] [CrossRef]
Pearl, Judea. 2009a. Causality: Models, Reasoning and Inference, 2nd ed. Cambridge: Cambridge University Press. [Google Scholar]
Pearl, Judea. 2009b. Causal inference in statistics: An overview. Statistics Surveys 3: 96–146. [Google Scholar] [CrossRef]

© 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

On The Interpretation of Instrumental Variables in the Presence of Specification Errors: A Causal Comment

Abstract

1. Introduction

2. Non-Existence of Instruments

3. Structural Models

4. Non-Existence Revisited: Instruments Do Exist

5. Conclusions

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics