Next Article in Journal
Quantitative Recurrence Properties in Some Irregular Sets for Beta Dynamical Systems
Previous Article in Journal
Towards Efficient HPC: Exploring Overlap Strategies Using MPI Non-Blocking Communication
Previous Article in Special Issue
Time Series Analysis of the Dynamics of Merger and Acquisition Cycles in the Global Water Sector
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification and Estimation in Linear Models with Endogeneity Through Time-Varying Volatility

Department of Economics, California State Polytechnic University, 3801 W. Temple Ave., Pomona, CA 91768, USA
Mathematics 2025, 13(11), 1849; https://doi.org/10.3390/math13111849
Submission received: 1 May 2025 / Revised: 27 May 2025 / Accepted: 31 May 2025 / Published: 2 June 2025

Abstract

This paper proposes a novel control function approach to identify and estimate linear models with endogenous variables in the absence of valid instrumental variables. The identification strategy exploits time-varying volatility to address the multicollinearity problem that arises in conventional control function methods when instruments are weak. We establish the identification conditions and show that the proposed method is T -consistent and asymptotically normal. We apply the proposed approach to estimate the elasticity of intertemporal substitution, a key parameter in macroeconomics. Using quarterly data on aggregate stock returns across eleven countries, we find that the data exhibit substantial time variation in volatility, supporting the identifying assumptions. The proposed method yields confidence intervals that are broadly consistent with the general findings in the literature and are substantially narrower than those obtained using weak-instrument robust methods.

1. Introduction

In the linear regression framework, the instrumental variable (IV) is one of the most widely applied methods to tackle endogeneity in empirical research. A valid IV must satisfy three conditions: (1) it must be excluded from the structural equation; (2) it must be uncorrelated with the structural error term; (3) it must be strongly correlated with the endogenous regressors. If any of these conditions fail, the conventional IV estimator will produce biased and inconsistent results. In many empirical applications, it is often debatable whether the exclusion and the orthogonality conditions hold. Furthermore, even when these two conditions are satisfied, a vast literature has studied the consequences of weak instruments, as instruments are often only weakly correlated with the endogenous variables.
It is well known that in linear models, the IV method is equivalent to ordinary least squares (OLS) regression that includes the reduced form residuals as additional regressors to control for endogeneity, a method known as the control function (CF) approach (see, e.g., [1,2]). While the two approaches obtain numerically identical estimators under strong instruments, the CF method offers a distinct conceptual advantage by allowing researchers to model the source of endogeneity directly in the structural equation. This flexibility makes the CF approach particularly appealing in settings with conditional heteroskedasticity, where variation in the second moments can be exploited to aid identification.
However, when instruments are weak, the residuals from the reduced form equation become nearly collinear with the endogenous regressors, leading to identification failure in the standard CF framework. In this paper, we propose a feasible CF approach that exploits time-varying volatility to address the identification issue in the absence of strong instruments. The idea is that heteroskedasticity generates variation in the scale of the reduced-form errors, which weakens their asymptotic collinearity with the endogenous variables. Earlier studies have applied similar logic in various scenarios. Ref. [3] formulates a model and shows that the parameters can be identified when there are multiple known regimes of heteroskedasticity under certain conditions. Ref. [4] explores the Markov-switching heteroskedascitiy to identify the correlation between trend inflation and inflation gap in an unobserved components model. In structural vector autoregressive framework, refs. [5,6] discuss how Markov-switching heteroskedasticity and stochastic volatility can aid identification.
The proposed method is developed under a general framework that allows for nonparametric time variation in volatility and does not require knowledge of the heteroskedasticity structure. We provide conditions under which the parameters of the structural model are identified and show that the proposed estimator is T -consistent and asymptotically normal. Unlike existing methods that rely on known heteroskedastic regimes [3] or parametric specifications for the volatility process [4,5,6], our approach allows for a nonparametric volatility process subject only to mild smoothness conditions. We derive the asymptotic distribution of the estimator and discuss inference procedures that account for generated regressors and time-varying second moments. In contrast to the standard generalized method of moments (GMM), our framework allows for consistent estimation and testing under weak instruments. A test for endogeneity is also developed, which is applicable even without valid instruments.
We apply the proposed method to estimate the elasticity of intertemporal substitution (EIS), a central parameter in macroeconomics and asset pricing. The EIS governs the trade-off between current and future consumption and plays a critical role in determining asset prices. The empirical literature has found mixed results for the value of EIS. While [7] finds small values for the EIS, high values for EIS are required by models such as the long-run risk model of [8], which has been successful in explaining the equity premium and other asset pricing puzzles. As pointed out by [9,10], weak instruments may explain the divergence in the empirical literature. Motivated by the well-documented time-varying volatility in financial returns (e.g., [11]), we apply our proposed method to estimate the EIS using quarterly aggregate stock return data from eleven developed countries. We find that the confidence intervals obtained under our approach are much tighter than those derived from weak-instrument robust methods and broadly consistent with the literature.
This paper contributes to the literature on identification through heteroskedasticity by showing that smooth nonparametric volatility dynamics can serve as a source of identification when conventional instruments fail. It also contributes to the empirical macro-finance literature by providing new estimates of the EIS based on a control function approach that does not rely on instrument strength. The rest of the paper is organized as follows. Section 2 presents the model specification and the identification conditions. Section 3 discusses the estimation and inference under the proposed framework. Section 4 applies the method to the estimation of EIS. Section 5 concludes.

2. Model Specification and Identification

We consider the following structural model with endogenous regressors,
y t = x t β 0 + u t , E ( x t u t ) 0 ,
x t = ( I p z t 1 ) δ 0 + v t ,
where y t is the scalar dependent variable, observed over t = 1 , , T . x t R p are the endogenous regressors, and z t 1 R q are instruments observed at lag t 1 . This specification allows for other strictly exogenous variables to be partialled out, following the approach in [12]. Our primary goal is to identify β 0 B , with B compact.
We define the σ -field F t generated by the history of observables { y s , x s , z s } s t , and introduce the following definitions:
σ u t 2 = V a r ( u t | F t 1 ) , u t * = u t σ u t , Ω v t = V a r ( v t | F t 1 ) , v t * = Ω v t 1 2 v t
Assumption 1.
(Mixing and Moment Conditions):
1. 
The process u t * and v i t * for i = 1 , , p are strong mixing (α-mixing) martingale difference sequences with respect to F t , with unit conditional variance.
2. 
There exist α 1 > 1 , α 2 i > 1 , M 1 > 0 and M 2 i > 0 , such that sup t E u t * 4 α 1 < M 1 < and sup t E v i t * 4 α 2 i < M 2 i < , for i = 1 , , p .
Assumption 1 assumes that both u t * and v t * have finite fourth moments. By this assumption and Lyapunov’s inequality, E | u t * | 4 ξ and E | v i t * | 4 ξ exist for all ξ min ( α 1 , α 21 , , α 2 p ) , as do all expectations involving up to any four combinations of u t * and v i t * , for i = 1 , , p . Note that by construction, V a r ( u t * | F t 1 ) = V a r ( v i t * | F t 1 ) = 1 .
To correct for endogeneity, we follow the control function approach. Conditional on F t 1 , we project the structural error term u t on the reduced-form error term,
u t = E ( u t | v t ) + w t = v t E v t v t | F t 1 1 E u t v t | F t 1 + w t = v t * E u t v t | F t 1 + w t
Substituting (3) into (1), we obtain the following transformed model:
y t = x t β 0 + v t * E u t v t | F t 1 + w t
The transformed error term w t plays a central role. The following lemma characterizes its orthogonality properties:
Lemma 1.
Under Assumption 1, we can show that
( 1 ) E ( w t | F t 1 ) = 0 ( 2 ) E ( x t w t | F t 1 ) = 0 ( 3 ) E ( v t w t | F t 1 ) = 0 ( 4 ) E ( v t * w t | F t 1 ) = 0
The proof of Lemma 1 is in Appendix A. Lemma 1 shows that w t is a martingale difference sequence and orthogonal to x t , v t , and v t * . Intuitively, w t is orthogonal to v t * by the projection in (3). Since v t * is a transformation of v t , and x t is from the instruments and v t , the orthogonality naturally extends to x t and v t . This orthogonality allows (4) to be used for consistent estimation, provided the model is identified.
However, when instruments are weak, the model in (4) is not always identifiable. We now formalize the identification strategy in this case.

Identification Strategy

When the instruments in (2) are weak, δ 0 0 as T , and the regressors x t v t , causing the control function term v t * and x t to become collinear. This undermines identification in (4). We formalize this situation with the following assumption:
Assumption 2.
1. 
Orthogonality: 1 T t = 2 T z t 1 u t p E z t 1 u t = 0 q , 1 T t = 2 T z t 1 v t p E z t 1 v t = 0 p × q .
2. 
Finite second moments: sup t E z t z t < .
3. 
Local-to-zero reduced-form coefficients: δ 0 = C T ν , where ν 1 2 , C R p q .
Assumption 2(1) assumes that the instruments satisfy the conventional orthogonality condition. Assumption 2(2) states that the instruments have a finite second moment for all t. Both are standard in the literature. Assumption 2(3) captures a local-to-zero asymptotic framework, which is standard in the weak-instrument literature [13]. It indicates that the instruments are increasingly uninformative as the sample size grows. In practice, this means that the predictive power of z t 1 for x t is low, and x t approximates the reduced-form error v t . Applied researchers can assess this by checking first-stage F-statistics, small eigenvalues, or instability in 2SLS estimates. Modeling this weakness explicitly allows us to show how time-varying volatility can restore identification.
Under Assumption 2, x t v t . When volatility is constant over time, v t * = Ω v 1 / 2 v t , both regressors in (4) become asymptotically collinear. In other words, the moment conditions derived from (4) collapse into a system of only p linearly independent equations for 2 p parameters. Identification fails due to collinearity.
To overcome this, we allow volatility to vary smoothly with the following assumption:
Assumption A3.
1. 
Ω v t Ω t T = h 11 t T h 12 t T h 1 p t T h 21 t T h 22 t T h 2 p t T h p 1 t T h p 2 t T h p p t T ,
where h i j ( · ) = h j i ( · ) , h i j ( · ) are non-stochastic, measurable, uniformly bounded on the interval ( 0 , 1 ] , and satisfy Lipschitz condition, for all i , j = 1 , , p . inf t h i i t T > 0 , for all i = 1 , , p .
2. 
Ω t T is not constant and nonsingular over t.
3. 
E ( u t v i t * | F t 1 ) = γ i 0 where γ i 0 are constant for all t, i = 1 , , p .
When volatility varies smoothly over time, the standardization v t * = Ω v t 1 / 2 v t introduces time-varying heterogeneity that breaks the collinearity between x t and v t * . This restores full rank in the moment conditions implied by (4), allowing point identification of β 0 , γ 0 . The time-varying volatility structure assumed in Assumption 3 is consistent with a wide range of economic environments where conditional second moments evolve smoothly over time. In macro-finance, stochastic volatility is a well-documented feature of asset returns (e.g., [14,15]). For instance, persistent changes in macroeconomic uncertainty or shifting investor sentiment can induce gradual fluctuations in return volatility. Our assumption accommodates such dynamics in a nonparametric form, allowing identification to be driven by economically plausible second-moment variation without requiring a specific structural model of volatility.

3. Estimation and Inference

Under Assumptions 1–3, the structural model in (4) takes the form
y t = x t β 0 + v t * γ 0 + w t ,
where w t is a martingale difference sequence orthogonal to x t and v t * , as established in Lemma 1.
Let s t = x t , v t * R 2 p , the population moment conditions are
E s t w t = 0
Because the number of moment conditions equals the number of parameters, the system is just-identified. The optimal GMM estimator reduces to OLS, and no weighting matrix is required. In this case, the moment conditions do not benefit from reweighting, and the weighting matrix is effectively the identity.
If the values of δ 0 and Ω v t were known, the infeasible estimator could be written as
β ^ γ ^ = ( t = 1 T s t s t ) 1 t = 1 T s t y t
Theorem 1.
Under Assumptions 1–3, and assuming δ 0 and Ω v t are known, the estimator in (6) satisfies
T β ^ β 0 γ ^ γ 0 d N ( 0 , Γ 1 1 Λ 1 Γ 1 1 ) ,
where
Γ 1 = lim T 1 T t = 1 T E s t s t , Λ 1 = lim T 1 T t = 1 T E w t 2 s t s t
The proof of Theorem 1 is given in Appendix A. Theorem 1 shows that the infeasible estimators are T consistent and asymptotically normally distributed when the true value of δ 0 and Ω v t are known. It also demonstrates the problem of weak instruments in the absence of time-varying variance. In particular, when Ω v t = Ω ¯ , Γ 1 in Theorem 1 becomes
Γ 1 = Ω ¯ Ω ¯ 1 2 Ω ¯ 1 2 1 ,
which is a singular matrix. Thus, the asymptotic variance–covariance matrix cannot be computed.
The estimators in Theorem 1 are infeasible in practice, as the true value of δ 0 and Ω v t are unknown. We propose a feasible estimator in the following subsection.

3.1. Feasible Estimation Procedure

In practice, we construct a feasible estimator by estimating δ 0 and Ω v t . Let v ^ t = x t ( I p z t ) δ ^ be the first stage OLS residuals from estimating reduced-form Equation (2). We propose to use the generalized kernel proposed by [16] to estimate the time-varying volatility.
Let b i j b i j ( T ) 0 denote a bandwidth and let K ( · ) b i j denote a univariate generalized kernel function with the properties K b i j ( u , t ) = 0 if u > t or u < 1 t ; for all t [ 0 , 1 ] ,
h ( j + 1 ) t 1 t u j K b i j ( u , t ) d u = 1 , if j = 0 0 , if 1 j r 1
We call K b i j ( · , · ) a univariate generalized kernel function of order r.
The following example is taken from [16]. Define
M θ , r ( [ a 1 , a 2 ] ) = g L i p ( [ a 1 , a 2 ] ) , a 1 a 2 x j g ( x ) d x = 1 , if j = 0 0 , if 1 j r 1 ,
where L i p ( [ a 1 , a 2 ] ) denotes the space of Lipschitz continuous functions on [ a 1 , a 2 ] . Define K + ( · , · ) and K ( · , · ) as follows:
(i) 
The support of K + ( x , q ) is [ 1 , q ] × [ 0 , 1 ] and the support of K ( · , · ) is [ q , 1 ] × [ 0 , 1 ] .
(ii)
K + ( · , q ) M θ , r ( [ 1 , q ] ) and K ( · , q ) M θ , r ( [ q , 1 ] ) . We note that K + ( · , 1 ) = K ( · , 1 ) = K ( · ) M θ , r ( [ 1 , 1 ] ) . Now let
K h ( u , t ) = K + ( u , 1 ) , if h t 1 h K + ( u h , t h ) , if 0 t h K ( u h , 1 t h ) , if 1 h t 1
We can show that K h ( · , · ) is a generalized kernel function of order r.
Assume the bandwidth parameter b i j satisfies O ( T 1 2 ) < b i j < O ( T 1 5 ) i , j = 1 , 2 , , p . The estimator v ^ t * can be expressed as
v ^ t * = Ω ^ v t 1 2 v ^ t
where
Ω ^ v t = h ^ 11 t T h ^ 12 t T h ^ 1 p t T h ^ 21 t T h ^ 22 t T h ^ 2 p t T h ^ p 1 t T h ^ p 2 t T h ^ p p t T
h ^ i j t T = τ = 1 T w t τ i j v ^ i τ v ^ j τ , w t τ i j = τ = 1 T K b i j t τ T 1 K b i j t τ T
Following [16,17], we can have the following convergence results under the above generalized kernel estimator,
( 1 ) v ^ t v t = O p T 1 2 ( 2 ) h ^ i j r h i j * r = O p ( b i j 2 ) + O p T b i j 1 2 , uniformly in r [ 0 , 1 ] ( 3 ) v ^ t * v t * = o p T 1 4
A feasible estimator ( β ˜ , γ ˜ ) can be obtained by the following three-step approach.
Step 1: Estimate (1) by OLS, obtain a consistent estimator of v t , v ^ t .
Step 2: Apply the generalized kernel estimator to estimate the time-varying variance–covariance matrix Ω v t , and calculate v ^ t * as Ω ^ v t v ^ t . To estimate the time-varying variance–covariance matrix, we use a fourth-order Epanechnikov kernel for improved boundary performance and reduced bias, as recommended in [16]. The bandwidths b i j T were selected using leave-one-out cross-validation.
Step 3: Replace v t * in (5) with v ^ t * and estimate the new model by OLS.
Follow Lemma 5.1 in [18], the feasible approach can consistently estimate β and γ . To make correct inference on β , however, we need to account for the generated regressors problem due to the estimation of δ and Ω v t . The following Corollary 1 provides the asymptotic distribution of the feasible estimators.
Corollary 1.
The feasible estimators ( β ˜ , γ ˜ ) have the following asymptotic distribution
T β ˜ β 0 γ ˜ γ 0 d N ( 0 , Γ 1 1 Λ 1 * Γ 1 1 ) ,
where
Λ 1 * = lim T 1 T t = 2 T E w t s t G δ M 1 z t 1 v t + i = 1 p i j p D i , j , t v i t v j t h i j t T G δ M 1 z t 1 v t × w t s t G δ M 1 z t 1 v t + i = 1 p i j p D i , j , t v i t v j t h i j t T G δ M 1 z t 1 v t G δ = lim T 1 T t = 2 T E ( y t x t β 0 γ 0 v t * ) s t δ M = lim T 1 T t = 2 T E ( z t 1 z t 1 ) D i , j , t = E y t x t β 0 γ 0 v t * s t h i j t T
Following [18,19], the exact form of Λ 1 * includes adjustment terms involving the Jacobians of s t with respect to δ 0 and h i j ( r ) , i , j = 1 , , p .

3.2. Testing Endogeneity Without Instruments

Theorem 1 also provides a useful result for testing endogeneity. Given the specification in (5), γ i = ρ i σ u ; thus, when there exists no endogeneity, ρ i = γ i = 0 . We can propose an extended version of the Hausman–Wu test [20,21] without valid instrument variables.
Corollary 2.
Under the same assumptions in Theorem 1, a test for endogeneity can be established as
H 0 : γ 0 = 0 ,
where the test statistics T γ ^ ( Γ 1 1 Λ 1 Γ 1 1 ) * 1 γ ^ a χ p 2 , ( Γ 1 1 Λ 1 Γ 1 1 ) * is the p × p bottom right block of the Γ 1 1 Λ 1 Γ 1 1 in Theorem 1.
It is worth noting that the proposed test in Corollary 2 does not suffer from the generated regressors problem. This is because when ρ = γ = 0 , (1) can be directly estimated by OLS and thus no extra uncertainty is introduced.

3.3. Inference and Robustness

The feasible estimator is T -consistent and asymptotically normal. Since the model is just-identified, no GMM weighting matrix is required, and the estimator is semiparametrically efficient under the maintained assumptions. The use of generalized kernels ensures uniform convergence of the volatility estimates, even near boundaries. As discussed in [22,23,24], smooth time-varying volatility satisfies the regularity conditions necessary for applying central limit theorems in heteroskedastic environments. This justifies the use of Wald-type inference.
Unlike conventional IV-GMM estimators, which are sensitive to weak instruments, our approach remains robust due to identification through time-varying volatility. The asymptotic distribution of our estimator is derived directly from moment conditions that remain informative even when the instruments are weak. As such, Wald-type inference remains valid under our framework, as long as the volatility process satisfies smoothness and regularity conditions. This distinction is critical in applications where traditional identification strategies are compromised.

4. Elasticity of Intertemporal Substitution: Consumption CAPM

The EIS is a central parameter in macroeconomics and asset pricing. It governs the intertemporal allocation of consumption and plays a crucial role in determining equilibrium asset returns. In recursive utility models, such as Epstein–Zin [25,26], the EIS appears as a structural determinant of households’ willingness to substitute consumption over time in response to changes in the real return rate.
Empirically, the estimation of the EIS has produced mixed results. Using asset return data, studies such as [7] find small values of the EIS, while the long-run risk model in [8] requires a high EIS to match observed risk premia. This controversy has been partly attributed to the weakness of conventional instruments used to address endogeneity in return-predictability regressions (see [9,10]).
We apply the proposed control function approach to estimate the EIS using quarterly aggregate stock return data without valid instruments. This empirical setting is particularly well suited to the method for two reasons. First, the estimation of EIS involves a canonical predictive regression that is widely known to suffer from weak predictive power. Second, stock returns exhibit well-documented time-varying volatility, which is precisely the source of identification exploited by our framework. Thus, the EIS application offers both policy relevance and methodological transparency.
Following standard consumption-based asset pricing theory, we start from the log-linearized Euler equation implied by the Epstein–Zin utility specification:
E t Δ c t + 1 = τ i + ψ E t r i , t + 1 ,
where Δ c t + 1 is log consumption growth, r i , t + 1 is the log return on asset i, ψ is the EIS, and τ i is a term that depends on the conditional variances and covariances of returns and consumption growth. We assume ψ to be constant over time to compare our results with [10]. Empirically, EIS could be time-varying; we leave this for future research. In empirical work, expectations are replaced by realized values, and we get the following regression:
Δ c t + 1 = τ i + ψ r i , t + 1 + u i , t + 1
The variable r i , t + 1 represents the return on asset i. It may be correlated with the error term u i , t + 1 , making it endogenous in the regression. This correlation can arise for several reasons. First, both consumption growth and asset returns are influenced by common shocks to economic fundamentals. Second, forward-looking behavior by investors can cause returns to respond to information that also affects consumption decisions. As a result, estimating (10) using ordinary least squares would lead to biased and inconsistent estimates of the EIS.
To address this issue, researchers typically use instrumental variable methods. These instruments are constructed from lagged economic variables that are assumed to be correlated with returns but uncorrelated with the error term. Standard choices include past observations of interest rates, inflation, consumption growth, and the dividend–price ratio. However, as emphasized by [7,27], returns are difficult to predict in practice, and these instruments often have little explanatory power. This weak relationship between instruments and returns leads to unreliable estimates and very wide confidence intervals, which limit the usefulness of the results.
In line with the literature on weak instruments (e.g., [13]), we examined the first-stage relationship by estimating the reduced-form return prediction:
r i , t + 1 = z t δ + v i , t + 1 ,
where z t is a set of instruments that consists of lagged values of nominal interest rates, inflation, consumption growth, and the log dividend–price ratio, each lagged twice to mitigate time aggregation bias, following [28]. The data are quarterly and span various periods from 1947 to 1999, depending on the country. Stock returns are constructed from major stock market indices for eleven developed countries, and log consumption growth is calculated from real per capita consumption expenditures (for a full description of the data set, see [7,29]). The third and fourth columns in Table 1 report corresponding F-statistics and partial R 2 values for each country. These statistics confirm the weakness of the instruments, with F-statistics well below conventional thresholds and low partial R 2 across the board. This supports the motivation for using a volatility-based identification strategy, as standard IV inference would likely suffer from low power and misleading coverage in such settings.
We estimate (10) using the proposed control function approach. The reduced-form residuals v ^ i , t + 1 are obtained from (11), and standardized using an estimated conditional variance matrix Ω ^ v t constructed via the kernel estimator described in Section 3. The standardized residuals v ^ i , t + 1 * = Ω ^ v t 1 / 2 v ^ t are then used as control functions in the second-stage regression,
Δ c t + 1 = τ i + ψ r i , t + 1 + γ v ^ i , t + 1 * + w i , t + 1
To estimate the time-varying volatility of returns, we apply the generalized kernel estimator described in Section 3.1. We use a fourth-order Epanechnikov kernel for improved boundary performance and reduced bias, as recommended in [16]. While the generalized kernel is theoretically more appealing, we found that the empirical results are robust to alternative kernel choices, including the quartic and Gaussian kernels. The bandwidths b i j T were selected using leave-one-out cross-validation using each country-specific sample. The volatilities of returns exhibit significant time variation across countries, as shown in Figure 1 and Figure 2. This provides empirical support for the identifying assumption of time-varying volatility. Furthermore, we calculate the correlation between between r t and v t * to check whether the multicollinearity is avoided. The last column in Table 1 reports the corresponding correlations between r t and v t * for each country. The correlations are all substantially below unity, consistent with the theoretical argument that time-varying volatility restores identification.
Table 2 reports confidence intervals for the EIS estimated using our control function method and compares them to intervals derived from three commonly used weak-instrument robust tests: the Anderson–Rubin [30], Kleibergen-LM [31], and Conditional Likelihood Ratio [32,33], as reported in [10]. In most cases, the confidence intervals obtained by three weak-instruments robust tests are extremely wide or unbounded, reflecting the limited power of conventional weak-IV methods. In contrast, our approach obtains narrower, informative intervals. The estimated EIS values are uniformly modest across countries, roughly in the −0.05 to 0.3 range, indicating a limited willingness of households to shift consumption over time. In economic terms, an EIS of this magnitude implies a strong preference for consumption smoothing. These findings are aligned with prior empirical findings that the EIS is typically low (e.g., [7,28]), standing in contrast to the much larger values (EIS well above 1) required by certain asset-pricing frameworks like long-run risk models [8]. Moreover, our approach delivers considerably tighter confidence intervals than standard weak-IV methods, reflecting a substantial gain in identification power. The improved precision suggests that by exploiting time-varying volatility, we can estimate the EIS with much greater confidence. These results demonstrate the empirical relevance of the theoretical framework developed in earlier sections.

5. Concluding Remarks

This paper develops a feasible control function approach for identifying and estimating linear models with endogenous regressors without valid instruments. The method exploits time-varying volatility in the reduced-form errors as a source of identification, offering an alternative when conventional instrumental variable techniques are infeasible or unreliable.
The main contribution lies in demonstrating that conditional heteroskedasticity can restore identification even when valid instrumental variables are unavailable. By projecting the structural error term onto the reduced-form error terms and standardizing by the time-varying variance, the proposed estimator isolates exogenous variation that would otherwise be obscured by collinearity. Unlike approaches that rely on strong instruments, known heteroskedastic regimes [3], or parametric volatility models [4,5,6], our framework allows for smooth, nonparametric variation in second moments and does not require external exclusion restrictions.
We formally establish that the estimator is T -consistent and asymptotically normal. Importantly, we show that the estimator remains valid under weak identification and heteroskedasticity, provided the volatility process satisfies mild regularity conditions. We also derive a test for endogeneity that does not require valid instruments, extending the logic of the Hausman–Wu test to settings where standard IV methods fail.
In an empirical application, we estimate the EIS using quarterly stock return and consumption data from eleven developed countries. The data exhibit substantial time variation in return volatility, consistent with the identifying assumptions. The proposed method produces confidence intervals that are narrower and more stable than those derived from conventional weak-instrument robust procedures, while remaining consistent with existing findings in the literature.
The results demonstrate both the empirical relevance and practical applicability of using heteroskedasticity as a source of identification. The approach provides a flexible alternative to instrument-based methods in settings where external instruments are weak, contested, or unavailable. More broadly, this framework suggests that structured time variation in second moments can offer credible identification strategies in a range of economic applications.

Funding

This research received no external funding.

Data Availability Statement

Data are available in the public domain: https://drive.google.com/file/d/0BzR-ojpYuaFMaUJxd0RNSk84a00/view?resourcekey=0-XRMv1V1xJjM5JrOj-TB51Q (accessed on 20 May 2025).

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A

Proof of Lemma 1.
By construction, w t u t v t * γ t = u t v t * ρ σ u t .
For (1), it follows immediately from Assumption 1.
For (2),
E ( x t w t | F t 1 ) = E ( I p z t 1 ) δ 0 + v t u t v t * ρ σ u t | F t 1 = E v t u t v t * ρ σ u t | F t 1 = E v t u t | F t 1 E v t v t * | F t 1 ρ σ u t = Ω v t 1 2 Ω v t 1 2 E v t * v t * | F t 1 ρ σ u t = 0
where the second equation follows by Assumption 1, and the last follows E v t * v t * | F t 1 = I p by construction.
Proofs for (3) and (4) follow the same logic as (2) and thus are omitted. □
Proof of Theorem 1.
The proof of Theorem 1 proceeds via a sequence of technical lemmas, presented below.
Lemma A1.
1 T t = 1 T s t s t p Ω Ω 1 2 Ω 1 2 1
Proof. 
To prove convergence of the sample second moment matrix, it suffices to establish the following:
(a) 1 T t = 1 T x t x t p Ω .
(b) 1 T t = 1 T x t v t * p Ω 1 2 .
For (a),
1 T t = 1 T x t x t = 1 T t = 1 T ( I p z t 1 ) δ δ ( I p z t 1 ) + 1 T t = 1 T ( I p z t 1 ) δ ( Ω 1 2 v t * ) + 1 T t = 1 T ( Ω 1 2 v t * ) δ ( I p z t 1 ) + 1 T t = 1 T ( Ω 1 2 v t * ) ( Ω 1 2 v t * )
For the first term, IVs (2) and (3) ensure the following convergence result,
1 T t = 1 T ( I p z t 1 ) δ δ ( I p z t 1 ) = 1 T 2 t = 1 T ( I p z t 1 ) C C ( I p z t 1 ) p 0
For the second and third terms, we apply the orthogonality conditions implied by Assumption 2(1),
1 T t = 1 T ( I p z t 1 ) δ ( Ω 1 2 v t * ) p 0
1 T t = 1 T ( Ω 1 2 v t * ) δ ( I p z t 1 ) p 0
For the fourth term, by the law of large numbers for martingale difference sequence and TV,
1 T t = 1 T ( Ω 1 2 v t * ) ( Ω 1 2 v t * ) p lim T 1 T t = 1 T E ( Ω 1 2 v t * ) ( Ω 1 2 v t * ) = Ω
Thus, the desired result can be obtained by combining (A1) to (A4).
For (b),
1 T t = 1 T x t v t * = 1 T t = 1 T I p z t 1 v t * + 1 T t = 1 T Ω 1 2 v t * v t *
The first term converges to 0 following the same argument in (a). For the second term,
1 T t = 1 T Ω 1 2 v t * v t * p lim T 1 T t = 1 T E Ω 1 2 v t * v t * = Ω 1 2
Lemma A2.
(a) 
1 T t = 1 T v t w t 2 v t E v t w t 2 v t p 0
(b) 
1 T t = 1 T v t w t 2 v t * E v t w t 2 v t * p 0
(c) 
1 T t = 1 T v t * w t 2 v t * E v t * w t 2 v t * p 0
Proof. 
For (a),
v t w t 2 v t = v 1 t 2 w t 2 v 1 t v 2 t w t 2 v 1 t v p t w t 2 v 2 t v 1 t w t 2 v 2 t 2 w t 2 v 2 t v p t w t 2 v p t v 1 t w t 2 v p t v 2 t w t 2 v p t 2 w t 2
Since v i t and w t are α -mixing by Theorem 14.1 in [34], thus v i t v j t and w t 2 are also α -mixing and near-epoch dependent in L 2 norm on the α -mixing sequence v t * , u t * . So by Theorem 17.9 in [34], v i t v j t w t 2 is near-epoch dependent in L 1 norm on v t * , u t * . Let α = min α 1 , α 21 , , α 2 p , it follows that
E v i t v j t w t 2 α E v i t 4 α E v j t 4 α E w t 4 α <
by Assumption 1 and the Cauchy–Schwarz inequality. Thus, by the law of large numbers for L 1 -mixingales [35], we have the desired results. Proofs for (b) and (c) follow the same logic and thus are omitted. □
Lemma A3.
1 T t = 1 T s t w t d N ( 0 , Ω 2 )
Proof. 
Following the Cramer–Wold theorem, it suffices to show that for any fixed vector λ R 2 p ,
1 T t = 1 T λ s t w t d N ( 0 , λ Ω 2 λ )
for every fixed 2 p × 1 vector λ = ( λ 1 , λ 2 ) , where λ 1 = ( λ 11 , λ 12 , , λ 1 p ) , λ 2 = ( λ 21 , λ 22 , , λ 2 p ) .
First,
1 T t = 1 T λ s t w t = 1 T t = 1 T λ 1 x t w t + λ 2 v t * w t = 1 T t = 1 T λ 1 ( I p z t 1 ) δ w t + λ 1 v t w t + λ 2 v t * w t = 1 T t = 1 T λ 1 v t w t + λ 2 v t * w t + o p ( 1 ) = 1 T t = 1 T λ s t * w t + o p ( 1 )
where s t * ( v t , v t * ) , and the third equation follows from Assumption 2(1) and 2(3).
Second, note that λ s t * w t , F t is a martingale difference sequence by Lemma 1. Then the following convergence results follow Lemma A2
1 T t = 1 T λ w t 2 s t * s t * λ p λ Λ λ
Moreover, we have
E ( λ s t * w t ) 2 α = λ 1 v t w t + λ 2 v t * w t 2 α 2 α ( i = 1 p λ 1 i v t i w t 2 α + i = 1 p λ 2 i v t i * w t 2 α ) 2 α
where the inequality follows from the Minkowski inequality. □
Each term in the bound (A7) is finite due to the assumed moment conditions and mixing properties,
λ 1 i v t i w t 2 α = | λ 1 i | ( E ( v i t w t ) 2 α ) 1 2 α | λ 1 i | ( E v i t 4 α E w t 4 α ) 1 4 α <
λ 2 i v t i * w t 2 α = | λ 2 i | ( E ( v i t * w t ) 2 α ) 1 2 α | λ 2 i | ( E v i t * 4 α E w t 4 α ) 1 4 α <
where the inequality follows from the Cauchy–Schwarz inequality.
So, we can conclude that (A7) is finite, by the central limit theorem for the martingale differences, (A6) holds, and then we have the desired results.
The proof of Theorem 1 follows Lemma A1–A3.

References

  1. Galvao, A.F.; Montes-Rojas, G. On the Equivalence of Instrumental Variables Estimators for Linear Models. Econ. Lett. 2015, 134, 13–15. [Google Scholar] [CrossRef]
  2. Kim, C.-J.S. Dealing with Endogeneity in Regression Models with Dynamic Coefficients. Found. Trends Econ. 2008, 3, 165–266. [Google Scholar] [CrossRef]
  3. Rigobon, R. Identification Through Heteroskedasticity. Rev. Econ. Stat. 2003, 85, 777–792. [Google Scholar] [CrossRef]
  4. Hwu, S.-T.; Kim, C.-J. Estimating Trend Inflation Based on Unobserved Components Models: Is It Correlated with the Inflation Gap? J. Money Credit Bank. 2019, 51, 2305–2319. [Google Scholar] [CrossRef]
  5. Bertsche, D.; Braun, R. Identification of Structural Vector Autoregressions by Stochastic Volatility. J. Bus. Econ. Stat. 2022, 40, 328–341. [Google Scholar] [CrossRef]
  6. Hertwartz, H.; Lütkepohl, H. Structural vector autoregressions with Markov switching: Combining conventional with statistical identification of shocks. J. Econ. 2014, 183, 104–116. [Google Scholar]
  7. Campbell, J.Y. Consumption-based asset pricing. Handb. Econ. Financ. 2003, 1, 803–887. [Google Scholar] [CrossRef]
  8. Bansal, R.; Yaron, A. Risks for the long run: A potential resolution of asset pricing puzzles. J Financ. 2004, 59, 1481–1509. [Google Scholar] [CrossRef]
  9. Neely, C.J.; Roy, A.; Whiteman, C.H. Risk aversion versus intertemporal substitution: A case study of identification failure in the intertemporal consumption capital asset pricing model. J. Bus. Econ. Stat. 2001, 19, 395–403. [Google Scholar] [CrossRef]
  10. Yogo, M. Estimating the elasticity of intertemporal substitution when instruments are weak. Rev. Econ. Stat. 2004, 86, 797–810. [Google Scholar] [CrossRef]
  11. Schwert, G.W. Why Does Stock Market Volatility Change Over Time? J. Financ. 1989, 44, 1115–1153. [Google Scholar] [CrossRef]
  12. Hahn, J.; Hausman, J.A. A New Specification Test for the Validity of Instrumental Variables. Econometrica 2002, 70, 163–189. [Google Scholar] [CrossRef]
  13. Staiger, D.; Stock, J.H. Instrumental variables regression with weak instruments. Econometrica 1997, 65, 557–586. [Google Scholar] [CrossRef]
  14. Hull, J.; White, A. The Pricing of Options on Assets with Stochastic Volatilities. J. Financ. 1987, 42, 281–300. [Google Scholar] [CrossRef]
  15. Harvey, A.C.; Ruiz, E.; Shephard, N. Multivariate Stochastic Variance Models. Rev. Econ. Stud. 1994, 61, 247–264. [Google Scholar] [CrossRef]
  16. Müller, H.-G. Smooth optimum kernel estimators near endpoints. Biometrika 1991, 78, 521–530. [Google Scholar] [CrossRef]
  17. Feng, Y. Simultaneously Modeling Conditional Heteroskedasticity and Scale Change. Econ. Theory 2004, 20, 563–596. [Google Scholar] [CrossRef]
  18. Newey, W.K. The Asymptotic Variance of Semiparametric Estimators. Econometrica 1994, 62, 1349–1382. [Google Scholar] [CrossRef]
  19. Pagan, A. Econometric issues in the analysis of regressions with generated regressors. Int. Econ. Rev. 1984, 65, 557–586. [Google Scholar] [CrossRef]
  20. Hausman, J. Specification tests in econometrics. Econometrica 1978, 46, 1251–1271. [Google Scholar] [CrossRef]
  21. Wu, D. Alternative tests of independence between stochastic regressors and disturbances. Econometrica 1973, 41, 733–750. [Google Scholar] [CrossRef]
  22. Cavaliere, G. Unit root tests under time-varying variance shifts. Econ. Rev. 2004, 23, 259–292. [Google Scholar] [CrossRef]
  23. Phillips, P.C.B.; Xu, K.-L. Inference in Autoregression under Heteroskedasticity. J. Time Ser. Anal. 2006, 27, 289–308. [Google Scholar] [CrossRef]
  24. Xu, K.-L.; Phillips, P.C.B. Adaptive Estimation of Autoregressive Models with Time-Varying Variances. J. Econ. 2008, 142, 265–280. [Google Scholar] [CrossRef]
  25. Epstein, L.G.; Zin, S.E. Substitution, risk aversion, and the temporal behavior of consumption and asset returns: A theoretical framework. Econometrica 1989, 57, 937–969. [Google Scholar] [CrossRef]
  26. Epstein, L.G.; Zin, S.E. Substitution, risk aversion, and the temporal behavior of consumption and asset returns: An Empirical Analysis. J. Political Econ. 1991, 99, 263–286. [Google Scholar] [CrossRef]
  27. Cochrane, J.H. The Dog That Did Not Bark: A Defense of Return Predictability. Rev. Financ. Stud. 2008, 21, 1533–1575. [Google Scholar] [CrossRef]
  28. Hall, R.E. Intertemporal Substitution in Consumption. J. Political Econ. 1991, 96, 339–357. [Google Scholar] [CrossRef]
  29. Campbell, J. Data Appendix for ‘Asset Prices, Consumption, and the Business Cycle’. Unpublished Manuscript. 1999. Available online: https://campbell.scholars.harvard.edu//publications/data-appendix-asset-prices-consumption-and-business-cycle (accessed on 25 May 2025).
  30. Anderson, T.W.; Rubin, H. Estimation of the parameters of a single equation in a complete system of stochastic equations. Ann. Math. Stat. 1949, 20, 46–63. [Google Scholar] [CrossRef]
  31. Kleibergen, F. Pivotal statistics for testing structural parameters in instrumental variables regression. Econometrica 2002, 70, 1781–1803. [Google Scholar] [CrossRef]
  32. Moreira, M.J. A conditional Likelihood ratio test for structure models. Econometrica 2002, 71, 1027–1048. [Google Scholar] [CrossRef]
  33. Moreira, M.J. Test with Correct size when instruments can be arbitrarily weak. J Econ. 2007, 152, 131–140. [Google Scholar] [CrossRef]
  34. Davidson, J. Stochastic Limit Theory; Oxford University Press: Oxford, UK, 1994. [Google Scholar]
  35. Andrew, D.W.K. Law of Large Numbers for Dependent Non-Identical Distributed Random Variables. Econ. Theory 1988, 4, 458–467. [Google Scholar] [CrossRef]
Figure 1. Aggregate stock return and estimated volatility.
Figure 1. Aggregate stock return and estimated volatility.
Mathematics 13 01849 g001
Figure 2. Aggregate stock return and estimated volatility.
Figure 2. Aggregate stock return and estimated volatility.
Mathematics 13 01849 g002
Table 1. Supporting evidence for the proposed identification strategy.
Table 1. Supporting evidence for the proposed identification strategy.
CountrySample PeriodF-StatisticsPartial R 2 Corr ( r t , v t * )
AUL1970.3–1998.41.820.060.81
CAN1970.3–1999.12.510.080.83
FR1970.3–1998.33.10.100.81
GER1979.1–1998.30.690.040.85
ITA1971.4–1998.11.100.040.89
JAP1970.3–1998.43.490.110.85
NTH1977.3–1998.40.730.030.84
SWD1970.3–1999.22.240.070.84
SWT1976.2–1998.40.110.010.83
UK1970.3–1999.12.620.090.80
USA1947.3–1998.42.880.050.85
Note: This table reports the F-statistics and partial R 2 of the first-stage regressions and the correlations between r t and v t * .
Table 2. Confidence interval based on the control function approach and the weak-instrument robust tests.
Table 2. Confidence interval based on the control function approach and the weak-instrument robust tests.
CountrySample PeriodCFARLMCLR
AUL1970.3–1998.4[−0.11, 0.14][ , ∞][ , ∞][ , ∞]
CAN1970.3–1999.1[0.03, 0.28][0.02, 4.03][0.05, 0.35][0.04, 0.41]
FR1970.3–1998.3[−0.03, 0.15][−0.28, 0.20][ , ∞][−0.16, 0.11]
GER1979.1–1998.3[−0.01, 0.19][ , ∞][ , ∞][ , ∞]
ITA1971.4–1998.1[−0.03, 0.17][ , ∞][ , ∞][ , ∞]
JAP1970.3–1998.4[−0.05, 0.19][−0.05, 0.32][−1.01, 0.20][−0.02, 0.21]
NTH1977.3–1998.4[0.03, 0.19][ , ∞][ , ∞][ , ∞]
SWD1970.3–1999.2[−0.01, 0.07][ , ∞][ , ∞][ , ∞]
SWT1976.2–1998.4[−0.04, 0.21][ , ∞][ , ∞][ , ∞]
UK1970.3–1999.1[0.02, 0.29][−0.51, −0.02][ , ∞][ , ∞]
USA1947.3–1998.4[0.11, 0.27][−0.21, −0.02][ , ∞][ , ∞]
Note: This table reports the confidence interval of EIS estimated by the proposed approach and the weak-instrument robust tests. CF refers to the proposed estimator. AR refers to the Anderson–Rubin test. LM refers to the Kleibergen-LM test. CLR refers to the Conditional Likelihood Ratio test.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hwu, S.-T. Identification and Estimation in Linear Models with Endogeneity Through Time-Varying Volatility. Mathematics 2025, 13, 1849. https://doi.org/10.3390/math13111849

AMA Style

Hwu S-T. Identification and Estimation in Linear Models with Endogeneity Through Time-Varying Volatility. Mathematics. 2025; 13(11):1849. https://doi.org/10.3390/math13111849

Chicago/Turabian Style

Hwu, Shih-Tang. 2025. "Identification and Estimation in Linear Models with Endogeneity Through Time-Varying Volatility" Mathematics 13, no. 11: 1849. https://doi.org/10.3390/math13111849

APA Style

Hwu, S.-T. (2025). Identification and Estimation in Linear Models with Endogeneity Through Time-Varying Volatility. Mathematics, 13(11), 1849. https://doi.org/10.3390/math13111849

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop