Next Article in Journal
Examining Strategies to Manage Climate Risks of PPP Infrastructure Projects
Previous Article in Journal
Cryptocurrencies as a Tool for Money Laundering: Risk Assessment and Perception of Threats Based on Empirical Research
Previous Article in Special Issue
Historical Perspectives in Volatility Forecasting Methods with Machine Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using Daily Stock Returns to Estimate the Unconditional and Conditional Variances of Lower-Frequency Stock Returns

Department of Finance, University of North Carolina at Charlotte, 9201 University City Blvd., Charlotte, NC 28223, USA
Risks 2025, 13(10), 190; https://doi.org/10.3390/risks13100190
Submission received: 3 September 2025 / Revised: 29 September 2025 / Accepted: 30 September 2025 / Published: 3 October 2025
(This article belongs to the Special Issue Volatility Modeling in Financial Market)

Abstract

If intraday price data are unavailable, then using daily returns to construct realized measures of the variances of lower-frequency returns is a natural substitute for using high-frequency returns in this context. Notably, a suitable application of this approach yields realized measures that are unbiased estimators of the unconditional and conditional variances of holding period returns for any investment horizon. I use a long sample of daily S&P 500 index returns to investigate the merits of constructing realized measures in this fashion. First, I conduct a Monte Carlo study using a data generating process that reproduces the key dynamic properties of index returns. The results of the study suggest that using realized measures constructed from daily returns to estimate the conditional and unconditional variances of lower-frequency returns should lead to substantial increases in efficiency. Next, I fit a multiplicative error model to the realized measures for weekly and monthly index returns to obtain out-of-sample forecasts of their conditional variances. Using the forecasts produced by a generalized autoregressive conditional heteroskedasticity model as a benchmark, I find that the forecasts produced by the multiplicative error model always generate lower mean absolute errors. Furthermore, the improvements in forecasting performance are statistically significant in most cases.

1. Introduction

Realized variances are ex-post measures of return variation that are typically constructed by summing the squared values of high-frequency log returns (see, e.g., Andersen and Bollerslev 1998; Andersen et al. 2003; Barndorff-Nielsen et al. 2008). The use of realized variances has revolutionized methods of modeling and forecasting volatility over the past two decades. Indeed, realized variances are now employed in a wide variety of intriguing applications. Some recent examples include forecasting volatility for stocks included the S&P 500 index via machine learning methods (Zhu et al. 2023), forecasting volatility for international real estate investment trusts (Bonato et al. 2022), modeling time-varying conditional skewness in equity markets (Kirby 2024), pricing options on the Chicago Board Options Exchange volatility index (Tong and Huang 2021), developing dynamic tail-risk models to aid in measuring and managing financial risk (Chen et al. 2023), and studying volatility spillovers across cryptocurrency markets (Ben Ameur et al. 2024).
Many of the econometric studies that employ realized variances are conducted using either daily log returns or daily simple returns (see, e.g., Gorgi et al. 2019; Hansen et al. 2012, 2024; Noureldin 2022; Noureldin et al. 2012). Researchers seldom feel the need to differentiate between simple returns and log returns in such studies because doing so is unnecessary from an empirical perspective. If the holding period for a stock or stock index is a single day, then the difference between the variance of a simple return and the variance of the corresponding log return will typically be negligible. However, the differences between the statistical properties of simple returns and those of log returns become more pronounced as the holding period increases. Thus, they are unlikely to be negligible for research that addresses asset pricing, portfolio optimization, and related topics, which is usually conducted using simple returns for weekly, monthly, or quarterly holding periods (see, e.g., Avramov et al. 2006; Kirby and Ostdiek 2012; Yogo 2006).
For instance, the covariance matrix of simple returns plays a central role in Markowitz (1952) portfolio selection. Although it would be straightforward to use realized variances computed from log returns to construct an estimator of the covariance matrix of simple returns, the estimator would typically be biased given that simple returns are nonlinearly related to log returns. Consider the case in which log returns are normally distributed. Because simple returns have a lognormal distribution under these circumstances, the variance of simple returns is typically higher than the variance of log returns in this case. This clearly suggests that it would be useful to develop a procedure for constructing realized variances that are unbiased estimators of the variances of simple returns.
More broadly, it is important to note that the high-frequency data needed to construct daily realized variances may not be available for the full sample period of interest. The first year of the Trade and Quote data provided by the New York Stock Exchange is 1993. In contrast, the coverage of the daily stock file of the Center for Research in Security Prices begins in 1926. The widespread availability of daily historical data makes it well suited for estimating the unconditional and conditional variances of lower-frequency stock returns. I investigate this approach using a new technique for constructing realized measures. Unlike the conventional construction technique pioneered by Andersen and Bollerslev (1998), the new technique delivers realized measures that are unbiased estimators of the unconditional and conditional variances of simple returns in a discrete time setting under relatively mild assumptions that are frequently invoked in the volatility modeling literature.
I begin by conducting a Monte Carlo study of the relative estimation errors that result from using the new and conventional realized measures as estimators of the unconditional and conditional variances of simple returns and log returns for a range of different holding periods. The results of the study demonstrate that my technique for constructing realized measures of the variances of simple returns works as intended. I find no evidence of bias for any holding period and the proposed realized measures deliver improvements in estimation efficiency that are comparable to those produced by conventional realized measures of the variances of log returns.
To develop further insights, I use S&P 500 index data to investigate the performance of pseudo out-of-sample variance forecasts that are constructed using the new realized measures. The empirical analysis employs the generalized autoregressive conditional heteroskedasticity (GARCH) model of Bollerslev (1986) as a benchmark and is conducted in a manner that isolates the incremental gains from using the new realized measures for modeling purposes. First, I fit a GARCH(1,1) model to simple returns for both weekly and monthly holding periods. Next, I replace the squared demeaned simple returns in the recurrence relation for the conditional variance under the GARCH(1,1) model with the corresponding realized measures. Finally, If fit the resultant specification, which is a multiplicative error model (MEM) of the type introduced by Engle (2002), to the same sample of simple returns for weekly and monthly holding periods.
Because the only difference between the recurrence relations for the conditional variances under the GARCH(1,1) and MEM(1,1) specifications is that former employs squared demeaned simple returns and the latter employs realized measures, the performance advantage of the MEM(1,1) specification (if any) is due to the incremental gains from using realized measures. I use the Giacomini and White (2006) test of equal predictive ability to assess whether the differences in performance are statistically significant. As anticipated, I find that the MEM forecasts produce smaller mean errors, smaller mean absolute errors, and smaller root mean square errors than the GARCH forecasts for every forecast horizon under consideration at both the weekly frequency and the monthly frequency. However, the results for the monthly observations are stronger from the standpoint of statistical significance. I find that the smallest t-statistic produced by the test of equal predictive ability is 2.61 in this case. Because I reject the hypothesis that the GARCH forecasts of monthly variances are just as accurate as the MEM forecasts of monthly variances at the 1% significance level, irrespective of the forecast horizon, I conclude that the proposed realized measures of the variances of simple returns deliver meaningful performance gains.
Although these results are illustrative, the proposed realized measures could obviously be exploited in other ways. For example, researchers have developed a variety of specifications that use conventional realized measures to model volatility dynamics, such as the heterogeneous autoregressive volatility (HAR) model of Corsi (2009), the high-frequency-based (HEAVY) model of Shephard and Sheppard (2010), and the realized GARCH model of Hansen et al. (2012). By replacing the realized measures constructed from high-frequency log returns with realized measures constructed from daily simple returns, any of these specifications could be used to model and forecast the conditional variances of simple returns for weekly, monthly, or quarterly holding periods.
It is also clear that the proposed realized measures can be employed to model and forecast the conditional variances of lower-frequency returns for any asset or commodity for which daily price data are readily available (e.g., the Eurodollar exchange rate, crude oil, or Bitcoin). Of course, using this approach for seasonal commodities would require a volatility model that is capable of capturing seasonality, such as a periodic MEM analog of the periodic GARCH model of Bollerslev and Ghysels (1996). But implementing this extension should be relatively straightforward from an econometric perspective.
The rest of the article is organized as follows. Section 2 shows how to construct realized measures that are unbiased estimators of the unconditional and conditional variances of simple returns. Section 3 discusses the results of the Monte Carlo study. Section 4 describes the GARCH(1,1) and MEM(1,1) specifications used to forecast conditional variances for the S&P 500 index and presents the results of the pseudo out-of-sample performance comparisons. Finally, Section 5 offers some concluding remarks.

2. Realized Measures

Suppose that P ( t i ) denotes the price of a stock or stock index at time t i . Further suppose that the price is recorded at a fixed frequency such that there is always one time period between successive elements of the sequence { P ( t i ) } i = 0 K T , where K > 1 and T > 0 are integers to be specified later. To develop realized measures that are unbiased estimators of the variance of simple returns in a discrete-time setting, I invoke assumptions that eliminate the need to employ the type of fill-in asymptotics that underpin the arguments of Andersen et al. (2003). Henceforth, F ( t i ) denotes the information set that contains all prices realized prior to time t i + 1 . I presume throughout the discussion that log returns and simple returns are weakly-stationary random variables.

2.1. Realized Measures Computed from Log Returns

Andersen and Bollerslev (1998) pioneered the use of high-frequency log returns to construct realized measures. It is easy to formulate discrete-time analogs of the basic arguments that motivate their methodology. Let r ˜ ( t i , t i + k ) = log P ( t i + k ) log P ( t i ) denote the log return for the k-period interval that begins at time t i and ends at time t i + k , where 0 k K . I assume for simplicity that E [ r ˜ ( t i , t i + 1 ) ] = 0 and use σ ˜ K 2 : = var ( r ˜ ( t i , t i + K ) ) to denote the variance of K-period log returns.
The starting point is to consider a scenario in which the single-period log returns are serially uncorrelated. Because r ˜ ( t i , t i + K ) can be expressed as r ˜ ( t i , t i + K ) = j = 1 K r ˜ ( t i + j 1 , t i + j ) , it follows immediately that
σ ˜ K = E j = 1 K r ˜ 2 ( t i + j 1 , t i + j ) M M 1 / 2 ,
where r ˜ 2 ( t i + j 1 , t i + j ) denotes the square of r ˜ ( t i + j 1 , t i + j ) . Thus, v ˜ ( t i , t i + K ) = ( j = 1 K r ˜ 2 ( t i + j 1 , t i + j ) ) M M 1 / 2 is a realized measure of volatility that satisfies E [ v ˜ 2 ( t i , t i + K ) ] = σ ˜ K 2 .
It is also easy to see that T 1 j = 1 T v ˜ 2 ( t ( j 1 ) K , t j K ) and T 1 j = 1 T r ˜ 2 ( t ( j 1 ) K , t j K ) are unbiased estimators of σ ˜ K 2 . But the former is a lot more efficient than the latter in general. More broadly, v ˜ ( t i , t i + K ) satisfies E [ v ˜ 2 ( t i , t i + K ) | F ( t i ) ] = var ( r ˜ ( t i , t i + K ) | F ( t i ) ) under suitable assumptions about the dynamic properties of log returns. To grasp the basic requirements for conditional unbiasedness, let σ ˜ 2 ( t i , t i + K ) : = var ( r ˜ ( t i , t i + K ) | F ( t i ) ) and think about a data generating process (DGP) of the form
r ˜ ( t i , t i + 1 ) = σ ˜ ( t i , t i + 1 ) z ˜ ( t i , t i + 1 ) , i = 0 , 1 , , K T 1 ,
where σ ˜ ( t i , t i + 1 ) F ( t i ) , E [ z ˜ ( t i , t i + 1 ) | F ( t i ) ] = 0 , and E [ z ˜ 2 ( t i , t i + 1 ) | F ( t i ) ] = 1 for all i. For example, the DGP could be a GARCH(1,1) model (see Bollerslev 1986). Because a DGP of this form implies that E [ r ˜ ( t i , t i + 1 ) r ˜ ( t i + j , t i + j + 1 ) | F ( t i ) ] = 0 for all j 0 , it follows that σ ˜ ( t i , t i + K ) = ( j = 1 K E [ r ˜ 2 ( t i + j 1 , t i + j ) | F ( t i ) ] ) M M 1 / 2 by iterated expectations.

2.2. Realized Measures Computed from Simple Returns

In a typical finance application (portfolio optimization, risk management, etc.), the analysis focuses on simple returns rather than log returns. Furthermore, the simple returns of interest are often measured at relatively low frequencies (monthly observations, quarterly observations, etc.). I therefore propose a new strategy for constructing realized measures that are unbiased estimators of the unconditional and conditional variances of simple returns. Henceforth, simple returns are just called returns.
Let r ( t i , t i + k ) = P ( t i + k ) / P ( t i ) 1 denote the return for the k-period interval that begins at time t i and ends at time t i + k . By straightforward algebra, this quantity can be expressed as
r ( t i , t i + k ) = j = 1 k R ( t i , t i + j 1 ) r ( t i + j 1 , t i + j ) ,
where R ( t i , t i + k ) = P ( t i + k ) / P ( t i ) denotes the gross return for the k-period interval under consideration. I assume for simplicity that E [ r ( t i , t i + 1 ) ] = 0 and explain how to relax this assumption later on.
Now let σ K 2 : = var ( r ( t i , t i + K ) ) , σ 2 ( t i , t i + K ) : = var ( r ( t i , t i + K ) | F ( t i ) ) , and consider a scenario in which single-period returns satisfy
cov ( r ( t i + j 1 , t i + j ) , R ( t i , t i + j 1 ) r ( t i + k 1 , t i + k ) R ( t i , t i + k 1 ) ) = 0
for all j > k 1 . Under these circumstances,
v ( t i , t i + K ) = j = 1 K R 2 ( t i , t i + j 1 ) r 2 ( t i + j 1 , t i + j ) M M 1 / 2
is a realized measure of σ K that satisfies E [ v 2 ( t i , t i + K ) ] = σ K 2 . Furthermore, it is apparent that v ( t i , t i + K ) satisfies E [ v 2 ( t i , t i + K ) | F ( t i ) ] = σ 2 ( t i , t i + K ) under suitable assumptions about the dynamic properties of returns. This is the case, for example, if the DGP takes the form
r ( t i , t i + 1 ) = σ ( t i , t i + 1 ) z ( t i , t i + 1 ) , i = 0 , 1 , , K T 1 ,
where σ ( t i , t i + 1 ) F ( t i ) , E [ z ( t i , t i + 1 ) | F ( t i ) ] = 0 , and E [ z 2 ( t i , t i + 1 ) | F ( t i ) ] = 1 for all i. To see why, simply note that
E [ R ( t i , t i + j 1 ) r ( t i + j 1 , t i + j ) R ( t i , t i + k 1 ) r ( t i + k 1 , t i + k ) | F ( t i ) ] = 0
for all j 1 , k 1 , and j k under the DGP in Equation (6).

2.3. Some Useful Extensions

The methodology can easily be modified to address situations in which the maintained assumptions are deemed too restrictive. For instance, if single-period returns display serial correlation, then a realized kernel approach can be used to construct the realized measures. Barndorff-Nielsen et al. (2008) show that this is an effective way of addressing the presence of serial correlation that is due to microstructure effects.
To illustrate, let J denote the lag truncation value employed by the realized kernel estimator of Barndorff-Nielsen et al. (2008). An analogous estimator for simple returns can be obtained by specifying J < K and computing v ( t i , t i + K ) as
v ( t i , t i + K ) = j = J J w j J + 1 g j ( P ( t i ) , , P ( t i + K ) ) M M 1 / 2 ,
where
g j ( P ( t i ) , , P ( t i + K ) ) = l = | j | + 1 K P ( t i + l 1 ) P ( t i ) r ( t i + l 1 , t i + l ) P ( t i + l | j | 1 ) P ( t i ) r ( t i + l | j | 1 , t i + l | j | )
and
w ( x ) = 1 6 x 2 + 6 | x | 3 0 | x | 1 / 2 2 ( 1 | x | ) 3 1 / 2 | x | 1 .
is the weight function for the Parzen kernel.
The assumption that expected returns are equal to zero can also be relaxed. Suppose, for instance, that E [ r ( t i , t i + 1 ) | F ( t i ) ] = μ for all i. In this case,
E [ ( 1 + μ ) k R ( t i , t i + k ) 1 | F ( t i ) ] = 0
for all i and k 0 , so it is a simple matter to show that
var R ( t i , t i + K ) ( 1 + μ ) K F ( t i ) = E j = 1 K R ( t i , t i + j 1 ) ( 1 + μ ) j 1 2 r ( t i + j 1 , t i + j ) μ 1 + μ 2 F ( t i )
by mirroring the arguments for the μ = 0 case. The realized measure
v * ( t i , t i + K ) = j = 1 K ( 1 + μ ) 2 ( K j ) R 2 ( t i , t i + j 1 ) ( r ( t i + j 1 , t i + j ) μ ) 2 M M 1 / 2
therefore satisfies E [ v * 2 ( t i , t i + K ) | F ( t i ) ] = var ( r ( t i , t i + K ) | F ( t i ) ) .
It is clear that v 2 ( t i , t i + K ) is a biased estimator of σ 2 ( t i , t i + K ) for cases in which μ 0 , just as v ˜ 2 ( t i , t i + K ) is a biased estimator of σ ˜ 2 ( t i , t i + K ) for cases in which E [ r ˜ ( t i , t i + 1 ) | F ( t i ) ] 0 . But the results also show how to implement a simple bias correction for v 2 ( t i , t i + K ) . In particular, a bias-corrected realized measure for returns can be obtained by substituting a consistent estimator of E [ r ( t i , t i + 1 ) ] that is available at time t i for μ in Equation (13). The effect of this correction will typically be quite small for realized measures that are constructed from daily stock returns because the sample mean of daily returns is typically only a few basis points. This is the reason why studies that fit volatility models to daily stock returns often assume that expected returns are equal to zero (see, e.g., Visser 2011).

3. Monte Carlo Analysis

I use Monte Carlo integration to document the properties of the unbiased variance estimators discussed in Section 2. The DGP for the study is a well-known variant of the GARCH(1,1) model of Bollerslev (1986). In particular, I generate the single-period log returns from the model
r ˜ ( t i , t i + 1 ) = κ σ ˜ 2 ( t i , t i + 1 ) + σ ˜ ( t i , t i + 1 ) z ˜ ( t i , t i + 1 ) ,
σ ˜ 2 ( t i , t i + 1 ) = ω + β σ ˜ 2 ( t i 1 , t i ) + α ( z ˜ ( t i 1 , t i ) γ σ ˜ ( t i 1 , t i ) ) 2 ,
where ω 0 , β 0 , α 0 , ( β + α γ 2 ) < 1 , and z ˜ ( t i , t i + 1 ) is an NID ( 0 , 1 ) random variable. This specification is well suited to Monte Carlo work because it allows σ ˜ K 2 , σ K 2 , σ ˜ 2 ( t i , t i + K ) , and σ 2 ( t i , t i + K ) to be computed analytically.1
Daily S&P 500 index data for the years 1946 through 2023 (19,835 observations) are used to calibrate the DGP. The data are from two sources: the daily stock file of the Center for Research in Security Prices for 3 July 1962 to 29 December 2023 and a dataset compiled by Schwert (1990) for 2 January 1946 to 2 July 1962.2 First, I use the method of maximum likelihood to fit the model to daily log index returns subject to κ = 0 and ω = 0 .3 Second, I set the values of α , β , and γ in Equations (14) and (15) equal to their maximum likelihood estimates, generate { r ˜ ( t i , t i + 1 } i = 0 K T 1 with κ = 0 and ω = 0 , and construct { r ( t i , t i + 1 } i = 0 K T 1 by setting κ = 1 / 2 and computing r ( t i , t i + 1 ) = exp ( κ σ ˜ 2 ( t i , t i + 1 ) + r ˜ ( t i , t i + 1 ) ) 1 for all i.4 Third, I use the simulated data to calculate v ˜ 2 ( t i , t i + K ) and v 2 ( t i , t i + K ) for each i { 0 , K , 2 K , , ( T 1 ) K } . Because there are roughly 252 trading days per year for the S&P 500 index, I consider K = 5 , K = 21 , K = 63 , K = 126 , and K = 252 to approximate weekly, monthly, quarterly, semiannual, and annual holding periods.
Table 1 summarizes the results for 10 million simulated observations (i.e., T = 1,000,000). Panel A examines the properties of the relative estimation errors for unconditional variances. The initial six columns report the mean, mean absolute, and root mean square values of r ˜ 2 ( t i , t i + K ) / σ ˜ K 2 1 and r 2 ( t i , t i + K ) / σ K 2 1 for the six values of K under consideration (denoted by ME, MAE, and RMSE). For K = 1 , the results for log returns are nearly identical to those for returns. But differences emerge as K increases.
As anticipated, the mean errors are quite small (zero to three decimal places) because r ˜ 2 ( t i , t i + K ) and r 2 ( t i , t i + K ) are unbiased estimators of σ ˜ K 2 and σ K 2 . The largest RMSEs correspond to K = 21 for log returns and K = 5 for returns. An increase in the RMSE is always indicative of an increase in kurtosis, which can be expressed as 1 plus the mean square error. The smallest MAEs and RMSEs correspond to K = 252 .
Now consider the results in the final six columns of panel A, which contain the mean, mean absolute, and root mean square values of v ˜ 2 ( t i , t i + K ) / σ ˜ K 2 1 and v 2 ( t i , t i + K ) / σ K 2 1 for the six values of K under consideration.5 The realized measures show no indications of bias and are clearly much more efficient estimators of σ ˜ K 2 and σ K 2 for K > 1 than r ˜ 2 ( t i , t i + K ) and r 2 ( t i , t i + K ) . Notice, for example, that replacing r 2 ( t i , t i + K ) with v 2 ( t i , t i + K ) reduces the RMSE from 1.666 to 0.856 with K = 5 . This is a reduction of 48.6 % . Furthermore, the improvements in efficiency become more pronounced as K increases. The RMSE drops from 1.407 to 0.194 for the K = 252 case, which is a reduction of 86.2%.
Panel B examines the properties of the relative estimation errors for the conditional variances using the same layout as panel A. Once again, the mean errors are zero to three decimal places in all cases and there are large gains in efficiency from employing the realized measures. The reduction in the RMSEs relative to those reported in panel A is an indicator of the benefits exploiting conditioning information. The RMSEs drop by 0.203 (12.6%) in all cases for K = 1 . As K increases, the drop always becomes smaller in raw numerical terms. But the percentage drop in the RMSE does not display a monotonic relation with K. For example, the RMSE for v 2 ( t i , t i + K ) drops by 0.133 ( 15.5 % ) for K = 5 .
Overall, the Monte Carlo evidence indicates that the proposed technique for constructing realized measures that are unbiased estimators of the variances of holding-period returns works as intended. It achieves improvements in efficiency that are comparable to those achieved by the conventional technique for constructing realized measures of the variances of multiperiod log returns. I now turn to an empirical application that focuses on forecasting the conditional variances of weekly and monthly S&P 500 index returns.

4. Out-of-Sample Results for the S&P 500 Index

To lay the groundwork for the discussion, assume that the objective is to forecast the variance of a financial variable y ( t + s ) using a realization of the sequence { y ( 1 ) , , y ( t ) } for some s 1 . Because the GARCH(1,1) model of Bollerslev (1986) is known to perform well in a variety of settings, it is often used to construct such forecasts. If the DGP is a GARCH(1,1) specification, then y ( t + s ) can be expressed as
y ( t + s ) = μ + h M M 1 / 2 ( t + 1 , s ) z ( t + s ) ,
h ( t + 1 , s ) = ( 1 ϕ s 1 ) η + ϕ s 1 h ( t + 1 , 1 ) ,
h ( t + 1 , 1 ) = η + ϕ ( h ( t , 1 ) η ) + δ ( e 2 ( t ) h ( t , 1 ) ) ,
where E [ z ( t + s ) | y ( 1 ) , , y ( t ) ] = 0 , E [ z 2 ( t + s ) | y ( 1 ) , , y ( t ) ] = 1 , and e 2 ( t ) = ( y ( t ) μ ) 2 . Thus, h ( t + 1 , s ) is a conditionally-unbiased s-step-ahead forecast of e 2 ( t + s ) .
Now consider an alternative s-step-ahead forecast of e 2 ( t + s ) that is constructed from a realization of the sequence { x ( 1 ) , , x ( t ) } , where x ( t ) is a conditionally-unbiased realized measure of the variance of y ( t ) with dynamics that are described by an MEM specification of the type introduced by Engle (2002). If the DGP is an MEM(1,1) specification, then x ( t + s ) can be expressed as
x ( t + s ) = m ( t + 1 , s ) u ( t + s ) ,
m ( t + 1 , s ) = ( 1 φ s 1 ) ς + φ s 1 m ( t + 1 , 1 ) ,
m ( t + 1 , 1 ) = ς + φ ( m ( t , 1 ) ς ) + λ ( x ( t ) m ( t , 1 ) ) ,
where u ( t + s ) is strictly non-negative and satisfies E [ u ( t + s ) | x ( 1 ) , , x ( t ) ] = 1 . Because m ( t + 1 , s ) is a conditionally unbiased s-step-ahead forecast of x ( t + s ) , it clearly has the potential to outperform h ( t + 1 , s ) as a forecast of e 2 ( t + s ) .
I focus on the case in which y ( t + s ) is a weekly or monthly return on the S&P 500 index and the realized measure of its variance is constructed from daily returns. Presumably, variance forecasts based on realized measures should generally be more accurate than those based on weekly or monthly returns. I therefore use the pseudo out-of-sample forecasts produced by the GARCH(1,1) specification to benchmark the performance of the pseudo out-of-sample forecasts produced by the MEM(1,1) specification. As in Giacomini and White (2006), I conduct the analysis using limited-memory estimators of the parameters.
Because the GARCH(1,1) model implies that e 2 ( t + 1 ) = h ( t + 1 , 1 ) z 2 ( t + 1 ) , it is essentially a MEM(1,1) specification for e 2 ( t + 1 ) . Furthermore, the recurrence relation for h ( t + 1 , 1 ) in Equation (18) can be transformed into the recurrence relation for m ( t + 1 , 1 ) in Equation (21) by replacing e 2 ( t ) with x ( t ) and relabeling the parameters. It is apparent, therefore, that the research design ensures that performance advantage of the MEM(1,1) specification (if any) is due to the incremental gains from using realized measures as long as the approach used to fit the GARCH and MEM specifications puts them on an equal footing. This poses no issues because fitting the GARCH specification under the assumption that z ( t + 1 ) NID ( 0 , 1 ) produces the same results as treating e 2 ( t + 1 ) = h ( t + 1 , 1 ) z 2 ( t + 1 ) as an MEM specification and fitting it by assuming that z 2 ( t + 1 ) is a serially independent exponential random variable with a unit mean (see Engle 2002, for further elaboration).
To illustrate, suppose W + s 1 > 0 is the number of observations in a rolling window of weekly or monthly returns. For each choice of s and value of N { 1 , , T W s + 1 } , I construct an estimate of h ( t + 1 , s ) for t = N + W 1 using the estimate of θ : = ( μ , η , ϕ , δ ) obtained by minimizing
Q h ( θ ; s , N ) = t = N N + W 1 1 2 log ( h ( t , s ) ) + 1 2 e 2 ( t + s 1 ) h ( t , s )
subject to h ( N , 1 ) = η , μ = μ ^ , and η = η ^ , where μ ^ = W 1 t = N N + W 1 y ( t ) and η ^ = W 1 t = N N + W 1 ( y ( t ) μ ^ ) 2 . Similarly, for each choice of s and value of N, I construct an estimate of m ( t + 1 , s ) for t = N + W 1 using the estimate of ϑ : = ( ς , φ , λ ) obtained by minimizing
Q m ( ϑ ; s , N ) = t = N N + W 1 log ( m ( t , s ) ) + x ( t + s 1 ) m ( t , s )
subject to m ( N , 1 ) = ς and ς = ς ^ , where ς ^ = W 1 t = N N + W 1 x ( t ) . The resultant estimated values of μ , h ( t + 1 , s ) , and m ( t + 1 , s ) are denoted by μ ^ ( t + 1 , s ) , h ^ ( t + 1 , s ) , and m ^ ( t + 1 , s ) .
Several features of this procedure are worthy of further comment. First, apart from an additive constant, Q h ( θ ; s , N ) and Q m ( ϑ ; s , N ) are the log quasi-likelihood functions that result from treating z ( t ) as N ( 0 , 1 ) and u ( t ) as an exponential random variable with a rate parameter of one. Thus, the estimators of θ and ϑ are consistent under the usual regularity conditions for quasi-maximum likelihood estimation. Second, I use the sample mean of y ( t ) , sample variance of y ( t ) , and sample mean of x ( t ) that are computed from the initial W observations of the rolling window as estimators of μ , η , and ς . This targeting approach simplifies optimization. Third, the procedure produces horizon-tuned forecasts because the estimates of ϕ , δ , φ , and λ are specific to the value of s under consideration.6
To formally compare the accuracy of h ^ ( t + 1 , s ) and m ^ ( t + 1 , s ) as s-step-ahead forecasts of e ^ 2 ( t + s ) = ( y ( t + s ) μ ^ ( t + 1 , s ) ) 2 , I use the unconditional version of the Giacomini and White (2006) test of equal predictive ability. The test is based on the criterion
Δ L ( t + s ) = e ^ 2 ( t + s ) h ^ ( t + 1 , s ) 1 e ^ 2 ( t + s ) m ^ ( t + 1 , s ) 1 , t = W , W + 1 , , T s ,
which is the difference between the absolute error losses produced by h ^ ( t + 1 , s ) and m ^ ( t + 1 , s ) .7 The null hypothesis for the test is H 0 : E [ Δ L ( t + s ) ] = 0 . Hence, inference is conducted using the t-statistic for
Δ L ¯ ( s ) = 1 T W s + 1 t = W T s Δ L ( t + s ) .
If Δ L ¯ ( s ) is positive and statistically significant, then the test indicates that the s-step-ahead MEM forecasts outperform the s-step-ahead GARCH(1,1) forecasts under MAE loss.8
The weekly and monthly index returns along with their realized variances are computed from daily index data for the years 1946 through 2023. As is typical in the finance literature, I use the actual number of trading days in a given week or given month rather than a fixed value of K for the computations. Because the daily index returns display some evidence of negative first-order serial correlation, I account for the impact of this feature by computing the realized measures as
v 2 ( t i , t i + D ) = j = 1 D R 2 ( t i , t i + j 1 ) r 2 ( t i + j 1 , t i + j ) +                                               2 j = 1 D 1 R ( t i , t i + j 1 ) r ( t i + j 1 , t i + j ) R ( t i , t i + j ) r ( t i + j , t i + j + 1 )
rather than as shown in Section 2.9 Here D denotes the number of trading days for the week or month in question. I specify W = 2034 for the weekly data and W = 468 for the monthly data (50% of the number of available observations in each case). To aid in interpreting the findings, I also conduct tests of equal predictive ability using weekly and monthly observations of log returns and their realized variances.

4.1. Properties of the Rolling-Window Parameter Estimates

Table 2 examines the properties of the sequence of parameter estimates produced by the rolling-window optimizations for each specification. Panels A and B present the results for weekly log returns and weekly returns. Not surprisingly, the average estimates of ϕ and φ for s = 1 point to strong persistence in the conditional variances for both log returns and returns. The results also indicate that the estimates of ϕ and φ are quite stable over time. In panel A, for example, the estimate of ϕ for s = 1 ranges from 0.943 to 0.977 and the estimate of φ for s = 1 ranges from 0.959 to 0.975.
The results for δ and λ in panel A display some interesting patterns. First, the average estimate of δ is somewhat smaller than the average estimate of λ for s = 1 , s = 3 , and s = 6 . This finding suggests the conditional variance process of weekly log returns displays a weaker response to shocks under the GARCH specification than under the MEM specification. Second, the average estimate of δ declines monotonically with s, whereas the average estimate of λ does not. But there is a sharp drop in the average estimate of λ for s = 12 . Although the underlying mechanism that leads to this finding is not immediately apparent, the findings for weekly returns mirror those for weekly log returns in all respects.
Panels C and D present the results for monthly log returns and monthly returns. As anticipated, the average estimates of ϕ and φ are somewhat lower than the corresponding values in panels A and B, which is consistent with returns following a stationary stochastic process. But the results still point to a substantial degree of persistence in the conditional variances. There is also more variation in the estimates of ϕ and φ over time for the monthly observations, which is an expected consequence of the sharp reduction in the number of observations in the rolling window used for estimation purposes.
Perhaps the most intriguing aspect of the results in panels C and D is that the average estimate of δ is considerably smaller than the average estimate of λ for s = 1 , s = 6 , and s = 12 . This pattern suggests that the GARCH specification produces a smoother sequence of conditional variance forecasts than the MEM specification, which could indicate that the latter specification has an advantage in tracking the conditional variances. Notice that the average estimate of λ for s = 3 is relatively low by comparison. Because s = 3 for monthly observations is roughly equivalent to s = 12 for weekly observations, the relation between the average estimate of λ and the forecast horizon is similar at both frequencies.

4.2. Conditional Volatility Forecasts

To develop further insights, I plot the conditional volatility forecasts for weekly returns and monthly returns. Figure 1 shows side-by-side plots of the GARCH and MEM forecasts for weekly returns. The upper panels are for s = 1 and lower panels are for s = 12 . Although the side-by-side comparisons highlight the broad similarities in the forecasts for both forecast horizons, it is easy to spot a few differences. For instance, the spike in the one-step-ahead forecast of conditional volatility that follows the 1987 stock market crash is larger for the MEM specification than for the GARCH specification. But it is clear from the plots that the GARCH and MEM forecasts are highly correlated as a general rule.
Of course, this finding does not necessarily imply that the differences in the predictive ability of GARCH and MEM forecasts is negligible. If the MEM forecasts are more efficient than the GARCH forecasts, then they should have a performance advantage in formal statistical tests provided that the sample size is sufficiently large. Furthermore, the results of the Monte Carlo analysis indicate that gains from employing realized measures are inversely related to the investment horizon used for the analysis.
Consider the side-by-side plots of the GARCH and MEM forecasts for monthly returns, which are shown in Figure 2. The visual differences in the plots are certainly more pronounced in this case. Not only are the one-step-ahead GARCH forecasts relatively smooth, they are also confined to a much narrower range than one-step-ahead MEM forecasts. These features are broadly consistent with a scenario in which the realized measures are more efficient estimators of the conditional variances than the squared demeaned returns.

4.3. Hypothesis Tests

The tests of equal predictive ability provide formal evidence in this regard. The results of the tests are presented in Table 3. The initial eight columns of the table report the mean, mean absolute, root mean square, mean square values of e ^ 2 ( t + s ) / h ^ ( t + 1 , s ) 1 and e ^ 2 ( t + s ) / m ^ ( t + 1 , s ) 1 for the four choices of s: 1, 3, 6, and 12. The final three columns report Δ L ¯ ( s ) , its t-statistic, and the associated p-value.
The results in panel A are for weekly log returns. Notably, the MEM forecasts produce smaller MEs, MAEs, and RMSEs than the GARCH forecasts at every forecast horizon. The largest difference in the RMSE corresponds to s = 3 : 3.604 versus 2.440. But the test of equal predictive ability produces a p-value of 0.129 in this case. Broadly speaking, however, the test favors the MEM forecasts. Note that it produces a t-statistic of 1.75 ( p = 0.079 ) for s = 6 and 2.30 ( p = 0.021 ) for s = 12 .
The results in panel B are for weekly returns. Once again, the MEM forecasts produce smaller MEs, MAEs, and RMSEs than the GARCH forecasts at every forecast horizon. The other findings are also similar to those for weekly log returns. The test of equal predictive ability favors the MEM forecasts, yielding a t-statistic of 1.93 ( p = 0.054 ) for s = 6 and 2.31 ( p = 0.021 ) for s = 12 .
The results in panels C and D are for monthly log returns and monthly returns. The overall pattern of the MAEs and RMSEs mirrors that in panels A and B. However, the evidence regarding the superiority of the MEM forecasts is considerably stronger at the monthly frequency. The smallest t-statistics in panels C and D are 2.36 and 2.61, which have p-values of 0.018 and 0.009. Hence, the null hypothesis of equal predictive ability is rejected at the 1% level for every forecast horizon for monthly returns. This finding highlights the extent to which the new realized measures for lower-frequency returns deliver meaningful performance gains.

4.4. Broader Implications of the Analysis

The implications of the results for the MEM(1,1) specification extend beyond the specification itself because they suggest that replacing squared demeaned returns with the proposed realized measures can be adopted as a general strategy for improving the performance of volatility models for lower-frequency returns. Indeed, if the objective is to model the conditional variances of lower-frequency returns, then the proposed realized measures could be in place of conventional realized measures in any existing specification that uses conventional realized measures. Some prominent examples of such specifications include the heterogeneous HAR model of Corsi (2009), the HEAVY model of Shephard and Sheppard (2010), and the realized GARCH model of Hansen et al. (2012).
As for potential applications of the methodology, it can be implemented for any asset or commodity for which daily price data are readily available. This includes international equity indexes, exchange rates, commodities, and cryptocurrencies. It should therefore prove useful in many types of research, especially if the focus is on volatility modeling, asset pricing, or risk management over medium- to long-term horizons. Depending on the application, it might be necessary to address additional features of the DGP. One that immediately comes to mind is deterministic patterns in the volatility of seasonal commodity returns. In this case, the methodology could be implemented using a volatility model that is capable of capturing seasonality, such as a periodic MEM analog of the periodic GARCH model of Bollerslev and Ghysels (1996).

4.5. Caveats

The discussion in Section 2.3 addresses two of the key assumptions that underpin the methodology and outlines techniques for relaxing these assumptions should they fail to hold in the setting of interest. But it is worthwhile to mention a few other caveats about implementing the methodology using a given specification of the DGP. Although the basic MEM(1,1) specification is useful for illustrating the incremental improvements in forecasting performance that result from using the proposed realized measures in place of squared demeaned returns, it should clearly be subject to specification tests before being adopted in a particular application. A potential concern is that, like the benchmark GARCH(1,1) specification, the basic MEM(1,1) specification is incapable of capturing leverage effects. This concern could easily be addressed by replacing Equation (21) with
m ( t + 1 , 1 ) = ς + φ ( m ( t , 1 ) ς ) + ( λ 1 I ( y ( t ) 0 ) + λ 2 I ( y ( t ) > 0 ) ) ( x ( t ) m ( t , 1 ) ) ,
where I ( · ) is the indicator function. This would yield an asymmetric MEM(1,1) analog of the threshold GARCH(1,1) specification of Glosten et al. (1993).
It is also important to consider the potential impact of phenomena, such as structural breaks, that would invalidate the assumption that simple returns are weakly stationary. Note, however, that this caveat is applicable to any volatility model that assumes weak stationarity. If an MEM(1,1) model is misspecified due the presence of a structural break or breaks, then the same is true of a GARCH(1,1) model.

5. Conclusions

The availability of high-frequency data on stock prices has transformed the volatility modeling literature over the past two decades. But there are still good arguments for using daily returns to estimate the volatility of longer-horizon returns, especially for sample periods that begin prior to 1993. Because the statistical properties of log returns differ from those of returns and the differences increase with the investment horizon, I show how to construct realized measures that are unbiased estimators of the unconditional and conditional variances of returns in a discrete-time setting, provided that the DGP satisfies relatively mild assumptions that are often invoked in the volatility-modeling literature. The empirical evidence indicates that using the proposed realized measures to compute out-of-sample forecasts of the variances of weekly and monthly returns on the S&P 500 index leads to significant improvements in forecast accuracy. Hence, the measures should be useful in research that addresses asset pricing, portfolio optimization, and related topics, which is typically conducted using returns for weekly, monthly, or quarterly holding periods.
For example, suppose an investor wants to estimate the conditional value at risk for a long position in an equity index over a one-month holding period. This could be accomplished in two simple steps. First, use the realized measures constructed from daily index returns to fit an MEM specification to the monthly index returns. Second, use the resultant sequence of estimated conditional volatilities to standardize the sequence of monthly index returns, find the quantile of the standardized index returns that corresponds to the desired confidence level for the value-at-risk criterion, and compute the conditional value at risk using the estimated conditional volatility for the month that follows the final month in the sample period.
The methodology could also be exploited in macro-finance applications. Suppose, for instance, that a researcher is interested in assessing the influence of macroeconomic variables, such as industrial production, on the volatility of monthly stock market returns. One approach for doing so would be to fit an autoregressive model to the logarithm of the gross monthly growth rate of industrial production, use the resultant parameter estimates to compute the estimated multiplicative shocks to the growth rate, and augment the conditional variance recursion of an MEM specification for monthly stock market returns with lagged values of the estimated growth-rate shocks. This approach would allow a long sample period to be used for the analysis because a century’s worth of monthly data on U.S. industrial production and daily data on U.S. market returns are currently available.

Funding

This research received no external funding.

Data Availability Statement

The data for 2 January 1946 to 2 July 1962 are downloadable from https://www.billschwert.com/dstock.htm. The data for 3 July 1962 to 29 December 2023 are from a commercial data provider that charges a subscription fee to obtain data access. Please contact the Center for Research in Security Prices for subscription information.

Conflicts of Interest

The author declares no conflicts of interest.

Notes

1
See the note to Table 1 for specifics.
2
Downloadable from https://www.billschwert.com/dstock.htm, accessed on 16 August 2025.
3
The latter constraint is imposed because the nonnegativity restriction on ω is binding for the sample under consideration. This is a common finding for this model in the literature (see, e.g., Christoffersen et al. 2013).
4
The maximum likelihood estimates of the parameters are given in the note to Table 1. Notice that the simulated log returns and simulated returns have a population mean of zero by construction.
5
The results for K = 1 are identical to those in the first six columns because v ˜ 2 ( t i , t i + 1 ) = r ˜ 2 ( t i , t i + 1 ) and v 2 ( t i , t i + 1 ) = r 2 ( t i , t i + 1 ) .
6
This approach to constructing multi-step-ahead variance forecasts is discussed in detail by Shephard and Sheppard (2010).
7
I use absolute error loss rather than squared error loss to mitigate the impact of the pronounced excess kurtosis of S&P 500 index returns, which substantially inflates the variance of e ^ 2 ( t + s ) .
8
I use the Newey and West (1987) estimator with a lag length of s 1 to estimate the long-run variance of Δ L ¯ ( s ) .
9
Technically, the autocorrelation correction in Equation (26) could cause v 2 ( t i , t i + D ) to be negative. But this never occurs in the empirical application. Under the realized kernel approach with J = 1 , the second summation would be multiplied by 1 / 2 rather than by 2.

References

  1. Andersen, Torben, and Tim Bollerslev. 1998. Answering the skeptics: Yes, standard volatility models do provide accurate forecasts. International Economic Review 39: 885–905. [Google Scholar] [CrossRef]
  2. Andersen, Torben, Tim Bollerslev, Francis Diebold, and Paul Labys. 2003. Modeling and forecasting realized volatility. Econometrica 71: 579–625. [Google Scholar] [CrossRef]
  3. Avramov, Doron, Tarun Chordia, and Amit Goyal. 2006. Liquidity and autocorrelations in individual stock returns. The Journal of Finance 61: 2365–94. [Google Scholar] [CrossRef]
  4. Barndorff-Nielsen, Ole, Peter Hansen, Asger Lunde, and Neil Shephard. 2008. Designing realized kernels to measure the ex-post variation of equity prices in the presence of noise. Econometrica 76: 1481–536. [Google Scholar]
  5. Ben Ameur, Hachmi, Zied Ftiti, and Waël Louhichi. 2024. Interconnectedness of cryptocurrency markets: An intraday analysis of volatility spillovers based on realized volatility decomposition. Annals of Operation Research 341: 757–79. [Google Scholar]
  6. Bollerslev, Tim. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics 31: 307–27. [Google Scholar] [CrossRef]
  7. Bollerslev, Tim, and Eric Ghysels. 1996. Periodic autoregressive conditional heteroscedasticity. Journal of Business and Economic Statistics 14: 139–51. [Google Scholar] [CrossRef]
  8. Bonato, Matteo, Oguzhan Cepni, Rangan Gupta, and Christian Pierdzioch. 2022. Forecasting realized volatility of international REITs: The role of realized skewness and realized kurtosis. Journal of Forecasting 41: 303–15. [Google Scholar] [CrossRef]
  9. Chen, Cathy, Hsiao-Yun Hsu, and Toshiaki Watanabe. 2023. Tail risk forecasting of realized volatility CAViaR models. Finance Research Letters 51: 103326. [Google Scholar] [CrossRef]
  10. Christoffersen, Peter, Steven Heston, and Kris Jacobs. 2013. Capturing option anomalies with a variance-dependent pricing kernel. Review of Financial Studies 26: 1963–2006. [Google Scholar] [CrossRef]
  11. Corsi, Fulvio. 2009. A simple approximate long-memory model of realized volatility. Journal of Financial Econometrics 7: 174–96. [Google Scholar] [CrossRef]
  12. Engle, Robert. 2002. New frontiers for ARCH models. Journal of Applied Econometrics 17: 425–46. [Google Scholar] [CrossRef]
  13. Giacomini, Raffaella, and Halbert White. 2006. Tests of conditional predictive ability. Econometrica 74: 1545–78. [Google Scholar] [CrossRef]
  14. Glosten, Lawrence, Ravi Jagannathan, and David Runkle. 1993. On the relation between the expected value and the volatility of the nominal excess return on stocks. Journal of Finance 48: 1779–801. [Google Scholar] [CrossRef]
  15. Gorgi, Paolo, Peter Hansen, Patryk Janus, and Siem Jan Koopman. 2019. Realized Wishart-GARCH: A score-driven multi-asset volatility model. Journal of Financial Econometrics 17: 1–32. [Google Scholar] [CrossRef]
  16. Hansen, Peter, Zhuo Huang, and Howard Shek. 2012. Realized GARCH: A joint model for returns and realized measures of volatility. Journal of Applied Econometrics 27: 877–906. [Google Scholar] [CrossRef]
  17. Hansen, Peter, Zhuo Huang, Chen Tong, and Tianyi Wang. 2024. Realized GARCH, CBOE VIX, and the volatility risk premium. Journal of Financial Econometrics 22: 187–223. [Google Scholar] [CrossRef]
  18. Heston, Steven, and Saikat Nandi. 2000. A closed-form GARCH option valuation model. The Review of Financial Studies 13: 585–625. [Google Scholar] [CrossRef]
  19. Kirby, Chris. 2024. Volatility shocks, leverage effects, and time-varying conditional skewness. Journal of Financial Econometrics 22: 1714–58. [Google Scholar] [CrossRef]
  20. Kirby, Chris, and Barbara Ostdiek. 2012. It’s all in the timing: Simple active portfolio strategies that outperform naive diversification. Journal of Financial and Quantitative Analysis 47: 437–67. [Google Scholar] [CrossRef]
  21. Markowitz, Harry. 1952. Portfolio selection. Journal of Finance 7: 77–91. [Google Scholar]
  22. Newey, Whitney, and Kenneth West. 1987. A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix. Econometrica 55: 703–708. [Google Scholar] [CrossRef]
  23. Noureldin, Diaa. 2022. Volatility prediction using a realized-measure-based component model. Journal of Financial Econometrics 20: 76–104. [Google Scholar] [CrossRef]
  24. Noureldin, Diaa, Neil Shephard, and Kevin Sheppard. 2012. Multivariate high-frequency-based volatility (HEAVY) models. Journal of Applied Econometrics 27: 907–33. [Google Scholar] [CrossRef]
  25. Schwert, G. William. 1990. Indexes of U.S. stock prices from 1802 to 1987. The Journal of Business 63: 399–426. [Google Scholar] [CrossRef]
  26. Shephard, Neil, and Kevin Sheppard. 2010. Realising the future: Forecasting with high-frequency-based volatility (HEAVY) models. Journal of Applied Econometrics 25: 197–231. [Google Scholar] [CrossRef]
  27. Tong, Chen, and Zhuo Huang. 2021. Pricing VIX options with realized volatility. Journal of Futures Markets 41: 1180–200. [Google Scholar] [CrossRef]
  28. Visser, Marcel. 2011. GARCH parameter estimation using high-frequency data. Journal of Financial Econometrics 9: 162–97. [Google Scholar]
  29. Yogo, Motohiro. 2006. A consumption-based explanation of expected stock returns. The Journal of Finance 61: 539–80. [Google Scholar] [CrossRef]
  30. Zhu, Haibin, Lu Bai, Lidan He, and Zhi Liu. 2023. Forecasting realized volatility with machine learning: Panel data perspective. Journal of Empirical Finance 73: 251–71. [Google Scholar] [CrossRef]
Figure 1. Out-of-sample forecasts of conditional volatility from GARCH and MEM specifications for Weekly S&P 500 Index Returns. Note: I use a rolling window of 2034 weekly observations to estimate the parameters of the GARCH and MEM specifications via quasi-maximum likelihood. The forecasts of conditional volatility, which are expressed as annualized percentage rates, are for week one of January 1985 to week four of December 2023 in the upper two panels and for week three of March 1985 to week four of December 2023 in the lower two panels. The overall sample period is January 1946 to December 2023.
Figure 1. Out-of-sample forecasts of conditional volatility from GARCH and MEM specifications for Weekly S&P 500 Index Returns. Note: I use a rolling window of 2034 weekly observations to estimate the parameters of the GARCH and MEM specifications via quasi-maximum likelihood. The forecasts of conditional volatility, which are expressed as annualized percentage rates, are for week one of January 1985 to week four of December 2023 in the upper two panels and for week three of March 1985 to week four of December 2023 in the lower two panels. The overall sample period is January 1946 to December 2023.
Risks 13 00190 g001
Figure 2. Out-of-sample forecasts of conditional volatility from GARCH and MEM specifications for Monthly S&P 500 Index Returns. Note: I use a rolling window of 468 monthly observations to estimate the parameters of the GARCH and MEM specifications via quasi-maximum likelihood. The forecasts of conditional volatility, which are expressed as annualized percentage rates, are for January 1985 to December 2023 in the upper two panels and for December 1985 to December 2023 in the lower two panels. The overall sample period is January 1946 to December 2023.
Figure 2. Out-of-sample forecasts of conditional volatility from GARCH and MEM specifications for Monthly S&P 500 Index Returns. Note: I use a rolling window of 468 monthly observations to estimate the parameters of the GARCH and MEM specifications via quasi-maximum likelihood. The forecasts of conditional volatility, which are expressed as annualized percentage rates, are for January 1985 to December 2023 in the upper two panels and for December 1985 to December 2023 in the lower two panels. The overall sample period is January 1946 to December 2023.
Risks 13 00190 g002
Table 1. Monte Carlo study of the relative estimation errors for unconditional and conditional variances.
Table 1. Monte Carlo study of the relative estimation errors for unconditional and conditional variances.
Panel A
r ˜ 2 ( t i , t i + K ) σ ˜ K 2 1 r 2 ( t i , t i + K ) σ K 2 1 v ˜ 2 ( t i , t i + K ) σ ˜ K 2 1 v 2 ( t i , t i + K ) σ K 2 1
K  MEMAERMSEMEMAERMSEMEMAERMSEMEMAERMSE
1 0.000 1.015 1.617 0.000 1.015 1.618 0.000 1.015 1.617 0.000 1.015 1.618
5 0.000 1.027 1.692 0.000 1.025 1.666 0.000 0.615 0.864 0.000 0.612 0.856
21 0.000 1.013 1.734 0.000 1.005 1.635 0.000 0.432 0.576 0.000 0.417 0.548
63 0.000 0.997 1.698 0.000 0.984 1.532 0.000 0.317 0.412 0.000 0.284 0.362
126 0.000 0.987 1.617 0.000 0.972 1.446 0.000 0.244 0.313 0.000 0.204 0.258
252 0.000 0.978 1.537 0.000 0.966 1.407 0.000 0.181 0.229 0.000 0.153 0.194
Panel B
r ˜ 2 ( t i , t i + K ) σ ˜ 2 ( t i , t i + K ) 1 r 2 ( t i , t i + K ) σ 2 ( t i , t i + K ) 1 v ˜ 2 ( t i , t i + K ) σ ˜ 2 ( t i , t i + K ) 1 v 2 ( t i , t i + K ) σ 2 ( t i , t i + K ) 1
K  MEMAERMSEMEMAERMSEMEMAERMSEMEMAERMSE
1 0.000 0.968 1.414 0.000 0.968 1.415 0.000 0.968 1.414 0.000 0.968 1.415
5 0.000 0.990 1.559 0.000 0.988 1.533 0.000 0.537 0.732 0.000 0.534 0.723
21 0.000 0.995 1.676 0.000 0.988 1.579 0.000 0.380 0.506 0.000 0.362 0.476
63 0.000 0.993 1.684 0.000 0.980 1.519 0.000 0.300 0.391 0.000 0.265 0.338
126 0.000 0.985 1.613 0.000 0.971 1.441 0.000 0.238 0.306 0.000 0.197 0.249
252 0.000 0.978 1.536 0.000 0.965 1.406 0.000 0.178 0.227 0.000 0.150 0.191
Note: I use Monte Carlo integration to document the performance of competing estimators of σ ˜ K 2 : = var ( r ˜ ( t i , t i + K ) ) , σ K 2 : = var ( r ( t i , t i + K ) ) , σ ˜ 2 ( t i , t i + K ) : = var ( r ˜ ( t i , t i + K ) | F ( t i ) ) , and σ 2 ( t i , t i + K ) : = var ( r ( t i , t i + K ) | F ( t i ) ) , where r ˜ ( t i , t i + K ) and r ( t i , t i + K ) denote the log return and return for the K-period interval that begins at time t i and ends at time t i + K and F ( t i ) = { r ˜ ( t 0 , t 1 ) , , r ˜ ( t i 1 , t i ) } . The columns labeled ME, MAE, and RMSE report the mean, mean absolute, and root mean square values of the relative estimation errors for the indicated estimators. The analysis is carried out using a data generating process (DGP) of the form shown in Equations (14) and (15), where ω = 0 , β = 0.8754 , α = 4.554 × 10 6 , γ = 127.0 , and z ˜ ( t i , t i + 1 ) NID ( 0 , 1 ) . First, I generate the sequence { r ˜ ( t i , t i + 1 } i = 0 K T 1 with κ = 0 and T = 10000000 . Next, I construct the sequence { r ( t i , t i + 1 } i = 0 K T 1 by setting κ = 1 / 2 and computing r ( t i , t i + 1 ) = exp ( κ σ ˜ 2 ( t i , t i + 1 ) + r ˜ ( t i , t i + 1 ) ) 1 for all i. Finally, I calculate v ˜ 2 ( t i , t i + K ) = j = 1 K r ˜ 2 ( t i + j 1 , t i + j ) + 2 j = 1 K 1 r ˜ ( t i + j 1 , t i + j ) r ˜ ( t i + j , t i + j + 1 ) and v 2 ( t i , t i + K ) = j = 1 K R 2 ( t i , t i + j 1 ) r 2 ( t i + j 1 , t i + j ) + 2 j = 1 K 1 R ( t i , t i + j 1 ) r ( t i + j 1 , t i + j ) R ( t i , t i + j ) r ( t i + j , t i + j + 1 ) along with σ ˜ 2 ( t i , t i + K ) and σ 2 ( t i , t i + K ) for all i { 0 , K , 2 K , , ( T 1 ) K } , where R ( t i , t i + j ) = 1 + r ( t i , t i + j ) . Simple algebra yields σ ˜ 2 ( t i , t i + K ) = K σ ˜ 1 2 + ( 1 ρ ) 1 ( 1 ρ K ) ( σ ˜ 2 ( t i , t i + 1 ) σ ˜ 1 2 ) and σ ˜ K 2 = K σ ˜ 1 2 , where ρ = β + α γ 2 and σ ˜ 1 2 = ( 1 ρ ) 1 ( ω + α ) . To obtain analytic expressions for σ 2 ( t i , t i + K ) = E [ R 2 ( t i , t i + K ) | F ( t i ) ] 1 and σ K 2 = E [ R 2 ( t i , t i + K ) ] 1 , I rely on results from the option pricing literature (see Heston and Nandi 2000, for details). Specifically, it is well known that E [ R τ ( t i , t i + K ) | F ( t i ) ] = exp ( a K ( τ ) + b K ( τ ) σ ˜ 2 ( t i , t i + 1 ) ) under the DGP for the study, where a K ( τ ) and b K ( τ ) are given by the recurrence relations a K ( τ ) = a K 1 ( τ ) + ω b K 1 ( τ ) 1 2 log ( 1 2 α b K 1 ( τ ) ) and b K ( τ ) = τ ( κ + γ ) 1 2 γ 2 + β b K 1 ( τ ) + ( 1 / 2 ) ( τ γ ) 2 1 2 α b K 1 ( τ ) with a 0 ( τ ) = b 0 ( τ ) = 0 . Setting τ = 2 produces an expression for σ 2 ( t i , t i + K ) + 1 , which in turn yields σ K 2 + 1 = exp ( a K ( 2 ) ) E [ exp ( b K ( 2 ) σ ˜ 2 ( t i , t i + 1 ) ) ] by the law of iterated expectations. Because E [ exp ( ξ ( z + ν ) 2 ) ] = ( 1 2 ξ ) M M 1 / 2 exp ( ν 2 ξ ( 1 2 ξ ) 1 ) given that z N ( 0 , 1 ) , the law of iterated expectations also implies that E [ exp ( b K ( 2 ) σ ˜ 2 ( t i , t i + 1 ) ) ] = exp ( j = 1 ω c j 1 ( 1 / 2 ) log ( 1 2 α c j 1 ) ) , where c j satisfies the recurrence relation c j = β c j 1 + α c j 1 ( 1 2 α c j 1 ) 1 γ 2 with c 0 = b K ( 2 ) .
Table 2. Selected properties of rolling-window parameter estimates for GARCH and MEM specifications.
Table 2. Selected properties of rolling-window parameter estimates for GARCH and MEM specifications.
Panel A: Weekly log returns
GARCH with s-step-ahead conditional variancesMEM with s-step-ahead conditional variances
ϕ δ φ λ
sMinMeanMaxMinMeanMaxMinMeanMaxMinMeanMax
1 0.943 0.963 0.977 0.098 0.129 0.198 0.959 0.966 0.975 0.143 0.173 0.200
3 0.937 0.968 0.992 0.051 0.116 0.194 0.936 0.964 0.975 0.160 0.210 0.307
6 0.949 0.976 0.994 0.027 0.093 0.208 0.930 0.961 0.976 0.140 0.205 0.313
12 0.968 0.983 0.994 0.019 0.054 0.091 0.971 0.984 0.992 0.027 0.046 0.064
Panel B: Weekly returns
1 0.946 0.964 0.977 0.097 0.126 0.193 0.961 0.967 0.976 0.143 0.170 0.194
3 0.941 0.968 0.991 0.056 0.116 0.191 0.938 0.965 0.976 0.159 0.208 0.304
6 0.952 0.976 0.994 0.029 0.093 0.195 0.933 0.962 0.977 0.140 0.199 0.307
12 0.968 0.983 0.994 0.020 0.054 0.091 0.972 0.984 0.992 0.028 0.048 0.069
Panel C: Monthly log returns
1 0.874 0.940 0.975 0.056 0.087 0.132 0.821 0.867 0.922 0.472 0.552 0.667
3 0.867 0.942 0.971 0.091 0.129 0.173 0.872 0.936 0.963 0.151 0.196 0.251
6 0.861 0.934 0.964 0.084 0.141 0.175 0.882 0.929 0.960 0.212 0.449 0.812
12 0.749 0.898 0.939 0.204 0.347 0.760 0.874 0.921 0.955 0.242 0.351 0.423
Panel D: Monthly returns
1 0.880 0.942 0.975 0.060 0.091 0.136 0.845 0.889 0.937 0.404 0.493 0.613
3 0.861 0.941 0.971 0.102 0.137 0.180 0.875 0.937 0.963 0.154 0.211 0.276
6 0.853 0.931 0.963 0.094 0.145 0.185 0.888 0.934 0.963 0.214 0.356 0.569
12 0.749 0.890 0.938 0.178 0.321 0.762 0.885 0.923 0.954 0.245 0.342 0.416
Note: The table reports selected properties of limited-memory parameter estimates that are computed using s-step-ahead forecasts of the conditional variances of weekly and monthly S&P 500 index returns. The forecasts of the conditional variances are generated by GARCH(1,1) and MEM(1,1) specifications of the form shown in Equations (16) through (18) and Equations (19) through (21). I conduct the analysis using a quasi-maximum likelihood (QML) approach that employs a rolling window of W + s 1 observations to estimate the parameters. In particular, I construct the estimated values of h ( t + 1 , s ) and m ( t + 1 , s ) for t = N + W 1 using a window of observations that begins in period N and ends in period N + W + s 1 , where N { 1 , , T W s + 1 } . To compute the log quasi-likelihood functions, I treat z ( t ) as an NID ( 0 , 1 ) random variable and u ( t ) as a serially-independent exponential random variable with a rate parameter of one. Note that this methodology produces horizon-tuned forecasts of the conditional variances because the estimates of ϕ , δ , φ , and λ are specific to the value of s under consideration. I specify W = 2034 for the weekly data and W = 468 for the monthly data, which is 50% of the number of available observations in each case. The sample period is January 1946 to December 2023.
Table 3. Tests of equal predictive ability using realized measures constructed from daily observations.
Table 3. Tests of equal predictive ability using realized measures constructed from daily observations.
Panel A: Weekly log returns
GARCH forecasts (s-step-ahead)MEM forecasts (s-step-ahead) H 0 : E [ Δ L ( t + s ) ] = 0
sMEMAERMSEMSEMEMAERMSEMSE Δ L ¯ ( s ) t-statpval
1 0.067 1.083 2.172 4.716 0.039 1.061 2.127 4.522 0.022 1.41 0.157
3 0.167 1.198 3.604 12.992 0.096 1.124 2.440 5.953 0.075 1.52 0.129
6 0.211 1.262 3.802 14.459 0.170 1.216 3.481 12.118 0.046 1.75 0.079
12 0.205 1.269 3.671 13.476 0.153 1.235 3.615 13.068 0.034 2.30 0.021
Panel B: Weekly returns
1 0.062 1.077 2.063 4.258 0.033 1.054 2.010 4.042 0.023 1.58 0.115
3 0.150 1.179 3.178 10.100 0.088 1.115 2.293 5.258 0.064 1.55 0.120
6 0.197 1.244 3.462 11.987 0.151 1.195 3.150 9.921 0.049 1.93 0.054
12 0.193 1.255 3.401 11.567 0.144 1.223 3.345 11.191 0.032 2.31 0.021
Panel C: Monthly log returns
1 0.080 1.104 2.535 6.425 0.059 0.971 1.900 3.612 0.133 2.83 0.005
3 0.153 1.174 2.655 7.049 0.057 1.040 2.284 5.217 0.134 4.10 0.000
6 0.160 1.187 2.586 6.689 0.010 1.068 2.130 4.536 0.119 2.36 0.018
12 0.157 1.195 2.705 7.316 0.026 1.077 2.301 5.293 0.118 2.89 0.004
Panel D: Monthly returns
1 0.064 1.080 2.177 4.740 0.068 0.956 1.727 2.982 0.124 3.28 0.001
3 0.137 1.150 2.336 5.458 0.052 1.029 2.028 4.114 0.121 4.04 0.000
6 0.136 1.158 2.272 5.163 0.026 1.043 1.873 3.510 0.115 2.61 0.009
12 0.129 1.164 2.343 5.490 0.022 1.066 2.043 4.173 0.098 3.02 0.003
Note: The table reports the results of tests of equal predictive ability for the S&P 500 index. The tests are conducted using the s-step-ahead variance forecasts produced by GARCH(1,1) and MEM(1,1) models for weekly and monthly observations (see Equations (16) through (18) and Equations (19) through (21) for details). I use a quasi-maximum likelihood approach that employs a rolling window of W + s 1 observations to estimate the parameters, which produces horizon-tuned forecasts because the estimates of ϕ , δ , φ , and λ are specific to the value of s under consideration. In particular, I construct the estimated values of h ( t + 1 , s ) and m ( t + 1 , s ) for t = N + W 1 using a window of observations that begins in period N and ends in period N + W + s 1 , where N { 1 , , T W s + 1 } . The estimated values of h ( t + 1 , s ) and m ( t + 1 , s ) are denoted by h ^ ( t + 1 , s ) and m ^ ( t + 1 , s ) . I base the tests on the criterion Δ L ( t + s ) = | e ^ 2 ( t + s ) h ^ ( t + 1 , s ) 1 | | e ^ 2 ( t + s ) m ^ ( t + 1 , s ) 1 | , t = W , W + 1 , , T s , where e ^ 2 ( t + s ) = ( y ( t + s ) μ ^ ( t + 1 , s ) ) 2 and μ ^ ( t + 1 , s ) is the sample mean of { r ( t W + 1 ) , , r ( t ) } . The null hypothesis is H 0 : E [ Δ L ( t + s ) ] = 0 . Hence, inference is conducted using the t-statistic for Δ L ¯ ( s ) = 1 T W s + 1 t = W T s Δ L ( t + s ) . If Δ L ¯ ( s ) is positive and statistically significant, then the test indicates that the s-step-ahead MEM forecasts outperform the s-step-ahead GARCH(1,1) forecasts under the specified loss function. The initial eight columns report the mean, mean absolute, root mean square, and mean square values of e ^ 2 ( t + s ) / h ^ ( t + 1 , s ) 1 and e ^ 2 ( t + s ) / m ^ ( t + 1 , s ) 1 for the four choices of s: 1, 3, 6, and 12. I specify W = 2034 for the weekly data and W = 468 for the monthly data, which is 50% of the number of available observations in each case. The sample period is January 1946 to December 2023.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kirby, C. Using Daily Stock Returns to Estimate the Unconditional and Conditional Variances of Lower-Frequency Stock Returns. Risks 2025, 13, 190. https://doi.org/10.3390/risks13100190

AMA Style

Kirby C. Using Daily Stock Returns to Estimate the Unconditional and Conditional Variances of Lower-Frequency Stock Returns. Risks. 2025; 13(10):190. https://doi.org/10.3390/risks13100190

Chicago/Turabian Style

Kirby, Chris. 2025. "Using Daily Stock Returns to Estimate the Unconditional and Conditional Variances of Lower-Frequency Stock Returns" Risks 13, no. 10: 190. https://doi.org/10.3390/risks13100190

APA Style

Kirby, C. (2025). Using Daily Stock Returns to Estimate the Unconditional and Conditional Variances of Lower-Frequency Stock Returns. Risks, 13(10), 190. https://doi.org/10.3390/risks13100190

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop