Abstract
This paper suggests a new approach to evaluate realized covariance (RCOV) estimators via their predictive power on return density. By jointly modeling returns and RCOV measures under a Bayesian framework, the predictive density of returns and ex-post covariance measures are bridged. The forecast performance of a covariance estimator can be assessed according to its improvement in return density forecasting. Empirical applications to equity data show that several RCOV estimators consistently perform better than others and emphasize the importance of RCOV selection in covariance modeling and forecasting.
1. Introduction
The past two decades have seen dramatic growth in the amount of literature on estimating and modeling realized covariance (RCOV) measures. On the one hand, various methods have been proposed to extract covariation information from noisy and non-synchronous high-frequency data. On the other hand, the literature on RCOV modeling has focused on improving models’ flexibility and predictability. Which RCOV measure leads to superior out-of-sample forecasting is an important but not fully answered question. This paper suggests a joint return and RCOV modeling approach to assess RCOV measures based on return density forecasts. This direction of research contributes not only to the ex-post covariance estimation literature by proposing a new evaluation method but also to the RCOV modeling literature as we show that the choice of estimator matters to a model’s predictability.
Most studies regarding ex-post covariance estimation focus on developing a high-frequency covariance estimator that can accommodate market microstructure noise and non-synchronous trading without losing consistency and efficiency. Andersen et al. (2003) and Barndorff-Nielsen and Shephard (2004) build the theoretical foundation of RCOV in an ideal setting. Zhang et al. (2005) suggest the subsampling approach and Zhang (2011) designs a two-scales realized covariance (TSRC). Griffin and Oomen (2011) analyze the statistical properties of RCOV with lead-lag adjustments (RCLL). Barndorff-Nielsen et al. (2011) design a multivariate realized kernel (RK) based on refresh-time based returns. Christensen et al. (2010) extend the idea of pre-averaging in Jacod et al. (2009) to the multivariate setting and propose pre-averaged covariance estimators. Aït-Sahalia et al. (2010) introduce a quasi-maximum likelihood approach to estimate the ex-post covariance. Other estimation methods include those of Hayashi and Yoshida (2005), Voev and Lunde (2007), Hansen et al. (2008), Bannouh et al. (2009), Tao et al. (2011), Corsi et al. (2015), Peluso et al. (2015) and Lunde et al. (2016).
A common practice for assessing the accuracy of covariance estimators is via simulation-based exercises. For example, works by Voev and Lunde (2007), Jacod et al. (2009), Aït-Sahalia et al. (2010), Barndorff-Nielsen et al. (2011), Corsi et al. (2015), and Peluso et al. (2015) compare several RCOV estimators via simulation studies. This approach is useful in studying estimation accuracy and efficiency but cannot evaluate estimators’ empirica performance.
An alternative RCOV evaluation approach is based on out-of-sample portfolio performance. For example, among competing covariance estimators, the estimator that leads to the least volatile minimum-variance portfolio is preferred. In the context of portfolio optimization, de Pooter et al. (2008) evaluate the choice of sampling frequency in constructing RCOVs. Fan et al. (2012) study covariance matrix estimation from the perspective of portfolio selection with gross-exposure constraints. Corsi et al. (2015) and Lunde et al. (2016) conduct portfolio allocation experiments to compare their proposed estimators with benchmarks. Compared with statistical evaluation based on simulation studies, the mean-variance portfolio optimization provides an indirect criterion for assessing estimator performance from an economic perspective.
Recent developments in RCOV modeling facilitate the application of realized measures to forecast future covariance. Gourieroux et al. (2009) pioneer RCOV modeling by suggesting a non-central Wishart distribution to accommodate the positivity and symmetry of RCOV measures. Golosnoy et al. (2012) introduce a conditional autoregressive Wishart (CAW) model, and Yu et al. (2017) suggest a generalized CAW. Jin and Maheu (2013) propose a class of joint return-RCOV models based on Wishart distributions with additive or multiplicative components. Chiriac and Voev (2010), Bauer and Vorkink (2011) and Cech and Barunik (2017) apply standard time-series methods to model transformations of RCOV. Hansen et al. (2014) propose a multivariate GARCH model incorporating RCOV measures. Noureldin et al. (2012) design a multivariate high-frequency volatility (HEAVY) model and Opschoor et al. (2018) extend the HEAVY model to allow for better fitting. Jin and Maheu (2016) introduce a Bayesian nonparametric framework for RCOV modeling. Asai and McAleer (2015), Jin et al. (2019) and Shen et al. (2020) design factor models for RCOV. Amendola et al. (2020) propose a strategy based on the model confidence set to evaluate a group of multivariate volatility models. The literature surrounding RCOV modeling emphasizes the flexibility and predictability of models, while the practical importance of selecting RCOV measures has typically been ignored. In most works, RCOV constructed using low-frequency returns and realized kernel are the main choices of ex-post covariance measures.
Investigating the predictive power of RCOV measures is important to both academia and industry. However, directly measuring the accuracy of covariance prediction is infeasible as volatility is unobservable. In a univariate setting, Aït-Sahalia and Mancini (2008) rely on simulated data to compare out-of-sample forecasts of two realized volatility (RV) estimators. This paper suggests an approach based on return density forecasts to evaluate RCOV estimators. Our approach is inspired by several works on joint return-RCOV modeling, such as Noureldin et al. (2012), Jin and Maheu (2013) and Jin and Maheu (2016). By jointly modeling returns and RCOV measures under a Bayesian framework, the predictive density of returns and ex-post covariance measures are bridged, which allows the use of observed returns as the criterion for evaluating RCOV estimators. The density forecast improvement offered by a covariance estimator can be quantified using the predictive likelihood of returns. We evaluate a group of RCOV estimators using three joint return-RCOV models with different specifications. Empirical results support that the density-forecast-based method is an efficient way of assessing the out-of-sample performance of RCOV estimators. Our results also show that with regard to the pursuit of better predictability, the choice of RCOV estimator is as important as the model specification.
Compared with the evaluation method based on portfolio allocation, the density-forecast-based approach requires stochastic assumptions on returns and RCOV estimates, but offers several advantages. First, predictive likelihood reflects the accuracy of forecasting the return distribution. In contrast, portfolio analysis typically only considers the first two moments of the distribution. Second, portfolio exercise requires a reasonable long out-of-sample period to summarize portfolio performance, whereas the predictive likelihood measures the density forecast at each period and is not sensitive to the out-of-sample size. Third, forming a portfolio reduces the data dimension from multivariate to univariate. As a result, the difference among various RCOV measures may be averaged out and not be revealed based on portfolio returns. In contrast, the density forecast approach directly uses return vectors as the criterion, which reduces potential information loss. Overall, the density-forecast-based approach offers a direct and improved way to evaluate the out-of-sample performance of RCOV measures.
This paper is organized as follows. Section 2 provides a review of seven commonly used ex-post covariance estimation approaches. Section 3 discusses joint return-RCOV models, prediction and comparison criteria. Section 4 summarizes the data. Empirical results are reported in Section 5. Section 6 concludes, followed by an Appendix A.
2. Review of Ex-Post Covariance Estimation
Suppose the d dimensional log price follows a continuous stochastic process
where is a vector of drift terms, is a instantaneous volatility matrix and is a vector of standard Brownian motions. As shown in Andersen et al. (2003),
where is the quadratic covariation which measures the covariance of the log return over time .
Let , denotes the observed intraday price of asset j, where represents its arrival time and is the number of observations on day t. For most high-frequency covariance estimators, data synchronization is required. Under the previous-tick scheme with grid length h, regularly-spaced log prices are sampled as
Let , where , represents the intraday return vector over time on day t.
An alternative data synchronization approach is the refresh time scheme proposed by Barndorff-Nielsen et al. (2011), under which prices are sampled when all asset prices are refreshed. The refreshed price is sampled as
where , and denotes the number of asset j’s observations before time . Unlike previous-tick data, refresh time sampled prices are irregularly spaced. The return under the refresh time scheme is denoted as , where .
In reality, price observations are contaminated with market microstructure noise. Furthermore, different assets have different trading frequencies and their prices are not realized simultaneously. As a result, the estimation of the diagonal elements of the covariance matrix suffers from upward bias and the off-diagonal estimates have downward bias, especially when the sampling frequency is high. The remaining part of this section reviews seven ex-post covariance estimation approaches.
2.1. Realized Covariance
The simplest RCOV estimator based on synchronized returns is defined as
RC is a consistent estimator of the quadratic covariation if observations are free of error and arrived simultaneously. However, RC is not robust to microstructure noise and non-synchronous trading, which restricts forming RC using returns sampled at high frequencies.
2.2. Subsampled Realized Covariance
The formation of realized covariance using sparsely sampled data controls estimation bias, but eliminates considerable amounts of informative data. Zhang et al. (2005) suggest that an improved ex-post volatility estimator can be constructed by averaging subsampled estimators. Each subsampled estimator is formed using returns with the same sampling frequency but different starting points. For example, the 5-min time interval could start from 9:31 a.m. or 9:32 a.m., instead of 9:30 a.m. Subsampled realized covariance (SRC) with K subsampled groups is defined as
where is the return vector over an alternative subsample that shifts the grid in (3) by . As noted by Zhang et al. (2005), subsampling reduces the variance of covariance estimates but fails to eliminate the bias.
2.3. Two-Scales Realized Covariance
Zhang et al. (2005) propose a way to correct the bias of subsampled realized variance using information over two time scales. Zhang (2011) develops a multivariate extension and introduces the two-scales covariance estimator, which is robust to non-synchronous trading and microstructure noise. The two-scales covariance estimator between assets g and l is given as
where , and . The diagonal elements of are the two-scales realized variances (TSRV) defined in Zhang et al. (2005). TSRV of the asset is calculated as
where is the average of K subsampled RV of asset l.
2.4. Realized Covariance with Lead-Lag Adjustments
Adding lead-lag realized autocovariance terms to the RC defined in Equation (5) reduces the downward bias caused by non-synchronous trading (Scholes and Williams 1977; Dimson 1979), and mitigates the upward bias in realized variance due to microstructure noise (Hansen and Lunde 2006). The RCOV with lead and lag adjustments (RCLL) is defined as
where
is the realized autocovariance matrix and is the Bartlett-kernel weight.
2.5. Realized Kernel
Barndorff-Nielsen et al. (2011) design a multivariate realized kernel (RK) by integrating lead-lag autocovariance adjustments, suitably chosen kernel weight functions and a refresh-time sampling scheme. RK is a consistent and positive semi-definite covariance estimator. Based on refresh time sampled data, RK is defined as
where is the Parzen kernel function1 and the bandwidth H is determined as . For the Parzen kernel function, . is the noise-to-signal ratio for the lth asset, which can be estimated as , as suggested in Barndorff-Nielsen et al. (2009).
2.6. Pre-Averaged Realized Covariance
Christensen et al. (2010) extend the pre-averaging method introduced in Jacod et al. (2009) to the multivariate setting and propose a class of pre-averaged realized covariance (PARC) estimators. The idea behind the pre-averaging method is that noise can be averaged away by averaging high-frequency data. Christensen et al. (2010) show that PARC remains efficient in a setting with non-synchronous trading. Based on synchronized data, PARC is defined as
where is the pre-averaged return defined as
A conservative window length can be set as . More detailed discussion of the window length can be found in Christensen et al. (2010).
The cumulative covariance (HY) estimator introduced by Hayashi and Yoshida (2005) can be applied directly to raw observations without data synchronization. Christensen et al. (2010) developed the pre-averaged version of the HY estimator (PAHY), which estimates the covariance between assets g and l as
where is the indicator function.
2.7. Quasi-Maximum Likelihood Covariance Estimator
Aït-Sahalia et al. (2010) introduce a covariance estimator based on the quasi-maximum likelihood (QML) estimation. The quasi-maximum likelihood covariance (QMLC) estimator between assets g and l is based on the following:
where is estimated using the quasi-maximum likelihood method. The QMLC estimation method does not require adjustment of tuning parameters, but yields only pairwise estimates. Diagonal elements of the covariance matrix can be estimated using the QML volatility estimation method introduced in Xiu (2010).
2.8. Regularization
Portfolio allocation and covariance modeling typically require the covariance matrix to be positive definite. We apply the regularization method in Hautsch et al. (2012) to convert ill-conditioned matrices into positive definite matrices using the following steps.
- i.
- Decompose the non-positive definite covariance matrix as , where C is the correlation matrix and is a matrix with standard deviations on the diagonal. Decompose the correlation matrix as , where is the diagonal matrix of eigenvalues and Q is the matrix of eigenvectors.
- ii.
- Calculate threshold value . Eigenvalues less than are replaced by , where k is the number of eigenvalues greater than .
- iii.
- The positive definite matrix is reconstructed as , where and is the matrix with updated eigenvalues.
3. Joint Return-RCOV Models
We consider three joint return-RCOV models with different distributional assumptions and volatility specifications to evaluate the predictive power of RCOV estimators. Let represent the information set up to t, where represents the series of d-dimensional return vectors and denotes RCOV matrices over t periods.
3.1. Inverse-Wishart Additive Model
Jin and Maheu (2013) introduce a joint return-RCOV model based on Wishart distribution with additive components. They suggest decomposing the scale matrix in the Wishart density into several additive components formed by past RCOVs. Later, Jin and Maheu (2016) find that the inverse-Wishart framework offers superior out-of-sample performance over the Wishart version. The joint inverse-Wishart additive (IW-A) model is defined as
where denotes an inverse-Wishart distribution with degrees of freedom and scale matrix . The conditional mean of is , which is fully determined according to parameters and 2. is a positive-definite matrix. for and ’s are vectors. is the additive component defined as the average of the past over terms. The first component is equal to by setting . and are treated as parameters such that the sizes of past RCOVs in the second and third components can be determined endogenously.
In Bayesian inference, the model is estimated through Markov chain Monte Carlo (MCMC) techniques. The parameter set includes and . Following the prior specifications in Jin and Maheu (2016), we set the priors of and all elements of ’s as , the prior of as and the priors of and as discrete uniform distribution . To avoid identification issues, we impose and the first element of being positive as prior restrictions. A Metropolis-Hastings algorithm with a joint random walk proposal is used to sample and . The proposal for sampling and is a random walk with Poisson increments. is computed as , where is the sample average of RCOVs, following the RCOV targeting technique. Any draws with a singular matrix are dropped. Additional details of sampling steps are collected in the Appendix A.
3.2. Conditional Autoregressive Wishart Model
We extend the conditional autoregressive Wishart (CAW) model proposed in Golosnoy et al. (2012) to a joint return-RCOV model by linking RCOV estimates and returns via Equation (19). The joint CAW model is given as
where is a Wishart distribution with degrees of freedom and scale matrix . s and s and C are matrices and C is positive definite. In addition to the distributional difference, the CAW model differs from IW-A in that it assumes conditional covariance has an autoregressive structure. depends on its lagged value as well as the previous RCOVs, which technically suggests that all past are accountable for explaining .
We adapt the diagonal version of CAW with , which restricts and to be diagonal matrices. The priors of and diagonal elements of and are all and the first diagonal elements of and are restricted to be positive. The prior of is . Similar to the estimation of in the IW-A model, C is set to , where is the sample average of RCOVs. Other model parameters are sampled using the Metropolis-Hastings algorithm with a multivariate random walk proposal. Additional details of posterior sampling are presented in the Appendix A.
3.3. HEAVY Model
The multivariate high-frequency-based volatility (HEAVY) model introduced by Noureldin et al. (2012) is also considered. We add Student-t innovations to the HEAVY model as follows:
where and are positive definite matrices, and , , and are diagonal matrices. and are the degrees of freedom of the Student-t and Wishart distributions, respectively. The HEAVY model exploits the conditional covariance of low-frequency returns and the conditional mean of RCOV in a GARCH-like setting. Equations (22) and (24) are similar to a multivariate GARCH(1,1) model with replaced by . Unlike IW-A and CAW models that assume RCOV is an unbiased measure of return covariance, the HEAVY model estimates return covariance conditional on both return and RCOV information.
For Bayesian inference, we set the priors of all diagonal elements of , , and to be , and the priors of and are and . and are calculated using RCOV targeting. Metropolis-Hastings sampling steps with a multivariate random walk proposal are used for posterior simulation. Additional details of posterior sampling are presented in the Appendix A.
3.4. Prediction
Given that covariances are not observable, while returns are, the density forecast of returns provides a fair benchmark to evaluate the out-of-sample performance of RCOV estimators. Conditional on a particular model and information set , the predictive density of the next-period return is given as
where is the density conditional on parameter set and the information set at time t and is the posterior density. Based on G MCMC outputs, the predictive likelihood is computed by integrating out the parameter uncertainty as
where .
Let and represent the IW-A, CAW and HEAVY models, respectively. For the IW-A model, after integrating out , the conditional distribution of is a multivariate Student-t given as:
For the CAW model, the distribution of conditional on and is given as
can be approximated by averaging , where and .
For the HEAVY model, conditional on the draw of model parameters, the predictive density of is
There are several differences among the three models’ predictive densities of return. Under the HEAVY model, the degrees of freedom and covariance matrix are estimated conditional on both returns and RCOVs, while the IW-A or CAW model relies on RCOV data only to infer the covariance and kurtosis of return densities. Another distinction is that the three models capture time-series dependence in RCOVs differently. IW-A determines the size of historical RCOVs endogenously by learning from the data. In contrast, all past RCOVs are involved in explaining future covariance under the CAW and HEAVY models.
The density forecast improvement offered by a RCOV estimator can be measured by the predictive likelihood, which is the predictive density evaluated at . Let stand for the information set contains estimator . The log-predictive likelihood () conditional on over the out-of-sample period from to T is
One could compare the predictive likelihoods conditional on different information set within the same model. Based on the model , the log-Bayes factor (log-BF) of RCOV estimator versus is defined as , where is preferred if the log-Bayes factor is positive. To investigate subsample density forecast performance, the cumulative log-Bayes factor, which is a sequence of log-Bayes factors, is computed as follows.
An increasing trend suggests that estimator consistently outperforms over the out-of-sample period. Similarly, we could compare any two models via the log-Bayes factor by conditioning on the same information set .
Mean-variance portfolio analysis requires point predictions of the covariance matrix. Given G MCMC outputs, the predictive means of the next-period covariance matrix in IW-A, CAW and HEAVY models are computed as follows:
4. Data
The transaction prices of 20 equities from 2 January 2002 to 31 December 2014 are obtained from the TAQ database, and the data from 2 January 2015 to 31 December 2018 are obtained from Tick Data. The raw intraday data are cleaned following the method used by Barndorff-Nielsen et al. (2009).
The returns are defined as the difference between log prices and are scaled by 100. We compute ex-post covariance matrix estimates in both 10 and 20 dimensions. BAC, CAT, DIS, GS, IBM, JNJ, KO, PG, WMT and XOM compose the 10 assets group A, and the 20 assets group contains an additional 10 assets: AXP, C, CVX, HD, HON, JPM, MCD, NKE, PFE, and VZ3. The last 10 assets form the 10 assets group B.
Twenty RCOV measures are constructed using the seven estimation approaches summarized in Section 2. They are (i) RC based on 10-min, 5-min and 1-min (, and ), (ii) 10-min SRC with 10 and 20 subsamples (, ) and 5-min SRC with 5 and 10 subsamples (, ), (iii) two-scales estimators: , and , (iv) 1-min and 30-sec RCLL with and (, , and ), (v) RK, (vi) Pre-averaged RC based on 1-min, 30-s and refresh-time data ( and and ) and pre-averaged HY estimator, (vi) QMLC estimator. Table 1 lists the twenty estimators and their synchronization schemes and provides statistical summaries of the diagonal and off-diagonal elements of the covariance matrix estimates in the 20 assets case.
Table 1.
List of RCOV measures.
5. Empirical Results
Each of the twenty RCOV measures is jointly modeled with returns using the three models discussed in Section 3. The out-of-sample forecasts are computed recursively from 19 October 2006 to 31 December 2018, a total of 3070 days. The estimation on the initial day of the out-of-sample is based on 10,000 MCMC runs, after dropping 10,000 burn-in draws. As new data arrive, model parameters are re-estimated based on 5000 MCMC results, after 1000 burn-in draws4.
5.1. Density Forecasts
Table 2, Table 3 and Table 4 report the sum of log-predictive likelihoods of next-period returns for the out-of-sample period under IW-A, CAW and HEAVY models conditional on various RCOV measures. The performance of RCOV measures can be visualized in Figure 1, Figure 2 and Figure 3, which plot the log-predictive Bayes factors for RCOV measures against in the three asset cases. In almost all cases, pre-averaging estimators based on previous-tick returns provide the best density forecast improvement. For example, switching from to increases the log-predictive likelihood by a minimum of 187.0 (HEAVY, 10 assets—A) to a maximum of 1126.6 (IW-A, 20 assets). Two-scales and subsampling approaches lead to the second and third best-performing groups. RK and QMLC offer improved density forecast results compared with RCLLs, but are not significantly better than . The evaluation results are consistent across modeling frameworks, data dimensions and asset groups. In order to investigate the prior robustness of RCOV evaluation results, we compute the log-predictive likelihoods of the IW-A model under two additional sets of priors. As shown in Table 5, the rankings of RCOV measures remain unchanged under more informative or more sparse priors, which suggests the density-forecast-based RCOV evaluation method is robust to prior assumptions.
Table 2.
Predictive likelihoods of return (IW-A model).
Table 3.
Predictive likelihoods of return (CAW model).
Table 4.
Predictive likelihoods of return (HEAVY model).
Figure 1.
Log Bayes factors (10 assets—Group A).
Figure 2.
Log Bayes factors (10 assets—Group B).
Figure 3.
Log Bayes factors (20 assets).
Table 5.
Prior sensitivity check.
Figure 4 plots the cumulative log-Bayes factors of several representative estimators (, , and ) against in the three cases according to the IW-A model. The decreasing trend in Figure 4 suggests that the ranking of RCOV estimator is robust in subsample periods.
Figure 4.
Cumulative log-Bayes factors of vs. alternative RC estimators.
The density forecast results confirm several theoretical expectations and findings in the literature. The ranking of TSRC, SRC and RC is consistent with the conclusion in Zhang et al. (2005) that the two-scales estimator has a smaller bias than the subsampled or sparsely-sampled RC. A comparison of SRC estimators confirms that it is better to form a subsampled estimator with low-frequency data and more subsamples. The out-of-sample performance of the RC deteriorates as the sampling frequency increases, which validates the use of low-frequency RC in most empirical studies. The out-of-sample performance of RC and RCLL matches the theoretical results reported by Griffin and Oomen (2011), in which for a fixed sampling frequency, increasing the lead and lag terms reduces the estimation bias. Our results also show that PAHY underperforms PARCs, which is consistent with the finite sample result of PAHY documented by Christensen et al. (2010).
The variation in return density forecasts suggests that the choice of RCOV measure matters greatly with regard to prediction. For example, in the 20 assets application using the IW-A model, switching from to increases the predictive likelihood from −80,200.9 to −76,645.5, a log-Bayes factor of 3555.4. Most RCOV modeling works try to improve the forecasts by adjusting the model specifications and stochastic assumptions, while our results shed light on a different perspective; that is, the choice of RCOV estimator is also important in the pursuit of better predictability.
The density forecast results also show that the comparison of RCOV models could be sensitive to the choice of RCOV. For example, when using or as the RCOV data, the IW-A model produces the best density forecast results, followed by HEAVY and CAW. In contrast, HEAVY performs better than IW-A for most of the other measures. Among the three joint models, IW-A has the highest sensitivity to RCOV inputs. Taking the 10 assets group A as an example, the log Bayes factor between the best and worst estimators is over 1300 under the IW-A model, while the predictive likelihood ranges under CAW and HEAVY are around 1000 and 400, respectively. Different distributional assumptions and model specifications are potential reasons for the sensitivity difference. In the HEAVY model, the return covariance matrix is estimated conditional on both returns and RCOVs. Therefore, the additional information from returns mitigates poor RCOV measures’ negative influence but diminishes the prediction improvement offered by good RCOV measures. Compared with the IW-A model, the CAW model captures stronger volatility persistence, so its prediction is less sensitive to newly arrived RCOV data. The IW-A and CAW models also differ in RCOV distributional assumptions, which further contributes to the forecasting results differences.
In addition to the one-period ahead density forecasts, we investigate the performance of RCOV measures based on long horizon density forecasts. Table 6 shows 5-period and 10-period ahead log-predictive likelihoods5 under the IW-A model across the three asset groups. Multiple horizon density forecasts provide a similar ranking of the twenty estimators, compared with the results taken from Table 2, Table 3 and Table 4.
Table 6.
Predictive likelihoods of returns over long horizons.
5.2. Portfolio Allocation
In this section, we evaluate the out-of-sample performance of RCOV estimators from a portfolio optimization perspective. Through forming mean-variance portfolios using predicted covariance, the predictive performance of RCOV estimators can be indirectly assessed based on portfolio performance measures such as standard deviation or Sharpe ratio. Given the predictive covariance of next-period returns, the optimal weight can be obtained by solving the following optimization problem:
where . Under IW-A, CAW and HEAVY models, are calculated according to Equations (35)–(37). The realized portfolio return and the out-of-sample portfolio variance is given as
The estimator that leads to the smallest is considered to be the best6.
Table 7 reports the standard deviations of global minimum variance (GMV) portfolios based on the IW-A model7 with various RCOV measures. The comparison results based on portfolio exercises are generally consistent with those obtained from the density forecasts. For example, , , TSRC(60,1), and lead to portfolios with relatively low variance. However, the difference among standard deviations of GMV portfolios is marginal. To further investigate whether the difference is significant, we apply the model confidence set (MCS) introduced by Hansen et al. (2011) to obtain a set of estimators that includes the optimal one. Table 7 provides MCS test p-values for each covariance matrix estimator. In the 10-asset group A, estimators excluded are , , , , RK, PAHY and QMLC at the 25% significance level. The model confidence set could exclude underperforming RCOV measures, but fails to suggest the optimal one.
Table 7.
Standard deviations of global minimum variance portfolio returns (IW-A model).
Despite the fact that both the density-forecast and portfolio-based methods are able to eliminate several inferior RCOV estimators, the latter fails to suggest outperforming ones. Compared with the portfolio-based method, the density forecast method is more direct as it ranks RCOV based on density forecasts of multivariate return vectors, rather than univariate portfolio measures. Another drawback of the portfolio-based method is that ranking of covariance estimators is sensitive to the choice of out-of-sample size and significance measurement8.
5.3. Close-to-Close Data
Previous empirical works use open-to-close data, which only account for information during trading hours. To investigate the robustness of our approach, we evaluate RCOV measures based on density forecasts of close-to-close returns. Following Fleming et al. (2003) and de Pooter et al. (2008), the close-to-close covariance matrix is formed by summing intraday covariance and the outer product of overnight returns.
where is a realized measure over trading hours and stands for the overnight log returns, which are formed using the opening price on day t and the closing price on day .
Table 8 reports the log-predictive likelihood of close-to-close returns using the HEAVY and CAW models. Even though adding the common overnight covariance component makes the competing covariance estimators more similar, the proposed approach is still sufficiently robust to evaluate estimators, and the rankings remain relatively consistent. , , , and remain the top-performing RCOV measures.
Table 8.
Predictive likelihoods of close-to-close return.
6. Conclusions
Existing methods of evaluating RCOV estimators empirically rely on portfolio analysis, which compare RCOV measures indirectly using univariate measures such as portfolio standard deviation or Sharpe ratio. The comparison of RCOV measures’ predictive power on return density forecasts is not well investigated in the existing literature. This paper fills the gap by suggesting a density-forecast-based method to evaluate RCOV measures. Given that covariances are not observable, while returns are, the joint modeling of returns and RCOVs enables the evaluation of RCOV estimators via return density forecasts. We test the empirical predictive power of a list of popular RCOV estimators and found several estimators consistently outperform others. The density-forecast-based evaluation method is robust to various RCOV models, datasets, data dimensions and forecast horizons. Another important insight is that the RCOV measures should be carefully selected in covariance modeling, as the choice of RCOV measures can significantly impact a model’s forecasting performance.
Author Contributions
Conceptualization, J.L. and Q.Y.; methodology, X.J., J.L. and Q.Y.; software, X.J.; validation, X.J.; formal analysis, X.J., J.L. and Q.Y.; resources, X.J., J.L. and Q.Y.; data curation, J.L.; writing—original draft preparation, J.L. and Q.Y.; writing—review and editing, X.J., J.L. and Q.Y. All authors have read and agreed to the published version of the manuscript.
Funding
Jin’s research is supported by NSFC through Project 71773069. Liu thanks the FGSR internal grant of Saint Mary’s University for financial support. Yang thanks the start-up fund of ShanghaiTech and Young Scientists Fund of NSFC (Project 72103137) for financial support.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data presented in this study are obtained from NYSE Trade and Quote (TAQ) database and Tick Data (https://www.tickdata.com/, accessed on April 2020).
Acknowledgments
Earlier versions of this paper were presented at the 2021 NBER-NSF SBIES Conference and 2021 China Meeting of the Econometric Society. We would like to thank those who participated in these meetings for their valuable comments and suggestions.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
This section illustrates the MCMC sampling steps for IW-A, CAW and HEAVY models.
Appendix A.1. IW-A Model Estimation Steps
The parameters to be sampled are . We divide them into the following three blocks and iteratively sample them from their conditional posterior distributions.
is sampled by a Gibbs sampler as its conditional posterior is a multivariate normal:
where and .
The joint conditional posterior of is
We sample by applying a Metropolis-Hastings (MH) algorithm with random walk sampler. A multivariate normal serves as the proposal distribution. Given the current values in the MCMC chain, the new proposal is accepted with probability
The joint conditional posterior of is
and are sequentially sampled by MH algorithms with the following proposal density:
and we set . The new proposal is accepted with probability
Appendix A.2. CAW Model Estimation Steps
The parameters set contains , where and are vectors of diagonal elements of and , respectively, for . We iteratively sample the following two blocks of parameters in Bayesian estimation.
The same Gibbs step of sampling in the IW-A model estimation is used to sample since the two models share the same conditional posterior of . The joint conditional posterior of is
We sample by applying a MH algorithm with random walk sampler similarly to step 2 in the estimation of the IW-A model. The new proposal is accepted with probability
Appendix A.3. HEAVY Model Estimation Steps
The parameter set to be sampled includes , where and are vectors of diagonal elements of and , respectively, for . We sample all the parameters in one block. The joint posterior is
where and . We apply a MH algorithm with random walker proposal to sample . The new proposal is accepted with probability
Notes
| 1 | Parzen kernel function:
|
| 2 | ⊙ denotes the element-by-element (Hadamard) product of two matrices. |
| 3 | The company names are: American Express, Bank of American, Citigroup, Caterpillar, Chevron, Disney, Goldman Sachs, Home Depot, Honeywell, International Business Machine, Johnson and Johnson, JPMorgan Chase, Coca-Cola, McDonald, Nike, Pfizer, Procter and Gamble, Verizon Communication, Walmart and Exxon Mobile. |
| 4 | Initial values of parameters in a new sample are set to be the posterior mean of the previous sample. This could make the Markov chain converge quickly and reduce the computation cost. |
| 5 | The h-period ahead predictive likelihood is the predictive density evaluated at the realized return . , which can be calculated based on MCMC outputs similar to Equation (27). |
| 6 | As indicated by Patton and Sheppard (2009), the true variance-covariance that generates the out-of-sample portfolio variance must be the smallest. |
| 7 | We only report the GMV portfolio results based on the IW-A model. CAW and HEAVY models provide similar results. |
| 8 | For example, how numerically small is will be seen as significant. Besides Equation (39), tracking error portfolios (Patton and Sheppard 2009) and utility-based framework (Fleming et al. 2003) are alternative measurements with different economic intuition. |
References
- Aït-Sahalia, Yacine, Jianqing Fan, and Dacheng Xiu. 2010. High-Frequency Covariance Estimates With Noisy and Asynchronous Financial Data. Journal of the American Statistical Association 105: 1504–17. [Google Scholar] [CrossRef]
- Amendola, Alessandra, Manuela Braione, Vincenzo Candila, and Giuseppe Storti. 2020. A model confidence set approach to the combination of multivariate volatility forecasts. International Journal of Forecasting 36: 873–91. [Google Scholar] [CrossRef]
- Andersen, Torben G., Tim Bollerslev, Francis X. Diebold, and Paul Labys. 2003. Modeling and Forecasting Realized Volatility. Econometrica 71: 579–625. [Google Scholar] [CrossRef]
- Asai, Manabu, and Michael McAleer. 2015. Forecasting co-volatilities via factor models with asymmetry and long memoryt in realized covariance. Journal of Econometrics 189: 251–62. [Google Scholar] [CrossRef]
- Aït-Sahalia, Yacine, and Loriano Mancini. 2008. Out of sample forecasts of quadratic variation. Journal of Econometrics 147: 17–33. [Google Scholar] [CrossRef]
- Bannouh, Karim, Dick van Dijk, and Martin Martens. 2009. Range-Based Covariance Estimation Using High-Frequency Data: The Realized Co-Range. Journal of Financial Econometrics 7: 341–72. [Google Scholar] [CrossRef][Green Version]
- Barndorff-Nielsen, Ole E., Peter Reinhard Hansen, Asger Lunde, and Neil Shephard. 2009. Realized kernels in practice: Trades and quotes. Econometrics Journal 12: C1–C32. [Google Scholar] [CrossRef]
- Barndorff-Nielsen, Ole E., Peter Reinhard Hansen, Asger Lunde, and Neil Shephard. 2011. Multivariate realised kernels: Consistent positive semi-definite estimators of the covariation of equity prices with noise and non-synchronous trading. Journal of Econometrics 162: 149–69. [Google Scholar] [CrossRef]
- Barndorff-Nielsen, Ole E., and Neil Shephard. 2004. Econometric analysis of realized covariation: High frequency based covariance, regression, and correlation in financial economics. Econometrica 72: 885–925. [Google Scholar] [CrossRef]
- Bauer, Gregory, and Keith Vorkink. 2011. Forecasting multivriate realized stock market volatility. Journal of Econometrics 160: 79–109. [Google Scholar] [CrossRef]
- Cech, Frantsek, and Jozef Barunik. 2017. On the modelling and forecasting of multivariate realzied volatility: Generalized heterogeneous autoregressive (GHAR) model. Journal of Forecasting 36: 181–206. [Google Scholar] [CrossRef]
- Chiriac, Roxana, and Valeri Voev. 2010. Modelling and forecasting multivariate realized volatility. Journal of Applied Econometrics 26: 922–47. [Google Scholar] [CrossRef]
- Christensen, Kim, Silja Kinnebrock, and Mark Podolskij. 2010. Pre-averaging estimators of the ex-post covariance matrix in noisy diffusion models with non-synchronous data. Journal of Econometrics 159: 116–33. [Google Scholar] [CrossRef]
- Corsi, Fulvio, Stefano Peluso, and Francesco Audrino. 2015. Missing in asynchronicity: A kalman-em approach for multivariate realized covariance estimation. Journal of Applied Econometrics 30: 377–97. [Google Scholar] [CrossRef]
- de Pooter, Michiel, Martin Martens, and Dick van Dijk. 2008. Predicting the daily covariance matrix for s&p 100 stocks using intraday data—But which frequency to use? Econometric Reviews 27: 199–229. [Google Scholar] [CrossRef]
- Dimson, Elroy. 1979. Risk measurement when shares are subject to infrequent trading. Journal of Financial Economics 7: 197–226. [Google Scholar] [CrossRef]
- Fan, Jianqing, Yingying Li, and Ke Yu. 2012. Vast volatility matrix estimation using high-frequency data for portfolio selection. Journal of the American Statistical Association 107: 412–28. [Google Scholar] [CrossRef]
- Fleming, Jeff, Chris Kirby, and Barbara Ostdiek. 2003. The economic value of volatility timing using realized volatility. Journal of Financial Economics 67: 473–509. [Google Scholar] [CrossRef]
- Golosnoy, Vasyl, Bastian Gribisch, and Roman Liesenfeld. 2012. The conditional autoregressive wishart model for multivariate stock market volatility. Journal of Econometrics 167: 211–23. [Google Scholar] [CrossRef]
- Gourieroux, Christian, Joann Jasiak, and Razvan Sufana. 2009. The wishart autoregressive process of multivariate stochastic volatility. Journal of Econometrics 150: 167–81. [Google Scholar] [CrossRef]
- Griffin, Jim E., and Roel C. A. Oomen. 2011. Covariance measurement in the presence of non-synchronous trading and market microstructure noise. Journal of Econometrics 160: 58–68. [Google Scholar] [CrossRef]
- Hansen, Peter, Jeremy Large, and Asger Lunde. 2008. Moving Average-Based Estimators of Integrated Variance. Econometric Reviews 27: 79–111. [Google Scholar] [CrossRef]
- Hansen, Peter R., and Asger Lunde. 2006. Realized variance and market microstructure noise. Journal of Business & Economic Statistics 24: 127–61. [Google Scholar]
- Hansen, Peter R., Asger Lunde, and James M. Nason. 2011. The model confidence set. Econometrica 79: 453–97. [Google Scholar] [CrossRef]
- Hansen, Peter Reinhard, Asger Lunde, and Valeri Voev. 2014. Realized beta GARCH: A multivariate GARCH model with realized measures of volatility. Journal of Applied Econometrics 29: 774–99. [Google Scholar] [CrossRef]
- Hautsch, Nikolaus, Lada M. Kyj, and Roel C. A. Oomen. 2012. A blocking and regularization approach to high-dimensional realized covariance estimation. Journal of Applied Econometrics 27: 625–45. [Google Scholar] [CrossRef]
- Hayashi, Takaki, and Nakahiro Yoshida. 2005. On covariance estimation of non-synchronously observed diffusion processes. Bernoulli 11: 359–79. [Google Scholar] [CrossRef]
- Jacod, Jean, Yingying Li, Per A. Mykland, Mark Podolskij, and Mathias Vetter. 2009. Microstructure noise in the continuous case: The pre-averaging approach. Stochastic Processes and Their Applications 119: 2249–76. [Google Scholar] [CrossRef]
- Jin, Xin, and John Maheu. 2013. Modeling realized covariances and returns. Journal of Financial Econometrics 11: 335–69. [Google Scholar] [CrossRef]
- Jin, Xin, and John Maheu. 2016. Bayesian semiparametric modeling of realized covariance matrices. Journal of Econometrics 192: 19–39. [Google Scholar] [CrossRef]
- Jin, Xin, John Maheu, and Qiao Yang. 2019. Bayesian parametric and semiparametric factor models for large realized covariance matrices. Journal of Applied Econometrics 34: 641–60. [Google Scholar] [CrossRef]
- Lunde, Asger, Neil Shephard, and Kevin Sheppard. 2016. Econometric analysis of vast covariance matrices using composite realized kernels and their application to portfolio choice. Journal of Business & Economic Statistics 34: 504–18. [Google Scholar]
- Noureldin, Diaa, Neil Shephard, and Kevin Sheppard. 2012. Multivariate high-frequency-based volatility (HEAVY) models. Journal of Applied Econometrics 27: 907–33. [Google Scholar] [CrossRef]
- Opschoor, Anne, Pawel Janus, Andre Lucas, and Dick Van Dijk. 2018. New Heavy models for fat-tailed realized covariance and returns. Journal of Business and Economic Statistics 36: 643–57. [Google Scholar] [CrossRef]
- Patton, Andrew, and Kevin Sheppard. 2009. Evaluating volatility and correlation forecasts. Chapter 15. In Handbook of Financial Time Series. Berlin/Heidelberg: Springer, pp. 801–38. [Google Scholar]
- Peluso, Stefano, Fulvio Corsi, and Antonietta Mira. 2015. A bayesian high-frequency estimator of the multivariate covariance of noisy and asynchronous returns. Journal of Financial Econometrics 13: 665–97. [Google Scholar] [CrossRef][Green Version]
- Scholes, Myron, and Joseph Williams. 1977. Estimating betas from nonsynchronous data. Journal of Financial Economics 5: 309–27. [Google Scholar] [CrossRef]
- Shen, Keren, Jianfeng Yao, and Wai Keung Li. 2020. Forecasting high-dimensional realized volatility matyrices using a factor model. Quantitative Finance 20: 1879–87. [Google Scholar] [CrossRef]
- Tao, Minjing, Yazhen Wang, Qiwei Yao, and Jian Zou. 2011. Large volatility matrix inference via combining low-frequency and high-frequency approaches. Journal of the American Statistical Association 106: 1025–40. [Google Scholar] [CrossRef]
- Voev, Valeri, and Asger Lunde. 2007. Integrated Covariance Estimation using High-frequency Data in the Presence of Noise. Journal of Financial Econometrics 5: 68–104. [Google Scholar] [CrossRef]
- Xiu, Dacheng. 2010. Quasi-maximum likelihood estimation of volatility with high frequency data. Journal of Econometrics 159: 235–50. [Google Scholar] [CrossRef]
- Yu, Philip, Wai Keung Li, and Fo Chun Ng. 2017. The generalized conditional autoregressive wishart model for multivariate realized volatility. Journal of Business and Economic Statistics 35: 513–27. [Google Scholar] [CrossRef]
- Zhang, Lan. 2011. Estimating covariation: Epps effect, microstructure noise. Journal of Econometrics 160: 33–47. [Google Scholar] [CrossRef]
- Zhang, Lan, Per A. Mykland, and Yacine Aït-Sahalia. 2005. A tale of two time scales: Determining integrated volatility with noisy high-frequency data. Journal of the American Statistical Association 100: 1394–11. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).