Does the Choice of Realized Covariance Measures Empirically Matter? A Bayesian Density Prediction Approach

Xin Jin; Jia Liu; Qiao Yang

doi:10.3390/econometrics9040045

,

and

¹

School of Economics, Shanghai University of Finance and Economics, Shanghai 200433, China

²

Sobey School of Business, Saint Mary’s University, Halifax, NS B3H 3C3, Canada

³

School of Entrepreneurship and Management, ShanghaiTech University, Shanghai 201210, China

^*

Author to whom correspondence should be addressed.

Econometrics2021, 9(4), 45;https://doi.org/10.3390/econometrics9040045

Version Notes

Order Reprints

Abstract

This paper suggests a new approach to evaluate realized covariance (RCOV) estimators via their predictive power on return density. By jointly modeling returns and RCOV measures under a Bayesian framework, the predictive density of returns and ex-post covariance measures are bridged. The forecast performance of a covariance estimator can be assessed according to its improvement in return density forecasting. Empirical applications to equity data show that several RCOV estimators consistently perform better than others and emphasize the importance of RCOV selection in covariance modeling and forecasting.

Keywords:

realized covariance; forecast comparison; density forecast; high-frequency data

1. Introduction

The past two decades have seen dramatic growth in the amount of literature on estimating and modeling realized covariance (RCOV) measures. On the one hand, various methods have been proposed to extract covariation information from noisy and non-synchronous high-frequency data. On the other hand, the literature on RCOV modeling has focused on improving models’ flexibility and predictability. Which RCOV measure leads to superior out-of-sample forecasting is an important but not fully answered question. This paper suggests a joint return and RCOV modeling approach to assess RCOV measures based on return density forecasts. This direction of research contributes not only to the ex-post covariance estimation literature by proposing a new evaluation method but also to the RCOV modeling literature as we show that the choice of estimator matters to a model’s predictability.

Most studies regarding ex-post covariance estimation focus on developing a high-frequency covariance estimator that can accommodate market microstructure noise and non-synchronous trading without losing consistency and efficiency. Andersen et al. (2003) and Barndorff-Nielsen and Shephard (2004) build the theoretical foundation of RCOV in an ideal setting. Zhang et al. (2005) suggest the subsampling approach and Zhang (2011) designs a two-scales realized covariance (TSRC). Griffin and Oomen (2011) analyze the statistical properties of RCOV with lead-lag adjustments (RCLL). Barndorff-Nielsen et al. (2011) design a multivariate realized kernel (RK) based on refresh-time based returns. Christensen et al. (2010) extend the idea of pre-averaging in Jacod et al. (2009) to the multivariate setting and propose pre-averaged covariance estimators. Aït-Sahalia et al. (2010) introduce a quasi-maximum likelihood approach to estimate the ex-post covariance. Other estimation methods include those of Hayashi and Yoshida (2005), Voev and Lunde (2007), Hansen et al. (2008), Bannouh et al. (2009), Tao et al. (2011), Corsi et al. (2015), Peluso et al. (2015) and Lunde et al. (2016).

A common practice for assessing the accuracy of covariance estimators is via simulation-based exercises. For example, works by Voev and Lunde (2007), Jacod et al. (2009), Aït-Sahalia et al. (2010), Barndorff-Nielsen et al. (2011), Corsi et al. (2015), and Peluso et al. (2015) compare several RCOV estimators via simulation studies. This approach is useful in studying estimation accuracy and efficiency but cannot evaluate estimators’ empirica performance.

An alternative RCOV evaluation approach is based on out-of-sample portfolio performance. For example, among competing covariance estimators, the estimator that leads to the least volatile minimum-variance portfolio is preferred. In the context of portfolio optimization, de Pooter et al. (2008) evaluate the choice of sampling frequency in constructing RCOVs. Fan et al. (2012) study covariance matrix estimation from the perspective of portfolio selection with gross-exposure constraints. Corsi et al. (2015) and Lunde et al. (2016) conduct portfolio allocation experiments to compare their proposed estimators with benchmarks. Compared with statistical evaluation based on simulation studies, the mean-variance portfolio optimization provides an indirect criterion for assessing estimator performance from an economic perspective.

Recent developments in RCOV modeling facilitate the application of realized measures to forecast future covariance. Gourieroux et al. (2009) pioneer RCOV modeling by suggesting a non-central Wishart distribution to accommodate the positivity and symmetry of RCOV measures. Golosnoy et al. (2012) introduce a conditional autoregressive Wishart (CAW) model, and Yu et al. (2017) suggest a generalized CAW. Jin and Maheu (2013) propose a class of joint return-RCOV models based on Wishart distributions with additive or multiplicative components. Chiriac and Voev (2010), Bauer and Vorkink (2011) and Cech and Barunik (2017) apply standard time-series methods to model transformations of RCOV. Hansen et al. (2014) propose a multivariate GARCH model incorporating RCOV measures. Noureldin et al. (2012) design a multivariate high-frequency volatility (HEAVY) model and Opschoor et al. (2018) extend the HEAVY model to allow for better fitting. Jin and Maheu (2016) introduce a Bayesian nonparametric framework for RCOV modeling. Asai and McAleer (2015), Jin et al. (2019) and Shen et al. (2020) design factor models for RCOV. Amendola et al. (2020) propose a strategy based on the model confidence set to evaluate a group of multivariate volatility models. The literature surrounding RCOV modeling emphasizes the flexibility and predictability of models, while the practical importance of selecting RCOV measures has typically been ignored. In most works, RCOV constructed using low-frequency returns and realized kernel are the main choices of ex-post covariance measures.

Investigating the predictive power of RCOV measures is important to both academia and industry. However, directly measuring the accuracy of covariance prediction is infeasible as volatility is unobservable. In a univariate setting, Aït-Sahalia and Mancini (2008) rely on simulated data to compare out-of-sample forecasts of two realized volatility (RV) estimators. This paper suggests an approach based on return density forecasts to evaluate RCOV estimators. Our approach is inspired by several works on joint return-RCOV modeling, such as Noureldin et al. (2012), Jin and Maheu (2013) and Jin and Maheu (2016). By jointly modeling returns and RCOV measures under a Bayesian framework, the predictive density of returns and ex-post covariance measures are bridged, which allows the use of observed returns as the criterion for evaluating RCOV estimators. The density forecast improvement offered by a covariance estimator can be quantified using the predictive likelihood of returns. We evaluate a group of RCOV estimators using three joint return-RCOV models with different specifications. Empirical results support that the density-forecast-based method is an efficient way of assessing the out-of-sample performance of RCOV estimators. Our results also show that with regard to the pursuit of better predictability, the choice of RCOV estimator is as important as the model specification.

Compared with the evaluation method based on portfolio allocation, the density-forecast-based approach requires stochastic assumptions on returns and RCOV estimates, but offers several advantages. First, predictive likelihood reflects the accuracy of forecasting the return distribution. In contrast, portfolio analysis typically only considers the first two moments of the distribution. Second, portfolio exercise requires a reasonable long out-of-sample period to summarize portfolio performance, whereas the predictive likelihood measures the density forecast at each period and is not sensitive to the out-of-sample size. Third, forming a portfolio reduces the data dimension from multivariate to univariate. As a result, the difference among various RCOV measures may be averaged out and not be revealed based on portfolio returns. In contrast, the density forecast approach directly uses return vectors as the criterion, which reduces potential information loss. Overall, the density-forecast-based approach offers a direct and improved way to evaluate the out-of-sample performance of RCOV measures.

This paper is organized as follows. Section 2 provides a review of seven commonly used ex-post covariance estimation approaches. Section 3 discusses joint return-RCOV models, prediction and comparison criteria. Section 4 summarizes the data. Empirical results are reported in Section 5. Section 6 concludes, followed by an Appendix A.

2. Review of Ex-Post Covariance Estimation

Suppose the d dimensional log price follows a continuous stochastic process

d P (t) = m (t) d t + Π (t) d w (t),

(1)

where

m (t)

is a

d \times 1

vector of drift terms,

Π (t)

is a

d \times d

instantaneous volatility matrix and

w (t)

is a

d \times 1

vector of standard Brownian motions. As shown in Andersen et al. (2003),

P (t) - P (0) \sim N (\int_{0}^{t} m (τ) d τ, \int_{0}^{t} Π (τ) Π {(τ)}^{'} d τ),

(2)

where

\int_{0}^{t} Π (τ) Π {(τ)}^{'} d τ

is the quadratic covariation which measures the covariance of the log return over time

(0, t)

.

Let

{\tilde{p}}_{t, τ_{l}^{(j)}}^{(j)}, l = 1, \dots, n_{t}^{(j)}

, denotes the

l^{t h}

observed intraday price of asset j, where

τ_{l}^{(j)}

represents its arrival time and

n_{t}^{(j)}

is the number of observations on day t. For most high-frequency covariance estimators, data synchronization is required. Under the previous-tick scheme with grid length h, regularly-spaced log prices are sampled as

p_{t, i}^{(j)} = {\tilde{p}}_{t, max (τ_{l}^{(j)} | τ_{l}^{(j)} \leq i h)}^{(j)}, j = 1, \dots, d .

(3)

Let

R_{t, i} = (r_{t, i}^{(1)}, r_{t, i}^{(2)}, \dots, r_{t, i}^{(d)})

, where

r_{t, i}^{(j)} = p_{t, i}^{(j)} - p_{t, i - 1}^{(j)}

, represents the intraday return vector over time

((i - 1) h, i h)

on day t.

An alternative data synchronization approach is the refresh time scheme proposed by Barndorff-Nielsen et al. (2011), under which prices are sampled when all asset prices are refreshed. The

i^{t h}

refreshed price

{\dot{p}}_{t, i}^{(j)}

is sampled as

{\dot{p}}_{t, i}^{(j)} = {\tilde{p}}_{t, max (τ_{l}^{(j)} | τ_{l}^{(j)} \leq s_{i})}^{(j)}, j = 1, \dots, d,

(4)

where

s_{1} = max (τ_{1}^{(1)}, τ_{1}^{(2)}, \dots, τ_{1}^{(d)})

,

s_{i} = max (τ_{{\dot{n}}_{s_{i - 1}}^{(1)} + 1}^{(1)}, τ_{{\dot{n}}_{s_{i - 1}}^{(2)} + 1}^{(2)}, \dots, τ_{{\dot{n}}_{s_{i - 1}}^{(d)} + 1}^{(d)})

and

{\dot{n}}_{s_{i}}^{(j)}

denotes the number of asset j’s observations before time

s_{i}

. Unlike previous-tick data, refresh time sampled prices are irregularly spaced. The return under the refresh time scheme is denoted as

{\dot{R}}_{t, i} = ({\dot{r}}_{t, i}^{(1)}, {\dot{r}}_{t, i}^{(2)}, \dots, {\dot{r}}_{t, i}^{(d)})

, where

{\dot{r}}_{t, i}^{(j)} = {\dot{p}}_{t, i}^{(j)} - {\dot{p}}_{t, i - 1}^{(j)}

.

In reality, price observations are contaminated with market microstructure noise. Furthermore, different assets have different trading frequencies and their prices are not realized simultaneously. As a result, the estimation of the diagonal elements of the covariance matrix suffers from upward bias and the off-diagonal estimates have downward bias, especially when the sampling frequency is high. The remaining part of this section reviews seven ex-post covariance estimation approaches.

2.1. Realized Covariance

The simplest RCOV estimator based on synchronized returns is defined as

{RC}_{t} = \sum_{i = 1}^{n} R_{t, i} R_{t, i}^{'} .

(5)

RC is a consistent estimator of the quadratic covariation if observations are free of error and arrived simultaneously. However, RC is not robust to microstructure noise and non-synchronous trading, which restricts forming RC using returns sampled at high frequencies.

2.2. Subsampled Realized Covariance

The formation of realized covariance using sparsely sampled data controls estimation bias, but eliminates considerable amounts of informative data. Zhang et al. (2005) suggest that an improved ex-post volatility estimator can be constructed by averaging subsampled estimators. Each subsampled estimator is formed using returns with the same sampling frequency but different starting points. For example, the 5-min time interval could start from 9:31 a.m. or 9:32 a.m., instead of 9:30 a.m. Subsampled realized covariance (SRC) with K subsampled groups is defined as

SRC {(K)}_{t} = \frac{1}{K} \sum_{k = 1}^{K} {RC}_{t}^{k}, where {RC}_{t}^{k} = \sum_{i = 1}^{n} R_{t, i_{k}} R_{t, i_{k}}^{^{'}},

(6)

where

R_{t, i_{k}}

is the return vector over an alternative subsample that shifts the grid in (3) by

h_{k / K}

. As noted by Zhang et al. (2005), subsampling reduces the variance of covariance estimates but fails to eliminate the bias.

2.3. Two-Scales Realized Covariance

Zhang et al. (2005) propose a way to correct the bias of subsampled realized variance using information over two time scales. Zhang (2011) develops a multivariate extension and introduces the two-scales covariance estimator, which is robust to non-synchronous trading and microstructure noise. The two-scales covariance estimator between assets g and l is given as

TSRC {(K, J)}_{t}^{(g, l)} = c_{n} (SRC {(K)}^{(g, l)} - \frac{{\bar{n}}_{K}}{{\bar{n}}_{J}} SRC {(J)}^{(g, l)}),

(7)

where

{\bar{n}}_{K} = \frac{n - K + 1}{K}

,

{\bar{n}}_{J} = \frac{n - J + 1}{J}

and

c_{n} = \frac{n}{(K - J) \times {\bar{n}}_{K}}

. The diagonal elements of

TSRC {(K, J)}_{t}

are the two-scales realized variances (TSRV) defined in Zhang et al. (2005). TSRV of the

l^{t h}

asset is calculated as

TSRV {(K, J)}_{t}^{(l)} = {(1 - \frac{{\bar{n}}_{K}}{{\bar{n}}_{J}})}^{- 1} (SRV {(K)}^{(l)} - \frac{{\bar{n}}_{K}}{{\bar{n}}_{J}} SRV {(J)}^{(l)}),

(8)

where

SRV {(K)}^{(l)}

is the average of K subsampled RV of asset l.

2.4. Realized Covariance with Lead-Lag Adjustments

Adding lead-lag realized autocovariance terms to the RC defined in Equation (5) reduces the downward bias caused by non-synchronous trading (Scholes and Williams 1977; Dimson 1979), and mitigates the upward bias in realized variance due to microstructure noise (Hansen and Lunde 2006). The RCOV with lead and lag adjustments (RCLL) is defined as

RCLL {(U)}_{t} = \sum_{i = 1}^{n} R_{t, i} R_{t, i}^{'} + \sum_{l = - U}^{U} d_{l} Γ_{t, l},

(9)

where

Γ_{t, l} = \sum_{i = 1}^{n - l} R_{t, i + l} R_{t, i}^{'}

(10)

is the

l^{t h}

realized autocovariance matrix and

d_{l} = 1 - l / (U + 1)

is the Bartlett-kernel weight.

2.5. Realized Kernel

Barndorff-Nielsen et al. (2011) design a multivariate realized kernel (RK) by integrating lead-lag autocovariance adjustments, suitably chosen kernel weight functions and a refresh-time sampling scheme. RK is a consistent and positive semi-definite covariance estimator. Based on refresh time sampled data, RK is defined as

{RK}_{t} = \sum_{j = - H}^{H} (k (\frac{j}{H}) \sum_{i = j + 1}^{\dot{n}} {\dot{R}}_{t, i} {\dot{R}}_{t, i - j}^{'}),

(11)

where

k (\cdot)

is the Parzen kernel function1 and the bandwidth H is determined as

H = c_{0} {\dot{n}}^{0.6} (d^{- 1} \sum_{l = 1}^{d} ξ_{l}^{0.8})

. For the Parzen kernel function,

c_{0} = 3.5143

.

ξ_{l}^{2}

is the noise-to-signal ratio for the lth asset, which can be estimated as

{RV}_{dense}^{(l)} / (2 n {RV}_{sparse}^{(l)})

, as suggested in Barndorff-Nielsen et al. (2009).

2.6. Pre-Averaged Realized Covariance

Christensen et al. (2010) extend the pre-averaging method introduced in Jacod et al. (2009) to the multivariate setting and propose a class of pre-averaged realized covariance (PARC) estimators. The idea behind the pre-averaging method is that noise can be averaged away by averaging high-frequency data. Christensen et al. (2010) show that PARC remains efficient in a setting with non-synchronous trading. Based on synchronized data, PARC is defined as

{PARC}_{t} = \frac{n}{n - k_{n} + 2} \frac{12}{k_{n}} \sum_{i = 0}^{n - k_{n} + 1} {\bar{R}}_{t, i} {({\bar{R}}_{t, i})}^{'} .

(12)

where

{\bar{R}}_{t, i}

is the pre-averaged return defined as

{\bar{R}}_{t, i} = \frac{1}{k_{n}} (\sum_{j = k_{n} / 2}^{k_{n} - 1} P_{t, i + j} - \sum_{j = 0}^{k_{n} / 2 - 1} P_{t, i + j}) .

(13)

A conservative window length can be set as

k_{n} = \sqrt{n}

. More detailed discussion of the window length can be found in Christensen et al. (2010).

The cumulative covariance (HY) estimator introduced by Hayashi and Yoshida (2005) can be applied directly to raw observations without data synchronization. Christensen et al. (2010) developed the pre-averaged version of the HY estimator (PAHY), which estimates the covariance between assets g and l as

{PAHY}_{t}^{(g, l)} = \frac{16}{{(k_{n})}^{2}} \sum_{i = 0}^{n^{(g)} - k_{n} + 1} \sum_{j = 0}^{n^{(l)} - k_{n} + 1} {\bar{r}}_{t, i}^{(g)} {\bar{r}}_{t, j}^{(l)} \cdot 𝟙 ((τ_{i}^{(g)}, τ_{i + k_{n}}^{(g)}] \cap (τ_{j}^{(l)}, τ_{i + k_{n}}^{(l)}] \neq ⌀),

(14)

where

𝟙 ()

is the indicator function.

2.7. Quasi-Maximum Likelihood Covariance Estimator

Aït-Sahalia et al. (2010) introduce a covariance estimator based on the quasi-maximum likelihood (QML) estimation. The quasi-maximum likelihood covariance (QMLC) estimator between assets g and l is based on the following:

{QMLC}_{t}^{(g, l)} = \frac{1}{4} (\hat{var} (p_{t}^{(g)} + p_{t}^{(l)}) - \hat{var} (p_{t}^{(g)} - p_{t}^{(l)}))

(15)

where

\hat{var} (\cdot)

is estimated using the quasi-maximum likelihood method. The QMLC estimation method does not require adjustment of tuning parameters, but yields only pairwise estimates. Diagonal elements of the covariance matrix can be estimated using the QML volatility estimation method introduced in Xiu (2010).

2.8. Regularization

Portfolio allocation and covariance modeling typically require the covariance matrix to be positive definite. We apply the regularization method in Hautsch et al. (2012) to convert ill-conditioned matrices into positive definite matrices using the following steps.

i.: Decompose the non-positive definite covariance matrix as $Δ C Δ^{'}$ , where C is the correlation matrix and $Δ$ is a matrix with standard deviations on the diagonal. Decompose the correlation matrix as $C = Q Λ Q^{'}$ , where $Λ = diag (λ_{1}, λ_{2}, \dots, λ_{d})$ is the diagonal matrix of eigenvalues and Q is the matrix of eigenvectors.
ii.: Calculate threshold value $λ_{c} = (1 - \frac{λ_{m a x}}{d}) (1 + \frac{d}{n} + 2 \sqrt{\frac{d}{n}})$ . Eigenvalues less than $λ_{c}$ are replaced by $\bar{λ} = \frac{1}{d - k} \sum_{λ_{i} < λ_{c}} \max (0, λ_{j})$ , where k is the number of eigenvalues greater than $λ_{c}$ .
iii.: The positive definite matrix is reconstructed as $Δ \tilde{C} Δ^{'}$ , where $\tilde{C} = Q \tilde{Λ} Q^{'}$ and $\tilde{Λ}$ is the matrix with updated eigenvalues.

3. Joint Return-RCOV Models

We consider three joint return-RCOV models with different distributional assumptions and volatility specifications to evaluate the predictive power of RCOV estimators. Let

F_{t} \equiv {R_{1 : t}, Σ_{1 : t}}

represent the information set up to t, where

R_{1 : t} = {R_{1}, R_{2}, \dots, R_{t}}

represents the series of d-dimensional return vectors and

Σ_{1 : t} = {Σ_{1}, Σ_{2}, \dots, Σ_{t}}

denotes RCOV matrices over t periods.

3.1. Inverse-Wishart Additive Model

Jin and Maheu (2013) introduce a joint return-RCOV model based on Wishart distribution with additive components. They suggest decomposing the scale matrix in the Wishart density into several additive components formed by past RCOVs. Later, Jin and Maheu (2016) find that the inverse-Wishart framework offers superior out-of-sample performance over the Wishart version. The joint inverse-Wishart additive (IW-A) model is defined as

\begin{matrix} R_{t} | μ, Σ_{t} & \sim & N (μ, Σ_{t}), \end{matrix}

(16)

\begin{matrix} Σ_{t} | V_{t}, ν & \sim & IW (ν, (ν - d - 1) V_{t}), \end{matrix}

(17)

\begin{matrix} V_{t} & = & B_{0} + \sum_{j = 1}^{3} B_{j} ⊙ Γ_{t - 1, ℓ_{j}}, Γ_{t - 1, ℓ} = \frac{1}{ℓ} \sum_{i = 1}^{ℓ} Σ_{t - i}, \end{matrix}

(18)

where

IW (ν, (ν - d - 1) V_{t})

denotes an inverse-Wishart distribution with degrees of freedom

ν

and scale matrix

(ν - d - 1) V_{t}

. The conditional mean of

Σ_{t}

is

V_{t}

, which is fully determined according to parameters

B_{0 : 3}, ℓ_{1 : 3}

and

Σ_{t - l : t - 1}

2.

B_{0}

is a

d \times d

positive-definite matrix.

B_{j} = b_{j} b_{j}^{'}

for

j = 1, 2, 3

and

b_{j}

’s are

d \times 1

vectors.

Γ_{t - 1, ℓ_{j}}

is the

j^{t h}

additive component defined as the average of the past

Σ_{t}

over

ℓ_{j}

terms. The first component is equal to

Σ_{t - 1}

by setting

ℓ_{1} = 1

.

ℓ_{2}

and

ℓ_{3}

are treated as parameters such that the sizes of past RCOVs in the second and third components can be determined endogenously.

In Bayesian inference, the model is estimated through Markov chain Monte Carlo (MCMC) techniques. The parameter set

Θ

includes

μ, ν, B_{0}, b_{1}, b_{2}, b_{3}, ℓ_{2}

and

ℓ_{3}

. Following the prior specifications in Jin and Maheu (2016), we set the priors of

μ

and all elements of

b_{j}

’s as

N (0, 100)

, the prior of

ν

as

exp (100) I_{ν > d + 1}

and the priors of

ℓ_{2}

and

ℓ_{3}

as discrete uniform distribution

U (2, 200)

. To avoid identification issues, we impose

ℓ_{2} < ℓ_{3}

and the first element of

b_{j}

being positive as prior restrictions. A Metropolis-Hastings algorithm with a joint random walk proposal is used to sample

μ, ν, b_{1}, b_{2}

and

b_{3}

. The proposal for sampling

ℓ_{2}

and

ℓ_{3}

is a random walk with Poisson increments.

B_{0}

is computed as

B_{0} = (ι ι^{'} - B_{1} - B_{2} - B_{3}) ⊙ \bar{Σ}

, where

\bar{Σ}

is the sample average of RCOVs, following the RCOV targeting technique. Any draws with a singular

B_{0}

matrix are dropped. Additional details of sampling steps are collected in the Appendix A.

3.2. Conditional Autoregressive Wishart Model

We extend the conditional autoregressive Wishart (CAW) model proposed in Golosnoy et al. (2012) to a joint return-RCOV model by linking RCOV estimates and returns via Equation (19). The joint CAW model is given as

\begin{matrix} R_{t} | μ, Σ_{t} & \sim & N (μ, Σ_{t}), \end{matrix}

(19)

\begin{matrix} Σ_{t} | ν, V_{t} & \sim & W (ν, V_{t} / ν), \end{matrix}

(20)

\begin{matrix} V_{t} & = & C + \sum_{i = 1}^{p} B_{i} V_{t - i} B_{i}^{'} + \sum_{i = 1}^{q} A_{i} Σ_{t - i} A_{i}^{'}, \end{matrix}

(21)

where

W (ν, V_{t} / ν)

is a Wishart distribution with degrees of freedom

ν

and scale matrix

V_{t} / ν

.

A_{i}

s and

B_{i}

s and C are

d \times d

matrices and C is positive definite. In addition to the distributional difference, the CAW model differs from IW-A in that it assumes conditional covariance has an autoregressive structure.

V_{t}

depends on its lagged value as well as the previous RCOVs, which technically suggests that all past

Σ_{t}

are accountable for explaining

V_{t}

.

We adapt the diagonal version of CAW with

p = q = 2

, which restricts

A_{i}

and

B_{i}

to be diagonal matrices. The priors of

μ

and diagonal elements of

A_{i}

and

B_{i}

are all

N (0, 100)

and the first diagonal elements of

A_{i}

and

B_{i}

are restricted to be positive. The prior of

ν

is

exp (100) I_{ν > d}

. Similar to the estimation of

B_{0}

in the IW-A model, C is set to

(ι ι^{'} - A_{1} - A_{2} - B_{1} - B_{2}) ⊙ \bar{Σ}

, where

\bar{Σ}

is the sample average of RCOVs. Other model parameters are sampled using the Metropolis-Hastings algorithm with a multivariate random walk proposal. Additional details of posterior sampling are presented in the Appendix A.

3.3. HEAVY Model

The multivariate high-frequency-based volatility (HEAVY) model introduced by Noureldin et al. (2012) is also considered. We add Student-t innovations to the HEAVY model as follows:

\begin{matrix} R_{t} | ν_{r}, H_{t} & \sim & St (0, \frac{ν_{r} - 2}{ν_{r}} H_{t}, ν_{r}), \end{matrix}

(22)

\begin{matrix} Σ_{t} | ν, V_{t} & \sim & W (ν, V_{t} / ν), \end{matrix}

(23)

\begin{matrix} H_{t} & = & C_{H} + B_{H} H_{t - 1} B_{H}^{'} + A_{H} Σ_{t - 1} A_{H}^{'}, \end{matrix}

(24)

\begin{matrix} V_{t} & = & C_{V} + B_{V} V_{t - 1} B_{V}^{'} + A_{V} Σ_{t - 1} A_{V}^{'}, \end{matrix}

(25)

where

C_{H}

and

C_{V}

are

d \times d

positive definite matrices, and

A_{H}

,

B_{H}

,

A_{V}

and

A_{V}

are

d \times d

diagonal matrices.

ν_{r}

and

ν

are the degrees of freedom of the Student-t and Wishart distributions, respectively. The HEAVY model exploits the conditional covariance of low-frequency returns and the conditional mean of RCOV in a GARCH-like setting. Equations (22) and (24) are similar to a multivariate GARCH(1,1) model with

R_{t - 1} R_{t - 1}^{'}

replaced by

Σ_{t - 1}

. Unlike IW-A and CAW models that assume RCOV is an unbiased measure of return covariance, the HEAVY model estimates return covariance

H_{t}

conditional on both return and RCOV information.

For Bayesian inference, we set the priors of all diagonal elements of

A_{H}

,

B_{H}

,

A_{V}

and

A_{H}

to be

N (0, 100)

, and the priors of

ν

and

ν_{r}

are

exp (100) I_{ν > d}

and

exp (100) I_{ν > 2}

.

C_{H}

and

C_{V}

are calculated using RCOV targeting. Metropolis-Hastings sampling steps with a multivariate random walk proposal are used for posterior simulation. Additional details of posterior sampling are presented in the Appendix A.

3.4. Prediction

Given that covariances are not observable, while returns are, the density forecast of returns provides a fair benchmark to evaluate the out-of-sample performance of RCOV estimators. Conditional on a particular model

M

and information set

F_{t}

, the predictive density of the next-period return is given as

p (R_{t + 1} | F_{t}, M) = \int p (R_{t + 1} | Θ, F_{t}, M) p (Θ | F_{t}, M) d Θ,

(26)

where

p (R_{t + 1} | Θ, F_{t}, M)

is the density conditional on parameter set

Θ

and the information set at time t and

p (Θ | F_{t}, M)

is the posterior density. Based on G MCMC outputs, the predictive likelihood is computed by integrating out the parameter uncertainty as

\begin{matrix} p (R_{t + 1} | F_{t}, M) \approx \frac{1}{G} \sum_{i = 1}^{G} p (R_{t + 1} | Θ^{(i)}, F_{t}, M), \end{matrix}

(27)

where

Θ^{(i)} \sim p (Θ | F_{t}, M)

.

Let

M_{1}, M_{2}

and

M_{3}

represent the IW-A, CAW and HEAVY models, respectively. For the IW-A model, after integrating out

Σ_{t + 1}

, the conditional distribution of

R_{t + 1}

is a multivariate Student-t given as:

\begin{matrix} p (R_{t + 1} | Θ^{(i)}, F_{t}, M_{1}) & = St (R_{t + 1} | μ, \frac{ν^{(i)} - d - 1}{ν^{(i)} - d + 1} V_{t + 1}^{(i)}, ν^{(i)} - d + 1), \end{matrix}

(28)

\begin{matrix} V_{t + 1}^{(i)} & = B_{0}^{(i)} + \sum_{j = 1}^{3} B_{j}^{(i)} ⊙ Γ_{t, ℓ_{j}^{(i)}} . \end{matrix}

(29)

For the CAW model, the distribution of

R_{t + 1}

conditional on

Θ^{(i)}

and

F_{t}

is given as

p (R_{t + 1} | Θ^{(i)}, F_{t}, M_{2}) \propto p (R_{t + 1} | Σ_{t + 1}, Θ^{(i)}, F_{t}, M_{2}) p (Σ_{t + 1} | Θ^{(i)}, F_{t}, M_{2}) .

(30)

p (R_{t + 1} | Θ^{(i)}, F_{t}, M_{2})

can be approximated by averaging

p (R_{t + 1} | Σ_{t + 1}^{(i)}, Θ^{(i)}, F_{t}, M_{2})

, where

Σ_{t + 1}^{(i)} \sim W (ν, V_{t + 1}^{(i)} / ν)

and

V_{t + 1}^{(i)} = C^{(i)} + \sum_{i = 1}^{2} B_{i}^{(i)} V_{t}^{(i)} B_{i}^{{(i)}^{'}} + \sum_{i = 1}^{2} A_{i}^{(i)} Σ_{t} A_{i}^{{(i)}^{'}}

.

For the HEAVY model, conditional on the

i^{t h}

draw of model parameters, the predictive density of

R_{t + 1}

is

\begin{matrix} p (R_{t + 1} | F_{t}, Θ^{(i)}, M_{3}) = & St (R_{t + 1} | 0, \frac{ν_{r}^{(i)} - 2}{ν_{r}^{(i)}} H_{t + 1}^{(i)}, ν_{r}^{(i)}), \end{matrix}

(31)

\begin{matrix} H_{t + 1}^{(i)} = & C_{H}^{(i)} + B_{H}^{(i)} H_{t}^{(i)} B_{H}^{{(i)}^{'}} + A_{H}^{(i)} Σ_{t} A_{H}^{{(i)}^{'}} . \end{matrix}

(32)

There are several differences among the three models’ predictive densities of return. Under the HEAVY model, the degrees of freedom

ν_{r}

and covariance matrix

H_{t + 1}

are estimated conditional on both returns and RCOVs, while the IW-A or CAW model relies on RCOV data only to infer the covariance and kurtosis of return densities. Another distinction is that the three models capture time-series dependence in RCOVs differently. IW-A determines the size of historical RCOVs endogenously by learning from the data. In contrast, all past RCOVs are involved in explaining future covariance under the CAW and HEAVY models.

The density forecast improvement offered by a RCOV estimator can be measured by the predictive likelihood, which is the predictive density evaluated at

R_{t + 1}

. Let

F_{t}^{A}

stand for the information set contains estimator

A

. The log-predictive likelihood (

LPL

) conditional on

F_{t}^{A}

over the out-of-sample period from

t_{0} + 1

to T is

{LPL}_{A} = \sum_{t = t_{0}}^{T - 1} \log p (R_{t + 1} | F_{t}^{A}, M) .

(33)

One could compare the predictive likelihoods conditional on different information set within the same model. Based on the model

M

, the log-Bayes factor (log-BF) of RCOV estimator

A

versus

B

is defined as

{LPL}_{A} - {LPL}_{B}

, where

A

is preferred if the log-Bayes factor is positive. To investigate subsample density forecast performance, the cumulative log-Bayes factor, which is a sequence of log-Bayes factors, is computed as follows.

cum log {BF}_{s} = \sum_{t = t_{0}}^{s} [log p (R_{t + 1} | F_{t}^{A}, M) - log p (R_{t + 1} | F_{t}^{B}, M)] for s = t_{0}, \dots, T - 1 .

(34)

An increasing trend suggests that estimator

A

consistently outperforms

B

over the out-of-sample period. Similarly, we could compare any two models via the log-Bayes factor by conditioning on the same information set

F

.

Mean-variance portfolio analysis requires point predictions of the covariance matrix. Given G MCMC outputs, the predictive means of the next-period covariance matrix in IW-A, CAW and HEAVY models are computed as follows:

\hat{Cov} (R_{t + 1} | F_{t}, M_{1}) = \frac{1}{G} \sum_{i = 1}^{G} (B_{0}^{(i)} + \sum_{j = 1}^{3} B_{j}^{(i)} ⊙ Γ_{t, ℓ_{j}^{(i)}}) .

(35)

\hat{Cov} (R_{t + 1} | F_{t}, M_{2}) = \frac{1}{G} \sum_{i = 1}^{G} (C^{(i)} + \sum_{i = 1}^{2} B_{i}^{(i)} V_{t}^{(i)} B_{i}^{{(i)}^{'}} + \sum_{i = 1}^{2} A_{i}^{(i)} Σ_{t} A_{i}^{{(i)}^{'}}) .

(36)

\hat{Cov} (R_{t + 1} | F_{t}, M_{3}) = \frac{1}{G} \sum_{i = 1}^{G} (C_{H}^{(i)} + B_{H}^{(i)} H_{t}^{(i)} B_{H}^{{(i)}^{'}} + A_{H}^{(i)} Σ_{t} A_{H}^{{(i)}^{'}}) .

(37)

4. Data

The transaction prices of 20 equities from 2 January 2002 to 31 December 2014 are obtained from the TAQ database, and the data from 2 January 2015 to 31 December 2018 are obtained from Tick Data. The raw intraday data are cleaned following the method used by Barndorff-Nielsen et al. (2009).

The returns are defined as the difference between log prices and are scaled by 100. We compute ex-post covariance matrix estimates in both 10 and 20 dimensions. BAC, CAT, DIS, GS, IBM, JNJ, KO, PG, WMT and XOM compose the 10 assets group A, and the 20 assets group contains an additional 10 assets: AXP, C, CVX, HD, HON, JPM, MCD, NKE, PFE, and VZ3. The last 10 assets form the 10 assets group B.

Twenty RCOV measures are constructed using the seven estimation approaches summarized in Section 2. They are (i) RC based on 10-min, 5-min and 1-min (

{RC}_{600 s}

,

{RC}_{300 s}

and

{RC}_{60 s}

), (ii) 10-min SRC with 10 and 20 subsamples (

SRC {(10)}_{600 s}

,

SRC {(20)}_{600 s}

) and 5-min SRC with 5 and 10 subsamples (

SRC {(10)}_{300 s}

,

SRC {(5)}_{300 s}

), (iii) two-scales estimators:

TSRC {(60, 1)}_{5 s}

,

TSRC {(60, 10)}_{5 s}

and

TSRC {(30, 1)}_{5 s}

, (iv) 1-min and 30-sec RCLL with

U = 1

and

U = 2

(

RCLL {(1)}_{60 s}

,

RCLL {(2)}_{60 s}

,

RCLL {(1)}_{30 s}

and

RCLL {(2)}_{30 s}

), (v) RK, (vi) Pre-averaged RC based on 1-min, 30-s and refresh-time data (

{PARC}_{60 s}

and

{PARC}_{30 s}

and

{PARC}_{refresh}

) and pre-averaged HY estimator, (vi) QMLC estimator. Table 1 lists the twenty estimators and their synchronization schemes and provides statistical summaries of the diagonal and off-diagonal elements of the covariance matrix estimates in the 20 assets case.

Table 1. List of RCOV measures.

5. Empirical Results

Each of the twenty RCOV measures is jointly modeled with returns using the three models discussed in Section 3. The out-of-sample forecasts are computed recursively from 19 October 2006 to 31 December 2018, a total of 3070 days. The estimation on the initial day of the out-of-sample is based on 10,000 MCMC runs, after dropping 10,000 burn-in draws. As new data arrive, model parameters are re-estimated based on 5000 MCMC results, after 1000 burn-in draws4.

5.1. Density Forecasts

Table 2, Table 3 and Table 4 report the sum of log-predictive likelihoods of next-period returns for the out-of-sample period under IW-A, CAW and HEAVY models conditional on various RCOV measures. The performance of RCOV measures can be visualized in Figure 1, Figure 2 and Figure 3, which plot the log-predictive Bayes factors for RCOV measures against

{RC}_{300 s}

in the three asset cases. In almost all cases, pre-averaging estimators based on previous-tick returns provide the best density forecast improvement. For example, switching from

{RC}_{300 s}

to

{PARC}_{30 s}

increases the log-predictive likelihood by a minimum of 187.0 (HEAVY, 10 assets—A) to a maximum of 1126.6 (IW-A, 20 assets). Two-scales and subsampling approaches lead to the second and third best-performing groups. RK and QMLC offer improved density forecast results compared with RCLLs, but are not significantly better than

{RC}_{300 s}

. The evaluation results are consistent across modeling frameworks, data dimensions and asset groups. In order to investigate the prior robustness of RCOV evaluation results, we compute the log-predictive likelihoods of the IW-A model under two additional sets of priors. As shown in Table 5, the rankings of RCOV measures remain unchanged under more informative or more sparse priors, which suggests the density-forecast-based RCOV evaluation method is robust to prior assumptions.

Table 2. Predictive likelihoods of return (IW-A model).

Table 3. Predictive likelihoods of return (CAW model).

Table 4. Predictive likelihoods of return (HEAVY model).

Figure 1. Log Bayes factors (10 assets—Group A).

Figure 2. Log Bayes factors (10 assets—Group B).

Figure 3. Log Bayes factors (20 assets).

Table 5. Prior sensitivity check.

Figure 4 plots the cumulative log-Bayes factors of several representative estimators (

{RC}_{300 s}

,

SRC {(20)}_{600 s}

,

TSRC (60, 1)

and

{PARC}_{refresh}

) against

{PARC}_{30 s}

in the three cases according to the IW-A model. The decreasing trend in Figure 4 suggests that the ranking of RCOV estimator is robust in subsample periods.

Figure 4. Cumulative log-Bayes factors of

{PARC}_{30 s}

vs. alternative RC estimators.

The density forecast results confirm several theoretical expectations and findings in the literature. The ranking of TSRC, SRC and RC is consistent with the conclusion in Zhang et al. (2005) that the two-scales estimator has a smaller bias than the subsampled or sparsely-sampled RC. A comparison of SRC estimators confirms that it is better to form a subsampled estimator with low-frequency data and more subsamples. The out-of-sample performance of the RC deteriorates as the sampling frequency increases, which validates the use of low-frequency RC in most empirical studies. The out-of-sample performance of RC and RCLL matches the theoretical results reported by Griffin and Oomen (2011), in which for a fixed sampling frequency, increasing the lead and lag terms reduces the estimation bias. Our results also show that PAHY underperforms PARCs, which is consistent with the finite sample result of PAHY documented by Christensen et al. (2010).

The variation in return density forecasts suggests that the choice of RCOV measure matters greatly with regard to prediction. For example, in the 20 assets application using the IW-A model, switching from

{RC}_{60 s}

to

{PARC}_{30 s}

increases the predictive likelihood from −80,200.9 to −76,645.5, a log-Bayes factor of 3555.4. Most RCOV modeling works try to improve the forecasts by adjusting the model specifications and stochastic assumptions, while our results shed light on a different perspective; that is, the choice of RCOV estimator is also important in the pursuit of better predictability.

The density forecast results also show that the comparison of RCOV models could be sensitive to the choice of RCOV. For example, when using

{PARC}_{30 s}

or

{PARC}_{60 s}

as the RCOV data, the IW-A model produces the best density forecast results, followed by HEAVY and CAW. In contrast, HEAVY performs better than IW-A for most of the other measures. Among the three joint models, IW-A has the highest sensitivity to RCOV inputs. Taking the 10 assets group A as an example, the log Bayes factor between the best and worst estimators is over 1300 under the IW-A model, while the predictive likelihood ranges under CAW and HEAVY are around 1000 and 400, respectively. Different distributional assumptions and model specifications are potential reasons for the sensitivity difference. In the HEAVY model, the return covariance matrix is estimated conditional on both returns and RCOVs. Therefore, the additional information from returns mitigates poor RCOV measures’ negative influence but diminishes the prediction improvement offered by good RCOV measures. Compared with the IW-A model, the CAW model captures stronger volatility persistence, so its prediction is less sensitive to newly arrived RCOV data. The IW-A and CAW models also differ in RCOV distributional assumptions, which further contributes to the forecasting results differences.

In addition to the one-period ahead density forecasts, we investigate the performance of RCOV measures based on long horizon density forecasts. Table 6 shows 5-period and 10-period ahead log-predictive likelihoods5 under the IW-A model across the three asset groups. Multiple horizon density forecasts provide a similar ranking of the twenty estimators, compared with the results taken from Table 2, Table 3 and Table 4.

Table 6. Predictive likelihoods of returns over long horizons.

5.2. Portfolio Allocation

In this section, we evaluate the out-of-sample performance of RCOV estimators from a portfolio optimization perspective. Through forming mean-variance portfolios using predicted covariance, the predictive performance of RCOV estimators can be indirectly assessed based on portfolio performance measures such as standard deviation or Sharpe ratio. Given the predictive covariance

Σ_{t + 1}

of next-period returns, the optimal weight

w_{t + 1}

can be obtained by solving the following optimization problem:

\min w_{t + 1}^{'} Σ_{t + 1} w_{t + 1}, subject to w_{t + 1}^{'} ι = 1,

(38)

where

Σ_{t + 1} = \hat{Cov} (R_{t + 1} | F_{t}, M)

. Under IW-A, CAW and HEAVY models,

Σ_{t + 1}

are calculated according to Equations (35)–(37). The realized portfolio return

r_{t + 1}^{p} = w_{t + 1}^{'} R_{t + 1}

and the out-of-sample portfolio variance is given as

σ_{p}^{2} = \frac{1}{T - t_{0}} \sum_{t = t_{0} + 1}^{T} {(r_{t}^{p} - {\bar{r}}^{p})}^{2} .

(39)

The estimator that leads to the smallest

σ_{p}^{2}

is considered to be the best6.

Table 7 reports the standard deviations of global minimum variance (GMV) portfolios based on the IW-A model7 with various RCOV measures. The comparison results based on portfolio exercises are generally consistent with those obtained from the density forecasts. For example,

{PARC}_{30 s}

,

{PARC}_{60 s}

, TSRC(60,1), and

SRC {(10)}_{600 s}

lead to portfolios with relatively low variance. However, the difference among standard deviations of GMV portfolios is marginal. To further investigate whether the difference is significant, we apply the model confidence set (MCS) introduced by Hansen et al. (2011) to obtain a set of estimators that includes the optimal one. Table 7 provides MCS test p-values for each covariance matrix estimator. In the 10-asset group A, estimators excluded are

{RC}_{60 s}

,

SRC {(10)}_{300 s}

,

{RCLL (2)}_{30 s}

,

{RCLL (1)}_{30 s}

, RK, PAHY and QMLC at the 25% significance level. The model confidence set could exclude underperforming RCOV measures, but fails to suggest the optimal one.

Table 7. Standard deviations of global minimum variance portfolio returns (IW-A model).

Despite the fact that both the density-forecast and portfolio-based methods are able to eliminate several inferior RCOV estimators, the latter fails to suggest outperforming ones. Compared with the portfolio-based method, the density forecast method is more direct as it ranks RCOV based on density forecasts of multivariate return vectors, rather than univariate portfolio measures. Another drawback of the portfolio-based method is that ranking of covariance estimators is sensitive to the choice of out-of-sample size and significance measurement8.

5.3. Close-to-Close Data

Previous empirical works use open-to-close data, which only account for information during trading hours. To investigate the robustness of our approach, we evaluate RCOV measures based on density forecasts of close-to-close returns. Following Fleming et al. (2003) and de Pooter et al. (2008), the close-to-close covariance matrix is formed by summing intraday covariance and the outer product of overnight returns.

{RCOV}_{t}^{c c} = R_{t}^{c o} R_{t}^{c o^{'}} + {RCOV}_{t}^{o c},

(40)

where

{RCOV}_{t}^{o c}

is a realized measure over trading hours and

R_{t}^{c o}

stands for the overnight log returns, which are formed using the opening price on day t and the closing price on day

t - 1

.

Table 8 reports the log-predictive likelihood of close-to-close returns using the HEAVY and CAW models. Even though adding the common overnight covariance component makes the competing covariance estimators more similar, the proposed approach is still sufficiently robust to evaluate estimators, and the rankings remain relatively consistent.

{PARC}_{30 s}

,

{PARC}_{60 s}

,

TSRC (60, {1)}_{5 s}

,

SRC {(20)}_{600 s}

and

SRC {(10)}_{600 s}

remain the top-performing RCOV measures.

Table 8. Predictive likelihoods of close-to-close return.

6. Conclusions

Existing methods of evaluating RCOV estimators empirically rely on portfolio analysis, which compare RCOV measures indirectly using univariate measures such as portfolio standard deviation or Sharpe ratio. The comparison of RCOV measures’ predictive power on return density forecasts is not well investigated in the existing literature. This paper fills the gap by suggesting a density-forecast-based method to evaluate RCOV measures. Given that covariances are not observable, while returns are, the joint modeling of returns and RCOVs enables the evaluation of RCOV estimators via return density forecasts. We test the empirical predictive power of a list of popular RCOV estimators and found several estimators consistently outperform others. The density-forecast-based evaluation method is robust to various RCOV models, datasets, data dimensions and forecast horizons. Another important insight is that the RCOV measures should be carefully selected in covariance modeling, as the choice of RCOV measures can significantly impact a model’s forecasting performance.

Author Contributions

Conceptualization, J.L. and Q.Y.; methodology, X.J., J.L. and Q.Y.; software, X.J.; validation, X.J.; formal analysis, X.J., J.L. and Q.Y.; resources, X.J., J.L. and Q.Y.; data curation, J.L.; writing—original draft preparation, J.L. and Q.Y.; writing—review and editing, X.J., J.L. and Q.Y. All authors have read and agreed to the published version of the manuscript.

Funding

Jin’s research is supported by NSFC through Project 71773069. Liu thanks the FGSR internal grant of Saint Mary’s University for financial support. Yang thanks the start-up fund of ShanghaiTech and Young Scientists Fund of NSFC (Project 72103137) for financial support.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are obtained from NYSE Trade and Quote (TAQ) database and Tick Data (https://www.tickdata.com/, accessed on April 2020).

Acknowledgments

Earlier versions of this paper were presented at the 2021 NBER-NSF SBIES Conference and 2021 China Meeting of the Econometric Society. We would like to thank those who participated in these meetings for their valuable comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

This section illustrates the MCMC sampling steps for IW-A, CAW and HEAVY models.

Appendix A.1. IW-A Model Estimation Steps

The parameters to be sampled are

μ, ν, b_{1}, b_{2}, b_{3}, ℓ_{2}, ℓ_{3}

. We divide them into the following three blocks and iteratively sample them from their conditional posterior distributions.

\begin{matrix} \begin{matrix} 1 . & μ | R_{1 : T}, Σ_{1 : T}, \\ 2 . & ν, b_{1}, b_{2}, b_{3} | ℓ_{2}, ℓ_{3}, Σ_{1 : T}, \\ 3 . & ℓ_{2}, ℓ_{3} | ν, b_{1}, b_{2}, b_{3}, Σ_{1 : T} . \end{matrix} \end{matrix}

μ

is sampled by a Gibbs sampler as its conditional posterior is a multivariate normal:

\begin{matrix} μ | R_{1 : T}, Σ_{1 : T} \sim N ({\bar{m}}_{μ}, {\bar{V}}_{μ}), \end{matrix}

where

{\bar{V}}_{μ} = {(\frac{1}{100} I + \sum_{t = 1}^{T} Σ_{t}^{- 1})}^{- 1}

and

{\bar{m}}_{μ} = {\bar{V}}_{μ} \sum_{t = 1}^{T} Σ_{t}^{- 1} R_{t}

.

The joint conditional posterior of

ν, b_{1}, b_{2}, b_{3}

is

\begin{matrix} p (ν, b_{1}, b_{2}, b_{3} | ℓ_{2}, ℓ_{3}, Σ_{1 : T}) & \propto & p (ν, b_{1}, b_{2}, b_{3}) \prod_{t = 1}^{T} I W (Σ_{t} | ν, (ν - d - 1) V_{t}) \\ = & p (ν, b_{1}, b_{2}, b_{3}) \prod_{t = 1}^{T} \frac{| Σ_{t} |^{- \frac{ν + d + 1}{2}} {| (ν - d - 1) V_{t} |}^{\frac{ν}{2}}}{2^{\frac{ν d}{2}} \prod_{j = 1}^{d} Γ (\frac{ν + 1 - j}{2})} \\ exp (- \frac{1}{2} T r (Σ_{t}^{- 1} (ν - d - 1) V_{t})) . \end{matrix}

We sample

ν, b_{1}, b_{2}, b_{3}

by applying a Metropolis-Hastings (MH) algorithm with random walk sampler. A multivariate normal serves as the proposal distribution. Given the current values

{ν, b_{1}, b_{2}, b_{3}}

in the MCMC chain, the new proposal

{ν^{^{'}}, b_{1}^{^{'}}, b_{2}^{^{'}}, b_{3}^{^{'}}}

is accepted with probability

\begin{matrix} min \{\frac{p (ν^{^{'}}, b_{1}^{^{'}}, b_{2}^{^{'}}, b_{3}^{^{'}} | ℓ_{2}, ℓ_{3}, Σ_{1 : T})}{p (ν, b_{1}, b_{2}, b_{3} | ℓ_{2}, ℓ_{3}, Σ_{1 : T})}, 1\} . \end{matrix}

The joint conditional posterior of

ℓ_{2}, ℓ_{3}

is

\begin{matrix} p (ℓ_{2}, ℓ_{3} | ν, b_{1}, b_{2}, b_{3}, Σ_{1 : T}) & \propto & p (ℓ_{2}, ℓ_{3}) \prod_{t = 1}^{T} I W (Σ_{t} | ν, (ν - d - 1) V_{t}) . \end{matrix}

ℓ_{2}

and

ℓ_{3}

are sequentially sampled by MH algorithms with the following proposal density:

q (ℓ) = \frac{λ^{ℓ} e^{- λ}}{2 ℓ!}

and we set

λ = 2

. The new proposal

{ℓ_{2}^{'}, ℓ_{3}^{'}}

is accepted with probability

\begin{matrix} min \{\frac{p (ℓ_{2}^{^{'}}, ℓ_{3}^{^{'}}, | ν, b_{1}, b_{2}, b_{3}, Σ_{1 : T})}{p (ℓ_{2}, ℓ_{3} | ν, b_{1}, b_{2}, b_{3}, Σ_{1 : T})}, 1\} . \end{matrix}

Appendix A.2. CAW Model Estimation Steps

The parameters set contains

μ, ν, a_{1}, a_{2}, b_{1}, b_{2}

, where

a_{i}

and

b_{i}

are vectors of diagonal elements of

A_{i}

and

B_{i}

, respectively, for

i = 1, 2

. We iteratively sample the following two blocks of parameters in Bayesian estimation.

\begin{matrix} \begin{matrix} 1 . & μ | R_{1 : T}, Σ_{1 : T}, \\ 2 . & ν, a_{1}, a_{2}, b_{1}, b_{2} | Σ_{1 : T} . \end{matrix} \end{matrix}

The same Gibbs step of sampling

μ

in the IW-A model estimation is used to sample

μ

since the two models share the same conditional posterior of

μ

. The joint conditional posterior of

ν, a_{1}, a_{2}, b_{1}, b_{2}

is

\begin{matrix} p (ν, a_{1}, a_{2}, b_{1}, b_{2} | Σ_{1 : T}) & \propto & p (ν, a_{1}, a_{2}, b_{1}, b_{2}) \prod_{t = 1}^{T} W (Σ_{t} | ν, V_{t} / ν) \\ = & p (ν, a_{1}, a_{2}, b_{1}, b_{2}) \prod_{t = 1}^{T} \frac{| Σ_{t} |^{\frac{ν - d - 1}{2}} {| V_{t} / ν |}^{- \frac{ν}{2}}}{2^{\frac{ν d}{2}} \prod_{j = 1}^{d} Γ (\frac{ν + 1 - j}{2})} exp (- \frac{1}{2} T r (Σ_{t} ν V_{t}^{- 1})) . \end{matrix}

We sample

ν, a_{1}, a_{2}, b_{1}, b_{2}

by applying a MH algorithm with random walk sampler similarly to step 2 in the estimation of the IW-A model. The new proposal

{ν^{^{'}}, a_{1}^{^{'}}, a_{2}^{^{'}}, b_{1}^{^{'}}, b_{2}^{^{'}}}

is accepted with probability

\begin{matrix} min \{\frac{p (ν^{^{'}}, a_{1}^{^{'}}, a_{2}^{^{'}}, b_{1}^{^{'}}, b_{2}^{^{'}} | Σ_{1 : T})}{p (ν, a_{1}, a_{2}, b_{1}, b_{2} | Σ_{1 : T})}, 1\} . \end{matrix}

Appendix A.3. HEAVY Model Estimation Steps

The parameter set

Θ

to be sampled includes

{ν, ν_{r}, a_{H}, a_{V}, b_{H}, b_{V}}

, where

a_{i}

and

b_{i}

are vectors of diagonal elements of

A_{i}

and

B_{i}

, respectively, for

i = H, V

. We sample all the parameters in one block. The joint posterior is

\begin{matrix} p (ν, ν_{r}, a_{H}, a_{V}, b_{H}, b_{V} | R_{1 : T}, Σ_{1 : T}) \\ \propto & p (ν, ν_{r}, a_{H}, a_{V}, b_{H}, b_{V}) \prod_{t = 1}^{T} S t (R_{t} | 0, \frac{ν_{r} - 2}{ν_{r}} H_{t}, ν_{r}) W (Σ_{t} | ν, V_{t} / ν) \\ = & \prod_{t = 1}^{T} \frac{Γ [(ν_{r} + d) / s]}{Γ (ν_{r} / 2) ν_{r}^{d / 2} π^{d / 2} {| (ν_{r} - 2) / ν_{r} H_{t} |}^{1 / 2}} {[1 + \frac{1}{ν_{r} - 2} R_{t}^{'} H_{t}^{- 1} R_{t}]}^{- \frac{ν_{r} + 2}{2}} \\ \cdot \frac{| Σ_{t} |^{\frac{ν - d - 1}{2}} {| V_{t} / ν |}^{- \frac{ν}{2}}}{2^{\frac{ν d}{2}} \prod_{j = 1}^{d} Γ (\frac{ν + 1 - j}{2})} exp (- \frac{1}{2} T r (Σ_{t} ν V_{t}^{- 1})) \\ \cdot p (ν, ν_{r}, a_{H}, a_{V}, b_{H}, b_{V}) . \end{matrix}

where

H_{t} = C_{H} + B_{H} H_{t - 1} B_{H}^{'} + A_{H} Σ_{t - 1} A_{H}^{'}

and

V_{t} = C_{V} + B_{V} V_{t - 1} B_{V}^{'} + A_{V} Σ_{t - 1} A_{V}^{'}

. We apply a MH algorithm with random walker proposal to sample

ν, ν_{r}, a_{H}, a_{V}, b_{H}, b_{V}

. The new proposal

{ν^{^{'}}, ν_{r}^{^{'}}, a_{H}^{^{'}}, a_{V}^{^{'}}, b_{H}^{^{'}}, b_{V}^{^{'}}}

is accepted with probability

\begin{matrix} min \{\frac{p (ν^{^{'}}, ν_{r}^{^{'}}, a_{H}^{^{'}}, a_{V}^{^{'}}, b_{H}^{^{'}}, b_{V}^{^{'}} | R_{1 : T}, Σ_{1 : T})}{p (ν, ν_{r}, a_{H}, a_{V}, b_{H}, b_{V} | R_{1 : T}, Σ_{1 : T})}, 1\} . \end{matrix}

Notes

1	Parzen kernel function: $k (x) = \{\begin{matrix} 1 - 6 x^{2} + 6 x^{3}, & 0 \leq x \leq 1 / 2 \\ 2 {(1 - x)}^{3}, & 1 / 2 < x \leq 1 \\ 0, & x > 1 \end{matrix}$
2	⊙ denotes the element-by-element (Hadamard) product of two matrices.
3	The company names are: American Express, Bank of American, Citigroup, Caterpillar, Chevron, Disney, Goldman Sachs, Home Depot, Honeywell, International Business Machine, Johnson and Johnson, JPMorgan Chase, Coca-Cola, McDonald, Nike, Pfizer, Procter and Gamble, Verizon Communication, Walmart and Exxon Mobile.
4	Initial values of parameters in a new sample are set to be the posterior mean of the previous sample. This could make the Markov chain converge quickly and reduce the computation cost.
5	The h-period ahead predictive likelihood is the predictive density evaluated at the realized return $R_{t + h}$ . $p (R_{t + h} \| F_{t}, M) = \int p (R_{t + h} \| Σ_{t + h}, Θ, M) p (Σ_{t + h} \| Θ, F_{t}, M) p (Θ \| F_{t}, M) d Θ$ , which can be calculated based on MCMC outputs similar to Equation (27).
6	As indicated by Patton and Sheppard (2009), the true variance-covariance that generates the out-of-sample portfolio variance must be the smallest.
7	We only report the GMV portfolio results based on the IW-A model. CAW and HEAVY models provide similar results.
8	For example, how numerically small $σ_{p}$ is will be seen as significant. Besides Equation (39), tracking error portfolios (Patton and Sheppard 2009) and utility-based framework (Fleming et al. 2003) are alternative measurements with different economic intuition.

References

Aït-Sahalia, Yacine, Jianqing Fan, and Dacheng Xiu. 2010. High-Frequency Covariance Estimates With Noisy and Asynchronous Financial Data. Journal of the American Statistical Association 105: 1504–17. [Google Scholar] [CrossRef]
Amendola, Alessandra, Manuela Braione, Vincenzo Candila, and Giuseppe Storti. 2020. A model confidence set approach to the combination of multivariate volatility forecasts. International Journal of Forecasting 36: 873–91. [Google Scholar] [CrossRef]
Andersen, Torben G., Tim Bollerslev, Francis X. Diebold, and Paul Labys. 2003. Modeling and Forecasting Realized Volatility. Econometrica 71: 579–625. [Google Scholar] [CrossRef]
Asai, Manabu, and Michael McAleer. 2015. Forecasting co-volatilities via factor models with asymmetry and long memoryt in realized covariance. Journal of Econometrics 189: 251–62. [Google Scholar] [CrossRef]
Aït-Sahalia, Yacine, and Loriano Mancini. 2008. Out of sample forecasts of quadratic variation. Journal of Econometrics 147: 17–33. [Google Scholar] [CrossRef]
Bannouh, Karim, Dick van Dijk, and Martin Martens. 2009. Range-Based Covariance Estimation Using High-Frequency Data: The Realized Co-Range. Journal of Financial Econometrics 7: 341–72. [Google Scholar] [CrossRef][Green Version]
Barndorff-Nielsen, Ole E., Peter Reinhard Hansen, Asger Lunde, and Neil Shephard. 2009. Realized kernels in practice: Trades and quotes. Econometrics Journal 12: C1–C32. [Google Scholar] [CrossRef]
Barndorff-Nielsen, Ole E., Peter Reinhard Hansen, Asger Lunde, and Neil Shephard. 2011. Multivariate realised kernels: Consistent positive semi-definite estimators of the covariation of equity prices with noise and non-synchronous trading. Journal of Econometrics 162: 149–69. [Google Scholar] [CrossRef]
Barndorff-Nielsen, Ole E., and Neil Shephard. 2004. Econometric analysis of realized covariation: High frequency based covariance, regression, and correlation in financial economics. Econometrica 72: 885–925. [Google Scholar] [CrossRef]
Bauer, Gregory, and Keith Vorkink. 2011. Forecasting multivriate realized stock market volatility. Journal of Econometrics 160: 79–109. [Google Scholar] [CrossRef]
Cech, Frantsek, and Jozef Barunik. 2017. On the modelling and forecasting of multivariate realzied volatility: Generalized heterogeneous autoregressive (GHAR) model. Journal of Forecasting 36: 181–206. [Google Scholar] [CrossRef]
Chiriac, Roxana, and Valeri Voev. 2010. Modelling and forecasting multivariate realized volatility. Journal of Applied Econometrics 26: 922–47. [Google Scholar] [CrossRef]
Christensen, Kim, Silja Kinnebrock, and Mark Podolskij. 2010. Pre-averaging estimators of the ex-post covariance matrix in noisy diffusion models with non-synchronous data. Journal of Econometrics 159: 116–33. [Google Scholar] [CrossRef]
Corsi, Fulvio, Stefano Peluso, and Francesco Audrino. 2015. Missing in asynchronicity: A kalman-em approach for multivariate realized covariance estimation. Journal of Applied Econometrics 30: 377–97. [Google Scholar] [CrossRef]
de Pooter, Michiel, Martin Martens, and Dick van Dijk. 2008. Predicting the daily covariance matrix for s&p 100 stocks using intraday data—But which frequency to use? Econometric Reviews 27: 199–229. [Google Scholar] [CrossRef]
Dimson, Elroy. 1979. Risk measurement when shares are subject to infrequent trading. Journal of Financial Economics 7: 197–226. [Google Scholar] [CrossRef]
Fan, Jianqing, Yingying Li, and Ke Yu. 2012. Vast volatility matrix estimation using high-frequency data for portfolio selection. Journal of the American Statistical Association 107: 412–28. [Google Scholar] [CrossRef]
Fleming, Jeff, Chris Kirby, and Barbara Ostdiek. 2003. The economic value of volatility timing using realized volatility. Journal of Financial Economics 67: 473–509. [Google Scholar] [CrossRef]
Golosnoy, Vasyl, Bastian Gribisch, and Roman Liesenfeld. 2012. The conditional autoregressive wishart model for multivariate stock market volatility. Journal of Econometrics 167: 211–23. [Google Scholar] [CrossRef]
Gourieroux, Christian, Joann Jasiak, and Razvan Sufana. 2009. The wishart autoregressive process of multivariate stochastic volatility. Journal of Econometrics 150: 167–81. [Google Scholar] [CrossRef]
Griffin, Jim E., and Roel C. A. Oomen. 2011. Covariance measurement in the presence of non-synchronous trading and market microstructure noise. Journal of Econometrics 160: 58–68. [Google Scholar] [CrossRef]
Hansen, Peter, Jeremy Large, and Asger Lunde. 2008. Moving Average-Based Estimators of Integrated Variance. Econometric Reviews 27: 79–111. [Google Scholar] [CrossRef]
Hansen, Peter R., and Asger Lunde. 2006. Realized variance and market microstructure noise. Journal of Business & Economic Statistics 24: 127–61. [Google Scholar]
Hansen, Peter R., Asger Lunde, and James M. Nason. 2011. The model confidence set. Econometrica 79: 453–97. [Google Scholar] [CrossRef]
Hansen, Peter Reinhard, Asger Lunde, and Valeri Voev. 2014. Realized beta GARCH: A multivariate GARCH model with realized measures of volatility. Journal of Applied Econometrics 29: 774–99. [Google Scholar] [CrossRef]
Hautsch, Nikolaus, Lada M. Kyj, and Roel C. A. Oomen. 2012. A blocking and regularization approach to high-dimensional realized covariance estimation. Journal of Applied Econometrics 27: 625–45. [Google Scholar] [CrossRef]
Hayashi, Takaki, and Nakahiro Yoshida. 2005. On covariance estimation of non-synchronously observed diffusion processes. Bernoulli 11: 359–79. [Google Scholar] [CrossRef]
Jacod, Jean, Yingying Li, Per A. Mykland, Mark Podolskij, and Mathias Vetter. 2009. Microstructure noise in the continuous case: The pre-averaging approach. Stochastic Processes and Their Applications 119: 2249–76. [Google Scholar] [CrossRef]
Jin, Xin, and John Maheu. 2013. Modeling realized covariances and returns. Journal of Financial Econometrics 11: 335–69. [Google Scholar] [CrossRef]
Jin, Xin, and John Maheu. 2016. Bayesian semiparametric modeling of realized covariance matrices. Journal of Econometrics 192: 19–39. [Google Scholar] [CrossRef]
Jin, Xin, John Maheu, and Qiao Yang. 2019. Bayesian parametric and semiparametric factor models for large realized covariance matrices. Journal of Applied Econometrics 34: 641–60. [Google Scholar] [CrossRef]
Lunde, Asger, Neil Shephard, and Kevin Sheppard. 2016. Econometric analysis of vast covariance matrices using composite realized kernels and their application to portfolio choice. Journal of Business & Economic Statistics 34: 504–18. [Google Scholar]
Noureldin, Diaa, Neil Shephard, and Kevin Sheppard. 2012. Multivariate high-frequency-based volatility (HEAVY) models. Journal of Applied Econometrics 27: 907–33. [Google Scholar] [CrossRef]
Opschoor, Anne, Pawel Janus, Andre Lucas, and Dick Van Dijk. 2018. New Heavy models for fat-tailed realized covariance and returns. Journal of Business and Economic Statistics 36: 643–57. [Google Scholar] [CrossRef]
Patton, Andrew, and Kevin Sheppard. 2009. Evaluating volatility and correlation forecasts. Chapter 15. In Handbook of Financial Time Series. Berlin/Heidelberg: Springer, pp. 801–38. [Google Scholar]
Peluso, Stefano, Fulvio Corsi, and Antonietta Mira. 2015. A bayesian high-frequency estimator of the multivariate covariance of noisy and asynchronous returns. Journal of Financial Econometrics 13: 665–97. [Google Scholar] [CrossRef][Green Version]
Scholes, Myron, and Joseph Williams. 1977. Estimating betas from nonsynchronous data. Journal of Financial Economics 5: 309–27. [Google Scholar] [CrossRef]
Shen, Keren, Jianfeng Yao, and Wai Keung Li. 2020. Forecasting high-dimensional realized volatility matyrices using a factor model. Quantitative Finance 20: 1879–87. [Google Scholar] [CrossRef]
Tao, Minjing, Yazhen Wang, Qiwei Yao, and Jian Zou. 2011. Large volatility matrix inference via combining low-frequency and high-frequency approaches. Journal of the American Statistical Association 106: 1025–40. [Google Scholar] [CrossRef]
Voev, Valeri, and Asger Lunde. 2007. Integrated Covariance Estimation using High-frequency Data in the Presence of Noise. Journal of Financial Econometrics 5: 68–104. [Google Scholar] [CrossRef]
Xiu, Dacheng. 2010. Quasi-maximum likelihood estimation of volatility with high frequency data. Journal of Econometrics 159: 235–50. [Google Scholar] [CrossRef]
Yu, Philip, Wai Keung Li, and Fo Chun Ng. 2017. The generalized conditional autoregressive wishart model for multivariate realized volatility. Journal of Business and Economic Statistics 35: 513–27. [Google Scholar] [CrossRef]
Zhang, Lan. 2011. Estimating covariation: Epps effect, microstructure noise. Journal of Econometrics 160: 33–47. [Google Scholar] [CrossRef]
Zhang, Lan, Per A. Mykland, and Yacine Aït-Sahalia. 2005. A tale of two time scales: Determining integrated volatility with noisy high-frequency data. Journal of the American Statistical Association 100: 1394–11. [Google Scholar] [CrossRef]

Figure 1. Log Bayes factors (10 assets—Group A).

Figure 2. Log Bayes factors (10 assets—Group B).

Figure 3. Log Bayes factors (20 assets).

Figure 4. Cumulative log-Bayes factors of

{PARC}_{30 s}

vs. alternative RC estimators.

Table 1. List of RCOV measures.

Estimator	Description	Synchronization	$\bar{mean (RV)}$	$\bar{var (RV)}$	$\bar{mean (RC)}$	$\bar{var (RC)}$
${RC}_{300 s}$	5-min realized covariance	Previous-tick	2.4980	6.6307	0.8518	2.5601
${RC}_{600 s}$	10-min realized covariance	Previous-tick	2.4127	6.6639	0.8502	2.6526
${RC}_{60 s}$	1-min realized covariance	Previous tick	2.8160	8.3463	0.8063	2.4790
$SRC {(20)}_{600 s}$	Average of 20 subsampled 10-min RC	Previous-tick	2.3851	6.5081	0.8389	2.5811
$SRC {(10)}_{600 s}$	Average of 10 subsampled 10-min RC	Previous-tick	2.3920	6.5578	0.8412	2.5992
$SRC {(10)}_{300 s}$	Average of 10 subsampled 5-min RC	Previous-tick	2.4959	6.9792	0.8545	2.6526
$SRC {(5)}_{300 s}$	Average of 5 subsampled 5-min RC	Previous-tick	2.5081	7.0812	0.8590	2.6889
$TSRC {(60, 10)}_{5 s}$	Two-scale RC ( $K = 60$ and $J = 10$ )	Previous-tick	2.3301	6.5662	0.8676	2.7205
$TSRC {(60, 1)}_{5 s}$	Two-scale RC ( $K = 60$ and $J = 1$ )	Previous-tick	2.2979	6.4629	0.8578	2.6576
$TSRC {(30, 1)}_{5 s}$	Two-scale RC ( $K = 30$ and $J = 1$ )	Previous-tick	2.3880	6.8016	0.8561	2.6138
$RCLL {(1)}_{60 s}$	1-min RC with 1 lead and 1 lag	Previous-tick	2.8061	8.2185	0.7951	2.4085
$RCLL {(1)}_{30 s}$	30-s RC with 1 lead and 1 lag	Previous-tick	2.6857	7.8036	0.8499	2.6106
$RCLL {(2)}_{60 s}$	1-min RC with 2 lead and 2 lag	Previous-tick	2.7165	7.8183	0.8246	2.4848
$RCLL {(2)}_{30 s}$	30-s RC with 2 lead and 2 lag	Previous-tick	2.6185	7.5414	0.8632	2.6984
RK	Multivariate realized kernel	Refresh time	2.5072	7.0398	0.8375	2.4260
${PARC}_{60 s}$	1-min pre-averaged RC	Previous-tick	2.0961	5.7833	0.8171	2.4796
${PARC}_{30 s}$	30-s pre-averaged RC	Previous-tick	2.1581	5.9694	0.8312	2.5417
${PARC}_{refresh}$	Refresh-time pre-averaged RC	Refresh time	2.2563	6.8222	0.8552	2.6614
PAHY	Pre-averaged Hayashi-Yoshida	-	2.4172	7.1006	0.8622	2.6612
QMLC	Quasi-maximum likelihood covariance	Refresh time	2.4760	6.7113	0.8120	2.1383

This table reports mean and variance of diagonal and off-diagonal elements of 20-assets ex-post covariance matrix estimated using 20 ways. The sample period spans from 2 January 2004 to 31 December 2018.

\bar{mean (RV)}

is the average of 20 RV means.

\bar{var (RV)}

is the average of 20 variances of RV. Similarly,

\bar{mean (RC)}

and

\bar{var (RC)}

represents the average of 190 realized covariance (off-diagonal) means, and the average of 190 variances of realized covariances, respectively.

Table 2. Predictive likelihoods of return (IW-A model).

	10 Assets—Group A		10 Assets—Group B		20 Assets
Estimators	$LPL$	log-BF	$LPL$	log-BF	$LPL$	log-BF
${RC}_{300 s}$	−39,829.0	0	−42,229.0	0	−77,772.1	0
${RC}_{600 s}$	−39,628.1	200.9	−42,055.1	173.9	−77,885.2	−113.1
${RC}_{60 s}$	−40,737.5	−908.5	−43,127.4	−898.4	−80,200.9	−2428.8
$SRC {(20)}_{600 s}$	−39,596.2	232.8 *	−41,919.6	309.4	−77,167.0	605.1
$SRC {(10)}_{600 s}$	−39,605.6	223.4	−41,951.9	277.1	−77,183.5	588.6
$SRC {(10)}_{300 s}$	−39,771.2	57.8	−42,072.7	156.3	−77,640.9	131.2
$SRC {(5)}_{300 s}$	−39,815.7	13.3	−42,124.6	104.4	−77,710.8	61.3
$TSRC {(60, 10)}_{5 s}$	−39,727.4	101.6	−41,834.6	394.4 *	−76,982.2	789.9 *
$TSRC {(60, 1)}_{5 s}$	−39,489.8	339.2 *	−41,780.0	449.0 *	−76,920.8	851.3 *
$TSRC {(30, 1)}_{5 s}$	−39,681.7	147.3	−41,955.4	273.6	−77,402.6	369.5
$RCLL {(1)}_{60 s}$	−40,306.3	−477.3	−42,640.9	−411.9	−79,044.6	−1272.5
$RCLL {(1)}_{30 s}$	−40,726.1	−897.1	−43,105.5	−876.5	−80,180.7	−2408.6
$RCLL {(2)}_{60 s}$	−40,105.7	−276.7	−42,423.3	−194.3	−78,494.5	−722.4
$RCLL {(2)}_{30 s}$	−40,439.1	−610.1	−42,796.5	−567.5	−79,427.2	−1655.1
RK	−40,045.9	−216.9	−42,255.7	−26.7	−78,065.6	−293.5
${PARC}_{60 s}$	−39,369.0	460.0 *	−41,734.3	494.7 *	−76,676.5	1095.6 *
${PARC}_{30 s}$	−39,368.6	460.4 *	−41,718.6	510.4 *	−76,645.5	1126.6 *
${PARC}_{refresh}$	−39,539.9	289.1 *	−41,780.0	449.0 *	−76,869.1	903.0 *
PAHY	−39,851.0	−22.0	−42,101.1	127.9	−77,796.4	−24.3
QMLC	−40,123.0	−303.0	−42,487.0	−258.0	−79,838.1	−2066.0

The base RCOV measure for log-BF computation is RC_300s. Bold numbers indicate the highest log-BF values, and the top 5 results are labelled with *.

Table 3. Predictive likelihoods of return (CAW model).

	10 Assets—Group A		10 Assets—Group B		20 Assets
Estimators	$LPL$	log-BF	$LPL$	log-BF	$LPL$	log-BF
${RC}_{300 s}$	−40,029.9	0	−42,360.9	0	−78,462.0	0
${RC}_{600 s}$	−39,914.5	115.4	−42,224.1	136.8	−79,214.4	−752.4
${RC}_{60 s}$	−40,605.9	−576	−43,032.1	−671.2	−79,991.6	−1529.6
$SRC {(20)}_{600 s}$	−39,802.0	227.9	−42,066.1	294.8	−77,962.2	499.8
$SRC {(10)}_{600 s}$	−39,832.4	197.5	−42,102.5	258.4	−78,009.5	452.5
$SRC {(10)}_{300 s}$	−39,920.7	109.2	−42,172.6	188.3	−78,101.3	360.7
$SRC {(5)}_{300 s}$	−39,941.6	88.3	−42,238.0	122.9	−78,158.5	303.5
$TSRC {(60, 10)}_{5 s}$	−39,861.6	168.3	−41,943.1	417.8 *	−77,511.8	950.2 *
$TSRC {(60, 1)}_{5 s}$	−39,674.3	355.6 *	−41,887.2	473.7 *	−77,369.6	1092.4 *
$TSRC {(30, 1)}_{5 s}$	−39,783.0	246.9 *	−42,029.4	331.5	−77,625.9	836.1 *
$RCLL {(1)}_{60 s}$	−40,282.2	−252.3	−42,626.6	−265.7	−79,029.9	−567.9
$RCLL {(1)}_{30 s}$	−40,597.2	−567.3	−42,991.9	−631.0	−79,899.0	−1437.0
$RCLL {(2)}_{60 s}$	−40,119.6	−89.7	−42,457.5	−96.6	−78,609.0	−147.0
$RCLL {(2)}_{30 s}$	−40,370.5	−340.6	−42,736.9	−376.0	−79,282.5	−820.5
RK	−40,091.6	−61.7	−42,312.9	−48.0	−78,275.0	187.0
${PARC}_{60 s}$	−39,629.7	400.2 *	−41,862.1	498.8 *	−78,231.3	230.7
${PARC}_{30 s}$	−39,626.9	403.0 *	−41,858.5	502.4 *	−77,794.6	667.4 *
${PARC}_{refresh}$	−39,705.8	324.1 *	−41,895.4	465.5 *	−77,478.9	983.1 *
PAHY	−39,972.8	57.1	−42,195.6	165.3	−78,172.6	289.4
QMLC	−40,173.9	−144.0	−42,385.0	−24.1	−78,575.9	−113.9

The base RCOV measure for log-BF computation is RC_300s. Bold numbers indicate the highest log-BF values, and the top 5 results are labelled with *.

Table 4. Predictive likelihoods of return (HEAVY model).

	10 Assets—Group A		10 Assets—Group B		20 Assets
Estimators	$LPL$	log-BF	$LPL$	log-BF	$LPL$	log-BF
${RC}_{300 s}$	−39,638.9	0	−42,052.3	0	−77,260.1	0
${RC}_{600 s}$	−39,647.4	−8.5	−42,031.0	21.3	−77,162.8	97.3
${RC}_{60 s}$	−39,809.6	−170.7	−42,319.3	−267.0	−77,927.1	−667
$SRC {(20)}_{600 s}$	−39,567.3	71.6	−41,908.1	144.2	−76,975.6	284.5
$SRC {(10)}_{600 s}$	−39,557.0	81.9	−41,922.8	129.5	−77,005.5	254.6
$SRC {(10)}_{300 s}$	−39,565.3	73.6	−41,933.7	118.6	−77,069.7	190.4
$SRC {(5)}_{300 s}$	−39,546.3	92.6	−41,950.0	102.3	−77,064.5	195.6
$TSRC {(60, 10)}_{5 s}$	−39,471.5	167.4 *	−41,792.4	259.9 *	−76,780.5	479.6 *
$TSRC {(60, 1)}_{5 s}$	−39,418.4	220.5 *	−41,761.3	291.0 *	−76,711.8	548.3 *
$TSRC {(30, 1)}_{5 s}$	−39,451.8	187.1 *	−41,833.4	218.9	−76,874.3	385.8
$RCLL {(1)}_{60 s}$	−39,654.6	−15.7	−42,101.0	−48.7	−77,398.2	−138.1
$RCLL {(1)}_{30 s}$	−39,792.6	−153.7	−42,282.6	−230.3	−77,809.0	−548.9
$RCLL {(2)}_{60 s}$	−39,601.7	37.2	−42,022.2	30.1	−77,201.1	59.0
$RCLL {(2)}_{30 s}$	−39,695.6	−56.7	−42,145.1	−92.8	−77,513.1	−253.0
RK	−39,592.3	46.6	−41,949.6	102.7	−77,105.7	154.4
${PARC}_{60 s}$	−39,475.1	163.8	−41,773.8	278.5 *	−76,710.0	550.1 *
${PARC}_{30 s}$	−39,451.9	187.0 *	−41,758.0	294.3 *	−76,721.6	538.5 *
${PARC}_{refresh}$	−39,445.7	193.2 *	−41,770.9	281.4 *	−76,762.3	497.8 *
PAHY	−39,592.2	46.7	−41,898.7	153.6	−77,178.3	81.8
QMLC	−39,677.7	−38.8	−42,016.6	35.7 *	−77,325.0	−64.9

The base RCOV measure for log-BF computation is RC_300s. Bold numbers indicate the highest log-BF values, and the top 5 results are labelled with *.

Table 5. Prior sensitivity check.

			Less Sparse Priors		More Sparse Priors
	$N (0, 10^{2})$		$N (0, 1)$		$N (0, 100^{2})$
Estimators	$LPL$	log-BF	$LPL$	log-BF	$LPL$	log-BF
${RC}_{300 s}$	−39,829.0	0	−39,829.3	0	−39,828.8	0
${RC}_{600 s}$	−39,628.1	200.9	−39,626.1	203.2	−39,624.6	204.2
${RC}_{60 s}$	−40,737.5	−908.5	−40,734.6	−905.3	−40,735.6	−906.8
$SRC {(20)}_{600 s}$	−39,596.2	232.8	−39,588.8	240.5	−39,589.3	239.5
$SRC {(10)}_{600 s}$	−39,605.6	223.4	−39,600.0	229.3	−39,601.5	227.3
$SRC {(10)}_{300 s}$	−39,771.2	57.8	−39,771.7	57.6	−39,774.1	54.7
$SRC {(5)}_{300 s}$	−39,815.7	13.3	−39,823.3	6.0	−39,820.7	8.1
$TSRC {(60, 10)}_{5 s}$	−39,727.4	101.6	−39,728.4	100.9	−39,727.9	100.9
$TSRC {(60, 1)}_{5 s}$	−39,489.8	339.2	−39,487.0	342.3	−39,487.8	341.0
$TSRC {(30, 1)}_{5 s}$	−39,681.7	147.3	−39,684.3	145.0	−39,683.0	145.8
$RCLL {(1)}_{60 s}$	−40,306.3	−477.3	−40,310.8	−481.5	−40,311.1	−482.3
$RCLL {(1)}_{30 s}$	−40,726.1	−897.1	−40,723.5	−894.2	−40,721.2	−892.4
$RCLL {(2)}_{60 s}$	−40,105.7	−276.7	−40,105.6	−276.3	−40,107.4	−278.6
$RCLL {(2)}_{30 s}$	−40,439.1	−610.1	−40,439.3	−610.0	−40,440.6	−611.8
RK	−40,045.9	−216.9	−40,043.8	−214.5	−40,040.2	−211.4
${PARC}_{60 s}$	−39,369.0	460.0	−39,368.8	460.5	−39,367.7	461.1
${PARC}_{30 s}$	−39,368.6	460.4	−39,368.1	461.2	−39,372.4	456.4
${PARC}_{refresh}$	−39,539.9	289.1	−39,539.4	289.9	−39,541.4	287.4
PAHY	−39,851.0	−22.0	−39,852.1	−22.8	−39,850.4	−21.6
QMLC	−40,123.0	−303.0	−40,122.2	−292.9	−40,120.8	−292.0

This table reports the density forecast results based on the IW-A model under three sets of priors.

Table 6. Predictive likelihoods of returns over long horizons.

	10 Assets—Group A				10 Assets—Group B				20 Assets
Estimators	${LPL}_{5}$	log BF	${LPL}_{10}$	log BF	${LPL}_{5}$	log BF	${LPL}_{10}$	log BF	${LPL}_{5}$	log BF	${LPL}_{10}$	log BF
${RC}_{300 s}$	−40,448	0	−40,727	0	−42,847	0	−43,105	0	−78,725	0	−79,125	0
${RC}_{600 s}$	−40,099	349 *	−40,287	440 *	−42,531	316	−42,710	395 *	−78,280	445	−78,628	497
${RC}_{60 s}$	−41,621	−1173	−42,003	−1276	−43,961	−1114	−44,294	−1189	−81,783	−3058	−82,465	−3340
$SRC {(20)}_{600 s}$	−40,192	256 *	−40,416	311 *	−42,570	277	−42,812	293	−78,196	529	−78,546	579
$SRC {(10)}_{600 s}$	−40,203	245	−40,428	299	−42,587	260	−42,828	277	−78,181	544	−78,529	596 *
$SRC {(10)}_{300 s}$	−40,485	−37	−40,772	−45	−42,811	36	−43,092	13	−78,905	−180	−79,365	−240
$SRC {(5)}_{300 s}$	−40,531	−83	−40,823	−96	−42,845	2	−43,127	−22	−78,969	−244	−79,446	−321
$TSRC {(60, 10)}_{5 s}$	−40,527	−79	−40,896	−169	−42,507	340 *	−42,746	359 *	−78,077	648 *	−78,457	668 *
$TSRC {(60, 1)}_{5 s}$	−40,179	269 *	−40,435	292 *	−42,501	346 *	−42,761	344	−78,124	601 *	−78,532	593
$TSRC {(30, 1)}_{5 s}$	−40,467	−19	−40,772	−45	−42,716	131	−42,984	121	−78,781	−56	−79,278	−153
$RCLL {(1)}_{60 s}$	−41,127	−679	−41,471	−744	−43,433	−586	−43,751	−646	−80,534	−1809	−81,136	−2011
$RCLL {(1)}_{30 s}$	−41,624	−1176	−42,013	−1286	−43,965	−1118	−44,332	−1227	−81,822	−3097	−82,517	−3392
$RCLL {(2)}_{60 s}$	−40,881	−443	−41,202	−475	−43,172	−325	−43,471	−366	−79,887	−1162	−80,443	−1318
$RCLL {(2)}_{30 s}$	−41,304	−856	−41,667	−940	−43,634	−787	−43,983	−878	−81,014	−2289	−81,655	−2530
RK	−40,936	−488	−41,308	−581	−43,073	−226	−43,370	−265	−79,477	−752	−80,011	−886
${PARC}_{60 s}$	−39,868	580 *	−40,024	703 *	−42,286	561 *	−42,491	614 *	−77,479	1246 *	−77,729	1396 *
${PARC}_{30 s}$	−39,925	523 *	−40,117	610 *	−42,324	523 *	−42,543	562 *	−77,576	1149 *	−77,870	1255 *
${PARC}_{refresh}$	−40,311	137	−40,592	135	−42,493	354 *	−42,744	361 *	−78,100	625 *	−78,497	628 *
PAHY	−40,763	−315	−41,093	−366	−42,918	−71	−43,203	−98	−79,343	−618	−79,986	−861
QMLC	−40,971	−523	−41,378	−651	−43,553	−706	−43,906	−801	−81,557	−2832	−82,019	−2894

The base RCOV measure for log-BF computation is RC_300s. Bold numbers indicate the highest log-BF values, and the top 5 results are labelled with *.

Table 7. Standard deviations of global minimum variance portfolio returns (IW-A model).

	10 Assets—A		10 Assets—B		20 Assets
	$σ_{GMV}$	$p_{MCS}$	$σ_{GMV}$	$p_{MCS}$	$σ_{GMV}$	$p_{MCS}$
${RC}_{300 s}$	0.6889	0.363 *	0.8003	0.151	0.6631	0.915 *
${RC}_{600 s}$	0.6850	0.548 *	0.7903	0.875 *	0.6867	0.050
${RC}_{60 s}$	0.7031	0.057	0.8028	0.072	0.6770	0.132
$SRC {(20)}_{600 s}$	0.6843	0.385 *	0.7922	0.631 *	0.6608	0.960 *
$SRC {(10)}_{600 s}$	0.6813	0.628 *	0.7914	0.860 *	0.6615	0.929 *
$SRC {(10)}_{300 s}$	0.6929	0.180	0.7930	0.474 *	0.6626	0.891 *
$SRC {(5)}_{300 s}$	0.6893	0.383 *	0.7898	0.875 *	0.6616	0.943 *
$TSRC {(60, 10)}_{5 s}$	0.6905	0.371 *	0.7892	0.875 *	0.6603	0.960 *
$TSRC {(60, 1)}_{5 s}$	0.6881	0.385 *	0.7869	1.000 *	0.6593	0.970 *
$TSRC {(30, 1)}_{5 s}$	0.6938	0.298 *	0.7890	0.875 *	0.6662	0.686 *
$RCLL {(1)}_{60 s}$	0.6962	0.245 *	0.7977	0.112	0.6706	0.431 *
$RCLL {(1)}_{30 s}$	0.7022	0.106	0.8090	0.013	0.6788	0.172
$RCLL {(2)}_{60 s}$	0.6940	0.332 *	0.7931	0.665 *	0.6661	0.792 *
$RCLL {(2)}_{30 s}$	0.6982	0.186	0.8039	0.041	0.6745	0.226 *
RK	0.6993	0.069	0.7987	0.274 *	0.6704	0.351 *
${PARC}_{60 s}$	0.6786	1.000 *	0.7911	0.875 *	0.6591	0.970 *
${PARC}_{30 s}$	0.6805	0.628 *	0.7899	0.875 *	0.6585	1.000 *
${PARC}_{refresh}$	0.6907	0.362 *	0.7892	0.875 *	0.6632	0.869 *
PAHY	0.6966	0.095	0.7968	0.194	0.6689	0.276*
QMLC	0.7049	0.077	0.8161	0.022	0.6713	0.580 *

σ_GMV is the standard deviation of global minimum variance portfolio’s returns. p_MCS is the p-value in the model confidence set test. p-values with * indicate the estimator belongs to the best groups at 75% confidence level.

Table 8. Predictive likelihoods of close-to-close return.

	HEAVY Model				CAW Model
	Group A		Group B		Group A		Group B
Estimators	$LPL$	log BF	$LPL$	log BF	$LPL$	log BF	$LPL$	log BF
${RC}_{300 s}$	−44,011.3	0	−46,244.7	0	−44,793.1	0	−46,941.3	0
${RC}_{600 s}$	−43,993.3	18.0	−46,221.2	23.5	−44,699.1	94.0	−46,852.9	88.4
${RC}_{60 s}$	−44,150.2	−138.9	−46,437.0	−192.3	−45,253.9	−460.8	−47,442.5	−501.2
$SRC {(20)}_{600 s}$	−43,966.0	45.3	−46,165.9	78.8	−44,706.1	87.0	−46,789.0	152.3
$SRC {(10)}_{600 s}$	−43,970.9	40.4	−46,170.4	74.3	−44,660.1	133.0	−46,795.7	145.6
$SRC {(10)}_{300 s}$	−43,972.4	38.9	−46,178.5	66.2	−44,755.3	37.8	−46,875.0	66.3
$SRC {(5)}_{300 s}$	−43,970.8	40.5	−46,188.0	56.7	−44,746.1	47.0	−46,855.5	85.8
$TSRC {(60, 10)}_{5 s}$	−43,952.6	58.7 *	−46,114.8	129.9 *	−44,755.7	37.4	−46,696.7	244.6 *
$TSRC {(60, 1)}_{5 s}$	−43,918.9	92.4 *	−46,107.0	137.7 *	−44,578.4	214.7 *	−46,658.9	282.4 *
$TSRC {(30, 1)}_{5 s}$	−43,937.1	74.2 *	−46,146.7	98.0	−44,640.9	152.2 *	−46,739.5	201.8
$RCLL {(1)}_{60 s}$	−44,037.0	−25.7	−46,278.4	−33.7	−44,984.4	−191.3	−47,152.1	−210.8
$RCLL {(1)}_{30 s}$	−44,141.2	−129.9	−46,419.4	−174.7	−45,225.7	−432.6	−47,408.8	−467.5
$RCLL {(2)}_{60 s}$	−44,002.9	8.4	−46,232.8	11.9	−44,870.4	−77.3	−47,024.7	−83.4
$RCLL {(2)}_{30 s}$	−44,071.2	−59.9	−46,314.9	−70.2	−45,076.9	−283.8	−47,215.3	−274.0
RK	−44,014.5	−3.2	−46,199.5	45.2	−44,850.0	−56.9	−46,881.7	59.6
${PARC}_{60 s}$	−43,962.9	48.4	−46,127.4	117.3 *	−44,636.2	156.9 *	−46,654.3	287 *
${PARC}_{30 s}$	−43,953.1	58.2 *	−46,120.6	124.1 *	−44,588.0	205.1 *	−46,650.7	290.6 *
${PARC}_{refresh}$	−43,932.8	78.5 *	−46,118.3	126.4 *	−44,609.7	183.4 *	−46,643.8	297.5 *
PAHY	−44,020.5	−9.2	−46,178.3	66.4	−44,780.9	12.2	−46,829.8	111.5
QMLC	−44,096.5	−85.2	−46,247.2	−2.5	−44,925.6	−132.5	−46,951.8	−10.5

This table reports the density forecast of close-to-close returns in the two 10-assets cases. The base RCOV measure for log-BF computation is RC_300s. Bold numbers indicate the highest log-BF values, and the top 5 results are labelled with *.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Does the Choice of Realized Covariance Measures Empirically Matter? A Bayesian Density Prediction Approach

Abstract

1. Introduction

2. Review of Ex-Post Covariance Estimation

2.1. Realized Covariance

2.2. Subsampled Realized Covariance

2.3. Two-Scales Realized Covariance

2.4. Realized Covariance with Lead-Lag Adjustments

2.5. Realized Kernel

2.6. Pre-Averaged Realized Covariance

2.7. Quasi-Maximum Likelihood Covariance Estimator

2.8. Regularization

3. Joint Return-RCOV Models

3.1. Inverse-Wishart Additive Model

3.2. Conditional Autoregressive Wishart Model

3.3. HEAVY Model

3.4. Prediction

4. Data

5. Empirical Results

5.1. Density Forecasts

5.2. Portfolio Allocation

5.3. Close-to-Close Data

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. IW-A Model Estimation Steps

Appendix A.2. CAW Model Estimation Steps

Appendix A.3. HEAVY Model Estimation Steps

Notes

References

Article Metrics

Citations

Article Access Statistics