Next Article in Journal
Profitability Determinants of Financial Institutions: Evidence from Banks in Pakistan
Next Article in Special Issue
Multi-Factor Asset-Pricing Models under Markov Regime Switches: Evidence from the Chinese Stock Market
Previous Article in Journal
The Impact of Brexit on the Stock Markets of the Greater China Region
Previous Article in Special Issue
An Empirical Examination of the Incremental Contribution of Stock Characteristics in UK Stock Returns
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Risk-Based Portfolios with Large Dynamic Covariance Matrices

1
Nomura Asset Management Co., Ltd. 1-12-1 Nihonbashi, Chuo-ku, 103-0027 Tokyo, Japan
2
Graduate School of Business Sciences, University of Tsukuba, 3-29-1 Otsuka, Bunkyo-ku, 112-0012 Tokyo, Japan
3
Department of Risk Engineering, University of Tsukuba, 1-1-1 Tennodai, 305-8577 Tsukuba, Japan
*
Author to whom correspondence should be addressed.
Int. J. Financial Stud. 2018, 6(2), 52; https://doi.org/10.3390/ijfs6020052
Submission received: 15 February 2018 / Revised: 9 May 2018 / Accepted: 10 May 2018 / Published: 14 May 2018
(This article belongs to the Special Issue Asset Pricing and Portfolio Choice)

Abstract

:
In the field of portfolio management, practitioners are focusing increasingly on risk-based portfolios rather than on mean-variance portfolios. Risk-based portfolios are constructed based solely on covariance matrices, and include methods such as minimum variance (MV), risk parity (RP), and maximum diversification (MD). It is well known that the performance of a mean-variance portfolio depends on the accuracy of the estimations of the inputs. However, no studies have examined the relationship between the performance of risk-based portfolios and the estimated accuracy of covariance matrices. In this research, we compare the performance of risk-based portfolios for several estimation methods of covariance matrices in the Japanese stock market. In addition, we propose a highly accurate estimation method called cDCC-NLS, which incorporates nonlinear shrinkage into the cDCC-GARCH model. The results confirm that (1) the cDCC-NLS method shows the best estimation accuracy, (2) the RP and MD do not depend on the estimation accuracy of the covariance matrix, and (3) the MV does depend on the estimation accuracy of the covariance matrix.
JEL Classification:
C13; C32; G11

1. Introduction

In the field of portfolio management, practitioners are focusing increasingly on risk-based portfolios, which use methods such as the minimum variance (MV), risk parity (RP), and maximum diversification (MD) to construct portfolios based solely on covariance matrices. Risk-based portfolios outperform traditional mean-variance portfolios (Markowitz 1952) and market capitalization-weighted portfolio, as shown in various empirical studies on stock selection (De Carvalho et al. 2012) and asset allocation (Chaves et al. 2011). As a result, the number of investment trusts and pension funds designed based on this concept are increasing.
It is well known that the performance of a mean-variance portfolio depends on the accuracy of the estimations of the inputs (Michaud 1989). In contrast, risk-based portfolios do not need to estimate expected returns, which are notoriously unstable and difficult to predict (Merton 1980). This is one of the main motivations for using a risk-based portfolio rather than a mean-variance portfolio. Recently, Ardia et al. (2017) estimated the accuracy of covariance matrices in terms of the weights of several risk-based portfolios. However, to the best of our knowledge, no studies have examined the relation between the performance of risk-based portfolios and the estimated accuracy of covariance matrices.
In this research, we compare the performances of risk-based portfolios for several estimation methods of covariance matrices. In addition, we propose a highly accurate estimation method that combines nonlinear shrinkage and the cDCC-GARCH model, called the cDCC-NLS method.
To estimate the covariance matrix in financial time series, it is necessary consider two important aspects: the cross section and the time series. With regard to the cross section, we have the difficulty of correcting the biases of the sample covariance matrix eigenvalues in a large number of time series. A popular method used to achieve this is the linear shrinkage method (LS) and its successor, the nonlinear shrinkage method (NLS), proposed by Ledoit and Wolf (2012). With regard to the time series aspect, we have to account for volatility clustering and time-varying correlations. To introduce a time-varying volatility and correlation structure, Engle (2002) proposed using the dynamic conditional correlation GARCH (DCC) model. Aielli (2013) modified the DCC model, proposing the corrected DCC-GARCH (cDCC) model. We combine the cDCC model and the NLS method to deliver an improved estimation of the covariance matrix. We compare various (c)DCC models (DCC, cDCC, DCC-LS, cDCC-LS, DCC-NLS, and cDCC-NLS) to estimate the accuracy of each method. To test whether the estimation accuracy of the covariance matrices affects the performance of the risk-based portfolio, we compare the accuracy of each method, including the proposed method. Then, we check whether the improvement affects the performance of risk-based portfolios in the Japanese stock market.
The proposed method (cDCC-NLS) offers the best estimation accuracy of the six methods. Furthermore, we confirm that, unlike MV, RP and MD do not depend on the estimation accuracy of the covariance matrix.
The remainder of the paper is organized as follows. Section 2 gives a brief description of risk-based portfolios and the estimation method using covariance matrices. Section 3 describes the proposed method, which consists of a cDCC-GARCH model and the nonlinear shrinkage method. Section 4 discusses a simulation of a large number of stocks in Japanese markets. Lastly, Section 5 concludes the paper.

2. Related Works

2.1. Risk-Based Portfolios

In this section, we summarize the well-known risk-based portfolios examined in previous research. Suppose there are N risky assets, with rates of returns given by the random variables R = ( R 1 , , R N ) T . Let w = ( w 1 , , w N ) T be a weighting vector for each asset, μ = ( μ 1 , , μ N ) T be an expected return vector, and Σ = E [ ( R μ ) ( R μ ) T ] be a covariance matrix. Thus, a portfolio’s variance and risk (standard deviation) can be expressed as σ P 2 = w T Σ w .

2.1.1. The Minimum Variance Portfolio

An MV portfolio has the lowest risk of portfolios based on the mean-variance method developed by Markowitz (1952). Specifically, the optimization problem is to minimize the portfolio variance subject to short-sales constraints and a budget constraint in which the sum of the weights is 1 , denoting a vector of ones of dimension N × 1 :
min w σ P 2 = w T Σ w
s . t , w T 1 = 1
w > 0 .
In the absence of the short-sales constraints (Equation (3)), the analytical solution is w = Σ 1 1 1 T Σ 1 1 . The MV portfolio is an efficient portfolio in terms of the mean-variance if the expected returns of all assets are the same.
Haugen and Baker (1991) and Clarke et al. (2006) investigated the MV portfolio and showed that the cumulative excess returns of MV portfolios in the U.S. stock market have been slightly higher than those of the market over the previous 42 years. Scherer (2011) showed that an MV portfolio tends to hold a low beta and low residual-risk stocks. These results are known as low volatility anomalies.

2.1.2. The Risk Parity Portfolio

The RP portfolio assigns the same risk budget to each asset (Qian 2005). This way, no asset class can be dominant in driving the volatility of the portfolio. We present a rigorous mathematical definition for the RP portfolio using the marginal risk contribution (MRC) and the risk contribution (RC), following Maillard et al. (2010). The definition characterizes the RP portfolio in terms of an equal RC for each asset, as follows:
M R C = σ P w = Σ w σ P , M R C i = ( Σ w ) i σ P
σ P = i = 1 N w i × M R C i = w T M R C
R C i = w i × M R C i σ P .
Maillard et al. (2010) propose the following optimization problem (7)–(9) to determine the optimal RP portfolio weights when short selling and when leverage is not allowed. In addition, Maillard et al. (2010) show that the optimization program in (7)–(9) is a convex program and that it has a unique local minimum.
min w i = 1 N j = 1 N R C i R C j 2
s . t , w T 1 = 1
w > 0

2.1.3. The Maximum Diversification Portfolio

Choueifaty and Coignard (2008) proposed the MD portfolio to benefit most from the diversification effect of the portfolio. They define the diversification ratio of any portfolio as D R ( w ) . Here, D R ( w ) is the ratio of the weighted average of the volatilities divided by the portfolio volatility. The MD portfolio maximizes the diversification ratio. Intuitively, this portfolio incorporates more low-correlated assets. From the viewpoint of the Sharpe ratio-maximized portfolio, it tries to minimize the volatility and the correlation, rather than the expected return.
max w D R ( w ) = i = 1 N w i σ i σ P .
In order to maximize D R ( w ) , it is necessary to reduce the denominator or to increase the numerator. The numerator and the denominator are linked through the volatility term. Because the denominator has a correlation term, a portfolio with a low correlation is constructed as a result.

2.2. Estimation Method of Covariance Matrices

The difficulty of estimating covariance matrices in financial time series arises from two aspects: the cross section and time series.
In the cross section, the challenge is to correct the biases of the sample covariance matrix eigenvalues. When the number of assets under consideration is large, especially relative to the number of historical return observations, the sample covariance matrix is estimated with a large error. This is caused by the bias of the eigenvalues of the sample covariance matrix. In order to reduce the estimation error, the linear shrinkage method of Ledoit and Wolf (2004) is often used. This method uniformly corrects all biases of the eigenvalues of the sample covariance matrix.
In general, the eigenvalues of a small sample are estimated as too small, and those of a large sample are estimated as too large. Therefore, the nonlinear shrinkage method of Ledoit and Wolf (2012) individually corrects each bias of the sample eigenvalues using the random matrix theory. In time series, the challenge is to account for volatility clusteringand time-varying correlations.
A model that has been shown to be successful in capturing volatility clustering is the autoregressive conditional heteroscedasticity (ARCH) model, introduced by Engle (1982). Then, Bollerslev (1986) generalized the ARCH model (the GARCH model) to make it more realistic.
To introduce a time-varying correlation structure to these conditional variance models, Engle (2002) proposed the dynamic conditional correlation GARCH (DCC) model. However, Aielli (2013) reveals a weak point of the DCC model, in that it possesses a significant asymptotic bias in the estimator of the sample covariance matrix, which is a constituent of the correlation evolution process. Aielli (2013) subsequently modified the DCC model, proposing the corrected DCC-GARCH (cDCC) model.
On the other hand, it is well known that estimating the DCC model with a large number of assets is difficult. One of the reasons for this is that we need to invert the conditional correlation matrix when we calculate the log-likelihood function of the DCC model in order to estimate the parameters. Finding the inverse matrix has a computational load of O ( N 3 ) . Fortunately, Engle et al. (2008) discovered a way to overcome this hurdle, called the composite likelihood method. The composite likelihood is computed by summing the log-likelihood functions of all contiguous pairs. This requires a computational load of O ( N ) . We summarize each method in the next section.

2.2.1. Nonlinear Shrinkage

The population covariance matrix Σ is a non-random t by N-dimensional positive definite matrix. Let X t be a t by N matrix of real independent and identically distributed (i.i.d.) random variables, with a zero mean and unit variance. We only observe Y t = X t Σ 1 / 2 . Thus, neither X t nor Σ are observed on their own.
Let Σ ^ = Σ ^ ( Y t ) be an estimator of Σ . Then, the estimator is said to be rotation-equivariant if it satisfies Σ ^ ( AY t ) = A Σ ^ A T for any orthogonal matrix A . The class of rotation-equivariant estimators of the covariance matrix comprises all estimators that have the same eigenvectors as the sample covariance matrix. Thus, every rotation-equivariant estimator Σ ^ is of the form V t D t V t T where D t is a diagonal matrix with eigenvalues ( d 1 , , d t ) as its elements, and where V t is a matrix with the ith column as the sample eigenvector. Ledoit and Wolf (2012) found the best Σ ^ under the following loss function in the rotation-equivariant estimator.
min D t V t D t V t T Σ
Here, x is the Frobenius norm, defined as M = T r ( M M T ) r for any r × m matrix M . Its solution is D t * = diag ( d 1 , , d t ) , where d i = v i T Σ v i and v i is the i th column of V t . Therefore, the best rotation-equivariant estimator of the covariance matrix is S t = V t D t * V t T .
Ledoit and Péché (2011) showed that d i can be approximated under the following conditions. Let H T be the population spectral distribution and let F T be the limiting spectral distribution. We assume that H T and F T converge to the non-random limiting spectral distribution H and F, respectively. Here, τ i , T denotes the population eigenvalues of Σ , and λ i , T denotes the sample eigenvalues of Σ ^ .
H T ( x ) = 1 N i = 1 N 1 { x τ i , T } , x R
F T ( x ) = 1 N i = 1 N 1 { x λ i , T } , x R
H T ( x ) H
F T ( x ) F .
From the random matrix theory, m F is quantitatively related to H, where m F is the Stieltjes transform of F, and c is p / n (Silverstein 1995).
m F ( z ) = 1 λ z d F ( λ ) , z C +
m F ( z ) = 1 τ ( 1 c c z m F ( z ) ) z d H ( τ )
m ^ F ( λ ) = lim z C + λ m F ( z )
From the above, d i can be approximated as follows:
d i = λ i | 1 c c λ i m ^ F ( λ i ) | 2 .

2.2.2. DCC-GARCH Model

Suppose there are N risky assets. Let r t = ( r 1 t , , r N t ) T be the return vector at time t ( 1 t T ) . Then, E ( r t | F t 1 ) = 0 is defined as the conditional mean vector and V ( r t | F t 1 ) = H t is defined as the conditional covariance matrix. We assume that the joint distribution of returns follows a multivariate Gaussian distribution. Then, F t 1 is the σ -algebra that contains information of what happened up to time t 1 .
Engle (2002) defines the DCC model using Equations (21)–(25). Here, H t is decomposed into D t 2 , the conditional variance of each asset, and R t , the conditional correlation matrix.1
In the DCC model, both the variance and the correlations between the assets are designed to be time-varying. The elements in the diagonal matrix D t are standard deviations from univariate GARCH models. Note that the univariate GARCH models can have different orders. Often the simplest model, GARCH(1,1) is adequate. Hereafter, we assume D t is driven by GARCH(1,1). Then, R t is decomposed into Q t to ensure that all elements are less than or equal to one, by definition of the correlation. Several conditions are imposed on the parameters a and b in order to guarantee that H t is positive definite. The scalar values a and b must satisfy: a > 0 , b > 0 and a + b < 1 . In addition, the starting value of Q t has to be positive definite to guarantee that H t is positive definite. In addition, S is the unconditional covariance matrix of the standardized errors, ε t .
r t | F t 1 N ( 0 , H t )
H t = D t R t D t
D t 2 = diag ( ω i ) + diag ( α i ) r t 1 T r t 1 + diag ( β i ) D t 1 2
ε t = D t 1 r t
R t = Q t 1 2 Q t Q t 1 2
Q t = S ( 1 a b ) + a ε t 1 T ε t 1 + b Q t 1
Divide the parameters of DCC into two groups: θ = ( θ 1 , , θ N ) , where θ i = ( ω i , α i , β i ) are the parameters of the univariate GARCH model for the ith asset series, and the correlation structure in Equation (24) is driven by Equation (25), which has the parameters ϕ = ( a , b ) .
When r t follows a multivariate Gaussian distribution, the log-likelihood function for L ( θ , ϕ ) can be written as the sum of the volatility part L V ( θ ) and the correlation part L C ( θ , ϕ ) .
L ( θ , ϕ ) = L V ( θ ) + L C ( θ , ϕ )
L V ( θ ) = 1 2 t = 1 T ( n ln 2 π + ln | D t | 2 + r t T D t 2 r t )
L C ( θ , ϕ ) = 1 2 t = 1 T ( ln | R t | + ε t T R t 1 ε t ε t T ε t )
However, estimating the correctly specified log-likelihood is difficult. Hence, the DCC model is designed to allow for a two-step estimation.
Compared with the log-likelihood in the univariate case, we find that the log-likelihood in L V ( θ ) is the sum of the log-likelihoods of the univariate GARCH equations of N assets. Thus, the parameters of the different univariate models may be determined separately. In the first step, the parameters of the univariate GARCH models are estimated for each asset series. From this step, the parameter set θ is estimated. When estimating θ , we can also estimate the standardized errors ε t , which can then be used to estimate S . In the second step, the parameter ϕ is estimated using θ and S . The method used to estimated S with the standardized errors ε t is called correlation targeting (Engle 2009). Here, S can be estimated using the sample covariance matrix of ε t
Step 1
Maximize the log-likelihood L V ( θ ) to determine D t ^ .
θ ^ = max θ L V ( θ ) .
Step 2
After estimating θ , estimate the standardized errors ε t . Here, S can be estimated using the moment estimator S ^ = 1 2 t = 1 T ε ^ t ε ^ t T , ε ^ t = D ^ t 1 r t .
Then, maximize the log-likelihood L C ( θ , ϕ ) to determine ϕ .
ϕ ^ = max ϕ L C ( θ ^ , ϕ ) .

2.2.3. cDCC-GARCH Model

Aielli (2013) reveals a weak point in the DCC model. The correlation targeting estimation is based on the assumption that Q ¯ = 1 T t = 1 T Q t is equal to S ^ . However, this assumption cannot be justified because the unconditional expectation of Q t is not equal to the unconditional expectation of ε ^ t ε ^ t T .
Thus, Aielli (2013) proposes a corrected DCC-GARCH (cDCC) model, which is tractable and easy to implement. In this model, the conditional correlation driving the process Q t is defined as follows:
Q t = Ψ ( 1 a b ) + a ε t 1 ε t 1 T + b Q t 1
ε t = diag ( Q t ) 1 2 ε t
The form of the correlation driving the process of the DCC model is modified in such a way that the unconditional expectation of Q t is equal to the unconditional expectation of ε t ε t T . Then, Ψ can be estimated consistently using the sample covariance matrix of ε t .
The estimation of the cDCC model is based on maximizing the same log-likelihood function as that of the DCC model. In the first step, the parameter vector θ of the univariate GARCH models is estimated. Using θ , estimate the modified errors ε t , the sample covariance matrix of which can be used to estimate Ψ . In the second step, the parameter ϕ is estimated using θ ^ .
Aielli (2013) performed a simulation study, showing that there is a considerable bias in the estimator of S for the DCC, even when the cross-section size is small. However, he also shows there is a downward bias in a and b, which is a recognized problem when using DCC with large systems (Engle 2002).

2.2.4. Composite Likelihood

It is well known that estimating the (c)DCC model in the case of a large number of assets is difficult. One of the reasons for this is that we need to invert the conditional correlation matrix when we maximize the log-likelihood L C ( θ , ϕ ) . Finding the inverse matrix has a computational load of O ( N 3 ) . Engle et al. (2008) overcome this problem using what they call the composite likelihood method, which can be traced back to Lindsay (1988). The composite likelihood of the (c)DCC model is based on subsets of observations, for example, a subset of two assets’ returns, Y jt = ( r 1 j , t , r 2 j , t ) T , j = 1 , N . The conditional variance of Y jt is H jt .2 Subsets can be selected in various ways: all possible combinations of pairs of assets ( O ( N 2 ) ), contiguous pairs ( O ( N ) ), or subsets of pairs selected randomly ( O ( 1 ) ). The simulation study in Engle et al. (2008) shows that the results for different types of subsets are approximately the same. We use the composite likelihood estimator based on all contiguous pairs because it is the most scalable of those available.
Y jt | F t 1 N ( 0 , H jt )
H jt = D jt R jt D jt
D jt 2 = diag ( ω i ) + diag ( α i ) Y jt 1 T Y jt 1 + diag ( β i ) D jt 1 2
ε jt = D jt 1 Y jt
Q jt = S j ( 1 a b ) + a ε jt 1 T ε jt 1 + b Q jt 1
R jt = Q jt 1 2 Q jt Q jt 1 2
Here, S ^ can be estimated using the moment estimator S ^ = 1 2 t = 1 T ε ^ jt ε ^ jt T . The parameter vector ϕ ^ = ( a , b ) is estimated by maximizing the average of the log-likelihood function L C L ( θ ^ , ϕ ) among all contiguous pairs, rather than maximizing max ϕ L C ( θ ^ , ϕ ) in Step 2.
ϕ ^ = max ϕ L C L ( θ ^ , ϕ )
L C L ( θ ^ , ϕ ) = t = 1 T 1 N j = 1 N log L j , t ( ϕ )
L j , t ( ϕ ) = 1 2 t = 1 T ( ln | R jt | + ε jt T R jt 1 ε jt ε jt T ε jt )

3. Combining Nonlinear Shrinkage and the cDCC-GARCH Model

As described above, the estimation of the covariance matrix is problematic in terms of the cross section and the time series. To combine these two aspects, Hafner and Reznikova (2012) proposed an estimation method combining linear shrinkage and the DCC-GARCH model (DCC-LS), and Engle et al. (2017) proposed an estimation method combining nonlinear shrinkage and the DCC-GARCH model (DCC-NLS). We propose a highly accurate estimation method that combines nonlinear shrinkage and the cDCC-GARCH model, called the cDCC-NLS method. To summarize, the cDCC-NLS estimation follows a two-step process:
Step 1
For each asset, fit a univariate GARCH(1,1) model.
Step 2
Estimate the unconditional covariance matrix of the standardized errors using the nonlinear shrinkage method, and then maximize the composite likelihood of the cDCC model.
In the estimation of Step 2, Hafner and Reznikova (2012) used the sample covariance matrix with the linear shrinkage estimator (DCC-LS). Engle et al. (2017) later replaced this with the nonlinear shrinkage estimator (DCC-NLS). We combine nonlinear shrinkage and the cDCC-GARCH model and confirm its advantages in the next section.

4. Simulation Study

In order to check whether the performance of risk-based portfolios depends on the estimation accuracy of the covariance matrices, it is necessary to confirm (1) the estimation accuracy of each method, and (2) the effect of the estimation accuracy on performance.
In this section, we first compare the estimation accuracy of the covariance matrix using various (c)DCC models. We evaluate the covariance matrix estimation accuracy following Engle et al. (2017). Next, we examine the effect of the difference in the estimation accuracy on the performance of the risk-based portfolios.

4.1. Monte Carlo Study

In this section, we perform a Monte Carlo simulation to compare the estimation accuracy of the parameters of various (c)DCC models: the DCC, cDCC, DCC-LS, cDCC-LS, DCC-NLS, and cDCC-NLS models. Engle et al. (2017) defines an error function between the true covariance matrix generated by a (c)DCC model and the estimation. On the other hand, because of the sample estimator, the LS and NLS methods do not handle true covariance matrices at each time point, which means the errors cannot be calculated in the same way as in the (c)DCC model. Therefore, we limit our discussion to the (c)DCC model.
In order to compare our results with those of previous studies (Hafner and Reznikova 2012; Engle et al. 2017), we run a simulation study using the same setup. Let the covariance matrix at each time point of the DCC model, given a certain unconditional covariance and parameters, be a true covariance matrix. We generate random numbers according to the true DCC model, which we then use to estimate the covariance matrix. An error between the estimated covariance matrix and the true covariance matrix is defined as the estimation accuracy.
As in the previous studies, we estimate the unconditional population covariance matrix from the N { 100 , 500 , 1000 } most liquid stocks in the TOPIX index, using 60 months of daily data from January 2005 through December 2009. We then simulate a (c)DCC model using the unconditional population covariance matrix and the parameters a = 0.05 and b = 0.93 . The univariate volatility dynamics are GARCH(1,1) models with identical parameters ( ω i , α i , β i ) = ( 0.01 , 0.05 , 0.9 ) across all stocks i = 1 , , N . For each simulation, we generate an N × T matrix of simulated returns, where the sample size is T = 1250 . We run the simulation study for the same setup, with the exception of the unconditional population covariance matrix. Because we study the Japanese stock market in the next section, we use data on Japanese stocks to define the true unconditional covariance matrix.
First, we check the estimation accuracy of the (c)DCC model parameters a = 0.05 and b = 0.93 . The estimation results for each model are shown in Table 1 and Table 2. Table 1 and Table 2 show the average values and standard deviations of 10 5 / N estimates of each parameter ( a , b ) . There is no downward bias in the parameters described by Engle et al. (2008) or Aielli (2013) in either the estimation method or in the number of stocks. Therefore, it is considered that the difference in each method is the estimation accuracy of the standardized residual sample covariance matrix, which we use for the correlation targeting.
Next, we evaluate the accuracy of the covariance matrix estimation using the loss function of Engle et al. (2017). Let Σ t denote the N-dimensional “true” covariance matrix at time t, and Σ ^ t be an estimator of Σ t . The loss function L t ( Σ t ^ , Σ t ) and the average loss L ^ are defined below.
L t ( Σ t ^ , Σ t ) = T r ( Σ t ^ 1 Σ t Σ t ^ 1 ) / N [ T r ( Σ t ^ 1 / N ) ] 2 1 T r ( Σ t 1 / N )
L ^ = 1 T t = 1 T L t
Table 3 shows the results for L ^ , averaged across 10 5 / N Monte Carlo simulations.
We find that the average loss of the cDCC-NLS method is the lowest of the various methods for any number of assets. Therefore, the cDCC-NLS method shows the best estimation accuracy.

4.2. Performance of the Risk-Based Portfolios

In the previous section, we compared the accuracy of the covariance matrix estimation of each method using a Monte Carlo study. Thus, we were able to confirm that the cDCC-NLS method shows the best estimation accuracy. Next, we examine the effect of the difference in the estimation accuracy on the performance of risk-based portfolios. Here, we use the minimum variance (MV), minimum variance without short constraint (MVS), risk parity (RP), and maximum diversification (MD). We estimate the covariance matrices using the various (c)DCC models for the N { 100 , 500 , 1000 } most liquid stocks in the TOPIX index, as before. Each portfolio is re-balanced monthly, and the parameter estimation period of each method is 60 months. The out-of-sample period is from January 2000 to December 2015. In other words, we estimate the weights on the previous 60 months of data from January 2000, and then compute the value of the out-of-sample portfolio for the following month. Next, we roll the window forward by one month and repeat the process until December 2015.
We use an annualized Sharpe ratio to evaluate the performance of each portfolio. We also consider whether the cDCC-NLS (the most accurate method) delivers a higher out-of-sample Sharpe ratio than that of the DCC and the cDCC at a level that is statistically significant. For a given sample size N, a two-sided p-value for the null hypothesis of equal Sharpe ratios is obtained using the pre-whitened HACPW method, as described in (Ledoit and Wolf 2008). Table 4, Table 5, Table 6 and Table 7 show the annualized return, annualized volatility, and annualized Sharpe ratio for each risk-based portfolio.
All risk-based portfolios show improved performance as the number of stocks N increases. For the RP in Table 6 and the MD in Table 7, we confirm that performance does not depend on the estimation accuracy. There are no statistically significant differences between the DCC and cDCC-NLS or between the cDCC and cDCC-NLS. We further confirm that the weights of the RP and MD are similar across the models until December 2015. We consider that this explains why the performance of the RP and MD are similar.
On the other hand, for the MV in Table 4, the Sharpe ratio of the cDCC-NLS is an improvement over those of the DCC and the cDCC, especially for large numbers of stocks. In addition, the MVS in Table 5, which allows short selling, shows a significant improvement. For the MV and MVS, the cDCC-NLS records the highest Sharpe ratios for any numbers of stocks. According to Scherer (2011), the good performance of the MV portfolio is based on a low-risk anomaly. By being short in high volatility and long in low volatility, low-risk anomalies can be realized. Thus, we think the presence of short sales makes the low-risk anomaly possible. From these results, we conclude the RP and MD do not depend on the estimation accuracy of the covariance matrix, whereas the MV and MVS do depend on the estimation accuracy of the covariance matrix.

5. Conclusions

The contribution of this study is twofold. First, we propose the cDCC-NLS method, a highly accurate estimation method that combines nonlinear shrinkage and the cDCC-GARCH model. This method overcomes the difficulties that arise from the cross-section and time series aspects of estimating covariance matrices. We compared the performance of risk-based portfolios under several estimation methods for covariance matrices: the DCC, cDCC, DCC-LS, DCC-NLS, cDCC-LS, and cDCC-NLS. We performed an empirical analysis using a large number of stocks in Japanese markets and confirm the following results:
  • The cDCC-NLS method shows the best estimation accuracy.
  • The RP and MD do not depend on the estimation accuracy of the covariance matrix.
  • The MV does depend on the estimation accuracy of the covariance matrix.
Our recommendations for investors are as follows. When constructing a minimum variance portfolio, especially one that allows short selling, the covariance matrix should be estimated with good estimation accuracy using the cDCC-NLS method. However, for risk parity and maximum diversification portfolios, the estimation accuracy of the covariance matrix is not so important.
Although not considered in this research, many studies have examined the volatility asymmetry in which volatility tends to show a greater increase after negative returns than it does after positive returns. With regard to the conditional correlation, asymmetry is evident in the correlation between stocks tending to increase after negative returns, as in the case of volatility. In future research, we will investigate the impact on the performance of risk-based portfolios by introducing the asymmetry of volatility and the correlation.
In this study, we limited our scope to the (c)DCC model in order to estimate the error of covariance matrix. However, it is also important to examine the relationship between the performance of a risk-based portfolio and other estimation methods.

Author Contributions

K.N. provided the idea, constructed the hypotheses, analyzed the data and wrote the paper. M.I. performed statistical computations and helped analyzing the data. K.Y. supervised and edited the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Aielli, Gian Piero. 2013. Dynamic conditional correlation: on properties and estimation. Journal of Business & Economic Statistics 31: 282–99. [Google Scholar]
  2. Ardia, David, Guido Bolliger, Kris Boudt, and Jean-Philippe Gagnon-Fleury. 2017. The impact of covariance misspecification in risk-based portfolios. Annals of Operations Research 254: 1–16. [Google Scholar] [CrossRef]
  3. Bollerslev, Tim. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics 31: 307–27. [Google Scholar] [CrossRef]
  4. Chaves, Denis, Jason Hsu, Feifei Li, and Omid Shakernia. 2011. Risk parity portfolio vs. other asset allocation heuristic portfolios. The Journal of Investing 20: 108–18. [Google Scholar] [CrossRef]
  5. Choueifaty, Yves, and Yves Coignard. 2008. Toward maximum diversification. The Journal of Portfolio Management 35: 40–51. [Google Scholar] [CrossRef]
  6. Clarke, Roger G., Harindra De Silva, and Steven Thorley. 2006. Minimum-variance portfolios in the us equity market. The Journal of Portfolio Management 33: 10–24. [Google Scholar] [CrossRef]
  7. De Carvalho, Raul Leote, Xiao Lu, and Pierre Moulin. 2012. Demystifying equity risk-based strategies: A simple alpha plus beta description. The Journal of Portfolio Management 38: 56–70. [Google Scholar] [CrossRef]
  8. Engle, Robert. 2009. Anticipating Correlations: A New Paradigm for Risk Management. Princeton: Princeton University Press. [Google Scholar]
  9. Engle, Robert F. 1982. Autoregressive conditional heteroscedasticity with estimates of the variance of united kingdom inflation. Econometrica: Journal of the Econometric Society 50: 987–1007. [Google Scholar] [CrossRef]
  10. Engle, Robert F. 2002. Dynamic conditional correlation: A simple class of multivariate generalized autoregressive conditional heteroskedasticity models. Journal of Business & Economic Statistics 20: 339–50. [Google Scholar]
  11. Engle, Robert F, Olivier Ledoit, and Michael Wolf. 2017. Large dynamic covariance matrices. Journal of Business & Economic Statistics, 1–13. [Google Scholar] [CrossRef]
  12. Engle, Robert F., Neil Shephard, and Kevin Sheppard. 2008. Fitting Vast Dimensional Time-Varying Covariance Models. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=1293629 (accessed on 11 May 2018).
  13. Hafner, Christian M., and Olga Reznikova. 2012. On the estimation of dynamic conditional correlation models. Computational Statistics & Data Analysis 56: 3533–45. [Google Scholar]
  14. Haugen, Robert A., and Nardin L. Baker. 1991. The efficient market inefficiency of capitalization—Weighted stock portfolios. The Journal of Portfolio Management 17: 35–40. [Google Scholar] [CrossRef]
  15. Ledoit, Olivier, and Sandrine Péché. 2011. Eigenvectors of some large sample covariance matrix ensembles. Probability Theory and Related Fields 151: 233–64. [Google Scholar] [CrossRef]
  16. Ledoit, Olivier, and Michael Wolf. 2004. A well-conditioned estimator for large-dimensional covariance matrices. Journal of Multivariate Analysis 88: 365–411. [Google Scholar] [CrossRef]
  17. Ledoit, Oliver, and Michael Wolf. 2008. Robust performance hypothesis testing with the sharpe ratio. Journal of Empirical Finance 15: 850–59. [Google Scholar] [CrossRef] [Green Version]
  18. Ledoit, Olivier, and Michael Wolf. 2012. Nonlinear shrinkage estimation of large-dimensional covariance matrices. The Annals of Statistics 40: 1024–60. [Google Scholar] [CrossRef]
  19. Lindsay, Bruce G. 1988. Composite likelihood methods. Contemporary Mathematics 80: 221–39. [Google Scholar]
  20. Maillard, Sébastien, Thierry Roncalli, and Jérôme Teïletche. 2010. The properties of equally weighted risk contribution portfolios. The Journal of Portfolio Management 36: 60–70. [Google Scholar] [CrossRef]
  21. Markowitz, Harry. 1952. Portfolio selection. The Journal of Finance 7: 77–91. [Google Scholar]
  22. Merton, Robert C. 1980. On estimating the expected return on the market: An exploratory investigation. Journal of Financial Economics 8: 323–61. [Google Scholar] [CrossRef]
  23. Michaud, Richard O. 1989. The markowitz optimization enigma: Is ‘optimized’ optimal? Financial Analysts Journal 45: 31–42. [Google Scholar] [CrossRef]
  24. Qian, Edward. 2005. Risk Parity Portfolios: Efficient Portfolios through True Diversification. Boston: Panagora Asset Management. [Google Scholar]
  25. Scherer, Bernd. 2011. A note on the returns from minimum variance investing. Journal of Empirical Finance 18: 652–60. [Google Scholar] [CrossRef]
  26. Silverstein, Jack W. 1995. Strong convergence of the empirical distribution of eigenvalues of large dimensional random matrices. Journal of Multivariate Analysis 55: 331–39. [Google Scholar] [CrossRef]
1.
The operator ∘ represents the Hadamard product, which is a binary operation that takes two matrices of the same dimensions. For two matrices, A and B of the same dimension, the Hadamard product A B , is a matrix of the same dimension as the operands, with elements given by A B i , j = ( A ) i , j ( B ) i , j .
2.
Note that the parameter vector ϕ = ( a , b ) is the same among all H jt .
Table 1. The average and standard deviation of the (c)DCC parameter a = 0.05 using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 .
Table 1. The average and standard deviation of the (c)DCC parameter a = 0.05 using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 .
NDCCDCC-LSDCC-NLScDCCcDCC-LScDCC-NLS
1000.05060.05060.0510.04980.05010.0503
(0.0032)(0.0033)(0.0034)(0.0028)(0.0029)(0.003)
5000.04970.04970.04980.05020.05010.0504
(0.0011)(0.0011)(0.0015)(0.0025)(0.0027)(0.003)
10000.04960.04980.05010.04760.050.0501
(0.0016)(0.0013)(0.0015)(0.0017)(0.0018)(0.0018)
Table 2. The average and standard deviation of the (c)DCC parameter b = 0.93 using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 .
Table 2. The average and standard deviation of the (c)DCC parameter b = 0.93 using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 .
NDCCDCC-LSDCC-NLScDCCcDCC-LScDCC-NLS
1000.92790.92850.92870.92870.92840.9289
(0.0048)(0.0052)(0.0054)(0.0031)(0.0034)(0.0038)
5000.92820.92850.92920.9270.92750.9283
(0.0024)(0.0024)(0.0022)(0.004)(0.0046)(0.005)
10000.92780.92780.92770.92690.92710.9275
(0.0021)(0.0016)(0.0018)(0.0022)(0.0025)(0.0024)
Table 3. Average loss for (c)DCC estimators using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The unit is 10 3 .
Table 3. Average loss for (c)DCC estimators using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The unit is 10 3 .
NDCCDCC-LSDCC-NLScDCCcDCC-LScDCC-NLS
1008.602578.272057.470247.959067.65586.8779
50023.0454118.3661214.5573622.1663818.0739913.37496
100070.4423224.4669118.829363.2480124.493515.29444
Table 4. Annualized return, volatility, and Sharpe ratio for an MV portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
Table 4. Annualized return, volatility, and Sharpe ratio for an MV portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
DCCDCC-LSDCC-NLScDCCcDCC-LScDCC-NLS
N = 100
Return [%]8.088.238.297.828.098.62
Volatility [%]16.4016.4116.3616.2016.1616.18
Sharpe Ratio0.490.500.510.480.50 0.53 + + +
N = 500
Return [%]9.809.659.719.819.7410.09
Volatility [%]17.1517.1617.0016.9216.9316.90
Sharpe Ratio0.570.560.570.580.58 0.60 + +
N = 1000
Return [%]9.259.509.979.669.6410.05
Volatility [%]16.6116.6316.3716.2316.1716.15
Sharpe Ratio0.560.570.610.590.60 0.62 +
Table 5. Annualized return, volatility, and Sharpe ratio for an MV portfolio without short-sales constraints (MVS) using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
Table 5. Annualized return, volatility, and Sharpe ratio for an MV portfolio without short-sales constraints (MVS) using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
DCCDCC-LSDCC-NLScDCCcDCC-LScDCC-NLS
N = 100
Return [%]10.5410.639.4310.1410.1811.57
Volatility [%]22.6522.5420.9015.1915.1215.08
Sharpe Ratio0.470.470.450.670.67 0.77 + + +
N = 500
Return [%]9.489.277.419.379.3311.09
Volatility [%]12.6412.4713.9011.6311.4011.12
Sharpe Ratio0.750.740.530.810.82 1.00 + +
N = 1000
Return [%]6.016.526.986.267.789.30
Volatility [%]9.968.959.1610.568.998.21
Sharpe Ratio0.600.730.760.590.87 1.13 + + +
Table 6. Annualized return, volatility, and Sharpe ratio for an RP portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
Table 6. Annualized return, volatility, and Sharpe ratio for an RP portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
DCCDCC-LSDCC-NLScDCCcDCC-LScDCC-NLS
N = 100
Return [%]6.956.977.167.017.017.06
Volatility [%]19.3919.3919.4519.6019.6019.56
Sharpe Ratio0.360.360.370.360.360.36
N = 500
Return [%]9.469.359.769.829.819.68
Volatility [%]18.2618.3018.4118.5918.5718.48
Sharpe Ratio0.520.510.530.530.530.52
N = 1000
Return [%]10.4110.5410.5510.5410.5210.52
Volatility [%]16.8816.9317.0717.2317.1917.07
Sharpe Ratio0.620.620.620.610.610.62
Table 7. Annualized return, volatility, and Sharpe ratio for an MD portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
Table 7. Annualized return, volatility, and Sharpe ratio for an MD portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
DCCDCC-LSDCC-NLScDCCcDCC-LScDCC-NLS
N = 100
Return [%]7.067.077.067.007.047.02
Volatility [%]20.4820.4920.4920.4920.4920.49
Sharpe Ratio0.350.350.340.340.340.34
N = 500
Return [%]10.1310.1310.1210.1210.1210.12
Volatility [%]19.6419.6419.6419.6419.6419.64
Sharpe Ratio0.520.520.520.520.520.52
N = 1000
Return [%]10.6310.6310.6310.6610.6610.63
Volatility [%]18.6618.6618.6618.6618.6618.66
Sharpe Ratio0.570.570.570.570.570.57

Share and Cite

MDPI and ACS Style

Nakagawa, K.; Imamura, M.; Yoshida, K. Risk-Based Portfolios with Large Dynamic Covariance Matrices. Int. J. Financial Stud. 2018, 6, 52. https://doi.org/10.3390/ijfs6020052

AMA Style

Nakagawa K, Imamura M, Yoshida K. Risk-Based Portfolios with Large Dynamic Covariance Matrices. International Journal of Financial Studies. 2018; 6(2):52. https://doi.org/10.3390/ijfs6020052

Chicago/Turabian Style

Nakagawa, Kei, Mitsuyoshi Imamura, and Kenichi Yoshida. 2018. "Risk-Based Portfolios with Large Dynamic Covariance Matrices" International Journal of Financial Studies 6, no. 2: 52. https://doi.org/10.3390/ijfs6020052

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop