Next Article in Journal
On the Volatility Spillover between Agricultural Commodities and Latin American Stock Markets
Next Article in Special Issue
National Culture and Corporate Rating Migrations
Previous Article in Journal
On the Optimal Risk Sharing in Reinsurance with Random Recovery Rate
Previous Article in Special Issue
Numerical Ruin Probability in the Dual Risk Model with Risk-Free Investments
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Maximal Tail Dependence-Based Clustering Procedure for Financial Time Series and Its Applications in Portfolio Selection

1
School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai 200433, China
2
School of Economics, Central University of Finance and Economics, Beijing 100081, China
3
Department of Insurance and Actuary, Wuhan University, Wuhan 430072, Hubei, China
4
Department of Statistical and Actuarial Sciences, University of Western Ontario, London, ON N6A 5B7, Canada
*
Author to whom correspondence should be addressed.
Risks 2018, 6(4), 115; https://doi.org/10.3390/risks6040115
Submission received: 12 August 2018 / Revised: 26 September 2018 / Accepted: 27 September 2018 / Published: 9 October 2018
(This article belongs to the Special Issue Risk, Ruin and Survival: Decision Making in Insurance and Finance)

Abstract

:
In this paper, we propose a clustering procedure of financial time series according to the coefficient of weak lower-tail maximal dependence (WLTMD). Due to the potential asymmetry of the matrix of WLTMD coefficients, the clustering procedure is based on a generalized weighted cuts method instead of the dissimilarity-based methods. The performance of the new clustering procedure is evaluated by simulation studies. Finally, we illustrate that the optimal mean-variance portfolio constructed based on the resulting clusters manages to reduce the risk of simultaneous large losses effectively.

1. Introduction

It is of great interest in identifying the risk of simultaneous large losses in portfolio selection and financial risk management. If this type of risk is identified properly, candidate assets could be grouped such that asset prices or returns from different groups are unlikely to drop simultaneously. An investment strategy is called portfolio diversification if the portfolio is constructed by selecting one asset from each group. As we could see, the performance of portfolio diversification depends on how the assets are grouped.
In general, the observed prices or returns of assets are essentially time series. To group assets properly, time series clustering techniques are usually involved. Early works on time series clustering include interdependence measure between asset returns such as the (Pearson or Spearman type) cross-correlation coefficients (cf. Kaufman and Rousseeuw 1990). In particular, Mantegna (1999) and Bonanno et al. (2004) quantified the degree of interdependence between the synchronous time evolution of a pair of stock prices and used it in financial time series clustering. Moreover, as another extension of dependence-based method, Baragona (2001) and Brockwell and Davis (2002) developed a new measure of interdependence from the residuals obtained by fitting the data to acceptable time series. In addition, inspired by the dynamic conditional correlation (DCC) model developed by Engle and Sheppard (2001) and Engle (2002), Billio et al. (2006) and Billio and Caporin (2009) proposed the Flexible Dynamic Conditional Correlation (FDCC) multivariate GARCH model and provided an estimate of the dynamics of correlation coefficients within groups of financial assets for asset allocations.
However, cross-correlation coefficients do not always guarantee a sufficient degree of portfolio diversification because these coefficients cannot always capture the possible extreme co-movements of asset returns in lower tails. Extreme co-movement of asset returns in lower tails plays an important role in studying contagion of financial crisis. Bae et al. (2003) provided evidence of the existence of extreme co-movements in terms of coexceedances when studying the phenomenon of contagion. More formally, the contagion of financial crisis could be defined directly as a significant increase of extreme co-movements if financial crisis occurs in one of the markets (cf. Pericoli and Sbracia 2003, Definition 4). Hence, if a portfolio diversification arrangement fails to diversify the risk of extreme co-movement in the lower tail, it might be vulnerable to the contagion of financial crisis occurring in other markets.
Even when there is no contagion, extreme co-movements of asset returns may also exist due to the similarity of fundamentals from the traditional point of view, investor trading patterns(Barberis et al. 2005), or incomplete information (Veldkamp 2006). To diversify the risk of extreme co-movement in lower tail, De Luca and Zuccolotto (2011) proposed a dissimilarity measure based on tail dependence coefficients (TDC) instead of cross-correlation coefficients to obtain homogeneous groups of time series with an association between extreme low values. Inspired by this work, Durante et al. (2014) developed a time series clustering procedure with a conditional version of Spearman’s correlation coefficient for extremely low values introduced by Durante et al. (2014), and a non-parametric estimator of tail dependence provided in Durante et al. (2015). De Luca and Zuccolotto (2015) further proposed a dynamic clustering procedure so that the coefficient employed to measure the lower tail dependence can be time-varying on the basis of historical market volatility.
In this paper, we propose to cluster time series via the coefficients of maximum tail dependence introduced by Furman et al. (2015). The coefficients of maximal tail dependence are direct extensions of TDCs including the tail dependence coefficient λ , the weak tail dependence coefficient χ and the tail order κ . The major difference is that the coefficients of maximal tail dependence are calculated with convergence paths that are possibly other than the diagonal path. As a result, the matrix of coefficients of maximal tail dependence may not be symmetric and thus cannot be used as a similarity (or dissimilarity) matrix in clustering procedures. Instead, such a matrix may be seen as a type of affinity matrix representing directed relations between assets.
The paper is organized as follows. Section 2 is a brief introduction of the coefficients of maximal tail dependence. The proposed clustering procedure of time series is formally described in Section 3. The performance of the proposed procedure is evaluated in Section 4. An application to real exchange rates of G20 countries is presented and analyzed in Section 5. Section 6 concludes.

2. The Coefficients of Maximal Tail Dependence

Several coefficients have been introduced by researchers to measure the extreme co-movements in recent years. For example, one of the most important measures is the lower (upper) tail dependence, which is formally defined by
λ L : = lim u 0 + P X F X 1 ( u ) | Y F Y 1 ( u ) λ U : = lim u 1 P X > F X 1 ( u ) | Y > F Y 1 ( u ) ,
where random variables X and Y represent the potentially dependent risks. Since by Sklar’s Theorem (cf. Nelsen 2006) there is a uniquely determined copula function C : [ 0 , 1 ] 2 [ 0 , 1 ] such that
F X , Y ( x , y ) = C F X ( x ) , F Y ( y ) ,
the lower (upper) tail dependence could be defined as the limiting point of a functional of the copula function, namely,
λ L : = λ L ( C ) = lim u 0 + C ( u , u ) u , λ U : = λ U ( C ) = lim u 0 + C ^ ( u , u ) u ,
where C ^ is the survival copula with respect to C. Apart from the lower (upper) tail dependence, similar measures include the weak lower (upper) tail dependence
χ L : = χ L ( C ) = lim u 0 + 2 log u log C ( u , u ) 1 , χ U : = χ U ( C ) = lim u 0 + 2 log u log C ^ ( u , u ) 1 ,
(cf. Coles et al. 1999) and lower (upper) tail order κ L ( κ U ) defined via:
C ( u , u ) = L ( u ) u κ L , C ^ ( 1 u , 1 u ) = U ( u ) u κ U , u ( 0 , 1 )
where L and U are slowly varying functions of u at 0 (cf. Hua and Joe 2011).
The aforementioned measures of tail dependence are all limiting values of functionals of C as the arguments ( u , v ) shrink to ( 0 , 0 ) along the diagonal line of the square [ 0 , 1 ] 2 . However, as pointed out by Furman et al. (2015), these measures may sometimes underestimate the extent of extreme co-movements for dependent risks, and, for this reason, the authors proposed improved versions of these coefficients of tail dependence, named as the coefficients of the maximal tail dependence, which are more sensitive to extreme co-movements. Accordingly, a clustering procedure based on such coefficients may provide better clustering results than those based on other coefficients of tail dependence such as those proposed by De Luca and Zuccolotto (2011) and Durante et al. (2014), and the portfolios constructed based on such clustering results may also outperform.
The coefficients of the maximum tail dependence are limiting values of the usual versions of the corresponding functionals of C (or C ^ ) converging to the lower-left (or upper-right) vertex along paths of maximal tail dependence. To formally define the paths of maximal tail dependence, consider a function φ : [ 0 , 1 ] [ 0 , 1 ] satisfying the following admissible conditions (see Furman et al. 2015, Definition 2.1):
  • φ ( u ) [ u 2 , 1 ] for every u [ 0 , 1 ] ; and
  • both φ ( u ) and u 2 / φ ( u ) converge to 0 when u 0 .
The collection of such kind of functions is called the admissible set, denoted as A . Then, a path φ ( u ) , u 2 / φ ( u ) 0 u 1 shrinking to the lower-left (or upper-right) vertex is called admissible whenever φ belongs to A . Specifically, the diagonal path used to define the usual coefficients of tail dependence ( u , u ) 0 u 1 is admissible as the function φ 0 ( u ) = u , u [ 0 , 1 ] is admissible. According to (Furman et al. 2015, Definition 2.2), the paths of maximal tail dependence is denoted as φ * ( u ) , u 2 / φ * ( u ) 0 u 1 where
φ * ( u ) = arg max φ A C φ ( u ) , u 2 / φ ( u ) .
To simplify the notations, we denote Π * ( u ) = C φ * ( u ) , u 2 / φ * ( u ) if the optimal value exists. Then, the lower tail maximal dependence (LTMD) is defined via:
λ L * : = λ L * ( C ) = lim u 0 + Π * ( u ) u ,
the weak lower tail maximal dependence (WLTMD) is defined via:
χ L * : = χ L * ( C ) = lim u 0 + 2 log u log Π * ( u ) 1 ,
and the order of lower tail maximal dependence is defined via:
Π * ( u ) = L * ( u ) u κ L * , u ( 0 , 1 )
where L * is a slowly varying function of u at 0. In particular, for κ L * , we have the following result similar to that of the usual tail order κ L (cf. Hua and Joe 2011):
Proposition 1.
For any bivariate copula function C, if φ * ( u ) A exists, then the corresponding index of maximal tail dependence κ L * [ 1 , 2 ] .
The proof is given in Appendix A. As a result, χ L * [ 0 , 1 ] may also not be a desirable affinity measure for common clustering procedures such as k means clustering or hierarchical clustering. However, χ L * is a desirable weight for graphs. The larger χ L * of two assets is, the stronger the extreme co-movement between these assets is, and hence a bigger weight is posed by χ L * .
We refer to Furman et al. (2015) for examples of expressions for λ L * and κ L * with closed forms in the case of parametric families of distributions. Notably, Furman et al. (2016) proved that, in the Gaussian case, the classical and maximal tail dependence coefficients coincide. In the present paper, however, to speed up practical calculations, we resort to non-parametric approach in the following sections. In particular, we find a clustering procedure based on χ L * to be very attractive.

3. Clustering Procedure

Typically, clustering based on dissimilarity matrices such as given by De Luca and Zuccolotto (2011, sct. 3) could be achieved through the hierarchical clustering method directly. However, to cluster using the affinity matrix constructed with χ L * , we could not use the hierarchical clustering method because the affinity matrix may not be symmetric. Hence, we have to consider graph based clustering procedures.
Suppose we have n assets in total available for a portfolio construction. Then, the affinity matrix Δ = Δ i j n × n is given by
Δ i j = χ L , i j * = 2 κ L , i j * 1
(cf. Furman et al. 2015, sct. 5). Theoretically, Δ should be a symmetric matrix which could be seen as an affinity matrix consisting of edge weights for an undirected graph and thus could be used for clustering with the hierarchical clustering method. This is because Π * ( u ) is unique as long as it exists and thus κ L , i j * = κ L , j i * . However, for those asymmetric copulas such as the unexchangeable Marshall–Olkin copulas
C a , b MO ( u , v ) = min { u 1 a v , u v 1 b } , a , b ( 0 , 1 ) , a b
the estimated parameters may differ due to that for two series of observations there are actually two different copulas to be chosen for the parameter estimation procedure: C a , b MO ( u , v ) and C b , a MO ( u , v ) = C a , b MO ( v , u ) . In other words, for two arbitrary series of observations, it is impossible to determine which group should be regarded as “u” and the other group as “v” in practice, even though we are sure that these observations are generated from an unexchangeable Marshall–Olkin copula.
For simplicity, when constructing the affinity matrix Δ , we keep only one of the two possible copulas whenever we have to estimate the parameters of the copula from pairwise observations. The advantage of this idea is that, taking C a , b MO ( u , v ) for instance, the lower triangle part of Δ is calculated by assuming the pairwise observations are generated from C a , b MO ( u , v ) and then estimating the parameters a and b while the upper triangle part of Δ is actually calculated by assuming the same pairwise observations are generated from C b , a MO ( u , v ) = C a , b MO ( v , u ) and then estimating the parameters a and b. In other words, the resulting affinity matrix Δ contains information of estimated parameters from both C a , b MO ( u , v ) and C b , a MO ( u , v ) = C a , b MO ( v , u ) in fact.
Since such an affinity matrix may not necessarily be symmetric, the hierarchical clustering method fails to work. In this case, the resulting affinity matrix could be considered as a matrix of weighted edges in a directed graph instead of an undirected graph. The clustering task could be achieved with the Weighted Normalized Cuts (WNACut for short) introduced by Meilă and Pentney (2007), which is initially developed to analyze directed graphs related to the link data.
To understand the WNACut method, let C = { C 1 , , C K } 1 K n be an arbitrary partition of the set of all assets. Then, the cut of C k to C k represents the total influence of Cluster C k on Cluster C k , namely,
C u t ( C k , C k ) = i C k j C k Δ i j .
Hence, the total weighted cut of all clusters is defined via:
W C u t ( C ) = k = 1 K k k C u t ( C k , C k ) i C k V i ,
where V i = j = 1 n Δ i j for i = 1 , , n . Then, our target cluster C * is such that
W C u t ( C * ) = min C W C u t ( C ) .
This optimization problem could be solved through a spectral clustering algorithm named “Best WCut”. Similar to the clustering procedure proposed by De Luca and Zuccolotto (2011), this algorithm is also a two-stage clustering procedure, in which the non-metric multidimensional scaling (MDS) in the first stage is substituted by a process that transforms the asymmetric affinity matrix consisting of the WLTMDs into k orthonormal columns, where k is the predetermined number of clusters. The details are given in Algorithm A1. In Meilă and Pentney (2007), the WNACut method is shown to consistently outperform all other clustering methods chosen to be tested with synthetic data in their experiments. For this reason, we adopt this method to finish the clustering task in our proposed procedure.

4. A Simulation Study of Synthetic Data

As mentioned, our proposed clustering procedure is a two-stage procedure, as shown in Algorithm A1. However, there is no information related to the choice of clustering method for the second stage in this particular situation revealed. Hence, a simulation study is designed in this section to compare the performance of the second stage clustering method in WNACut with that of a list of commonly used clustering methods, including the classical k-means method and hierarchical clustering procedure with Ward’s minimum variance method (considering both Ward’s criterion and Ward’s criterion squared, the results are denoted as Ward.D2 and Ward.D, respectively), single linkage method, complete linkage method, average linkage method, McQuitty’s linkage method, median linkage method, and centroid linkage method. The performances are measured in two metrics: the misclassification error (ME) described in Verma and Meilă (2003) and variation in information (VI) introduced in Meilă (2003). Both criteria tend to be smaller if the resulting clusters are more similar to the K known clusters.
To begin with, we assume there are K different known clusters whose numbers of elements are N 1 , , N K , respectively. The dependence structure employed to generate the realizations in each cluster is a particular case of the asymmetric multivariate copula given by Liebscher (2008, eq. 3); namely, for cluster k, we have
C k ( u k , 1 , , u k , n 1 ) = u k , 1 γ 1 / ( γ 0 + γ 1 ) u k , 1 1 / ( γ 0 + γ 1 ) + i = 2 n 1 u k , i 1 / γ 0 1 γ 0 ,
where γ 0 > 0 and γ 1 0 . Then, the joint CDF of the distribution used to generate the realizations is given by
C ( u 1 , 1 , , u 1 , n 1 , , u K , 1 , , u K , n K ; γ 0 , γ 1 ) = k = 1 K C k ( u k , 1 , , u k , n 1 ) .
Notice that, with the above model settings, the pairwise marginal copulas could be written as either
C ( u , v ) = u γ 1 / ( γ 0 + γ 1 ) u 1 / ( γ 0 + γ 1 ) + v 1 / γ 0 1 γ 0 , γ 1 [ 0 , ] ,
or
C ( u , v ) = v γ 1 / ( γ 0 + γ 1 ) v 1 / ( γ 0 + γ 1 ) + u 1 / γ 0 1 γ 0 , γ 1 [ 0 , ] ,
which includes both the pairwise independent copula C ( u , v ) = u v as the particular case γ 1 = and the pairwise standard Clayton copula C ( u , v ) = u 1 / γ 0 + v 1 / γ 0 1 γ 0 as the particular case γ 1 = 0 . Then, as discussed in Section 3, we only keep (3) for further analysis. When the realizations are generated, the affinity matrix consisting of pairwise WLTMDs could be calculated by first estimating γ 0 and γ 1 with the maximal likelihood method based on (3) and then calculating the WLTMDs using
κ ^ L , i j * = 1 + γ ^ 1 , i j / ( γ ^ 1 , i j + 2 γ ^ 0 , i j )
(cf. Furman et al. 2015, eq. 6.2). Therefore, with all pairwise κ ^ L , i j * obtained, the affinity matrix Δ could be calculated through Equation (1). Then, given the predetermined number of clusters k, the first stage of the BestWCut will transform the affinity matrix Δ into k orthonormal columns.
The total number of iterations for our simulation study is 500, which is the same as De Luca and Zuccolotto (2011). Other values of the parameters for the simulation study are given below:
  • The number of known clusters K = 4 .
  • The number of objects in each cluster n 1 , , n K are independently sampled from { 3 , 4 , , 8 } at random for each iteration.
  • γ 0 = 4 .
  • γ 1 = 1 , 8 , 64 , which result in theoretical κ L * = 10 / 9 , 1 . 5 , 17 / 9 , respectively, if the two series of realizations are neither independent nor dependent with a classical Clayton copula.
  • The distances used in hierarchical clustering methods are all Euclidean distance in R K .
The results of MEs and VIs are given in Table 1 and Table 2, respectively, in which we can see that, with Ward’s criterion squared, the Ward’s minimum variance method is consistently competitive or even outperforms other methods. Moreover, the distributional properties of the simulated ME’s are shown in Figure 1, Figure 2 and Figure 3 for γ 1 = 1 , γ 1 = 8 , and γ 1 = 64 , respectively, and the distributional properties of the simulated VIs are shown in Figure 4, Figure 5 and Figure 6 for γ 1 = 1 , γ 1 = 8 , and γ 1 = 64 , respectively. All these figures indicate that the Ward’s minimum variance method might be the best choice for the second stage of the proposed clustering procedure.
The results shown in Table 1 and Table 2 also indicate the sensitivity of the ME/VI to κ L * . Taking the Ward’s minimum variance method as an example, when κ L * increases or decreases by ( 3.5 / 9 / 1.5 = ) 25.93 % (i.e., γ 1 increases from 8 to 64 or decreases from 8 to 1, correspondingly) the average ME increases by 179.25 % or decreases by 19.57 % , correspondingly, while the average VI increases by 79.8 % or decreases by 38.39 % , correspondingly. Thus, our proposed clustering procedure seems to perform better as the true WLTMD κ L * gets closer to 1, and as the true WLTMD κ L * moves towards 2, the performance of our proposed clustering procedure worsens rapidly.

5. Application to Real Data

In this section, we apply the proposed procedure on the log-returns of foreign exchange rates with respect to US dollars from Group of 20 (known as G20). Since US dollar is used as the underlying currency, there are 19 time series of exchange rates in total. The exchange rates of France, Germany and Italy are excluded from our analysis due to their perfect linear correlation with euros, which results in 16 time series of exchange rates available for our clustering analysis1. The data were collected weekly from 5 September 2012 to 17 August 2016, covering 207 × 16 active observations during these four years, which can be downloaded from “PACIFIC Exchange Rate Service”, 2016, by Werner Antweiler, University of British Columbia. The descriptive statistics and histograms of observations for the 16 currencies are given in Table 3 and Figure 7.

5.1. Preliminary Analysis

For each series of the exchange rates, the log-returns are obtained by taking logarithm of the fraction between two consecutive weekly exchange rates, part of which are shown in Figure 8. To eliminate the potential autocorrelation and heteroskedasticity of the log-returns, a univariate generalized error distribution (GED) ARMA-GARCH ( 1 , 1 ) model is applied to each time series of log-returns and the fitted standardized residuals are extracted for the purpose of clustering.
A preliminary test of correlation indicates that the standardized residuals of Argentine Pesos seem to be uncorrelated to those of all the other currencies. The details of the correlation tests are shown in Table 4, from which we could see that all p-values are greater than 0.05 . Thus, we suspect that all WLTMDs between the residuals of ARS and any other currency are approximately 0. To this end, let T X , i = min { R A R S , i , R X , i } where X represents some currency and R X , i is the random variable having the same distribution as the standard residuals obtained by fitting the log-returns of currency X with GED ARMA-GARCH ( 1 , 1 ) model. Then, by Coles et al. (1999, eq. 4.2), we have
P T X , i > t X ( t ) t η X
as t where η X = 2 / ( 1 + χ L , A R S ( X ) ) and χ L , A R S ( X ) is the weak lower-tail dependence between the standard residuals of the log-returns of ARS and currency X. Hence, our concern about whether χ L , A R S ( X ) = 0 is equivalent to test
H 0 : η X = 2 v . s . H 1 : η X 2 .
The test statistic, which is asymptotically standard normal, could be obtained using the improved OLS method given in Gabaix and Ibragimov (2011). When 50 pairs of observations are used, the resulting p-values are all greater than 0.1 , as shown in Table 4. Thus, H 0 could not be rejected, which leads to η ^ G I 2 and hence χ ^ L , A R S ( X ) 0 for all currencies other than ARS. Notice that by definition WLTMD should always be smaller than or equal to the corresponding weak lower-tail dependence, therefore the WLTMDs between the residuals of ARS and those of any other currency are approximately 0, which allows us to evaluate ARS as an isolated point that will not be taken into account in further analysis.
In fact, ARS is not the only currency that should be excluded from further clustering procedure. When testing the correlation between standardized residuals of the fitted GED ARMA-GARCH ( 1 , 1 ) model for log-returns, we discover that the residuals of SAR have negative correlation with majority of the residuals of the other currencies (see the columns entitled “Sign of estimated coefficients” in Table 5). Furthermore, we can see in Table 5 that only AUD, CNY, GBP and JPY are currencies whose standard residuals may not be negatively correlated with SAR. Unfortunately, none of the corresponding p-values show significant correlation between the standardized residuals of these currencies and SAR. Therefore, it is reasonable to suspect the WLTMDs between the residuals of SAR and any other currency are approximately 0. As expected, the last column in Table 5 verifies this result using the OLS test statistics with only 30 pairs of observations. Therefore, we do not include SAR in our next step of clustering procedure either.
Since there are no more currencies that could be evaluated as isolated points based on correlation tests or OLS test statistics, we retain all of the rest 14 exchange rates in our further clustering analysis.

5.2. Clustering the Exchange Rates Using WLTMD

For each pair of the residuals fitted from the log-return series, a bivariate Clayton copula function (see Liebscher 2008, for instance) is adopted with the form of Equation (3) on the estimated empirical distribution functions (pseudo observations), where γ 0 0 and γ 1 0 are unknown parameters to be specified. Notice that the Clayton-type copula defined in Equation (3) relaxes the restriction γ 1 = 0 in the classical Clayton copula and hence is not necessarily symmetric. With this Clayton-type copula, the tail order of maximal dependence is proved to have the following analytic form:
κ L * = 1 + γ 1 / ( γ 1 + 2 γ 0 )
(cf. Furman et al. 2015, eq. 6.2). The parameters can be estimated by the maximum likelihood method, and hence the estimate of the lower tail of the maximal dependence is given by κ ^ L * = 1 + γ 1 ^ / ( γ 1 ^ + 2 γ 0 ^ ) . To measure the goodness-of-fit of the assumed copula, we employ a test based on the Rosenblatt transformation (see Breymann et al. 2003) between two dependent random variables Y 1 and Y 2 , given by
S ( Y 1 , Y 2 ) = Φ 1 F 1 ( Y 1 ) 2 + Φ 1 C ( F 2 ( Y 2 ) | F 1 ( Y 1 ) ) 2
where Φ denotes the standard normal c.d.f. and C ( F 2 ( Y 2 ) | F 1 ( Y 1 ) ) the conditional copula. Then, the random variable S ( Y 1 , Y 2 ) should have a χ 2 2 distribution if C is the true copula, which can be tested by a Kolmogorov–Smirnov test between S as a function of pseudo observations and χ 2 2 for each pair of the standardized residuals for different currencies. The test statistics are given in Table A1 in Appendix C. Notice that n = 206 in our situation, thus the 5 % critical value for the Kolmogorov–Smirnov test is equal to 1.358 / 206 = 0.095 , indicating none of the fitted copulas should be rejected as the true copulas.
Using the fitted parameters γ ^ 0 and γ ^ 1 , we obtain the fitted tail order of maximal dependence κ ^ L as well as the fitted WLTMD χ ^ L . Then, we apply the WNACut algorithm for the first stage of the clustering procedure and Ward1 method for the second stage on the affinity matrix constructed by the ( 14 × 14 = 196) χ ^ L s for k = 2 , , 13 , respectively. The total weights for WNACut is k, and the percentages of weights cut of the total weights W C u t ( C * ) / k against k are shown in Figure 9, which indicates that a relatively stable clustering result is obtained when the number of clusters k 4 .
To get a closer look at the clustering result, first we consider the case k = 3 whose cluster dendrogram is given by Figure 10.
The resulting clusters are listed in Table 6 which provides a regional segmentation of the world: Northeast Asia, the East/Southeast Asia and the rest of the world. Obviously, such a cluster result has the lowest percentage of weighted cuts and seems to perform well for currencies of countries in the Northeast Asia, East Asia and Southeast Asia. However, it fails to distinguish the currencies of the rest of the world. For the purpose of comparison, in Table 6, we also provide the clustering results given by the method of De Luca and Zuccolotto (2011) for k = 3 .
Next, we compare the clustering result for k = 4 to k = 3 . When k = 4 , the clustering dendrogram is given by Figure 11.
The resulting clusters are listed in Table 7 which preserves the regional characteristics for currencies of countries in East/Northeast Asia while the partition of the rest of the world in Table A2 is provided by the IMF. For the purpose of comparison, in Table 7, we also provide a clustering result given by the method of De Luca and Zuccolotto (2011) for k = 4 . Here are some remarks on the clustering outcomes obtained by our clustering procedure.
  • All economies in the first group have strong economic connections with the US (under certain type of free trade agreements). Besides, the nominal per capita GDPs of these economies are all above $ 30,000 (see Table A2 in Appendix D). Thus, it is reasonable to include currencies of these economies in one group.
  • The second group of economies have neither strong economic connection to the US nor high nominal per capita GDPs (less than $ 10,000 ). Moreover, all of them are identified as emerging economies by IMF (see Table A2 in Appendix D).
  • China is the only member of the third group. Although there is no free trade agreements between US and China, it is well-acknowledged that China has a strong economic connection with the US. In addition, China has a very high nominal GDP (the third highest among all 20 economies) but very low nominal GDP per capita (less than $ 8000 ). As a result, it is reasonable to not include China in any other group.
  • It is reasonable to include the rest three economies in one group from the geographical perspective.
In conclusion, the clustering result for k = 4 seems to more reasonable compared to that of k = 3 . Besides, the clustering result for k = 4 could not be obtained through further partition based on the clustering result for k = 3 because it requires a combination of {Brazil, Mexico, South Africa, Turkey} and {India, Indonesia} as a new cluster. Therefore, the clustering result for k = 3 seems not to be sufficiently stable from our perspective.

5.3. An Example of Portfolio Management with the Clustering

As stressed in Section 1, one important application of time series clustering is risk management. Since the WLTMD represents the extreme co-movement downwards, the resulting clusters obtained through our proposed procedure represent groups of assets whose returns move in the same direction when both returns are extremely low. Hence, we should avoid including two assets from the same cluster in our portfolio. For instance, if we would like to construct a portfolio with four of the 14 aforementioned currencies, we may consider the following steps:
  • Perform our proposed clustering procedure for k = 4 . The clustering result is given in Table 7.
  • From the clustering result in Table 7, one has ( 4 × 6 × 1 × 3 = ) 72 choices if he/she tries to avoid selecting two assets from the same cluster in his/her portfolio. All 72 resulting portfolios are listed in Table A3 in Appendix E.
  • Construct portfolio using Markowitz’s procedure (minimum variance criteria), namely, computing the sample covariance matrix S of the log-returns for any combination of currencies in Table A3 and solving
    min w w S w
    subject to w 1 + w 2 + w 3 + w 4 = 1 and w i 0 for i = 1 , 2 , 3 , 4 , where w = w 1 w 2 w 3 w 4 . The resulting weights corresponding to the 72 choices of currencies are also given in Table A3 in Appendix E.
The paths of the returns for all 72 combinations are shown in Figure 12. Notice that there are ( 14 4 = ) 1001 different combinations of four currencies without considering the clustering result (the resulting weights as well as the mean and accumulative returns corresponding to these 1001 portfolios are listed in Table S1 in the Supplementary Materials). We also construct portfolios using Equation (5) for all of the 1001 combinations of currencies and plot the corresponding paths of returns in Figure 12. In contrast, in Figure 12, we also provide the paths of aggregated returns of all portfolios constructed by the method of De Luca and Zuccolotto (2011).
In Figure 12, the portfolios constructed through our proposed procedure are shown to have uniformly exceptional performance among all possible ways to construct portfolios with four currencies for a long period (September 2012–August 2016). Compared with the portfolios constructed using the method of De Luca and Zuccolotto (2011), all of our portfolios outperform most of their portfolios. Besides, the paths of return provided by our portfolios do not vary significantly across the 72 combinations of currencies and seem to have better performance in the long run (the average and accumulative returns of these 72 portfolios on 17 August 2016 (the last date of the observation period) are also provided in Table A3 in Appendix E, from which we can see that the accumulative returns are from 7 % to 4 % , corresponding to the range shown in the left of Figure 12).
We also show the path of risk adjusted returns provided by our portfolios in Figure 13. The results also show that our portfolios have outstanding performance most of the time.
For a retrospective study, the performance of the proposed portfolios as well as all possible portfolios from 17 August 2016 to 10 September 2018 are plotted in Figure 14. Although these portfolios fail to be the best choices again, the resulting returns are still shown to have very strong invulnerability against fluctuations and risks.
In contrast, in Figure 14, we also provide the paths of aggregate returns of all portfolios constructed by the method of De Luca and Zuccolotto (2011). Obviously, the performances are quite inconsistent and the returns seem to vary when volatility increases.
For the retrospective study, we also show the path of risk adjusted returns provided by our portfolios in Figure 15. The performance of our portfolios seems to be very good during early 2018 but relatively poor during late 2016 to 2017.

6. Conclusions

In this paper, we have proposed a clustering procedure based on a new affinity measure indicating the extreme co-movements for financial time series. Unlike the common distance-based affinity matrix, our proposed affinity matrix is not necessarily symmetric and hence cannot be used as the input in hierarchical clustering algorithms directly. As a result, clustering procedures based on weighted cuts are employed and examined, and the WNACut method is finally selected as a recommended clustering procedure, based on the performances of compared procedures applicable. The resulting clusters seem to be reasonable when applied to the real exchange rate time series, and the portfolios constructed based on the resulting clusters are shown to outperform those from other clustering procedures, particularly in the long run.
Future research should focus on seeking consistent nonparametric estimators for the coefficients of maximal tail dependence such as WLTMD, since parametric copulas having explicit forms of these coefficients are not always available for the standard residuals extracted from the data. As pointed out in (Furman et al. 2015, sct. 7), the achievement in the area of M-estimators may be a possible way to obtain such estimators as well as the relevant statistical inference. Moreover, the idea of the OLS estimator (see Gabaix and Ibragimov 2011, for instance) may also be a potential way to address this challenging problem. Furthermore, as indexes of maximal tail dependence such as WLTMD are measures of extreme co-movements, they might not reflect the potential causality between quantities and hence could not be directly applied to some important issues in the financial markets such as contagion effects. Nevertheless, our proposed clustering procedure provides an insight to analyze financial time series using asymmetric information matrices such as matrices of contagion measures.

Supplementary Materials

The following are available at https://www.mdpi.com/2227-9091/6/4/115/s1, Table S1: 1001 portfolios constructed for all combinations of 4 currencies out of 14 currencies. The portfolio weights obtained by Markowitz minimal variance criteria is given in the bracket. Also, mean returns and accumulative returns are listed.

Author Contributions

Conceptualization, X.L.; Methodology, J.W.; Formal Analysis, C.Y.; Data Curation, W.J.

Funding

This research was funded by National Natural Science Foundation of China (71603190) and Ministry of Education in China (10YJC790280).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Proposition 2.1

Proof. 
Since there exists a function φ * ( u ) A such that
Π * ( u ) = C ( φ * ( u ) , u 2 / φ * ( u ) ) ,
combined with the upper Fréchet–Hoeffding bound of copulas we must have for any x ( 0 , 2 ) ,
C ( u x , u 2 x ) Π * ( u ) min { φ * ( u ) , u 2 / φ * ( u ) }
for all u ( 0 , 1 ] because u x , u 2 x A . From Equation (A1), it is easy to obtain
Π * ( u ) u .
Given the Lipschitz condition of copula, we have
C ( u x , u 2 x ) C ( 1 , u 2 ) u x 1 + u 2 u x 1 0
as x 0 by noticing C ( 1 , u 2 ) = u 2 , then
lim x 0 C ( u x , u 2 x ) = u 2
for all u ( 0 , 1 ] . Combining Equations (A1) and (A3) yields
Π * ( u ) u 2
for all u ( 0 , 1 ] , and since Π * ( u ) = * ( u ) u κ L * , it further follows by Equations (A2) and (A4) that
1 log * ( u ) log ( u ) κ L * 2 log * ( u ) log ( u )
for all u ( 0 , 1 ] . Provided the Karamata’s Representation Theorem, the slowly varying function * ( u ) has the following representation:
* ( u ) = c * ( u ) exp 1 1 / u ϵ * ( t ) t t ,
where measurable functions c * ( u ) and ϵ * ( t ) satisfy
lim u 0 c * ( u ) = c ( 0 , )
and
lim t ϵ * ( t ) = 0 .
Hence, by Equation (A6), we have for all u ( 0 , 1 ]
log * ( u ) log ( u ) = 1 log ( u ) log c * ( u ) + 1 1 / u ϵ * ( t ) t t .
Notice that, in Equation (A7), it is obvious that
lim u 0 log c * ( u ) log ( u ) = 0 ,
the major discussion should focus on the limit of the rest part of Equation (A7) as u 0 . Since ϵ * ( t ) 0 as t , by defining T ϵ : = sup { t ( 1 , ) : | ϵ * ( t ) | ϵ } for arbitrary ϵ > 0 we know that T ϵ < . Hence, for small enough u ( 0 , 1 ] , we could rewrite
1 log ( u ) 1 1 / u ϵ * ( t ) t t = 1 log ( u ) 1 T ϵ ϵ * ( t ) t t + T ϵ 1 / u ϵ * ( t ) t t .
Then, for Equation (A9), it is obvious, that for u < U ϵ : = exp 1 ϵ 1 T ϵ ϵ * ( t ) t t , we have
1 log ( u ) 1 T ϵ ϵ * ( t ) t t < ϵ ,
while for u < T ϵ 0 . 5 ,
1 log ( u ) T ϵ 1 / u ϵ * ( t ) t t < 1 log ( u ) log ( u ) log T ϵ ϵ = 1 + log T ϵ log ( u ) ϵ ϵ .
Therefore, for all u < min { T ϵ 0 . 5 , U ϵ } we have
1 log ( u ) 1 1 / u ϵ * ( t ) t t < 2 ϵ ,
which is equivalent to
lim u 0 1 log ( u ) 1 1 / u ϵ * ( t ) t t = 0 .
Finally, combined with (A8) we deduce
lim u 0 log * ( u ) log ( u ) = 0 .
Therefore, we may conclude that κ L * [ 1 , 2 ] by letting u 0 for both inequalities of Equation (A9). ☐

Appendix B. Details of the Clustering Algorithm in Simulation

Algorithm A1 Best WCut (Meilă and Pentney 2007, Algorithm 4.1).
Require: Affinity matrix, Δ ; diagonal matrix of volume weights, V ; diagonal matrix of row weights, R ; number of clusters, k;
Ensure: The clustering, C * ;
1:
Update Δ through the linear transform R Δ ;
2:
Define D i = j = 1 d δ i j and D = diag { D i } i = 1 , , n ;
3:
Define H ( B ) = 1 2 B + B , where B = V 1 / 2 D Δ V 1 / 2 ;
4:
Compute Y the n × k matrix with orthonormal columns containing the eigenvectors of H ( B ) corresponding to the k smallest eigenvalues;
5:
Cluster the rows of X = V 1 / 2 Y as points in R k .

Appendix C. Goodness-of-Fit Test

Table A1. Estimated Kolmogorov statistics for the goodness-of-fit test.
Table A1. Estimated Kolmogorov statistics for the goodness-of-fit test.
AUDBRLCADCNYGBPINRIDRJPYKRWMXNRUBZARTRYEUR
AUD0.06340.07780.06360.07330.06140.04600.07470.06850.09220.05630.08150.06440.0487
BRL0.07940.06850.05390.04250.04930.04880.05010.05270.06680.07660.06650.08410.0454
CAD0.08990.06550.05020.05450.05120.05610.06730.06010.05680.06680.05630.05550.0629
CNY0.04950.04450.04690.07940.03750.06330.04890.06660.04960.04940.05160.05160.0470
GBP0.06030.05790.05170.07050.04810.05840.05800.05650.04930.04940.04860.07540.0414
INR0.06330.04330.04700.03830.04840.03670.04760.04720.02900.06040.04620.05770.0637
IDR0.04800.05310.05020.06120.05840.03710.06750.06860.03950.05770.07500.06650.0505
JPY0.06870.05220.06770.05340.06110.05160.07050.06450.06370.04970.05130.05870.0542
KRW0.07630.06190.05810.07970.08210.05690.06960.06430.06280.05780.07970.08960.0448
MXN0.08610.06480.07480.05110.05240.03530.03530.06370.05430.07180.05850.05890.0446
RUB0.05580.07480.05200.05500.04810.04930.05590.04850.03820.05120.05610.05650.0592
ZAR0.07150.06200.05000.05660.05180.05040.05650.05100.07260.07860.06470.06870.0465
TRY0.06750.06140.05420.05550.08900.05120.05990.06040.07190.06980.05760.07530.0483
EUR0.04150.05740.06630.04830.04180.06070.03990.05480.03060.03390.05400.04190.0506

Appendix D. Economic Summary of G20 Nations

Table A2. Part of the economic summary of G20 nations 2015 by (IMF (2014)).
Table A2. Part of the economic summary of G20 nations 2015 by (IMF (2014)).
MemberNom. GDP mil. USDPPP GDP mil. USDNom. GDP per capitaUSDPPP GDP per capltaUSDHDIPopulationAreaEconomic Classification (IMF)
Argentina585,623964,30013,58922,5540.83642,961,0002,780,400Emerging
Australia1,223,8871,489,00050,96247,3890.93523,599,0007,692,024Advanced
Brazil1,772,5893,166,000867016,1550.755202,768,0008,515,767Emerging
Canada1,552,3861,632,00043,33244,9670.91335,467,0009,984,670Advanced
China10,982,82919,510,000799014,1070.7271,367,520,0009,572,900Emerging
France2,421,5602,647,00037,67541,1810.88863,951,000640,679Advanced
Germany3,357,6143,842,00040,99746,8930.91680,940,000357,114Advanced
India2,090,7067,965,000161761620.6091,259,695,0003,287,263Emerging
Indonesia858,9532,839,000336211,1260.684251,490,0001,904,569Emerging
Italy1,815,7572,174,00029,86735,7080.87360,665,551301,336Advanced
Japan4,123,2584,658,00032,48638,0540.891127,061,000377,930Advanced
South Korea1,376,8681,849,00027,19536,5110.89850,437,000100,210Advanced
Mexico1,144,3342,220,000900917,5340.756119,581,7891,964,375Emerging
Russia1,324,7343,471,000905525,4110.798146,300,00017,098,242Emerging
Saudi Arabia653,2191,683,00020,81353,6240.83730,624,0002,149,690Emerging
South Africa312,957723,518569513,1650.66653,699,0001,221,037Emerging
Turkey733,6421,589,000943720,4380.76177,324,000783,562Emerging
United Kingdom2,849,3452,660,00043,77141,1590.90764,511,000242,495Advanced
United States17,947,00017,947,00055,80555,8050.915318,523,0009,526,468Advanced
European Union16,220,37019,180,00031,91837,8520.876505,570,7004,422,773N/A

Appendix E. Seventy-Two Portfolios Constructed Based on Our Clustering Result in Table 7

Table A3. Sevent-two portfolios constructed based on our clustering result in Table 7. Notice that for each cluster only one currency is selected. The portfolio weights obtained by Markowitz minimal variance criteria is given in the bracket. In addition, mean returns and accumulative returns are listed.
Table A3. Sevent-two portfolios constructed based on our clustering result in Table 7. Notice that for each cluster only one currency is selected. The portfolio weights obtained by Markowitz minimal variance criteria is given in the bracket. In addition, mean returns and accumulative returns are listed.
PortfolioCurrency from Cluster 1Currency from Cluster 2Currency from Cluster 3Currency from Cluster 4Mean ReturnAccumulative Return
1AUD (0.0000)BRL (0.0220)CNY (0.9177)JPY (0.0603)−0.0321%−6.6162%
2CAD (0.0122)BRL (0.0195)CNY (0.9089)JPY (0.0595)−0.0329%−6.7676%
3GBP (0.0221)BRL (0.0196)CNY (0.8984)JPY (0.0599)−0.0333%−6.8559%
4EUR (0.0392)BRL (0.0164)CNY (0.8944)JPY (0.0499)−0.0313%−6.4531%
5AUD (0.0000)INR (0.0539)CNY (0.8841)JPY (0.0620)−0.0316%−6.5051%
6CAD (0.0188)INR (0.0500)CNY (0.8709)JPY (0.0603)−0.0332%−6.8367%
7GBP (0.0236)INR (0.0508)CNY (0.8643)JPY (0.0613)−0.0331%−6.8180%
8EUR (0.0419)INR (0.0483)CNY (0.8596)JPY (0.0502)−0.0315%−6.4815%
9AUD (0.0000)IDR (0.0322)CNY (0.9072)JPY (0.0606)−0.0320%−6.5924%
10CAD (0.0227)IDR (0.0271)CNY (0.8913)JPY (0.0589)−0.0336%−6.9266%
11GBP (0.0278)IDR (0.0317)CNY (0.8807)JPY (0.0597)−0.0340%−6.9977%
12EUR (0.0441)IDR (0.0263)CNY (0.8810)JPY (0.0486)−0.0315%−6.4964%
13AUD (0.0023)MXN (0.0109)CNY (0.9221)JPY (0.0647)−0.0299%−6.1674%
14CAD (0.0300)MXN (0.0000)CNY (0.9093)JPY (0.0608)−0.0310%−6.3935%
15GBP (0.0269)MXN (0.0066)CNY (0.9029)JPY (0.0636)−0.0310%−6.3900%
16EUR (0.0468)MXN (0.0031)CNY (0.8995)JPY (0.0507)−0.0288%−5.9339%
17AUD (0.0085)ZAR (0.0000)CNY (0.9290)JPY (0.0626)−0.0290%−5.9725%
18CAD (0.0300)ZAR (0.0000)CNY (0.9093)JPY (0.0608)−0.0310%−6.3935%
19GBP (0.0282)ZAR (0.0000)CNY (0.9091)JPY (0.0627)−0.0301%−6.2074%
20EUR (0.0476)ZAR (0.0000)CNY (0.9024)JPY (0.0501)−0.0284%−5.8410%
21AUD (0.0085)TRY (0.0000)CNY (0.9290)JPY (0.0626)−0.0290%−5.9725%
22CAD (0.0300)TRY (0.0000)CNY (0.9093)JPY (0.0608)−0.0310%−6.3935%
23GBP (0.0282)TRY (0.0000)CNY (0.9091)JPY (0.0627)−0.0301%−6.2074%
24EUR (0.0476)TRY (0.0000)CNY (0.9024)JPY (0.0501)−-0.0284%−5.8410%
25AUD (0.0000)BRL (0.0264)CNY (0.9736)KRW (0.0000)−0.0271%−5.5828%
26CAD (0.0217)BRL (0.0218)CNY (0.9566)KRW (0.0000)−0.0285%−5.8756%
27GBP (0.0234)BRL (0.0238)CNY (0.9528)KRW (0.0000)−0.0284%−5.8427%
28EUR (0.0606)BRL (0.0166)CNY (0.9228)KRW (0.0000)−0.0272%−5.6042%
29AUD (0.0045)INR (0.0558)CNY (0.9397)KRW (0.0000)−0.0262%−5.3969%
30CAD (0.0304)INR (0.0511)CNY (0.9185)KRW (0.0000)−0.0286%−5.8954%
31GBP (0.0260)INR (0.0541)CNY (0.9200)KRW (0.0000)−0.0275%−5.6677%
32EUR (0.0635)INR (0.0480)CNY (0.8885)KRW (0.0000)−0.0273%−5.6167%
33AUD (0.0058)IDR (0.0391)CNY (0.9516)KRW (0.0036)−0.0276%−5.6778%
34CAD (0.0321)IDR (0.0346)CNY (0.9332)KRW (0.0000)−0.0299%−6.1540%
35GBP (0.0303)IDR (0.0416)CNY (0.9282)KRW (0.0000)−0.0296%−6.1004%
36EUR (0.0643)IDR (0.0307)CNY (0.9050)KRW (0.0000)−0.0281%−5.7784%
37AUD (0.0143)MXN (0.0000)CNY (0.9762)KRW (0.0094)−0.0233%−4.7940%
38CAD (0.0418)MXN (0.0000)CNY (0.9581)KRW (0.0001)−0.0264%−5.4331%
39GBP (0.0289)MXN (0.0000)CNY (0.9621)KRW (0.0089)−0.0238%−4.8943%
40EUR (0.0690)MXN (0.0000)CNY (0.9310)KRW (0.0000)−0.0242%−4.9823%
41AUD (0.0143)ZAR (0.0000)CNY (0.9762)KRW (0.0094)−0.0233%−4.7940%
42CAD (0.0418)ZAR (0.0000)CNY (0.9581)KRW (0.0001)−0.0264%−5.4331%
43GBP (0.0289)ZAR (0.0000)CNY (0.9621)KRW (0.0089)−0.0238%−4.8943%
44EUR (0.0690)ZAR (0.0000)CNY (0.9310)KRW (0.0000)−0.0242%−4.9823%
45AUD (0.0143)TRY (0.0000)CNY (0.9762)KRW (0.0094)−0.0233%−4.7940%
46CAD (0.0418)TRY (0.0000)CNY (0.9581)KRW (0.0001)−0.0264%−5.4331%
47GBP (0.0289)TRY (0.0000)CNY (0.9621)KRW (0.0089)−0.0238%−4.8943%
48EUR (0.0690)TRY (0.0000)CNY (0.9310)KRW (0.0000)−0.0242%−4.9823%
49AUD (0.0000)BRL (0.0264)CNY (0.9736)RUB (0.0000)−0.0271%−5.5828%
50CAD (0.0217)BRL (0.0218)CNY (0.9566)RUB (0.0000)−0.0285%−5.8756%
51GBP (0.0234)BRL (0.0238)CNY (0.9528)RUB (0.0000)−0.0284%−5.8427%
52EUR (0.0606)BRL (0.0166)CNY (0.9228)RUB (0.0000)−0.0272%−5.6042%
53AUD (0.0045)INR (0.0558)CNY (0.9397)RUB (0.0000)−0.0262%−5.3969%
54CAD (0.0304)INR (0.0511)CNY (0.9185)RUB (0.0000)−0.0286%−5.8954%
55GBP (0.0260)INR (0.0541)CNY (0.9200)RUB (0.0000)−0.0275%−5.6677%
56EUR (0.0635)INR (0.0480)CNY (0.8885)RUB (0.0000)−0.0273%−5.6167%
57AUD (0.0071)IDR (0.0394)CNY (0.9534)RUB (0.0000)−0.0279%−5.7458%
58CAD (0.0321)IDR (0.0346)CNY (0.9332)RUB (0.0000)−0.0299%−6.1540%
59GBP (0.0303)IDR (0.0416)CNY (0.9282)RUB (0.0000)−0.0296%−6.1004%
60EUR (0.0643)IDR (0.0307)CNY (0.9050)RUB (0.0000)−0.0281%−5.7784%
61AUD (0.0182)MXN (0.0000)CNY (0.9818)RUB (0.0000)−0.0240%−4.9542%
62CAD (0.0418)MXN (0.0000)CNY (0.9582)RUB (0.0000)−0.0264%−5.4349%
63GBP (0.0310)MXN (0.0000)CNY (0.9690)RUB (0.0000)−0.0242%−4.9888%
64EUR (0.0690)MXN (0.0000)CNY (0.9310)RUB (0.0000)−0.0242%−4.9823%
65AUD (0.0182)ZAR (0.0000)CNY (0.9818)RUB (0.0000)−0.0240%−4.9542%
66CAD (0.0418)ZAR (0.0000)CNY (0.9582)RUB (0.0000)−0.0264%−5.4349%
67GBP (0.0310)ZAR (0.0000)CNY (0.9690)RUB (0.0000)−0.0242%−4.9888%
68EUR (0.0690)ZAR (0.0000)CNY (0.9310)RUB (0.0000)−0.0242%−4.9823%
69AUD (0.0182)TRY (0.0000)CNY (0.9818)RUB (0.0000)−0.0240%−4.9542%
70CAD (0.0418)TRY (0.0000)CNY (0.9582)RUB (0.0000)−0.0264%−5.4349%
71GBP (0.0310)TRY (0.0000)CNY (0.9690)RUB (0.0000)−0.0242%−4.9888%
72EUR (0.0690)TRY (0.0000)CNY (0.9310)RUB (0.0000)−0.0242%−4.9823%

References

  1. Bae, Kee-Hong, G. Andrew Karolyi, and Reneé M. Stulz. 2003. A New Approach to Measuring Financial Contagion. Review of Financial Studies 16: 717–63. [Google Scholar] [CrossRef]
  2. Baragona, Roberto. 2001. A simulation study on clustering time series with metaheuristic methods. Quaderni di Statistica 3: 1–26. [Google Scholar]
  3. Barberis, Nicholas, Andrei Shleifer, and Jeffrey Wurgler. 2005. Comovement. Journal of Financial Economics 75: 283–317. [Google Scholar] [CrossRef] [Green Version]
  4. Billio, Monica, and Massimiliano Caporin. 2009. A generalized Dynamic Conditional Correlation model for portfolio risk evaluation. Mathematics and Computers in Simulation 79: 2566–78. [Google Scholar] [CrossRef]
  5. Billio, Monica, Massimiliano Caporin, and Michele Gobbo. 2006. Flexible Dynamic Conditional Correlation multivariate GARCH models for asset allocation. Applied Financial Economics Letters 2: 123–30. [Google Scholar] [CrossRef]
  6. Bonanno, Giovanni, Guido Caldarelli, Fabrizio Lillo, Salvatore Miccichè, Nicolas Vandewalle, and Rosario N. Mantegna. 2004. Networks of equities in financial markets. The European Physical Journal B - Condensed Matter 38: 363–71. [Google Scholar] [CrossRef]
  7. Breymann, Wolfgang, Alexandra Dias, and Paul Embrechts. 2003. Dependence structures for multivariate high-frequency data in finance. Quantitative Finance 3: 1–14. [Google Scholar] [CrossRef] [Green Version]
  8. Brockwell, Peter J., and Richard A. Davis. 2002. Introduction to Time Series and Forecasting. Springer Texts in Statistics. New York: Springer. [Google Scholar] [CrossRef]
  9. Coles, Stuart, Janet Heffernan, and Jonathan Tawn. 1999. Dependence measures for extreme value analyses. Extremes 2: 339–65. [Google Scholar] [CrossRef]
  10. De Luca, Giovanni, and Paola Zuccolotto. 2011. A tail dependence-based dissimilarity measure for financial time series clustering. Advances in Data Analysis and Classification 5: 323–40. [Google Scholar] [CrossRef]
  11. De Luca, Giovanni, and Paola Zuccolotto. 2015. Dynamic tail dependence clustering of financial time series. Statistical Papers 58: 641–57. [Google Scholar] [CrossRef]
  12. Durante, Fabrizio, Enrico Foscolo, Piotr Jaworski, and Hao Wang. 2014. A spatial contagion measure for financial time series. Expert Systems with Applications 41: 4023–34. [Google Scholar] [CrossRef]
  13. Durante, Fabrizio, Roberta Pappadà, and Nicola Torelli. 2014. Clustering of financial time series in risky scenarios. Advances in Data Analysis and Classification 8: 359–76. [Google Scholar] [CrossRef]
  14. Durante, Fabrizio, Roberta Pappadà, and Nicola Torelli. 2015. Clustering of time series via non-parametric tail dependence estimation. Statistical Papers 56: 701–21. [Google Scholar] [CrossRef]
  15. Engle, Robert. 2002. Dynamic Conditional Correlation: A Simple Class of Multivariate Generalized Autoregressive Conditional Heteroskedasticity Models. Journal of Business & Economic Statistics 20: 339–50. [Google Scholar]
  16. Engle, Robert, and Kevin Sheppard. 2001. Theoretical and Empirical properties of Dynamic Conditional Correlation Multivariate GARCH. Cambridge: National Bureau of Economic Research. [Google Scholar] [CrossRef]
  17. International Monetary Fund. 2014. World Economic Outlook Database. Washington: International Monetary Fund. [Google Scholar]
  18. Furman, Edward, Alexey Kuznetsov, Jianxi Su, and Ričardas Zitikis. 2016. Tail dependence of the Gaussian copula revisited. Insurance: Mathematics and Economics 69: 97–103. [Google Scholar] [Green Version]
  19. Furman, Edward, Jianxi Su, and Ričardas Zitikis. 2015. Paths and indices of maximal tail dependence. ASTIN Bulletin 45: 661–78. [Google Scholar] [CrossRef]
  20. Gabaix, Xavier, and Rustam Ibragimov. 2011. Rank −1/2: A Simple Way to Improve the OLS Estimation of Tail Exponents. Journal of Business & Economic Statistics 29: 24–39. [Google Scholar]
  21. Hua, Lei, and Harry Joe. 2011. Tail order and intermediate tail dependence of multivariate copulas. Journal of Multivariate Analysis 102: 1454–71. [Google Scholar] [CrossRef]
  22. Kaufman, Leonard, and Peter J. Rousseeuw. 1990. Finding Groups in Data: An Introduction to Cluster Analysis. Wiley Series in Probability and Statistics; Hoboken: John Wiley & Sons, Inc. [Google Scholar] [CrossRef]
  23. Liebscher, Eckhard. 2008. Construction of asymmetric multivariate copulas. Journal of Multivariate Analysis 99: 2234–50. [Google Scholar] [CrossRef]
  24. Mantegna, Rosario N. 1999. Hierarchical structure in financial markets. The European Physical Journal B 11: 193–97. [Google Scholar] [CrossRef]
  25. Meilă, Marina. 2003. Comparing Clusterings by the Variation of Information. In Learning Theory and Kernel Machines. Berlin and Heidelberg: Springer Nature Switzerland AG, pp. 173–87. [Google Scholar] [CrossRef]
  26. Meilă, Marina, and William Pentney. 2007. Clustering by weighted cuts in directed graphs. Paper presented at 2007 SIAM International Conference on Data Mining, Philadelphia, PA, USA, 26–28 April. [Google Scholar]
  27. Nelsen, Roger B. 2006. An Introduction to Copulas. Springer Series in Statistics; New York: Springer. [Google Scholar] [CrossRef]
  28. Pericoli, Marcello, and Massimo Sbracia. 2003. A Primer on Financial Contagion. Journal of Economic Surveys 17: 571–608. [Google Scholar] [CrossRef]
  29. Veldkamp, Laura L. 2006. Information markets and the comovement of asset prices. The Review of Economic Studies 73: 823–45. [Google Scholar] [CrossRef]
  30. Verma, Deepak, and Marina Meilă. A Comparison of Spectral Clustering Algorithms. Technical Report UW-CSE-03-05-01. Seattle: University of Washington.
1
The currencies include: Argentine Pesos, Australian Dollars, Brazilian Reals, British Pounds, Canadian Dollars, Chinese Renminbi, European Euros, Indian Rupees, Indonesian Rupiah, Japanese Yen, Mexican Pesos, Russian Rubles, Saudi Arabian Riyal, South African Rand, South Korean Won, and Turkish New Lira.
Figure 1. Box plots of simulated MEs using various clustering methods in the second stage of WNACut for γ 1 = 1 .
Figure 1. Box plots of simulated MEs using various clustering methods in the second stage of WNACut for γ 1 = 1 .
Risks 06 00115 g001
Figure 2. Box plots of simulated MEs using various clustering methods in the second stage of WNACut for γ 1 = 8 .
Figure 2. Box plots of simulated MEs using various clustering methods in the second stage of WNACut for γ 1 = 8 .
Risks 06 00115 g002
Figure 3. Box plots of simulated MEs using various clustering methods in the second stage of WNACut for γ 1 = 64 .
Figure 3. Box plots of simulated MEs using various clustering methods in the second stage of WNACut for γ 1 = 64 .
Risks 06 00115 g003
Figure 4. Box plots of simulated VIs using various clustering methods in the second stage of WNACut for γ 1 = 1 .
Figure 4. Box plots of simulated VIs using various clustering methods in the second stage of WNACut for γ 1 = 1 .
Risks 06 00115 g004
Figure 5. Box plots of simulated MEs for all hierarchical clustering methods when γ = 8 .
Figure 5. Box plots of simulated MEs for all hierarchical clustering methods when γ = 8 .
Risks 06 00115 g005
Figure 6. Box plots of simulated VIs using various clustering methods in the second stage of WNACut for γ 1 = 64 .
Figure 6. Box plots of simulated VIs using various clustering methods in the second stage of WNACut for γ 1 = 64 .
Risks 06 00115 g006
Figure 7. Histograms for the 16 selected currencies.
Figure 7. Histograms for the 16 selected currencies.
Risks 06 00115 g007
Figure 8. Log return of exchange rates for some of the G20 members.
Figure 8. Log return of exchange rates for some of the G20 members.
Risks 06 00115 g008
Figure 9. Percentage of weights cut against the number of clusters.
Figure 9. Percentage of weights cut against the number of clusters.
Risks 06 00115 g009
Figure 10. The dendrogram of the clustering procedure based on WNACut methods with affinity matrix as WLTMDs for k = 3 .
Figure 10. The dendrogram of the clustering procedure based on WNACut methods with affinity matrix as WLTMDs for k = 3 .
Risks 06 00115 g010
Figure 11. The dendrogram of the clustering procedure based on WNACut methods with affinity matrix as WLTMDs for k = 4 .
Figure 11. The dendrogram of the clustering procedure based on WNACut methods with affinity matrix as WLTMDs for k = 4 .
Risks 06 00115 g011
Figure 12. Returns (black lines) of the selected minimum variance portfolios based on our clustering procedure (left, 72 portfolios in total) and the procedure proposed by (De Luca and Zuccolotto (2011)) (right, 32 portfolios in total). Each portfolio consists of four currencies, selected by choosing one currency from each of the four resulting clusters, compared to returns (gray lines) of minimum variance portfolios constructed by all combinations (1001 in total) of four currencies out of total 14 currencies.
Figure 12. Returns (black lines) of the selected minimum variance portfolios based on our clustering procedure (left, 72 portfolios in total) and the procedure proposed by (De Luca and Zuccolotto (2011)) (right, 32 portfolios in total). Each portfolio consists of four currencies, selected by choosing one currency from each of the four resulting clusters, compared to returns (gray lines) of minimum variance portfolios constructed by all combinations (1001 in total) of four currencies out of total 14 currencies.
Risks 06 00115 g012
Figure 13. Risk-adjusted returns (black lines) of the 72 minimum variance portfolios selected based on our proposed clustering procedure, compared to risk-adjusted returns (gray lines) of minimum variance portfolios constructed by all combinations (1001 in total) of four currencies out of total 14 currencies.
Figure 13. Risk-adjusted returns (black lines) of the 72 minimum variance portfolios selected based on our proposed clustering procedure, compared to risk-adjusted returns (gray lines) of minimum variance portfolios constructed by all combinations (1001 in total) of four currencies out of total 14 currencies.
Risks 06 00115 g013
Figure 14. Returns (black lines) of the selected minimum variance portfolios based on our clustering procedure (left, 72 portfolios in total) and the procedure proposed by De Luca and Zuccolotto (2011) (right, 32 portfolios in total). Each portfolio consists of four currencies, selected by choosing one currency from each of the four clusters, compared to returns (gray lines) of minimum variance portfolios constructed by all combinations (1001 in total) of four currencies out of total 14 currencies.
Figure 14. Returns (black lines) of the selected minimum variance portfolios based on our clustering procedure (left, 72 portfolios in total) and the procedure proposed by De Luca and Zuccolotto (2011) (right, 32 portfolios in total). Each portfolio consists of four currencies, selected by choosing one currency from each of the four clusters, compared to returns (gray lines) of minimum variance portfolios constructed by all combinations (1001 in total) of four currencies out of total 14 currencies.
Risks 06 00115 g014
Figure 15. Risk-adjusted returns (black lines) of the 72 minimum variance portfolios selected based on our proposed clustering procedure, compared to returns (gray lines) of minimum variance portfolios constructed by all combinations (1001 in total) of four currencies out of total 14 currencies.
Figure 15. Risk-adjusted returns (black lines) of the 72 minimum variance portfolios selected based on our proposed clustering procedure, compared to returns (gray lines) of minimum variance portfolios constructed by all combinations (1001 in total) of four currencies out of total 14 currencies.
Risks 06 00115 g015
Table 1. Means and variances of simulated MEs using various clustering methods in the second stage of WNACut for γ 1 = 1 , γ 1 = 8 and γ 1 = 64 , respectively.
Table 1. Means and variances of simulated MEs using various clustering methods in the second stage of WNACut for γ 1 = 1 , γ 1 = 8 and γ 1 = 64 , respectively.
γ 1 = 1 γ 1 = 8 γ 1 = 64
MeanVarMeanVarMeanVar
k means0.12690.01430.17950.01320.38750.0109
ward.D0.10890.00950.13540.00120.37810.0113
ward.D20.12060.01170.13260.00140.39130.0111
single0.33410.02710.14770.00370.53110.0089
complete0.17150.02050.13490.00340.42440.0121
average0.19880.02420.13290.00210.45820.0127
McQuitty0.19950.02170.13450.00230.45360.0124
median0.34300.03240.14220.00330.52010.0088
centroid0.32230.03770.13490.00190.52840.0091
Table 2. Means and variances of simulated VIs using various clustering methods in the second stage of WNACut for γ 1 = 1 , γ 1 = 8 and γ 1 = 64 , respectively.
Table 2. Means and variances of simulated VIs using various clustering methods in the second stage of WNACut for γ 1 = 1 , γ 1 = 8 and γ 1 = 64 , respectively.
γ 1 = 1 γ 1 = 8 γ 1 = 64
MeanVarMeanVarMeanVar
k means0.63230.22361.02340.13471.88490.2088
ward.D0.63170.19591.02540.03461.84370.2202
ward.D20.64860.20431.00850.04621.87380.2095
single1.13350.29061.04460.03831.97810.1007
complete0.79140.28270.99770.06591.93270.1998
average0.82250.27501.00000.05511.89800.1622
McQuitty0.82850.26201.00850.05091.92950.1804
median1.21150.35181.03400.04051.99350.1115
centroid1.15040.39941.01580.04841.97830.1013
Table 3. Descriptive statistics of (log)returns of 16 selected currencies.
Table 3. Descriptive statistics of (log)returns of 16 selected currencies.
Min.1st Qu.MedianMean3rd Qu.Max.Std
ARS−0.1466−0.0048−0.0026−0.0056−0.00100.04150.0112
AUD−0.0294−0.0085−0.0005−0.00140.00600.02950.0175
BRL−0.0584−0.0128−0.0012-0.00220.00860.04740.0086
CAD−0.0239−0.0071−0.0008−0.00130.00380.02280.0028
CNY−0.0211−0.00110.0000-0.00020.00100.00790.0106
EUR−0.0379−0.0067−0.0006−0.00060.00580.02680.0087
GBP−0.0840−0.0063−0.0005−0.00100.00520.03040.0093
IDR−0.0366−0.0059−0.0015−0.00150.00160.05020.0118
INR−0.0408−0.0050−0.0010−0.00090.00370.04090.0089
JPY−0.0482−0.0078−0.0008−0.00120.00510.04090.0110
KRW−0.0273−0.00550.00040.00010.00590.02660.0245
MXN−0.0318−0.0077−0.0012−0.00160.00520.02800.0158
RUB−0.1215−0.0124−0.0022−0.00340.00800.12530.0126
SAR−0.0052−0.00080.0000−0.00000.00080.00450.0097
TRY−0.0484−0.0099−0.0006−0.00230.00570.03460.0112
ZAR−0.0487−0.0130−0.0017−0.00230.00790.04370.0175
Table 4. The p-values of correlation tests between the standardized residuals of ARS and those of the other 15 currencies based on Pearson’s product moment correlation coefficient, Kendall’s τ and Spearman’s ρ , respectively. The last column contains the p-values of the test of H 0 using the OLS test statistic.
Table 4. The p-values of correlation tests between the standardized residuals of ARS and those of the other 15 currencies based on Pearson’s product moment correlation coefficient, Kendall’s τ and Spearman’s ρ , respectively. The last column contains the p-values of the test of H 0 using the OLS test statistic.
CurrencyPearsonKendallSpearman η ^ GI
AUD0.34370.21110.22480.9268
BRL0.05300.62390.65450.5408
CAD0.09220.19320.21570.6553
CNY0.18250.94440.97180.1550
GBP0.39540.84850.82630.8356
INR0.64180.28160.32780.8877
IDR0.35420.38070.38550.2402
JPY0.63900.28890.29570.3472
KRW0.77970.97020.96600.6733
MXN0.87500.89620.97120.4198
RUB0.48550.99440.97390.3999
SAR0.75830.50400.47710.9567
ZAR0.84120.15080.17460.7425
TRY0.20610.06960.07410.6147
EUR0.40670.21030.21730.6303
Table 5. The sign of Estimated Correlation Coefficients and p-values of correlation tests between the standardized residuals of SAR and those of the other 14 currencies based on Pearson’s product moment correlation coefficient, Kendall’s τ and Spearman’s ρ , respectively. The last column is the p-values of the test of H 0 using the OLS test statistic.
Table 5. The sign of Estimated Correlation Coefficients and p-values of correlation tests between the standardized residuals of SAR and those of the other 14 currencies based on Pearson’s product moment correlation coefficient, Kendall’s τ and Spearman’s ρ , respectively. The last column is the p-values of the test of H 0 using the OLS test statistic.
CurrencySign of Estimated Coefficientsp-Values
PearsonKendallSpearmanPearsonKendallSpearman η ^ GI
AUD+0.87340.79810.80150.6997
BRL0.00210.00070.00080.9334
CAD0.21400.06410.05140.5117
CNY++0.87980.75930.76910.9141
GBP+0.25830.87550.83420.8694
INR0.05920.02400.02130.6743
IDR0.85430.75630.76760.9509
JPY+++0.43730.50780.52160.1236
KRW0.98620.94920.93910.9861
MXN0.00430.00440.00410.4444
RUB0.02900.00220.00210.2088
ZAR0.08990.02830.02330.7497
TRY0.11210.05490.04710.4727
EUR0.51230.39630.42090.7975
Table 6. Members for each of the clusters when k = 3 .
Table 6. Members for each of the clusters when k = 3 .
Cluster 1Cluster 2Cluster 3
Our proposed procedureAustraliaMexicoChinaRussia
BrazilSouth AfricaIndiaJapan
CanadaTurkeyIndonesiaSouth Korea
UKEU
De Luca and Zuccolotto (2011)AustraliaMexicoBrazilIndonesia
CanadaSouth AfricaJapanTurkey
ChinaRussia
IndiaSouth Korea
UKEU
Table 7. Members for each of the clusters when k = 4 .
Table 7. Members for each of the clusters when k = 4 .
Cluster 1Cluster 2Cluster 3Cluster 4
Our proposed procedureAustraliaBrazilChinaRussia
CanadaIndia Japan
UKIndonesia South Korea
EUMexico
South Africa
Turkey
De Luca and Zuccolotto (2011)AustraliaChinaUKJapan
CanadaIndia
BrazilIndonesia
EUMexico
Russia
South Korea
South Africa
Turkey

Share and Cite

MDPI and ACS Style

Liu, X.; Wu, J.; Yang, C.; Jiang, W. A Maximal Tail Dependence-Based Clustering Procedure for Financial Time Series and Its Applications in Portfolio Selection. Risks 2018, 6, 115. https://doi.org/10.3390/risks6040115

AMA Style

Liu X, Wu J, Yang C, Jiang W. A Maximal Tail Dependence-Based Clustering Procedure for Financial Time Series and Its Applications in Portfolio Selection. Risks. 2018; 6(4):115. https://doi.org/10.3390/risks6040115

Chicago/Turabian Style

Liu, Xin, Jiang Wu, Chen Yang, and Wenjun Jiang. 2018. "A Maximal Tail Dependence-Based Clustering Procedure for Financial Time Series and Its Applications in Portfolio Selection" Risks 6, no. 4: 115. https://doi.org/10.3390/risks6040115

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop