Bayesian Mixture Copula Estimation and Selection with Applications

Liu, Yujian; Xie, Dejun; Yu, Siyi

doi:10.3390/analytics2020029

Open AccessFeature PaperArticle

Bayesian Mixture Copula Estimation and Selection with Applications

by

Yujian Liu

^1,2

,

Dejun Xie

^1,* and

Siyi Yu

²

¹

School of Mathematics and Physics, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China

²

School of Economics and Management, Shanghai University of Sport, Shanghai 200438, China

^*

Author to whom correspondence should be addressed.

Analytics 2023, 2(2), 530-545; https://doi.org/10.3390/analytics2020029

Submission received: 22 April 2023 / Revised: 25 May 2023 / Accepted: 12 June 2023 / Published: 15 June 2023

Download

Browse Figure

Versions Notes

Abstract

:

Mixture copulas are popular and essential tools for studying complex dependencies among variables. However, selecting the correct mixture models often involves repeated testing and estimations using criteria such as AIC, which could require effort and time. In this paper, we propose a method that would enable us to select and estimate the correct mixture copulas simultaneously. This is accomplished by first overfitting the model and then conducting the Bayesian estimations. We verify the correctness of our approach by numerical simulations. Finally, the real data analysis is performed by studying the dependencies among three major financial markets.

Keywords:

mixture copulas; copula selection; dependence modeling; Bayesian estimations

1. Introduction

Copula functions are important tools for detecting and modeling statistical relations. While the Pearson correlation remains popular among the applied sciences because of its simplicity, its shortcomings become undeniable in the task of constructing nonlinear behavior; see Murphy [1] (Section 2.5.1) for the undesirable results of using the Pearson correlation in the case of non-linearity. On the other hand, copula functions fully describe the connection between random variables by forming their joint distribution from their univariate margins. Mathematically speaking, following from McNeil et al. [2] (p. 221), a d-dimensional copula

C (u_{1}, u_{2}, \dots, u_{d})

:

{[0, 1]}^{d} \to [0, 1]

is a distribution function with uniform margins. That is, there exists

U_{1}, U_{2}, \dots, U_{d}

uniformly distributed such that

C (u_{1}, u_{2}, \dots, u_{d}) = P (U_{1} \leq u_{1}, U_{2} \leq u_{2}, \dots, U_{d} \leq u_{d})

. The important Sklar theorem [3] guarantees the validity and usefulness of the copula functions in modeling the co-movement among random variables. It states that for the random variables

X_{1}, X_{2}, \dots, X_{d}

with the distribution function

F_{d} (x_{1}, x_{2}, \dots, x_{d})

, there exists a copula function, uniquely defined up to Range

(F_{1})

× Range

(F_{2})

×…Range

(F_{d})

such that

F_{d} (x_{1}, x_{2}, \dots, x_{d}) = C (F_{1} (x_{1}), F_{2} (x_{2}), \dots, F_{d} (x_{d}))

, where

C (\cdot)

refers to a d-dimensional copula function. See Embrechts [4] for a careful introduction to the concepts and theorems related to the Sklar theorem with applications in financial risk management.

When applying the copula methods to real life, such as modeling the dependence between global trading markets, it is often insufficient to rely on a single parametric copula family because of the data complexity and heterogeneity. A possible remedy is to use mixture modeling; McLachlan et al. [5] give a recent comprehensive review of this classic statistical topic. In terms of the copula functions, we are therefore motivated to write a copula function as the mixture of others:

C_{mix} (u) = \sum_{j = 1}^{K} w_{j} C_{j} (u; α_{j}) u \in R^{d}, α_{j} \in Θ_{j}, \sum_{j} w_{j} = 1

(1)

where we denote

Θ_{j}

to be the parameter space of the copula j and it is straightforward to verify that (1) is also a copula. In most of the research papers regarding the topics of finite mixture models, it is common to assume that the mixed distributions come from the same parametric family. This is also mentioned in [5,6]. However, in the literature on copula methods, mixture copulas consisting of several parametric copula families are also common. Hu [7] is one of a few pioneering works of the mixture copula application in the field of finance. The author used mixed normal Gumbel/survival Gumbel copula with empirical marginal distributions to model the stock dependence between the FTSE 100 of the UK, Nikkei 225 of Japan, S&P 500 of the USA, and Hang Seng of Hong Kong, S.A.R., and quasi-likelihood was used for parameter estimation, and the chi-square test was applied for the goodness of fit. Arakelian and Karlis [8] use the expectation maximization (E.M.) approach to estimate the mixture copula with two components and use them to detect the changing dependence between financial markets, the combination of the Gaussian, Clayton, and Gumbel copulas is mainly considered there, and the model selection criteria is log-likelihood. Vrac et al. [9] combined the dynamic clustering with the gradient ascent to solve the mixture copula; Frank copula family with nonparametric margins are used in their geographical application, and the best model is selected by minimizing the approximate weight of evidence (A.W.E.), which is defined to be

A . W . E . = - MLE + \sum_{i}^{K} \dim | Θ_{i} | (l o g N + 3 / 2)

. Given the number of mixture components is correct, the asymptotic convergence of their methods is finally obtained. More recently, Liu et al. [10] proposed constructing the semi-parametric conditional mixture copulas to assess the global currency market; their best models are selected by comparing the Bayes information criterion (B.I.C.), and asymptotic consistency is obtained.

The topics discussed in this essay are mixture copula estimation and selection using the Bayesian approach. Some previous works regarding Bayesian copula selections include [11], where the author treats the copula parameters as a nuisance and selects the copula with the highest posterior. That is, for the model

M_{l}

and the data

D

M_{b e s t} = {argmax}_{M_{l}} P (M_{l} ∣ D) = {argmax}_{M_{l}} \int P (D ∣ θ, M_{l}) d F (θ) p (M_{l}) .

Their method is free from estimating the copula parameters. However, on the other hand, this selection approach may experience high variance in the case of small data sets with high dimensional copulas, and the computational round-off error becomes significant when we multiply the probability if the data set is large. Silva and Lopes [12] proposed to select the model using deviance information criteria (D.I.C.), expected Bayes information criteria (EBIC), and expected Akaike information criteria (EAIC). They also pointed out the importance of the joint estimation of copula parameters using the Bayesian approach from the perspective of considering parameter dependence. Their work can be viewed as the Bayesian version of the popular frequentist A.I.C. (B.I.C.) approaches. One potential concern here is the trade-off between the computational load and the selection efficiency compared with the classic approach. Wu et al. [13,14] proposed to use the Dirichlet process and select the correct mixture copula from the infinite model. Their methods unify the parameter estimation and the best model selection, and we consider this feature to be convenient and important.

On the other hand, Wu et al.’s [13,14] methods are more suitable for the mixture models where each component is from the same parametric family. For example,

C_{m i x} = \sum w_{j} C (u, θ_{j})

with

C (\cdot, θ_{j})

belongs to the normal copula for any j. In copula research, it is interesting and meaningful to consider the mixed model with heterogeneous mixture components because the dependence on real data sets is complex and ever-changing. Therefore, we propose to use the Bayesian Monte Carlo sampling approaches so that the copula parameters and the correct heterogeneous components can be determined at once. This is simply performed by writing out the saturated mixture models with all possible components included and estimating it with the Bayesian approaches, which is expected to have more stable behavior than the maximum-likelihood methods [15].

Following the introduction, we proceed to outline some classic parametric copula families, and in Section 3, we present our sampling methods here, including how we overfit the model first and estimate the parameters by using the Bayesian modeling. Connection of our approach with the penalized likelihood method from Wang [16], Cai, and Wang [17] has also been made through the E.M. method. The last two sections are for numerical simulations and real data analysis.

2. Parametric Copula Families

2.1. Elliptical Copulas

The elliptical copulas are one of the most common choices for modeling the dependence structures among variables, especially in high dimensional settings [18]. From the Sklar theorem, copulas are of the form

C (u_{1}, u_{2}, \dots, u_{d}) = F (F_{1}^{- 1} (u_{1}), F_{2}^{- 1} (u_{2}), \dots, F_{d}^{- 1} (u_{d})) .

(2)

Since the elliptical distribution is closed under the marginalization, we can therefore get the corresponding parametric copula implicitly defined by (2). For example, by inverting the marginal of the standard multivariate normal distribution, we obtain the normal copula, which is

\begin{matrix} \begin{matrix} C_{C} (u_{1}, u_{2}, \dots, u_{d}) = \int_{- \infty}^{ϕ^{- 1} (u_{d})} \dots \int_{- \infty}^{ϕ^{- 1} (u_{1})} {({(2 π)}^{d} | C |)}^{- 1 / 2} exp (- \frac{1}{2} x^{'} C^{- 1} x) d x, \end{matrix} \end{matrix}

(3)

where C is the positive definite correlation matrix, and

x = {(ϕ^{- 1} (u_{1}), ϕ^{- 1} (u_{2}), \dots, ϕ^{- 1} (u_{d}))}^{'}

with

ϕ (\cdot)

being the quantile function. On the other hand, taking the same action to the multivariate t distribution yields the t copula,

\begin{matrix} \begin{matrix} C_{v, C} (u_{1}, u_{2}, \dots, u_{d}) = \int_{- \infty}^{t_{v}^{- 1} (u_{d})} \dots \int_{- \infty}^{t_{v}^{- 1} (u_{1})} \frac{Γ (\frac{1}{2} (v + d)) / Γ (\frac{v}{2})}{\sqrt{{(π v)}^{d} | C |}} (1 + \frac{y^{'} C^{- 1} y}{v}) d y, \end{matrix} \end{matrix}

(4)

t_{v}^{- 1} (u_{1})

is the quantile function of the univariate standard t distribution with v degree of freedom and

y = {(t_{v}^{- 1} (u_{1}), t_{v}^{- 1} (u_{2}), t_{v}^{- 1} (u_{3}), \dots, t_{v}^{- 1} (u_{d}))}^{'}

, C is a correlation matrix. The respective copula density

c (\cdot)

can be obtained due to the differentiation

\begin{matrix} f (F_{1}^{- 1} (u_{1}), F_{2}^{- 1} (u_{2}), \dots, F_{d}^{- 1} (u_{d})) = c (u_{1}, u_{2}, \dots, u_{d}) \prod_{j = 1}^{d} f_{j} (F_{j}^{- 1} (u_{j})) . \end{matrix}

(5)

One potential advantage of using the t copula (4) over the normal copula is its ability to model the tail dependence. That is, we wish to measure the degree of dependence on the upper tail

ρ_{u}

and on the lower tail

ρ_{l}

. Taking two dimensional copulas as examples, we have for the corresponding random vector

(X_{1}, X_{2})

\begin{matrix} \begin{matrix} ρ_{l} = lim_{u \to 0} P (X_{2} < F_{2}^{- 1} (u) ∣ X_{1} < F_{1}^{- 1} (u)) = lim_{u \to 0} \frac{C (u, u)}{u} \\ ρ_{u} = lim_{u \to 1} P (X_{2} > F_{2}^{- 1} (u) ∣ X_{1} > F_{1}^{- 1} (u)) = lim_{u \to 1} \frac{1 - 2 u + C (u, u)}{u} . \end{matrix} \end{matrix}

Calculations lead to

ρ_{l} = ρ_{u} = 0

for the normal copula but for the t copula with v degree of freedom we have,

ρ_{l} = ρ_{u} = 2 F_{v + 1; t} (- \sqrt{(v + 1) (1 - c) / (1 + c)}),

where

F_{v + 1; t} (\cdot)

is the t distribution function with v degree of freedom. One criticism of the elliptical copula families is their symmetric property

c (u) = c (1 - u)

, which might be unrealistic for modeling the asymmetrical correlation that often occurs in the financial market [19]. Therefore, many authors have proposed the skewed elliptical copula. Smith et al. [20] proposed the skew t copula and the estimation of the parameters is performed by MCMC. Wu et al. [13] uses a nonparametric Bayesian approach to construct infinite mixture skew normal copula. Wei et al. [21] explored some theoretical properties of the skew-normal copula. Alternatively, Archimedean families of copulas are another solution for the issue.

2.2. Archimedean Copulas

Archimedean copulas have been widely researched and applied in the field of credit risk modeling [2]. They can be constructed by satisfying the following linear additive property, that is,

\begin{matrix} φ^{- 1} (C (u_{1}, u_{2}, \dots, u_{d})) = \sum_{i = 1}^{d} φ^{- 1} (u_{i}), \end{matrix}

(6)

where

φ (\cdot) : [0, + \infty) \to [0, 1]

is usually called the Archimedean copula generator satisfying convexity, continuity, and completely monotonicity with

φ (0) = 1

and

{lim}_{t \to \infty} φ (t) = 0

. The generator with such properties can be derived from the Laplace transform of the positive random variable X with its distribution function having

F_{X} (0) = 0

,

φ (t) = \int_{0}^{\infty} exp (- t x) d F_{X} (x) .

Taking different forms of

φ (t)

yields different Archimedean families of copulas, a comprehensive table can be found in the textbooks [22] (Table 4.1). We give several copulas that we will use with their generators in Table 1, and the corresponding distribution function is

C (u_{1}, \dots, u_{d}) = φ (\sum_{i = 1}^{d} φ^{- 1} (u_{i}))

.

One noticeable property of the Archimedean copula is its exchangeability. That is, for any permutation

σ (i)

of

{1, 2, \dots, d}

, we have

C (u_{1}, u_{2}, \dots, u_{d}) = C (u_{σ (1)}, u_{σ (2)}, \dots, u_{σ (d)}) .

This characteristic would be attractive for some applications, such as portfolio default modeling in the credit market. However, for the more general purpose, it might be undesirable when we have the copula dimension

d \geq 3

since this implies that the connection between variables is assumed to be homogeneous. Some improvement has been made on this problem, including nonexchangeable copulas named asymmetric Archimedean copulas [2,23]

C^{γ} (u_{1}, u_{2}, \dots, u_{d}) = (\prod_{j = 1}^{d} u_{j}^{γ_{i}}) C (u_{1}^{γ_{1}}, u_{2}^{γ_{2}}, \dots, u_{d}^{γ_{d}}) .

Otherwise, some amendment in (6) yields so-called nonexchangeable nested Archimedean copula [24].

Different copulas can model different kinds of dependence, and Figure 1 gives us a plot of four different types of copulas. In particular, normal and Frank copulas are symmetric in the sense of

c (1 - u_{1}, 1 - u_{2}) = c (u_{1}, u_{2})

with 0 tail dependence, but the Frank copula was, in addition, proved to be radial symmetric [25]. The Gumbel copula is able to depict the extreme upper tail dependence with

p_{u} = - 2^{θ_{Gumbel}^{- 1}} + 2

but

p_{l} = 0

. Oppositely, the Clayton copula is able to describe the extreme lower tail dependence with

p_{l} = 2^{- θ_{Clayton}^{- 1}}

for

θ_{Clayton} > 0

, but

ρ_{u} = 0

. Hence by mixing those four copulas, setting

\begin{matrix} C_{mix} (u_{1}, u_{2}) = w_{1} C_{fr} (u_{1}, u_{2}) + w_{2} C_{No} (u_{1}, u_{2}) + w_{3} C_{Cl} (u_{1}, u_{2}) + w_{4} C_{Gu} (u_{1}, u_{2}), \end{matrix}

(7)

We would be able to reconstruct vast amounts of nontrivial dependence from here [17].

3. Estimation and Selection

The main model we use to conduct simulations and real data analysis is (7). We also work with three-dimensional mixture Gaussian copulas in Section 4.3 to demonstrate the application of our approach in the higher dimensional situation. The general starting point is to construct the model by writing out

\begin{matrix} C_{m i x} (u) = \sum_{j = 1}^{K} w_{j} C_{j} (u; θ_{j}), \end{matrix}

(8)

with the knowledge that a true model has the form

C^{0} (u) = \sum_{j = 1}^{K^{'}} w_{j} C_{j} (u; θ_{j}^{0}), for K^{'} \leq K .

We then proceed to directly estimate (8) by the Bayesian approach, where

C_{j} (\cdot), C_{k} (\cdot)

for any

j, k \leq K

can either from the same parametric family or not, although for the former case, one needs to take extra measures for the label switching problems [6] (Section 22.3). Rousseau and Mengersen [15] showed that by applying this approach to the standard finite mixture distribution, it would clear out the redundant components asymptotically. In particular, they showed for

w = (w_{1}, w_{2}, \dots, w_{K}) \sim Dirichlet (α_{1}, α_{2}, \dots, α_{K})

with

\dim (θ_{j}) / 2

plus some other regularity conditions, the posterior estimation of weights has the property

\sum_{j = K^{'} + 1}^{K} E [w_{j} | D] = O_{P} (1 / \sqrt{n})

. This result has shown us the extra stability of the Bayesian estimation due to its shrinkage property compared with the maximum likelihood approach (MLE) since the MLE of an over-fitted model only guarantees the convergence to an unidentifiable set with the limiting distribution

C_{\infty} (\cdot) = C^{0} (\cdot)

in the domain as

n \to \infty

[26]. However, the asymptotic results do not guarantee sparsity. This would cause a failure to identify the correct number of components if, for example,

i, j, k \leq K

,

w_{i} C_{i} (u) + w_{j} C_{j} (u) = w_{k} C_{k} (u)

is achievable in the model setting.

On the other hand, Cai and Wang [17] approached the mixed copula estimation and selection problem using penalized MLE approach. In terms of its nature, this approach is quite similar to Bayesian estimation. However, the authors only applied penalties to the weighting parameters, whereas the Bayesian counterpart typically applies penalties to all parameters. The connection between these two approaches is established using the expectation maximization (EM) approach of the posterior, as outlined at the end of Section 3.2, where we compare the maximization form of the posterior mode and the penalized MLE.

3.1. Markov Chain Monte Carlo

We show the sampling algorithm of the model (7), but the spirit of the estimation remains the same for all forms of (8). It is especially straightforward to extend the work to the high dimensional implicit copulas [27], including some skew elliptical copulas by minor modifications. Hence, the validity of the approaches remains valid in high-dimensional settings.

In most cases, for the data

D_{n} = {X_{1}, X_{2}, \dots, X_{n}}

where

X_{i} \in R^{d}

, we do not have any additional information regarding the marginal distributions. Therefore, it would be necessary to estimate them together or treat them as nuisance parameters using the nonparametric approach. That is, we either specify the marginal parametric model so that the likelihood for the i.i.d data is

p (D_{n} ∣ α, θ_{m i x}) = \prod_{i = 1}^{n} c_{m i x} (F_{1} (X_{i 1}; α_{1}), F_{2} (X_{i 2}; α_{2}), \dots, F_{d} (X_{i d}; α_{d}); θ_{m i x}) \prod_{j = 1}^{d} f_{j} (X_{i j}; α_{1}) .

Or, to avoid misspecification of the marginal models, we use the semiparametric approach. The pseudo-likelihood for the i.i.d data is

p (D_{n} | θ_{m i x}) \propto \prod_{i = 1}^{n} c_{m i x} ({\hat{F}}_{n 1} (X_{i 1}), {\hat{F}}_{n 2} (X_{i 2}), \dots, {\hat{F}}_{n d} (X_{i d}); θ_{m i x})

where we have

{\hat{F}}_{n j} (x) = \frac{1}{n + 1} \sum_{i = 1}^{n} I (X_{i j} < x) .

Other alternatives of the margins

{\hat{F}}_{n} = ({\hat{F}}_{n 1}, {\hat{F}}_{n 2}, \dots, {\hat{F}}_{n d})

such as kernel density estimations are also available [28]. Therefore, only

θ_{m i x}

is estimated here. In this paper, we focus on the discussion of semiparametric cases.

We specifying the prior of

w

and

θ_{m i x}

with

\begin{matrix} π (w) \sim D i r (α_{1}, α_{2}, \dots, α_{K}) \\ π (θ_{m i x}) \sim N_{d} (0, I_{d}) \end{matrix}

Note that for any copula parameters which do not have the range

(- \infty, + \infty)

, when convenient, we transfer them from the original parameter space to

R

so that

θ = ϕ (θ_{0}) \in (- \infty, + \infty)

. Hence, we will be able to unify the prior to be normal. In case of the model (7), denote

θ_{m i x} \in {(- \infty, + \infty)}^{d}

, the original parameters can be obtained by

\begin{matrix} \begin{matrix} θ_{clayton}^{ori} = exp (θ_{mixclayton}) \\ θ_{gumbel}^{ori} = exp (θ_{mixgumbel}) + 1 \\ θ_{normal}^{ori} = \frac{1 - exp (- θ_{mixnormal})}{1 + exp (- θ_{mixnormal})} . \\ θ_{frank}^{ori} = θ_{mixfrank} \end{matrix} \end{matrix}

(9)

where

θ^{o r i}

refer to the parameters in the classical copula settings. We augment our data to

(X_{i}, Z_{i})

, where

Z_{i}

denotes the cluster of the point i, so that

p (X_{i} ∣ Z_{i} = k, θ_{m i x}) \propto c_{k} (F (X_{i}); θ_{k})

. The Metropolis–Hasting algorithm of sampling the posterior

p (θ_{m i x}, w ∣ D_{n})

follows as:

Setting initial values $θ_{m i x}^{(0)}, w^{(0)}$ .
Denote the current round to be t, iteratively updating $Z_{i}^{(t)}$ such that $p (Z_{i}^{(t)} ∣ Z_{∖ i}^{(t)}, D_{n}, w)$
$\propto p (X_{i} ∣ Z^{(t)}, w) p (Z_{i}^{(t)} ∣ w)$ for $i = 1, 2, \dots, n$ using Gibbs procedure; this can be sampled from the multinomial distribution with $p_{k} = \frac{w_{k} c_{k} ({\hat{F}}_{n} (X_{i}) ∣ θ_{k})}{\sum_{i} w_{i} c ({\hat{F}}_{n} (X_{i}) ∣ θ_{i})}$ with $k = 1, 2, \dots, K$ .
For all $i = 1, 2, \dots, K$ , we propose $f (θ_{i}^{*} ∣ θ_{i}^{t - 1}) \sim N_{d} (θ_{i}^{t - 1}, \hat{Σ})$ where $\hat{Σ}$ is updated every 50 iterations from the sample variance of previously accepted points. We accept the $θ_{i}^{*} = θ_{i}^{t}$ with the acceptance rate

$a_{i} = \frac{\prod_{j = 1}^{n_{i}} c_{i} (X_{i j}; θ_{i}^{*}) π (θ_{i}^{*}) f (θ_{i}^{t - 1} ∣ θ_{i}^{*})}{\prod_{j = 1}^{n_{i}} c_{i} (X_{i j}; θ_{i}^{t - 1}) π (θ_{i}^{t - 1}) f (θ_{i}^{*} ∣ θ_{i}^{t - 1})} .$
Update $w \sim D i r (α_{1} + \sum_{i = 1}^{n} I (Z_{i}^{(t)} = 1), \dots, α_{K} + \sum_{i = 1}^{n} I (Z_{i}^{(t)} = K))$ .
Repeat steps 2–4 until the stopping criteria are reached, for example, after 10,000 iterations. The MCMC method would be sufficient for our purpose. However, by setting up the EM method for the posterior mode, we can bridge between the Bayesian methods and the penalized likelihood methods discussed in [16,17]. In addition, if the gradient information of the copula is available, it would be faster to work with the EM to get the parameter estimations.

3.2. EM Algorithm

Start from the complete data

(X_{i}, Z_{i})

where

Z_{i}

is the cluster label as previously. Therefore, we denote

Q (Z) : = log p (w, θ_{m i x}, Z | X)

; our goal is to work iteratively so that

\begin{matrix} (w^{t + 1}, θ^{t + 1}) = {argmax}_{θ, w} \int Q (Z) p (Z ∣ X, θ_{m i x}^{t}, w^{t}) d Z = {argmax}_{θ, w} E_{p (Z | X, θ_{m i x}^{t}, w^{t})} (Q) \end{matrix}

(10)

In more detail,

\begin{matrix} Q (Z) = log p (w, θ_{m i x}, Z | X) \\ \propto log p (X | w, θ_{m i x}, Z) + log p (Z | w) + log p (w, θ_{m i x}) \\ \propto log \prod_{i = 1}^{n} \prod_{j = 1}^{K} c_{j} {(F_{n} (X_{i}); θ_{j})}^{I (Z_{i} = j)} + log \prod_{i = 1}^{n} \prod_{j = 1}^{K} w_{j}^{I (Z_{i} = j)} \\ + log \prod_{j = 1}^{K} w_{j}^{(α_{j} - 1)} - \sum_{j = 1}^{K} \frac{1}{2} | | θ_{j} {| |}^{2} + C, \end{matrix}

where we have denoted the irrelevant constant to be C, and

p (w, θ_{m i x}) = p (w) p (θ_{m i x})

.

Hence, we take the expectation so that the argmax of (10) would be equivalent as

\begin{matrix} \begin{matrix} {argmax}_{w, θ_{m i x}} \sum_{i, j} log c_{j} ({\hat{F}}_{n} (X_{i}); θ_{j}) E (I (Z_{i} = j)) \\ + \sum_{i, j} E (I (Z_{i} = j)) log w_{j} + \sum_{j = 1}^{K} (α_{j} - 1) log w_{j} - \sum_{j = 1}^{K} \frac{1}{2} | | θ_{j} {| |}^{2} \\ = \sum_{i, j} r_{i j}^{t} log c_{j} ({\hat{F}}_{n} (X_{i}); θ_{j}) + n \sum_{i, j} r_{i j}^{t} log w_{j} - (1 - \frac{1}{K}) \sum_{j} log w_{j} - \sum_{j} \frac{1}{2} | | θ_{j} {| |}^{2}, \end{matrix} \end{matrix}

(11)

where we have taken

α_{j} = 1 / K

to make it less informative while satisfying the regularity condition of [15] and

r_{i j}^{t} = \frac{w_{j}^{t} c_{j} (y_{i} | θ_{j}^{t})}{\sum_{j} w_{j}^{t} f_{j} (y_{i} | θ_{j}^{t})}

To achieve the maximum, we differentiate with respect to

w_{j}

while adding the Lagrange multiplier

λ (1 - \sum w_{j})

, we have

\begin{matrix} w_{j}^{t + 1} = \frac{1}{N + 1 - K} (\sum_{i} \frac{w_{j}^{t} c_{j} (y_{i} | θ_{j}^{t})}{\sum_{j} w_{j}^{t} f_{j} (y_{i} | θ_{j}^{t})} - (1 - \frac{1}{K})) . \end{matrix}

(12)

Differentiate with respect to

θ_{j}

, and it can be solved numerically using quasi-Newton methods.

We note that the goal of the EM method is to find the mode of the log posterior

\begin{matrix} log p (w, θ_{m i x} | X) & \propto \sum_{i} log \sum_{j} w_{j} c_{i j} ({\hat{F}}_{n} (X); θ_{j}) - (1 - \frac{1}{K}) \sum_{j} log w_{j} - \sum_{j} \frac{1}{2} | | θ_{j} {| |}^{2} \\ = \sum_{i} log \sum_{j} w_{j} c_{i j} ({\hat{F}}_{n} (X); θ_{j}) - n \sum_{j} Ω_{(1 - 1 / K)}^{w} (w_{j}) - n \sum_{j} Ω_{(1 / 2)}^{θ_{m i x}} (θ_{j}), \end{matrix}

(13)

This form shares a similar structure as (3.2) in [16] or (3) in [17] despite the fact that they do not penalize the copula parameters. Intuitively, it would be beneficial to penalize them in order to regularize the parameters of

C_{j} (\cdot)

when this copula has 0 weightings. Wang [16] proved the

\sqrt{n}

—asymptotic consistency and sparsity of their semiparametric SCAD-penalized likelihood approaches. The consistency of our Bayesian methods will be tested empirically in the next part. However, the theoretical demonstrations are more challenging to consider with Dirichlet distribution priors due to the singularity of

log w_{i}

at

w_{i} = 0

[29].

One shortcoming of using the EM method is the difficulty in obtaining the confidence interval of estimators. Bootstrap could be a very computationally intensive solution. On the other hand, one may consider the fisher information matrix

- \nabla \nabla_{w, θ} log p (X ∣ \hat{w}, {\hat{θ}}_{mix})

as an asymptotic approximation of the precision matrix. Gelman et al. [6] (p. 324) provide an approach to iteratively calculate the asymptotic variance matrix along with the parameter estimations.

4. Numerical Simulations

4.1. Markov Chain Monte Carlo

We perform two types of numerical simulations. Firstly, we assume that the marginal distributions of the data are perfectly known. Therefore, we focus on the estimation of the copula using the data

(U_{i 1}, U_{i 2}, \dots, U_{i d}) = (F_{i 1} (X_{1}), F_{i 2} (X_{2}), \dots, F_{i d} (X_{d}))

for

i = 1, 2, \dots, n

. the dimension d is set to be 2 for our simulation purpose. Our working model is (7). That is,

\begin{matrix} C_{mix} (u_{1}, u_{2}) = w_{1} C_{fr} (u_{1}, u_{2}) + w_{2} C_{No} (u_{1}, u_{2}) + w_{3} C_{Cl} (u_{1}, u_{2}) + w_{4} C_{Gu} (u_{1}, u_{2}) . \end{matrix}

We sample the data from different true models, which are submodels of (7), and we estimate them using the MCMC method of Section 3.1. Secondly, we assume that the marginal distributions are unknown, we hence estimate the margins empirically using

{\hat{F}}_{n p} (x) = \frac{1}{n + 1} \sum_{i = 1}^{n} I (X_{i p} \leq x)

. Thus, we have

({\hat{U}}_{i 1}, {\hat{U}}_{i 2}, \dots, {\hat{U}}_{i d}) = ({\hat{F}}_{i 1} (X_{1}), {\hat{F}}_{i 2} (X_{2}), \dots, {\hat{F}}_{i d} (X_{d}))

for

i = 1, 2, \dots, n

and the copula parameters can be estimated thereafter.

We simulate 3000 iterations for all models, with the first 2500 points discarded as the burning stage. The number of the sample points is

n = 400, 800, 2000

. Table 2 and Table 3 display the simulation results. In general, the weighting parameters as well as the copula parameters of non-zero weighting components approach the truth with decreasing Monte Carlo standard deviation. The mean and error estimations of the copula parameters with zero weightings remain close to its priors, which might be considered as an advantage over the penalized method used in [16,17] as they proved that the zero weighting copula parameters would end up randomly in their parameter spaces by using their penalized likelihood approach. Three major misidentification cases were found in tables, that is,

n = 400, 800

of Frank copulas simulations in Table 2 and

n = 800

of Frank copulas in the Table 3. All cases mentioned seem to be misidentified as normal copulas, which are understandable as the normal copula and Frank copula share very similar structures with zero tail dependence.

4.2. Expectaion Maximization

In this part, we investigate the performances of the EM algorithm introduced in Section 3.2. The approach is computationally demanding. Therefore, we only show the results with the sample size of

n = 200, 400, 800

for one-component copulas. Data are generated directly from the true copula models. More specifically, for each sample size of

n = 200, 400, 800

, we generate 10 batches of data from the true distribution. Every batch is learned by the EM method, and the stopping criteria are 1000 full iterations or the absolute sum of the parameters increase less than

0.001

for an iteration. We calculate the mean and variance estimators for each sample size. Table 4 displays the results of the EM approach. It shows comparable outcomes with the MCMC. Although all algorithms fail to distinguish the Frank copulas from the normal ones due to their similarities, other copulas are selected with satisfactory accuracy. One clear advantage of using the EM is its convenience in introducing an exit mechanism for unlikely copulas during the training process. That is, due to the shrinkage term of the weight in (12), we can eliminate components when their corresponding weights fall down to non-positive during the training. By adding this procedure, we can automatically consider fewer mixture components at later stages. As we can see from Table 4, there are many components with deterministic 0 weightings. However, the shortcomings of the EM approach are also very clear. It is more computationally demanding especially when we seek to obtain some estimation errors or work with high dimensional copulas. On the other hand, the EM seeks to find the posterior mode which is less favorable than the posterior mean in statistical decision theories, while the MCMC approach gives full posterior distributions, and it is well acknowledged that the performance of the EM could be affected by starting points.

4.3. Higer Dimensional Cases

We proceed to test the effectiveness of our approach in a higher-dimensional case. As the classic Archimedean families of copulas are rarely used in high-dimensional applications due to the restriction of their parameter spaces, we apply the more commonly used Gaussian mixture copulas with 3 components to perform the estimations, while the dimension of the data is set to be 3. That is, we use the MCMC sampler to estimate the model

c_{NormalMix} = w_{α} c_{α} (u; Σ_{α}) + w_{β} c_{β} (u; Σ_{β}) + w_{γ} c_{γ} (u; Σ_{γ}) .

(14)

A major obstacle to performing MCMC of such type is the sampling of the correlation matrices

(Σ_{α}, Σ_{β}, Σ_{γ})

. The valid sampler should generate symmetric positive definite matrices every time with every entry from 0 to 1 and 1 in their diagonal. Readers are referred to [30] for a detailed approach. On the other hand, when performing the MCMC sampling with mixture copulas from the same parametric families, it should also be noticed that label-switching problems often occurred. This is because that (14) has

3!

equivalent forms by just switching the labels; some engineering efforts should be made to mitigate the circumstances. After every round of iteration, one can post-process the model so that the component with the highest weighting always ranks first. In addition, if the weightings are too close to distinguish, further criteria such as

\det | Σ | + trace (Σ)

should be used.

In this study, we use the data sampled from

\begin{matrix} (Σ_{α}^{12}, Σ_{α}^{23}, Σ_{α}^{13}) = (0.7, 0.7, - 0.6) \\ (Σ_{β}^{12}, Σ_{β}^{23}, Σ_{β}^{13}) = (0.6, 0.6, 0.6) \\ (Σ_{γ}^{12}, Σ_{β}^{23}, Σ_{γ}^{13}) = (- 0.7, 0.7, 0.7) . \end{matrix}

Additionally, we set the true weighting of the experiments to be

(w_{α}, w_{β}, w_{γ}) = (1, 0, 0)

and

(w_{α}, w_{β}, w_{γ}) = (0, 0.7, 0.3)

, respectively. Therefore, the true model lies in the parameter spaces of (14). Table 5 displays the results of experiments. It shows good signs of convergence to the truth while we increase the sample size. For the 2000 sample size experiments, the true components are successfully filtered out with low uncertainty, indicating the effectiveness of our proposed approach beyond the classical two-dimensional copula applications.

5. Real Data Analysis

In the real data analysis, we use financial trading data from three major indices, that is, Standard & Poors 500 (SP500), Shanghai Composite Index (SSEC), and Hang Seng Index (HSI). Daily close prices from 9 October 2017 to 29 September 2022 were extracted; we aligned three series with the common trading days among them, and other days were omitted. To ease the analysis of the dependence pattern among them, we take the log returns respectively so that

R_{i} = log P_{i} - log P_{i - 1}, i = 1, 2, 3

. Table 6 shows the Pearson and Spearman correlation among markets. SSEC and HSI display strong levels of dependence, while their connection with SP500 is relatively weak for those two markets. However, as we argued previously, the single metric of correlation does not give the full picture of the dependence. It is therefore reasonable to apply the mixture copula models for further analysis. In addition, the Ljung–Box tests to the absolute values

| R_{i} |

of series indicate all series are correlated to themselves through time. Moreover, the augmented Dickey–Fuller tests show that they are covariance stationary. To apply the copula models to the autocorrelated data, we use the standard method of standardizing. That is, the autocorrelation is removed by rescaling the volatility of the GARCH(1,1) model; assume

\begin{matrix} \begin{matrix} R_{t} = μ + σ_{t} z_{t} i . i . d z_{t} \sim N (0, 1) \\ σ_{t}^{2} = α σ_{t - 1}^{2} + β z_{t - 1}^{2} + γ, \end{matrix} \end{matrix}

(15)

We apply the data

(Z_{1}, Z_{2}, \dots, Z_{T}) = (\frac{R_{1} - μ}{σ_{1}}, \frac{R_{2} - μ}{σ_{2}}, \dots, \frac{R_{T} - μ}{σ_{T}})

to copula model and use semiparametric approach of Section 3.1 to learn the parameters. The MCMC samplings are performed 5000 times with the last 500 times used for analyzing the parameters. Table 7 shows the results of the estimation with the insignificant components omitted. We observe the strong Clayton components in the first two columns, but the Gumbel components, on the other hand, are all very weak among the three markets due to the asymmetry nature of the Clayton copula at its left tail, which can be seen in Figure 1. This indicates the existence of asymmetry dependence among markets, especially at the lower left tail, and the dependence on the upper right tail is less obvious. Our finding means that the stock markets usually more easily have a downward co-movement but are much less likely to move upward together. In contrast, the dependence pattern between HSI and SP500 is more symmetrical, with dominating normal and Frank components and a very weak Gumbel component. Given the absence of extreme left tail dependence, a portfolio consisting of HSI and SP500 indexes is less likely to experience significant losses compared to other cross-market portfolios during extreme financial conditions.

6. Conclusions

In this paper, we discuss the method of selecting and estimating the mixture copula simultaneously. This is achieved by first overfitting the model with all potential mixture components and then estimating the parameters by Bayesian methods. The MCMC and EM methods are proposed to learn the parameters, and we have performed numerical simulations to validate the correctness. Furthermore, we apply the methodology to the financial markets to detect the asymmetry dependencies among them. For future research, the effectiveness of this method for general mixture models can be thoroughly investigated and tested. In addition, a full and thorough comparison among various model selection approaches can be studied. We also expect this method to be useful in improving other empirical studies, such as value at risk and conditional value at risk calculations in financial risk management.

Author Contributions

Conceptualization, D.X.; Methodology, Y.L.; Resources, S.Y.; Writing—original draft, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Please contact the corresponding author to request the data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Murphy, K.P. Machine Learning: A Probabilistic Perspective; MIT Press: Cambridge, MA, USA, 2012. [Google Scholar]
McNeil, A.J.; Frey, R.; Embrechts, P. Quantitative Risk Management: Concepts, Techniques and Tools-Revised Edition; Princeton University Press: Princeton, NJ, USA, 2015. [Google Scholar]
Sklar, M. Fonctions de repartition an dimensions et leurs marges. Publ. Inst. Statist. Univ. Paris 1959, 8, 229–231. [Google Scholar]
Embrechts, P. Copulas: A personal view. J. Risk Insur. 2009, 76, 639–650. [Google Scholar] [CrossRef]
McLachlan, G.J.; Lee, S.X.; Rathnayake, S.I. Finite mixture models. Annu. Rev. Stat. Its Appl. 2019, 6, 355–378. [Google Scholar] [CrossRef]
Gelman, A.; Carlin, J.B.; Stern, H.S.; Dunson, D.B.; Vehtari, A.; Rubin, D.B. Bayesian Data Analysis; CRC Press: Boca Raton, FL, USA, 2013. [Google Scholar]
Hu, L. Dependence patterns across financial markets: A mixed copula approach. Appl. Financ. Econ. 2006, 16, 717–729. [Google Scholar] [CrossRef]
Arakelian, V.; Karlis, D. Clustering dependencies via mixtures of copulas. Commun. Stat.-Simul. Comput. 2014, 43, 1644–1661. [Google Scholar] [CrossRef]
Vrac, M.; Billard, L.; Diday, E.; Chédin, A. Copula analysis of mixture models. Comput. Stat. 2012, 27, 427–457. [Google Scholar] [CrossRef] [Green Version]
Liu, G.; Long, W.; Yang, B.; Cai, Z. Semiparametric estimation and model selection for conditional mixture copula models. Scand. J. Stat. 2022, 49, 287–330. [Google Scholar] [CrossRef]
Huard, D.; Evin, G.; Favre, A.C. Bayesian copula selection. Comput. Stat. Data Anal. 2006, 51, 809–822. [Google Scholar] [CrossRef]
Silva, R.d.S.; Lopes, H.F. Copula, marginal distributions and model selection: A Bayesian note. Stat. Comput. 2008, 18, 313–320. [Google Scholar] [CrossRef]
Wu, J.; Wang, X.; Walker, S.G. Bayesian nonparametric inference for a multivariate copula function. Methodol. Comput. Appl. Probab. 2014, 16, 747–763. [Google Scholar] [CrossRef] [Green Version]
Wu, J.; Wang, X.; Walker, S.G. Bayesian nonparametric estimation of a copula. J. Stat. Comput. Simul. 2015, 85, 103–116. [Google Scholar] [CrossRef]
Rousseau, J.; Mengersen, K. Asymptotic behaviour of the posterior distribution in overfitted mixture models. J. R. Stat. Soc. Ser. B Stat. Methodol. 2011, 73, 689–710. [Google Scholar] [CrossRef] [Green Version]
Wang, X. Selection of Mixed Copulas and Finite Mixture Models with Applications in Finance. Ph.D. Thesis, The University of North Carolina at Charlotte, Charlotte, NC, USA, 2008. [Google Scholar]
Cai, Z.; Wang, X. Selection of mixed copula model via penalized likelihood. J. Am. Stat. Assoc. 2014, 109, 788–801. [Google Scholar] [CrossRef]
Smith, M.S.; Loaiza-Maya, R. Implicit copula variational inference. J. Comput. Graph. Stat. 2022, 2022, 1–28. [Google Scholar] [CrossRef]
Ang, A.; Chen, J. Asymmetric correlations of equity portfolios. J. Financ. Econ. 2002, 63, 443–494. [Google Scholar] [CrossRef]
Smith, M.S.; Gan, Q.; Kohn, R.J. Modelling dependence using skew t copulas: Bayesian inference and applications. J. Appl. Econom. 2012, 27, 500–522. [Google Scholar] [CrossRef]
Wei, Z.; Kim, S.; Choi, B.; Kim, D. Multivariate skew normal copula for asymmetric dependence: Estimation and application. Int. J. Inf. Technol. Decis. Mak. 2019, 18, 365–387. [Google Scholar] [CrossRef]
Nelsen, R.B. An Introduction to Copulas; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
Genest, C.; Ghoudi, K.; Rivest, L.P. “Understanding relationships using copulas,” by Edward Frees and Emiliano Valdez, January 1998. North Am. Actuar. J. 1998, 2, 143–149. [Google Scholar] [CrossRef]
Joe, H. Multivariate Models and Multivariate Dependence Concepts; CRC Press: Boca Raton, FL, USA, 1997. [Google Scholar]
Frank, M.J. On the simultaneous associativity of F(x,y) and x+y-F(x,y). Aequationes Math. 1979, 19, 194–226. [Google Scholar] [CrossRef]
Feng, Z.D.; McCulloch, C.E. Using bootstrap likelihood ratios in finite mixture models. J. R. Stat. Soc. Ser. B Methodol. 1996, 58, 609–617. [Google Scholar] [CrossRef]
Smith, M.S. Implicit copulas: An overview. Econom. Stat. 2021, in press. [CrossRef]
Patton, A.J. A review of copula models for economic time series. J. Multivar. Anal. 2012, 110, 4–18. [Google Scholar] [CrossRef] [Green Version]
Fan, J.; Li, R. Variable selection via nonconcave penalized likelihood and its oracle properties. J. Am. Stat. Assoc. 2001, 96, 1348–1360. [Google Scholar] [CrossRef]
Smith, M.S. Bayesian approaches to copula modelling. arXiv 2011, arXiv:1112.4204. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Scatter plots of different families of copula with 2000 points,

θ = 0.6, 5, 3, a n d 3

for normal, Clayton, Frank, and Gumbel copulas, respectively.

Figure 1. Scatter plots of different families of copula with 2000 points,

θ = 0.6, 5, 3, a n d 3

for normal, Clayton, Frank, and Gumbel copulas, respectively.

Table 1. Some classic Archimedean families of copulas.

Copula Type	$φ (t)$	$θ$ Range
Frank	$θ^{- 1} ln (\frac{1}{1 + (exp (- θ - t) - exp (- t))})$	$R ∖ {0}$
Gumbel	$exp (- t^{θ^{- 1}})$	$[1, + \infty)$
Clayton ^$†$	${(1 + θ t)}_{+}^{- θ^{- 1}}$	$(0, + \infty)$

^† We denote

{(\cdot)}_{+} : = max (\cdot, 0)

.

Table 2. MCMC estimations of the copula with the marginal distributions fully known. The numbers inside parentheses indicate standard errors, and estimations of the true components are denoted in bold font.

True Copula (Param)
		MCMC Estimation
	$n$	Clayton		Gumbel		Normal		Frank
		$w$	$θ$	$w$	$θ$	$w$	$θ$	$w$	$θ$
Normal (0.5)	400	0.089 (0.088)	1.568 (1.670)	0.047 (0.036)	3.801 (2.750)	0.839 (0.105)	0.444 (0.046)	0.025 (0.032)	−0.029 (0.994)
	800	0.0439 (0.066)	1.173 (1.586)	0.007 (0.011)	2.494 (1.370)	0.940 (0.066)	0.514 (0.026)	0.009 (0.014)	0.113 (0.916)
	2000	0.039 (0.049)	1.363 (1.424)	0.028 (0.034)	3.087 (1.968)	0.913 (0.063)	0.494 (0.024)	0.020 (0.038)	0.043 (1.120)
Clayton (5)	400	0.990 (0.011)	4.914 (0.246)	0.005 (0.010)	2.490 (1.612)	0.003 (0.005)	0.526 (0.224)	0.002 (0.003)	0.012 (0.965)
	800	0.992 (0.009)	4.876 (0.185)	0.003 (0.006)	2.641 (1.774)	0.003 (0.005)	0.488 (0.207)	0.002 (0.004)	0.037 (0.944)
	2000	0.996 (0.003)	5.091 (0.133)	0.001 (0.002)	2.411 (1.569)	0.001 (0.002)	0.568 (0.205)	0.001 (0.001)	−0.198 (0.983)
Gumbel (2.5)	400	0.017 (0.027)	1.676 (1.505)	0.957 (0.038)	2.486 (0.105)	0.022 (0.033)	0.530 (0.210)	0.004 (0.007)	0.080 (1.002)
	800	0.002 (0.004)	1.480 (1.593)	0.991 (0.009)	2.701 (0.071)	0.004 (0.007)	0.545 (0.210)	0.002 (0.005)	0.070 (1.051)
	2000	0.006 (0.008)	1.442 (1.268)	0.988 (0.014)	2.470 (0.048)	0.005 (0.011)	0.533 (0.194)	0.001 (0.002)	−0.091 (0.968)
Frank (5)	400	0.061 (0.083)	1.903 (1.512)	0.030 (0.041)	2.386 (2.115)	0.875 (0.087)	0.647 (0.038)	0.033 (0.047)	0.416 (1.037)
	800	0.058 (0.041)	3.774 (2.751)	0.019 (0.039)	2.324 (2.043)	0.899 (0.055)	0.603 (0.031)	0.024 (0.031)	0.280 (0.984)
	2000	0.007 (0.012)	1.358 (1.223)	0.004 (0.007)	2.255 (1.394)	0.205 (0.055)	0.790 (0.041)	0.784 (0.058)	4.408 (0.285)
0.5 Gumbel (2.5) + 0.5 Clayton (5)	400	0.439 (0.057)	6.079 (0.761)	0.533 (0.059)	2.756 (0.242)	0.024 (0.036)	0.569 (0.207)	0.004 (0.007)	0.176 (1.003)
	800	0.567 (0.034)	5.332 (0.390)	0.429 (0.040)	2.328 (0.143)	0.002 (0.004)	0.514 (0.210)	0.002 (0.004)	0.126 (0.976)
	2000	0.509 (0.034)	5.111 (0.356)	0.480 (0.032)	2.505 (0.076)	0.005 (0.008)	0.523 (0.200)	0.006 (0.007)	0.182 (1.005)
0.5 Clayton (5) + 0.5 Normal (0.5)	400	0.513 (0.087)	5.150 (1.054)	0.061 (0.070)	2.606 (2.353)	0.383 (0.095)	0.554 (0.080)	0.044 (0.067)	0.280 (1.036)
	800	0.573 (0.041)	4.107 (0.336)	0.165 (0.079)	1.833 (0.534)	0.191 (0.144)	0.410 (0.126)	0.069 (0.086)	0.365 (1.028)
	2000	0.456 (0.035)	5.500 (0.372)	0.069 (0.046)	2.750 (0.788)	0.473 (0.035)	0.466 (0.035)	0.002 (0.003)	−0.105 (0.941)

Table 3. MCMC estimations of the copula with the marginal distributions estimated by empirical distribution. The numbers inside parentheses indicate standard errors, and estimations of the true components are denoted in bold font. The corresponding true marginal distribution is N(1, 1) and N(0.5, 1).

True Copula (Param)
		MCMC Estimation
	$n$	Clayton		Gumbel		Normal		Frank
		$w$	$θ$	$w$	$θ$	$w$	$θ$	$w$	$θ$
Normal (0.5)	400	0.003 (0.005)	1.637 (1.785)	0.114 (0.125)	2.015 (1.073)	0.878 (0.124)	0.590 (0.038)	0.005 (0.009)	0.064 (0.993)
	800	0.08 (0.08)	1.543 (1.321)	0.008 (0.011)	2.569 (1.758)	0.886 (0.102)	0.568 (0.033)	0.026 (0.037)	0.246 (1.054)
	2000	0.025 (0.039)	0.845 (1.039)	0.021 (0.021)	1.740 (0.786)	0.952 (0.048)	0.541 (0.022)	0.002 (0.004)	−0.072 (0.951)
Clayton(5)	400	0.987 (0.015)	4.856 (0.240)	0.006 (0.014)	2.648 (1.718)	0.004 (0.007)	0.530 (0.204)	0.002 (0.004)	0.061 (0.929)
	800	0.994 (0.006)	4.733 (0.185)	0.003 (0.005)	2.957 (1.793)	0.001 (0.002)	0.499 (0.226)	0.001 (0.003)	−0.050 (1.023)
	2000	0.996 (0.004)	5.423 (0.130)	0.002 (0.004)	3.438 (2.236)	0.001 (0.002)	0.539 (0.197)	0.001 (0.001)	−0.088 (0.959)
Gumbel (2.5)	400	0.009 (0.018)	1.589 (1.533)	0.971 (0.034)	2.830 (0.122)	0.018 (0.031)	0.554 (0.190)	0.002 (0.004)	−0.104 (1.088)
	800	0.005 (0.007)	2.293 (2.862)	0.981 (0.021)	2.652 (0.084)	0.012 (0.019)	0.484 (0.199)	0.002 (0.003)	0.076 (0.931)
	2000	0.004 (0.008)	2.295 (2.647)	0.993 (0.008)	2.530 (0.044)	0.001 (0.001)	0.522 (0.216)	0.002 (0.004)	0.156 (0.962)
Frank (5)	400	0.012 (0.016)	2.220 (2.809)	0.096 (0.099)	2.333 (1.019)	0.005 (0.010)	0.532 (0.197)	0.887 (0.094)	4.533 (0.376)
	800	0.147 (0.044)	4.664 (1.414)	0.006 (0.011)	2.430 (1.634)	0.836 (0.048)	0.599 (0.028)	0.011 (0.018)	0.145 (1.028)
	2000	0.005 (0.007)	2.334 (3.135)	0.050 (0.028)	2.659 (0.833)	0.016 (0.024)	0.453 (0.208)	0.929 (0.023)	5.107 (0.281)
0.5 Gumbel (2.5) + 0.5 Clayton (5)	400	0.551 (0.085)	4.327 (0.556)	0.363 (0.129)	2.586 (0.276)	0.080 (0.135)	0.603 (0.206)	0.006 (0.009)	0.072 (1.071)
	800	0.413 (0.046)	5.149 (0.526)	0.538 (0.060)	2.645 (0.167)	0.050 (0.050)	0.558 (0.211)	0.003 (0.007)	−0.031 (1.136)
	2000	0.531 (0.030)	4.792 (0.270)	0.464 (0.031)	2.500 (0.089)	0.004 (0.008)	0.508 (0.197)	0.001 (0.002)	0.104 (1.026)
0.5 Clayton (5) + 0.5 Normal (0.5)	400	0.502 (0.082)	4.871 (0.992)	0.044 (0.077)	2.376 (1.230)	0.450 (0.109)	0.488 (0.077)	0.005 (0.009)	−0.122 (0.960)
	800	0.526 (0.042)	5.321 (0.461)	0.015 (0.020)	2.841 (1.888)	0.444 (0.054)	0.485 (0.048)	0.015 (0.026)	0.065 (1.021)
	2000	0.534 (0.034)	4.725 (0.325)	0.106 (0.058)	1.751 (0.353)	0.351 (0.070)	0.512 (0.060)	0.009 (0.011)	0.015 (0.966)

Table 4. EM estimations of the copula with the marginal distributions fully known. The numbers inside parentheses indicate standard errors, and estimations of the true components are denoted in bold font. The starting value of the

E M

is

w = (0.25, 0.25, 0.25, 0.25), θ_{m i x} = (1, 1, 1, 1)

.

Table 4. EM estimations of the copula with the marginal distributions fully known. The numbers inside parentheses indicate standard errors, and estimations of the true components are denoted in bold font. The starting value of the

E M

is

w = (0.25, 0.25, 0.25, 0.25), θ_{m i x} = (1, 1, 1, 1)

.

True Copula (Param)
		EM Estimations
	$n$	Clayton		Gumbel		Normal		Frank
		$w$	$θ$	$w$	$θ$	$w$	$θ$	$w$	$θ$
Normal (0.5)	200	0.020 (0.060)	1.137 (0.435)	0.035 (0.110)	1.957 (0.135)	0.947 (0.117)	0.509 (0.030)	0 (0)	0.5 (0)
	400	0.031 (0.078)	1.479 (1.661)	0.050 (0.12)	2.031 (0.351)	0.921 (0.129)	0.506 (0.042)	0 (0)	0.5 (0)
	800	0.031 (0.068)	1.021 (0.134)	0 (0)	2 (0)	0.969 (0.068)	0.485 (0.021)	0 (0)	0.5 (0)
Clayton (5)	200	1 (0)	4.955 (0.525)	0 (0)	2 (0)	0 (0)	0.5 (0)	0 (0)	0.5 (0)
	400	0.989 (0.035)	4.972 (0.246)	0.012 (0.036)	2.628 (1.985)	0 (0)	0.5 (0)	0 (0)	0.5 (0)
	800	1 (0)	4.988 (0.162)	0 (0)	2 (0)	0 (0)	0.5 (0)	0 (0)	0.5 (0)
Gumbel (2.5)	200	0 (0)	1 (0)	1 (0)	2.486 (0.177)	0 (0)	0.5 (0)	0 (0)	0.5 (0)
	400	0 (0)	1 (0)	1 (0)	2.500 (0.088)	0 (0)	0.5 (0)	0 (0)	0.5 (0)
	800	0 (0)	1 (0)	0.979 (0.038)	2.562 (0.081)	0.02 (0.04)	0.513 (0.058)	0 (0)	0.5 (0)
Frank (5)	200	0.117 (0.138)	2.182 (1.415)	0.155 (0.268)	2.016 (0.118)	0.723 (0.244)	0.557 (0.102)	0.010 (0.030)	0.498 (0.007)
	400	0.054 (0.087)	1.619 (1.075)	0.184 (0.187)	2.283 (0.383)	0.764 (0.228)	0.555 (0.080)	0 (0)	0.5 (0)
	800	0.075 (0.085)	2.201 (1.456)	0.060 (0.103)	2.249 (0.498)	0.840 (0.089)	0.608 (0.047)	0.030 (0.050)	0.517 (0.029)

Table 5. MCMC estimations of the 3-dimensional mixture Gaussian copulas with the marginal distributions fully known. The numbers inside parentheses indicate standard errors, and estimations of the true components are denoted in bold font. Comp is the abbreviation for component and the components are ordered by their weightings.

True Copula (Param)
		MCMC Estimations
	n	Comp1		Comp2		Comp3
		$w$	$θ$	$w$	$θ$	$w$	$θ$
Normal (0.7, −0.7, −0.6)	400	0.815 (0.160)	0.691 (0.031), −0.676 (0.042), −0.602 (0.048)	0.173 (0.152)	0.685 (0.212), −0.658 (0.278), −0.430 (0.342)	0.019 (0.016)	0.414 (0.306), −0.050 (0.539), −0.152 (0.413)
	800	0.991 (0.014)	0.680 (0.017), −0.715 (0.015), −0.604 (0.023)	0.007 (0.012)	0.371 (0.429),−0.435 (0.318),−0.413 (0.571)	0.002 (0.003)	0.312 (0.445),−0.341 (0.377),−0.308 (0.566)
	2000	0.992 (0.007)	0.70 (0.009),−0.699 (0.010),−0.612 (0.012)	0.007 (0.007)	0.154 (0.350),−0.362 (0.498),−0.336 (0.252)	0.001 (0.001)	−0.225 (0.283),0.303 (0.445),−0.230 (0.239)
0.7 Normal (0.6, 0.6, 0.6) + 0.3 Normal (−0.7, −0.7, 0.7)	400	0.609 (0.097)	0.679 (0.069), 0.616 (0.048), 0.602 (0.043)	0.289 (0.047)	−0.595 (0.310), −0.592 (0.327), 0.713 (0.066)’	0.102 (0.085)	−0.224 (0.387), −0.170 (0.452), 0.468 (0.141)
	800	0.656 (0.074)	0.567 (0.035), 0.599 (0.042), 0.576 (0.038)	0.216 (0.042)	−0.535 (0.326), −0.594 (0.252), 0.687 (0.076)	0.128 (0.054)	−0.331 (0.392), −0.450 (0.342), 0.724 (0.087)
	2000	0.663 (0.030)	0.636 (0.017), 0.607 (0.021), 0.603 (0.014)	0.310 (0.020)	−0.677 (0.030), −0.690 (0.029), 0.740 (0.020)	0.026 (0.027)	0.190 (0.320), 0.138 (0.319), 0.257 (0.193)

Table 6. Pearson and Spearman correlation among three markets from October 2017 to September 2020.

	SSEC	HSI	SP500	SSEC	HSI	SP500
	Pearson Correlation			Spearman Correlation
SSEC	1	0.699	0.18	1	0.679	0.173
HSI	0.699	1	0.25	0.679	1	0.224
SP500	0.18	0.25	1	0.173	0.224	1

Table 7. Parameters estimation of the stocks data with mean estimator and

90 %

credible interval.

Table 7. Parameters estimation of the stocks data with mean estimator and

90 %

credible interval.

		SSEC-HSI	SSEC-SP500	HSI-SP500
Clayton	w	0.280 (0.144, 0.372)	0.685 (0.508, 0.814)	0
Clayton	$θ$	2.53 (1.65, 3.65)	0.168 (0.069, 0.247)
Gumbel	w	0	0	0.104 (0.015, 0.257)
Gumbel	$θ$			1.484 (1.130, 2.368)
Normal	w	0.668 (0.587, 0.785)	0.222 (0.062, 0.350)	0.528 (0.350, 0.672)
Normal	$θ$	0.722 (0.681, 0.764)	0.366 (0.190, 0.563)	0.400 (0.269, 0.561)
Frank	w	0	0	0.33 (0.201, 0.542)
Frank	$θ$			−0.534 (−1.557, 0.509)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, Y.; Xie, D.; Yu, S. Bayesian Mixture Copula Estimation and Selection with Applications. Analytics 2023, 2, 530-545. https://doi.org/10.3390/analytics2020029

AMA Style

Liu Y, Xie D, Yu S. Bayesian Mixture Copula Estimation and Selection with Applications. Analytics. 2023; 2(2):530-545. https://doi.org/10.3390/analytics2020029

Chicago/Turabian Style

Liu, Yujian, Dejun Xie, and Siyi Yu. 2023. "Bayesian Mixture Copula Estimation and Selection with Applications" Analytics 2, no. 2: 530-545. https://doi.org/10.3390/analytics2020029

APA Style

Liu, Y., Xie, D., & Yu, S. (2023). Bayesian Mixture Copula Estimation and Selection with Applications. Analytics, 2(2), 530-545. https://doi.org/10.3390/analytics2020029

Article Menu

Bayesian Mixture Copula Estimation and Selection with Applications

Abstract

1. Introduction

2. Parametric Copula Families

2.1. Elliptical Copulas

2.2. Archimedean Copulas

3. Estimation and Selection

3.1. Markov Chain Monte Carlo

3.2. EM Algorithm

4. Numerical Simulations

4.1. Markov Chain Monte Carlo

4.2. Expectaion Maximization

4.3. Higer Dimensional Cases

5. Real Data Analysis

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI