Abstract
A copula is a useful tool for constructing bivariate and/or multivariate distributions. In this article, we consider a new modified class of FGM (Farlie–Gumbel–Morgenstern) bivariate copula for constructing several different bivariate Kumaraswamy type copulas and discuss their structural properties, including dependence structures. It is established that construction of bivariate distributions by this method allows for greater flexibility in the values of Spearman’s correlation coefficient, and Kendall’s .
1. Introduction
Over the last decade or so, there has been a growing interest in constructing various bivariate distributions and study their dependence structure. For an excellent survey on this, an interested reader is suggested to see Balakrishnan and Lai (2009) and the references therein. Of late, copula based methods of construction have also gained a considerable amount of attention, mainly due to their analytical tractability in the sense of discussing dependence structure between two dependent random variables. A copula is a multivariate distribution function whose marginals are uniform on (see Sklar (1959), Nelsen (2006) for further details). It couples or links the marginal distributions to their joint distribution. In order to obtain a bivariate/multivariate distribution function, one needs to simply combine two (in the bivariate case) and/or several marginal distribution functions with any copula function. Consequently, for the purpose of statistical modeling, it is desirable to have a plethora of copulas at one’s disposal. One of the most important parametric family of copulas is the Farlie–Gumbel–Morgenstern (FGM, henceforth) family defined as , where This family of copulas have the following properties. Such family is derived from so called Farlie–Gumbel–Morgenstern distributions considered by Morgenstern (1956) and Gumbel (1960) and further developed by Farlie (1960).
- Symmetry: , , and have the lower and upper tail dependence coefficients equal to zero.
- It is positive quadrant dependent (PQD) for and negative quadrant dependent (NQD) for
However, the major drawback of FGM copula is that the range of values of Spearman’s correlation coefficient () and Kendal’s () is and , respectively. To overcome this limited nature of dependence, several authors proposed extensions of this family (for example, Bairamov and Kotz (2000), Rodriguez-Lallena and Ubeda-Flores (2004)). It is to be noted here that a good number of literary works are available for the FGM family and the associated dependence parameter. Huang and Kotz (1999) studied a polynomial type parameter extensions of the FGM bivariate distribution and have shown that the positive correlation between the marginal distributions can be increased up to , while the maximal negative correlation remains at . Lai and Xie (2000) used uniform representation of the FGM bivariate distributions having positive quadrant dependence (henceforth, PQD) with the association parameter between 0 and 1. Bairamov and Kotz (2000) showed that, for such a bivariate family, the related association parameter has a much wider range. In another article, Bairamov et al. (2001) developed a new generalization of the bivariate FGM distribution by introducing additional parameters. In their representation, with some specific choice of the functions , and (see Equation (1) of Bairamov et al. (2001), they have shown that the admissible range for the association parameter is between , while the Pearson correlation coefficient between X and Y will never exceed
This fuels working in this direction in the sense of considering a modified FGM class and using it as a pivot for constructing bivariate Kumaraswamy models.
The Kumaraswamy distribution (Kumaraswamy 1980) is a two parameter absolutely continuous distribution useful for double bounded random processes with hydrological applications. The Kumaraswamy distribution (hereafter the KW distribution) on the interval has its probability density function (pdf) and its cumulative distribution function (cdf) with two shape parameters and defined by
If a random variable X has Equation (1) as its density, then we will write (for details, see Jones (2009)). The density function in Equation (1) has similar properties to those of the beta distribution. The KW pdf is unimodal, uniantimodal, increasing, decreasing or constant depending (similar to the beta distribution) on the values of the parameters. However, the construction of bivariate KW distributions has received limited attention. Barreto-Souza and Lemonte (2013) introduced a bivariate KW distribution related to a Marshall–Olkin survival copula and discussed some structural properties of their bivariate KW distributions. Arnold and Ghosh (2017) discussed some different strategies for constructing legitimate bivariate KW models via Arnold–Ng type copula approach. Recently, Ghosh and Ray (2016) discussed some copula based approach to construct several bivariate KW type models along with an application to a real life data set focusing on financial risk assessment. This article is a follow up paper to Ghosh and Ray (2016), in which we examine in detail the utility of a well-known bivariate FGM copula by a slight modification to allow greater flexibility in modeling various types of data sets. In this article, we start with a standard KW quantile function from two independent KW distributions (with two different sets of shape parameters) and construct the corresponding bivariate copula with different shape parameters. The rest of the article is organized as follows: in Section 2, we define the modified FGM copula and discuss some structural properties. In Section 3, we consider four special classes of modified bivariate KW FGM type copulas for constructing bivariate KW distributions. In Section 4, we establish some dependence structures for those developed bivariate KW FGM type copulas. In Section 5, an outline of simulation from the proposed copula model is provided. In Section 6, some applications of the four bivariate KW-FGM type copula models on two real-life data insurance data sets are considered for illustrative purposes. In Section 7, some concluding remarks are presented.
2. Modified Bivariate FGM Copula
We consider the following modified version of the bivariate FGM copula defined as
where and and
For a detailed study on this family of bivariate copula, see Rodriguez-Lallena and Ubeda-Flores (2004), where and are two absolutely continuous functions on with the following conditions.
- . This is known as a boundary condition.
- wherewhereAgain, andwhere
Theorem 1.
Proof.
The proof immediately follows, since it matches with the form of bivariate copula (Equation (3), p. 316) in Rodriguez-Lallena and Ubeda-Flores (2004). ☐
First, we make a note of the following:
- The associated bivariate copula density from Equation (2) will be
Similarly, one can find the conditional copula density of V given
It is noteworthy to mention that copulas are instrumental for understanding the dependence between random variables. With them, we can separate the underlying dependence from the marginal distributions. It is well known that a copula that characterizes dependence is invariant under strictly monotone transformations. Subsequently, a better global measure of dependence would also be invariant under such transformations. Among other dependence measures, Kendall’s and Spearman’s are invariant under strictly monotone transformations of the random variables, and, as we will see in the next section, they can be expressed in terms of the associated copula.
- Kendall’s : This measures the amount of concordance present in a bivariate distribution. Suppose that and are two independent pairs of random variables from a joint distribution function. We say that these pairs are concordant if “large values of one tend to be associated” with “large values of the other”, and “small values of one” tend to be associated with “small values of the other”. The pairs are called discordant if large goes with small or vice versa. Algebraically we have concordant pairs if and discordant pairs if we reverse the inequality. Let X and Y be continuous random variables with copula Then, Kendall’s is given by
- Spearman’s : For two random variables, X and Y are equal to the linear correlation coefficient between and where and are the marginal distributions of X and Y, respectively. Then, Spearman’s is given bywhere is the linear correlation coefficient.Alternatively, can be written as . Also, as mentioned earlier, one can equivalently show that For details on such copula based measures of dependence, see Nelsen (2006).
Proposition 1.
Let be a random pair with copula given by Equation (2). Then, the expressions for Kendall’s tau and Spearman’s rho are
- where
- respectively.
Proof.
The proofs are almost similar in approach for the two coefficients. First, consider for the Spearman’s . For our copula model in Equation (2), the corresponding will be
Next, consider the integral in parenthesis, which, after some simplification, reduces to
Substituting Equation (8) in Equation (7), we get
after simple algebraic operation—hence the result.
☐
3. Bivariate KW-FGM Type Models
In this section, we discuss in detail two different types of bivariate FGM type copula models to construct bivariate KW-type distribution.
Bivariate KW-FGM (Type I) Model:
Here, we consider the following functional form for both and :
- for
- for
Note that this particular functional form does satisfy all the conditions stated earlier for and In that case, the corresponding bivariate copula (obtained from Equation (2)) will be given by
Next, suppose and they are independent. Then, using Equation (10), a bivariate dependent FGM-Kumaraswamy (Type I) distribution will be of the following form (replacing u and v by the quantiles of and , respectively):
for and
Bivariate KW-FGM (Type II) Model:
Here, we consider the following functional form for both and :
- for
- for
Note that this particular functional form does satisfy all the conditions stated earlier for and In that case, the corresponding bivariate copula (henceforth, BK-FGM(Type II) copula) will be given by
In this case, like the previous one, a bivariate dependent KW-FGM (Type II) distribution, arising from two independent KW variables, will be of the following form:
Bivariate KW-FGM (Type III) Model:
Here, we consider the following functional form for both and :
Note that this particular functional form does satisfy all the conditions stated earlier for and In that case, the corresponding BK-FGM (Type III) copula will be given by
In this case, one can also obtain a closed form expression for the associated distribution function.
Bivariate KW-FGM (Type-IV) Copula:
For the standard KW distribution with parameters , we have the pdf, cdf and the inverse cdf are given, respectively, by
Hence, the associated copula for suitable parameters a and b, and having two given marginal distributions that are the standard KW distributions, has the following form:
For details on this, see Ghosh and Ray (2016).
4. Some Properties of the Bivariate KW-FGM Type Copulas
Next, we have the following:
- For the BK-FGM (Type I) bivariate copula
- Closed form expression for Kendall’s is not available.
- Spearman’s correlation coefficient will beprovided
- For the BK-FGM (Type II) bivariate copula
- Kendall’s will beprovided .
- Corresponding Spearman’s correlation coefficient will beprovided .
- For the BK-FGM (Type III) copula, no closed form expressions for Kendall’s and Spearman’s are available. They need to be evaluated numerically.
- For the BK-FGM (Type III) copula
- Kendall’s will be(by straightforward integration).
- Spearman’s correlation coefficient will be
Dependence Properties
In this section, we focus on the following properties.
Tail Dependence Property:
Let X and Y be two continuous random variables with and The upper tail dependence coefficient (parameter) is the limit (if it exists) of the conditional probability that Y is greater than th percentile of G given that X is greater than the th percentile of F as approaches 1:
If , then X and Y are upper tail dependent and asymptotically independent otherwise. Similarly, the lower tail dependence coefficient is defined as
Let C be the copula of X and Y. Then, equivalently, we can write and where is the corresponding joint survival copula given by
Next, we consider the following.
- In our case (for the bivariate KW-FGM (type I) copula model),Thus, X and Y are asymptotically independent. The corresponding joint survival copula will be given byAgain,Thus, are asymptotically dependent.
- For the bivariate KW-FGM (type II) copula model,provided Hence, it is asymptotically independent provided Again,provided this again implying that are asymptotically dependent.
Similarly, one can establish these properties for the bivariate KW-FGM (type III) and (type IV) copula models.
Positive Quadrant Dependent (PQD) and Left-Tail Decreasing (LTD) Property:
According to Amblard and Girard (2002), (Theorem 3), for and a random pair with copula as defined in equation (2), we have the following result:
- X and Y are PQD if and only if either ∀ and ∀, or
- X and Y are LTD if and only if and is monotone. Next, consider the following:Proposition 2.The BK-FGM (Type I, Type II and Type III) copulas are PQDProof.For the modified BK-FGM (Type I) copula, we have and . Note that, for any real for all as well as for all Hence, are PQD. ☐Similarly, one can easily check the PQD property for the other two copula models.Proposition 3.The BK-FGM (Type I and Type III) copula exhibits LTD properties, while, for the BK-FGM (Type II), it is indeterministic.Proof.For the modified BK-FGM (Type I) copula, consider the ratio . It is monotonically decreasing provided, and for any and it is also true for any Similar results hold for the other ratio for any Hence, it is LTD for only and for any but not for any other possible choices of the constants and . ☐Again, for the modified BK-FGM (Type III)copula, the ratio It is monotonically decreasing for any Similar results will hold for the other ratio for any Hence, it is LTD.However, for the modified BK-FGM (Type II) copula, these ratios are not uniformly increasing and/or decreasing. This is why it is indeterministic in this sense.
5. Simulation from a Bivariate Copula
There are several different methods (for example, acceptance–rejection sampling for bivariate cases, via transformation to a known bivariate distribution, etc.) that are available to simulate/generate bivariate random samples from a bivariate copula. We can, in principle, use the following result Joe (1997), to simulate random samples from our modified BK-FGM type copula as follows. Let us define the conditional copula distribution function (say, of V given ), Next, if U and W are independent random variables, then will have the distribution This method, sometimes known as conditional distribution approach or iterative conditioning, is appealing because it involves only univariate simulation. In our case, we do have closed form expressions of for both types of modified BK-FGM bivariate copula available. For example, for the modified FGM BK (type I) copula, one can write (from Equation (10))
Consequently, we can easily apply this method. Needless to say, there are other distinct sampling procedures that are also available (for example, importance sampling, adaptive acceptance–rejection sampling, etc.), which is suitable for other classes of copulas.
6. Applications
6.1. Application in Risk Management
In practice, several risk managers employ VaR (Value at Risk) as a tool of risk measurement. Briefly speaking, VaR is the maximal potential loss of a position or a portfolio on some investment horizon at a given confidence level. Because of the enormous literature, we only provide its definition. Let be the market values of an asset or a portfolio of assets over n periods, and be the negative log return (loss) over the t-th period. Next, given a positive value close to 0, the VaR of X at confidence level is given by
For a detailed study on the computation of VaR used in the pure copula method, an interested reader is suggested to see Ouyang et al. (2009). Here, we will propose one idea based on bivariate KW-FGM copula (Type II). We list the steps as follows:
- Simulate U, V and W independently from standard uniform distribution,
- If , for the given bivariate KW-FGM (Type II) copula (say, ), take (
- If , for the given bivariate KW-FGM (Type II) copula (say, , ), take
Then, the random vector has the joint distribution
where and its marginal distributions are and and linear correlation is . After this, we consider the following formula to generate the random number of the negative log returns of portfolios. Here, and are the weights and must satisfy . Then, will be computed by calculating the -th quantile of R.
For illustrative purposes, we consider the portfolio composed of Nasdaq and S&P 500 stock indices. The database contains 2972 daily closing prices from 2 January 1992 to 1 October 2003. We denote the log-returns of Nasdaq as variable 1 (X, say) and the log-returns of S&P 500 as variable 2 (Y). For details on this data set, see Palaro and Hotta (2006).
From Table 1, it is evident that the annualized means of both series are positive. Both return series distributions are nearly symmetric and have large kurtosis, with the Nasdaq presenting the larger one. We do not present the autocorrelation functions of the series, but, for the Nasdaq returns, only the autocorrelations of lag 12 and 13 are significant at the 5% level (t statistic equals to 3.68 and 4.48, respectively). There is no significant correlation for the S&P 500 returns at the 5% level. In order to specify the bivariate model for these two returns, and to estimate the associated Var under several bivariate copula models, we will consider some specific Autoregressive integrated moving average-Generalized Autoregressive Conditional Heteroskedastic (or in short, ARMA-GARCH) models, the reason being that return series are usually successfully modeled by ARMA-GARCH models by many authors. As suggested in Palaro and Hotta (2006), we will mainly consider three different ARMA-GARCH models: GARCH-N, GARCH-t, and GARCH-E. In terms of modeling the dependence between the two series, we consider three copula functions that are quite popular among other authors: FGM, Gumbel–Hougaard, Bivariate Gaussian copula along with our bivariate KW-FGM type copulas. In order to asses the accuracy of the VaR estimates at 95%, and 99% confidence level, we followed the procedure as discussed in Palaro and Hotta (2006). In the table below, we present the proportion of observations (in brackets), for to 2971, where the portfolio loss exceeded the estimated VaR for .
Table 1.
Descriptive statistics of daily log-returns of Nasdaq and S&P 500 stock indices.
From Table 2, it appears that the Bivariate KW-FGM (Type III) copula model provided a better result in estimating VaR. This is quite expected, since, for the data, the estimated coefficients and for the Bivariate KW-FGM (Type III) copula appear to be very close to 1, which then behaves more like a symmetric copula. In addition, for this data, both of the return series are nearly symmetric.
Table 2.
Proportion of observations (number of observations in brackets), for t = 751 to 2971, where the portfolio loss exceeded the estimated Value at Risk for .
6.2. An Application to Insurance Data
Here, we consider one application for the four proposed bivariate KW copula models to a heavily used data set, originally considered by Genest et al. (2009), as well as in Ghosh and Ray (2016). This data set contains two variables:
- : an indemnity payment,
- : an allocated loss adjustment expense (comprising lawyers’ fees and claim investigation process).
This data set is comprised of 1500 general liability claims. Several other authors, among others, have used (for e.g., Chen and Fan (2005)) this data set to demonstrate copula-model selection and fitting in an insurance context. We conjecture that this data might well be explained by one or more bivariate Kumaraswamy copula models derived in this paper. For the sake of simplicity, we apply all four bivariate Kumaraswamy copula models to 1466 uncensored claims. As suggested by Genest et al. (2009), based on a comparative study on the numerical estimates of the dependence parameter (), this imposed restriction has a very little or no effect on it. For the uncensored sample, the observed value of Kendall’s tau is . In the table below, we provide results of the goodness-of-fit tests based on the statistics and with . For a detailed description on each of these goodness-of-fit statistics, see Genest et al. (2009).
Here, the dependence parameter is estimated in each case through inversion of Kendall’s The critical values and p-values reported in Table 1 are based on repetitions of the parametric bootstrap procedure discussed in Genest et al. (2009). From Table 3, it appears that bivariate Kumaraswamy (Type III and Type IV) copula provide a better fit as compared to other BK copula models.
Table 3.
Goodness of fit statistics for the insurance data.
7. Conclusions
In this paper, we consider a modified version of the FGM family of copulas and study some important structural properties including the dependence structure. With this modified version, we consider the construction of bivariate KW distributions and discuss some of their structural properties. It is evident from Equation (2), that, depending on suitable choices of and functions, satisfying associated boundary conditions as mentioned earlier, one can generate a plethora of such copula models and subsequently develop a wide spectrum of bivariate KW distributions. Our future work would focus on the following:
- Extension to the multivariate case and study several associated properties. It is noteworthy to mention that, albeit complex nature of these type of models (involving several parameters), we expect that multivariate KW distribution construction via such type of copula models will be much more interesting and computationally will be more easy to handle.
- For modeling large losses, asymmetric copulas are more useful as compared to symmetric copulas. Thus, we will consider a family of asymmetric copulas as introduced in Nelsen (2006), Chapter 4, which has the following form:Here, a and b are functions defined on the interval . The associated several types of dependence measures will also be considered. In addition, based on this, bivariate and subsequently multivariate KW distributions construction will be considered and then a comparison study will be made with those bivariate and multivariate KW models constructed under a symmetric class of copulas.
- Since a convex combination of any two (or more) valid copulas is also a copula, we would be interested in studying the role of such a mixture of copula in developing bivariate, and sub- sequently multivariate, Kumaraswamy type distributions. For example, one may start with the following:for
- A natural multivariate extension of the above asymmetric copula would bewith A natural question would be what judicious choices of the functions for would result in a tractable model. Associated model inference will be a challenging task due to the involvement of so many parameters. We plan to report all of these findings in a separate article somewhere else.
Acknowledgments
The author would like to thank two anonymous referees for their insightful comments and suggestions, which have greatly helped to improve on an earlier version of this manuscript.
Conflicts of Interest
The author declares no conflict of interest.
References
- Amblard, Cécile, and Stéphane Girard. 2002. Symmetry and dependence properties within a semiparametric family of bivariate copulas. Journal of Nonparametric Statistics 14: 715–27. [Google Scholar] [CrossRef]
- Arnold, Barry C., and Indranil Ghosh. 2017. Bivariate Kumaraswamy models involving use of Arnold–Ng copulas. Journal of Applied Statistical Science 22: 227–41. [Google Scholar]
- Bairamov, Ismihan G., and Samuel Kotz. 2000. On a New Family of Positive Quadrant Dependent Bivariate Distribution. Technical Report. Washington: The George Washington University. [Google Scholar]
- Bairamov, Ismihan G., Samuel Kotz, and Muhammet Bekci. 2001. New generalized Farlie–Gumbel–Morgenstern distributions and concomitants of order statistics. Journal of Applied Statistics 28: 521–36. [Google Scholar] [CrossRef]
- Balakrishnan, Narayanaswamy, and Chin-Diew Lai. 2009. Continuous Bivariate Distributions, 2nd ed. New York: Springer. [Google Scholar]
- Barreto-Souza, Wagner, and Artur J. Lemonte. 2013. Bivariate Kumaraswamy distribution: Properties and a new method to generate bivariate classes. Statistics 47: 1–22. [Google Scholar] [CrossRef]
- Chen, Xiaohong, and Yanqin Fan. 2005. Pseudo-likelihood ratio tests for model selection in semiparametric multivariate copula models. The Canadian Journal of Statistics 33: 389–414. [Google Scholar] [CrossRef]
- Farlie, Dennis J. G. 1960. The performance of some correlation coefficients for a general bivariate distribution. Biometrika 47: 307–23. [Google Scholar] [CrossRef]
- Ghosh, Indranil, and Samik Ray. 2016. Some alternative bivariate Kumaraswamy type distributions via copula with application in risk management. Journal of Statistical Theory and Practice 10: 693–706. [Google Scholar] [CrossRef]
- Genest, Christian, Michael Gendron, and Michael Bourdeau-Brien. 2009. The Advent of Copulas in Finance. The European Journal of Finance 15: 609–18. [Google Scholar] [CrossRef]
- Gumbel, Emil J. 1960. Bivariate exponential distributions. Journal of American Statistical Association 55: 698–707. [Google Scholar] [CrossRef]
- Huang, Jian Shan, and Samuel Kotz. 1999. Modifications of the Farlie–Gumbel–Morgenstern distributions: A tough hill to climb. Metrika 49: 307–23. [Google Scholar] [CrossRef]
- Joe, Harry. 1997. Multivariate Models and Multivariate Dependence Concepts. New York: Chapman & Hall/ CRC Monographs on Statistics & Applied Probability. [Google Scholar]
- Jones, Chris. 2009. Kumaraswamy’s distribution: A beta-type distribution with some tractability advantages. Statistical Methodology 6: 70–81. [Google Scholar] [CrossRef]
- Kumaraswamy, Poondi. 1980. Generalized probability density-function for double-bounded random-processes. Journal of Hydrology 462: 79–88. [Google Scholar] [CrossRef]
- Lai, Chin-Diew, and Min Xie. 2000. Stochastic Ageing and Dependence for Reliability. New York: Springer. [Google Scholar]
- Morgenstern, David. 1956. Einfache Beispiele zweidimensionaler Verteilungen. Mitteinlings fu Mathematische Statistik 8: 234–35. [Google Scholar]
- Nelsen, Roger. 2006. An Introduction to Copulas. New York: Springer. [Google Scholar]
- Ouyang, Zi-Sheng, Hui Liao, and Xiang-qun Yang. 2009. Modeling dependence based on mixture copulas and its application in risk management. Applied Mathematics—A Journal of Chinese Universities 24: 393–401. [Google Scholar] [CrossRef]
- Palaro, Helder P., and Luiz Koodi Hotta. 2006. Using Conditional Copula to Estimate Value at Risk. Journal of Data Science 4: 93–115. [Google Scholar] [CrossRef]
- Rodriguez-Lallena, Jose Antonio, and Manuel Ubeda-Flores. 2004. A new class of bivariate copulas. Statistics and Probability Letters 66: 315–25. [Google Scholar] [CrossRef]
- Sklar, Abe. 1959. Fonctions de Repartition ’a n Dimensions et Leurs Marges. Publications de I’Institut de Statistique de I’Universite de Paris 8: 229–31. [Google Scholar]
© 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).