Abstract
In this paper, we propose a general class of bivariate proportional hazard distributions, which is based on the family of asymmetric proportional hazard distributions and the bivariate Pareto copula. Distributional properties of the bivariate proportional hazard distribution are derived. We specialize the bivariate proportional hazard family of distributions to the normal case, and so we introduce the bivariate proportional hazard normal distribution. Parameter estimation by the maximum likelihood method of the bivariate proportional hazard normal distribution is then discussed. Finally, an application of the new bivariate distribution to real data is considered for illustrative purposes.
MSC:
60E05
1. Introduction
The proportional hazard distribution was presented as an extension of the distribution of the minimum of a random sample replacing with in a random variable Z with probability density function (PDF) f, and cumulative distribution function (CDF) F. So, this distribution could be considered as a distribution of fractional order statistics ([1]). Here, we start defining the proportional hazard (PHF) distribution, which was studied by [2]; see also [3] where inference under progressively type II right-censored sampling for the PHF distribution is considered. Let F be a continuous CDF with PDF , and hazard function We say that Z has a PHF distribution associated with F and f, if its PDF is of the form
where is the shape parameter. We use the notation to refer to this distribution. The CDF of the PHF distribution is given by
The hazard function of this model is ; that is, the proportional hazard function of the PDF f. From Equation (2), we can use the inversion method for generating a random variable with PHF distribution; that is, if , i.e., a uniform distribution on the interval , then the random variable is distributed according to the PHF distribution with parameter vector , where denotes the inverse of . The distributional properties of the PHF distribution, as well as inferential procedures and information matrix for the model parameters are studied by [2].
The location-scale extension of model (1) is obtained using the linear transformation , where is a location parameter, is a scale parameter, and . The PDF of the random variable X is given by
If a random variable X follows the model (3), it is denoted by . If and , where and detone the PDF and CDF of a standard normal distribution, we write , and so we obtain the proportional hazard normal (PHN) distribution. Note that , and so the normal distribution is a special case of the PHN distribution. Hence, it is more flexible than the normal model in terms of skewness and kurtosis, due to the inclusion of the shape parameter . We have that the Kumaraswamy-G (KwG) distribution, proposed by [4], is defined by the PDF
where G is an arbitrary continuous CDF, and . Also, and are additional shape parameters to that of G. If Z is a random variable with PDF (4), we write . It is worth stressing that when , we obtain the distribution, i.e., , and so this kind of distribution arises as a possible solution when we have asymmetric data.
The construction of multivariate distributions from copula theory can be carried out using the Sklar’s theorem. The way to obtain a multivariate distribution from the Sklar’s theorem is as follows: Let be p random variables with continuous CDFs respectively. Then, according to Sklar’s theorem, has a unique copula representation:
Moreover, it is well known that many dependence properties of a multivariate distribution depend only on the corresponding copula. Therefore, many dependence properties of a multivariate distribution can be obtained by studying the corresponding copula. By using Clayton copula, Ref [5] proposed a bivariate extension of the power-normal distribution. Specifically, a bivariate random variable has a bivariate power-normal distribution, denoted by , if for and has the following joint CDF
where is the parameter that controls the dependence in the Clayton copula. The Clayton copula is usually attributed to [6], but it has also been studied by [7]. For , it is defined as
The first systematic study of this model was conducted by [8], who interpreted the parameter as a measure of the dependence between u and Therefore, the independence between the variables is obtained when tends to zero. Obviously there are other forms of constructing multivariate distributions, and so new multitariate distributions have been proposed in the statistical literature. To mention a few, but not limited to, we refer the reader to the works by [9,10,11,12], among others.
It is worth emphasizing that the univariate PHF family of distributions has received significant attention over the last years in the statistical literature, mainly due to its flexibility in considering different models in its construction given by f and F; see expression (1). On the other hand, multivariate extensions of this univariate family of distributions has been little explored. This paper fills this gap and provides a bivariate extension of this univariate distribution. The bivariate PHF distribution is quite simple and, hence, may be widely applied in analyzing bivariate real data in practice. Additionally, distributional properties of the bivariate PHF distribution are investigated in details. An important claim to introduce the bivariate PHF distribution relies on the fact that the practitioners will have new bivariate model to use in bivariate settings. Additionally, the formulae related with the new bivariate model are manageable and with the use of modern computer resources and its numerical capabilities, the bivariate PHF distribution may prove to be an useful addition to the arsenal of applied statisticians. We hope that the bivariate distribution introduced in this paper may serve as an alternative bivariate model to some well-known bivariate model available in the statistical literature. We also hope that the bivariate PHF distribution may work better (at least in terms of model fitting) than some bivariate distributions available in the literature in certain practical situations, although it cannot always be guaranteed. Here, the bivariate PHF family of distributions we introduce is obtained from a multivariate distribution which was constructed from Pareto copula (descried in the next section) coupled with PHF marginals. It is worth stressing that many types of copulas are available o construct multivariate distributions, namely: Clayton copula, Frank copula, Joe copula, and Gumbel copula, among others. Perhaps, the most common copula is the Clayton copula. In this paper, instead, we shall consider the Pareto copula to introduce the new bivariate model due to its simplicity and interesting properties (descried in the next section). In addition, the Pareto copula yields a simple form of the likelihood function, so it makes the estimation of the model parameters easy to deal with. In short, it is evident that few works about bivariate generalizations of the PHF distribution have been published in the statistical literature and, hence, a new, simple and tractable bivariate PHF extension of the univariate PHF distribution through Pareto copula is welcome.
The paper is organized as follows. Section 2 presents the Pareto copula and discusses with details the structural properties derived from it. The bivariate proportional hazard distribution is proposed in Section 3. Distributional properties of this bivariate family of distributions are also derived in this section. In Section 4, we study the bivariate PHN distribution, where the univariate normal distribution is taken into account. Location-scale extension, as well as parameter estimation for the bivariate PHN distribution are also discussed in this section. An empirical appliction of the bivariate PHN distribution that considers real data is provided in Section 5 for illustrative purposes. Finally, Section 6 concludes the paper.
2. Pareto Copula
The Pareto distribution, or Pareto type I distribution, has been extensively used in economic literature and in reliability theory. Its CDF is given by
where and . Another type of Pareto distribution is the Pareto type II distribution, whose CDF takes the form
Its survival function is given by
The bivariate Pareto distribution of the random vector has joint CDF in the form
where . Then, using the Sklar’s theorem for continuous marginal distributions, the bivariate Pareto copula is given by
Next, we shall consider some properties of the bivariate Pareto copula:
- 1.
- The joint PDF of the Pareto copula is
- 2.
- The joint survival function of the Pareto copula is
- 3.
- For all the bivariate Pareto copula is a concave function on for fixed . This result follows from
- 4.
- The bivariate dependency measures for continuous variables, usually used in copulas, are Kendall’s tau correlation coefficient (), Spearman’s rho (), and the medial correlation coefficient. The first two coefficients are invariant to increasing transformations. The Kendall’s tau measures the difference between the probability of two concordant random pairs and the probability of two discordant random pairs. For a continuous bivariate distribution function F, is defined bySpearman’s rho correlation coefficient measures the correlation between the two CDFs, and Then, given that and are continuous uniform random variables on with mean and variance and , respectively, it can be shown thatIf and denote the medians of and respectively, the medial correlation coefficient of and , say , is given by (see [13])For the bivariate Pareto copula, we have that
- 5.
- A non-negative function b is totally positive of order 2 () if for all and with , if is verified thatThis is a condition of positive dependence when it is performed on the PDF, and means that two pairs with components matching high-low and low-low are more likely than two pairs with high-low and low-high components. For all the bivariate Pareto copula is a non-negative function of order 2 (or ).
- 6.
- The bivariate tail dependence concept is related to the amount of dependency on the tail of the bivariate distribution, in the upper or lower quadrant. The symbol is used to determine a tail dependence parameter. If a bivariate copula C, with survival copula , is such thatexists, then C has an upper tail dependence if and has no upper tail dependence if It can be shown thatwhere v refers to the component Similarly, if a bivariate copula C is such thatexists, then C has lower tail dependence if and has no lower tail dependence if The reasoning behind these definitions is thatFor the bivariate Pareto copula, we have that and . Then, the Pareto copula has a lower tail dependence. That is, for a value u arbitrarily close to zero, there is a positive probability that one of the variables or takes values smaller than u, given that the other is smaller than u.
3. Bivariate Proportional Hazard Distribution
In what follows, the joint CDF of , , is constructed from the bivariate Pareto copula, with marginal distributions and , where and . Then, from the Sklar’s theorem, the joint CDF of is given by
We have the following definition.
Definition 1.
A bivariate random vector has a bivariate proportional hazard distribution, if its joint CDF is given by
where , , and .
Remark 1.
We denote the bivariate proportional hazard distribution in Definition 1 by .
Remark 2.
Let . Then, the joint PDF of has the form
We have the following propositions.
Proposition 1.
Let . We have that
- (i)
- for
- (ii)
- The CDF of , given , is
- (iii)
- The PDF of , given , is
- (iv)
- The joint survival function of is
Proposition 2.
Let . We have that is stochastically decreasing in , and vice versa, for any value of , , and α.
Proof.
Since Pareto copula is a concave function on for fixed , the result holds. □
The bivariate hazard rate of and was defined by [14] and it is given by
Proposition 3.
Let . We have that
- (i)
- For fixed , is an increasing function of
- (ii)
- For fixed , is an increasing function of
Proof.
This result can be verified as follows. Let and . It can be shown that
where and, hence, the result follows. □
Proposition 4.
Let . We have that the Kendall’s tau correlation coefficient, Spearman’s rho and the medial correlation coefficient, are given by the expressions provided in (5).
Proof.
Kendall’s tau and Spearman’s rho correlation coefficients are invariant to increasing transformations, and so the resut follows. Note that is enough to take the transformation and which leads to the PDF Pareto copula. Since the medial coefficient is defined directly from the Pareto copula, the result is obtained directly. □
Remark 3.
The upper and lower limits for are
Proof.
Since the Pareto copula has a lower tail dependence, the result holds. □
Proposition 5.
The bivariate distribution has a lower tail dependence.
4. Bivariate Proportional Hazard Normal Distribution
When , i.e., the standard normal CDF, we obtain the bivariate PHN distribution, denoted by . The joint PDF of is given by
where , , and . The joint CDF takes the form
The marginal distributions of are for . The PDF of , given , is given by
and the CDF of , given , is
The joint survival function of has the form
4.1. Location-Scale Extension
The family of BPHN distributions with location-scale parameters is defined as the joint distribution of and , where , and for The corresponding joint PDF is given by
where is the location parameter, is the scale parameter, and
We use the notation to denote this location-scale extension. Figure 1 and Figure 2 display contour plots of the BPHN distribution for some parameter values.
Figure 1.
Contour plots: (a) BPHN(0,1,1.75,0,1,2.25,2), and (b) BPHN(0,1,0.5,0,1,0.75,0.75).
Figure 2.
Contour plots: (a) BPHN(0,1,1.5,0,1,0.75,3.5), and (b) BPHN(0,1,0.75,0,1,2,0.5).
4.2. Parameter Estimation
In what follows, we consider the issue of estimating the parameters of the BPHN distribution. Given the dependency structure induced by Pareto copula, we use the maximum likelihood (ML) method, and the two-stage method (see [15]) to estimate the parameter vector . Initially, as in [5], we perform the following reparameterizations: , and , so that the interest is focused on the parameter vector . Thus, for a sample of size n from the distribution, say , the log-likelihood function is given by
where
The ML estimate of can be obtained from the direct maximization of the log-likelihood function with respect to the model parameters. Using the R programming language, specifically applying the optim function, the maximization of the log-likelihood function can be performed. Also, using the invariance property of the ML estimator, we can easily obtain the ML estimates and of and , respectively.
The score functions are
where . The ML estimate can also be obtained by solving simultaneously the nonlinear system of equations
Numerical approaches are required for solving the above nonlinear system of equations, which requires the specification of initial values. To obtain initial values for the model parameters to be used in the iterative process, we can use the two-stage estimation procedure proposed by [15], since this procedure was defined for multivariate copula-based models. The first stage considers ML estimation from univariate marginals, while the second stage considers ML estimation of the dependence parameter with the other parameters held fixed from the first stage:
- First stage. Considering that for . We have that the ML estimaties of , and are obtained from the solutions of the nonlinear equationswhereThe solution of the previous system of equations for each j allows us to obtain the ML estimates and of , and , respectively, for .
- Second stage. In this stage, we estimate the parameter by replacing the unknown parameters with the ML estimates from the previous stage and then maximize the resulting log-likelihood function; that is, we need to maximize the functionwhereso that we have the score functionThe solution of the resulting equation lead us to the estimate of .
We can also consider another alternative to obtain initial values for the model parameters by using the method proposed by [16], where we initially normalize each variable to obtain a model that depends only on , and ; that is, we have the log-likelihood function
After doing
we obtain
where
Thus, the new profiled log-likelihood function is reached. After obtaining the estimates and of and , respectively, we then obtain an estimate for as . We can take as initial values for and the sample means, and for and the sample standard deviations, or we can replace , and in the original log-likelihood to obtain the initial values for , , and .
When n is large, we have that , where “” means approximately distributed, is the expected Fisher information matrix for , and is the Hessian matrix, whose elements are provided in the Appendix A. Using the transformation method, we have that , where with denoting its inverse, and
The elements of are obtained in terms of numerical integration. On the other hand, the observed Fisher information matrix given by can also be used for computing asymptotic standard errors for the ML estimates of the model parameters (see Appendix A). After computing , we obtain the observed Fisher information matrix under -parametrization as , where . It is worth emphisizing that the observed information matrix can be computed numerically from standard maximization routines, which now provide the observed information matrix as part of their output. For example, one can use the R functions optim or nlm to compute the observed Fisher information matrix numerically.
4.3. Monte Carlo Simulation
In the following, we consider Monte Carlo simulation experiments to evaluate the performance of the ML estimation procedure discussed in the previous section to estimate the BPHN distribution parameters. It is worth emphasizing that is rather easy to generate random variates from the BPHN distribution. We consider the following steps to generate from distribution:
- Step 1.
- Generate two independent random variates and , where means a uniform distribution on .
- Step 2.
- Computewhere is the standard normal quantile function.
- Step 3.
- Define and . Let . For each value of , computeThen, .
The Monte Carlo experiments were performed using the R programming language, and we consider 10,000 Monte Carlo replications. The performance of the ML procedure to estimate the BPHN distribution parameters was evaluated on the basis of the following quantities for each sample size: the empirical mean, and the empirical standard deviation, which are computed from 10,000 replications. The sample sizes we consider are and 150. Without loss of generality, we consider , , , , , and , , , , , and . The emprical mean is listed in Table 1, while the empirical standard deviation is listed in Table 2. In general, from Table 1, note that the ML estimates of , , , , and are close to their respective true values, while the parameter that controls the dependence in the Pareto copula, , is overestimated, mainly as this parameter increases. As expected, the standard deviation of the ML estimates of all parameters decreases as the sample size increases (see Table 2). In short, the Monte Carlo simulation experiments reveal that the ML method can be used effectively to estimate the BPHN parameters.
Table 1.
Parameter estimates; empirical mean.
Table 2.
Parameter estimates; empirical standard deviation.
5. Real Data Application
Next, we consider an application of the BPHN distribution to real data for illustrative purposes. The data set contains two different measures of stiffness of each of the 30 boards. The considered stiffness measures are the Shock () and Vibration () of each of the 30 boards. The data were reported by [17]. The first measurement involves sending a shock wave down the board, and the second measurement is determined while vibrating the board. All numerical computations provided in this were done by using the R program.
We assume that for . To estimate the BPHN model parameters, we initially estimate the marginal PDFs corresponding to each variable. We thus obtain the following estimates: , , , , , and . Figure 3a,b display the QQ-plots for the estimated marginal distributions. Note the goodness-of-fit of each estimated marginal distribution to the real data, which means that the BPHN distribution may be a good choice for modeling these data. Now, using the above estimates in the copula function, we obtain an estimate for the parameter , that is, . Finally, taking the above estimates as initial values for the iterative process, we obtain the following ML estimates (standard errors in parentheses): , , , , , , and . Visual inspection of the contour plot in Figure 3c confirms a satisfactory fit of the BPHN distribution to the data.
Figure 3.
QQ-plots and contour plot: (a) Shock, (b) Vibration, and (c) contour plot of the BPHN distribution.
Next, for the sake of comparison, we also consider the following bivariate distributions to fit the current real data: the bivariate skew-normal conditional (BVSNC) distribution introduced by [9], denoted by BVSNC, and the bivariate Birnbaum-Saunders (BVBS) distribution studied by [10], denoted by BVBS. The ML estimates (standard errors in parentheses) of the BVSNC parameters are and , whereas the ML estimates (standard errors in parentheses) of the BVBS parameters are and . To compare the bivariate distributions, we make use of the Akaike information criterion (AIC), and Bayesian information criterion (BIC). The smaller the values of AIC and BIC, the better the fitted distribution to the real data. Table 3 lists the AIC and BIC values for all bivariate distributions, which leads to the conclusion that the BPHN distribution is better than the other bivariate distributions to model the current bivariate real data.
Table 3.
AIC and BIC values.
6. Concluding Remarks
The univariate proportional hazard distribution has found several applications in the literature and has many attractive properties. However, the extension of the univariate proportional hazard distribution in a multivariate setting has been so neglected in the statistical literature. In this paper, based on the Pareto copula, we have introduced a simple and tractable bivariate proportional hazard family of distributions that may be very useful to deal with bivariate data in practice. We also derive several distribution properties for this bivariate family of distributions. In addition, we particularize this general bivariate distribution to the case where the univariate normal distribution is considered, so that the bivariate proportional hazard normal distribution is obtained. The estimation of the bivariate proportional hazard normal model parameters is approached by the maximum likelihood method, and the observed and expected Fisher information matrixes are derived. Finally, an application to real data set is presented to illustrate the bivariate proportional hazard normal distribution in practice.
Author Contributions
Conceptualization, G.M.-F., A.J.L. and C.B.-C.; data curation, G.M.-F., A.J.L. and C.B.-C.; formal analysis, G.M.-F., A.J.L. and C.B.-C.; funding acquisition, C.B.-C.; investigation, G.M.-F. and A.J.L. and C.B.-C.; methodology, G.M.-F., A.J.L. and C.B.-C.; project administration, G.M.-F., A.J.L. and C.B.-C.; resources, G.M.-F., A.J.L. and C.B.-C.; supervision, G.M.-F., A.J.L. and C.B.-C.; visualization, C.B.-C.; writing—original draft, G.M.-F. and A.J.L. and C.B.-C.; writing—review and editing, A.J.L. and C.B.-C. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Instituto Tecnológico Metropolitano (ITM).
Data Availability Statement
Not applicable.
Acknowledgments
G.M.-F. acknowledges the support given by Universidad de Córdoba, Montería, Colombia. A.J.L. acknowledges the financial support of the Brazilian agency Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq grant 304776/2019–0). C.B.-C. extends their sincere gratitude to the Instituto Tecnológico Metropolitano (ITM). The authors would like to thank the anonymous reviewers for their insightful comments and suggestions.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Observed Fisher Information Matrix
In the following, we calculate the elements of the Hessian matrix and their respective expected values. For and , let
We have the following derivatives ():
Let
and , where is the quantile function of the standard normal distribution. We have the expected values:
References
- Durrans, S.R. Distributions of fractional order statistics in hydrology. Water Resour. Res. 1992, 28, 1649–1655. [Google Scholar] [CrossRef]
- Martínez-Flórez, G.; Moreno-Arenas, G.; Vergara-Cardoso, S. Properties and Inference for Proportional Hazard Models. Colomb. J. Stat. 2013, 36, 95–114. [Google Scholar]
- Wang, B.; Yu, K.; Jones, M. Inference under progressively type II right-censored sampling for certain lifetime distributions. Technometrics 2010, 52, 453–460. [Google Scholar] [CrossRef]
- Cordeiro, G.M.; de Castro, M. A new family of generalized distributions. J. Stat. Comput. Simul. 2011, 81, 883–898. [Google Scholar] [CrossRef]
- Kundu, D.; Gupta, R. Power normal distribution. Statistics 2013, 47, 110–125. [Google Scholar] [CrossRef]
- Clayton, D.G. A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika 1978, 65, 141–152. [Google Scholar] [CrossRef]
- Kimeldorf, G.; Sampson, A.R. Uniform representations of bivariate distributions. Commun. Stat. Theory Methods 1975, 4, 617–627. [Google Scholar] [CrossRef]
- Cook, R.D.; Johnson, M.E. A family of distributions for modelling non-elliptically symmetric multivariate data. J. R. Stat. Soc. B 1981, 43, 210–218. [Google Scholar] [CrossRef]
- Arnold, B.C.; Castilho, E.; Sarabia, J.M. Conditionally specified multivariate skewed distributions. Sankhya A 2002, 64, 206–226. [Google Scholar]
- Lemonte, A.J.; Martínez-Flórez, G.; Moreno-Arenas, G. Multivariate Birnbaum–Saunders distribution: Properties and associated inference. J. Stat. Comput. Simul. 2015, 85, 374–392. [Google Scholar] [CrossRef]
- Martínez-Flórez, G.; Lemonte, A.J.; Salinas, H.S. Multivariate skew-power-normal distributions: Properties and associated inference. Symmetry 2019, 11, 1509. [Google Scholar] [CrossRef]
- Zhang, L.; Xu, A.; An, L.; Li, M. Bayesian inference of system reliability for multicomponent stress-strength model under Marshall-Olkin Weibull distribution. Systems 2022, 10, 196. [Google Scholar] [CrossRef]
- Nelsen, R.B. An Introduction to Copulas; Springer: New York, NY, USA, 2006. [Google Scholar]
- Marshall, A.W. Some comments on the hazard gradient. Stoch. Process. Their Appl. 1975, 3, 293–300. [Google Scholar] [CrossRef]
- Joe, H. Asymptotic efficiency of two-stage estimation method for copula-based models. J. Multivar. Anal. 2005, 94, 401–419. [Google Scholar] [CrossRef]
- Gupta, D.; Gupta, R.C. Analyzing skewed data by power normal model. Test 2008, 17, 197–210. [Google Scholar] [CrossRef]
- Johnson, R.A.; Wichern, D.W. Applied Multivariate Statistical Analysis; Pearson Prentice Hall: Upper Saddle River, NJ, USA, 2007. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).