A Generator of Bivariate Distributions: Properties, Estimation, and Applications

Franco, Manuel; Vivo, Juana-María; Kundu, Debasis

doi:10.3390/math8101776

Open AccessArticle

A Generator of Bivariate Distributions: Properties, Estimation, and Applications

by

Manuel Franco

^1,*

,

Juana-María Vivo

¹

and

Debasis Kundu

²

¹

Department of Statistics and Operations Research, University of Murcia, CEIR Campus Mare Nostrum, IMIB-Arrixaca, 30100 Murcia, Spain

²

Department of Mathematics and Statistics, Indian Institute of Technology, Kanpur 208016, India

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(10), 1776; https://doi.org/10.3390/math8101776

Submission received: 3 September 2020 / Revised: 6 October 2020 / Accepted: 8 October 2020 / Published: 14 October 2020

(This article belongs to the Section D1: Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

:

In 2020, El-Morshedy et al. introduced a bivariate extension of the Burr type X generator (BBX-G) of distributions, and Muhammed presented a bivariate generalized inverted Kumaraswamy (BGIK) distribution. In this paper, we propose a more flexible generator of bivariate distributions based on the maximization process from an arbitrary three-dimensional baseline distribution vector, which is of interest for maintenance and stress models, and expands the BBX-G and BGIK distributions, among others. This proposed generator allows one to generate new bivariate distributions by combining non-identically distributed baseline components. The bivariate distributions belonging to the proposed family have a singular part due to the latent component which makes them suitable for modeling two-dimensional data sets with ties. Several distributional and stochastic properties are studied for such bivariate models, as well as for its marginals, conditional distributions, and order statistics. Furthermore, we analyze its copula representation and some related association measures. The EM algorithm is proposed to compute the maximum likelihood estimations of the unknown parameters, which is illustrated by using two particular distributions of this bivariate family for modeling two real data sets.

Keywords:

bivariate distribution generator; copula; reversed hazard gradient; maximum likelihood estimation; EM algorithm; multivariate distribution generator

MSC:

60E05; 62H05; 62H10

1. Introduction

Gumbel [1], Freund [2], and Marshall and Olkin [3] in their pioneering papers developed bivariate exponential distributions. Since then, an extensive amount of work has been done on these models and their different generalizations, which have played a crucial role in the construction of multivariate distributions and modeling in a wide variety of applications, such as physic, economy, biology, health, engineering, computer science, etc. Several continuous bivariate distributions can be found in Balakrishnan and Lai [4], and some generalizations and multivariate extensions have been studied by Franco and Vivo [5], Kundu and Gupta [6], Franco et al. [7], Gupta et al. [8], Kundu et al. [9], among others, and recently by Muhammed [10], Franco et al. [11], and El-Morshedy et al. [12], also see the references cited therein.

Kundu and Gupta [13] introduced a bivariate generalized exponential (BGE) distribution by using the trivariate reduction technique with generalized exponential (GE) random variables, which is based on the maximization process between components with a latent random variable, suitable for modeling of some stress and maintenance models. This procedure has also been applied in the literature to generate other bivariate distributions, for example, the bivariate generalized linear failure rate (BGLFR) given by Sarhan et al. [14], the bivariate log-exponentiated Kumaraswamy (BlogEK) introduced by Elsherpieny et al. [15], the bivariate exponentiated modified Weibull extension (BEMWE) given by El-Gohary et al. [16], the bivariate inverse Weibull (BIW) studied by Muhammed [17] and Kundu and Gupta [18], the bivariate Dagum (BD) provided by Muhammed [19], the bivariate generalized Rayleigh (BGR) depicted by Sarhan [20], the bivariate Gumbel-G (BGu-G) presented by Eliwa and El-Morshedy [21], the bivariate generalized inverted Kumaraswamy (BGIK) given by Muhammed [10], and the bivariate Burr typeX-G (BBX-G) proposed by El-Morshedy et al. [12]. Some associated inferential issues have been discussed in these articles, and all of them are based on considering the same kind of baseline components. In each of these bivariate models, the baseline components belong to the proportional reversed hazard rate (PRH) family with a certain underlying distribution (Gupta et al. [22] and Di Crescenzo [23]). It is worth mentioning that Kundu and Gupta [24] extended the BGE model by using components within the PRH family, called a bivariate proportional reversed hazard rate (BPRH) family, and a multivariate extension of the BPRH model was studied by Kundu et al. [9].

The main aim of this paper is to provide a more flexible generator of bivariate distributions based on the maximization process from an arbitrary three-dimensional baseline continuous distribution vector, i.e., not necessarily identical continuous distributions. Hence, this proposed generator allows researchers and practitioners to generate new bivariate distributions even by combining non-identically distributed baseline components, which may be interpreted as a stress model or as a maintenance model. We refer to the bivariate models from this generator as the generalized bivariate distribution (GBD) family, which contains as special cases the aforementioned bivariate distributions. Note that a two-dimensional random variable

(X_{1}, X_{2})

, belonging to the GBD family, has dependent components due to a latent factor, and its joint cumulative distribution function (cdf) is not absolutely continuous, i.e., the joint cdf is a mixture of an absolutely continuous part and a singular part due to the positive probability of the event

X_{1} = X_{2}

, whereas the line

x_{1} = x_{2}

has two-dimensional Lebesgue measure zero. In general, the maximum likelihood estimation (MLE) of the unknown parameters a GBD model cannot be obtained in closed form, and we propose using an EM algorithm to compute the MLEs of such parameters.

The rest of the paper is organized as follows. The construction of the GBD family is given in Section 2, and we obtain its decomposition in absolutely continuous and singular parts and its joint probability density function (pdf). In Section 3, several special bivariate models are presented. The cdf and pdf of the marginals and conditional distributions are derived in Section 4, as well as for its order statistics. Some dependence and two-dimensional ageing properties for the GBD family, and stochastic properties of their marginals and order statistics are studied in Section 5, as well as its copula representation and some related association measures. The EM algorithm is proposed in Section 6, which is applied in Section 7, for illustrative purposes, to find the MLEs of particular models of the GBD family in the analysis of two real data sets. Finally, the multivariate extension is discussed in Section 8, as well as the concluding remarks. Some of the proofs are relegated to Appendix A for a fluent presentation of the results, and some technical details of the applications can be found in Appendix B.

2. The GBD Family

In this section, we define the generalized bivariate distribution family as a generator system from any three-dimensional baseline continuous distribution, and then we provide its joint cdf, decomposition, and joint pdf.

Let

U_{1}

,

U_{2}

, and

U_{3}

be mutually independent random variables with any continuous distribution functions

F_{U_{1}}

,

F_{U_{2}}

and

F_{U_{3}}

, respectively. Let

X_{1} = max (U_{1}, U_{3})

and

X_{2} = max (U_{2}, U_{3})

. Then, the random vector

(X_{1}, X_{2})

is said to be a GBD model with baseline distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

.

Theorem 1.

Let

(X_{1}, X_{2})

be a GBD model with baseline distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

, then its joint cdf is given by

F (x_{1}, x_{2}) = F_{U_{1}} (x_{1}) F_{U_{2}} (x_{2}) F_{U_{3}} (z),

(1)

where

z = min (x_{1}, x_{2})

, for all

x_{1}, x_{2} \in R

.

Proof.

It is immediate since

\begin{matrix} F (x_{1}, x_{2}) & = P (X_{1} \leq x_{1}, X_{2} \leq x_{2}) = P (max (U_{1}, U_{3}) \leq x_{1}, max (U_{2}, U_{3}) \leq x_{2}) \\ = P (U_{1} \leq x_{1}, U_{2} \leq x_{2}, U_{3} \leq min (x_{1}, x_{2})) = F_{U_{1}} (x_{1}) F_{U_{2}} (x_{2}) F_{U_{3}} (min (x_{1}, x_{2})) . \end{matrix}

□

For instance, a stress model may lead to the GBD family, as in Kundu and Gupta [13]. Suppose a two-component system where each component is subject to an individual independent stress, say

U_{1}

and

U_{2}

, respectively. The system has an overall stress

U_{3}

which has been equally transmitted to both the components, independent of their individual stresses. Then, the observed stress for each component is the maximum of both, individual and overall stresses, i.e.,

X_{1} = max (U_{1}, U_{3})

and

X_{2} = max (U_{2}, U_{3})

, and

(X_{1}, X_{2})

is a GBD model.

Analogously, a GBD model is also plausible for a maintenance model. Suppose a system has two components, and it is assumed that each component has been maintained independently and there is also an overall maintenance. Due to component maintenance, the lifetime of the individual component is increased by a random time, say

U_{1}

and

U_{2}

respectively, and, because of the overall maintenance, the lifetime of each component is increased by another random time

U_{3}

. Then, the increased lifetime of each component is the maximum of both individual and overall maintenances,

X_{1} = max (U_{1}, U_{3})

and

X_{2} = max (U_{2}, U_{3})

, respectively.

As mentioned before, a bivariate model belonging to the GBD family does not have an absolutely continuous cdf. Let us see now the decomposition of a GBD model as a mixture of bivariate absolutely continuous and singular cdfs, the proof is provided in Appendix A.

Theorem 2.

Let

(X_{1}, X_{2})

be a GBD model with baseline distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

. Then,

F (x_{1}, x_{2}) = α F_{s} (x_{1}, x_{2}) + (1 - α) F_{a c} (x_{1}, x_{2})

(2)

where

F_{s} (x_{1}, x_{2}) = \frac{1}{α} \int_{- \infty}^{z} F_{U_{1}} (u) F_{U_{2}} (u) d F_{U_{3}} (u)

(3)

and

F_{a c} (x_{1}, x_{2}) = \frac{1}{1 - α} (F_{U_{1}} (x_{1}) F_{U_{2}} (x_{2}) F_{U_{3}} (z) - \int_{- \infty}^{z} F_{U_{1}} (u) F_{U_{2}} (u) d F_{U_{3}} (u))

(4)

with

z = min (x_{1}, x_{2})

, are the singular and absolutely continuous parts, respectively, and

α = \int_{- \infty}^{\infty} F_{U_{1}} (u) F_{U_{2}} (u) d F_{U_{3}} (u) .

In addition, due to the singular part

F_{s}

in (2), the GBD family does not have a pdf with respect to the two-dimensional Lebesgue measure even when the distribution functions

F_{U_{1}}

,

F_{U_{2}}

, and

F_{U_{3}}

are absolutely continuous. However, it is possible to construct a joint pdf for

(X_{1}, X_{2})

through a mixture between a pdf with respect to the two-dimensional Lebesgue measure and a pdf with respect to the one-dimensional Lebesgue measure (the proof is provided in Appendix A).

Theorem 3.

If

(X_{1}, X_{2})

is a GBD model with joint cdf given by (1), then the joint pdf with respect to μ, the measure associated with F, is

f (x_{1}, x_{2}) = \{\begin{matrix} f_{1} (x_{1}, x_{2}), & if x_{1} < x_{2} \\ f_{2} (x_{1}, x_{2}), & if x_{1} > x_{2} \\ f_{0} (x), & if x_{1} = x_{2} = x, \end{matrix}

where

f_{i} (x_{1}, x_{2}) = f_{U_{j}} (x_{j}) (f_{U_{i}} (x_{i}) F_{U_{3}} (x_{i}) + F_{U_{i}} (x_{i}) f_{U_{3}} (x_{i})), w i t h i \neq j \in {1, 2},

and

f_{0} (x) = f_{U_{3}} (x) F_{U_{1}} (x) F_{U_{2}} (x),

when the pdf

f_{U_{i}}

of

U_{i}

exists,

i = 1, 2, 3

.

3. Special Cases

In this section, we derive new bivariate models from Theorem 1, taking into account particular baseline distribution vectors

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

.

Note that, if the baseline components

U_{i}

s belong to the same distribution family, say

F_{U}

, then the proposed generator provides novel extended bivariate versions of that distribution

F_{U}

. Furthermore, under certain restrictions on the underlying parameters of each

U_{i}

, bivariate distributions given in the literature are obtained. From now on, it is assumed that all parameters of each

F_{U_{i}}

are positive unless otherwise mentioned.

Extended bivariate generalized exponential model. A random variable U follows a GE distribution,

U \sim G E (θ, λ)

(see Gupta and Kundu [25]), if its cdf is given by

F_{G E} (u; θ, λ) = {(1 - e^{- λ u})}^{θ}, for u > 0 .

If

U_{i} \sim G E (θ_{i}, λ_{i})

i = 1, 2, 3

, then the GBD model with the GE baseline distribution vector is an extended BGE model with

θ = (θ_{1}, θ_{2}, θ_{3})

and

λ = (λ_{1}, λ_{2}, λ_{3})

parameter vectors, denoted as

(X_{1}, X_{2}) \sim E B G E (θ, λ)

, and its joint cdf is

F_{E B G E} (x_{1}, x_{2}) = F_{G E} (x_{1}; θ_{1}, λ_{1}) F_{G E} (x_{2}; θ_{2}, λ_{2}) F_{G E} (z; θ_{3}, λ_{3}), for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

As a particular case, if

λ = λ_{i}

,

i = 1, 2, 3

,

(X_{1}, X_{2}) \sim B G E (θ, λ)

given by Kundu and Gupta [13].

Extended bivariate proportional reversed hazard rate model. If

U_{i} \sim P R H (θ_{i})

with base distribution

F_{B_{i}}

i = 1, 2, 3

, i.e., its cdf can be expressed as

F_{U_{i}} = F_{B_{i}}^{θ_{i}}

(see Gupta et al. [22] and Di Crescenzo [23]), then the GBD model with PRH baseline distribution vector provides an extended BPRH model,

(X_{1}, X_{2}) \sim E B P R H (θ, λ)

, with

θ = (θ_{1}, θ_{2}, θ_{3})

parameter vector of the PRH components and

λ = (λ_{1}, λ_{2}, λ_{3})

parameter vector of the underlying distributions

F_{B_{i}}

’s. From (1), its joint cdf is given by

F_{E B P R H} (x_{1}, x_{2}) = F_{B_{1}}^{θ_{1}} (x_{1}; λ_{1}) F_{B_{2}}^{θ_{2}} (x_{2}; λ_{2}) F_{B_{3}}^{θ_{3}} (z; λ_{3}), for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

In particular, if the PRH components have the same base distribution,

F_{B} = F_{B_{i}}

i = 1, 2, 3

, then

(X_{1}, X_{2}) \sim B P R H (θ, λ)

with baseline distribution

F_{B} (\cdot; λ)

introduced by Kundu and Gupta [24].

Extended bivariate generalized linear failure rate model. It is said that a random variable U follows a GLFR distribution,

U \sim G L F R (θ, λ, γ)

(see Sarhan and Kundu [26]), if its cdf is given by

F_{G L F R} (u; θ, λ, γ) = {(1 - exp (- λ u - \frac{γ}{2} u^{2}))}^{θ}, for u > 0 .

If

U_{i} \sim G L F R (θ_{i}, λ_{i}, γ_{i})

i = 1, 2, 3

, then the GBD model with GLFRs baseline distribution vector is an extended BGLFR model,

(X_{1}, X_{2}) \sim E B G L F R (θ, λ, γ)

, with parameters

θ = (θ_{1}, θ_{2}, θ_{3})

,

λ = (λ_{1}, λ_{2}, λ_{3})

, and

γ = (γ_{1}, γ_{2}, γ_{3})

, having joint cdf

F_{E B G L F R} (x_{1}, x_{2}) = F_{G L F R} (x_{1}; θ_{1}, λ_{1}, γ_{1}) F_{G L F R} (x_{2}; θ_{2}, λ_{2}, γ_{2}) F_{G L F R} (z; θ_{3}, λ_{3}, γ_{3}), for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

When

λ_{i} = λ

and

γ_{i} = γ

,

i = 1, 2, 3

, it is obtained that

(X_{1}, X_{2}) \sim B G L F R (θ, λ, γ)

given by Sarhan et al. [14].

Extended bivariate log-exponentiated Kumaraswamy model. Let U be a random variable with logEK distribution,

U \sim l o g E K (θ, λ, γ)

(see Lemonte et al. [27]), then its cdf

F_{l o g E K} (u; θ, λ, γ) = {(1 - {(1 - {(1 - e^{- u})}^{λ})}^{γ})}^{θ}, for u > 0 .

If

U_{i} \sim l o g E K (θ_{i}, λ_{i}, γ_{i})

i = 1, 2, 3

, then the GBD model with logEKs baseline distribution vector is an extended BlogEK model,

(X_{1}, X_{2}) \sim E B l o g E K (θ, λ, γ)

with parameters

θ = (θ_{1}, θ_{2}, θ_{3})

,

λ = (λ_{1}, λ_{2}, λ_{3})

, and

γ = (γ_{1}, γ_{2}, γ_{3})

, and its joint cdf is given by

F_{E B l o g E K} (x_{1}, x_{2}) = F_{l o g E K} (x_{1}; θ_{1}, λ_{1}, γ_{1}) F_{l o g E K} (x_{2}; θ_{2}, λ_{2}, γ_{2}) F_{l o g E K} (z; θ_{3}, λ_{3}, γ_{3}), for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

Clearly, it can be seen that

(X_{1}, X_{2}) \sim B l o g E K (θ, λ, γ)

given by Elsherpieny et al. [15], when

λ_{i} = λ

and

γ_{i} = γ

,

i = 1, 2, 3

.

Extended bivariate exponentiated modified Weibull extension model. A random variable U follows an EMWE distribution,

U \sim E M W E (θ, α, β, λ)

(see Sarhan and Apaloo [28]), if its cdf can be expressed as

F_{E M W E} (u; θ, α, β, λ) = {(1 - exp (α λ (1 - e^{{(u / α)}^{β}})))}^{θ}, for u > 0 .

If

U_{i} \sim E M W E (θ_{i}, α_{i}, β_{i}, λ_{i})

i = 1, 2, 3

, then the GBD model with EMWEs baseline distribution vector is an extended BEMWE model,

(X_{1}, X_{2}) \sim E B E M W E (θ, α, β, λ)

with

θ = (θ_{1}, θ_{2}, θ_{3})

and

α = (α_{1}, α_{2}, α_{3})

,

β = (β_{1}, β_{2}, β_{3})

, and

λ = (λ_{1}, λ_{2}, λ_{3})

parameter vectors, and its joint cdf is given by

F_{E B E M W E} (x_{1}, x_{2}) = F_{E M W E} (x_{1}; θ_{1}, α_{1}, β_{1}, λ_{1}) F_{E M W E} (x_{2}; θ_{2}, α_{2}, β_{2}, λ_{2}) F_{E M W E} (z; θ_{3}, α_{3}, β_{3}, λ_{3}),

for

x_{1} > 0

and

x_{2} > 0

, where

z = min (x_{1}, x_{2})

.

Note that, if

α_{i} = α

,

β_{i} = β

and

λ_{i} = λ

,

i = 1, 2, 3

, then

(X_{1}, X_{2}) \sim B E M W E (θ, α, β, λ)

given by El-Gohary et al. [16].

Extended bivariate inverse Weibull model. The cdf of the IW distribution (e.g., see Keller et al. [29]) is defined by

F_{I W} (u; θ, λ) = e^{- λ u^{- θ}}, for u > 0 .

If

U_{i} \sim I W (θ_{i}, λ_{i})

i = 1, 2, 3

, then the GBD model with IWs baseline distribution vector is an extended BIW model with

θ = (θ_{1}, θ_{2}, θ_{3})

and

λ = (λ_{1}, λ_{2}, λ_{3})

parameter vectors, denoted as

(X_{1}, X_{2}) \sim E B I W (θ, λ)

, and its joint cdf can be written as

F_{E B I W} (x_{1}, x_{2}) = e^{- λ_{1} x_{1}^{- θ_{1}} - λ_{2} x_{2}^{- θ_{2}} - λ_{3} z^{- θ_{3}}}, for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

In particular,

(X_{1}, X_{2}) \sim B I W (θ, λ)

studied by Muhammed [17] and Kundu and Gupta [18], when

θ_{i} = θ

for

i = 1, 2, 3

.

Extended bivariate Dagum model. It is said that a random variable U follows a Dagum distribution [30],

U \sim D (θ, λ, γ)

, if its cdf is given by

F_{D} (u; θ, λ, γ) = {(1 + λ u^{- γ})}^{- θ}, for u > 0 .

If

U_{i} \sim D (θ_{i}, λ_{i}, γ_{i})

i = 1, 2, 3

, then the GBD model with Dagum baseline distribution vector is an extended BD model with

θ = (θ_{1}, θ_{2}, θ_{3})

,

λ = (λ_{1}, λ_{2}, λ_{3})

and

γ = (γ_{1}, γ_{2}, γ_{3})

parameter vectors, denoted as

(X_{1}, X_{2}) \sim E B D (θ, λ, γ)

, having joint cdf

F_{E B D} (x_{1}, x_{2}) = F_{D} (x_{1}; θ_{1}, λ_{1}, γ_{1}) F_{D} (x_{2}; θ_{2}, λ_{2}, γ_{2}) F_{D} (z; θ_{3}, λ_{3}, γ_{3}), for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

Note that, when

λ_{i} = λ

and

γ_{i} = γ

for

i = 1, 2, 3

, it is simplified to the model

(X_{1}, X_{2}) \sim B D (θ, λ, γ)

defined by Muhammed [19].

Extended bivariate generalized Rayleigh model. The cdf of the GR distribution, also called Burr type X model [31], is

F_{G R} (u; θ, λ) = {(1 - e^{- {(λ u)}^{2}})}^{θ}, for u > 0 .

If

U_{i} \sim G R (θ_{i}, λ_{i})

i = 1, 2, 3

, then the GBD model with a GR baseline distribution vector is an extended BGR model with

θ = (θ_{1}, θ_{2}, θ_{3})

and

λ = (λ_{1}, λ_{2}, λ_{3})

parameter vectors,

(X_{1}, X_{2}) \sim E B G R (θ, λ)

, with joint cdf

F_{E B G R} (x_{1}, x_{2}) = F_{G R} (x_{1}; θ_{1}, λ_{1}) F_{G R} (x_{2}; θ_{2}, λ_{2}) F_{G R} (z; θ_{3}, λ_{3}), for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

Hence, if

λ_{i} = λ

,

i = 1, 2, 3

, it is obtained that

(X_{1}, X_{2}) \sim B G R (θ, λ)

given by Sarhan [20].

Extended bivariate Gumbel-G model. Alzaatrech et al. [32] proposed a transformed-transformer method for generating families of continuous distributions. From such method, it is said that a random variable U follows a Gumbel-G model,

U \sim G u

-

G (θ, α, λ)

if its cdf can be expressed as

F_{G u - G} (u; G, θ, α, λ) = exp (- θ {(\frac{1 - G (u; λ)}{G (u; λ)})}^{α}), for u > 0

where G is the transformer distribution with parameter vector

λ

. If

U_{i} \sim G u

-

G (θ_{i}, α_{i}, λ_{i})

i = 1, 2, 3

, then the GBD model with Gu-Gs baseline distribution vector is an extended BGu-G model,

(X_{1}, X_{2}) \sim E B G u

-

G (θ, α, λ_{G})

, with parameters

θ = (θ_{1}, θ_{2}, θ_{3})

,

α = (α_{1}, α_{2}, α_{3})

, and

λ_{G} = (λ_{1}, λ_{2}, λ_{3})

, where

λ_{G}

encompasses all parameter vectors of G in each baseline component. Thus, its joint cdf is given by

F_{E B G u - G} (x_{1}, x_{2}) = F_{G u - G} (x_{1}; G, θ_{1}, α_{1}, λ_{1}) F_{G u - G} (x_{2}; G, θ_{2}, α_{2}, λ_{2}) F_{G u - G} (z; G, θ_{3}, α_{3}, λ_{3}),

for

x_{1} > 0

,

x_{2} > 0

, where

z = min (x_{1}, x_{2})

.

In particular, when

α_{i} = α

and

λ_{i} = λ

for

i = 1, 2, 3

,

(X_{1}, X_{2}) \sim B G u

-

G (θ, α, λ)

given by Eliwa and El-Morshedy [21].

Extended bivariate generalized inverted Kumaraswamy model. A random variable U is said to be a GIK distribution defined by Iqbal et al. [33], if its cdf is given by

F_{G I K} (u; θ, α, γ) = {(1 - {(1 + u^{γ})}^{- α})}^{θ}, for u > 0 .

If

U_{i} \sim G I K (θ_{i}, α_{i}, γ_{i})

i = 1, 2, 3

, then the GBD model with GIKs baseline distribution vector is an extended BGIK model,

(X_{1}, X_{2}) \sim E B G I K (θ, α, γ)

, with parameters

θ = (θ_{1}, θ_{2}, θ_{3})

,

α = (α_{1}, α_{2}, α_{3})

, and

γ = (γ_{1}, γ_{2}, γ_{3})

, and its joint cdf can be written as

F_{E B G I K} (x_{1}, x_{2}) = F_{G I K} (x_{1}; θ_{1}, α_{1}, γ_{1}) F_{G I K} (x_{2}; θ_{2}, α_{2}, γ_{2}) F_{G I K} (z; θ_{3}, α_{3}, γ_{3}), for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

It is straightforward to see that

(X_{1}, X_{2}) \sim B G I K (θ, α, γ)

analyzed by Muhammed [10] when

α = α_{i}

and

γ = γ_{i}

for

i = 1, 2, 3

.

Extended bivariate Burr type X-G model. From the transformed-transformer method of Alzaatrech et al. [32], it is said that a random variable U follows a Burr X-G model,

U \sim B X

-

G (θ, λ)

if its cdf can be expressed as

F_{B X - G} (u; G, θ, λ) = {(1 - exp (- {(\frac{G (u; λ)}{1 - G (u; λ)})}^{2}))}^{θ}, for u > 0

where

λ

is the parameter vector of the transformer distribution G.

If

U_{i} \sim B X

-

G (θ_{i}, λ_{i})

i = 1, 2, 3

, then the GBD model with BX-Gs baseline distribution vector is an extended BBX-G model,

(X_{1}, X_{2}) \sim E B B X

-

G (θ, λ_{G})

, with parameters

θ = (θ_{1}, θ_{2}, θ_{3})

, and

λ_{G} = (λ_{1}, λ_{2}, λ_{3})

, where

λ_{G}

encompasses all parameter vectors of G in each baseline component. Then, its joint cdf can be expressed as

F_{E B B X - G} (x_{1}, x_{2}) = F_{B X - G} (x_{1}; θ_{1}, λ_{1}) F_{B X - G} (x_{2}; θ_{2}, λ_{2}) F_{B X - G} (z; θ_{3}, λ_{3}), for x_{1} > 0, x_{2} > 0,

where

z = min (x_{1}, x_{2})

.

In particular, if

λ = λ_{i}

for

i = 1, 2, 3

, then

(X_{1}, X_{2}) \sim B B X

-

G (θ, λ)

introduced by El-Morshedy et al. [12].

GBD models from different baseline components. In addition, a GBD model can be derived from baseline components

U_{i}

s belonging to different distribution families, which allows one to generate new bivariate distributions.

For illustrative purposes, Figure 1a–d display 3D surfaces of different joint pdfs given by Theorem 3, along with their contour plots. Here,

U_{1}

and

U_{2}

are taken identically distributed

G E (θ, λ)

with different shape and scale parameter values, and

U_{3}

having a Weibull distribution with scale parameter

λ_{3}

and shape parameter

α = 6

,

W (λ_{3}, 6)

.

Figure 1 shows that some of these GBD models are multi-modal bivariate models. It indicates a variety of shapes for the GBD family depending on the different baseline distribution components and for different parameter values.

4. Distributional Properties

Here, we derive the marginal and conditional distributions of the GBD family, and the order statistics. Furthermore, some properties for particular baseline distribution vectors are provided.

4.1. Marginal and Conditional Distributions

From Theorem 1, it is easy to obtain the marginal cdfs of the components

X_{i}

’s, which can be written as

F_{X_{i}} (x_{i}) = F_{U_{i}} (x_{i}) F_{U_{3}} (x_{i}), with i = 1, 2,

(5)

and, when the pdf

f_{U_{i}}

of

U_{i}

exists,

i = 1, 2, 3

, the corresponding pdfs are given by

f_{X_{i}} (x_{i}) = f_{U_{i}} (x_{i}) F_{U_{3}} (x_{i}) + F_{U_{i}} (x_{i}) f_{U_{3}} (x_{i}), with i = 1, 2 .

(6)

For instance, we shall now suppose that

U_{i}

s have PRH distributions, in order to provide some preservation results of the PRH property on the marginals, and its closure under exponentiation of the underlying distributions.

Proposition 1.

If

(X_{1}, X_{2})

has a GBD model formed by

U_{i} \sim P R H (θ_{i})

with baseline distribution

F_{B_{i}}

(

i = 1, 2, 3

), then

X_{i} \sim P R H (θ_{i} + θ_{3})

with base distribution

F_{B_{i}^{*}} = F_{B_{i}}^{θ_{i} / (θ_{i} + θ_{3})} F_{B_{3}}^{θ_{3} / (θ_{i} + θ_{3})}

. Moreover, when the base distribution is common,

F_{B} = F_{B_{i}}

, then

X_{i}

s also have the same baseline distribution

F_{B}

.

Proof.

It immediately follows from (5) and the EBPRH model, since

F_{U_{i}} = F_{B_{i}}^{θ_{i}}

. □

Corollary 1.

If

U_{i} \sim P R H (θ_{i})

with base distribution

F_{B_{i}}

, having

F_{B_{i}} \sim P R H (λ_{i})

with base distribution

F_{{\tilde{B}}_{i}}

(

i = 1, 2, 3

), then

X_{i} \sim P R H (θ_{i} λ_{i} + θ_{3} λ_{3})

with base distribution

F_{B_{i}^{*}} = F_{{\tilde{B}}_{i}}^{θ_{i} λ_{i} / (θ_{i} λ_{i} + θ_{3} λ_{3})} F_{{\tilde{B}}_{3}}^{θ_{3} λ_{3} / (θ_{i} λ_{i} + θ_{3} λ_{3})}

. Moreover, if

F_{\tilde{B}} = F_{{\tilde{B}}_{i}}

(

i = 1, 2, 3

), then

X_{i}

s also have the same base distribution

F_{\tilde{B}}

.

In addition, Figure 2 displays the plots of the marginal pdfs of the GBD models depicted in Figure 1a–d.

Note that Figure 2a–d show some bimodal shapes for the marginal pdfs given by (6) of the GBD models represented in Figure 1a–d, which also exhibit some multi-modal shapes of the joint pdfs. In this setting, Proposition 1 might be used to generate bimodal distributions from the marginals of the GBD family by mixing different baseline distribution components as in Figure 1.

Furthermore, we provide some results about the conditional distributions of a GBD model whose proof can be found in Appendix A.

Theorem 4.

If

(X_{1}, X_{2})

has a GBD model with baseline distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

, then

1.: The conditional distribution of $X_{i}$ given $X_{j} \leq x_{j}$ ( $i \neq j$ ), say $F_{i | X_{j} \leq x_{j}}$ , is an absolutely continuous cdf given by

$F_{i | X_{j} \leq x_{j}} (x_{i}) = \{\begin{matrix} F_{U_{i}} (x_{i}) \frac{F_{U_{3}} (x_{i})}{F_{U_{3}} (x_{j})}, & i f x_{i} < x_{j} \\ F_{U_{i}} (x_{i}), & i f x_{i} \geq x_{j} \end{matrix} .$
2.: The conditional pdf of $X_{i}$ given $X_{j} = x_{j}$ ( $i \neq j$ ), say $f_{i | X_{j} = x_{j}}$ , is a convex combination of an absolutely continuous cdf and a degenerate cdf given by

$f_{i | X_{j} = x_{j}} (x_{i}) = α_{j} I_{x_{j}} (x_{i}) + (1 - α_{j}) f_{i | x_{j}, a c} (x_{i}),$

where $I_{x_{j}}$ is the indicator function of the given point $x_{j}$ , and $f_{i | x_{j}, a c}$ is the absolutely continuous part

$f_{i | X_{j} = x_{j}, a c} (x_{i}) = \frac{1}{1 - α_{j}} \{\begin{matrix} f_{X_{i}} (x_{i}) \frac{f_{U_{j}} (x_{j})}{f_{X_{j}} (x_{j})}, & i f x_{i} < x_{j} \\ f_{U_{i}} (x_{i}), & i f x_{i} > x_{j} \\ 0, & i f x_{i} = x_{j} \end{matrix}$

and the mixing weight $α_{j}$ is constant with respect to $x_{i}$

$α_{j} = F_{U_{1}} (x_{j}) F_{U_{2}} (x_{j}) \frac{f_{U_{3}} (x_{j})}{f_{X_{j}} (x_{j})} .$

4.2. Minimum and Maximum Order Statistics

Now, we provide the cdfs of the maximum and minimum order statistics of a GBD model, which may be interpreted as the lifetimes of parallel and series systems based on the components of

(X_{1}, X_{2})

.

Theorem 5.

If

T_{1} = min (X_{1}, X_{2})

and

T_{2} = max (X_{1}, X_{2})

of a GBD model

(X_{1}, X_{2})

with baseline distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

, then their cdfs are given by

F_{T_{1}} (x) = F_{U_{3}} (x) F_{U_{1 : 2}} (x) and F_{T_{2}} (x) = F_{U_{3 : 3}} (x)

(7)

where

U_{1 : 2} = min (U_{1}, U_{2})

and

U_{3 : 3} = max (U_{1}, U_{2}, U_{3})

.

Proof.

It is trivial from (1) and (5) by taking into account that

F_{T_{2}} (x) = F (x, x)

and

F_{T_{1}} (x) = F_{X_{1}} (x) + F_{X_{2}} (x) - F_{T_{2}} (x)

. □

The pdfs

f_{T_{1}}

and

f_{T_{2}}

of the minimum and maximum statistics can be readily obtained by differentiation of (7).

Furthermore, the PRH property is preserved by the maximum order statistic of a GBD model, which is immediately derived from Theorem 5.

Corollary 2.

If

U_{i} \sim P R H (θ_{i})

with baseline distribution

F_{B_{i}}

(

i = 1, 2, 3

), then

T_{2} \sim P R H (θ)

with base

F_{B_{(2)}} = F_{B_{1}}^{θ_{1} / θ} F_{B_{2}}^{θ_{2} / θ} F_{B_{3}}^{θ_{3} / θ}

and

θ = θ_{1} + θ_{2} + θ_{3}

. Moreover, when

F_{B} = F_{B_{i}}

(

i = 1, 2, 3

), then

T_{2}

also has the same base distribution

F_{B}

.

5. Dependence and Stochastic Properties

In this section, we study various dependence and stochastic properties on the GBD family, its marginals and order statistics, and its copula representation. Notions of dependence and ageing for bivariate distributions can be found in Lai and Xie [34] and Balakrishnan and Lai [4]; see also Shaked and Shantikumar [35] for univariate and multivariate stochastic orders.

5.1. GBD Model

Proposition 2.

If

(X_{1}, X_{2}) \sim G B D

model, then

(X_{1}, X_{2})

is positive quadrant dependent (PQD).

Proof.

From (1) and (5), it is readily obtainable that

F (x_{1}, x_{2}) \geq F_{X_{1}} (x_{1}) F_{X_{2}} (x_{2})

, which is equivalent to say that all random vector

(X_{1}, X_{2})

, having a GBD model, is PQD. □

An immediate consequence of the PQD property is that

C o v (X_{1}, X_{2}) > 0

. Other important bivariate dependence properties are the following, whose proofs are provided in Appendix A.

Proposition 3.

Let

(X_{1}, X_{2})

be a random vector having a GBD model:

1.: $(X_{1}, X_{2})$ is left tail decreasing (LTD).
2.: $(X_{1}, X_{2})$ is left corner set decreasing (LCSD).
3.: Its joint cdf F is totally positive and of order 2 ( $T P_{2}$ ).

Proof.

Note that F is

T P_{2}

is equivalent to

(X_{1}, X_{2})

is LCSD, which implies LTD (e.g., see Balakrishnan and Lai [4]). Thereby, we only have to prove (3). From the definition of

T P_{2}

property, it is equivalent to check that the following inequality holds:

\frac{F (x) F (x^{'})}{F (x \lor x^{'}) F (x \land x^{'})} \leq 1,

(8)

for all

x

and

x^{'}

, where

x \lor x^{'} = (max (x_{1}, x_{1}^{'}), max (x_{2}, x_{2}^{'}))

, and

x \land x^{'} = (min (x_{1}, x_{1}^{'}), min (x_{2}, x_{2}^{'}))

. Hence, from (1), the inequality (8) can be expressed as

\frac{F_{U_{3}} (u) F_{U_{3}} (v)}{F_{U_{3}} (w) F_{U_{3}} (y)} \leq 1,

where

u = x_{1} \land x_{2}

,

v = x_{1}^{'} \land x_{2}^{'}

,

w = (x_{1} \lor x_{1}^{'}) \land (x_{2} \lor x_{2}^{'})

and

y = u \land v

. Moreover, one can observe that

y \leq u \lor v = max (u, v) \leq w

.

Therefore, when

u \leq v

, i.e.,

y = u \leq v \leq w

, the inequality (8) can be simplified as follows:

\frac{F_{U_{3}} (v)}{F_{U_{3}} (w)} \leq 1,

which is trivial, since

v \leq w

and

F_{U_{3}}

is a cdf. An analogous development follows for

u > v

, which completes the proof. □

Let us see now some results related to the reversed hazard gradient of a random vector from the GBD family, which is defined as an extension of the univariate case, see Domma [36],

r (x) = (r_{1} (x), r_{2} (x)) = (\frac{\partial}{\partial x_{1}}, \frac{\partial}{\partial x_{2}}) ln F (x_{1}, x_{2})

where each

r_{i} (x)

represents the reversed hazard function of

(X_{i} | X_{j} \leq x_{j})

,

i \neq j = 1, 2

, and assuming that F is differentiable. In addition, it is said that

(X_{1}, X_{2})

has a bivariate decreasing (increasing) reversed hazard gradient, BDRHG (BIRHG), if all components

r_{i}

s are decreasing (increasing) functions in the corresponding variables.

Proposition 4.

If

(X_{1}, X_{2})

has a GBD model with baseline distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

, then its reversed hazard gradient

r (x)

is given by

r_{i} (x) = \{\begin{matrix} r_{U_{i}} (x_{i}) + r_{U_{3}} (x_{i}), & i f x_{i} < x_{j} \\ r_{U_{i}} (x_{i}), & i f x_{i} \geq x_{j} \end{matrix}

for

i \neq j = 1, 2

, when the reversed hazard function of

U_{i}

,

r_{U_{i}} = f_{U_{i}} / F_{U_{i}}

exists,

i = 1, 2, 3

.

Proof.

The proof is straightforward from the definition of reversed hazard rate function corresponding to the conditional cdf

F_{i | X_{j} \leq x_{j}}

given by (1) of Theorem 4. □

Theorem 6.

Let

(X_{1}, X_{2})

be a random vector having a GBD model. If

U_{i}

s have decreasing reversed hazard functions (DRH), then

(X_{1}, X_{2}) \in B D R H G

.

Proof.

It is straightforward from Proposition 4. □

Note that Theorem 6 provides the closure of the DRH property under the formation of a GBD model. Thus, the bivariate extension of a DRH distribution

F_{U}

generated by the GBD family is BDRHG.

Nevertheless, it does not hold for the increasing reversed hazard (IRH) property, since both

r_{i} (x)

given in Proposition 4 have a negative jump discontinuity at

x_{i} = x_{j}

for

i \neq j = 1, 2

. Therefore, if

U_{i} \in I R H

, then

(X_{1}, X_{2})

cannot be BIRHG.

Finally, we present some interesting stochastic ordering results between bivariate random vectors of GBD type.

Theorem 7.

Let

X = (X_{1}, X_{2})

and

Y = (Y_{1}, Y_{2})

have GBD models with baseline distribution vectors

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

and

(F_{V_{1}}, F_{V_{2}}, F_{V_{3}})

, respectively. If

U_{i} \leq_{s t} V_{i}

(

i = 1, 2, 3

), then

X \leq_{l o} Y

.

Proof.

The result immediately follows from the stochastic ordering between components and (1), since

U_{i} \leq_{s t} V_{i}

is equivalent to

F_{U_{i}} (x) \geq F_{V_{i}} (x)

, and the lower orthant ordering is defined by the inequality

F_{X} (x) \geq F_{Y} (x)

for all

x = (x_{1}, x_{2})

. □

Corollary 3.

Let

X \sim E B P R H (θ, λ)

and

Y \sim E B P R H (θ^{*}, λ^{*})

with base distributions

F_{B_{i}}

and

F_{B_{i}^{*}}

(

i = 1, 2, 3

), respectively. If

θ_{i} \leq θ_{i}^{*}

and

F_{B_{i}} \leq_{s t} F_{B_{i}^{*}}

(

i = 1, 2, 3

), then

X \leq_{l o} Y

.

Proof.

It is obvious that

F_{B_{i}}^{θ_{i}} (x_{i}) \geq F_{B_{i}}^{θ_{i}^{*}} (x_{i}) \geq F_{B_{i}^{*}}^{θ_{i}^{*}} (x_{i})

, i.e.,

U_{i} \leq_{s t} V_{i}

, and then the proof readily follows from Theorem 7. □

Remark 1.

From Corollary 3, if both EBPRH models are based on a common base distribution vector,

F_{B_{i}} = F_{B_{i}^{*}}

(

i = 1, 2, 3

), then it is only necessary that

θ_{i} \leq θ_{i}^{*}

to hold the lower orthant ordering.

5.2. Marginals and Order Statistics

Now, we study some stochastic properties of the marginals and the minimum and maximum order statistics of the GBD model.

Firstly, from (5) and (6), the reversed hazard function of the marginal

X_{i}

s can be expressed as

r_{X_{i}} (x) = \frac{f_{X_{i}} (x)}{F_{X_{i}} (x)} = r_{U_{i}} (x) + r_{U_{3}} (x), i = 1, 2 .

(9)

Therefore, the DRH (IRH) property is preserved to the marginals.

Theorem 8.

If

(X_{1}, X_{2})

has a GBD model formed by

U_{i} \in D R H

(

i = 1, 2, 3

), then

X_{i} \in D R H

(

i = 1, 2

).

Remark 2.

Note that the IRH distributions have upper bounded support [37]. Thus, if any

U_{i}

is not upper bounded, its reversed hazard function is always decreasing at the end, and then the marginal cannot be IRH. Therefore, it is necessary that

U_{i} \in I R H

(

i = 1, 2, 3

) and they have the same upper bounds to be

X_{i} \in I R H

(

i = 1, 2

).

Example 1.

Suppose

U_{i}

s have extreme value distributions of type 3 with a common support,

U_{i} \sim E V 3 (β, λ_{i}, k_{i})

, whose cdf is defined by

F_{U_{i}} (u) = exp (- λ_{i} {(β - u)}^{k_{i}}), for u \leq β

and

F_{U_{i}} (u) = 1

otherwise. Its reversed hazard function is given by

r_{U_{i}} (u) = λ_{i} k_{i} {(β - u)}^{k_{i} - 1}, for u \in (- \infty, β],

which is increasing (decreasing) in its support for

k_{i} \leq (\geq) 1

. Thus, if

k_{i} \leq (\geq) 1

(

i = 1, 2, 3

), then

U_{i} \in I R H (D R H)

, and, consequently,

X_{i} \in I R H (D R H)

(

i = 1, 2

).

Example 2.

If

(X_{1}, X_{2})

has an EBGE model, then its marginals are

D R H

, since

r_{X_{i}}

given by (9) is the sum of two decreasing functions because of each

U_{i} \sim G E (θ_{i}, λ_{i})

is a

P R H (θ_{i})

with exponential baseline distribution

r_{U_{i}} (u) = θ_{i} r_{E x p (λ_{i})} (u) = \frac{θ_{i} λ_{i}}{e^{λ_{i} u} - 1},

which is evidently a decreasing function. Here,

E x p (λ)

denotes an exponential random variable with mean

1 / λ

.

Remark 3.

From (9), when the

U_{i}

s have a common distribution

F_{U}

, then the marginals

X_{i} \sim P R H (2)

with base distribution

F_{U}

. Therefore,

r_{X_{i}} (x) = 2 r_{U} (x)

has the same monotonicity. In particular, if

F_{U} \in D R H (I R H)

, then

X_{i} \in D R H (I R H)

.

Remark 4.

From (9), if

U_{i} \sim P R H (θ_{i})

with the same base distribution

F_{B}

, then

X_{i} \sim P R H (θ_{i} + θ_{3})

with base

F_{B}

, i.e.,

r_{X_{i}} (x) = (θ_{i} + θ_{3}) r_{B} (x)

. Thus, Remark 3 also holds by using

F_{B}

instead of

F_{U}

.

Secondly, the mean inactivity time (MIT), also called mean waiting time [37], of a random variable X is defined as

m_{X} (x) = E (x - X | X \leq x) = \int_{- \infty}^{x} \frac{F_{X} (y)}{F_{X} (x)} d y .

Thus, from (5), the MIT of the marginal

X_{i}

s of a GBD model can be derived by

m_{X_{i}} (x) = \frac{1}{F_{U_{i}} (x) F_{U_{3}} (x)} \int_{- \infty}^{x} F_{U_{i}} (y) F_{U_{3}} (y) d y, i = 1, 2 .

(10)

Here, we shall focus on two particular cases of GBD models, having baseline components with monotonous MIT, which is preserved by the marginals.

Example 3.

Suppose

U_{i} \sim E x p (λ)

, then its MIT can be expressed as

m_{U_{i}} (u) = \frac{u}{1 - e^{- λ u}} - \frac{1}{λ},

which is an increasing MIT function (IMIT), i.e.,

U_{i} \in I M I T

. From (10), we obtain the MIT function of the marginals

X_{i}

s for the bivariate exponential version of GBD type,

m_{X_{i}} (x) = \frac{(2 λ x - 3 + 4 e^{- λ x}) - e^{- 2 λ x}}{2 λ {(1 - e^{- λ x})}^{2}} .

Then, upon differentiation,

m_{X_{i}}^{'} (x)

has the same sign as the expression

1 - e^{- 2 λ x} - 2 λ x e^{- λ x}

, which is positive, and therefore

X_{i} \in I M I T

(

i = 1, 2

).

Example 4.

Suppose

U_{i} \sim E V 3 (β, λ_{i}, k = 2)

, then its MIT can be expressed as

m_{U_{i}} (u) = \frac{1}{e^{- λ_{i} {(β - u)}^{2}}} \int_{- \infty}^{u} e^{- λ_{i} {(β - y)}^{2}} d y = \frac{Φ (u; μ, σ_{i})}{ϕ (u; μ, σ_{i})}, for u \leq β

where

Φ (u; μ, σ_{i})

and

ϕ (u; μ, σ_{i})

are the cdf and pdf of a normal model with

μ = β

and

σ_{i} = \frac{1}{\sqrt{2 λ_{i}}}

, respectively. Moreover, taking into account that a random variable and its standardized version have PRH functions, and the standard normal distribution has the DRH property [38], we obtain that

U_{i} \in I M I T

.

Upon considering the cdf of

U_{i}

s and (5), the marginal

X_{i} \sim E V 3 (β, λ_{i} + λ_{3}, 2)

(

i = 1, 2

). Thus, their MIT can be written as

m_{X_{i}} (x) = \frac{Φ (x; μ, {\tilde{σ}}_{i})}{ϕ (x; μ, {\tilde{σ}}_{i})}, for x \leq β

where

{\tilde{σ}}_{i} = \frac{1}{\sqrt{2 (λ_{i} + λ_{3})}}

for

i = 1, 2

, and, consequently,

X_{i} \in I M I T .

On the other hand, the following stochastic orderings among the three baseline components of two GBD models are preserved by their corresponding marginals. The proof immediately follows from the definitions of the stochastic orderings.

Theorem 9.

Let

(X_{1}, X_{2})

and

(Y_{1}, Y_{2})

have GBD models with base distribution vectors

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

and

(F_{V_{1}}, F_{V_{2}}, F_{V_{3}})

, respectively.

1.: If $U_{i} \leq_{s t} V_{i}$ ( $i = 1, 2, 3$ ), then $X_{i} \leq_{s t} Y_{i}$ ( $i = 1, 2$ ).
2.: If $U_{i} \leq_{r h} V_{i}$ ( $i = 1, 2, 3$ ), then $X_{i} \leq_{r h} Y_{i}$ ( $i = 1, 2$ ).

Finally, we discuss some stochastic properties of the minimum and maximum order statistics of the GBD family. In this setting, from (7), the reversed hazard function of the maximum statistic

T_{2}

of

(X_{1}, X_{2})

of GBD type is determined by the sum of the reversed hazard rates of the baseline distribution vector:

r_{T_{2}} (x) = r_{U_{1}} (x) + r_{U_{2}} (x) + r_{U_{3}} (x)

(11)

when the pdf

f_{U_{i}}

of

U_{i}

exists,

i = 1, 2, 3

. Hence, it is immediate the following result.

Theorem 10.

If

U_{i} \in D R H (I R H)

(

i = 1, 2, 3

), then

T_{2} \in D R H (I R H)

.

Example 5.

Suppose

U_{i} \sim E V 3 (β, λ_{i}, k_{i})

(

i = 1, 2, 3

). Then, the reversed hazard function of

T_{2}

is given by

r_{T_{2}} (x) = \sum_{i = 1}^{3} λ_{i} k_{i} {(β - x)}^{k_{i} - 1},

and, therefore, if every

k_{i} \leq (\geq) 1

,

i = 1, 2, 3

, then

r_{T_{2}}

is increasing (decreasing) in x, i.e.,

T_{2} \in I R H (D R H)

.

Example 6.

If

U_{i} \sim G E (θ_{i}, λ_{i})

, then the maximum statistic of the EBGE model is

D R H

,

T_{2} \in D R H

, since (11) is the sum of three decreasing functions.

Remark 5.

When

U_{i}

s have a common distribution

F_{U}

, the GBD model has a maximum statistic whose cdf is

F_{U}

cube, and (11) can be written as

r_{T_{2}} (x) = 3 r_{U} (x)

. In particular, if

F_{U} \in D R H (I R H)

, then

T_{2} \in D R H (I R H)

.

Remark 6.

From Corollary 2, if

U_{i} \sim P R H (θ_{i})

with the same base distribution

F_{B}

,

T_{2} \sim P R H (θ)

with base

F_{B}

and

θ = θ_{1} + θ_{2} + θ_{3}

, i.e.,

r_{T_{2}} (x) = θ r_{B} (x)

. Thus,

T_{2} \in D R H (I R H)

if and only if

F_{B} \in D R H (I R H)

.

Furthermore, the MIT of the maximum statistic of a GBD model

(X_{1}, X_{2})

can be derived by

m_{T_{2}} (x) = \frac{1}{F_{U_{1}} (x) F_{U_{2}} (x) F_{U_{3}} (x)} \int_{- \infty}^{x} F_{U_{1}} (y) F_{U_{2}} (y) F_{U_{3}} (y) d y,

for each specific baseline distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

, when the integral exists. For instance, we will consider a particular case, similar to one used in Example 4.

Example 7.

Suppose

(X_{1}, X_{2})

has a GBD model with

U_{i} \sim P R H (θ_{i})

and base distributions

F_{B_{i}} \sim E V 3 (β, λ_{i}, k = 2)

for

i = 1, 2, 3

, then each component

U_{i} \sim E V 3 (β, θ_{i} λ_{i}, 2)

, and consequently,

U_{i} \in I M I T

for

i = 1, 2, 3

. Moreover, from Corollary 2, the maximum statistic

T_{2} \sim E V 3 (β, θ^{*}, 2)

with

θ^{*} = θ_{1} λ_{1} + θ_{2} λ_{2} + θ_{3} λ_{3}

. Thus,

T_{2} \in I M I T

which is obtained along the same line as Example 4, since

m_{T_{2}} (x) = \frac{Φ (x; β, {(2 θ^{*})}^{- 1 / 2})}{ϕ (x; β, {(2 θ^{*})}^{- 1 / 2})}, for x \leq β .

Regarding the minimum statistic

T_{1}

of

(X_{1}, X_{2})

of GBD type, some preservation results are also obtained based on its reversed hazard rate

r_{T_{1}}

, the proofs are given in Appendix A, and from (7)

r_{T_{1}}

can be written as

r_{T_{1}} (x) = r_{U_{1 : 2}} (x) + r_{U_{3}} (x) .

(12)

Theorem 11.

If

U_{i} \in D R H

(

i = 1, 2, 3

) and

U_{1 : 2} \leq_{r h} U_{i}

(

i = 1, 2

), then

T_{1} \in D R H

.

Corollary 4.

If

U_{i} \in D R H

(

i = 1, 2, 3

) and

U_{1} =_{s t} U_{2}

, then

T_{1} \in D R H

.

Example 8.

Suppose

U_{i} \sim G E (θ, λ)

for

i = 1, 2

and

U_{3} \sim G E (θ_{3}, λ_{3})

, then

U_{i} \in D R H

, and, consequently,

T_{1} \in D R H

from Corollary 4.

Remark 7.

Note that, when

U_{i}

s have a common distribution

F_{U}

, (12) can be expressed as

r_{T_{1}} (x) = r_{U} (x) (3 - 2 / (2 - F_{U} (x)))

, and from Corollary 4, it is immediate to have that, if

F_{U} \in D R H

, then

T_{1} \in D R H

.

Theorem 12.

Let

(X_{1}, X_{2})

be a GBD model. Then,

T_{1} \leq_{r h} T_{2}

.

Proof.

From (11) and (12), the statement is equivalent to

r_{U_{1 : 2}} (x) \leq r_{U_{2 : 2}} (x)

, which readily follows from Theorem 1.B.56 of Shaked and Shanthikumar [35], since the baseline components

U_{i}

s are independent. □

5.3. Copula and Related Association Measures

Let us see now the copula representation of the GBD family and some related dependence measures of interest in the analysis of two-dimensional data.

It is well known that the dependence between the random variables

X_{1}

and

X_{2}

is completely described by the joint cdf

F (x_{1}, x_{2})

, and it is often represented by a copula which describes the dependence structure in a separate form from the marginal behaviour. In this setting, from Sklar’s theorem (e.g., see [39]), if its marginal cdfs

F_{X_{i}}

s are absolutely continuous, then the joint cdf has a unique copula representation for

F (x_{1}, x_{2}) = C (F_{X_{1}} (x_{1}), F_{X_{2}} (x_{2})),

and reciprocally, if

F_{X_{i}}^{- 1}

is the inverse function of

F_{X_{i}}

(

i = 1, 2

), then there exists a unique copula C in

{[0, 1]}^{2}

, such that

C (u_{1}, u_{2}) = F (F_{X_{1}}^{- 1} (u_{1}), F_{X_{2}}^{- 1} (u_{2})) .

Now, we can derive the copula representation for the joint cdf of the GBD family as a function of its base distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

. In order to do this, by using (5), the joint cdf (1) can be expressed as

F (x_{1}, x_{2}) = F_{X_{1}} (x_{1}) F_{X_{2}} (x_{2}) \frac{F_{U_{3}} (min (x_{1}, x_{2}))}{F_{U_{3}} (x_{1}) F_{U_{3}} (x_{2})}

and taking

u_{i} = F_{X_{i}} (x_{i})

, the associated copula for an arbitrary base distribution vector

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

can be written as

C (u_{1}, u_{2}) = u_{1} u_{2} \frac{min (A_{1} (u_{1}), A_{2} (u_{2}))}{A_{1} (u_{1}) A_{2} (u_{2})},

(13)

where

A_{i} (u_{i}) = F_{U_{3}} ({(F_{U_{i}} \times F_{U_{3}})}^{- 1} (u_{i})), i = 1, 2,

which allows us to give an additional result.

Theorem 13.

Let

X = (X_{1}, X_{2})

and

Y = (Y_{1}, Y_{2})

be two GBD models with baseline distribution vectors

(F_{U_{1}}, F_{U_{2}}, F_{U_{3}})

and

(F_{V_{1}}, F_{V_{2}}, F_{V_{3}})

, respectively. If

X

and

Y

have the same associated copula and

U_{i} \leq_{s t} V_{i}

, then

X \leq_{s t} Y

.

Proof.

It is immediate by using Theorem 6.B.14 of Shaked and Shanthikumar [35] and (5), since

U_{i} \leq_{s t} V_{i}

implies

X_{i} \leq_{s t} Y_{i}

. □

Corollary 5.

Let

X = (X_{1}, X_{2})

and

Y = (Y_{1}, Y_{2})

be two GBD models with common baseline distributions,

F_{U}

and

F_{V}

, respectively. If

U \leq_{s t} V

, then

X \leq_{s t} Y

.

Note that (13) provides a general formula to establish the specific copula upon considering two particular continuous and increasing bijective functions

A_{1}

and

A_{2}

from

[0, 1]

onto

[0, 1]

. Fang and Li [40] analyzed some stochastic orderings for an equivalent copula representation to (13) with interesting applications in network security and insurance. In the last section, we shall use the bivariate copula representation (13) to discuss the multivariate extension of the GBD family.

Furthermore, (13) may be considered a generalization of the Marshall–Olkin copula, as displayed in the following results whose proofs are omitted.

Corollary 6.

If

(X_{1}, X_{2})

has a GBD model with a common base distribution

F_{U}

, then the copula representation of its joint cdf is

C (u_{1}, u_{2}) = min (u_{1} u_{2}^{1 / 2}, u_{1}^{1 / 2} u_{2}) .

Corollary 7.

If

(X_{1}, X_{2})

has a GBD model with PRHs baseline distribution vector of the same base

F_{B}

, i.e.,

(X_{1}, X_{2}) \sim B P R H (θ_{1}, θ_{2}, θ_{3})

, then the copula representation of its joint cdf is

C (u_{1}, u_{2}) = min (u_{1} u_{2}^{θ_{2} / (θ_{2} + θ_{3})}, u_{1}^{θ_{1} / (θ_{1} + θ_{3})} u_{2}) .

Some association measures for a bivariate random vector

(X_{1}, X_{2})

of GBD type can be derived from the dependence structure described by the general expression (13) for each particular pair of continuous and increasing bijective functions

A_{1}

and

A_{2}

determined by the specific baseline distribution vector. For instance, for the special GBD models given in Corollaries 6 and 7, the measures of dependence namely Kendall’s tau, Spearman’s rho, Blomqvist’s beta, and tail dependence coefficients, see Nelsen [39] among others, can be calculated as follows.

Kendall’s tau. The Kendall’s

τ

is defined as the probability of concordance minus the probability of discordance between two pairs of independent and identically distributed random vectors,

(X_{1}, X_{2})

and

(Y_{1}, Y_{2})

, as follows:

τ = P ((X_{1} - Y_{1}) (X_{2} - Y_{2}) > 0) - P ((X_{1} - Y_{1}) (X_{2} - Y_{2}) < 0),

and it can be calculated through its copula representation

C (u_{1}, u_{2})

by

τ = 4 E (C (U_{1}, U_{2})) - 1 = 1 - 4 {\int \int}_{{[0, 1]}^{2}} \frac{\partial C (u_{1}, u_{2})}{\partial u_{1}} \frac{\partial C (u_{1}, u_{2})}{\partial u_{2}} d u_{1} d u_{2}

(14)

with

U_{i}

s uniform

[0, 1]

random variables whose joint cdf is C.

For example, if

(X_{1}, X_{2})

has a GBD model with a common baseline

F_{U}

, upon substituting from the copula of Corollary 6 in (14), it is easy to check that Kendall’s

τ = 1 / 3

.

Analogously, from the copula given in Corollary 7 of the GBD model for

P R H (θ_{i})

components with a common base

F_{B}

, the Kendall’s

τ

coefficient (14) can be written as

τ = \frac{θ_{3}}{θ_{1} + θ_{2} + θ_{3}} .

Spearman’s rho. The Spearman’s

ρ

coefficient measures the dependence by three pairs of independent and identically distributed random vectors,

(X_{1}, X_{2})

,

(Y_{1}, Y_{2})

and

(Z_{1}, Z_{2})

. It is defined as

ρ = 3 (P ((X_{1} - Y_{1}) (X_{2} - Z_{2}) > 0) - P ((X_{1} - Y_{1}) (X_{2} - Z_{2}) < 0)),

which can be computed by its copula representation

C (u_{1}, u_{2})

by

ρ = 12 E (U_{1} U_{2}) - 3 .

(15)

Thus, if there is a common base distribution as in Corollary 6, the Spearman’s

ρ

coefficient between

X_{1}

and

X_{2}

is

ρ = 3 / 7

.

In the case of

U_{i} \sim P R H (θ_{i})

with a common base distribution

F_{B}

, from (15) and Corollary 7, this association measure is

ρ = \frac{3 θ_{3}}{2 θ_{1} + 2 θ_{2} + 3 θ_{3}}

which coincides with one obtained by Kundu et al. [9] for this specific GBD model,

(X_{1}, X_{2}) \sim B P R H (θ_{1}, θ_{2}, θ_{3})

. As remarked by Kundu et al. [9] for the BPRH model, both coefficients,

τ

and

ρ

, vary between 0 and 1 as

θ_{3}

varies from 0 to ∞.

Blomqvist’s Beta. The Blomqvist’s

β

coefficient, also called the medial correlation coefficient, is defined as the probability of concordance minus the probability of discordance between

(X_{1}, X_{2})

and its median point, say

(m_{1}, m_{2})

, taking the following form:

β = P ((X_{1} - m_{1}) (X_{2} - m_{2}) > 0) - P ((X_{1} - m_{1}) (X_{2} - m_{2}) < 0) = 4 F (m_{1}, m_{2}) - 1,

and from the copula of its joint cdf F, it can be expressed as

β = 4 C (1 / 2, 1 / 2) - 1 .

(16)

In the case of Corollary 6, it is immediate that the medial correlation coefficient between

X_{1}

and

X_{2}

is

β = \sqrt{2} - 1

when it follows a GBD model with a common baseline distribution.

In the other case, from Corollary 7, the Blomqvist’s

β

coefficient (16) is also readily obtainable between the marginals of a BPRH model:

β = \{\begin{matrix} 2^{θ_{3} / (θ_{2} + θ_{3})}, & if θ_{1} \leq θ_{2} \\ 2^{θ_{3} / (θ_{1} + θ_{3})}, & if θ_{1} > θ_{2}, \end{matrix}

which takes values between 0 and 1 as

θ_{3}

varies from 0 to ∞.

Tail Dependence. The tail dependence measures the association of extreme events in both directions, the upper (lower) tail dependence

λ_{U}

(

λ_{L}

) provides an asymptotical association measurement in the upper (lower) quadrant tail of a bivariate random vector, given by (if it exists)

λ_{U} (λ_{L}) = lim_{u \to 1^{-} (0^{+})} P (X_{2} > (\leq) F_{X_{2}}^{- 1} (u) | X_{1} > (\leq) F_{X_{1}}^{- 1} (u)) .

Similar to the above association coefficients, the tail dependence indexes can be calculated from the copula representation

C (u_{1}, u_{2})

of the joint cdf of

(X_{1}, X_{2})

, as follows:

λ_{U} = 2 - lim_{u \to 1^{-}} \frac{1 - C (u, u)}{1 - u} and λ_{L} = lim_{u \to 0^{+}} \frac{C (u, u)}{u} .

(17)

In particular, if

(X_{1}, X_{2})

follows a GBD model with a common baseline distribution, upon substituting from the copula of Corollary 6 in (17), it is easy to check that

λ_{L} = 0

and

λ_{U} = 1 / 2

.

In the case of

U_{i} \sim P R H (θ_{i})

with the same base, from (17) and Corollary 7, it is clear that the tail dependence indexes of the BPRH model are

λ_{L} = 0

and

λ_{U} = \{\begin{matrix} \frac{θ_{3}}{θ_{2} + θ_{3}}, & if θ_{1} \leq θ_{2} \\ \frac{θ_{3}}{θ_{1} + θ_{3}}, & if θ_{1} > θ_{2}, \end{matrix}

which takes values between 0 and 1 as

θ_{3}

varies from 0 to ∞.

6. Maximum Likelihood Estimation

In this section, we address the problem of computing the maximum likelihood estimations (MLEs) of the unknown parameters based on a random sample. The problem can be formulated as follows. Suppose

{(x_{1 i}, x_{2 i}); i = 1, \dots, n}

is a random sample of size n from a GBD model, where it is assumed that, for

j = 1, 2, 3

,

U_{j}

has the pdf

f_{U_{j}} (u; θ_{j})

and

θ_{j}

is of dimension

p_{j}

. The objective is to estimate the unknown parameter vector

θ = (θ_{1}, θ_{2}, θ_{3})

. We use the following partition of the sample:

I_{1} = {i : x_{1 i} < x_{2 i}}, I_{2} = {i : x_{1 i} > x_{2 i}}, I_{0} = {i : x_{1 i} = x_{2 i} = x_{i}} .

Based on the above observations, the log-likelihood function becomes

ℓ (θ) = \sum_{i \in I_{0}} ln f_{0} (x_{i}; θ) + \sum_{i \in I_{1}} ln f_{1} (x_{1 i}, x_{2 i}; θ) + \sum_{i \in I_{2}} ln f_{2} (x_{1 i}, x_{2 i}; θ),

where

f_{0} (x_{i}; θ)

,

f_{1} (x_{1 i}, x_{2 i}; θ)

,

f_{2} (x_{1 i}, x_{2 i}; θ)

have been defined in Theorem 3.

Here, it is difficult to compute the MLEs of the unknown parameter vector

θ

by solving a

p_{1} + p_{2} + p_{3}

optimization problem. To avoid that, we suggest using the EM algorithm, and the basic idea is based on considering a random sample of size n from

(U_{1}, U_{2}, U_{3})

, instead of the random sample of size n from

(X_{1}, X_{2})

. From the observed sample

{(x_{1 i}, x_{2 i}}

, the sample

{(u_{1 i}, u_{2 i}, u_{3 i}); i = 1, \dots, n}

has missing values as shown in Table 1. It is immediate that the MLEs of

θ_{1}

,

θ_{2}

and

θ_{3}

can be obtained by solving the following three optimization problems of dimensions

p_{1}

,

p_{2}

and

p_{3}

, respectively,

ℓ_{j} (θ_{j}) = \sum_{i = 1}^{n} ln f_{U_{j}} (u_{j i}; θ_{j}); j = 1, 2, 3,

which are computationally more tractable.

From Table 1, if,

i \in I_{0}

, then

u_{3 i}

is known, and

u_{1 i}

and

u_{2 i}

are unknown. Similarly, if

i \in I_{1}

(

i \in I_{2}

), then

u_{2 i}

(

u_{1 i}

) and

max {u_{1 i}, u_{3 i}}

(

max {u_{2 i}, u_{3 i}}

) are known. Hence, in the E-step of the EM algorithm, the ‘pseudo’ log-likelihood function is formed by replacing the missing

u_{j i}

by its expected value,

u_{j i m} (θ)

, for

i = 1, \dots, n

and

j = 1, 2, 3

:

If $i \in I_{0}$ , then

$u_{j i m} (θ) = E (U_{j} | U_{j} < x_{i}) = \frac{1}{F_{U_{j}} (x_{i})} \int_{- \infty}^{x_{i}} u f_{U_{j}} (u) d u, j = 1, 2 .$
If $i \in I_{1}$ and $j, k \in {1, 3}, j \neq k$ , then

$\begin{matrix} u_{j i m} (θ) & = E (U_{j} | max {U_{1}, U_{3}} = x_{1 i}) \\ = x_{1 i} P (U_{j} > U_{k}) + P (U_{j} < U_{k}) \frac{1}{F_{U_{j}} (x_{1 i})} \int_{- \infty}^{x_{1 i}} u f_{U_{j}} (u) d u \\ = x_{1 i} \int_{- \infty}^{\infty} f_{U_{j}} (u) F_{U_{k}} (u) d u + \frac{1}{F_{U_{j}} (x_{1 i})} \int_{- \infty}^{\infty} f_{U_{k}} (u) F_{U_{j}} (u) d u \int_{- \infty}^{x_{1 i}} u f_{U_{j}} (u) d u . \end{matrix}$
If $i \in I_{2}$ and $j, k \in {2, 3}, j \neq k$ , then

$\begin{matrix} u_{j i m} (θ) & = E (U_{j} | max {U_{2}, U_{3}} = x_{2 i}) \\ = x_{2 i} P (U_{j} > U_{k}) + P (U_{j} < U_{k}) \frac{1}{F_{U_{j}} (x_{2 i})} \int_{- \infty}^{x_{2 i}} u f_{U_{j}} (u) d u \\ = x_{2 i} \int_{- \infty}^{\infty} f_{U_{j}} (u) F_{U_{k}} (u) d u + \frac{1}{F_{U_{j}} (x_{2 i})} \int_{- \infty}^{\infty} f_{U_{k}} (u) F_{U_{j}} (u) d u \int_{- \infty}^{x_{2 i}} u f_{U_{j}} (u) d u . \end{matrix}$

Therefore, we propose the following EM algorithm to compute the MLEs of

θ

. Suppose at the k-th iteration of the EM algorithm, the value of

θ

is

θ^{(k)} = (θ_{1}^{(k)}, θ_{2}^{(k)}, θ_{3}^{(k)}

), then the following steps can be used to compute

θ^{(k + 1)}

:

E-step

At the k-th step for $i \in I_{0}$ , obtain the missing $u_{1 i}$ and $u_{2 i}$ as $u_{1 i m} (θ^{(k)})$ and $u_{2 i m} (θ^{(k)})$ , respectively. For $i \in I_{1}$ obtain the missing $u_{1 i}$ and $u_{3 i}$ as $u_{1 i m} (θ^{(k)})$ and $u_{3 i m} (θ^{(k)})$ , respectively. Similarly, for $i \in I_{2}$ , obtain the missing $u_{2 i}$ and $u_{3 i}$ as $u_{2 i m} (θ^{(k)})$ and $u_{3 i m} (θ^{(k)})$ , respectively.
Form the ’pseudo’ log-likelihood function as $ℓ_{s}^{(k)} (θ) = ℓ_{1 s}^{(k)} (θ_{1}) + ℓ_{2 s}^{(k)} (θ_{2}) + ℓ_{3 s}^{(k)} (θ_{3})$ , where

$\begin{matrix} ℓ_{1 s}^{(k)} (θ_{1}) & = \sum_{i \in I_{0}} ln f_{U_{1}} (u_{1 i m} (θ^{(k)}); θ_{1}) + \sum_{i \in I_{1}} ln f_{U_{1}} (u_{1 i m} (θ^{(k)}); θ_{1}) + \sum_{i \in I_{2}} ln f_{U_{1}} (u_{1 i}; θ_{1}) \\ ℓ_{2 s}^{(k)} (θ_{2}) & = \sum_{i \in I_{0}} ln f_{U_{2}} (u_{2 i m} (θ^{(k)}); θ_{2}) + \sum_{i \in I_{1}} ln f_{U_{2}} (u_{2 i}; θ_{2}) + \sum_{i \in I_{2}} ln f_{U_{2}} (u_{2 i m} (θ^{(k)}); θ_{2}) \\ ℓ_{3 s}^{(k)} (θ_{3}) & = \sum_{i \in I_{0}} ln f_{U_{3}} (u_{3 i}; θ_{3}) + \sum_{i \in I_{1}} ln f_{U_{3}} (u_{3 i m} (θ^{(k)}); θ_{3}) + \sum_{i \in I_{2}} ln f_{U_{3}} (u_{3 i m} (θ^{(k)}); θ_{3}) . \end{matrix}$

M-step

$θ^{(k + 1)} = (θ_{1}^{(k + 1)}, θ_{2}^{(k + 1)}, θ_{3}^{(k + 1)})$ can be obtained by maximizing $ℓ_{1 s}^{(k)} (θ_{1})$ , $ℓ_{2 s}^{(k)} (θ_{2})$ and $ℓ_{3 s}^{(k)} (θ_{3})$ with respect to $θ_{1}$ , $θ_{2}$ and $θ_{3}$ , respectively.

Mainly for illustrative purposes, two particular GBD models will be applied in the next section to show the usefulness of the above EM algorithm. Firstly, we shall consider a GBD model with baseline components having the same distribution type and different underlying parameters. Secondly, we shall use a GBD model with baseline components from different distribution families. The technical details of both of them can be found in Appendix B.

7. Data Analysis

In this section, we present the analysis of two-dimensional data sets in order to show how the proposed EM algorithm can be applied to fit particular GBD models. For that, we shall suppose the following two models described in Appendix B: Model I is the GBD model with the exponential baseline distributions and different underlying parameters,

U_{j} \sim E x p (λ_{j})

(

j = 1, 2, 3

). Model II is the GBD model with baseline components from Weibull and generalized exponential distributions,

U_{1} \sim W (λ_{1}, α_{1})

,

U_{2} \sim W (λ_{2}, α_{2})

and

U_{3} \sim G E (α_{3}, λ_{3})

.

7.1. Soccer Data

We have analyzed a UEFA Champion’s League data set [41], played during the seasons 2004–2005 and 2005–2006. This set represents the soccer data where at least one goal has been scored by a kick goal (penalty kick, foul kick or any other direct kick) by any team and one goal has been scored by the home team. Here, in the bivariate data,

(X_{1}, X_{2})

,

X_{1}

represents the time in minutes of the first kick goal and

X_{2}

represents the time in minutes scored by the home team. Clearly, all possibilities exist in the data set, namely

X_{1} < X_{2}

,

X_{1} > X_{2}

and

X_{1} = X_{2}

.

Meintanis [41] analyzed this data set using the Marshall–Olkin bivariate exponential model. The marginals of the Marshall–Olkin bivariate exponential distribution are exponential, and then they have constant hazard functions. A preliminary data analysis indicated that the empirical hazard function of both the marginals are increasing and their reversed hazard functions are decreasing. Hence, it may not be proper to use the Marshall–Olkin bivariate exponential model to analyze this data.

Example 9.

In order to use Model I, we have started the initial guess as

λ_{1}^{(0)} = λ_{2}^{(0)} = λ_{3}^{(0)} = 1

. The algorithm stops after eight iterations, the final estimates and the associated 95% confidence intervals are

{\hat{λ}}_{1} = 0.03126 (\pm 0.01121)

,

{\hat{λ}}_{2} = 0.04630 (\pm 0.01563)

and

{\hat{λ}}_{3} = 0.04269 (\pm 0.01875)

, with

- 257.8871

being the pseudo log-likelihood value. To check whether it has converged to the maximum or not, the performance of the EM algorithm may be compared with the experimental results obtained by using a quasi-Newton method for solving constrained nonlinear optimization problem, which have been summarized in Appendix C as well as the corresponding ones to the subsequent examples.

One natural question is whether Model I fits the bivariate data or not. We have computed the Kolmogorov–Smirnov (KS) distances with the corresponding p-values between the empirical and fitted cdfs for the marginals and the maximum order statistic. The results are reported in Table 2, and, from them, we cannot reject the null hypothesis that this data are coming from the GBD model with exponential baseline distributions.

Example 10.

Let us consider now Model II. We have started the EM algorithm with the initial guesses as

α_{1}^{(0)} = α_{2}^{(0)} = α_{3}^{(0)} = 1

,

λ_{1}^{(0)} = 0.03

,

λ_{2}^{(0)} = 0.05

and

λ_{3}^{(0)} = 0.04

. The algorithm converges in nineteen iterations, the final estimates and the associated 95% confidence intervals are

{\hat{α}}_{1} = 1.2987 (\pm 0.3124)

,

{\hat{λ}}_{1} = 0.0097 (\pm 0.0005)

,

{\hat{α}}_{2} = 0.8047 (\pm 0.2823)

,

{\hat{λ}}_{2} = 0.0093 (\pm 0.0021)

,

{\hat{α}}_{3} = 1.0037 (\pm 0.2879)

,

{\hat{λ}}_{3} = 0.0369 (\pm 0.008)

, with

- 201.1141

being the pseudo log-likelihood value.

The KS distances with the corresponding p-values for the marginals and the maximum statistic are reported in Table 2. Thus, based on the p-values, we can say that the GBD model with two baseline Weibull distributions and the third GE one fits the data reasonably well.

Summarizing, it is clear that both of the GBD models provide a good fit to the given data set and the EM algorithm also works quite effectively in both the cases. Now, to compare Models I and II of Examples 9 and 10, which provide a better fit, we compute the Akaike’s information criterion (AIC) and Bayesian information criterion (BIC) values and they are also presented in Table 2. Therefore, based on the AIC and BIC values, it is clear that Model I provides a better fit than Model II to the UEFA Champion’s League data set.

7.2. Diabetic Retinopathy Data

Let us consider now the diabetic retinopathy data set [42], available in the R package “SurvCor” [43]. Such data were investigated by the National Eye Institute to assess the effect of laser photocoagulation in delaying the onset of severe visual loss such as blindness in 197 patients with diabetic retinopathy. For each patient, one eye was randomly selected for laser photocoagulation and the other was given no treatment, being used as the control. The times to blindness in both eyes were recorded in months and the censoring was caused by death, dropout, or the end of the study.

For illustrative purposes, we have considered those patients for which complete data are available. Here,

X_{1}

denotes the time to the blindness of the untreated or control eye and

X_{2}

denotes the time to blindness of the treated eye. Out of 197 patients, we have complete information of

X_{1}

and

X_{2}

for 38 patients.

Example 11.

As in Example 9, we have used Model I to analyze the data set. In this case, we have also used the same initial guess as

λ_{1}^{(0)} = λ_{2}^{(0)} = λ_{3}^{(0)} = 1

. We have used the proposed EM algorithm, the iteration stops after 14 iterations, and the estimates of unknown parameters and the corresponding 95% confidence intervals are

{\hat{λ}}_{1} = 0.0653 (\pm 0.0175)

,

{\hat{λ}}_{2} = 0.0737 (\pm 0.0210)

and

{\hat{λ}}_{3} = 0.1345 (\pm 0.3879)

, with

- 172.2314

being the associated pseudo log-likelihood value.

The KS distances with the corresponding p-values between the empirical and fitted cdfs for the marginals and the maximum statistic are presented in Table 3.

Example 12.

As in Example 10, we have analyzed the data set by using Model II. We have started the EM algorithm with the initial guesses

α_{1}^{(0)} = α_{2}^{(0)} = α_{3}^{(0)} = 1

,

λ_{1}^{(0)} = 0.06

,

λ_{2}^{(0)} = 0.07

and

λ_{3}^{(0)} = 0.13

. The algorithm stops after 27 iterations, the final estimates and the corresponding 95% confidence intervals are

{\hat{α}}_{1} = 1.0937 (\pm 0.2563)

,

{\hat{λ}}_{1} = 0.0447 (\pm 0.0146)

,

{\hat{α}}_{2} = 0.5851 (\pm 0.1345)

,

{\hat{λ}}_{2} = 0.2369 (\pm 0.0763)

,

{\hat{α}}_{3} = 0.8995 (\pm 0.2787)

,

{\hat{λ}}_{3} = 0.1898 (\pm 0.0478)

, with

- 125.4519

being the associated pseudo log-likelihood value.

The KS distances with the corresponding p-values for the marginals and the maximum order statistic are presented in Table 3.

From Table 3, we can also say that the estimated GBD models fit the diabetic retinopathy data reasonably well in both the cases. Moreover, we also present the AIC and BIC values of the two models in Table 3. Therefore, based on the AIC and BIC values, it is clear that Model I provides a better fit than Model II for the diabetic retinopathy data.

8. Discussion and Conclusions

In this paper, we have presented the generalized bivariate distribution family by a generator system based on the maximization process from any three-dimensional baseline continuous distribution vector with independent components, providing bivariate models with dependence structure.

For the proposed GBD family, several distributional and stochastic properties have been established. The preservation of the PRH property for the marginals and the maximum order statistic has been obtained. The positive dependence has been shown between both marginals of the GBD models, some results about stochastic orders and on the preservation of the monotonicity of the reversed hazard function and of the mean inactivity time. Furthermore, the copula representation of the GBD model has been discussed, providing a general formula, and some related dependence measures have been also calculated for specific copulas of particular bivariate distributions of the GBD family. In addition, new bivariate distributions can be generated by combining independent baseline components from different distribution families, and several bivariate distributions given in the literature are derived as particular cases of the GBD family.

Note that, even in the simple case, the MLEs cannot be obtained in explicit forms, and it is required solving a multidimensional nonlinear optimization problem. We have proposed using an EM algorithm to compute the MLEs of the unknown parameters, and it is observed that the proposed EM algorithm perform quite satisfactorily in the two data analyses by using two different models of the GBD family. The experimental results summarized in Table A1 disclose such efficiency of the EM algorithm with respect to a conventional numerical iterative procedure of the Newton-type. In more detail, Table A1 presents the experimental results obtained by the Broyden–Fletcher–Goldfarb–Shanno algorithm for maximizing the log-likelihood function, available in the R package “maxLik” [44].

It is worth mentioning that the bivariate copula representation (13) allows us to discuss its multivariate extension. Let

U_{i}

s for

i = 1, \dots, q + 1

be a set of

q + 1

mutually independent random variables with any continuous distribution functions, denoting by

F_{U_{i}}

the cdf of each

U_{i}

. Similarly to (1), the joint cdf of the q-dimensional random vector

(X_{1}, \dots, X_{q})

with

X_{i} = max (X_{1}, X_{2})

is given by

F (x_{1}, . . . x_{q}) = F_{U_{q + 1}} (min (x_{1}, \dots, x_{q})) \prod_{i = 1}^{q} F_{U_{i}} (x_{i})

which can be considered as a generator of q-dimensional distribution models, called generalized multivariate distribution (GMD) family with baseline distribution vector

(F_{U_{1}}, . . ., F_{U_{q + 1}})

. Hence, the q-dimensional copula representation of this GMD family can be expressed as

C (u_{1}, . . ., u_{q}) = (\prod_{i = 1}^{q} u_{i}) \frac{{min}_{i = 1, . . ., q} A_{i} (u_{i})}{\prod_{i = 1}^{q} A_{i} (u_{i})},

where

A_{i} (u_{i}) = F_{U_{q + 1}} ({(F_{U_{i}} \times F_{U_{q + 1}})}^{- 1} (u_{i})), for i = 1, . . ., q .

From these q-dimensional joint cdf and copula, many distributional and stochastic properties established for the GBD family are extensible to the GMD family. Furthermore, by using this generator of multivariate distributions, the special bivariate models given in Section 3 can be easily extended to the multivariate case, which contain multivariate versions of bivariate distributions given in the literature.

Author Contributions

Conceptualization, M.F., J.-M.V., and D.K.; methodology, M.F. and J.-M.V.; software, M.F. and D.K.; validation, M.F. and D.K.; formal analysis, M.F., J.-M.V., and D.K.; investigation, M.F., J.-M.V., and D.K.; writing—original draft preparation, M.F. and J.-M.V.; writing—review and editing, M.F., J.-M.V., and D.K.; supervision, M.F.; project administration, M.F. and J.-M.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by the Spanish Ministry of Economy, Industry and Competitiveness, the European Regional Development Fund Program through grant TIN2017-85949-C2-1-R.

Acknowledgments

The authors would like to thank the editors and the anonymous reviewers for their comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Theorem 2.

First, taking into account the event

A = (U_{3} > max (U_{1}, U_{2}))

, the joint cdf can be expressed as

\begin{matrix} F (x_{1}, x_{2}) & = P (U_{1} \leq x_{1}, U_{2} \leq x_{2}, U_{3} \leq min (x_{1}, x_{2}) | A) P (A) \\ + P (U_{1} \leq x_{1}, U_{2} \leq x_{2}, U_{3} \leq min (x_{1}, x_{2}) | A^{'}) P (A^{'}) \end{matrix}

where

A^{'}

is the complementary event of A. For

z = m i n (x_{1}, x_{2})

, note that

\begin{matrix} P (U_{1} \leq x_{1}, U_{2} \leq x_{2}, U_{3} \leq z | A) & = P (U_{1} \leq x_{1}, U_{2} \leq x_{2}, U_{3} \leq z | U_{1} < U_{3}, U_{2} < U_{3}) \\ = P (U_{1} \leq U_{3}, U_{2} \leq U_{3}, U_{3} \leq z) = \int_{- \infty}^{z} F_{U_{1}} (u) F_{U_{2}} (u) d F_{U_{3}} (u) . \end{matrix}

Hence, it is immediate that

F_{s} (x_{1}, x_{2})

given by (3) is a singular cdf as its mixed second partial derivatives are zero when

x_{1} \neq x_{2}

.

Thus,

α = P (A)

may be established as follows:

\begin{matrix} α & = P (U_{3} > max (U_{1}, U_{2})) = \int_{- \infty}^{\infty} P (U_{1} < u, U_{2} < u) d F_{U_{3}} (u) \\ = \int_{- \infty}^{\infty} F_{U_{1}} (u) F_{U_{2}} (u) d F_{U_{3}} (u), \end{matrix}

and, consequently, the bivariate cdf

F (x_{1}, x_{2})

can be rewritten as (2), where the absolutely continuous part

F_{a c} (x_{1}, x_{2})

can be obtained by subtraction:

\begin{matrix} F_{a c} (x_{1}, x_{2}) & = P (U_{1} \leq x_{1}, U_{2} \leq x_{2}, U_{3} \leq min (x_{1}, x_{2}) | A^{'}) \\ = \frac{1}{1 - α} (F (x_{1}, x_{2}) - α F_{s} (x_{1}, x_{2})) \\ = \frac{1}{1 - α} (F_{U_{1}} (x_{1}) F_{U_{2}} (x_{2}) F_{U_{3}} (z) - \int_{- \infty}^{z} F_{U_{1}} (u) F_{U_{2}} (u) d F_{U_{3}} (u)), \end{matrix}

which completes the proof of the theorem. □

Proof of Theorem 3.

Let

μ

,

μ_{s}

and

μ_{a c}

be the measures associated with F,

F_{s}

and

F_{a c}

, respectively. Obviously,

μ_{a c}

is an absolutely continuous measure with respect to the two-dimensional Lebesgue measure since

μ_{a c} ((- \infty, x_{1}] \times (- \infty, x_{2}]) = F_{a c} (x_{1}, x_{2}) = \int_{- \infty}^{x_{1}} \int_{- \infty}^{x_{2}} f_{a c} (u, v) d u d v

where the pdf associated with

F_{a c}

in (4),

f_{a c} (u, v) = \frac{\partial^{2}}{\partial u \partial v} F_{a c} (u, v)

, can be written as

f_{a c} (x_{1}, x_{2}) = \{\begin{matrix} \frac{1}{1 - α} f_{1} (x_{1}, x_{2}), & if x_{1} < x_{2} \\ \frac{1}{1 - α} f_{2} (x_{1}, x_{2}), & if x_{1} > x_{2} \\ 0, & if x_{1} = x_{2} = x . \end{matrix}

On the other hand,

μ_{s}

is given by

μ_{s} ((- \infty, x_{1}] \times (- \infty, x_{2}]) = F_{s} (x_{1}, x_{2}) = F_{s} (z, z) = \frac{1}{α} \int_{- \infty}^{z} F_{U_{1}} (u) F_{U_{2}} (u) d F_{U_{3}} (u)

where

z = min (x_{1}, x_{2})

, and so it can be expressed as an absolutely continuous measure

μ_{s}^{*}

with respect to the one-dimensional Lebesgue measure on the projection onto the line

R

of the intersection between

(- \infty, x_{1}] \times (- \infty, x_{2}]

and the line

x_{1} = x_{2}

:

μ_{s} ((- \infty, x_{1}] \times (- \infty, x_{2}]) = μ_{s}^{*} ((- \infty, z]) = \int_{- \infty}^{z} f_{s}^{*} (u) d u,

where

f_{s}^{*} (u) = \frac{1}{α} F_{U_{1}} (u) F_{U_{2}} (u) f_{U_{3}} (u)

, which can be also written as

f_{s}^{*} (u) = \frac{1}{α} f_{0} (u)

.

Furthermore, it is trivial that the line

x_{1} = x_{2}

is a null set under the two-dimensional Lebesgue measure, and hence with respect to

μ_{a c}

. In addition, its complement

{(x_{1}, x_{2}) \in R^{2} | x_{1} \neq x_{2}}

is a null set with respect to

μ_{s}

, since its projection onto the line

R

is the empty set,

μ_{s} ({(x_{1}, x_{2}) \in R^{2} | x_{1} \neq x_{2}}) = μ_{s}^{*} (Ø) = 0,

and, consequently, the measures

μ_{s}

and

μ_{a c}

are mutually singular. Therefore, the measure associated with F

\begin{matrix} μ ((- \infty, x_{1}] \times (- \infty, x_{2}]) = F (x_{1}, x_{2}) & = α μ_{s} ((- \infty, x_{1}] \times (- \infty, x_{2}]) \\ + (1 - α) μ_{a c} ((- \infty, x_{1}] \times (- \infty, x_{2}]) \end{matrix}

allows us to have the pdf of a GBD model with respect to

μ

, given by

f (x_{1}, x_{2}) = α f_{s}^{*} (x_{1}) I_{(x_{1} = x_{2})} (x_{1}, x_{2}) + (1 - α) f_{a c} (x_{1}, x_{2})

where

I_{(x_{1} = x_{2})}

is the indicator function of

x_{1} = x_{2}

. Hence, it is easy to check that

\int_{- \infty}^{x_{1}} \int_{- \infty}^{x_{2}} f (u, v) d μ = F (x_{1}, x_{2})

for all

(x_{1}, x_{2}) \in R^{2}

. □

Proof of Theorem 4.

From (1) and (5), the proof of (1) of Theorem 4 is straightforward.

In order to prove (2) of Theorem 4, from the joint pdf of a GBD model given in Theorem 3 and its marginal pdf (6), the conditional pdf

f_{i | X_{j} = x_{j}}

can be expressed as

f_{i | X_{j} = x_{j}} (x_{i}) = \{\begin{matrix} \frac{f_{X_{i}} (x_{i}) f_{U_{j}} (x_{j})}{f_{X_{j}} (x_{j})}, & if x_{i} < x_{j} \\ f_{U_{i}} (x_{i}), & if x_{i} > x_{j} \\ \frac{F_{U_{1}} (x_{j}) F_{U_{2}} (x_{j}) f_{U_{3}} (x_{j})}{f_{X_{j}} (x_{j})}, & if x_{i} = x_{j}, \end{matrix}

by using the notation

α_{j} = f_{i | X_{j} = x_{j}} (x_{j})

, this conditional pdf can be readily rewritten as in the statement of Theorem 4. □

Proof of Theorem 11.

The reversed hazard function (12) of the minimum statistic can be rewritten as

r_{T_{1}} (x) = r_{U_{1}} (x) g_{2} (x) + r_{U_{2}} (x) g_{1} (x) + r_{U_{3}} (x),

where each

g_{i}

is a positive function (

i = 1, 2

) defined by

g_{i} (x) = 1 - \frac{F_{U_{i}} (x)}{F_{U_{1 : 2}} (x)} .

Here, observe that

U_{1 : 2} \leq_{r h} U_{i}

implies the decreasing monotonicity of

g_{i} (x)

, and therefore

r_{T_{1}}

is a sum of three decreasing functions, which completes the proof. □

Proof of Corollary 4.

The proof readily follows along the same line as Theorem 11, taking into account that (12) can be simplified by using

r_{U_{1 : 2}} (x) = 2 r_{U_{i}} (x) g_{i} (x)

where

g_{i} (x) = 1 - \frac{1}{2 - F_{U_{i}} (x)}

decreases in x. □

Appendix B

For practical implementation of the EM algorithm in the data analysis applications, we give the technical details of the EM algorithm for two particular GBD models, first with baseline component vector with the same distribution (Model I), and then with different baseline distributions (Model II).

Model I.

Suppose

U_{1} \sim E x p (λ_{1})

,

U_{2} \sim E x p (λ_{2})

and

U_{3} \sim E x p (λ_{3})

. To compute the MLEs of the unknown parameter vector

θ = (λ_{1}, λ_{2}, λ_{3})

, one needs to solve a three-dimensional optimization problem.

For implementation of the EM algorithm, we need the following expected values:

If $i \in I_{0}$ , then

$\begin{matrix} u_{1 i m} (θ) & = E (U_{1} | U_{1} < x_{i}) = H (x_{i}; λ_{1}) \\ u_{2 i m} (θ) & = E (U_{2} | U_{2} < x_{i}) = H (x_{i}; λ_{2}), \end{matrix}$

where

$H (x; λ) = \frac{1}{λ} - \frac{x e^{- λ x}}{1 - e^{- λ x}} .$
If $i \in I_{1}$ , then

$\begin{matrix} u_{1 i m} (θ) & = E (U_{1} | max {U_{1}, U_{3}} = x_{1 i}) = \frac{λ_{3}}{λ_{1} + λ_{3}} x_{1 i} + \frac{λ_{1}}{λ_{1} + λ_{3}} H (x_{1 i}; λ_{1}) \\ u_{3 i m} (θ) & = E (U_{3} | max {U_{1}, U_{3}} = x_{1 i}) = \frac{λ_{1}}{λ_{1} + λ_{3}} x_{1 i} + \frac{λ_{3}}{λ_{1} + λ_{3}} H (x_{1 i}; λ_{3}) . \end{matrix}$
If $i \in I_{2}$ , then

$\begin{matrix} u_{2 i m} (θ) & = E (U_{2} | max {U_{2}, U_{3}} = x_{2 i}) = \frac{λ_{3}}{λ_{2} + λ_{3}} x_{2 i} + \frac{λ_{2}}{λ_{2} + λ_{3}} H (x_{2 i}; λ_{2}) \\ u_{3 i m} (θ) & = E (U_{3} | max {U_{2}, U_{3}} = x_{2 i}) = \frac{λ_{2}}{λ_{2} + λ_{3}} x_{2 i} + \frac{λ_{3}}{λ_{2} + λ_{3}} H (x_{2 i}; λ_{3}) . \end{matrix}$

Hence, the ’pseudo’ log-likelihood function in this case becomes

ℓ_{s}^{(k)} (λ_{1}, λ_{2}, λ_{3}) = ℓ_{1 s}^{(k)} (λ_{1}) + ℓ_{2 s}^{(k)} (λ_{2}) + ℓ_{3 s}^{(k)} (λ_{3}),

where

\begin{matrix} ℓ_{1 s}^{(k)} (λ_{1}) & = n ln λ_{1} - λ_{1} [\sum_{i \in I_{0} \cup I_{1}} u_{1 i m}^{(k)} + \sum_{i \in I_{2}} x_{1 i}] \\ ℓ_{2 s}^{(k)} (λ_{2}) & = n ln λ_{2} - λ_{2} [\sum_{i \in I_{0} \cup I_{2}} u_{2 i m}^{(k)} + \sum_{i \in I_{1}} x_{2 i}] \\ ℓ_{3 s}^{(k)} (λ_{3}) & = n ln λ_{3} - λ_{3} [\sum_{i \in I_{1} \cup I_{2}} u_{3 i m}^{(k)} + \sum_{i \in I_{0}} x_{i}], \end{matrix}

and the

u_{j i m}^{(k)}

s are obtained from

u_{j i m} (θ)

,

j = 1, 2, 3

, by replacing

θ = (λ_{1}, λ_{2}, λ_{3})

with

θ^{(k)} = (λ_{1}^{(k)}, λ_{2}^{(k)}, λ_{3}^{(k)})

. Therefore,

\begin{matrix} λ_{1}^{(k + 1)} & = \frac{n}{[\sum_{i \in I_{0} \cup I_{1}} u_{1 i m}^{(k)} + \sum_{i \in I_{2}} x_{1 i}]} \\ λ_{2}^{(k + 1)} & = \frac{n}{[\sum_{i \in I_{0} \cup I_{2}} u_{2 i m}^{(k)} + \sum_{i \in I_{1}} x_{2 i}]} \\ λ_{3}^{(k + 1)} & = \frac{n}{[\sum_{i \in I_{1} \cup I_{2}} u_{3 i m}^{(k)} + \sum_{i \in I_{0}} x_{i}]} . \end{matrix}

Note that, in this case, the maximization can be performed analytically at each M-Step.

Model II.

Suppose

U_{1} \sim W (λ_{1}, α_{1})

,

U_{2} \sim W (λ_{2}, α_{2})

and

U_{3} \sim G E (α_{3}, λ_{3})

. The pdf of a Weibull distribution

W (λ, α)

with scale parameter

λ > 0

and the shape parameter

α > 0

can be written as

f_{W} (u; λ, α) = α λ u^{α - 1} e^{- λ u^{α}}, for u > 0,

and zero otherwise. Similarly, the

G E (α, λ)

model defined in Section 3 has the pdf

f_{G E} (u; α, λ) = α λ e^{- λ u} {(1 - e^{- λ u})}^{α - 1}; for u > 0,

and zero otherwise. Hence, one needs to solve a six-dimensional optimization problem to compute the MLEs of the unknown parameter vector

θ = (θ_{1}, θ_{2}, θ_{3})

where each

θ_{i}

represents the parameter vector of

U_{i}

.

We need the following expected values for implementation of the EM algorithm:

If $i \in I_{0}$ , then

$u_{j i m} (θ) = E (U_{j} | U_{j} < x_{i}) = H_{W} (x_{i}; α_{j}, λ_{j}), j = 1, 2,$

where

$H_{W} (x; α, λ) = \frac{1}{1 - e^{- λ x^{α}}} \int_{0}^{λ x^{α}} {(\frac{u}{λ})}^{1 / α} e^{- u} d u .$
If $i \in I_{1}$ , then

$\begin{matrix} u_{1 i m} (θ) & = E (U_{1} | max {U_{1}, U_{3}} = x_{1 i}) = p_{13} x_{1 i} + (1 - p_{13}) H_{W} (x_{1 i}; α_{1}, λ_{1}) \\ u_{3 i m} (θ) & = E (U_{3} | max {U_{1}, U_{3}} = x_{1 i}) = (1 - p_{13}) x_{1 i} + p_{13} H_{G} (x_{1 i}; α_{3}, λ_{3}), \end{matrix}$

where $p_{13} = P (U_{1} > U_{3}) = K (α_{1}, λ_{1})$ and

$K (α, λ) = \int_{0}^{\infty} α λ x^{α - 1} e^{- λ x^{α}} {(1 - e^{- λ_{3} x})}^{α_{3}} d x, H_{G} (x; α, λ) = x - \frac{1}{λ {(1 - e^{- λ x})}^{α}} \int_{0}^{1 - e^{- λ x}} \frac{t^{α}}{1 - t} d t$
If $i \in I_{2}$ , then

$\begin{matrix} u_{2 i m} (θ) & = E (U_{2} | max {U_{2}, U_{3}} = x_{2 i}) = p_{23} x_{2 i} + (1 - p_{23}) H_{W} (x_{2 i}; α_{2}, λ_{2}) \\ u_{3 i m} (θ) & = E (U_{3} | max {U_{2}, U_{3}} = x_{2 i}) = (1 - p_{23}) x_{2 i} + p_{23} H_{G} (x_{2 i}; α_{3}, λ_{3}), \end{matrix}$

where $p_{23} = P (U_{2} > U_{3}) = K (α_{2}, λ_{2}) .$

In this case, the terms of the ‘pseudo’ log-likelihood function

ℓ_{s}^{(k)} (θ)

can be written as

\begin{matrix} ℓ_{1 s}^{(k)} (α_{1}, λ_{1}) & = n ln α_{1} + n ln λ_{1} + (α_{1} - 1) [\sum_{i \in I_{0} \cup I_{1}} ln u_{1 i m}^{(k)} + \sum_{i \in I_{2}} ln x_{1 i}] \end{matrix}

\begin{matrix} - λ_{1} [\sum_{i \in I_{0} \cup I_{1}} {(u_{1 i m}^{(k)})}^{α_{1}} + \sum_{i \in I_{2}} x_{1 i}^{α_{1}}] \\ ℓ_{2 s}^{(k)} (α_{2}, λ_{2}) & = n ln α_{2} + n ln λ_{2} + (α_{2} - 1) [\sum_{i \in I_{0} \cup I_{2}} ln u_{2 i m}^{(k)} + \sum_{i \in I_{1}} ln x_{2 i}] \end{matrix}

(A1)

\begin{matrix} - λ_{2} [\sum_{i \in I_{0} \cup I_{2}} {(u_{2 i m}^{(k)})}^{α_{2}} + \sum_{i \in I_{1}} x_{2 i}^{α_{2}}] \\ ℓ_{3 s}^{(k)} (α_{3}, λ_{3}) & = n ln α_{3} + n ln λ_{3} + (α_{3} - 1) [\sum_{i \in I_{1} \cup I_{2}} ln (1 - e^{- λ_{3} u_{3 i m}^{(k)}}) + \sum_{i \in I_{0}} ln (1 - e^{- λ_{3} x_{i}})] \end{matrix}

(A2)

\begin{matrix} - λ_{3} [\sum_{i \in I_{0}} x_{i} + \sum_{i \in I_{1} \cup I_{2}} u_{3 i m}^{(k)}] . \end{matrix}

(A3)

Therefore,

u_{1 i m}^{(k)}

,

u_{2 i m}^{(k)}

,

u_{3 i m}^{(k)}

can be obtained from

u_{1 i m} (θ)

,

u_{2 i m} (θ)

and

u_{3 i m} (θ)

by replacing

θ = (α_{1}, λ_{1}, α_{2}, λ_{2}, α_{3}, λ_{3})

with

θ^{(k)} = (α_{1}^{(k)}, λ_{1}^{(k)}, α_{2}^{(k)}, λ_{2}^{(k)}, α_{3}^{(k)}, λ_{3}^{(k)})

. Thus,

θ_{1}^{(k + 1)} = (α_{1}^{(k + 1)}, λ_{1}^{(k + 1)})

,

θ_{2}^{(k + 1)} = (α_{2}^{(k + 1)}, λ_{2}^{(k + 1)})

and

θ_{3}^{(k + 1)} = (α_{3}^{(k + 1)}, λ_{3}^{(k + 1)})

can be obtained by maximizing (A1)–(A3), respectively. Hence, we obtain them as follows:

\begin{matrix} λ_{1}^{(k + 1)} & = \frac{n}{\sum_{i \in I_{0} \cup I_{1}} {(u_{1 i m}^{(k)})}^{α_{1}^{(k + 1)}} + \sum_{i \in I_{2}} x_{1 i}^{α_{1}^{(k + 1)}}}, \\ λ_{2}^{(k + 1)} & = \frac{n}{\sum_{i \in I_{0} \cup I_{2}} {(u_{2 i m}^{(k)})}^{α_{2}^{(k + 1)}} + \sum_{i \in I_{1}} x_{2 i}^{α_{2}^{(k + 1)}}}, \\ α_{3}^{(k + 1)} & = - \frac{n}{\sum_{i \in I_{1} \cup I_{2}} ln (1 - e^{- λ_{3}^{(k + 1)} u_{3 i m}^{(k)}}) + \sum_{i \in I_{0}} ln (1 - e^{- λ_{3}^{(k + 1)} x_{i}})}, \\ α_{1}^{(k + 1)} & = arg max p_{1} (α_{1}), \\ α_{2}^{(k + 1)} & = arg max p_{2} (α_{2}), \\ λ_{3}^{(k + 1)} & = arg max p_{3} (λ_{3}), \end{matrix}

where

\begin{matrix} p_{1} (α_{1}) & = n ln α_{1} - n ln [\sum_{i \in I_{0} \cup I_{1}} {(u_{1 i m}^{(k)})}^{α_{1}} + \sum_{i \in I_{2}} x_{1 i}^{α_{1}}] \\ + (α_{1} - 1) [\sum_{i \in I_{0} \cup I_{1}} ln u_{1 i m}^{(k)} + \sum_{i \in I_{2}} ln x_{1 i}], \\ p_{2} (α_{2}) & = n ln α_{2} - n ln [\sum_{i \in I_{0} \cup I_{2}} {(u_{2 i m}^{(k)})}^{α_{2}} + \sum_{i \in I_{1}} x_{2 i}^{α_{2}}] \\ + (α_{2} - 1) [\sum_{i \in I_{0} \cup I_{2}} ln u_{2 i m}^{(k)} + \sum_{i \in I_{1}} ln x_{2 i}], \\ p_{3} (λ_{3}) & = n ln λ_{3} - n ln [- \sum_{i \in I_{1} \cup I_{2}} ln (1 - e^{- λ_{3} u_{3 i m}^{(k)}}) - \sum_{i \in I_{0}} ln (1 - e^{- λ_{3} x_{i}})] \\ - λ_{3} [\sum_{i \in I_{0}} x_{i} + \sum_{i \in I_{1} \cup I_{2}} u_{3 i m}^{(k)}] - [\sum_{i \in I_{1} \cup I_{2}} ln (1 - e^{- λ_{3} u_{3 i m}^{(k)}}) + \sum_{i \in I_{0}} ln (1 - e^{- λ_{3} x_{i}})] . \end{matrix}

Note that, in this case, one needs to solve three one-dimensional optimization problems numerically at each M-Step.

Appendix C

Table A1. Summary of fitted GBD models for the two real data. EM rows are the parameters estimated with the EM algorithm for maximizing the pseudo log-likelihood function, along with the log-likelihood, AIC and BIC values, and BFGS rows correspond to the results obtained by applying the Broyden–Fletcher–Goldfarb–Shanno algorithm for maximizing the log-likelihood function.

GBD Model	$θ$						$ℓ (θ)$	AIC	BIC
GBD Model	$α_{1}$	$λ_{1}$	$α_{2}$	$λ_{2}$	$α_{3}$	$λ_{3}$	$ℓ (θ)$	AIC	BIC
Soccer data
Model I
EM		0.03126		0.04630		0.04269	−299.4331	604.8663	609.6990
BFGS		0.03116		0.04636		0.04283	−299.4328	604.8656	609.6984
Model II
EM	1.2987	0.0097	0.8047	0.0093	1.0037	0.0369	−348.2715	708.5430	718.2085
BFGS	1.3808	0.00698	0.5652	0.25469	1.53813	0.05219	−295.3057	602.6114	612.2770
Diabetic retinopathy data
Model I
EM		0.0653		0.0737		0.1345	−289.9878	585.9757	590.8884
BFGS		0.06290		0.07181		0.14282	−289.9144	585.8288	590.7415
Model II
EM	1.0937	0.0447	0.5851	0.2369	0.8995	0.1898	−290.0758	592.1515	601.9770
BFGS	1.1477	0.03920	0.7917	0.13923	0.41913	0.08272	−285.5795	583.1590	592.9846

References

Gumbel, E.J. Bivariate exponential distributions. J. Am. Stat. Assoc. 1960, 55, 698–707. [Google Scholar] [CrossRef]
Freund, J.E. A bivariate extension of the exponential distribution. J. Am. Stat. Assoc. 1961, 56, 971–977. [Google Scholar] [CrossRef]
Marshall, A.W.; Olkin, I. A multivariate exponential distribution. J. Am. Stat. Assoc. 1967, 62, 30–44. [Google Scholar] [CrossRef]
Balakrishnan, N.; Lai, C.D. Continuous Bivariate Distributions, 2nd ed.; Springer: New York, NY, USA, 2009. [Google Scholar] [CrossRef]
Franco, M.; Vivo, J.M. A multivariate extension of Sarhan and Balakrishnan’s bivariate distribution and its ageing and dependence properties. J. Multivar. Anal. 2010, 101, 491–499. [Google Scholar] [CrossRef] [Green Version]
Kundu, D.; Gupta, R.D. Modified Sarhan–Balakrishnan singular bivariate distribution. J. Stat. Plan. Inference 2010, 40, 526–538. [Google Scholar] [CrossRef] [Green Version]
Franco, M.; Kundu, D.; Vivo, J.M. Multivariate extension of the modified Sarhan-Balakrishnan bivariate distribution. J. Stat. Plan. Inference 2011, 141, 3400–3412. [Google Scholar] [CrossRef]
Gupta, R.C.; Kirmani, S.N.U.A.; Balakrishnan, N. On a class of generalized Marshall–Olkin bivariate distributions and some reliability characteristics. Probab. Engrg. Inform. Sci. 2013, 27, 261–275. [Google Scholar] [CrossRef]
Kundu, D.; Franco, M.; Vivo, J.M. Multivariate distributions with proportional reversed hazard marginals. Comput. Stat. Data Anal. 2014, 77, 98–112. [Google Scholar] [CrossRef]
Muhammed, H.Z. On a bivariate generalized inverted Kumaraswamy distribution. Phys. A 2020, 553, 124281. [Google Scholar] [CrossRef]
Franco, M.; Vivo, J.M.; Kundu, D. A generalized Freund bivariate model for a two-component load sharing system. Reliab. Eng. Syst. Saf. 2020, 203, 107096. [Google Scholar] [CrossRef]
El-Morshedy, M.; Ali-Alhussain, Z.; Atta, D.; Almetwally, E.M.; Eliwa, M.S. Bivariate Burr X generator of distributions: Properties and estimation methods with applications to complete and type-II censored samples. Mathematics 2020, 8, 264. [Google Scholar] [CrossRef] [Green Version]
Kundu, D.; Gupta, R.D. Bivariate generalized exponential distribution. J. Multivar. Anal. 2009, 100, 581–593. [Google Scholar] [CrossRef] [Green Version]
Sarhan, A.M.; Hamilton, D.C.; Smith, B.; Kundu, D. The bivariate generalized linear failure rate distribution and its multivariate extension. Comput. Stat. Data Anal. 2011, 55, 644–654. [Google Scholar] [CrossRef]
Elsherpieny, E.A.; Ibrahim, S.A.; Bedar, R.E. A New Bivariate Distribution with Log-Exponentiated Kumaraswamy Marginals. Chil. J. Stat. 2014, 5, 55–69. Available online: http://www.soche.cl/chjs/volumes/05/02/Elsherpieny_etal(2014).pdf (accessed on 31 August 2020).
El-Gohary, A.; El-Bassiouny, A.H.; El-Morshedy, M. Bivariate exponentiated modified Weibull extension distribution. J. Stat. Appl. Probab. 2016, 5, 67–78. [Google Scholar] [CrossRef]
Muhammed, H.Z. Bivariate inverse Weibull distribution. J. Stat. Comput. Simul. 2016, 86, 2335–2345. [Google Scholar] [CrossRef]
Kundu, D.; Gupta, A.K. On bivariate inverse Weibull distribution. Braz. J. Probab. Stat. 2017, 31, 275–302. [Google Scholar] [CrossRef]
Muhammed, H.Z. Bivariate Dagum Distribution. Int. J. Reliab. Appl. 2017, 18, 65–82. Available online: https://www.koreascience.or.kr/article/JAKO201715565837044.pdf (accessed on 31 August 2020).
Sarhan, A.M. The bivariate generalized Rayleigh distribution. J. Math. Sci. Model. 2019, 2, 99–111. [Google Scholar] [CrossRef]
Eliwa, M.S.; El-Morshedy, M. Bivariate Gumbel-G family of distributions: Statistical properties, bayesian and non-bayesian estimation with application. Ann. Data Sci. 2019, 6, 39–60. [Google Scholar] [CrossRef]
Gupta, R.C.; Gupta, P.L.; Gupta, R.D. Modeling failure time data by Lehman alternatives. Commun. Stat. Theory Methods 1998, 24, 887–904. [Google Scholar] [CrossRef]
Di Crescenzo, A. Some results on the proportional reversed hazards model. Stat. Probab. Lett. 2000, 50, 313–321. [Google Scholar] [CrossRef]
Kundu, D.; Gupta, R.D. A class of bivariate models with proportional reversed hazard marginals. Sankhya B 2010, 72, 236–253. [Google Scholar] [CrossRef]
Gupta, R.D.; Kundu, D. Generalized exponential distribution. Aust. N. Z. J. Stat. 1999, 41, 173–188. [Google Scholar] [CrossRef]
Sarhan, A.; Kundu, D. Generalized linear failure rate distribution. Commun. Stat. Theory Methods 2009, 38, 642–660. [Google Scholar] [CrossRef] [Green Version]
Lemonte, A.J.; Cordeiro, G.M.; Barreto-Souza, W. The exponentiated Kumaraswamy distribution and its log-transform. Braz. J. Probab. Stat. 2013, 27, 31–53. [Google Scholar] [CrossRef]
Sarhan, A.M.; Apaloo, J. Exponentiated modifed Weibull extension distribution. Reliab. Eng. Syst. Saf. 2013, 112, 137–144. [Google Scholar] [CrossRef]
Keller, A.Z.; Giblin, M.T.; Farnworth, N.R. Reliability analysis of commercial vehicle engines. Reliab. Eng. 1985, 10, 15–25. [Google Scholar] [CrossRef]
Dagum, C. A new model of personal income distribution: Specification and estimation. Econ. Appl. 1977, 30, 413–437. [Google Scholar]
Burr, I.W. Cumulative frequency functions. Ann. Math. Stat. 1942, 13, 215–232. [Google Scholar] [CrossRef]
Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef] [Green Version]
Iqbal, Z.; Tahir, M.M.; Riaz, N.; Ali, S.A.; Ahmad, M. Generalized inverted Kumaraswamy distribution: Properties and application. Open J. Stat. 2017, 7, 645–662. [Google Scholar] [CrossRef] [Green Version]
Lai, C.D.; Xie, M. Stochastic Ageing and Dependence for Reliability; Springer: New York, NY, USA, 2006. [Google Scholar] [CrossRef]
Shaked, M.; Shanthikumar, J.G. Stochastic Orders; Springer: New York, NY, USA, 2007. [Google Scholar] [CrossRef]
Domma, F. Bivariate reversed hazard rate, notions, and measures of dependence and their relationships. Commun. Stat. Theory Methods 2011, 40, 989–999. [Google Scholar] [CrossRef]
Finkelstein, M.S. On the reversed hazard rate. Reliab. Eng. Syst. Saf. 2002, 78, 71–75. [Google Scholar] [CrossRef]
Gupta, R.C.; Balakrishnan, N. Log-concavity and monotonicity of hazard and reversed hazard functions of univariate and multivariate skew-normal distributions. Metrika 2012, 75, 181–191. [Google Scholar] [CrossRef]
Nelsen, R.B. An Introduction to Copulas, 2nd ed.; Springer: New York, NY, USA, 2006. [Google Scholar] [CrossRef]
Fang, R.; Li, X. A note on bivariate dual generalized Marshall–Olkin distributions with applications. Probab. Engrg. Inform. Sci. 2013, 27, 367–374. [Google Scholar] [CrossRef]
Meintanis, S.G. Test of fit for Marshall–Olkin distribution with applications. J. Stat. Plan. Inference 2007, 137, 3954–3963. [Google Scholar] [CrossRef]
Huster, W.J.; Brookmeyer, R.; Self, S.G. Modelling paired survival data with covariates. Biometrics 1989, 45, 145–156. [Google Scholar] [CrossRef]
Ploner, M.; Kaider, A.; Heinze, G. SurvCorr: Correlation of Bivariate Survival Times. R Package Version 1.0. 2015. Available online: https://CRAN.R-project.org/package=SurvCorr (accessed on 31 August 2020).
Henningsen, A.; Toomet, O. maxLik: A package for maximum likelihood estimation in R. Comput. Stat. 2011, 26, 443–458. [Google Scholar] [CrossRef]

Figure 1. Surface and contour plots of the joint pdf of GBD models

(X_{1}, X_{2})

with different components

(U_{1}, U_{2}, U_{3})

.

Figure 1. Surface and contour plots of the joint pdf of GBD models

(X_{1}, X_{2})

with different components

(U_{1}, U_{2}, U_{3})

.

Figure 2. Plots of the marginal pdfs of the GBD models

(X_{1}, X_{2})

with different components

(U_{1}, U_{2}, U_{3})

.

Figure 2. Plots of the marginal pdfs of the GBD models

(X_{1}, X_{2})

with different components

(U_{1}, U_{2}, U_{3})

.

Table 1. Relation between

(x_{1 i}, x_{2 i})

and

(u_{1 i}, u_{2 i}, u_{3 i})

.

Table 1. Relation between

(x_{1 i}, x_{2 i})

and

(u_{1 i}, u_{2 i}, u_{3 i})

.

$I_{k}$	Ordering of $U_{j}$	$X_{1}$	$X_{2}$	Missing
$I_{0}$	$u_{1 i} < u_{2 i} < u_{3 i}$	$u_{3 i}$	$u_{3 i}$	$u_{1 i}$ , $u_{2 i}$
$I_{0}$	$u_{2 i} < u_{1 i} < u_{3 i}$	$u_{3 i}$	$u_{3 i}$	$u_{1 i}$ , $u_{2 i}$
$I_{1}$	$u_{1 i} < u_{3 i} < u_{2 i}$	$u_{3 i}$	$u_{2 i}$	$u_{1 i}$
$I_{1}$	$u_{3 i} < u_{1 i} < u_{2 i}$	$u_{1 i}$	$u_{2 i}$	$u_{3 i}$
$I_{2}$	$u_{2 i} < u_{3 i} < u_{1 i}$	$u_{1 i}$	$u_{3 i}$	$u_{2 i}$
$I_{2}$	$u_{3 i} < u_{2 i} < u_{1 i}$	$u_{2 i}$	$u_{1 i}$	$u_{3 i}$

Table 2. Goodness-of-fit results for UEFA Champion’s League data.

GBD Model	KS (p-Value)
GBD Model	$X_{1}$	$X_{2}$	$max {X_{1}, X_{2}}$	AIC	BIC
Model I	0.1491 (0.3830)	0.1099 (0.7622)	0.1530 (0.3517)	604.8663	609.6990
Model II	0.0976 (0.8719)	0.0839 (0.9565)	0.1139 (0.7228)	708.5430	718.2085

Table 3. Goodness-of-fit results for diabetic retinopathy data.

GBD Model	KS (p-Value)
GBD Model	$X_{1}$	$X_{2}$	$max {X_{1}, X_{2}}$	AIC	BIC
Model I	0.1033 (0.8244)	0.1848 (0.1598)	0.1229 (0.6310)	585.9757	590.8884
Model II	0.0920 (0.8960)	0.0952 (0.8706)	0.1152 (0.6778)	592.1515	601.9770

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Franco, M.; Vivo, J.-M.; Kundu, D. A Generator of Bivariate Distributions: Properties, Estimation, and Applications. Mathematics 2020, 8, 1776. https://doi.org/10.3390/math8101776

AMA Style

Franco M, Vivo J-M, Kundu D. A Generator of Bivariate Distributions: Properties, Estimation, and Applications. Mathematics. 2020; 8(10):1776. https://doi.org/10.3390/math8101776

Chicago/Turabian Style

Franco, Manuel, Juana-María Vivo, and Debasis Kundu. 2020. "A Generator of Bivariate Distributions: Properties, Estimation, and Applications" Mathematics 8, no. 10: 1776. https://doi.org/10.3390/math8101776

APA Style

Franco, M., Vivo, J.-M., & Kundu, D. (2020). A Generator of Bivariate Distributions: Properties, Estimation, and Applications. Mathematics, 8(10), 1776. https://doi.org/10.3390/math8101776

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Generator of Bivariate Distributions: Properties, Estimation, and Applications

Abstract

1. Introduction

2. The GBD Family

3. Special Cases

4. Distributional Properties

4.1. Marginal and Conditional Distributions

4.2. Minimum and Maximum Order Statistics

5. Dependence and Stochastic Properties

5.1. GBD Model

5.2. Marginals and Order Statistics

5.3. Copula and Related Association Measures

6. Maximum Likelihood Estimation

7. Data Analysis

7.1. Soccer Data

7.2. Diabetic Retinopathy Data

8. Discussion and Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

Appendix C

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI