A New Kumaraswamy Generalized Family of Distributions with Properties, Applications, and Bivariate Extension

Muhammad H. Tahir; Muhammad Adnan Hussain; Gauss M. Cordeiro; M. El-Morshedy; M. S. Eliwa

doi:10.3390/math8111989

,

and

¹

Department of Statistics, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan

²

Department of Statistics, Federal University of Pernambuco, Recife 50740-540, PE, Brazil

³

Department of Mathematics, College of Science and Humanities in Al-Kharj, Prince Sattam bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia

⁴

Department of Mathematics, Faculty of Science, Mansoura University, Mansoura 35516, Egypt

Mathematics2020, 8(11), 1989;https://doi.org/10.3390/math8111989

This article belongs to the Special Issue Statistics 2020

Version Notes

Order Reprints

Abstract

For bounded unit interval, we propose a new Kumaraswamy generalized (G) family of distributions through a new generator which could be an alternate to the Kumaraswamy-G family proposed earlier by Cordeiro and de Castro in 2011. This new generator can also be used to develop alternate G-classes such as beta-G, McDonald-G, Topp-Leone-G, Marshall-Olkin-G, and Transmuted-G for bounded unit interval. Some mathematical properties of this new family are obtained and maximum likelihood method is used for the estimation of G-family parameters. We investigate the properties of one special model called the new Kumaraswamy-Weibull (NKwW) distribution. Parameters of NKwW model are estimated by using maximum likelihood method, and the performance of these estimators are assessed through simulation study. Two real life data sets are analyzed to illustrate the importance and flexibility of the proposed model. In fact, this model outperforms some generalized Weibull models such as the Kumaraswamy-Weibull, McDonald-Weibull, beta-Weibull, exponentiated-generalized Weibull, gamma-Weibull, odd log-logistic-Weibull, Marshall-Olkin-Weibull, transmuted-Weibull and exponentiated-Weibull distributions when applied to these data sets. The bivariate extension of the family is also proposed, and the estimation of parameters is dealt. The usefulness of the bivariate NKwW model is illustrated empirically by means of a real-life data set.

Keywords:

bivariate family; Kumaraswamy-G family; Marshall and Olkin shock model; maximum likelihood method; parameter induction; T-X family; Weibull distribution

1. Introduction

The twenty-first century begun with establishing and extending new tools for modern statistics. In terms of distribution theory, one of the important developments is to define new useful models and then these are tested on real-life data sets available from simple to complex phenomenons. The modern distribution theory has also motivated statisticians and practitioners to propose new generalized (G) families and to investigate their special models, which can effectively be used in different fields, in particular, medicine, reliability engineering, agriculture, survival analysis, demography, actuarial study and others. The G-families proposed by Azzalini [1] (skew-Normal-G (SN-G)), Marshall and Olkin [2] (Marshall-Olkin-G (MO-G)), Gupta et al. [3] (exponentiated-G (exp-G) [Lehmann alternative 1 (LA1) and Lehmann alternative 2 (LA2)]), Eugene et al. [4] (beta-G), Gleaton and Lynch [5] (odd log-logistic-G (OLL-G)), Shaw and Buckley [6] (transmuted-G), Zografos and Balakrishnan [7] (ZBgamma-G), Cordeiro and de-Castro [8](Kumaraswamy-G (Kw-G)), Alexander et al. [9] (McDonald-G (Mc-G)), Ristić and Balakrishnan [10] (RBgamma-G), Cordeiro et al. [11] (exponentiated-generalized-G (EG-G)), Bourguignon et al. [12] (odd Weibull-G (OW-G)), Tahir et al. [13] (odd generalized-exponential (OGE-G)), Tahir et al. [14] (logistic-X) and Rezaei et al. [15] (Topp-Leone-G (TL-G)) have received increased attention in recent statistical literature. For more G-families, the reader is referred to Tahir and Nadarajah [16], and Tahir and Cordeiro [17].

Kumaraswamy [18] pioneered a two-parameter model for bounded unit interval

(0, 1)

which we denote here by using random variable (rv)

T \sim K w (a, b)

. The cumulative distribution function (cdf) and probability density function (pdf) of T are

R (t) = 1 - {(1 - t^{a})}^{b}, t \in (0, 1)

(1)

and

r (t) = a b t^{a - 1} {(1 - t^{a})}^{b - 1},

(2)

respectively, where

a > 0

and

b > 0

are shape parameters.

Cordeiro and de-Castro [8] defined the cdf and pdf of the Kw-G family by

F_{K w G} (x; a, b, ξ) = 1 - {[1 - G {(x; ξ)}^{a}]}^{b}, x \in (0, 1)

(3)

and

f_{K w G} (x; a, b, ξ) = a b g (x; ξ) G {(x; ξ)}^{a - 1} {[1 - G {(x; ξ)}^{a}]}^{b - 1},

(4)

where

a > 0

and

b > 0

are two additional shape parameters, and

ξ

is the vector of baseline parameters.

The Kw-G family has received wide-spread recognition and more than sixty special models have been studied so far, namely: exponential, exponentiated-exponential, Weibull, exponentiated-Weibull, modified Weibull (Lai et al. [19]), flexible-Weibull (Bebbington et al. [20]), generalized power Weibull, log-logistic, half-logistic, Lomax, Burr, Kumaraswamy, generalized gamma, exponentiated-gamma, generalized Rayleigh, Pareto, generalized Pareto, Pareto-IV, Gumbel, exponentiated-Gumbel (type-II), Fréchet, Laplace, Gompertz, Gompertz-Makeham, normal, inverse Gaussian, skew-normal, generalized half-normal, Birnbaum-Saunders, skew-t, Nadarajah-Haghighi, linear failure rate, quadratic hazard rate, Lindley, quasi-Lindley, Lindley-Poisson, Sushila, half-Cauchy, inverse exponential, inverse Rayleigh, inverse Weibull, inverse Weibull-Poisson, inverse flexible-Weibull, modified inverse Weibull (using LA2), Fisher-Snedecor, compound-Rayleigh, exponential-Rayleigh, exponential-Weibull (compounded), exponentiated-Chen, generalized Kappa, generalized extreme-value, Weibull-geometric (WG), complementary WG, Marshall-Olkin exponential, Marshall-Olkin Fréchet (MOFr), Marshall-Olkin Lindley, transmuted Weibull, transmuted Pareto, transmuted modified-Weibull (Sarhan-Zaindin), transmuted exponentiated modified Weibull, transmuted exponentiated additive Weibull and transmuted MOFr.

Some other special models of the Kw-G family were also reported in the literature but these suffer non-identifiability issue (when two parameters appear, for example, in a product and it is impossible to determine their individual effects). These special models are: power function, Burr III, generalized linear failure rate, exponentiated-Pareto, exponentiated Burr and exponentiated-Lomax.

Note 1. The citations and the references of the authors of special models of the Kw-G family [8] are avoided in this section and in references to save some space.

Alzaatreh et al. [21] proposed a general method for constructing G-families by using the transformed-transformer (T-X) approach. Let

r (t)

be the pdf and

R (t)

be the cdf of a rv

T \in [a, b]

for

- \infty < a < b < \infty

and let

W [G (x)]

be a function of the cdf

G (x)

or survival function (sf)

\bar{G} (x) = 1 - G (x)

of any baseline rv (

W (\cdot)

is known as generator) such that

W [G (x)]

satisfies three conditions:

(i): $W [G (x)] \in [a, b]$ ,
(ii): $W [G (x)]$ is differentiable and monotonically non-decreasing, and
(iii): $lim_{x \to - \infty} W [G (x)] = a$ and $lim_{x \to \infty} W [G (x)] = b$ .

The cdf of the T–X family is

F_{T X} (x) = \int_{a}^{W [G (x)]} r (t) d t = R (W [G (x)]),

(5)

where

W [G (x)]

satisfies the conditions (i)–(iii).

The pdf corresponding to Equation (5) is

f_{T X} (x) = r (W [G (x)]) \frac{d}{d x} W [G (x)] .

(6)

The main motivation for proposing new G-family are:

(i): Constructing new and novel G-families as a function of a cdf, $W [G (x)]$ , is a difficult task in these days. A few pioneer G-families were developed in the literature considering $W [G (x)]$ viz. exponentiated-G with power parameter $α > 0$ (LA1 and LA2) (Gupta et al. [3]) [ $G {(x)}^{α} and 1 - \bar{G} {(x)}^{α}$ ], beta-G (Eugene et al. [4]) [ $G (x)$ ], ZBgamma-G (Zografos and Balakrishnan [7]) [ $- log \bar{G} (x)$ ], odd log-logistic-G (Gleaton and Lynch [5]) [ $G (x) / \bar{G} (x)$ ], RBgamma-G (Ristic and Balakrishnan [10]) [ $- log G (x)$ ], log-odd logistic-G (Torabi and Montazeri [22]) [ $log {G (x) / \bar{G} (x)}$ ], Gumbel-X (Al-Aqtash et al. [23]) [ $log {- log \bar{G} (x)}$ ], Weibull-X (T-X approach) and Weibull-X (Ahmad et al. [24]) [ ${- log \bar{G} (x)} / \bar{G} (x)$ ] are the pioneer works. Other G-families either non-composite (alone based on well-established parent model) or composite (mixture of two G-families) and compounded G-families are the extensions or modifications of the above described pioneer G-families. For example, the generator $G (x)$ , where $T \in (0, 1)$ was pioneered by (Eugene et al. [4]) for defining the beta-G family, and later this generator was adopted by (Cordeiro and de-Castro [8]; Alexander et al. [9]; Rezaei et al. [15]) for defining the Kw-G, Mc-G and TL-G families, respectively. Similarly, the odd generator $G (x) / \bar{G} (x)$ (where $T \in (0, \infty)$ ) was suggested by (Gleaton and Lynch, [5]) for proposing the odd log-logistic-G family, and it was adopted by (Bourguignon et al. [12]; Torabi and Montazeri [25]; Tahir et al. [13]; Silva et al. [26]; Cordeiro et al. [27]; Alizadeh et al. [28]; Cordeiro et al. [29]; Hassan et al. [30]; Hassan and Nassr [31]; Maiti and Pramanik [32]); El-Morshedy and Eliwa [33], Alizadeh et al. [34]; El-Morshedy et al. [35]; Eliwa et al. [36] for defining the odd Weibull-G, odd gamma-G, odd generalized-exponential-G, odd Lindley-G, odd Burr-G, odd power-Cauchy-G, odd half-Cauchy, odd additive Weibull-G, odd power-Lindley-G, odd Xgamma-G, odd flexible Weibull-H, odd log-logistic Lindley-G, odd Chen generator and exponentiated odd Chen-G, respectively, among others.
(ii): The proposed extension of the Kumaraswamy-G model is based on a new generator $W [G (x)] = 1 - \bar{G} {(x)}^{G (x)}$ for $T \in (0, 1)$ instead of the only existing generator $G (x)$ for which the beta-G, Kw-G, Mc-G and TL-G classes were developed so far.
(iii): The proposed generator $1 - \bar{G} {(x)}^{G (x)}$ seems little complicated in comparison to earlier well-established generator for the unit interval but it has the ability to produce better estimates and goodness-of-fit (GoF) tests results that can make it distinguishable and attractive for applied researchers (as evident from the results in Section 5 and Section 7).
(iv): For most of the families and models, if the cdf is in closed form, then the quantile function (qf) can be straightforward to obtain. In some families and models, where the qf is based on some special functions such as beta, gamma, and others, then the qfs can only be determined by using power series. In our case, the cdf of the family is in closed form but the qf can be obtained only numerically.

Note 2. A complete and independent investigation of the properties and application of our proposed generator

F (x) = 1 - \bar{G} {(x)}^{G (x)}

as a new family such as transmuted-G (Tr-G) and exponentiated-generalized-G (EG-G) will appear in another outlet very soon. It is noted here that the two G-families (Tr-G and EG-G) have not been developed from any existing parent model similar to our proposed one.

The paper is unfolded as follows. In Section 2, we define the new Kumaraswamy generalized (NKw-G) family. In Section 3, some of its mathematical properties are determined from a useful linear representation of the family density. We investigate the asymptotics and shapes of the density and hazard rate, ordinary and incomplete moments, generating function, mean deviations and estimation of the model parameters. Several properties of a special model viz. new Kumaraswamy Weibull (NKwW) distribution are discussed in Section 4. A simulation study is also conducted to assess the performance of maximum likelihood estimators of the newly proposed model in this section. In Section 5, the usefulness of new model is illustrated by means of two real-life data sets. In Section 6, we define the Bivariate New Kumaraswamy G-family of distributions. In Section 7, the usefulness of the new bivariate model is illustrated by means of a real-life data set. In fact, we prove empirically that our proposed model outperforms some well-known univariate and bivariate distributions. Finally, Section 8 offers some concluding remarks.

The important feature of our article is that the proposed model from this new generalized family, NKwW, is better in performance as compared to some well-known (or well-established) generalized Weibull models selected from the statistical literature. It can be noted from Section 5 that the Kolmogrov-Smirnov GoF statistic yields minimum GoF values along with high p-values of this statistic. Furthermore, it can also been observed from Section 5 that the GoF values of some other well-established statistics such as Akaike information Criterion, Bayesian Information Criterion, Hannan-Quinn Information Criterion, Anderson-Darling and Cramér-von Mises for our propose model are smallest as compared to some important generalized Weibull models. This fact reveals that the performance and flexibility of our proposed model is better in comparison to all other competitive models, when applied to these selected real-life data sets. The same fact is valid for our proposed bivariate model (see, Section 7).

2. The NKw-G Family

For

W [G (x)] = G (x)

and

T \in (0, 1)

just only the beta-G, Kw-G, Mc-G and TL-G families were reported so far. No other generators for

T \in (0, 1)

were published until now. Therefore, our main objective is to introduce a new family of distributions for

T \in (0, 1)

called the NKw-G family and to study its main structural properties.

Let

r (t)

be the Kumaraswamy density. By inserting Equation (2) in Equation (5) and letting

W [G (x)] = 1 - {\bar{G} (x)}^{G (x)}

, the cdf of the NKw-G family is given by

\begin{matrix} F (x) = F (x; a, b, ξ) & = & a b \int_{0}^{1 - \bar{G} {(x; ξ)}^{G (x; ξ)}} t^{a - 1} {(1 - t^{a})}^{b - 1} d t \\ = & 1 - {\{1 - {[1 - \bar{G} {(x; ξ)}^{G (x; ξ)}]}^{a}\}}^{b}, \end{matrix}

(7)

where

a > 0

and

b > 0

are two shape parameters of the Kw distribution and

ξ

is the vector of the baseline parameters.

The pdf corresponding to Equation (7) becomes

\begin{matrix} f (x) = f (x; a, b, ξ) & = & a b g (x; ξ) {[\bar{G} (x; ξ)]}^{G (x)} {[1 - \bar{G} {(x; ξ)}^{G (x; ξ)}]}^{a - 1} \\ \times {\{1 - {[1 - \bar{G} {(x; ξ)}^{G (x; ξ)}]}^{a}\}}^{b - 1} [\frac{G (x; ξ)}{\bar{G} (x; ξ)} - log \bar{G} (x; ξ)] . \end{matrix}

(8)

Henceforth, a rv X with the density (8) is denoted by

X \sim N K w - G (a, b, ξ)

. The survival function (sf)

S (x)

and hazard rate function (hrf)

h (x)

of X are, respectively,

\begin{matrix} S (x) & = & {\{1 - {[1 - \bar{G} {(x; ξ)}^{G (x; ξ)}]}^{a}\}}^{b} \end{matrix}

and

\begin{matrix} h (x) & = & \frac{a b g (x; ξ) {[\bar{G} (x; ξ)]}^{G (x)} {[1 - \bar{G} {(x; ξ)}^{G (x; ξ)}]}^{a - 1} [\frac{G (x; ξ)}{\bar{G} (x; ξ)} - log \bar{G} (x; ξ)]}{1 - {[1 - \bar{G} {(x; ξ)}^{G (x; ξ)}]}^{a}} . \end{matrix}

(9)

3. Properties of the NKw-G Family

In this section, we obtain some mathematical properties of the NKw-G family.

3.1. Quantile Function

The most common and simplest method for generating random variates is based on the inverse cdf. For an arbitrary cdf, the quantile function (qf) is define as

Q (u) = F^{- 1} (u) = \min {x; F (x) \geq u}

. The qf of the NKw-G family can be determined by inverting (7) and then solving the two non-linear equations numerically. We can use the following procedure:

(i): Set $z = z (u) = 1 - {[1 - {(1 - u)}^{1 / b}]}^{1 / a}$ ;
(ii): Find $w = w (u)$ numerically in $w log (1 - w) = log (z)$ using any Newton-Raphson algorithm;
(iii): Solving numerically for x in $G (x; ξ) = w$ yields the qf $x = Q (u)$ of X.

3.2. Asymptotics

The following asymptotics for the density, distribution function and hrf of X hold.

Corollary 1.

The asymptotics of Equations (7)–(9) when

x \to 0

or (

G (x) \to 0

) are

\begin{matrix} F (x) \sim - b {[G (x; ξ)]}^{- a}, \\ f (x) \sim a b g (x; ξ) {[G (x; ξ)]}^{- (a + 1)}, \\ h (x) \sim a b g (x; ξ) {[G (x; ξ)]}^{- (a + 1)} . \end{matrix}

Corollary 2.

The asymptotics of Equations (7)–(9) when

x \to \infty

or (

G (x; ξ) \to 1

) are

\begin{matrix} 1 - F (x) \sim a {[log \bar{G} (x; ξ)]}^{- b}, \\ f (x) \sim \frac{a b g (x; ξ)}{\bar{G} (x; ξ)} {[log \bar{G} (x; ξ)]}^{- (b + 1)}, \\ h (x) \sim \frac{b g (x; ξ)}{\bar{G} (x; ξ) log \bar{G} (x; ξ)} . \end{matrix}

3.3. Analytic Shapes of the Density and Hazard Rate Function

The shapes of the density and hrf of X can be described analytically. The critical points of the density of X are the roots of the equation:

\begin{matrix} \frac{g^{'} (x; ξ)}{g (x; ξ)} - \frac{G (x; ξ) g (x; ξ)}{\bar{G} (x; ξ)} + \frac{g (x; ξ) \{G (x; ξ) - 2\}}{{G (x; ξ) - 1} [G (x; ξ) + {G (x; ξ) - 1} log \bar{G} (x; ξ)]} \\ + g (x; ξ) log \bar{G} (x; ξ) - \frac{(a - 1) g (x; ξ) M [G (x; ξ) + {G (x; ξ) - 1} log \bar{G} (x; ξ)]}{(M - 1) \bar{G} (x; ξ)} \\ + \frac{a (b - 1) g (x; ξ) M {(1 - M)}^{a - 1} [\frac{1}{G (x; ξ) - 1} + log \bar{G} (x; ξ) + 1]}{1 - {(1 - M)}^{a}} = 0, \end{matrix}

where

M = M (x; ξ) = {[1 - G (x; ξ)]}^{G (x; ξ)}

.

The critical points of the hrf of X are obtained from the equation:

\begin{matrix} \frac{g^{'} (x; ξ)}{g (x; ξ)} + \frac{1}{(M - 1) [G (x; ξ) - 1] [{(1 - M)}^{a} - 1] \{G (x; ξ) + [G (x; ξ) - 1] log \bar{G} (x; ξ)\}} \\ \times g (x; ξ) [[1 - {(1 - M)}^{a}] [G (x; ξ) - 1] \{- {log}^{2} \bar{G} (x; ξ) + G (x; ξ) {[log \bar{G} (x; ξ) + 1]}^{2} + 2\} \\ + M {2 - 2 {(1 - M)}^{a} - a {log}^{2} \bar{G} (x; ξ) - a G {(x; ξ)}^{2} {[log \bar{G} (x; ξ) + 1]}^{2} \\ + G (x; ξ) [{(1 - M)}^{a} + 2 a log \bar{G} (x; ξ) \{log \bar{G} (x; ξ) + 1\} - 1]}] = 0 . \end{matrix}

3.4. Linear Representation of the NKw-G Density

Here, we derive useful expansions for Equations (7) and (8) based on the concept of exponentiated distributions. For an arbitrary baseline cdf

G (x; ξ)

, a rv is said to have the exponentiated-G (exp-G) distribution with power parameter

a > 0

if its cdf and pdf are

H_{a} (x; ξ) = G {(x; ξ)}^{a}, h_{a} (x; ξ) = a g (x; ξ) G {(x; ξ)}^{a - 1},

(10)

respectively.

The properties of the exponentiated distributions were studied by many authors in recent years. We consider the generalized binomial expansion

{(1 - z)}^{b} = \sum_{k = 0}^{\infty} {(- 1)}^{k} (\binom{b}{k}) z^{k},

(11)

which holds for any real non-integer b and

| z | < 1

. Using (11) twice in the following expression

T (x; ξ) = {\{1 - {[1 - P (x; ξ)]}^{a}\}}^{b}

in Equation (7), where

P (x; ξ) = \bar{G} {(x; ξ)}^{G (x; ξ)}

, we can write

T (x; ξ) = \sum_{j = 0}^{\infty} w_{j + 1} P {(x; ξ)}^{j + 1}

, where

w_{j + 1} = \sum_{m = 1}^{\infty} {(- 1)}^{j + m + 1} (\binom{b}{m}) (\binom{m a}{j + 1})

. Then, we can expand Equation (7) as

F (x) = 1 - \sum_{j = 0}^{\infty} w_{j + 1} {[1 - G (x; ξ)]}^{(j + 1) G (x; ξ)} .

(12)

Furthermore, using Mathematica, the power series holds

{[1 - G (x; ξ)]}^{(j + 1) G (x; ξ)} = 1 + \sum_{i = 2}^{\infty} q_{i} (j + 1) G {(x; ξ)}^{i},

(13)

where

q_{2} (j + 1) = - (j + 1)

,

q_{3} (j + 1) = - (j + 1) / 2

,

q_{4} (j + 1) = (j + 1) (3 j + 1) / 6

,

q_{5} (j + 1) = (j + 1) (2 j + 1) / 4

, etc.

By inserting Equation (13) in Equation (12) and noting that

\sum_{j = 0}^{\infty} w_{j + 1} = 1

, we obtain

F (x) = \sum_{i = 2}^{\infty} t_{i} G {(x; ξ)}^{i},

(14)

where

t_{i} = - \sum_{j = 0}^{\infty} w_{j + 1} q_{i} (j + 1) for i \geq 2 .

(15)

By differentiating

F (x)

, the NKwG density has the form

f (x) = \sum_{i = 1}^{\infty} t_{i + 1} h_{i + 1} (x; ξ),

(16)

where

h_{i + 1} (x; ξ)

is the exp-G density with power parameter (

i + 1

). Equation (16) reveals that the NKw-G density function is a linear combination of exp-G densities. Then, some of its mathematical properties can be determined directly from those of the exp-G distribution.

3.5. Mathematical Properties

The formulae derived throughout the paper can be easily handled in most symbolic computation platforms such as

Maple

,

Mathematica

and

Matlab

which have the ability to deal with analytic expressions of formidable size and complexity. Henceforth, let

Y_{i + 1}

be a rv with the exp-G distribution with power parameter

(i + 1)

. We obtain some mathematical quantities of the NKw-G family from (16) and those properties of the exp-G distribution. The exp-G properties are known for at least fifty distributions; see those distributions listed in Tahir and Nadarajah [16].

First, the nth ordinary moment of X, say

E (X^{n})

, can be expressed from (16) as

E (X^{n}) = \sum_{i = 1}^{\infty} t_{i + 1} E (Y_{i + 1}^{n}) = \sum_{i = 1}^{\infty} (i + 1) t_{i + 1} τ_{n, i},

(17)

where

τ_{n, i} = \int_{- \infty}^{\infty} x^{n} G {(x; ξ)}^{i} g (x; ξ) d x = \int_{0}^{1} Q_{G} {(u; ξ)}^{n} u^{i} d u

, and

Q_{G} (u; ξ)

is the qf of the baseline G. The quantities

E (Y_{i + 1}^{n})

are known for many G distributions as can been seen in those papers cited in Tahir and Nadarajah (2015).

Moments are important in any statistical analysis. Some of the most important features of a distribution can be studied through moments. For instance, the first four moments can be used to describe some characteristics of a distribution. Clearly, the central moments and cumulants of X can be determined from (17) using well-known relationships.

Second, the nth lower incomplete moment of X, say

m_{n} (y) = \int_{- \infty}^{y} x^{n} f (x) d x

, is

\begin{matrix} m_{n} (y) & = & \sum_{i = 1}^{\infty} t_{i + 1} \int_{- \infty}^{y} x^{n} h_{i + 1} (x) d x = \sum_{i = 1}^{\infty} (i + 1) t_{i + 1} \int_{0}^{G (y; ξ))} Q_{G} {(u; ξ)}^{n} u^{i} d u . \end{matrix}

(18)

The last two integrals can be evaluated numerically for most G distributions.

The first incomplete moment

m_{1} (y)

is used to construct the Bonferroni and Lorenz curves (popular measures in economics, reliability, demography, insurance, and medicine) and to determine the totality of deviations from the mean and median of X (important statistics in statistical applications).

Third, for a given probability

π

, the Bonferroni and Lorenz curves (popular measures in economics, reliability, demography, insurance and medicine) of X are given by

B (π) = m_{1} (q) / (π μ_{1}^{'})

and

L (π) = m_{1} (q) / μ_{1}^{'}

, respectively, where

q = Q (π; ξ)

can be found from the procedure described at the last paragraph of Section 2.

Fourth, the total deviations from the mean and median are

δ_{1} = 2 μ_{1}^{'} F (μ_{1}^{'}) - 2 m_{1} (μ_{1}^{'})

and

δ_{2} = μ_{1}^{'} - 2 m_{1} (M)

, where

F (μ_{1}^{'})

comes from (7).

Fifth, the moment generating function (mgf)

M (t) = E (e^{t X})

of X follows from (16) as

\begin{matrix} M (t) = \sum_{i = 1}^{\infty} t_{i + 1} M_{i + 1} (t) = \sum_{i = 0}^{\infty} (i + 1) t_{i + 1} ρ_{i} (t), \end{matrix}

(19)

where

M_{i + 1} (t)

is the mgf of

Y_{i + 1}

and

ρ_{i} (t) = \int_{0}^{1} exp [t Q_{G} (u; ξ)] u^{i} d u

. Hence, we can obtain the mgfs of many special NKw-G distributions directly from exp-G generating function and Equation (19).

3.6. Estimation

Here, we consider the estimation of the unknown parameters of the NKw-G family by the maximum likelihood method. The MLEs enjoy desirable properties and deliver simple approximations that work well in finite samples when constructing confidence intervals. The normal approximation for the MLEs can be handled either analytically or numerically.

The log-likelihood function

ℓ (θ)

for the vector of parameters

θ = {(a, b, ξ)}^{⊤}

from n observations

x_{1}, \dots, x_{n}

has the form

\begin{matrix} ℓ = ℓ (θ) & = & n log (a b) + \sum_{i = 1}^{n} log [g (x_{i}; ξ)] + \sum_{i = 1}^{n} G (x_{i}; ξ) log [1 - G (x_{i}; ξ)] \\ + (a - 1) \sum_{i = 1}^{n} log \{1 - {[1 - G (x_{i}; ξ)]}^{G (x_{i}; ξ)}\} \\ + (b - 1) \sum_{i = 1}^{n} log [1 - {\{1 - {[1 - G (x_{i}; ξ)]}^{G (x_{i}; ξ)}\}}^{a}] \\ + \sum_{i = 1}^{n} log [\frac{G (x_{i}; ξ)}{1 - G (x_{i}; ξ)} - log [1 - G (x_{i}; ξ)]] . \end{matrix}

The MLE

\hat{θ}

of

θ

can be evaluated by maximizing

ℓ (θ)

. There are several routines for numerical maximization of

ℓ (θ)

in the R program (optim function), SAS (PROC NLMIXED), Ox (sub-routine MaxBFGS), among others.

All distributions belonging to the NKw-G family can be fitted to real data using the AdequacyModel package for the R statistical computing environment (https://www.r-project.org/). An important advantage of this package is that it is not necessary to define the log-likelihood function and that it computes the MLEs, their standard errors and some GoF statistics. We only need to provide the pdf and cdf of the distribution to be fitted to a data set.

Alternatively, we can differentiate the log-likelihood and solving the resulting nonlinear likelihood equations. Then, the score components with respect to a, b and

ξ

are

\begin{matrix} \frac{\partial ℓ}{\partial a} & = & \frac{n}{a} + \sum_{i = 1}^{n} log \{1 - {[1 - G (x_{i}; ξ)]}^{G (x_{i}; ξ)}\} \\ - (b - 1) \sum_{i = 1}^{n} \frac{{\{1 - {[1 - G (x_{i}; ξ)]}^{G (x_{i}; ξ)}\}}^{a} log \{1 - {[1 - G (x_{i}; ξ)]}^{G (x_{i}; ξ)}\}}{1 - {[1 - {(1 - G (x_{i}; ξ))}^{G (x_{i}; ξ)}]}^{a}}, \\ \frac{\partial ℓ}{\partial b} & = & \frac{n}{b} + \sum_{i = 1}^{n} log [1 - {\{1 - {[1 - G (x_{i}; ξ)]}^{G (x_{i}; ξ)}\}}^{a}], \\ \frac{\partial ℓ}{\partial ξ} & = & \sum_{i = 1}^{n} \frac{g_{i}^{ξ}}{g (x_{i}; ξ)} - \sum_{i = 1}^{n} \{\frac{G (x_{i}; ξ)}{1 - G (x_{i}; ξ)} - log (1 - G (x_{i}; ξ))\} G_{i}^{ξ} \\ - \sum_{i = 1}^{n} \frac{G_{i}^{ξ} [G (x_{i}; ξ) - 2]}{(1 - G (x_{i}; ξ)) [- log {1 - G (x_{i}; ξ)} + G (x_{i}; ξ) \{1 + log (1 - G (x_{i}; ξ))\}]} \\ + (a - 1) \sum_{i = 1}^{n} \frac{{(1 - G (x_{i}; ξ))}^{G (x_{i}; ξ) - 1}}{{[1 - G (x_{i}; ξ)]}^{G (x_{i}; ξ)}} [- log {1 - G (x_{i}; ξ)} + G (x_{i}; ξ) \\ \times \{1 + log (1 - G (x_{i}; ξ))\}] G_{i}^{ξ} - (b - 1) \sum_{i = 1}^{n} \frac{a {\{1 - {(1 - G (x_{i}; ξ))}^{G (x_{i}; ξ)}\}}^{a - 1}}{1 - {[1 - {(1 - G (x_{i}; ξ))}^{G (x_{i}; ξ)}]}^{a}} \\ \times {(1 - G (x_{i}; ξ))}^{G (x_{i}; ξ) - 1} [- log {1 - G (x_{i}; ξ)} + G (x_{i}; ξ) \{1 + log (1 - G (x_{i}; ξ))\}] G_{i}^{ξ}, \end{matrix}

where

g_{i}^{ξ} = \frac{\partial g (x_{i}; ξ)}{\partial ξ}

and

G_{i}^{ξ} = \frac{\partial G (x_{i}; ξ)}{\partial ξ}

are column vectors of the same dimension of

ξ

.

Setting the score components to zero and solving them simultaneously yields the MLEs of the model parameters. The resulting equations cannot be solved analytically, but some statistical softwares can be used to solve them numerically through iterative Newton-Raphson type algorithms.

For interval estimation and hypothesis tests on the model parameters, we can obtain the

(p + 2) \times (p + 2)

observed information matrix

J (θ)

numerically (p is the dimension of

ξ

) since the expected information matrix

K (θ)

is very complicated and requires numerical integration.

Under standard regularity conditions, we have

(\hat{θ} - θ) \overset{a}{\sim} N_{p + 2} (0, K {(θ)}^{- 1})

, where

\overset{a}{\sim}

means approximately distributed and

K (θ)

is the expected information matrix. The asymptotic behavior remains valid if

K (θ)

is replaced by the observed information matrix

J (θ)

evaluated at

\hat{θ}

, i.e.,

J (\hat{θ})

. The multivariate normal

N_{p + 2} (0, J {(\hat{θ})}^{- 1})

distribution can be used to construct approximate confidence intervals for the model parameters.

4. The NKwW Distribution

We now define the NKwW distribution by taking the Weibull baseline with cdf

G (x) = 1 - exp (- α x^{β})

and pdf

g (x) = α β x^{β - 1} exp (- α x^{β})

. Then, the cdf and pdf of the NKwW distribution are, respectively, given by

\begin{matrix} F_{N K w W} (x) & = & 1 - {\{1 - {[1 - exp \{- α x^{β} (1 - exp (- α x^{β}))\}]}^{a}\}}^{b} \end{matrix}

(20)

and

\begin{matrix} f_{N K w W} (x) & = & a b α β x^{β - 1} exp \{- α x^{β} (2 - exp (- α x^{β}))\} \\ \times {[1 - exp \{- α x^{β} (1 - exp (- α x^{β}))\}]}^{a - 1} \\ \times {\{1 - {[1 - exp \{- α x^{β} (1 - exp (- α x^{β}))\}]}^{a}\}}^{b - 1} \\ \times [\frac{1 - exp (- α x^{β})}{exp (- α x^{β})} + α x^{β}] . \end{matrix}

(21)

Henceforth, a rv with density (21) is denoted by

X \sim N K w W (a, b, α, β)

. The hrf of X has the form

\begin{matrix} h (x) & = & a b α β x^{β - 1} exp \{- α x^{β} (2 - exp (- α x^{β}))\} \\ \times {[1 - exp \{- α x^{β} (1 - exp (- α x^{β}))\}]}^{a - 1} \\ \times {\{1 - {[1 - exp \{- α x^{β} (1 - exp (- α x^{β}))\}]}^{a}\}}^{- 1} \\ \times [\frac{1 - exp (- α x^{β})}{exp (- α x^{β})} + α x^{β}] . \end{matrix}

Figure 1 and Figure 2 display some plots of the pdf and hrf of X for selected parameter values. Figure 1 reveals that the NKwW distribution is right-skewed and reversed-J shaped. Also, Figure 2 shows that the NKwW hrf can produce increasing, decreasing, bathtub and upside-down bathtub shapes.

Figure 1. Plots of the NKwW densities for some parameter values.

Figure 2. Plots of the NKwW hazard rate for some parameter values.

4.1. Linear Representation

The cdf of the NKwW distribution follows from Equation (14) is

F_{N K w W} (x) = \sum_{i = 2}^{\infty} t_{i} {[1 - exp (- α x^{β})]}^{i} .

(22)

By expanding the binomial term in (22) and noting that

\sum_{i = 2}^{\infty} t_{i} = 1

, we can write

F_{N K w W} (x) = 1 + \sum_{i = 2}^{\infty} t_{i} \sum_{p = 1}^{i} {(- 1)}^{p} (\binom{i}{p}) exp (- p α x^{β})

and then by changing the index p by

(p + 1)

we get

F_{N K w W} (x) = 1 + \sum_{i = 2}^{\infty} t_{i} \sum_{p = 0}^{i} {(- 1)}^{p + 1} (\binom{i}{p + 1}) exp [- (p + 1) α x^{β}] .

Let

δ_{p} = 2

for

p = 0, 1, 2

and

δ_{p} = p

for

p \geq 3

. We can interchange the sums conveniently to obtain

F_{N K w W} (x) = 1 + \sum_{p = 0}^{\infty} (- v_{p}) exp [- (p + 1) α x^{β}],

where

v_{p} = {(- 1)}^{p} \sum_{i = δ_{p}}^{\infty} (\binom{i}{p + 1}) t_{i}

.

By differentiating the last expression, the NKwW density can be expressed as

f_{N K w W} (x) = \sum_{p = 0}^{\infty} v_{p} π (x; (p + 1) α, β),

(23)

where

π (x; (p + 1) α, β) = (p + 1) α β x^{β - 1} exp (- (p + 1) α x^{β})

denotes the Weibull density with scale parameter

(p + 1) α

and shape parameter

β

.

Equation (23) shows that the NKwW density is a linear combination of Weibull densities. Therefore, several NKwW mathematical properties can be derived from those of the Weibull distribution.

4.2. Properties

Let

Z_{p}

be a rv with density

π (x; (p + 1) α, β)

. Then, several properties of X can follow from those of

Z_{p}

. First, the nth ordinary moment of X can be written as

μ_{n}^{'} = Γ (\frac{n}{β} + 1) \sum_{p = 0}^{\infty} \frac{v_{p}}{{[(p + 1) α]}^{n / β}} .

(24)

Second, the cumulants (

κ_{n}

) of X can be determined recursively from (24) as

κ_{s} = μ_{s}^{'} - \sum_{k = 1}^{s - 1} (\binom{s - 1}{k - 1}) κ_{k} μ_{s - k}^{'}

, respectively, where

κ_{1} = μ_{1}^{'}

.

The skewness

γ_{1} = κ_{3} / κ_{2}^{3 / 2}

and kurtosis

γ_{2} = κ_{4} / κ_{2}^{2}

of X can be calculated from the third and fourth standardized cumulants. The skewness and kurtosis plots of the NKwW distribution are displayed in Figure 3. These plots reveal that the parameters a and b play a significant role in modeling the skewness and kurtosis behaviors of X.

Figure 3. Skewness and kurtosis plots of the NKwW model for some values of

α

= 1.5, 0.5, 1.5 and

β

= 2.5, 3.5, 0.5.

Third, we derive an approximation for the density of the sample average

\bar{X} = \sum_{i = 1}^{n} X_{i} / \sqrt{n}

of independent and identically (iid) rvs

X_{1}, \dots, X_{n}

with density (21). Without loss of generality, we can replace each

X_{i}

by

(X_{i} - μ_{1}^{'}) / Var (X_{i})

in order to simplify the approximation. By doing this, the previous third and fourth standardized cumulants are

γ_{1} = μ_{3}^{'}

and

γ_{2} = μ_{4}^{'} - 3

. Furthermore, we require the first six Hermite polynomials defined by

{(- 1)}^{n} \partial^{n} ϕ (x) / \partial x^{n} = H_{r} (x) ϕ (x)

for

n \geq 0

, where

ϕ (x)

is the standard normal pdf. They satisfy the recurrence equation

H_{r} (x) = y H_{r - 1} (x) - (r - 1) H_{r - 2} (x) (r \geq 2)

and follow as

H_{0} (x) = 1, H_{1} (x) = x, H_{2} (x) = x^{2} - 1, H_{3} (x) = x^{3} - 3 x, H_{4} (x) = x^{4} - 6 x^{2} + 3, H_{5} (x) = x^{5} - 10 x^{3} + 15 x

and

H_{6} (x) = x^{6} - 15 x^{4} + 45 x^{2} - 15

.

The second-order Edgeworth expansion for the sample mean

\bar{X}

of standardized NKwW rvs can be expressed as

f_{\bar{X}} (x) = ϕ (x) \{1 + \frac{μ_{3}^{'}}{6 \sqrt{n}} H_{3} (x) + \frac{(μ_{4}^{'} - 3)}{24 n} H_{4} (x) + \frac{μ_{3}^{' 2}}{72 n} H_{6} (x)\} + O (n^{- 3 / 2}) .

(25)

It is much more frequent in statistical applications to compute distribution functions than density functions. By integrating Equation (25), the cdf of

\bar{X}

has the form

F_{\bar{X}} (x) = Φ (x) - ϕ (x) \{\frac{μ_{3}^{'}}{6 \sqrt{n}} H_{2} (x) + \frac{(μ_{4}^{'} - 3)}{24 n} H_{3} (x) + \frac{μ_{3}^{' 2}}{72 n} H_{5} (x)\} + O (n^{- 3 / 2}),

(26)

where

Φ (x)

is the standard normal cdf. Equation (26) provides highly accurate results for the probabilities associated with

\bar{Y}

.

Fourth, the nth incomplete moment of X, denoted by

m_{n} (y) = E (X^{n} ∣ X \leq y) = \int_{0}^{y} x^{n} f_{N K w W} (x) d x

, can easily be obtained by changing variables from the lower incomplete gamma function

γ (s, x) = \int_{0}^{\infty} x^{s - 1} e^{- x} dx

when calculating the corresponding moment of

Z_{p}

. Then, we obtain

m_{n} (z) = \sum_{p = 0}^{\infty} \frac{v_{p}}{{[(p + 1) α]}^{n / β}} γ (\frac{n}{β} + 1, (p + 1) α z^{β}) .

(27)

Fifth, the first incomplete moment

m_{1} (z)

is used to to determine the totality of deviations from the mean and median of a distribution and construct the Bonferroni and Lorenz curves. The total deviations from the mean and median M of X can be expressed as

δ_{1} = 2 μ_{1}^{'} F_{N K w W} (μ_{1}^{'}) - 2 m_{1} (μ_{1}^{'})

and

δ_{2} = μ_{1}^{'} - 2 m_{1} (M)

, where M can be determined from

F_{N K w W} (M) = 0.5

. The Bonferroni and Lorenz curves of X for a given probability

π

are given by

B (π) = m_{1} (q) / (π m u_{1}^{'})

and

L (π) = π B (π)

, respectively, where

q = Q_{(} π)

is the qf of X discussed in Section 4.1.

4.3. Quantile Function and Simulation Study

The qf of the NKwW distribution cannot be obtained explicitly. However, we can use Newton-Raphosn algorithm to generate NKwW variates as follows:

Set n, $α$ , $β$ , a, b and initial value $x^{0}$ .
Generate U ∼Uniform $(0, 1)$ .
Update $x^{0}$ by using the Newton’s formula

$x * = x^{0} - R (x^{0}; α, β, a, b),$

where $R (x^{0}; α, β, a, b) = \frac{F_{N K w W} (x^{0}; α, β, a, b)}{f_{N K w W} (x^{0}; α, β, a, b)}$ , and $F_{N K w W}$ and $f_{N K w W}$ are obtained from Equations (20) and (21), respectively.
If $| x^{0} - x^{*} | \leq ϵ$ , ( $ϵ > 0$ , very small tolerance limit), then store $x^{0} = x^{*}$ as a variate from the NKwW( $α, β, a, b$ ) distribution.
If $| x^{0} - x^{*} | > ϵ$ , then, set $x^{0} = x^{*}$ and go to step 3.
Repeat steps (2)–(5) n times to generate $x_{1}, \dots x_{n}$ .

The R script to generate observations from the NKwW distribution is given in the Appendix A.

Here we study the performance and accuracy of maximum likelihood estimates of the NKwW parameters using Monte Carlo simulations. The simulation study is carried out for sample sizes

n = 25, 50, 75, 100, 200

and parameter scenarios: I:

α = 0.5

,

β = 0.5

,

a = 2.5

, and

b = 1.5

, II:

α = 1.5

,

β = 1.5

,

a = 1.5

, and

b = 1.5

and III:

α = 1.1

,

β = 5.5

,

a = 0.5

, and

b = 0.5

. We used the above algorithm for sample generation whose R-codes ae given in Appendix A. The simulation study is repeated for N = 1000 times each with given sample size and computed the average estimates (AE) along with their average biases (Bias)of the MLEs and mean squared errors (MSE).

B i a s (\hat{θ}) = \sum_{i = 1}^{N} \frac{\hat{θ_{i}}}{N} - θ and M S E (\hat{θ}) = \sum_{i = 1}^{N} \frac{{(\hat{θ_{i}} - θ)}^{2}}{N} .

We display Bias and MSE for the parameters

α

,

β

, a and b in Figure 4 and Figure 5, respectively, which indicate that as sample size increases the bias and MSE decreases. Thus, MLEs perform well in estimating the parameters of the NKwW distribution.

Figure 4. Plots of estimated MSEs for selected parameter values.

Figure 5. Plots of estimated biases for selected parameter values.

4.4. Estimation

Let

x_{1}, \dots, x_{n}

be a sample of size n from the NKwW distribution given in Equation (21). The log-likelihood function

ℓ = ℓ (θ)

for the vector of parameters

θ = {(α, β, a, b)}^{⊤}

is

\begin{matrix} ℓ & = & n log (a b α β) - 2 α \sum_{i = 1}^{n} x_{i}^{β} + α \sum_{i = 1}^{n} x_{i}^{β} exp (- α x_{i}^{β}) \\ + \sum_{i = 1}^{n} log [α x_{i}^{β} + exp (- α x_{i}^{β}) (1 - exp (- α x_{i}^{β}))] \\ + (β - 1) \sum_{i = 1}^{n} log x_{i} + (a - 1) \sum_{i = 1}^{n} log [1 - exp (- α x_{i}^{β}) (1 - exp (- α x_{i}^{β}))] \\ + (b - 1) \sum_{i = 1}^{n} log [1 - {\{1 - exp (- α x_{i}^{β}) (1 - exp (- α x_{i}^{β}))\}}^{a}] . \end{matrix}

The function ℓ can be easily maximized using the

AdequacyModel

package. The components of the score vector

U (θ)

are

\begin{matrix} U_{α} & = & \frac{n}{α} + \sum_{i = 1}^{n} (\frac{x_{i}^{β} [1 + exp (α x_{i}^{β})]}{α x_{i}^{β} + exp (α x_{i}^{β}) - 1}) - \sum_{i = 1}^{n} x_{i}^{β} exp (- α x_{i}^{β}) [α x_{i}^{β} + 2 exp (α x_{i}^{β}) - 1] \\ + (a - 1) \sum_{i = 1}^{n} (\frac{z_{i, α}}{z_{i}}) - (b - 1) \sum_{i = 1}^{n} (\frac{a z_{i}^{a - 1} z_{i, α}}{1 - z_{i}^{a}}), \\ U_{β} & = & \frac{n}{β} + \sum_{i = 1}^{n} log x_{i} + \sum_{i = 1}^{n} (\frac{α x_{i}^{β} log (x) [1 + exp (α x_{i}^{β})]}{α x_{i}^{β} + exp (α x_{i}^{β}) - 1}) - \sum_{i = 1}^{n} [α x_{i}^{β} log (x) exp (- α x_{i}^{β})] \\ \times [α x_{i}^{β} + 2 exp (α x_{i}^{β}) - 1] + (a - 1) \sum_{i = 1}^{n} (\frac{z_{i, β}}{z_{i}}) - (b - 1) \sum_{i = 1}^{n} (\frac{a z_{i}^{a - 1} z_{i, β}}{1 - z_{i}^{a}}), \\ U_{a} & = & \frac{n}{a} + \sum_{i = 1}^{n} log z_{i} - (b - 1) \sum_{i = 1}^{n} (\frac{z_{i}^{a} log z_{i}}{1 - z_{i}^{a}}), \\ U_{b} & = & \frac{n}{b} + \sum_{i = 1}^{n} log (1 - z_{i}^{a}), \end{matrix}

where

z_{i} = 1 - exp \{- α x_{i}^{β} [1 - exp (- α x_{i}^{β})]\}

,

z_{i α} = x_{i}^{β} [α x_{i}^{β} + exp (α x_{i}^{β}) - 1] exp \{- α x_{i}^{β} [2 - exp (- α x_{i}^{β})]\}

,

z_{i β} = α x_{i}^{β} log x_{i} [α x_{i}^{β} + exp (α x_{i}^{β}) - 1] exp \{- α x_{i}^{β} [2 - exp (- α x_{i}^{β})]\} .

The MLE

\hat{θ}

of

θ

can be obtained by solving the nonlinear equations

U_{α} = 0

,

U_{β} = 0

,

U_{a} = 0

and

U_{b} = 0

. These equations cannot be solved analytically and statistical software can be used to obtain the estimates numerically. We can use iterative techniques such as a Newton-Raphson type algorithm to obtain

\hat{θ}

using a wide range of initial values. The initial values for the parameters are important but are not hard to obtain from the fit of the Weibull distribution. This process often results or leads to more than one maximum. However, in these cases, we consider the MLEs corresponding to the largest value of the maximum. In a few cases, no maximum is identified for the selected initial values. In these cases, a new initial value is tried in order to obtain a maximum.

5. Empirical Illustrations of NKwW Model

In this section, we compare the NKwW distribution with some well-known extended (or generalized) Weibull distributions. To check the potentiality of the new distribution, we use two real data sets representing different hydrological events such as precipitation and flood. We compare the NKwW model with the Kumaraswamy-Weibull (KwW) (Cordeiro et al. [37]), beta-Weibull (BW) (Lee et al. [38]), exponentiated-generalized Weibull (EGW) (Oguntunde et al. [39]), McDonald-Weibull (McW) (Cordeiro et al. [40]), gamma-Weibull (GaW) (Cordeiro et al. [41]), odd log-logistc Weibull (OLLW) (da-Cruz et al. [42]), Marshall-Olkin Weibull (MOW) (Ghitany et al. [43]), transmuted-Weibull (TrW) (Khan et al. [44]) and Weibull (W) models by means of two real-life data sets which are described below:

Data Set 1. Precipitation data. The data were taken from Katz et al. [45] which represent the annual maximum precipitation (inches) for one rain gauge in Fort Collins, Colorado from 1900 through 1999. The data are: 239, 232, 434, 85, 302, 174, 170, 121, 193, 168, 148, 116, 132, 132, 144, 183, 223, 96, 298, 97, 116, 146, 84, 230, 138, 170, 117, 115, 132, 125, 156, 124, 189, 193, 71, 176, 105, 93, 354, 60, 151, 160, 219, 142, 117, 87, 223, 215, 108, 354, 213, 306, 169, 184, 71, 98, 96, 218, 176, 121, 161, 321, 102, 269, 98, 271, 95, 212, 151, 136, 240, 162, 71, 110, 285, 215, 103, 443, 185, 199, 115, 134, 297, 187, 203, 146, 94, 129, 162, 112, 348, 95, 249, 103, 181, 152, 135, 463, 183, 241.

Data set 2. Flood data. The data were taken from Asgharzadeh et al. [46] which represent the maximum annual flood discharges (in units of 1000 cubic feet per second) of the North Saskachevan River at Edmonton, over a period of 48 years. The data are: 19.885, 20.940, 21.820, 23.700, 24.888, 25.460, 25.760, 26.720, 27.500, 28.100, 28.600, 30.200, 30.380, 31.500, 32.600, 32.680, 34.400, 35.347, 35.700, 38.100, 39.020, 39.200, 40.000, 40.400, 40.400, 42.250, 44.020, 44.730, 44.900, 46.300, 50.330, 51.442, 57.220, 58.700, 58.800, 61.200, 61.740, 65.440, 65.597, 66.000, 74.100, 75.800, 84.100, 106.600, 109.700, 121.970, 121.970, 185.560.

All the calculations in these two applications are performed using the

AdequacyModel

package in R. The unknown parameters of the models are estimated by the maximum likelihood method. The log-likelihood function is evaluated at the MLEs (

\hat{ℓ}

). The well-known GoFS such as the Akaike information criterion (AIC), Bayesian Information Criterion (BIC), Hannan-Quinn Information Criterion (HQIC), Anderson-Darling (

A^{*}

), Cramér–von Mises (

W^{*}

) and Kolmogrov-Smirnov (K-S) are adopted for model comparisons. The lower values of GoFS and higher p-values of the K-S statistic indicate good fits.

Table 1 and Table 2 list the MLEs and their standard errors (SEs) for the NKwW distribution and other competitive models (KwW, BW, EGW, McW, GaW, OLLW, MOW, TrW and W) fitted to the two hydrological data sets. The values of the GoFS in Table 3 and Table 4 indicate that the NKwW model shows small values of these statistics and hence it provides the best fit as compared to the other models. The plots in Figure 6 and Figure 7 also support our claim.

Table 1. MLEs and their SEs (in parentheses) for data set 1.

Table 2. MLEs and their standard errors (in parentheses) for data set 2.

Table 3. The statistics AIC, BIC, HQIC,

A^{*}

,

W^{*}

, K-S and p-value for data set 1.

Table 4. The statistics AIC, BIC, HQIC,

A^{*}

,

W^{*}

, K-S and p-value for data set 2.

Figure 6. Estimated (a) density (b) K-M (c) hazard rate, and (d) Box-plots for the data set 1.

Figure 7. Estimated (a) density (b) K-M (c) hazard rate, and (d) Box-plots for the data set 2.

6. Bivariate New Kumaraswamy G-Family

In this Section, we introduce a bivariate extension of the NKw-G family according to Marshall and Olkin shock model (see, Marshall and Olkin, [47]). Several authors used the Marshall and Olkin approach as a method to generate bivariate distributions, see for example Sarhan and Balakrishnan, [48], Kundu and Dey [49], El-Gohary et al. [50], Muhammed [51], El-Bassiouny et al. [52], Ghosh and Hamedani [53], El-Morshedy et al. [54,55], Eliwa et al. [56], among others. The bivariate new Kumaraswamy (BvNKw) G-family is constructed from three independent NKw-G families by using a minimization process. Assume three independent rvs

Y_{k} \sim

NKw-G

(a, b_{k}, ξ);

k = 1, 2, 3

and defining

X_{j} = min {Y_{j}, Y_{3}}; j = 1, 2

, the bivaraite random vector

X

is said to have the BvNKw-G family with parameters vector

Υ =

(

a, b_{1}, b_{2}

,

b_{3}, ξ

) if its joint reliability function (jrf) is given by

S_{X_{1}, X_{2}} (x_{1}, x_{2}; Υ) = \{\begin{matrix} S_{NKw - G} (x_{1}; a, b_{1}, ξ) S_{NKw - G} (x_{2}; a, b_{2} + b_{3}, ξ) if x_{1} < x_{2}, \\ S_{NKw - G} (x_{1}; a, b_{1} + b_{3}, ξ) S_{NKw - G} (x_{2}; a, b_{2}, ξ) if x_{1} \geq x_{2} . \end{matrix}

(28)

The marginal reliability functions (rfs) corresponding to (28) can be written as

S_{X_{i}} (x_{i}) = S_{NKw - G} (x_{i}; a, b_{i} + b_{3}, ξ); i = 1, 2 .

(29)

The corresponding joint pdf (jpdf) to (28) can be formulated as

f_{X_{1}, X_{2}} (x_{1}, x_{2}; Υ) = \{\begin{matrix} f_{NKw - G} (x_{1}; a, b_{1}, ξ) f_{NKw - G} (x_{2}; a, b_{2} + b_{3}, ξ) if x_{1} < x_{2}, \\ f_{NKw - G} (x_{1}; a, b_{1} + b_{3}, ξ) f_{NKw - G} (x_{2}; a, b_{2}, ξ) if x_{1} > x_{2}, \\ \frac{b_{3}}{b_{1} + b_{2} + b_{3}} f_{NKw - G} (x; a, b_{1} + b_{2} + b_{3}, ξ) if x_{1} = x_{2} = x, \end{matrix}

(30)

where the jpdf in Equation (30) can be derived from a well-known formula (see Eliwa and El-Morshedy, [57]). The marginal pdfs corresponding to Equation (29) can be proposed as

f_{X_{i}} (x_{i}) = f_{NKw - G} (x_{i}; a, b_{i} + b_{3}, ξ); i = 1, 2 .

(31)

If

X

have the BvNKw-G family, then the distributions of

max {X_{1}, X_{2}}

and

min {X_{1}, X_{2}}

are

F_{max {X_{1}, X_{2}}} (w) = \prod_{i = 1}^{3} F_{N K w - G} (w; a, b_{i}, ξ) and F_{min {X_{1}, X_{2}}} (w) = 1 - \prod_{i = 1}^{3} S_{NKw - G} (w; a, b_{i}, ξ),

respectively. If

X_{j} \sim

NKw-G

(a, b_{j} + b_{3}, ξ); j = 1, 2

,then the coefficient of correlation between

X_{1}

and

X_{2}

is

Q {(u)}_{X_{1}, X_{2}} = \{\begin{matrix} 4 F_{NKw - G} (Q {(u)}_{X_{1}}; a, b_{1}, ξ) F_{NKw - G} (Q {(u)}_{X_{2}}; a, b_{2} + b_{3}, ξ) - 1 if x_{1} < x_{2}, \\ 4 F_{NKw - G} (Q {(u)}_{X_{1}}; a, b_{1} + b_{3}, ξ) F_{NKw - G} (Q {(u)}_{X_{2}}; a, b_{2}, ξ) - 1 if x_{1} > x_{2} . \end{matrix}

(32)

The BvNKw-G family has a singular part along the line

x_{1} = x_{2}

with weight

b_{3} {(b_{1} + b_{2} + b_{3})}^{- 1}

, whereas on

x_{1} \neq x_{2}

with weight

(b_{1} + b_{2}) {(b_{1} + b_{2} + b_{3})}^{- 1}

, the BvNKw-G family has an absolute continuous part. Assume

δ_{i}

= S_{X_{i}} (x_{i})

where

X_{j} \sim

NKw-G

(a, b_{j} + b_{3}, ξ); j = 1, 2

, the jrf of the proposed family can be derived by using copula of the Marshall-Olkin model as

S_{X_{1}, X_{2}} (x_{1}, x_{2}; Υ) = δ_{1}^{1 - τ_{1}} δ_{2}^{1 - τ_{2}} max (δ_{1}^{τ_{1}}, δ_{2}^{τ_{2}}), for 0 < τ_{1}, τ_{2} < 1,

where

τ_{j} = \frac{b_{3}}{b_{j} + b_{3}}

. For more details on copula property, see Gijbels et al. [58] and Husková and Maciak [59].

Using Equations (28) and (30), the joint hrf (jhrf) can easily be reported by using

h_{X_{1}, X_{2}} (x_{1}, x_{2}; Υ) = \frac{f_{X_{1}, X_{2}} (x_{1}, x_{2}; Υ)}{S_{X_{1}, X_{2}} (x_{1}, x_{2}; Υ)}

. Figure 8, Figure 9 and Figure 10 show the jpdf, jhrf, and jrf for different values of the BvNKw-Weibull (BvNKwW) parameters.

Figure 8. The surface plots of the jpdf, jhrf and jrf of the BvNKwW model for

a = 0.6, b_{1} = 4, b_{2} = 4

,

b_{3} = 4, α = 0.6

and

β = 2.9

.

Figure 9. The surface plots of the jpdf, jhrf and jrf of the BvNKwW model for

a = 1.6, b_{1} = 2, b_{2} = 2

,

b_{3} = 2, α = 1.6

and

β = 2.9

.

Figure 10. The surface plots of the jpdf, jhrf and jrf of the BvNKwW model for

a = 0.8, b_{1} = 1.5, b_{2} = 1.5

,

b_{3} = 1.5, α = 0.9

and

β = 1.9

.

The MLE for the BvNKw-G Family

In this section, the unknown parameters of the BvNKw-G family are estimated by using the maximum likelihood approach. Assuming that

(x_{11}, x_{21})

,

(x_{12}, x_{22}), . . .,

(x_{1 p}, x_{2 p})

is a sample of size p from the BvNKw-G family where

Λ_{1} = {x_{1 i} < x_{2 i}}

,

Λ_{2} = {x_{1 i} > x_{2 i}}

,

Λ_{3} = {x_{1 i} = x_{2 i} = x_{i}}

,

p_{s} = |Λ_{s}|; s = 1, 2, 3

and

|Λ| = p = p_{1} + p_{2} + p_{3}

. Using Equation (30), the likelihood function

l (Υ)

can be expressed as

\begin{matrix} l (Υ) & = & \prod_{i = 1}^{p_{1}} f_{NKw - G} (x_{1 i}; a, b_{1}, ξ) f_{NKw - G} (x_{2 i}; a, b_{2} + b_{3}, ξ) \prod_{i = 1}^{p_{2}} f_{NKw - G} (x_{1 i}; a, b_{1} + b_{3}, ξ) \\ \times f_{NKw - G} (x_{2 i}; a, b_{2}, ξ) {(\frac{b_{3}}{b_{1} + b_{2} + b_{3}})}^{p_{3}} \prod_{i = 1}^{p_{3}} f_{NKw - G} (x_{i}; a, b_{1} + b_{2} + b_{3}, ξ) . \end{matrix}

(33)

Through differentiation of the term

L (Y) = log

l (Y)

with respect to

a, b_{1}, b_{2}, b_{3}

and

ξ

, and then equating the resulting equations to zeros, we get the non-linear normal equations. An iterative procedure such as Newton–Raphson technique is required to solve them numerically.

7. Empirical Illustrations of BvNKwW Model Through Motors Data

In this Section, the flexibility of the BvNKwW model is shown through a real-life data application. This data is reported in Relia and Staff [60] which represents the failure times of a parallel system constituted by two identical motors in days. The fitted bivariate models are compared using some statistical criteria, namely AIC, CAIC, BIC and HQIC. To fit the marginals of the BvNKwW model, the K-S with its p-value are used. The BvNKwW model is compared with other bivariate distributions such as: bivariate generalized power Weibull (BvGPW), bivariate exponentiated Weibull (BvEW), bivariate Weibull (BvW), bivariate generalized exponential (BvGEx), bivariate exponential (BvEx), and bivariate generalized linear failure rate (BvGLFR) distributions when applied to this data set. At first, the marginals

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

are fitted separately to this data set. The MLEs of the parameters

(a, b, α, β)

of the corresponding NKwW distribution for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

are (

60.5030

,

732.6059

,

0.9694

,

0.17440

), (

1.8182

,

84.4223

,

0.0019

,

0.9434

) and (

27.8306

,

263.4773

,

0.4649

,

0.2626

), respectively. The

- L

, K-S, p-value for the marginals are reported in Table 5.

Table 5. The -L, K-S and p-values for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

Table 5 lists that the NKwW model fits to the real data for the marginals. Figure 11, Figure 12 and Figure 13 show the fitted pdf, cdf and probability-probability (pp) plots, which support our empirical results in Table 5.

Figure 11. The fitted pdfs plots for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

Figure 12. The estimated cdfs for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

Figure 13. The pp-plots for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

Figure 14 show the box and TTT plots for the

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

, and the scatter plot for the motors data.

Figure 14. The box, TTT and scatter plots of the marginal models.

It is noted that the BvNKwW model can be used to analyze and the real data on motors. The MLEs,

- L

, AIC, CAIC, BIC and HQIC values for the BvNKwW model and some competitive models are listed in Table 6.

Table 6. The MLEs with its (SE) and goodness of fit measures for Motors data.

From Table 6, it is observed that the BvNKwW model provides a better fit as compared to competitive models.

8. Concluding Remarks

Proposing new and flexible models through G-classes is an active research area in distribution theory. The new era has proved that flexible models can prove very helpful to researchers and practitioners in investigating data genertated from different phenomenons. G-classes are one of the basic source which provides a paradigm to data related science and its investigation.

The purpose of our article is to contribute a new G-family and hence a new Kumaraswamy-G family of distributions is introduced from a new generator

1 - \bar{G} {(x)}^{G (x)}

for support (0,1), that has ability to serve as an alternative to well-known Kumaraswamy-G family (pioneered in 2011) and other classes of distributions for

T \in (0, 1)

. The proposed generator adopted here involves a different function of the cumulative function instead of existing generator which is only based on

G (x)

. In the literature, beta-G, Kw-G, Mc-G and TL-G families were introduced from the existing generator

G (x)

for bounded unit interval. Therefore, similar G-families can be developed from our proposed generator

1 - \bar{G} {(x)}^{G (x)}

. We obtain some structural properties of this new Kumaraswamy-G family, and also study some properties of the special model called the new Kumaraswamy-Weibull (NKwW) distribution. We compare this distribution with the well-known generalized Weibull models (Kumaraswamy-Weibull, McDonald-Weibull, beta-Weibull, exponentiated-generalized Weibull, gamma-Weibul, odd log-logistic-Weibull, Marshall-Olkin-Weibull, transmuted-Weibull and Weibull) using six popular GoF test-statistics. We found that the new distribution provides better estimates and minimum GoF-tests values. Thus, the NKwW distribution outperforms the well-established competitive models on the basis of numerical and graphical analysis. Similarly, the BvNKwW distribution is introduced, and is compared with other well-known bivariate models such as bivariate generalized power Weibull, bivariate exponentiated Weibull, bivariate Weibull, bivariate generalized exponential, bivariate exponential, and bivariate generalized linear failure rate distributions. The results of popular goodness-of-fit statistics showed that our proposed bivariate model is better as compared to other well-known bivariate models. We expect that this new family will be able to attract readers and applied statisticians.

Author Contributions

The main results are a part of M.A.H.’s dissertation under M.H.T.’s supervision. The other authors G.M.C., M.E.-M. and M.S.E. contributed to the mathematical properties, and bivariate section and its application. In general, all authors have contributed in manuscript refinement, preparation and revision. All authors have read and agreed to the published version of the manuscript.

Funding

This publication was supported by the Deanship of Scientific Research (DSR) at Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi Arabia.

Acknowledgments

The authors would like to thank the editor and the two worthy reviewers for useful comments and suggestions, which improved the earlier version of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The R script to generate NKwW variates is given below:

n=20; alpha=1; beta=1.5; a=2.5;b=2.5;

f=function(x,alpha,beta,a,b)

{

g=alpha*beta*x^(beta-1))*exp(-alpha*x^{beta})

G=1-exp(-alpha*x^{beta})

F=1-(1-(1-(1-G)^G)^a)^b

D =a*b*g*(1-G)^G*(1-(1-G)^G)^(a-1))

*((G)/(1-G)-log(1-G))*(1-(1-(1-G)^G)^a)^(b-1)

return(D)

};

F=function(x,alpha,beta,a,b)

{

g=alpha*beta*x^(beta-1))*exp(-alpha* x^{beta})

G=1-exp(-alpha*x^{beta})

F=1-(1-(1-(1-G)^G)^a)^b

D =a*b*g*(1-G)^G*(1-(1-G)^G)^(a-1))*((G)/(1-G)-log(1-G))

*(1-(1-(1-G)^G)^a)^(b-1)

return(d)

};

u=runif(n,0,1);

x=rep(0,n);

for(i in 1:n)

{

x0=1

xnew=x0-((F(x0,alpha,beta,a,b)-u[i])/f(x0,alpha,beta,a,b))

while(abs(xnew-x0) > 0.0001)

{

x0=xnew

xnew=x0-((F(x0,alpha,beta,a,b)-u[i])/f(x0,alpha,beta,a,b))

}

x[i]=xnew

}

print(x)

References

Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Marshall, A.W.; Olkin, I. A new method for adding parameters to a family of distributions with application to the exponential and Weibull families. Biometrika 1997, 84, 641–652. [Google Scholar] [CrossRef]
Gupta, R.C.; Gupta, P.L.; Gupta, R.D. Modeling failure time data by Lehman alternatives. Commun. Stat. Theory Methods 1998, 27, 887–904. [Google Scholar] [CrossRef]
Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Commun. Stat. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
Gleaton, J.U.; Lynch, J.D. Properties of generalized log-logistic families of lifetime distributions. J. Prob. Stat. Sci. 2006, 4, 51–64. [Google Scholar]
Shaw, W.T.; Buckley, I.R. The alchemy of probability distributions: Beyond Gram-Charlier expansions, and a skew-kurtotic-normal distribution from a rank transmutation map. arXiv 2009, arXiv:0901.0434. [Google Scholar]
Zografos, K.; Balakrishnan, N. On families of beta- and generalized gamma generated distributions and associated inference. Stat. Methodol. 2009, 6, 344–362. [Google Scholar] [CrossRef]
Cordeiro, G.M.; de-Castro, M. A new family of generalized distributions. J. Stat. Comput. Simul. 2011, 81, 883–898. [Google Scholar] [CrossRef]
Alexander, C.; Cordeiro, G.M.; Ortega, E.M.M.; Sarabia, J.M. Generalized beta-generated distributions. Comput. Stat. Data Anal. 2012, 56, 1880–1897. [Google Scholar] [CrossRef]
Ristić, M.M.; Balakrishnan, N. The gamma-exponentiated exponential distribution. J. Stat. Comput. Simul. 2012, 82, 1191–1206. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Ortega, E.M.M.; Cunha, D.C.C. The exponentiated generalized class of distributions. J. Data Sci. 2013, 11, 1–27. [Google Scholar]
Bourguignon, M.; Silva, R.B.; Cordeiro, G.M. The Weibull-G family of probability distributions. J. Data Sci. 2014, 12, 53–68. [Google Scholar]
Tahir, M.H.; Cordeiro, G.M.; Alizadeh, M.; Mansoor, M.; Zubair, M.; Hamedani, G.G. The odd generalized exponential family of distributions with applications. J. Stat. Distrib. App. 2015, 2. [Google Scholar] [CrossRef]
Tahir, M.H.; Cordeiro, G.M.; Alizadeh, M.; Mansoor, M.; Zubair, M. The logistic-X family of distributions and its applications. Commun. Stat. Theory Methods 2016, 45, 7326–7349. [Google Scholar] [CrossRef]
Rezaei, S.; Sadr, B.B.; Alizadeh, M.; Nadarajah, S. Topp-Leone generated family of distributions: Properties and applications. Commun. Stat. Theory Methods 2017, 46, 2893–2909. [Google Scholar] [CrossRef]
Tahir, M.H.; Nadarajah, S. Parameter induction in continuous univariate distributions: Well-established G families. An. Acad. Bras. Ciênc 2015, 87, 539–568. [Google Scholar] [CrossRef]
Tahir, M.H.; Cordeiro, G.M. Compounding of distributions: A survey and new generalized classes. J. Stat. Distrib. App. 2016, 3. [Google Scholar] [CrossRef]
Kumaraswamy, P. Generalized probability density function for double bounded random processes. J. Hydrol. 1980, 46, 79–88. [Google Scholar] [CrossRef]
Lai, C.D.; Xie, M.; Murthy, D.N.P. A modified Weibull distribution. IEEE Trans. Reliab. 2003, 52, 33–37. [Google Scholar] [CrossRef]
Bebbington, M.; Lai, C.D.; Zitikis, R. A flexible Weibull extension. Reliab. Eng. Syst. Saf. 2007, 92, 719–726. [Google Scholar] [CrossRef]
Alzaatreh, A.; Famoye, F.; Lee, C. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
Torabi, H.; Montazeri, N.H. The logistic-uniform distribution and its application. Commun. Stat. Simul. Comput. 2014, 43, 2551–2569. [Google Scholar] [CrossRef]
Al-Aqtash, R.; Lee, C.; Famoye, F. Gumbel-Weibull distribution: Properties and applications. J. Mod. Appl. Stat. Methods 2014, 13, 201–225. [Google Scholar] [CrossRef]
Ahmad, Z.; Elgarhy, M.; Hamedani, G.G. A new Weibull-X family of distributions: Properties, characterizations and applications. J. Stat. Distrib. Appl. 2018, 5. [Google Scholar] [CrossRef]
Torabi, H.; Montazeri, N.H. The gamma-uniform distribution and its application. Kybernetika 2012, 48, 16–30. [Google Scholar]
Silva, F.S.; Percontini, A.; de-Brito, E.; Ramos, M.W.; Venancio, R.; Cordeiro, G.M. The odd Lindley-G family of distribution. Austrian J. Stat. 2017, 46, 65–87. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Alizadeh, M.; Ramires, T.G.; Ortega, E.M.M. The generalized odd half-Cauchy family of distributions: Properties and applications. Commun. Stat. Theory Methods 2018, 46, 5685–5705. [Google Scholar] [CrossRef]
Alizadeh, M.; Altun, E.; Cordeiro, G.M.; Rasekhi, M. The odd power-Cauchy family of distributions: Properties, regression models and applications. J. Stat. Comput. Simul. 2018, 88, 785–807. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Yousof, H.M.; Ramires, T.G.; Ortega, E.M.M. The Burr XII system of densities: Properties, regression model and applications. J. Stat. Comput. Simul. 2017, 88, 432–456. [Google Scholar] [CrossRef]
Hassan, A.S.; Hemeda, S.E.; Maiti, S.S.; Pramanik, S. The generalized additive Weibull-G family of distributions. Int. J. Stat. Prob. 2017, 6, 65–83. [Google Scholar] [CrossRef]
Hassan, A.S.; Nassr, S.G. Power Lindley-G family of distributions. Ann. Data Sci. 2019, 6, 189–210. [Google Scholar] [CrossRef]
Maiti, S.S.; Pramanik, S. A generalized Xgamma generator family of distributions. arXiv 2009, arXiv:1805.03892. [Google Scholar]
El-Morshedy, M.; Eliwa, M.S. The odd flexible Weibull-H family of distributions: Properties and estimation with applications to complete and upper record data. Filomat 2019, 33, 2635–2652. [Google Scholar] [CrossRef]
Alizadeh, M.; Afify, A.Z.; Eliwa, M.S.; Ali, S. The odd log-logistic Lindley-G family of distributions: Properties, Bayesian and non-Bayesian estimation with applications. Comput. Stat. 2020, 35, 281–308. [Google Scholar] [CrossRef]
El-Morshedy, M.; Eliwa, M.S.; Afify, A.Z. The odd Chen generator of distributions: Properties and estimation methods with applications in medicine and engineering. J. Natl. Sci. Found. Sri Lanka 2020, 48, 113–130. [Google Scholar]
Eliwa, M.S.; El-Morshedy, M.; Ali, S. Exponentiated odd Chen-G family of distributions: Statistical properties, Bayesian and non-Bayesian estimation with applications. J. Appl. Stat. 2020. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Ortega, E.M.M.; Nadarajah, S. The Kumaraswamy Weibull distribution with application to failure data. J. Frankl. Inst. 2010, 347, 1399–1429. [Google Scholar] [CrossRef]
Lee, C.; Famoye, F.; Olumolade, O. Beta-Weibull distribution: Some properties and applications to censored data. J. Mod. Appl. Stat. Methods 2007, 6, 173–186. [Google Scholar] [CrossRef]
Oguntunde, P.E.; Odetunmibi, O.A.; Adejum, A.O. On the exponentiated generalized Weibull distribution: A generalization of the Weibull distribution. Indian J. Sci. Technol. 2015, 8, 1–7. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Hashimoto, E.M.; Ortega, E.M.M. The McDonald Weibull model. Statistics 2012, 48, 256–278. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Aristizábal, W.D.; Suárez, D.M.; Lozano, S. The gamma modified Weibull distribution. Chil. J. Stat. 2011, 6, 37–48. [Google Scholar]
Da-Cruz, J.N.; Ortega, E.M.M.; Cordeiro, G.M. The log-odd log-logistic Weibull regression model: Modelling, estimation, influential diagnostics and residual analysis. J. Stat. Comput. Simul. 2016, 86, 1516–1538. [Google Scholar] [CrossRef]
Ghitany, M.E.; Al-Hussaini, E.K.; Al-Jarallah, R.A. Marshall-Olkin extended Weibull distribution and its application to censored data. J. Appl. Stat. 2005, 32, 1025–1034. [Google Scholar] [CrossRef]
Khan, M.S.; King, R.; Hudson, I.L. Transmuted Weibull distribution: Properties and estimation. Commun. Stat. Theory Methods 2017, 46, 5394–5418. [Google Scholar] [CrossRef]
Katz, R.W.; Parlange, M.B.; Naveau, P. Statistics of extremes in hydrology. Adv. Water Res. 2002, 25, 1287–1304. [Google Scholar] [CrossRef]
Asgharzadeh, A.; Bakouch, H.S.; Habibi, M. A generalized binomial exponential 2 distribution: Modeling and applications to hydrologic events. J. Appl. Stat. 2017, 44, 2368–2387. [Google Scholar] [CrossRef]
Marshall, A.W.; Olkin, I. A multivariate exponential model. J. Am. Stat. Assoc. 1967, 62, 30–44. [Google Scholar] [CrossRef]
Sarhan, A.M.; Balakrishnan, N. A new class of bivariate distributions and its mixture. J. Multivar. Anal. 2007, 98, 1508–1527. [Google Scholar] [CrossRef][Green Version]
Kundu, D.; Dey, A.K. Estimating the parameters of the Marshall-Olkin bivariate Weibull distribution by EM algorithm. Comput. Stat. Data Anal. 2009, 53, 956–965. [Google Scholar] [CrossRef]
El-Gohary, A.; El-Bassiouny, A.H.; El-Morshedy, M. Bivariate exponentiated modified Weibull extension distribution. J. Stat. Appl. Prob. 2016, 5, 67–78. [Google Scholar] [CrossRef]
Muhammed, H.Z. Bivariate inverse Weibull distribution. J. Stat. Comput. Simul. 2016, 86, 2335–2345. [Google Scholar] [CrossRef]
El-Bassiouny, A.H.; EL-Damcese, M.; Abdelfattah, M.; Eliwa, M.S. Bivariate exponentaited generalized Weibull-Gompertz distribution. J. Appl. Probab. Stat. 2016, 11, 25–46. [Google Scholar]
Ghosh, I.; Hamedani, G.G. On the Ristic-Balakrishnan distribution: Bivariate extension and characterizations. J. Stat. Theory Pract. 2018, 12, 436–449. [Google Scholar] [CrossRef]
El-Morshedy, M.; Alhussain, Z.A.; Atta, D.; Almetwally, E.M.; Eliwa, M.S. Bivariate Burr X generator of distributions: Properties and estimation methods with applications to complete and type-II censored samples. Mathematics 2020, 8, 264. [Google Scholar] [CrossRef]
El-Morshedy, M.; Eliwa, M.S.; El-Gohary, A.; Khalil, A.A. Bivariate exponentiated discrete Weibull distribution: Statistical properties, estimation, simulation and applications. Math. Sci. 2020, 1–14. [Google Scholar] [CrossRef]
Eliwa, M.S.; Alhussain, Z.A.; Ahmed, E.A.; Salah, M.M.; Ahmed, H.H.; El-Morshedy, M. Bivariate Gompertz generator of distributions: Statistical properties and estimation with application to model football data. J. Natl. Sci. Found. 2020, 48, 54–72. [Google Scholar] [CrossRef]
Eliwa, M.S.; El-Morshedy, M. Bivariate Gumbel-G family of distributions: Statistical properties, Bayesian and non-Bayesian estimation with application. Ann. Data Sci. 2019, 6, 39–60. [Google Scholar] [CrossRef]
Gijbels, I.; Omelka, M.; Pesta, M.; Veraverbeke, N. Score tests for covariate effects in conditional copulas. J. Multivar. Anal. 2017, 159, 111–133. [Google Scholar] [CrossRef]
Husková, M.; Maciak, M. Discontinuities in robust nonparametric regression with alpha-mixing dependence. J. Nonparametr. Stat. 2017, 29, 447–475. [Google Scholar] [CrossRef]
Relia Softś, R.; Staff, D. Using QALT models to analyze system configurations with load sharing. Reliab. Edge 2002, 3, 1–4. Available online: http://www.reliasoft.com/pubs/reliabilityedge_4q2002.pdf (accessed on 9 January 2020).

Figure 1. Plots of the NKwW densities for some parameter values.

Figure 2. Plots of the NKwW hazard rate for some parameter values.

Figure 3. Skewness and kurtosis plots of the NKwW model for some values of

α

= 1.5, 0.5, 1.5 and

β

= 2.5, 3.5, 0.5.

Figure 4. Plots of estimated MSEs for selected parameter values.

Figure 5. Plots of estimated biases for selected parameter values.

Figure 6. Estimated (a) density (b) K-M (c) hazard rate, and (d) Box-plots for the data set 1.

Figure 7. Estimated (a) density (b) K-M (c) hazard rate, and (d) Box-plots for the data set 2.

Figure 8. The surface plots of the jpdf, jhrf and jrf of the BvNKwW model for

a = 0.6, b_{1} = 4, b_{2} = 4

,

b_{3} = 4, α = 0.6

and

β = 2.9

.

Figure 9. The surface plots of the jpdf, jhrf and jrf of the BvNKwW model for

a = 1.6, b_{1} = 2, b_{2} = 2

,

b_{3} = 2, α = 1.6

and

β = 2.9

.

Figure 10. The surface plots of the jpdf, jhrf and jrf of the BvNKwW model for

a = 0.8, b_{1} = 1.5, b_{2} = 1.5

,

b_{3} = 1.5, α = 0.9

and

β = 1.9

.

Figure 11. The fitted pdfs plots for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

Figure 12. The estimated cdfs for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

Figure 13. The pp-plots for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

Figure 14. The box, TTT and scatter plots of the marginal models.

Table 1. MLEs and their SEs (in parentheses) for data set 1.

Distribution	$α$	$β$	a	b	$θ$
NKwW	0.0089	1.1514	5.0192	0.5054	–
	(0.0023)	(0.0730)	(1.6716)	(0.1807)	–
KwW	0.0160	1.3962	5.7590	0.3381	–
	(0.0027)	(0.2113)	(1.9264)	(0.1554)	–
BW	0.0219	0.7969	13.2183	1.2340	–
	(0.0076)	(0.1903)	(5.1706)	(0.9787)	–
EGW	0.0086	0.9045	2.0898	10.5512	–
	(0.0022)	(0.1288)	(0.6257)	(4.4416)	–
McW	0.0132	1.2859	1.7925	0.5887	2.5824
	(0.0032)	(0.2544)	(0.6126)	(0.3831)	(0.9474)
GaW	1.1306	0.5469	17.6158	–	–
	(0.0496)	(0.0154)	(1.4970)	–	–
OLLW	0.0045	1.0313	2.4836	–	–
	(0.0003)	(0.1759)	(0.4532)	–	–
MOW	0.0032	3.4739	–	–	0.1039
	(0.0002)	(0.3436)	–	–	(0.0489)
TrW	0.0043	2.4549	–	–	0.6144
	(0.0003)	(0.1755)	–	–	(0.2078)
W	0.0050	2.2745	–	–	–
	(0.0002)	(0.1629)	–	–	–

Table 2. MLEs and their standard errors (in parentheses) for data set 2.

Distribution	$α$	$β$	a	b	$θ$
NKwW	0.1742	0.9887	59.0160	0.2183	–
	(0.0316)	(0.0619)	(0.4024)	(0.0585)	–
KwW	0.1609	1.0252	54.7825	0.2041	–
	(0.0153)	(0.0276)	(0.1358)	(0.0382)	–
BW	0.1320	1.1080	23.0602	0.1940	–
	(0.0073)	(0.0068)	(8.7941)	(0.0324)	–
EGW	0.0090	0.7774	5.5966	10.5493	–
	(0.0041)	(0.1370)	(2.0458)	(5.6821)	–
McW	0.1608	1.0049	14.5078	0.2210	2.5180
	(0.0340)	(0.0466)	(9.8227)	(0.0757)	(0.0895)
GaW	4.6144	0.4983	14.7225	–	–
	(0.1518)	(0.0217)	(1.7239)	–	–
OLLW	0.0154	0.9508	2.3925	–	–
	(0.0021)	(0.3378)	(0.9487)	–	–
MOW	0.0065	3.3556	–	–	0.0145
	(0.0014)	(0.4292)	–	–	(0.0146)
TrW	0.0137	1.9476	–	–	0.7003
	(0.0016)	(0.1941)	–	–	(0.2483)
W	0.0171	1.7719	–	–	–
	(0.0015)	(0.1776)	–	–	–

Table 3. The statistics AIC, BIC, HQIC,

A^{*}

,

W^{*}

, K-S and p-value for data set 1.

Table 3. The statistics AIC, BIC, HQIC,

A^{*}

,

W^{*}

, K-S and p-value for data set 1.

								K-S
Distribution	$\hat{ℓ}$	AIC	BIC	HQIC	$A^{*}$	$W^{*}$	K-S	P-Value
NKwW	565.2337	1138.4670	1148.8880	1142.6850	0.1722	0.0207	0.0454	0.9863
KwW	566.6253	1141.2510	1151.6710	1145.4680	0.3678	0.0477	0.0572	0.8987
BW	566.2292	1140.4580	1150.8790	1144.6760	0.3149	0.0411	0.0489	0.9707
EGW	566.2248	1140.4500	1150.8700	1144.6670	0.3266	0.0427	0.0487	0.9718
McW	567.4362	1144.8720	1157.8980	1150.1440	0.4868	0.0655	0.0596	0.8695
GaW	567.2618	1140.5240	1148.3390	1143.6870	0.5071	0.0689	0.0547	0.9257
OLLW	569.6909	1145.3820	1153.1970	1148.5450	0.6649	0.0932	0.0807	0.5335
MOW	568.4818	1142.9640	1150.7790	1146.1270	0.6431	0.0866	0.0595	0.8713
TrW	573.7855	1153.5710	1161.3870	1156.7340	1.4659	0.2183	0.0872	0.4321
W	576.1180	1156.2360	1161.4460	1158.3450	1.8275	0.2767	0.0936	0.3450

Table 4. The statistics AIC, BIC, HQIC,

A^{*}

,

W^{*}

, K-S and p-value for data set 2.

Table 4. The statistics AIC, BIC, HQIC,

A^{*}

,

W^{*}

, K-S and p-value for data set 2.

								K-S
Distribution	$\hat{ℓ}$	AIC	BIC	HQIC	$A^{*}$	$W^{*}$	K-S	P-Value
NKwW	215.1742	438.3485	445.8333	441.1770	0.2003	0.0277	0.0776	0.9346
KwW	215.5195	439.0389	446.5238	441.8675	0.2495	0.0347	0.0834	0.8924
BW	216.1573	440.3147	447.7995	443.1432	0.3387	0.0477	0.0973	0.7538
EGW	218.1801	444.3601	451.8449	447.1887	0.6147	0.0913	0.0973	0.7543
McW	215.7566	441.5132	450.8692	445.0489	0.2699	0.0374	0.0837	0.8895
GaW	219.4700	444.9401	450.5537	447.0615	0.8278	0.1250	0.1176	0.5203
OLLW	220.4104	446.8208	452.4344	448.9422	0.9051	0.1388	0.0934	0.7966
MOW	218.2594	442.5187	448.1323	444.6401	0.5773	0.0868	0.0791	0.9247
TrW	224.0997	454.1994	459.8130	456.3208	1.5006	0.2372	0.1291	0.4001
W	225.7065	455.4131	459.1555	456.8273	1.7286	0.2765	0.1399	0.3048

Table 5. The -L, K-S and p-values for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

Table 5. The -L, K-S and p-values for

X_{1}

,

X_{2}

and

max (X_{1}, X_{2})

.

	$X_{1}$			$X_{2}$			$max (X_{1}, X_{2})$
Model	−L	K-S	P-Value	−L	K-S	P-Value	−L	K-S	P-Value
NKwW	$100.2890$	$0.2376$	$0.2614$	$102.9142$	$0.0902$	$0.9956$	$101.1965$	$0.1372$	$0.8871$

Table 6. The MLEs with its (SE) and goodness of fit measures for Motors data.

	Model
Statistic	BvNKwW	BvGPW	BvEW	BvW	BvGEx	BvEx	BvGLFR
$\hat{a}$	$1.6395$	$0.0291$	$0.5203$	$0.0389$	$0.0137$	−	$6.99 \times 10^{- 5}$
	$(0.0651)$	$(0.0557)$	$(0.0511)$	$(0.0158)$	$(0.0023)$	−	$(1.09 \times 10^{- 5})$
$\hat{b_{1}}$	$3.1333$	1.5591	30.1381	0.2004	2.4541	0.0023	0.4171
	$(0.2364)$	$(3.0428)$	$(9.6756)$	$(0.0511)$	$(1.0189)$	$(0.0005)$	$(9.71 \times 10^{- 7})$
$\hat{b_{2}}$	$3.3989$	$1.8581$	$24.1351$	$0.2383$	$2.8803$	$0.0021$	$0.4864$
	$(0.1896)$	$(3.6787)$	$(7.6763)$	$(0.0513)$	$(1.1158)$	$(0.0005)$	$(1.05 \times 10^{- 6})$
$\hat{b_{3}}$	$4.2869$	$3.7191$	$61.8051$	$0.3381$	$6.0641$	$0.0051$	$1.0188$
	$(0.0985)$	$(7.2630)$	$(6.3779)$	$(0.0622)$	$(1.8113)$	$(0.0009)$	$(1.33 \times 10^{- 6})$
$\hat{α}$	$0.0008$	$0.0291$	$0.5203$	$0.0389$	$0.0137$	−	$6.99 \times 10^{- 5}$
	$(0.0001)$	$(0.0557)$	$(0.0511)$	$(0.0158)$	$(0.0023)$	−	$(1.09 \times 10^{- 5})$
$\hat{β}$	$1.2532$	−	−	−	−	−	−
	$(0.3421)$	−	−	−	−	−	−
$- L$	$211.1711$	$431.7909$	$339.2656$	$422.9532$	$335.2312$	$355.7323$	$331.7681$
AIC	$434.3422$	$871.5818$	$688.5312$	$853.9064$	$678.4624$	$717.4646$	$673.5362$
CAIC	$441.9786$	$874.6587$	$693.5312$	$856.9833$	$681.5393$	$719.1789$	$678.5362$
BIC	$439.6844$	$875.14328$	$692.9831$	$857.4679$	$682.0239$	$720.1357$	$677.9881$
HQIC	$435.0788$	$872.0729$	$689.1451$	$854.3975$	$678.9535$	$717.8329$	$674.1501$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A New Kumaraswamy Generalized Family of Distributions with Properties, Applications, and Bivariate Extension

Abstract

1. Introduction

2. The NKw-G Family

3. Properties of the NKw-G Family

3.1. Quantile Function

3.2. Asymptotics

3.3. Analytic Shapes of the Density and Hazard Rate Function

3.4. Linear Representation of the NKw-G Density

3.5. Mathematical Properties

3.6. Estimation

4. The NKwW Distribution

4.1. Linear Representation

4.2. Properties

4.3. Quantile Function and Simulation Study

4.4. Estimation

5. Empirical Illustrations of NKwW Model

6. Bivariate New Kumaraswamy G-Family

The MLE for the BvNKw-G Family

7. Empirical Illustrations of BvNKwW Model Through Motors Data

8. Concluding Remarks

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A

References

Article Metrics

Citations

Article Access Statistics