The Weibull-Gamma Distribution: Properties and Applications

Klakattawi, Hadeel S.

doi:10.3390/e21050438

Open AccessArticle

The Weibull-Gamma Distribution: Properties and Applications

by

Hadeel S. Klakattawi

Department of Statistics, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia

Entropy 2019, 21(5), 438; https://doi.org/10.3390/e21050438

Submission received: 28 March 2019 / Revised: 16 April 2019 / Accepted: 23 April 2019 / Published: 26 April 2019

(This article belongs to the Section Information Theory, Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

:

A new member of the Weibull-generated (Weibull-G) family of distributions—namely the Weibull-gamma distribution—is proposed. This four-parameter distribution can provide great flexibility in modeling different data distribution shapes. Some special cases of the Weibull-gamma distribution are considered. Several properties of the new distribution are studied. The maximum likelihood method is applied to obtain an estimation of the parameters of the Weibull-gamma distribution. The usefulness of the proposed distribution is examined by means of five applications to real datasets.

Keywords:

weibull distribution; gamma distribution; transformed-transformer family of distributions; maximum likelihood estimation; simulation study

1. Introduction

Probability distributions provide important information about the statistical inference and analysis of data. These results can be used for considering some well-grounded decisions. Hence, it is essential to have a distribution that accurately reflects the data. Different classical distributions have been broadly applied for the description of real-world phenomena and the modeling of data in different disciplines, including insurance, economics, finance, engineering, biology, industry, and medical sciences. However, many of these standard distributions have limitations in fitting some of the real data accurately. Therefore, it is of great importance to identify which distribution should be applied to model the data. Knowledge of the appropriate distribution greatly improves the efficiency of any statistical inference related to data sets. Hence, researchers are always trying to extend existing classical distributions, to improve their goodness-of-fit and obtain more flexibility and adaptability in modeling data in practice. This paper proposes a new modified distribution, which has increased flexibility to fit various data in practice. Many approaches to extend the existing standard models and develop generalized classes of distributions have been suggested in the literature; for an excellent review of various approaches in generating distributions, one may refer to [1].

Recently, Alzaatreh et al. [2] proposed a new approach for generating families of distributions, namely the transformed-transformer (T-X) family of distributions. These families can be defined for any baseline distribution with probability density function (pdf) g, cumulative density function (cdf) G, and a parameter vector

ζ

, by applying a function to its cdf. To illustrate, assume a continuous generator random variable (RV) T is defined on

[a, b]

with pdf

r (t)

and cdf

R (t)

. Then, the cdf of the T-X family of distributions for a RV X is defined as

F (x) = \int_{a}^{W (G (x; ζ))} r (t) d t,

which can be written as

F (x) = R (W (G (x; ζ))),

(1)

where

W (G (x; ζ)

is a function of the cdf G which satisfies

W (G (x; ζ) \in [a, b]

,

W (G (x; ζ)

is differentiable and monotonically non-decreasing,

W (G (x; ζ) \to a

as

x \to - \infty

, and

W (G (x; ζ) \to b

as

x \to \infty

. The corresponding pdf, associated with Equation (1), can be found as

f (x) = \{\frac{d}{d x} W (G (x; ζ))\} r (W (G (x; ζ))) .

Different forms of the upper limit W can be used to generate different types of the T-X family of distributions. Additionally, the term “generated” (which we will denote G, for short) illustrates that for each baseline distribution G, a different distribution F can be obtained; that is, for each family, several sub-models can be derived according to the choice of the distribution G. For more details, see [2], where they choose

W (G (x ζ)) = - l o g (1 - (G (x ζ)))

in order to introduce the gamma-G, beta-exponential-G, and Weibull-G families. Many T-X generated distributions have been proposed recently. For example, the gamma-G family was introduced in [3], the Lomax-G family was introduced in [4], the Lindley-G family was introduced in [5], the Gompertz-G family was introduced in [6], the generalized Burr-G family was introduced in [7], the power Lindley-G family was introduced in [8], and the odd Lomax-G family was introduced in [9], among others.

The gamma distribution can be considered to be one of the most commonly applied lifetime distributions in different fields. A RV X is said to have a gamma distribution, with shape parameter

k > 0

and a scale parameter

s > 0

, if the pdf and cdf of X are, respectively, given as

g (x; k, s) = \frac{1}{s^{k} Γ (k)} x^{k - 1} e^{- \frac{x}{s}}; x \geq 0

(2)

and

G (x; k, s) = \frac{γ (k, \frac{x}{s})}{Γ (k)},

(3)

where

Γ (k)

is the gamma function and

γ (k, x)

is the incomplete gamma function. given respectively as

Γ (k) = \int_{0}^{\infty} t^{k - 1} e^{- t} d t a n d γ (k, \frac{x}{s}) = \int_{0}^{\frac{x}{s}} t^{k - 1} e^{- t} d t .

(4)

Many attempts have been made to increase the flexibility of this distribution by introducing some new distribution with additional parameter(s), such as the exponentiated gamma (EG). This distribution can be considered to be a member of the exponentiated class of distributions proposed in [10], where the cdf of any classical distribution is raised to a power (shape) parameter. Thus, the cdf of the EG takes the form

F (x; s, k, α) = {[\frac{γ (k, \frac{x}{s})}{Γ (k)}]}^{α} .

(5)

See, for example [11], where the one-parameter gamma (with scale parameter

s = 1

), is exponentiated.

This paper proposes a new modification of the gamma distribution, based on the Weibull-G (W-G) family of distributions. The additional parameters presented by the Weibull generator might enhance the flexibility of the distribution. The rest of the paper is organized as follows. In Section 2, the W-G family is discussed. A member of this family—namely the Weibull-gamma (W-g) distribution—is introduced in Section 3. Some special cases of the W-g distribution are examined in Section 4. In Section 5, some of the properties of the new distribution are briefly discussed. The method of maximum likelihood for estimating the parameters of the W-g distribution is discussed in Section 6. Then, the consistency and precision of these estimates, by means of some Monte Carlo simulation studies, are investigated in Section 7. Finally, five real datasets are applied, in Section 8, to investigate the flexibility and usefulness of the W-g distribution.

2. The Weibull-Generated Family

A RV T is said to have a Weibull distribution, with shape parameter

c > 0

and scale parameter

β > 0

, if its pdf and cdf are, respectively, given by

r (t; c, β) = \frac{c}{β} {(\frac{t}{β})}^{c - 1} e^{- {(\frac{t}{β})}^{c}}; t \geq 0, and

(6)

R (t; c, β) = 1 - e^{- {(\frac{t}{β})}^{c}} .

(7)

Assuming that

G (x; ζ)

is a cdf of the baseline distribution with parameter vector

ζ

, the cdf of the W-G distribution can be derived by replacing t in Equation (7) by

W (G (x; ζ))

, as follows

\begin{matrix} F (x, c, β, ζ) & = \int_{0}^{W (G (x; ζ))} \frac{c}{β} {(\frac{t}{β})}^{c - 1} e^{- {(\frac{t}{β})}^{c}} d t \\ = 1 - e^{- {(\frac{W (G (x; ζ))}{β})}^{c}} . \end{matrix}

(8)

Then, the W-G distribution is obtained with two extra parameters, c and

β

, for any baseline distribution G distribution. Different types of W-G can be found, based on the choice of the upper limit of the integral

W (G (x; ζ))

. In other words, ref [12] assumed

W (G (x)) = - l o g (1 - G^{α} (x; ζ))

for

α > 0

to introduce the exponentiated W-G, ref [13] considered

W (G (x; ζ)) = \frac{G (x; ζ)}{1 - G (x; ζ)}

, ref [14] used the form of

W (G (x; ζ)) = \frac{1}{1 - G (x; ζ)}

, and [15] assumed

W (G (x; ζ)) = - l o g (G (x; ζ))

. In this paper,

W (G (x; ζ)) = - l o g (1 - G (x; ζ))

, which was discussed by [2,16,17], is considered in particular. The cdf of the W-G distribution is defined as

\begin{matrix} F (x; c, β, ζ) & = \int_{0}^{- l o g (1 - G (x; ζ))} \frac{c}{β} {(\frac{t}{β})}^{c - 1} e^{- {(\frac{t}{β})}^{c}} d t \\ = 1 - e^{- {(\frac{- l o g (1 - G (x; ζ))}{β})}^{c}}, \end{matrix}

(9)

with the corresponding pdf

f (x; c, β, ζ) = \frac{c}{β} (\frac{g (x; ζ)}{1 - G (x; ζ)}) {(\frac{- l o g (1 - G (x; ζ))}{β})}^{c - 1} e^{- {(\frac{- l o g (1 - G (x; ζ))}{β})}^{c}} .

(10)

In [18], a transformer X distributed as Pareto distribution was considered, to introduce the Weibull-Pareto distribution. In [16], the logistic distribution was used as a baseline distribution, providing the Weibull-logistic model. The Weibull-log-logistic distribution was discussed in [17] as a special case of the W-G family of distributions. Additionally, ref [19] applied this form of the upper limit for the Rayleigh and discussed the Weibull-Rayleigh distribution.

3. The Weibull-Gamma Distribution

The W-g distribution is derived as a member of the W-G family of distributions in Equation (9); that is, T is a Weibull RV and X is a gamma RV. Then, the pdf and cdf of the W-g distribution, with a vector of parameters

ζ = \{c, β, k, s\}

, can be found by substituting with Equations (2) and (3) in Equations (9) and (10), as follows

\begin{matrix} f (x; c, β, k, s) = & \frac{c}{β s^{k} Γ (k)} \frac{x^{k - 1} e^{- \frac{x}{s}}}{w (x; k)} {(\frac{- l o g (w (x; k))}{β})}^{c - 1} \times \\ e x p (- {(\frac{- l o g (w (x; k))}{β})}^{c}) \end{matrix}

(11)

and

F (x; c, β, k, s) = 1 - e x p (- {(\frac{- l o g (w (x; k))}{β})}^{c}) .

(12)

The reliability function of the W-g can be obtained, consequently, as

R (x; c, β, k, s) = 1 - F (x; c, β, k, s) = e x p (- {(\frac{- l o g (w (x; k))}{β})}^{c}),

(13)

where

x \geq 0

,

c, β, k, s > 0

, and

w (x; k) = 1 - \frac{γ (k, \frac{x}{s})}{Γ (k)} .

(14)

The hazard function can be defined as

h (x; c, β, k, s) = \frac{f (x; c, β, k, s)}{1 - F (x; c, β, k, s)},

(15)

where

f (x)

and

F (x)

are, respectively, defined by Equations (11) and (12).

Different plots of the pdf and the hazard functions for the W-g distribution are displayed, respectively, in Figure 1 and Figure 2, for some specific parameter values. The density and hazard functions show differing behaviors, based on the values of the parameters. The various possible shapes of the density function, including (approximately) symmetric, skewed, and bimodal, were produced. Additionally, several shapes, including monotonically decreasing, monotonically increasing, unimodal, bathtub, and U shapes, can be obtained for the hazard function of the W-g, for different combinations of the values of the parameters. This illustrates the great flexibility of the W-g distribution, which make it suitable for various real data.

4. Some Special Cases of the Weibull-Gamma Distribution

If $c = β = k = s = 1$ , the W-g distribution reduces to the standard exponential distribution, with pdf as follows

$f (x) = e^{- x}; x > 0 .$
When $c = β = 1$ in the W-g model, the gamma distribution in Equation (2) with shape parameter k and scale parameter s is obtained.
If $c = 1$ , the W-g distribution reduces to the exponential-gamma distribution, with pdf as follows

$f (x; β, k, s) = \frac{x^{k - 1} e^{- \frac{x}{s}}}{β s^{k} Γ (k)} {(1 - \frac{γ (k, \frac{x}{s})}{Γ (k)})}^{\frac{1}{β} - 1}; x > 0,$

where $Γ (k)$ and $γ (k, x)$ are, respectively, defined by Equation (4).

5. Properties

Providing some mathematical expansions to find some characteristics of the W-g distribution might be more reasonable than numerically solving the integrals of the pdf given by Equation (11) to derive these properties. Hence, some mathematical properties are provided here using algebraic expansions, which can be carried out using any computational software platform which can deal with analytic expressions.

5.1. Useful Expansions

In the following, we show an alternative formula for the pdf of the W-g distribution given in Equation (11). Using the power series for the exponential function

e^{- x} = \sum_{a_{1} = 0}^{\infty} \frac{{(- 1)}^{a_{1}}}{a_{1}!} x^{a_{1}},

we obtain

f (x; c, β, k, s) = \frac{c}{β s^{k} Γ (k)} \frac{x^{k - 1} e^{- \frac{x}{s}}}{w (x; k)} \sum_{a_{1} = 0}^{\infty} \frac{{(- 1)}^{a_{1}}}{a_{1}!} {(\frac{- l o g (w (x; k))}{β})}^{a_{1} c + c - 1} .

Applying the binomial theorem, which defines

{(1 - x)}^{- 1}

as

{(1 - x)}^{- 1} = \sum_{a_{2} = 0}^{\infty} x^{a_{2}},

to expand

w {(x; k)}^{- 1}

(where the definition of

w (x; k)

is given in Equation (14)), the pdf can be reduced to

\begin{matrix} f (x; c, β, k, s) = \frac{c}{β s^{k} Γ (k)} & x^{k - 1} e^{- \frac{x}{s}} \sum_{a_{2} = 0}^{\infty} {(\frac{γ (k, \frac{x}{s})}{Γ (k)})}^{a_{2}} \times \\ \sum_{a_{1} = 0}^{\infty} \frac{{(- 1)}^{a_{1}}}{a_{1}!} {(\frac{1}{β})}^{a_{1} c + c - 1} {(- l o g (1 - \frac{γ (k, \frac{x}{s})}{Γ (k)}))}^{a_{1} c + c - 1} . \end{matrix}

Furthermore, ref [17,20] applied the generalized binomial theorem to prove that

{(- l o g (1 - x))}^{a} = a \sum_{a_{3} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} \frac{{(- 1)}^{a_{4} + a_{3}} (\binom{a_{3} - a}{a_{3}}) (\binom{a_{3}}{a_{4}}) p_{a_{4}, a_{3}}}{(a - a_{4})} x^{a + a_{3}} .

Consequently, the pdf can be obtained as

\begin{matrix} f (x; c, β, k, s) = & \frac{c}{β s^{k} Γ (k)} x^{k - 1} e^{- \frac{x}{s}} \times \\ \sum_{a_{1}, a_{2}, a_{3} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} & \frac{{(- 1)}^{a_{1} + a_{3} + a_{4}} (a_{1} c + c - 1) (\binom{a_{3} - a_{1} c - c + 1}{a_{3}}) (\binom{a_{3}}{a_{4}}) p_{a_{4}, a_{3}}}{(a_{1}!) β^{a_{1} c + c - 1} (a_{1} c + c - a_{4} - 1)} \times \\ {(\frac{γ (k, \frac{x}{s})}{Γ (k)})}^{a_{1} c + c + a_{2} + a_{3} - 1}, \end{matrix}

where the constant

p_{a_{4}, a_{3}}

can be found recursively by

p_{a_{4}, a_{3}} = a_{3}^{- 1} \sum_{l = 1}^{a_{3}} (a_{3} - l (a_{4} + 1)) c_{l} p_{a_{4}, a_{3} - l},

for

a_{3} = 1, 2, \dots

,

p_{a_{4}, 0} = 1

, and

c_{a_{3}} = {(- 1)}^{a_{3} + 1} {(a_{3} + 1)}^{- 1}

.

By application of the expansion for the incomplete gamma function

γ (a, x)

using the power series, presented in [21] as

γ (a, x) = x^{a} \sum_{a_{5} = 0}^{\infty} \frac{{(- 1)}^{a_{5}} x^{a_{5}}}{a_{5}! (a + a_{5})},

we obtain

\begin{matrix} f (x; c, β, k, s) = & \frac{c}{β s^{k} Γ (k)} x^{k - 1} e^{- \frac{x}{s}} \times \\ \sum_{a_{1}, a_{2}, a_{3} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} & \frac{{(- 1)}^{a_{1} + a_{3} + a_{4}} (a_{1} c + c - 1) (\binom{a_{3} - a_{1} c - c + 1}{a_{3}}) (\binom{a_{3}}{a_{4}}) p_{a_{4}, a_{3}}}{(a_{1}!) β^{a_{1} c + c - 1} (a_{1} c + c - a_{4} - 1)} {(\frac{x}{s})}^{k (a_{1} c + c + a_{2} + a_{3} - 1)} \times \\ {(\frac{1}{Γ (k)})}^{a_{1} c + c + a_{2} + a_{3} - 1} {(\sum_{a_{5} = 0}^{\infty} \frac{{(- 1)}^{a_{5}} {(\frac{x}{s})}^{a_{5}}}{a_{5}! (k + a_{5})})}^{a_{1} c + c + a_{2} + a_{3} - 1} . \end{matrix}

Again, according to [21], the power series raised to an integer m can be simplified as

{(\sum_{a_{5} = 0}^{\infty} a_{a_{5}} x^{a_{5}})}^{m} = \sum_{a_{5} = 0}^{\infty} q_{a_{5}} x^{a_{5}},

where

q_{0} = {a_{0}}^{m}

and

q_{v} = \frac{1}{v a_{0}} \sum_{a_{5} = 1}^{v} (a_{5} m - v + a_{5}) a_{a_{5}} q_{v - a_{5}}

for

v \geq 1

.

Hence, assuming

a_{a_{5}} = \frac{{(- 1)}^{a_{5}}}{a_{5}! (k + a_{5})}

yields

\begin{matrix} f (x; c, β, k, s) = & c e^{- \frac{x}{s}} \times \\ \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} & \frac{{(- 1)}^{a_{1} + a_{3} + a_{4}} (a_{1} c + c - 1) (\binom{a_{3} - a_{1} c - c + 1}{a_{3}}) (\binom{a_{3}}{a_{4}}) p_{a_{4}, a_{3}} q_{a_{5}} x^{k (a_{1} c + c + a_{2} + a_{3}) + a_{5} - 1)}}{{(Γ (k))}^{a_{1} c + c + a_{2} + a_{3}} (a_{1}!) s^{k (a_{1} c + c + a_{2} + a_{3}) + a_{5}} β^{a_{1} c + c} (a_{1} c + c - a_{4} - 1)}, \end{matrix}

where

q_{0} = {a_{0}}^{a_{1} c + c + a_{2} + a_{3} - 1}

and

q_{v} = \frac{1}{v a_{0}} \sum_{a_{5} = 1}^{v} (a_{5} (a_{1} c + c + a_{2} + a_{3}) - v) a_{a_{5}} q_{v - a_{5}}

for

v \geq 1

.

If we define

A_{a_{1}, a_{2}, a_{3}, a_{4}, a_{5}} = \frac{c {(- 1)}^{a_{1} + a_{3} + a_{4}} (a_{1} c + c - 1) (\binom{a_{3} - a_{1} c - c + 1}{a_{3}}) (\binom{a_{3}}{a_{4}}) p_{a_{4}, a_{3}} q_{a_{5}}}{{(Γ (k))}^{a_{1} c + c + a_{2} + a_{3}} (a_{1}!) s^{k (a_{1} c + c + a_{2} + a_{3}) + a_{5}} β^{a_{1} c + c} (a_{1} c + c - a_{4} - 1)},

(16)

and using A instead of

A_{a_{1}, a_{2}, a_{3}, a_{4}, a_{5}}

for short, the pdf of the W-g can be rewritten as

f (x; c, β, k, s) = \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} A x^{k (a_{1} c + c + a_{2} + a_{3}) + a_{5} - 1)} e^{- \frac{x}{s}} .

(17)

Using a similar technique, the cdf of the W-g distribution can be obtained as

\begin{matrix} F (x; & c, β, k, s) = 1 - \\ \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} \frac{{(- 1)}^{a_{1} + a_{3} + a_{4}} (a_{1} c) (\binom{a_{3} - a_{1} c}{a_{3}}) (\binom{a_{3}}{a_{4}}) p_{a_{4}, a_{3}} q_{a_{5}}}{{(Γ (k))}^{a_{1} c + a_{3}} (a_{1}!) β^{a_{1} c} (a_{1} c + c - a_{4} - 1)} {(\frac{x}{s})}^{k (a_{1} c + a_{3}) + a_{5})} . \end{matrix}

(18)

5.2. Quantile Function

The pth quantile function (

0 < p < 1

) of the RV X which follows the W-g distribution is obtained by inverting Equation (12) and solving the non-linear equation

γ (k, \frac{x}{s}) = Γ (k) (1 - e^{- β {(- l o g (1 - p))}^{\frac{1}{c}}}) .

(19)

5.3. Moments

From Equation (17), the rth moment of a RV X which follows the W-g distribution can be obtained as

\begin{matrix} μ_{r} = E (X^{r}) = & \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} A \int_{0}^{\infty} x^{r + k (a_{1} c + c + a_{2} + a_{3}) + a_{5} - 1)} e^{- \frac{x}{s}} d x \\ = \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} & \sum_{a_{4} = 0}^{a_{3}} A s^{r + k (a_{1} c + c + a_{2} + a_{3}) + a_{5}} Γ (r + k (a_{1} c + c + a_{2} + a_{3}) + a_{5}) . \end{matrix}

(20)

5.4. Moment Generating Function

The moment generating function for the W-g distribution follows, from Equation (17), as

\begin{matrix} M_{x} (t) = & E (e^{t X}) = \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} A \int_{0}^{\infty} x^{k (a_{1} c + c + a_{2} + a_{3}) + a_{5} - 1)} e^{- x (\frac{1}{s} - t)} d x \\ = \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} A {(\frac{s}{1 - t s})}^{k (a_{1} c + c + a_{2} + a_{3}) + a_{5}} Γ (k (a_{1} c + c + a_{2} + a_{3}) + a_{5}) . \end{matrix}

(21)

5.5. Characteristic Function

We can obtain the characteristic function for the W-g distribution, from Equation (17), as follows

\begin{matrix} ϕ_{x} (t) = & E (e^{i t X}) = \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} A \int_{0}^{\infty} x^{k (a_{1} c + c + a_{2} + a_{3}) + a_{5} - 1)} e^{- x (\frac{1}{s} - i t)} d x \\ = \sum_{a_{1}, a_{2}, a_{3}, a_{5} = 0}^{\infty} \sum_{a_{4} = 0}^{a_{3}} A {(\frac{s}{1 - i t s})}^{k (a_{1} c + c + a_{2} + a_{3}) + a_{5}} Γ (k (a_{1} c + c + a_{2} + a_{3}) + a_{5}) . \end{matrix}

(22)

6. Parameter Estimation for Weibull-Gamma Distribution

Assuming a random sample of size n is taken from the W-g distribution in Equation (11). Then, to find the maximum likelihood errors (MLEs) of the vector of parameters

θ = (c, β, k, s)

, we need to find the log-likelihood function, then obtain the partial derivative with respect to each parameter and set these derivatives to zero.

The log-likelihood function ℓ for the W-g can be given as

\begin{matrix} ℓ = & n l o g (c) - n c l o g (β) - n k l o g (s) - n l o g (Γ (k)) \\ + (k - 1) \sum_{i = 1}^{n} l o g (x_{i}) - \frac{1}{s} \sum_{i = 1}^{n} x_{i} - \sum_{i = 1}^{n} l o g (w (x_{i}; k)) \\ + (c - 1) \sum_{i = 1}^{n} l o g (- l o g (w (x_{i}; k))) \\ - β^{- c} \sum_{i = 1}^{n} {(- l o g (w (x_{i}; k)))}^{c} \end{matrix},

(23)

where

w (x; k)

is defined by Equation (14).

The derivatives of Equation (23), with respect to

c, β, k

, and s, respectively, are given by

\begin{matrix} \frac{\partial ℓ}{\partial c} = & \frac{n}{c} - n l o g (β) + \sum_{i = 1}^{n} l o g (- l o g (w (x_{i}; k))) \\ - \sum_{i = 1}^{n} l o g (\frac{- l o g (w (x_{i}; k))}{β}) {(\frac{- l o g (w (x_{i}; k))}{β})}^{c} \end{matrix},

(24)

\frac{\partial ℓ}{\partial β} = \frac{- n c}{β} + \frac{c}{β^{c + 1}} \sum_{i = 1}^{n} {(- l o g (w (x; k)))}^{c},

(25)

\begin{matrix} \frac{\partial ℓ}{\partial k} = & - n l o g (s) - \frac{n}{Γ (k)} \frac{d}{d k} Γ (k) + \sum_{i = 1}^{n} l o g (x_{i}) - \sum_{i = 1}^{n} \frac{\frac{d}{d k} w (x_{i}; k)}{w (x_{i}; k)} \\ + (c - 1) \sum_{i = 1}^{n} \frac{\frac{d}{d k} w (x_{i}; k)}{w (x_{i}; k) l o g (w (x_{i}; k))} + \frac{c}{β^{c}} \sum_{i = 1}^{n} \frac{{(- l o g (w (x_{i}; k)))}^{c - 1} \frac{d}{d k} w (x_{i}; k)}{w (x_{i}; k)}, and \end{matrix}

(26)

\frac{\partial ℓ}{\partial s} = \frac{- n k}{s} + \frac{1}{s^{2}} \sum_{i = 1}^{n} x_{i} .

(27)

Thus, the MLEs of the parameters

c, β, k

, and s can be obtained by setting Equations (24)–(27) to zero and solving them iteratively, using numerical methods such as the Newton-Raphson iteration method. Alternatively, the log-likelihood in Equation (23) can be directly maximized, using any standard non-linear optimization tool.

7. Simulation Study

This section considers some simulation studies to evaluate the performance of the MLEs of the parameters of the W-g distribution. The simulation is considered over several iterations equal to

n s i m = 1000

, and for different sample sizes n with the following cases for the true parameters

θ_{t r}

Case I: $c = 1.5, β = 0.5, k = 0.5, s = 0.4$ , and
Case II: $c = 1.8, β = 0.3, k = 0.5, s = 0.4$

The MLE,

\hat{θ}

, for each parameter can be evaluated using two accuracy measures—the bias and the root mean square error (RMSE)—which can be calculated, respectively, as follows

bias (\hat{θ}) = \frac{\sum_{i = 1}^{n s i m} {\hat{θ}}_{i}}{n s i m} - θ_{t r}

(28)

and

RMSE (\hat{θ}) = \sqrt{\frac{\sum_{i = 1}^{n s i m} {({\hat{θ}}_{i} - θ_{t r})}^{2}}{n s i m}} .

(29)

The Monte Carlo simulation studies were conducted using the R programming language. Table 1 shows the results for the MLE of the parameters of W-g, along with their corresponding average bias and RMSE, respectively. As expected for the method of maximum likelihood, it can be seen that both criteria, bias, and RMSE, generally decreases as the size of the sample n increases and the estimates become closer to the true parameters on average.

8. Applications

This section illustrates the usefulness of the W-g distribution through five different real data sets. The fit of the W-g is compared with some related distributions; namely the gamma distribution in Equation (2) with shape parameter k and scale parameter s, and the Weibull distribution in Equation (6) with shape parameter c and scale parameter

β

. The gamma and Weibull distributions are fitted using the “fitdistr” function from the MASS package in R. Additionally, the fitting is compared with the EG in Equation (5) with power parameter

α

, shape parameter k, and scale parameter s. Also, the data is fitted by the exponentiated exponential (EE), introduced in [22] as an alternative to the gamma and Weibull distributions. The EE is obtained by exponentiating the classical exponential distribution to a power (shape) parameter

α

as

F (x) = {[1 - e^{- s x}]}^{α}

, where s is the scale parameter and

α

is the shape parameter. The results for EG, EE, and W-g were obtained using the package Newdistns, given in [23], in the statistical software R.

In particular, the MLEs of the parameters for each of the distributions with the value of the log-likelihood were computed. Then, to choose the best model among these various models, the Akaike Information Criterion (AIC), given in [24], was computed and the best model is the model with the minimum AIC values. The plots of the expected frequencies for the fitted gamma, Weibull, EE, EG, and W-g were compared with the histograms of the observed frequencies. Furthermore, the empirical cdf was plotted and compared with the estimated cdf for each of the distributions.

8.1. First Dataset

First, we will consider the dataset discussed in [25], which concerns with a large system with 30 units, in which the failure and running times are 2.75, 0.13, 1.47, 0.23, 1.81, 0.30, 0.65, 0.10, 3.00, 1.73, 1.06, 3.00, 3.00, 2.12, 3.00, 3.00, 3.00, 0.02, 2.61, 2.93, 0.88, 2.47, 0.28, 1.43, 3.00, 0.23, 3.00, 0.80, 2.45, and 2.66.

Table 2, Table 3, Table 4, Table 5 and Table 6 shows a summary of the MLEs of the parameters, the log-likelihood, and the AIC for each model. It can be seen that the W-g can be selected as the best model, according to its low AIC when compared to the other fitted distributions. The histogram of the data and plots of the estimated pdf and cdf for each model are displayed in Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7. It is clear that the proposed W-g distribution is the closest to the actual distribution of the data. Therefore, the W-g distribution can be selected as the best model for all datasets.

8.2. Second Dataset

The second dataset consists of the lifetimes of

n = 50

components, given in [26] as: 0.1, 0.2, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 3.0, 6.0, 7.0, 11.0, 12.0, 18.0, 18.0, 18.0, 18.0, 18.0, 21.0, 32.0, 36.0, 40.0, 45.0, 46.0, 47.0, 50.0, 55.0, 60.0, 63.0, 63.0, 67.0, 67.0, 67.0, 67.0, 72.0, 75.0, 79.0, 82.0, 82.0, 83.0, 84.0, 84.0, 84.0, 85.0, 85.0, 85.0, 85.0, 85.0, 86.0, and 86.0.

8.3. Third Dataset

The third dataset also gives the failure and running times of a sample of

n = 30

devices, given in [25] as: 2, 10, 13, 23, 23, 28, 30, 65, 80, 88, 106, 143, 147, 173, 181, 212, 245, 247, 261, 266, 275, 293, 300, 300, 300, 300, 300, 300, 300, and 300.

8.4. Fourth Dataset

The dataset considered here, discussed by [27], presents the waiting times between 65 consecutive eruptions of a blowhole, called the Kiama Blowhole, as follows: 83, 51, 87, 60, 28, 95, 8, 27, 15, 10, 18, 16, 29, 54, 91, 8, 17, 55, 10, 35,47, 77, 36, 17, 21, 36, 18, 40 , 10, 7, 34, 27, 28, 56, 8, 25, 68, 146, 89, 18, 73, 69, 9, 37, 10, 82, 29, 8, 60, 61, 61, 18, 169, 25, 8, 26, 11, 83, 11, 42, 17, 14, 9, and 12.

8.5. Fifth Dataset

The fifth dataset is from [28], and is the monthly actual tax revenues in Egypt between January 2006 and November 2010. These actual taxes, in 1000 million Egyptian pounds, are as follows: 5.9, 20.4, 14.9, 16.2, 17.2, 7.8, 6.1, 9.2, 10.2, 9.6, 13.3, 8.5, 21.6, 18.5, 5.1, 6.7, 17, 8.6, 9.7, 39.2, 35.7, 15.7, 9.7, 10, 4.1, 36, 8.5, 8, 9.2, 26.2, 21.9, 16.7, 21.3, 35.4, 14.3, 8.5, 10.6, 19.1, 20.5, 7.1, 7.7, 18.1, 16.5, 11.9, 7, 8.6, 12.5, 10.3, 11.2, 6.1, 8.4, 11, 11.6, 11.9, 5.2, 6.8, 8.9, 7.1, and 10.8.

9. Conclusions

The W-g distribution, a member of the W-G family, is proposed and discussed. This distribution is introduced as a new four-parameter distribution which extends the classical gamma distribution. This generalization can provide more flexibility in analyzing real data. Some special cases of this distribution are presented. Furthermore, some characteristics of this new distribution are obtained. The maximum likelihood method is applied to estimate the model parameters. Different simulation studies are conducted, with different sample sizes, to verify the consistency of the estimates in terms of the bias and RMSE. The results indicate the good performance of the proposed estimators. The usefulness of the suggested distribution is illustrated by means of five real-life datasets. The proposed W-g distribution can consistently provide a better fit than some other common competitive models. Hence, the new W-g distribution can be applied as a competitive model to fit different real data.

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

cdf	cumulative distribution function
pdf	probability density function
RV	random variable
MLE	maximum likelihood estimator
W-G	Weibull-generated
W-g	Weibull-gamma
EG	exponentiated gamma
EE	exponentiated exponential
AIC	Akaike Information Criterion
RMSE	root mean squared error

References

Lee, C.; Famoye, F.; Alzaatreh, A.Y. Methods for generating families of univariate continuous distributions in the recent decades. Wiley Interdiscip. Rev. Comput. Stat. 2013, 5, 219–238. [Google Scholar] [CrossRef]
Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
Alzaatreh, A.; Famoye, F.; Lee, C. The gamma-normal distribution: Properties and applications. Comput. Stat. Data Anal. 2014, 69, 67–80. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Ortega, E.M.; Popović, B.V.; Pescim, R.R. The Lomax generator of distributions: Properties, minification process and regression model. Appl. Math. Comput. 2014, 247, 465–486. [Google Scholar] [CrossRef]
Cakmakyapan, S.; Ozel, G. The Lindley family of distributions: Properties and applications. Hacet. J. Math. Stat. 2016, 46, 1–27. [Google Scholar] [CrossRef]
Alizadeh, M.; Cordeiro, G.M.; Pinho, L.G.B.; Ghosh, I. The Gompertz-G family of distributions. J. Stat. Theory Pract. 2017, 11, 179–207. [Google Scholar] [CrossRef]
Nasir, M.A.; Tahir, M.; Jamal, F.; Ozel, G. A new generalized Burr family of distributions for the lifetime data. J. Stat. Appl. Probab. 2017, 6, 401–417. [Google Scholar] [CrossRef]
Hassan, A.S.; Nassr, S.G. Power Lindley-G Family of Distributions. In Annals of Data Science; Springer: Berlin/Heidelberg, Germany, 2018; pp. 1–22. [Google Scholar]
Cordeiro, G.M.; Afify, A.Z.; Ortega, E.M.; Suzuki, A.K.; Mead, M.E. The odd Lomax generator of distributions: Properties, estimation and applications. J. Comput. Appl. Math. 2019, 347, 222–237. [Google Scholar] [CrossRef]
Gupta, R.C.; Gupta, P.L.; Gupta, R.D. Modeling failure time data by Lehman alternatives. Commun. Stat. Theory Methods 1998, 27, 887–904. [Google Scholar] [CrossRef]
Nadarajah, S.; Gupta, A.K. The exponentiated gamma distribution with application to drought data. Calcutta Stat. Assoc. Bull. 2007, 59, 29–54. [Google Scholar] [CrossRef]
Alzaghal, A.; Famoye, F.; Lee, C. Exponentiated T-X family of distributions with some applications. Int. J. Stat. Probab. 2013, 2, 31. [Google Scholar] [CrossRef]
Bourguignon, M.; Silva, R.B.; Cordeiro, G.M. The Weibull-G family of probability distributions. J. Data Sci. 2014, 12, 53–68. [Google Scholar]
Nasiru, S.; Luguterah, A. The new weibull-pareto distribution. Pak. J. Stat. Oper. Res. 2015, 11, 103–114. [Google Scholar] [CrossRef]
Tahir, M.; Zubair, M.; Mansoor, M.; Cordeiro, G.M.; Alizadeh, M.; Hamedani, G. A new Weibull-G family of distributions. Hacet. J. Math. Stat. 2016, 45, 629–647. [Google Scholar] [CrossRef]
Alzaatreh, A.; Ghosh, I. On the Weibull-X family of distributions. J. Stat. Theory Appl. 2014, 14, 169–183. [Google Scholar]
Cordeiro, G.M.; Ortega, E.M.; Ramires, T.G. A new generalized Weibull family of distributions: Mathematical properties and applications. J. Stat. Distrib. Appl. 2015, 2, 13. [Google Scholar] [CrossRef]
Alzaatreh, A.; Famoye, F.; Lee, C. Weibull-Pareto distribution and its applications. Commun. Stat. Theory Methods 2013, 42, 1673–1691. [Google Scholar] [CrossRef]
Ahmad, A.; Ahmad, S.; Ahmed, A. Characterization and Estimation of Weibull-Rayleigh Distribution with Applications to Life Time Data. Appl. Math. Inf. Sci. Lett. 2017, 5, 71–79. [Google Scholar] [CrossRef]
Nadarajah, S.; Cordeiro, G.M.; Ortega, E.M. The Zografos–Balakrishnan-G family of distributions: Mathematical properties and applications. Commun. Stat. Theory Methods 2015, 44, 186–215. [Google Scholar] [CrossRef]
Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series, and Products; Academic Press: Cambridge, MA, USA, 2014. [Google Scholar]
Gupta, R.D.; Kundu, D. Exponentiated exponential family: An alternative to gamma and Weibull distributions. Biom. J. J. Math. Methods Biosci. 2001, 43, 117–130. [Google Scholar] [CrossRef]
Nadarajah, S.; Rocha, R. Newdistns: An R package for new families of distributions. J. Stat. Softw. 2016, 69, 1–32. [Google Scholar] [CrossRef]
Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
Meeker, W.Q.; Escobar, L.A. Statistical Methods for Reliability Data; John Wiley & Sons Inc.: New York, NY, USA, 1998. [Google Scholar]
Aarset, M.V. How to identify a bathtub hazard rate. IEEE Trans. Reliab. 1987, 36, 106–108. [Google Scholar] [CrossRef]
Pinho, L.G.B.; Cordeiro, G.M.; Nobre, J.S. The Harris extended exponential distribution. Commun. Stat. Theory Methods 2015, 44, 3486–3502. [Google Scholar] [CrossRef]
Nassar, M.; Nada, N. The beta generalized Pareto distribution. J. Stat. Adv. Theory Appl. 2011, 6, 1–17. [Google Scholar]

Figure 1. The Weibull-gamma (W-g) probability density functions (pdfs) for various values of c,

β

, k, and s.

Figure 1. The Weibull-gamma (W-g) probability density functions (pdfs) for various values of c,

β

, k, and s.

Figure 2. The W-g hazard functions for various values of c,

β

, k, and s.

Figure 2. The W-g hazard functions for various values of c,

β

, k, and s.

Figure 3. Comparison of W-g distribution with the other distributions for the first dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.

Figure 4. Comparison of W-g distribution with the other distributions for the second dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.

Figure 5. Comparison of W-g distribution with the other distributions for the third dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.

Figure 6. Comparison of W-g distribution with the other distributions for the fourth dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.

Figure 7. Comparison of W-g distribution with the other distributions for the fifth dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.

Table 1. Simulation study: W-g parameter estimates, together with bias and root mean square error (RMSE), for two different cases with different sample sizes. MLE, maximum likelihood error.

Sample Size	Parameter	Case I			Case II
Sample Size	Parameter	MLE	Bias	RMSE	MLE	Bias	RMSE
$n = 30$	c	1.8567	0.3567	1.3371	2.4955	0.6955	1.8919
	$β$	0.9846	0.4846	1.2006	0.8371	0.5371	1.1462
	k	0.8473	0.3473	0.9991	0.7038	0.2038	0.7729
	s	0.3492	−0.0508	1.0025	0.3349	−0.0651	0.9125
$n = 100$	c	1.6269	0.1269	0.8466	1.9732	0.1732	1.0898
	$β$	0.7970	0.2970	0.8186	0.5303	0.2303	0.6201
	k	0.6396	0.1396	0.5129	0.6788	0.1788	0.5824
	s	0.4629	0.0629	0.9147	0.3931	−0.0069	0.8175
$n = 500$	c	1.5168	0.0168	0.4565	1.7818	−0.0182	0.4919
	$β$	0.7535	0.2535	0.7530	0.3760	0.0760	0.2767
	k	0.5373	0.0373	0.2011	0.5534	0.0534	0.2481
	s	0.4577	0.0577	0.4976	0.4053	0.0053	0.4096

Table 2. Estimation for the first dataset.

Distribution	gamma	Weibull	EE	EG	W-g
Parameter estimates	$\hat{k} = 1.1894$	$\hat{c} = 1.265$	$\hat{α} = 1.1543$	$\hat{α} = 0.0210$	$\hat{c} = 6.0638$
	$\hat{s} = 1.4884$	$\hat{β} = 1.8805$	$\hat{s} = 1.6231$	$\hat{k} = 54.853$	$\hat{β} = 6.4448$
				$\hat{s} = 0.0849$	$\hat{k} = 0.0085$
					$\hat{s} = 1.891$
Log-likelihood	−46.8656	−46.1587	−46.9569	−44.3009	−42.1281
AIC	97.7311	96.3175	97.9139	94.6018	92.2563

Table 3. Estimation for the second dataset.

Distribution	gamma	Weibull	EE	EG	W-g
Parameter estimates	$\hat{k} = 0.7991$	$\hat{c} = 0.9492$	$\hat{α} = 0.7802$	$\hat{α} = 0.0655$	$\hat{c} = 5.1941$
	$\hat{s} = 57.1717$	$\hat{β} = 44.9466$	$\hat{s} = 53.4185$	$\hat{k} = 11.8984$	$\hat{β} = 7.5083$
				$\hat{s} = 10.875$	$\hat{k} = 0.0044$
					$\hat{s} = 39.4314$
Log-likelihood	−240.1902	−241.0018	−239.9952	−237.314	−231.7916
AIC	484.3804	486.0037	483.9903	480.628	471.5832

Table 4. Estimation for the third dataset.

Distribution	gamma	Weibull	EE	EG	W-g
Parameter estimates	$\hat{k} = 1.1892$	$\hat{c} = 1.2651$	$\hat{α} = 1.1659$	$\hat{α} = 0.0236$	$\hat{c} = 6.2924$
	$\hat{s} = 148.8595$	$\hat{β} = 188.0556$	$\hat{s} = 161.1306$	$\hat{k} = 47.957$	$\hat{β} = 6.3784$
				$\hat{s} = 9.8477$	$\hat{k} = 0.0084$
					$\hat{s} = 197.8811$
Log-likelihood	−185.0207	−184.3138	−185.113	−182.4996	−180.267
AIC	374.0413	372.6277	374.2259	370.9992	368.5341

Table 5. Estimation for the fourth dataset.

Distribution	gamma	Weibull	EE	EG	W-g
Parameter estimates	$\hat{k} = 1.6208$	$\hat{c} = 1.2744$	$\hat{α} = 1.7317$	$\hat{α} = 7.3683$	$\hat{c} = 0.7086$
	$\hat{s} = 24.5738$	$\hat{β} = 43.205$	$\hat{s} = 28.5766$	$\hat{k} = 0.2578$	$\hat{β} = 4.0834$
				$\hat{s} = 36.8853$	$\hat{k} = 4.6704$
					$\hat{s} = 3.6166$
Log-likelihood	−295.8994	−296.9001	−295.666	−295.2987	−293.5914
AIC	595.7988	597.8003	595.332	596.5974	595.1828

Table 6. Estimation for the fifth dataset.

Distribution	gamma	Weibull	EE	EG	W-g
Parameter estimates	$\hat{k} = 3.6782$	$\hat{c} = 1.8404$	$\hat{α} = 5.5309$	$\hat{α} = 33.1003$	$\hat{c} = 0.5883$
	$\hat{s} = 3.667$	$\hat{β} = 15.306$	$\hat{s} = 5.5966$	$\hat{k} = 0.207$	$\hat{β} = 2.8474$
				$\hat{s} = 7.0303$	$\hat{k} = 16.5821$
					$\hat{s} = 0.5763$
Log-likelihood	−193.0820	−197.2905	−191.2235	−190.3999	−188.3944
AIC	390.164	398.5811	386.4471	386.7998	384.7887

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Klakattawi, H.S. The Weibull-Gamma Distribution: Properties and Applications. Entropy 2019, 21, 438. https://doi.org/10.3390/e21050438

AMA Style

Klakattawi HS. The Weibull-Gamma Distribution: Properties and Applications. Entropy. 2019; 21(5):438. https://doi.org/10.3390/e21050438

Chicago/Turabian Style

Klakattawi, Hadeel S. 2019. "The Weibull-Gamma Distribution: Properties and Applications" Entropy 21, no. 5: 438. https://doi.org/10.3390/e21050438

APA Style

Klakattawi, H. S. (2019). The Weibull-Gamma Distribution: Properties and Applications. Entropy, 21(5), 438. https://doi.org/10.3390/e21050438

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Weibull-Gamma Distribution: Properties and Applications

Abstract

1. Introduction

2. The Weibull-Generated Family

3. The Weibull-Gamma Distribution

4. Some Special Cases of the Weibull-Gamma Distribution

5. Properties

5.1. Useful Expansions

5.2. Quantile Function

5.3. Moments

5.4. Moment Generating Function

5.5. Characteristic Function

6. Parameter Estimation for Weibull-Gamma Distribution

7. Simulation Study

8. Applications

8.1. First Dataset

8.2. Second Dataset

8.3. Third Dataset

8.4. Fourth Dataset

8.5. Fifth Dataset

9. Conclusions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI