Abstract
A new member of the Weibull-generated (Weibull-G) family of distributions—namely the Weibull-gamma distribution—is proposed. This four-parameter distribution can provide great flexibility in modeling different data distribution shapes. Some special cases of the Weibull-gamma distribution are considered. Several properties of the new distribution are studied. The maximum likelihood method is applied to obtain an estimation of the parameters of the Weibull-gamma distribution. The usefulness of the proposed distribution is examined by means of five applications to real datasets.
1. Introduction
Probability distributions provide important information about the statistical inference and analysis of data. These results can be used for considering some well-grounded decisions. Hence, it is essential to have a distribution that accurately reflects the data. Different classical distributions have been broadly applied for the description of real-world phenomena and the modeling of data in different disciplines, including insurance, economics, finance, engineering, biology, industry, and medical sciences. However, many of these standard distributions have limitations in fitting some of the real data accurately. Therefore, it is of great importance to identify which distribution should be applied to model the data. Knowledge of the appropriate distribution greatly improves the efficiency of any statistical inference related to data sets. Hence, researchers are always trying to extend existing classical distributions, to improve their goodness-of-fit and obtain more flexibility and adaptability in modeling data in practice. This paper proposes a new modified distribution, which has increased flexibility to fit various data in practice. Many approaches to extend the existing standard models and develop generalized classes of distributions have been suggested in the literature; for an excellent review of various approaches in generating distributions, one may refer to [1].
Recently, Alzaatreh et al. [2] proposed a new approach for generating families of distributions, namely the transformed-transformer (T-X) family of distributions. These families can be defined for any baseline distribution with probability density function (pdf) g, cumulative density function (cdf) G, and a parameter vector , by applying a function to its cdf. To illustrate, assume a continuous generator random variable (RV) T is defined on with pdf and cdf . Then, the cdf of the T-X family of distributions for a RV X is defined as
which can be written as
where is a function of the cdf G which satisfies , is differentiable and monotonically non-decreasing, as , and as . The corresponding pdf, associated with Equation (1), can be found as
Different forms of the upper limit W can be used to generate different types of the T-X family of distributions. Additionally, the term “generated” (which we will denote G, for short) illustrates that for each baseline distribution G, a different distribution F can be obtained; that is, for each family, several sub-models can be derived according to the choice of the distribution G. For more details, see [2], where they choose in order to introduce the gamma-G, beta-exponential-G, and Weibull-G families. Many T-X generated distributions have been proposed recently. For example, the gamma-G family was introduced in [3], the Lomax-G family was introduced in [4], the Lindley-G family was introduced in [5], the Gompertz-G family was introduced in [6], the generalized Burr-G family was introduced in [7], the power Lindley-G family was introduced in [8], and the odd Lomax-G family was introduced in [9], among others.
The gamma distribution can be considered to be one of the most commonly applied lifetime distributions in different fields. A RV X is said to have a gamma distribution, with shape parameter and a scale parameter , if the pdf and cdf of X are, respectively, given as
and
where is the gamma function and is the incomplete gamma function. given respectively as
Many attempts have been made to increase the flexibility of this distribution by introducing some new distribution with additional parameter(s), such as the exponentiated gamma (EG). This distribution can be considered to be a member of the exponentiated class of distributions proposed in [10], where the cdf of any classical distribution is raised to a power (shape) parameter. Thus, the cdf of the EG takes the form
See, for example [11], where the one-parameter gamma (with scale parameter ), is exponentiated.
This paper proposes a new modification of the gamma distribution, based on the Weibull-G (W-G) family of distributions. The additional parameters presented by the Weibull generator might enhance the flexibility of the distribution. The rest of the paper is organized as follows. In Section 2, the W-G family is discussed. A member of this family—namely the Weibull-gamma (W-g) distribution—is introduced in Section 3. Some special cases of the W-g distribution are examined in Section 4. In Section 5, some of the properties of the new distribution are briefly discussed. The method of maximum likelihood for estimating the parameters of the W-g distribution is discussed in Section 6. Then, the consistency and precision of these estimates, by means of some Monte Carlo simulation studies, are investigated in Section 7. Finally, five real datasets are applied, in Section 8, to investigate the flexibility and usefulness of the W-g distribution.
2. The Weibull-Generated Family
A RV T is said to have a Weibull distribution, with shape parameter and scale parameter , if its pdf and cdf are, respectively, given by
Assuming that is a cdf of the baseline distribution with parameter vector , the cdf of the W-G distribution can be derived by replacing t in Equation (7) by , as follows
Then, the W-G distribution is obtained with two extra parameters, c and , for any baseline distribution G distribution. Different types of W-G can be found, based on the choice of the upper limit of the integral . In other words, ref [12] assumed for to introduce the exponentiated W-G, ref [13] considered , ref [14] used the form of , and [15] assumed . In this paper, , which was discussed by [2,16,17], is considered in particular. The cdf of the W-G distribution is defined as
with the corresponding pdf
In [18], a transformer X distributed as Pareto distribution was considered, to introduce the Weibull-Pareto distribution. In [16], the logistic distribution was used as a baseline distribution, providing the Weibull-logistic model. The Weibull-log-logistic distribution was discussed in [17] as a special case of the W-G family of distributions. Additionally, ref [19] applied this form of the upper limit for the Rayleigh and discussed the Weibull-Rayleigh distribution.
3. The Weibull-Gamma Distribution
The W-g distribution is derived as a member of the W-G family of distributions in Equation (9); that is, T is a Weibull RV and X is a gamma RV. Then, the pdf and cdf of the W-g distribution, with a vector of parameters , can be found by substituting with Equations (2) and (3) in Equations (9) and (10), as follows
and
The reliability function of the W-g can be obtained, consequently, as
where , , and
The hazard function can be defined as
where and are, respectively, defined by Equations (11) and (12).
Different plots of the pdf and the hazard functions for the W-g distribution are displayed, respectively, in Figure 1 and Figure 2, for some specific parameter values. The density and hazard functions show differing behaviors, based on the values of the parameters. The various possible shapes of the density function, including (approximately) symmetric, skewed, and bimodal, were produced. Additionally, several shapes, including monotonically decreasing, monotonically increasing, unimodal, bathtub, and U shapes, can be obtained for the hazard function of the W-g, for different combinations of the values of the parameters. This illustrates the great flexibility of the W-g distribution, which make it suitable for various real data.

Figure 1.
The Weibull-gamma (W-g) probability density functions (pdfs) for various values of c, , k, and s.
Figure 2.
The W-g hazard functions for various values of c, , k, and s.
4. Some Special Cases of the Weibull-Gamma Distribution
- If , the W-g distribution reduces to the standard exponential distribution, with pdf as follows
- When in the W-g model, the gamma distribution in Equation (2) with shape parameter k and scale parameter s is obtained.
- If , the W-g distribution reduces to the exponential-gamma distribution, with pdf as followswhere and are, respectively, defined by Equation (4).
5. Properties
Providing some mathematical expansions to find some characteristics of the W-g distribution might be more reasonable than numerically solving the integrals of the pdf given by Equation (11) to derive these properties. Hence, some mathematical properties are provided here using algebraic expansions, which can be carried out using any computational software platform which can deal with analytic expressions.
5.1. Useful Expansions
In the following, we show an alternative formula for the pdf of the W-g distribution given in Equation (11). Using the power series for the exponential function
we obtain
Applying the binomial theorem, which defines as
to expand (where the definition of is given in Equation (14)), the pdf can be reduced to
Furthermore, ref [17,20] applied the generalized binomial theorem to prove that
Consequently, the pdf can be obtained as
where the constant can be found recursively by
for , , and .
By application of the expansion for the incomplete gamma function using the power series, presented in [21] as
we obtain
Again, according to [21], the power series raised to an integer m can be simplified as
where and for .
Hence, assuming yields
where and for .
If we define
and using A instead of for short, the pdf of the W-g can be rewritten as
Using a similar technique, the cdf of the W-g distribution can be obtained as
5.2. Quantile Function
The pth quantile function () of the RV X which follows the W-g distribution is obtained by inverting Equation (12) and solving the non-linear equation
5.3. Moments
5.4. Moment Generating Function
The moment generating function for the W-g distribution follows, from Equation (17), as
5.5. Characteristic Function
We can obtain the characteristic function for the W-g distribution, from Equation (17), as follows
6. Parameter Estimation for Weibull-Gamma Distribution
Assuming a random sample of size n is taken from the W-g distribution in Equation (11). Then, to find the maximum likelihood errors (MLEs) of the vector of parameters , we need to find the log-likelihood function, then obtain the partial derivative with respect to each parameter and set these derivatives to zero.
Thus, the MLEs of the parameters , and s can be obtained by setting Equations (24)–(27) to zero and solving them iteratively, using numerical methods such as the Newton-Raphson iteration method. Alternatively, the log-likelihood in Equation (23) can be directly maximized, using any standard non-linear optimization tool.
7. Simulation Study
This section considers some simulation studies to evaluate the performance of the MLEs of the parameters of the W-g distribution. The simulation is considered over several iterations equal to , and for different sample sizes n with the following cases for the true parameters
- Case I: , and
- Case II:
The MLE, , for each parameter can be evaluated using two accuracy measures—the bias and the root mean square error (RMSE)—which can be calculated, respectively, as follows
and
The Monte Carlo simulation studies were conducted using the R programming language. Table 1 shows the results for the MLE of the parameters of W-g, along with their corresponding average bias and RMSE, respectively. As expected for the method of maximum likelihood, it can be seen that both criteria, bias, and RMSE, generally decreases as the size of the sample n increases and the estimates become closer to the true parameters on average.
Table 1.
Simulation study: W-g parameter estimates, together with bias and root mean square error (RMSE), for two different cases with different sample sizes. MLE, maximum likelihood error.
8. Applications
This section illustrates the usefulness of the W-g distribution through five different real data sets. The fit of the W-g is compared with some related distributions; namely the gamma distribution in Equation (2) with shape parameter k and scale parameter s, and the Weibull distribution in Equation (6) with shape parameter c and scale parameter . The gamma and Weibull distributions are fitted using the “fitdistr” function from the MASS package in R. Additionally, the fitting is compared with the EG in Equation (5) with power parameter , shape parameter k, and scale parameter s. Also, the data is fitted by the exponentiated exponential (EE), introduced in [22] as an alternative to the gamma and Weibull distributions. The EE is obtained by exponentiating the classical exponential distribution to a power (shape) parameter as , where s is the scale parameter and is the shape parameter. The results for EG, EE, and W-g were obtained using the package Newdistns, given in [23], in the statistical software R.
In particular, the MLEs of the parameters for each of the distributions with the value of the log-likelihood were computed. Then, to choose the best model among these various models, the Akaike Information Criterion (AIC), given in [24], was computed and the best model is the model with the minimum AIC values. The plots of the expected frequencies for the fitted gamma, Weibull, EE, EG, and W-g were compared with the histograms of the observed frequencies. Furthermore, the empirical cdf was plotted and compared with the estimated cdf for each of the distributions.
8.1. First Dataset
First, we will consider the dataset discussed in [25], which concerns with a large system with 30 units, in which the failure and running times are 2.75, 0.13, 1.47, 0.23, 1.81, 0.30, 0.65, 0.10, 3.00, 1.73, 1.06, 3.00, 3.00, 2.12, 3.00, 3.00, 3.00, 0.02, 2.61, 2.93, 0.88, 2.47, 0.28, 1.43, 3.00, 0.23, 3.00, 0.80, 2.45, and 2.66.
Table 2, Table 3, Table 4, Table 5 and Table 6 shows a summary of the MLEs of the parameters, the log-likelihood, and the AIC for each model. It can be seen that the W-g can be selected as the best model, according to its low AIC when compared to the other fitted distributions. The histogram of the data and plots of the estimated pdf and cdf for each model are displayed in Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7. It is clear that the proposed W-g distribution is the closest to the actual distribution of the data. Therefore, the W-g distribution can be selected as the best model for all datasets.
Table 2.
Estimation for the first dataset.
Table 3.
Estimation for the second dataset.
Table 4.
Estimation for the third dataset.
Table 5.
Estimation for the fourth dataset.
Table 6.
Estimation for the fifth dataset.
Figure 3.
Comparison of W-g distribution with the other distributions for the first dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.
Figure 4.
Comparison of W-g distribution with the other distributions for the second dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.
Figure 5.
Comparison of W-g distribution with the other distributions for the third dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.
Figure 6.
Comparison of W-g distribution with the other distributions for the fourth dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.
Figure 7.
Comparison of W-g distribution with the other distributions for the fifth dataset. (Left): cdf for each of the fitted distributions. (Right): observed and expected frequencies for each model.
8.2. Second Dataset
The second dataset consists of the lifetimes of components, given in [26] as: 0.1, 0.2, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 3.0, 6.0, 7.0, 11.0, 12.0, 18.0, 18.0, 18.0, 18.0, 18.0, 21.0, 32.0, 36.0, 40.0, 45.0, 46.0, 47.0, 50.0, 55.0, 60.0, 63.0, 63.0, 67.0, 67.0, 67.0, 67.0, 72.0, 75.0, 79.0, 82.0, 82.0, 83.0, 84.0, 84.0, 84.0, 85.0, 85.0, 85.0, 85.0, 85.0, 86.0, and 86.0.
8.3. Third Dataset
The third dataset also gives the failure and running times of a sample of devices, given in [25] as: 2, 10, 13, 23, 23, 28, 30, 65, 80, 88, 106, 143, 147, 173, 181, 212, 245, 247, 261, 266, 275, 293, 300, 300, 300, 300, 300, 300, 300, and 300.
8.4. Fourth Dataset
The dataset considered here, discussed by [27], presents the waiting times between 65 consecutive eruptions of a blowhole, called the Kiama Blowhole, as follows: 83, 51, 87, 60, 28, 95, 8, 27, 15, 10, 18, 16, 29, 54, 91, 8, 17, 55, 10, 35,47, 77, 36, 17, 21, 36, 18, 40 , 10, 7, 34, 27, 28, 56, 8, 25, 68, 146, 89, 18, 73, 69, 9, 37, 10, 82, 29, 8, 60, 61, 61, 18, 169, 25, 8, 26, 11, 83, 11, 42, 17, 14, 9, and 12.
8.5. Fifth Dataset
The fifth dataset is from [28], and is the monthly actual tax revenues in Egypt between January 2006 and November 2010. These actual taxes, in 1000 million Egyptian pounds, are as follows: 5.9, 20.4, 14.9, 16.2, 17.2, 7.8, 6.1, 9.2, 10.2, 9.6, 13.3, 8.5, 21.6, 18.5, 5.1, 6.7, 17, 8.6, 9.7, 39.2, 35.7, 15.7, 9.7, 10, 4.1, 36, 8.5, 8, 9.2, 26.2, 21.9, 16.7, 21.3, 35.4, 14.3, 8.5, 10.6, 19.1, 20.5, 7.1, 7.7, 18.1, 16.5, 11.9, 7, 8.6, 12.5, 10.3, 11.2, 6.1, 8.4, 11, 11.6, 11.9, 5.2, 6.8, 8.9, 7.1, and 10.8.
9. Conclusions
The W-g distribution, a member of the W-G family, is proposed and discussed. This distribution is introduced as a new four-parameter distribution which extends the classical gamma distribution. This generalization can provide more flexibility in analyzing real data. Some special cases of this distribution are presented. Furthermore, some characteristics of this new distribution are obtained. The maximum likelihood method is applied to estimate the model parameters. Different simulation studies are conducted, with different sample sizes, to verify the consistency of the estimates in terms of the bias and RMSE. The results indicate the good performance of the proposed estimators. The usefulness of the suggested distribution is illustrated by means of five real-life datasets. The proposed W-g distribution can consistently provide a better fit than some other common competitive models. Hence, the new W-g distribution can be applied as a competitive model to fit different real data.
Funding
This research received no external funding.
Conflicts of Interest
The author declares no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| cdf | cumulative distribution function |
| probability density function | |
| RV | random variable |
| MLE | maximum likelihood estimator |
| W-G | Weibull-generated |
| W-g | Weibull-gamma |
| EG | exponentiated gamma |
| EE | exponentiated exponential |
| AIC | Akaike Information Criterion |
| RMSE | root mean squared error |
References
- Lee, C.; Famoye, F.; Alzaatreh, A.Y. Methods for generating families of univariate continuous distributions in the recent decades. Wiley Interdiscip. Rev. Comput. Stat. 2013, 5, 219–238. [Google Scholar] [CrossRef]
- Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
- Alzaatreh, A.; Famoye, F.; Lee, C. The gamma-normal distribution: Properties and applications. Comput. Stat. Data Anal. 2014, 69, 67–80. [Google Scholar] [CrossRef]
- Cordeiro, G.M.; Ortega, E.M.; Popović, B.V.; Pescim, R.R. The Lomax generator of distributions: Properties, minification process and regression model. Appl. Math. Comput. 2014, 247, 465–486. [Google Scholar] [CrossRef]
- Cakmakyapan, S.; Ozel, G. The Lindley family of distributions: Properties and applications. Hacet. J. Math. Stat. 2016, 46, 1–27. [Google Scholar] [CrossRef]
- Alizadeh, M.; Cordeiro, G.M.; Pinho, L.G.B.; Ghosh, I. The Gompertz-G family of distributions. J. Stat. Theory Pract. 2017, 11, 179–207. [Google Scholar] [CrossRef]
- Nasir, M.A.; Tahir, M.; Jamal, F.; Ozel, G. A new generalized Burr family of distributions for the lifetime data. J. Stat. Appl. Probab. 2017, 6, 401–417. [Google Scholar] [CrossRef]
- Hassan, A.S.; Nassr, S.G. Power Lindley-G Family of Distributions. In Annals of Data Science; Springer: Berlin/Heidelberg, Germany, 2018; pp. 1–22. [Google Scholar]
- Cordeiro, G.M.; Afify, A.Z.; Ortega, E.M.; Suzuki, A.K.; Mead, M.E. The odd Lomax generator of distributions: Properties, estimation and applications. J. Comput. Appl. Math. 2019, 347, 222–237. [Google Scholar] [CrossRef]
- Gupta, R.C.; Gupta, P.L.; Gupta, R.D. Modeling failure time data by Lehman alternatives. Commun. Stat. Theory Methods 1998, 27, 887–904. [Google Scholar] [CrossRef]
- Nadarajah, S.; Gupta, A.K. The exponentiated gamma distribution with application to drought data. Calcutta Stat. Assoc. Bull. 2007, 59, 29–54. [Google Scholar] [CrossRef]
- Alzaghal, A.; Famoye, F.; Lee, C. Exponentiated T-X family of distributions with some applications. Int. J. Stat. Probab. 2013, 2, 31. [Google Scholar] [CrossRef]
- Bourguignon, M.; Silva, R.B.; Cordeiro, G.M. The Weibull-G family of probability distributions. J. Data Sci. 2014, 12, 53–68. [Google Scholar]
- Nasiru, S.; Luguterah, A. The new weibull-pareto distribution. Pak. J. Stat. Oper. Res. 2015, 11, 103–114. [Google Scholar] [CrossRef]
- Tahir, M.; Zubair, M.; Mansoor, M.; Cordeiro, G.M.; Alizadeh, M.; Hamedani, G. A new Weibull-G family of distributions. Hacet. J. Math. Stat. 2016, 45, 629–647. [Google Scholar] [CrossRef]
- Alzaatreh, A.; Ghosh, I. On the Weibull-X family of distributions. J. Stat. Theory Appl. 2014, 14, 169–183. [Google Scholar]
- Cordeiro, G.M.; Ortega, E.M.; Ramires, T.G. A new generalized Weibull family of distributions: Mathematical properties and applications. J. Stat. Distrib. Appl. 2015, 2, 13. [Google Scholar] [CrossRef]
- Alzaatreh, A.; Famoye, F.; Lee, C. Weibull-Pareto distribution and its applications. Commun. Stat. Theory Methods 2013, 42, 1673–1691. [Google Scholar] [CrossRef]
- Ahmad, A.; Ahmad, S.; Ahmed, A. Characterization and Estimation of Weibull-Rayleigh Distribution with Applications to Life Time Data. Appl. Math. Inf. Sci. Lett. 2017, 5, 71–79. [Google Scholar] [CrossRef]
- Nadarajah, S.; Cordeiro, G.M.; Ortega, E.M. The Zografos–Balakrishnan-G family of distributions: Mathematical properties and applications. Commun. Stat. Theory Methods 2015, 44, 186–215. [Google Scholar] [CrossRef]
- Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series, and Products; Academic Press: Cambridge, MA, USA, 2014. [Google Scholar]
- Gupta, R.D.; Kundu, D. Exponentiated exponential family: An alternative to gamma and Weibull distributions. Biom. J. J. Math. Methods Biosci. 2001, 43, 117–130. [Google Scholar] [CrossRef]
- Nadarajah, S.; Rocha, R. Newdistns: An R package for new families of distributions. J. Stat. Softw. 2016, 69, 1–32. [Google Scholar] [CrossRef]
- Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
- Meeker, W.Q.; Escobar, L.A. Statistical Methods for Reliability Data; John Wiley & Sons Inc.: New York, NY, USA, 1998. [Google Scholar]
- Aarset, M.V. How to identify a bathtub hazard rate. IEEE Trans. Reliab. 1987, 36, 106–108. [Google Scholar] [CrossRef]
- Pinho, L.G.B.; Cordeiro, G.M.; Nobre, J.S. The Harris extended exponential distribution. Commun. Stat. Theory Methods 2015, 44, 3486–3502. [Google Scholar] [CrossRef]
- Nassar, M.; Nada, N. The beta generalized Pareto distribution. J. Stat. Adv. Theory Appl. 2011, 6, 1–17. [Google Scholar]
© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).







