Abstract
In this paper, we introduce a new continuous probability distribution with five parameters called the modified beta Gompertz distribution. It is derived from the modified beta generator proposed by Nadarajah, Teimouri and Shih (2014) and the Gompertz distribution. By investigating its mathematical and practical aspects, we prove that it is quite flexible and can be used effectively in modeling a wide variety of real phenomena. Among others, we provide useful expansions of crucial functions, quantile function, moments, incomplete moments, moment generating function, entropies and order statistics. We explore the estimation of the model parameters by the obtained maximum likelihood method. We also present a simulation study testing the validity of maximum likelihood estimators. Finally, we illustrate the flexibility of the distribution by the consideration of two real datasets.
MSC:
60E05; 62E15; 62F10
1. Introduction
The Gompertz distribution is a continuous probability distribution introduced by Gompertz [1]. The literature about the use of the Gompertz distribution in applied areas is enormous. A nice review can be found in [2], and the references therein. From a mathematical point of view, the cumulative probability density function (cdf) of the Gompertz distribution with parameters and is given by
The related probability density function (pdf) is given by
It can be viewed as a generalization of the exponential distribution (obtained with ) and thus an alternative to the gamma or Weibull distribution. A feature of the Gompertz distribution is that is unimodal and has positive skewness, whereas the related hazard rate function (hrf) given by is increasing. To increase the flexibility of the Gompertz distribution, further extensions have been proposed. A natural one is the generalized Gompertz distribution introduced by El-Gohary et al. [3]. By introducing an exponent parameter , the related cdf is given by
The related applications show that a plays an important role in term of model flexibility. This idea was then extended by Jafari et al. [4] who used the so-called beta generator introduced by Eugene et al. [5]. The related cdf is given by
where , and , . This distribution has been recently extended by Benkhelifa [6] with a five-parameter distribution. It is based on the beta generator and the generalized Gompertz distribution.
Motivated by the emergence of complex data from many applied areas, other extended Gompertz distributions have been proposed in the literature. See for instance, El-Damcese et al. [7] who considered the Odd Generalized Exponential generator introduced by Tahir et al. [8]; Roozegar et al. [9] who used the McDonald generator introduced by Alexander et al. [10]; Refs. [11,12] who applied the transmuted generator introduced by Shaw and Buckley [13]; Chukwu and Ogunde [14] and Lima et al. [15] who used the Kumaraswamy generator; and Benkhelifa [16] and Yaghoobzadeh [17] who considered the Marshall–Olkin generator introduced by Marshall and Olkin [18]; and Shadrokh and Yaghoobzadeh [19] who considered the Beta-G and Geometric generators.
In this paper, we present and study a distribution with five parameters extending the Gompertz distribution. It is based on the modified beta generator developed by Nadarajah et al. [20] (which can also be viewed as a modification of the beta Marshall–Olkin generator developed by Alizadeh et al. [21]). The advantage of this generator is to nicely combine the advantages of the beta generator of Eugene et al. [5] and the Marshall–Olkin generator of Marshall and Olkin [18]. To the best of our knowledge, its application to the Gompertz distribution has never been considered before. We provide a comprehensive description of its general mathematical properties (expansions of the cdf and pdf, quantile function, various kinds of moments, moment generating function, entropies and order statistics). The estimation of the model parameters by maximum likelihood is then discussed. Finally, we explore applications to real datasets that illustrate the usefulness of the proposed model.
The structure of the paper is as follows. Section 2 describes the considered modified beta Gompertz distribution. Some mathematical properties are investigated in Section 3. Section 4 provides the necessary to the estimation of the unknown parameters with the maximum likelihood method. A simulation study is presented, which tests validity of the obtained maximum likelihood estimators. Applications to two real datasets are also given.
2. The Modified Beta Gompertz Distribution
Let , be a cdf and be a related pdf. The modified beta generator introduced by Nadarajah et al. [20] is characterized by the cdf given by
By differentiation of , a pdf is given by
The hrf is given by
Let us now present our main distribution of interest. Using the cdf of the Gompertz distribution with parameters and as baseline, the cdf given by Equation (2) becomes
The related distribution is called the modified beta Gompertz distribution (MBGz distribution), also denoted by MBGz(). The related pdf in Equation (3) is given by
The hrf is given by
Figure 1 shows the plots for and for selected parameter values . We observe that these functions can take various curvature forms depending on the parameter values, showing the increasing of the flexibility of the former Gompertz distribution.
Figure 1.
Some plots of the pdf (a); and some plots for the hrf (b).
A strong point of the MBGz distribution is to contain different useful distributions in the literature. The most popular of them are listed below.
- When with ( is a proportion parameter), we obtain the beta Gompertz geometric distribution introduced by Shadrokh and Yaghoobzadeh [19], i.e., with cdfHowever, this distribution excludes the case , which is of importance since it contains well-known flexible distributions, as developed below. Moreover, the importance of small values for c can also be determinant in the applications (see Section 4).
- When , we get the beta Gompertz distribution with four parameters introduced by Jafari et al. [4], i.e., with cdf
- When , we get the generalized Gompertz distribution studied by El-Gohary et al. [3], i.e., with cdf
- When and with , we get the a particular case of the Marshall–Olkin extended generalized Gompertz distribution introduced by Benkhelifa [16], i.e., with cdf
- When , we get the Gompertz distribution introduced by Gompertz [1], i.e., with cdf
- When and , we get beta exponential distribution studied by Nadarajah and Kotz [22], i.e., with cdf
- When and , we get the generalized exponential distribution studied by Gupta and Kundu [23], i.e., with cdf
- When and we get the exponential distribution, i.e., with cdf
3. Some Mathematical Properties
3.1. On the Shapes of the pdf
The shapes of given by Equation (5) can be described analytically. As usual, the critical points of the pdf satisfies , with
The point is a local maximum if , a local minimum if and a point of inflection if .
Let us now study the asymptotic properties of . We have
Thus, for , we have ; for , we have ; and, for , we have . We have
Thus, in all cases. Figure 1a illustrates these points for selected parameters.
3.2. On the Shapes of the hrf
Similar to the pdf, the critical points of the hrf given by Equation (6) satisfy , with
Again, the point is a local maximum if , a local minimum if and a point of inflection if .
We also have
Thus, for , we have ; for , we have ; and, for , we have . We have
Thus, in all cases. Figure 1b illustrates these points for selected parameters.
3.3. Linear Representation
Let us determine useful linear representations for given by Equation (4) and given by Equation (5). First, let us suppose that . It follows from the generalized binomial formula, i.e., for and , with , that
On the other hand, using again the generalized binomial formula, we obtain
In a similar manner, we have
where is the cdf of a Gompertz distribution with parameters and . Combining these equalities, we obtain the following series expansion:
where
By derivation of , can be expressed as
where and is the pdf of a Gompertz distribution with parameters and .
For the case , we must do some transformation for Equation (7) to apply the generalized binomial formula. We can write
On the other hand, we have
3.4. Quantile Function
The quantile function of X is given by
where denotes the inverse of . It satisfies . Using [20], one can show that
From , we can simulate the MBGz distribution. Indeed, let U be a random variable following the uniform distribution over . Then, the random variable follows the MBGz distribution.
The median of X is given by . We can also use to define skewness measures. Let us just introduce the Bowley skewness based on quartiles and the Moors kurtosis respectively defined by
Contrary to and , these quantities have the advantage to be always defined. We refer to [24,25].
3.5. Moments
Let r be a positive integer. The rth ordinary moment of X is defined by . Using the linear representation given by Equation (9), we can express as
By doing the change of variables , we obtain
This integral has connections with the so-called generalized integro-exponential function. Further developments can be found in [26,27]. Therefore, we have
Obviously, the mean of X is given by and the variance of X is given by .
3.6. Skewness
The rth central moment of X is given by . It follows from the binomial formula that
On the other side, the rth cumulants of X can be obtained via the equation:
with . The skewness of X is given by and the kurtosis of X is given by . One can also introduce the MacGillivray skewness given by
It illustrates the effects of the parameters a, b, and on the skewness. Further details can be found in [28].
3.7. Moment Generating Function
The moment generating function of X is given by . Using Equation (9), we have
where , the moment generating function of . Doing successively the change of variables and the change of variable , we obtain
where denotes the complementary incomplete gamma function defined by . Therefore, we can write
Alternatively, using the moments of X, one can write
3.8. Incomplete Moments and Mean Deviations
Doing successively the change of variables , we obtain
The mean deviation of X about the mean is given by
where denote the first incomplete moment. The mean deviation of X about the median is given by
3.9. Entropies
Let us now investigate different kinds of entropies. For the sake of simplicity in exposition, we suppose that (the case can be considered in a similar way). The Rényi entropy of X is defined by
with and . It follows from (3) that
The generalized binomial formula implies that
Similarly, we have
Therefore,
By doing the change of variable and the change of variable , we get
By putting the above equalities together, we have
The Shannon entropy of X is defined by . It can be obtained by the formula .
The -entropy is defined by
Using the expansion above, we obtain
3.10. Order Statistics
Let be the random sample from X and be the ith order statistic. Then, the pdf of is given by
Using a result from [29], power series raised to a positive power as follows
where the coefficients are determined from the recurrence equation: and, for any , . Therefore, noticing that , we have
where and, for any , . By combining the equalities above, we obtain
Finally, one can observe that , where denotes the pdf of the Gompertz distribution with parameters and . Thus, the pdf of ith order statistic of the MBGz distribution can be expressed as a linear combination of Gompertz pdfs, i.e.,
Let r be a positive integer. Then, the rth ordinary moment of can be expressed as
4. Statistical Inference
4.1. Maximum Likelihood Estimation
We now investigate the estimation of the parameters of the MBGz distribution. Let be n observed values from the MBGz distribution and be the vector of unknown parameters. The log likelihood function is given by
The maximum likelihood estimators of the parameters are obtained by maximizing the log likelihood function. They can be determined by solving the non-linear equations: , , , , with
by setting and (one can remark that and , where denotes the so called digamma function),
and
We can solve the above non-linear equations simultaneously. A mathematical package can be used to get the maximum likelihood estimators of the unknown parameters. In addition, all the second-order derivatives exist. As usual, the asymptotic normality of the maximum likelihood estimators can be used to construct informative objects (approximate confidence intervals, confidence regions, and testing hypotheses of , etc.).
4.2. Simulation
From a theoretical point of view, the performances of the different estimates (MLEs) for the MBGz distribution are difficult to compare. We therefore propose a simulation study that uses their mean square errors (MSEs) for different sample sizes as benchmarks. The software package Mathematica (version 9) was used. Different sample sizes were considered through the experiments at size , 100 and 150. The experiment was repeated 3000 times. In each experiment, the estimates of the parameters were obtained by maximum likelihood methods of estimation. The means and MSEs for the different estimators can be found in Table 1. We observed that MSEs are decreasing with increasing n.
Table 1.
The MLEs and MSEs of MBGz distribution.
4.3. Applications
This section provides an application to show how the MBGz distribution can be applied in practice. We compared MBGz to Exponentaited Generalized Weibull–Gompertz distribution (EGWGz) by El-Bassiouny et al. [30] and other well known distributions in literature, Kumaraswamy–Gompertz (Kw-Gz), beta Gompertz (BGz) and Gompertz (Gz) models. The MLEs are computed using Quasi-Newton Code for Bound Constrained Optimization and the log-likelihood function evaluated. The goodness-of-fit measures, Anderson–Darling (A*), Cramer–von Mises (W*), Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and log-likelihood values are computed. As usual, the lower are the values of these criteria, the better is the fit. In addition, the value for the Kolmogorov–Smirnov (KS) statistic and its p-value are reported. The required computations are carried out in the R software (version 3).
4.3.1. Dataset 1
The first dataset was given in [31]. It represents the time to failure of turbocharger of a certain type of engine. The dataset is as follows: 0.0312, 0.314, 0.479, 0.552, 0.700, 0.803, 0.861, 0.865, 0.944, 0.958, 0.966, 0.977, 1.006, 1.021, 1.027, 1.055, 1.063, 1.098, 1.140, 1.179, 1.224, 1.240, 1.253, 1.270, 1.272, 1.274, 1.301, 1.301, 1.359, 1.382, 1.382, 1.426, 1.434, 1.435, 1.478, 1.490, 1.511, 1.514, 1.535, 1.554, 1.566, 1.570, 1.586, 1.629, 1.633, 1.642, 1.648, 1.684, 1.697, 1.726, 1.770, 1.773, 1.800, 1.809, 1.818, 1.821, 1.848, 1.880, 1.954, 2.012, 2.067, 2.084, 2.090, 2.096, 2.128, 2.233, 2.433, 2.585, 2.585.
4.3.2. Dataset 2
The second dataset was considered in [32]. It corresponds to a single fiber with 20 and 101 mm of gauge length, respectively. The dataset is as follows: 1.6, 2.0, 2.6, 3.0, 3.5, 3.9, 4.5, 4.6, 4.8, 5.0, 5.1, 5.3, 5.4, 5.6, 5.8, 6.0, 6.0, 6.1, 6.3, 6.5, 6.5, 6.7, 7.0, 7.1, 7.3, 7.3, 7.3, 7.7, 7.7, 7.8, 7.9, 8.0, 8.1, 8.3, 8.4, 8.4, 8.5, 8.7, 8.8, 9.0.
Table 2 and Table 3 present the maximum likelihood estimates, with the corresponding standard errors in parentheses, of the unknown parameters () of the MBGz distribution for Datasets 1 and 2, respectively. Table 4 and Table 5 show the statistics AIC, BIC, W*, A*, KS, and p-Value values for all the considered models. We then see that the proposed MBGz model fits the considered data better than the other models. Thus, the proposed MBGz model provides an interesting alternative to other existing models for modeling positive real data. To complete this fact, PP, QQ, epdf and ecdf plots of the MBGz distribution are given in Figure 2 and Figure 3 for Datasets 1 and 2, respectively.
Table 2.
MLEs (standard errors in parentheses) for Dataset 1.
Table 3.
MLEs (standard errors in parentheses) for Dataset 2.
Table 4.
The AIC, BIC, W*, A*, KS, and p-Value values for Dataset 1.
Table 5.
The AIC, BIC, W*, A*, KS, and p-Value values for Dataset 2.
Figure 2.
PP, QQ, epdf and ecdf plots of the MBGz distribution for Dataset 1.
Figure 3.
PP, QQ, epdf and ecdf plots of the MBGz distribution for Dataset 2.
Author Contributions
I.E., F.J., C.C., M.E. and S.A. have contributed equally to this work.
Funding
This research received no external funding.
Acknowledgments
The authors would like to thank the referees for their valuable comments which helped to improve the manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Gompertz, B. On the nature of the function expressive of the law of human mortality and on the new mode of determining the value of life contingencies. Philos. Trans. R. Soc. A 1825, 115, 513–580. [Google Scholar] [CrossRef]
- Tjørve, K.M.C.; Tjørve, E. The use of Gompertz models in growth analyses, and new Gompertz-model approach: An addition to the Unified-Richards family. PLoS ONE 2017, 12. [Google Scholar] [CrossRef]
- El-Gohary, A.; Alshamrani, A.; Al-Otaibi, A. The generalized Gompertz distribution. Appl. Math. Model. 2013, 37, 13–24. [Google Scholar] [CrossRef]
- Jafari, A.A.; Tahmasebi, S.; Alizadeh, M. The beta-Gompertz distribution. Revista Colombiana de Estadistica 2014, 37, 141–158. [Google Scholar] [CrossRef]
- Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Comm. Statist. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
- Benkhelifa, L. The beta generalized Gompertz distribution. Appl. Math. Model. 2017, 52, 341–357. [Google Scholar] [CrossRef]
- El-Damcese, M.A.; Mustafa, A.; El-Desouky, B.S.; Mustafa, M.E. Generalized Exponential Gompertz Distribution. Appl. Math. 2015, 6, 2340–2353. [Google Scholar] [CrossRef]
- Tahir, M.H.; Cordeiro, G.M.; Alizadeh, M.; Mansoor, M.; Zubair, M.; Hamedani, G.G. The Odd Generalized Exponential Family of Distributions with Applications. J. Stat. Distrib. Appl. 2015, 2, 1–28. [Google Scholar] [CrossRef]
- Roozegar, R.; Tahmasebi, S.; Jafari, A.A. The McDonald Gompertz distribution: Properties and applications. Commun. Stat. Simul. Comput. 2017, 46, 3341–3355. [Google Scholar] [CrossRef]
- Alexander, C.; Cordeiro, G.M.; Ortega, E.M.M.; Sarabia, J.M. Generalized beta-generated distributions. Comput. Stat. Data Anal. 2012, 56, 1880–1897. [Google Scholar] [CrossRef]
- Khan, M.S.; King, R.; Hudson, I.L. Transmuted Generalized Gompertz distribution with application. J. Stat. Theory Appl. 2017, 16, 65–80. [Google Scholar] [CrossRef]
- Moniem, A.I.B.; Seham, M. Transmuted Gompertz Distribution. Comput. Appl. Math. 2015, 1, 88–96. [Google Scholar]
- Shaw, W.; Buckley, I. The Alchemy of Probability Distributions: Beyond Gram-Charlier Expansions and a Skewkurtotic-Normal Distribution from a Rank Transmutation Map; Research Report; King’s College: London, UK, 2007. [Google Scholar]
- Chukwu, A.U.; Ogunde, A.A. On kumaraswamy gompertz makeham distribution. Am. J. Math. Stat. 2016, 6, 122–127. [Google Scholar]
- Lima, F.P.; Sanchez, J.D.; da Silva, R.C.; Cordeiro, G.M. The Kumaraswamy Gompertz distribution. J. Data Sci. 2015, 13, 241–260. [Google Scholar]
- Benkhelifa, L. Marshall-Olkin extended generalized Gompertz distribution. J. Data Sci. 2017, 15, 239–266. [Google Scholar]
- Yaghoobzadeh, S. A new generalization of the Marshall-Olkin Gompertz distribution. Int. J. Syst. Assur. Eng. Manag. 2017, 8, 1580–1587. [Google Scholar] [CrossRef]
- Marshall, A.W.; Olkin, I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 1997, 84, 641–652. [Google Scholar] [CrossRef]
- Shadrokh, A.; Yaghoobzadeh, S.S. The Beta Gompertz Geometric Distribution: Mathematical Properties and Applications. Andishe_ye Amari 2018, 22, 81–91. [Google Scholar]
- Nadarajah, S.; Teimouri, M.; Shih, S.H. Modified Beta Distributions. Sankhya Ser. B 2014, 76, 19–48. [Google Scholar] [CrossRef]
- Alizadeh, M.; Cordeiro, G.M.; Brito, E. The beta Marshall-Olkin family of distributions. J. Stat. Distrib. Appl. 2015, 2, 4. [Google Scholar] [CrossRef]
- Nadarajah, S.; Kotz, S. The beta exponential distribution. Reliab. Eng. Syst. Saf. 2006, 91, 689–697. [Google Scholar] [CrossRef]
- Gupta, R.D.; Kundu, D. Generalized Exponential Distributions. Aust. N. Z. J. Stat. 1999, 41, 173–188. [Google Scholar] [CrossRef]
- Kenney, J.F.; Keeping, E.S. Mathematics of Statistics, 3rd ed.; Chapman and Hall Ltd.: Rahway, NJ, USA, 1962. [Google Scholar]
- Moors, J.J. A quantile alternative for kurtosis. J. R. Stat. Soc. Ser. D 1988, 37, 25–32. [Google Scholar] [CrossRef]
- Milgram, M. The generalized integro-exponential function. Math. Comput. 1985, 44, 443–458. [Google Scholar] [CrossRef]
- Lenart, A. The moments of the Gompertz distribution and maximum likelihood estimation of its parameters. Scand. Actuar. J. 2014, 3, 255–277. [Google Scholar] [CrossRef]
- MacGillivray, H.L. Skewness and Asymmetry: Measures and Orderings. Ann. Stat. 1986, 14, 994–1011. [Google Scholar] [CrossRef]
- Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series and Products; Academic Press: New York, NY, USA, 2000. [Google Scholar]
- El-Bassiouny, H.; EL-Damcese, M.; Mustafa, A.; Eliwa, M.S. Exponentiated Generalized Weibull-Gompertz Distribution with Application in Survival Analysis. J. Stat. Appl. Probab. 2017, 6, 7–16. [Google Scholar] [CrossRef]
- Xu, K.; Xie, M.; Tang, L.C.; Ho, S.L. Application of Neural Networks in forecasting Engine Systems Reliability. Appl. Soft Comput. 2003, 2, 255–268. [Google Scholar] [CrossRef]
- Badar, M.G.; Priest, A.M. Statistical aspects of fiber and bundle strength in hybrid composites. In Progress in Science and Engineering Composites, Proceedings of the ICCM-IV, Tokyo, Japan, 25–28 October 1982; Hayashi, T., Kawata, K., Umekawa, S., Eds.; Japan Society for Composite Materials: Tokyo, Japan, 1982; pp. 1129–1136. [Google Scholar]
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).