Abstract
Statistical methods are essential for describing, predicting, and modeling natural phenomena in numerous application areas. These methods are helpful for modeling and predicting data in medicine, reliability engineering, actuarial science, and other fields. This paper presents a novel, simple, and fully flexible modified gamma model. The new model provides various forms of densities, including symmetric, asymmetric, unimodal, and reversed-J shapes, as well as a bathtub-shaped failure rate, which is suitable for modeling the lifespan of patients with an increased risk of death. Some basic and dynamic properties of the model are examined. Four methods for estimating its parameters are discussed, and a simulation study is used to examine the consistency and efficiency of these estimators. Finally, the usefulness of the proposed model is demonstrated in the analysis of some data sets.
Keywords:
lifetime models; asymmetrical; gamma distribution; stochastic orders; parameter estimation MSC:
60E05
1. Introduction
The gamma model has wide applications in probability theory, reliability theory, survival analysis, durability testing, physics, econometrics, hydrology, engineering, and many other scientific fields. It is used to model data from a variety of sources, such as the size of atmospheric particles (Petty and Huang [1]), the age distribution of cancers (Belikov [2]), the size of insurance claims (Boland [3]), the amount of precipitation accumulated in a reservoir (Aksoy [4]), or the interspike intervals in neuroscience (Wright et al. [5]). The Gamma model is also considered a well-known conjugate prior of Bayesian statistics.
Many authors have extended the Gamma family to develop more flexible models to describe natural phenomena. Williams [6] introduced a modified gamma model for analyzing particle size spectra in coagulating aerosols. Ong and Shan [7] applied a generalized power gamma model to analyze raindrop data from Singapore. Muralidharan and Kale [8] introduced a modified version of the gamma distribution with a singularity at zero. Gebrenegus [9] considered an extended gamma model and Nadarajah and Gupta [10] defined an exponentiated gamma extension and applied it to model data on hydrological processes. Shawky and Bakoban [11] discussed the problem of estimating exponentiated gamma extensions. Cordeiro et al. [12] proposed a generalization of gamma by applying the Weibull model. Feroze and Elbatal [13] introduced and studied a beta-exponentiated generalization of the gamma model. Barriga et al. [14] used the Marshal–Olkin [15] model to generalize the gamma distribution. Mead et al. [16] defined a generalization version of the gamma model. Altun et al. [17] used a mixture of the gamma and xgamma distributions in their study. Saboor et al. [18] defined a further extended model from the gamma baseline model and proved some interesting results. The gamma model and many of its extended or modified forms have eventually increased risk.
This paper aims to define a new modified gamma (MG) model with a risk intensity control parameter at the right tail. We propose a modified gamma (MG) model with various forms of densities, including symmetric, asymmetric, unimodal, and reversed-J forms, as well as a bathtub-shaped failure rate (FR). It is particularly suitable when increased risks are involved, such as in patients with severe diseases that progress over time. We add a coefficient that attenuates the distribution’s right tail to the ordinary gamma reliability function. From this point of view, the proposed MG is suitable for data that initially follow a gamma model but have a thinner right tail than the corresponding gamma model. The model is relatively simple but completely flexible.
The paper is organized as follows. Section 2 introduces the new model, and some essential properties are explored in addition to its dynamic features. Section 3 discusses four methods for estimating the model parameters. In Section 4, a simulation study is conducted to investigate the behavior of these estimators. An illustrative example in Section 5 shows that this model can describe the data better than alternatives. Finally, Section 6 concludes the paper with further extended explanations, descriptions, and recommendations for further research.
2. Model Formulation
The gamma model is known by the probability distribution function (PDF)
where and are shape and rate parameters, respectively.
The new modified gamma model can be characterized by the reliability function
where shows the upper incomplete gamma function. The PDF of this model is
This model reduces to the gamma model when . For . It is a special case of the modified Weibull model defined by Lai et al. [19]. As for the shape of the PDF, as shown in Figure 1a, it shows a decreasing and unimodal shape. The coefficient ensures that the reliability function falls below the reliability of the baseline gamma distribution. If an object’s lifetime or life regularly follows the gamma model, it is exposed to a risk that increases with time under certain conditions. Such conditions may be the survival times of patients with serious diseases in which the disease progresses over time.
Figure 1.
The PDF (a) and the FR function (b) of the MG model for some parameters.
The proposed MG model has a light right tail, and, in comparison with the gamma distribution, we have
where and are the reliability functions of the MG and baseline gamma distributions, respectively. Thus, the MG model could properly fit to data with light right tail.
The th moment of a random variable from is
where follows from . The function is defined by , , and is its inverse. The last inequality follows from the fact that . Thus, since the moments of the gamma distribution are finite, the moments of the MG distribution are also finite.
Figure 1 below shows various forms of densities at (a), including symmetric, asymmetric, unimodal, and reversed-J shapes. At (b), the FR function accommodates increasing or bathtub-shaped forms.
The th quantile function , which determines the point for which holds, can be computed numerically by solving the following equation as a function of :
Dynamic Measures
There are many characterizations of the various lifespan distributions in the literature. These characterizations are essential because they offer new insights into the meaning of reliability features. For a random event at time , the failure rate (FR) function at time indicates the instantaneous risk for the occurrence of the event and is mathematically defined by
which for simplifies to
The function of the mean residual life (MRL) of a random life at time represents the conditional remaining lifetime, given it has been survived until , i.e.,
It could not be simplified further and should be computed numerically.
Similarly, the p-quantile residual life (p-QRL) function of a random life at time gives the pth quantile of its conditional residual life if it has survived to t and is formally written as follows:
The special case is called the median residual life function and is a good alternative to the MRL function. Figure 2 shows the MRL and median residual life functions for some parameters and has decreasing and unimodal forms.
Figure 2.
The MRL (a) and the median residual life (b) functions of MG model for some parameters.
As a valuable tool to develop different properties of models, stochastic orders by quantifying the concept of one random variable being more significant than another are considered. These, however, are partial orders, thus one random variable X may be neither stochastically greater than, less than, nor equal to another random variable Y. Many different orders have various applications, as per Lai and Xie [20]. The following results show that the MG distribution is ordered under the right conditions concerning the usual stochastic and failure rate orders.
Proposition 1.
Let, , and. Then,in stochastic ordering.
Proof.
Suppose that and represent the reliability functions of and , respectively. Following the definition of stochastic order (see Lai and Xie [20]), we should show that for all which is clear. □
Specially, if and , then in stochastic ordering.
Proposition 2.
Let,, , and. Then, in FR ordering.
Proof.
Suppose that and are the reliability functions of and , respectively. Following the definition of FR order (see Lai and Xie [20]), we should show that the following expression is increasing for all :
To reduce the complexity of differentiation, take and . Thus
By differentiating with respect to , it can be verified that the sign of is the same as the sign of the following expression, where represents for simplicity.
since and
Thus, by , is increasing and, consequently, in FR ordering. □
As a direct consequence of Proposition 2, and with the assumptions of this proposition, we can write that in MRL and -QRL orderings (see Lai and Xie [20]).
3. Parameters Estimation
Parameters estimation is essential in machine learning, statistics, communication systems, radar, and many other fields. Suppose we have an ordered independent and identically distributed realization of . In this section, we discuss some methods for estimating the parameters.
3.1. Maximum Likelihood Method
Model parameters are estimated using the MG distribution’s maximum likelihood estimators (MLEs). The log–likelihood function corresponding to the MG model is
To estimate the parameters, we can maximize the log–likelihood function with respect to . In another approach, the answer to the following log–likelihood equations can be considered as maximum likelihood (ML) estimation.
and
To approximate the asymptotic variance of the maximum likelihood estimator (MLE), we can calculate the observed Fisher information matrix to be
Then, converges weakly to the multivariate normal Therefore, approximate confidence intervals for the MLE can be calculated using the inverse of the observed information matrix and the standard normal quantiles.
3.2. Least Squared Error Method
In the least squared error (LSE) approach to estimate the parameters, we should minimize the sum of squared distances between the estimated distribution function and the empirical distribution function, i.e., the following expression should be minimized.
which can be simplified to
Thus, the LSE estimates are computed by
3.3. Anderson-Darling Method
In the Anderson–Darling (AD) method, a weighted sum of the squares of the distances between the estimated distribution function and the empirical distribution function is minimized. The proper weights are . Thus, the estimate could be obtained as in the following.
3.4. Quantile Based Method
The quantile-based (QB) method considers the distances between the quantiles of the estimated model and the empirical quantiles of the data. Thus, the QB estimate of the parameters is
where the quantile function is defined by (5).
4. Investigation of the Estimator’s Behavior
The efficiency and consistency of the discussed estimators of the parameters are investigated by simulations. To generate an instance of , we solve the following equation:
where is a random instance of the standard uniform model.
In each run, replicates of samples of size or from MG are simulated with selected parameters. Then, the parameters for each sample are estimated using ML, LSE, AD, or QB methods. The optimizations are performed using the built-in function “optim” or the programming language R for all methods. The initial values needed for the optimization are chosen randomly from a uniform distribution, for example, for α on the interval and similarly for and . Table 1 shows the bias (B) and mean square error (MSE) of the ML and QB estimators for all parameters. Similarly, Table 2 shows the results for the LSE and AD estimators. From these tables, the following observations can be made.
Table 1.
Simulation results for methods ML and QB. The first, second, and third lines of each cell refer to , , and , respectively.
Table 2.
Simulation results for the methods LSE and AD. The first, second, and third lines of each cell refer to , , and , respectively.
- All estimators are consistent and efficient for estimating the model parameters.
- The AD estimator, a weighted form of the LSE method, outperforms the LSE estimator.
- The QB estimator has a very small MSE but does not improve significantly with sample size.
5. Applications
In this section, we present two applications with real data to demonstrate the MG model’s flexibility empirically.
5.1. Application 1 Survival Times of AG Positive Patients Data
Table 3 shows 17 survival times in weeks for a group of patients who died from acute myelogenous leukemia. These patients are designated as AG positive, meaning that they are identified by the presence of Auer rods and significant granulation of leukemic cells in the bone marrow at diagnosis. For more details on this dataset and patient specifications, see Feigl and Zelen [21].
Table 3.
Survival times (weeks) of a group of AG positive patients.
In a comparative analysis, the alternative distributions gamma, Lehmann gamma (LG), exponentiated gamma (EG), Marshal–Olkin gamma (MOG), and gamma-exponential-competitive-risk (GEC) models, respectively, with the following reliability functions are considered.
and
where . Then, the parameters of all models are estimated using the ML method. The results of the analysis are summarized in Table 4. Indicated are the Akaike information criterion (AIC), Bayesian information criterion (BIC), the Kolmogorov–Smirnov (KS) statistic, the Cramer-von Mises (CVM) statistic, and the Anderson–Darling (AD) statistic, as well as the corresponding p-values. Based on these benchmarks, MG performs significantly better than the other candidates. Figure 3 shows the Total Time on Test (TTT) plot indicating a bathtub-shaped FR function for this dataset. This plot shows the empirical and the fitted CDF for the selected models, which graphically confirms that the MG model provides a better fit than the other models considered. Figure 4 shows the estimated PDF and FR functions for the MG model. It provides a bathtub-shaped FR function and a decreasing shape for PDF.
Table 4.
Results of fitting the MG distribution and some alternatives to survival times data.
Figure 3.
(a): The TTT representation of the survival times data. (b): The empirical and adjusted CDF to MG and alternative models for this data set.
Figure 4.
The PDF (a) and the FR function (b) of the fitted MG model for survival times data.
5.2. Application 2 The Gauge Lengths Data
Table 5 reports 74 observations of the gauge lengths of 20 mm. The data set was previously studied by Alfaer et al. [22] and Ahmed et al. [23].
Table 5.
The gauge lengths of 20 mm.
The parameters of the MG model and all alternatives considered in the previous example are estimated by the ML method. The results of the analysis are included in Table 6.
Table 6.
Results of fitting the MG distribution and some alternatives to gauge lengths data.
The AIC, BIC, KS, CVM, AD, as well as the corresponding p-values, are reported. Based on the results, the MG proves to be better than others. Figure 5a shows the TTT plot, which shows an increasing failure rate function. Figure 5b draws the empirical and fitted CDFs for the selected models and shows a very close competition for describing the data between selected models. Additionally, Figure 6 provides the estimated PDF and FR functions for the MG model.

Figure 5.
(a): The TTT representation of the gauge lengths data. (b): The empirical and adjusted CDF to MG and alternative models for this data set.

Figure 6.
The PDF (a) and the FR function (b) of the fitted MG model for gauge lengths data.
6. Conclusions
Statistical models play an essential role in survival analysis and the medical field. These models provide satisfactory results in modeling various types of data sets. In this paper, we have implemented a novel model to update the flexibility level of the existing models. A modified gamma model is introduced, and some basic properties are investigated. The FR function can be increasing or bathtub shaped. In addition, the MRL and p-QRL functions are studied. Four methods, ML, LSE, AD, and QB, to estimate the proposed model parameters are discussed. The simulation results show that these estimators are consistent and efficient. The results of the comparative analysis show that the proposed model is useful for modeling survival data. In general, the new model could be presented as a flexible and simple model for the analysis of survival data when the upper tail seems to be lighter than that of an ordinary gamma model (due to the increased risk in the occurrence of the event), and the data have a bathtub-shaped FR function. The results offer new concepts and applications in survival analysis, medical statistics, and risk theory. The new model will be beneficial to researchers in the future and will be considered a better choice over the baseline model. Further properties and applications of the new model may be considered in the future of this research. In particular, the following topics are exciting and remain open problems:
- Bayesian and E-Bayesian estimation based on complete and different censoring schemes;
- Proposing a bivariate family of this model to extend the univariate case.
Author Contributions
Conceptualization, M.A.A. and M.K.; methodology, M.K.; software, M.A.A.; validation, M.K. and M.A.A.; formal analysis, M.K.; investigation, M.A.A.; resources, M.K.; data curation, M.A.A.; writing—original draft preparation, M.A.A.; writing—review and editing, M.K.; visualization, M.A.A.; supervision, M.K.; project administration, M.K.; funding acquisition, M.K. All authors have read and agreed to the published version of the manuscript.
Funding
Researchers Supporting Project number RSP2023R392, King Saud University, Riyadh, Saudi Arabia.
Data Availability Statement
The data used to support the findings of this study are included in the article.
Acknowledgments
The authors would like to thank the anonymous reviewers for their suggestions and very constructive comments, which improved the presentation and readability of the article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Petty, G.W.; Huang, W. The modified gamma size distribution applied to inhomogeneous and nonspherical particles: Key relationships and conversions. J. Atmos. Sci. 2011, 68, 1460–1473. [Google Scholar] [CrossRef]
- Belikov, A.V. The number of key carcinogenic events can be predicted from cancer incidence. Sci. Rep. 2017, 7, 12170. [Google Scholar] [CrossRef] [PubMed]
- Boland, P.J. Statistical and Probabilistic Methods in Actuarial Science; Chapman and Hall (CRC): Boca Raton, FL, USA, 2007. [Google Scholar]
- Aksoy, H. Use of gamma distribution in hydrological analysis. Turk. J. Eng. Environ. Sci. 2000, 24, 419–428. [Google Scholar]
- Wright, M.C.M.; Winter, I.M.; Forster, J.J.; Bleeck, S. Response to best-frequency tone bursts in the ventral cochlear nucleus is governed by ordered inter-spike interval statistics. Hear. Res. 2014, 317, 23–32. [Google Scholar] [CrossRef] [PubMed]
- Williams, M.M.R. On the modified gamma distribution for representing the size spectra of coagulating aerosol particles. J. Colloid Interface Sci. 1985, 103, 516–527. [Google Scholar] [CrossRef]
- Ong, J.T.; Shan, Y.Y. Modified gamma model for Singapore rain drop size distribution. IGARSS’97. In Proceedings of the 1997 IEEE International Geoscience and Remote Sensing Symposium Proceedings. Remote Sensing—A Scientific Vision for Sustainable Development, Singapore, 3–8 August 1997; Volume 4, pp. 1757–1759. [Google Scholar]
- Muralidharan, K.; Kale, B.K. Modified gamma distribution with singularity at zero. Commun. Stat.-Simul. Comput. 2002, 31, 143–158. [Google Scholar] [CrossRef]
- Gebrenegus, G. The extended generalized gamma model and its special cases: Applications to modeling marriage durations. Qual. Quant. 2005, 39, 71–85. [Google Scholar]
- Nadarajah, S.; Gupta, A.K. The exponentiated gamma distribution with application to drought data. Calcutta Stat. Assoc. Bull. 2007, 59, 29–54. [Google Scholar] [CrossRef]
- Shawky, A.I.; Bakoban, R.A. Exponentiated gamma distribution: Different methods of estimations. J. Appl. Math. 2012, 2012, 284296. [Google Scholar] [CrossRef]
- Cordeiro, G.M.; Lima, M.S.; Gomes, A.E. The gamma extended Weibull distribution. J. Stat. Distrib. Appl. 2016, 5, 1–19. [Google Scholar] [CrossRef]
- Feroze, N.; Elbatal, I. Beta exponentiated gamma distribution: Some properties and estimation. Pak. J. Stat. Oper. Res. 2016, 12, 141–154. [Google Scholar] [CrossRef][Green Version]
- Barriga, G.D.; Cordeiro, G.M.; Dey, D.K.; Cancho, V.G.; Louzada, F.; Suzuki, A.K. The Marshall-Olkin generalized gamma distribution. Commun. Stat. Appl. Methods 2018, 25, 245–261. [Google Scholar] [CrossRef]
- Marshall, A.W.; Olkin, I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 1997, 84, 641–652. [Google Scholar] [CrossRef]
- Mead, M.; Nassar, M.M.; Dey, S. A generalization of generalized gamma distributions. Pak. J. Stat. Oper. Res. 2018, 14, 121–138. [Google Scholar] [CrossRef]
- Altun, E.; Korkmaz, M.C.; El-Morshedy, M.; Eliwa, M.S. The extended gamma distribution with regression model and applications. AIMS Math. 2021, 6, 2418–2439. [Google Scholar] [CrossRef]
- Saboor, A.; Rahman, G.; Ali, H.; Nisar, K.S.; Abdeljawad, T. Properties and Applications of a New Extended Gamma Function Involving Confluent Hypergeometric Function. J. Math. 2021, 2021, 2491248. [Google Scholar] [CrossRef]
- Lai, C.D.; Xie, M.; Murthy, D.N.P. A modified Weibull distribution. IEEE Trans. Reliab. 2003, 52, 33–37. [Google Scholar] [CrossRef]
- Lai, C.D.; Xie, M. Stochastic Ageing and Dependence for Reliability; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
- Feigl, P.; Zelen, M. Estimation of exponential survival probabilities with concomitant information. Biometrics 1965, 21, 826–838. [Google Scholar] [CrossRef] [PubMed]
- Alfaer, N.M.; Gemeay, A.M.; Aljohani, H.M.; Afify, A.Z. The extended log-logistic distribution: Inference and actuarial applications. Mathematics 2021, 9, 1386. [Google Scholar] [CrossRef]
- Afify, A.Z.; Cordeiro, G.M.; Shafique Butt, N.; Ortega, E.M.; Suzuki, A.K. A new lifetime model with variable shapes for the hazard rate. Braz. J. Probab. Stat. 2017, 31, 516–541. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).