Abstract
The univariate Topp–Leone distribution introduced by with closed forms of the cumulative distribution function, i.e., [0, 1], was extended to an unbounded limit called the log-Topp–Leone distribution, where the shapes of the hazard function can increase, decrease or remain constant; therefore, this distribution can serve as an alternative distribution to the gamma, Weibull and exponential distributions. The bivariate form of this proposed distribution was introduced by joining the probability density function using three distinct copulas. The MLE, IFM and Bayesian estimation methods were employed to estimate the parameters. The Plackett copula was the best when using the MLE and IFM estimation methods, while the Clayton copula was the best when using the Bayesian method.
1. Introduction
The development and use of statistical distributions are not new matters in statistics. Generating statistical distributions began with the use of systems of differential equations, as in [1], the method of generating systems of frequency curves and the quantile method introduced by [2]. Since then, the trend has changed to the addition of parameters to an existing distribution, as in [3], or the combination of existing standard distributions, as in [4]. Other methods include the beta-generated method and transformed-transformer method, proposed by [5,6], respectively.
Many real-life phenomena, for example, engineering, science and economics are presented in bivariate datasets, in which one component may influence the lifetime of the other component, i.e., in science, one may study the age and resting heart rate of an individual. To model these datasets, several bivariate distributions have been introduced by [7,8,9,10,11] and many others.
2. Methods
This section provides the structural form of the proposed unbounded univariate and bivariate log-Topp–Leone distribution using three different copula functions.
The probability distribution and the density function of the Topp–Leone distribution introduced by [12] are, respectively, given by
where is the shape parameter.
2.1. Log-Topp–Leone Distribution
The newly proposed log-Topp–Leone distribution is introduced by transforming in Equation (1). When T serves as a random variable denoting the time to the occurrence of an event of interest:
the parameter will maintain its status as a shape parameter. The corresponding pdf is obtained by differentiating Equation (3) as
The survival and hazard functions of the log-Topp–Leone distribution are, respectively, given by
Figure 1 (1st and 2nd) depicts the pdf and hazard function of the proposed distribu-tion, respectively. The shapes of the hazard function can increase, decrease or remain constant; therefore, this distribution can serve as an alternative distribution to the gamma, Weibull and exponential distributions.
Figure 1.
Plots of the pdf (1st) and hazard function (2nd) of the log-Topp–Leone distribution for some parameter values.
2.2. Copula
Sklar [13] first introduced the copula function to connect the multivariate distribution function with individual marginals.
2.2.1. Model Based on Farlie–Gumbel–Morgenstern Copula
The joint survival function based on the FGM copula for and is given by
where .
2.2.2. Model Based on Clayton Copula
The joint survival function based on the Clayton copula for and is given by
where is the dependence parameter, taking values in the interval .
2.2.3. Model Based on Plackett Copula
The joint survival function based on the Plackett copula for and is given by
where
2.3. Inference Methods
This section provides the parameter estimates of the bivariate log-Topp–Leone distribution using the MLE, IFM and Bayesian estimation methods.
Bayesian Method of Estimation
This section considers cases where both and are censored and uncensored observations.
- (a)
- When both and are censored observations.
Assuming that when and are both censored observations, where and , the likelihood function is given by
where and are two indicator variables, and .
- (b)
- When both and are uncensored or complete observations.
When both and are uncensored or complete observations, i.e., , the likelihood function in Equation (8) will reduce to
2.4. Deviance Information Criterion
The deviance information criterion (DIC) proposed by [14] is defined as
where is the deviance, and is the posterior deviance.
3. Results and Discussion
This section provides the goodness-of-fit results for all the copulas.
In Table 1, all the results of the p-values for the copulas are statistically significant. This proves the suitability of all the copulas for the dataset. Goodness-of-fit measures, i.e., AIC and BIC, were employed to select the best model, where the model with the lowest AIC and BIC values was regarded as the best model. The results from the Plackett copula were the lowest for all criteria; therefore, the Plackett copula was better than the FGM copula. Regarding the estimation methods, the MLE estimates were better than the IFM estimates for the two models, based on the standard error values.
Table 1.
Standard error, p-value and goodness-of-fit measure results for the copulas.
As shown in Table 2 and Table 3, the joint posterior distribution was obtained by combining the likelihood function with the joint prior distribution to obtain some information of interest. This information was obtained by generating different Gibbs samples for each parameter. The different samples generated helped in observing the DIC values as the sample size increased, and it is clearly shown that this process requires a large sample size for small DIC values and a better model selection. Here, the Clayton copula with the lowest DIC value for different sample sizes was regarded as a better model than the FGM copula for the censored and uncensored cases.
Table 2.
Posterior summary statistics for censored dataset using FGM and Clayton copula functions.
Table 3.
Posterior summary statistics for uncensored dataset using FGM and Clayton copula functions.
4. Conclusions
The univariate Topp–Leone distribution introduced by [12] with closed forms of the cumulative distribution function, i.e., [0, 1], was extended to an unbounded limit called the log-Topp–Leone distribution, where the shapes of the hazard function can increase, decrease or remain constant; therefore, this distribution can serve as an alternative distribution to the gamma, Weibull and exponential distributions. The bivariate form of this proposed distribution was introduced by joining the probability density function using three distinct copulas. First, two models were studied based on the FGM and Plackett copulas, and the parameters were estimated using the MLE and IFM estimation methods. The Plackett copula with the lowest AIC and BIC values for both of the estimation methods fit the dataset very well compared to the FGM copula. Two copulas, namely the Clayton and FGM copulas, were implemented using the Bayesian estimation method. This method is based on the Markov chain Monte Carlo simulation technique, and the criterion used is the deviance information criterion (DIC). The Clayton copula with the lowest DIC value for different sample sizes was regarded as a better model than the FGM copula for the censored and uncensored cases.
Author Contributions
Conceptualization, A.U., A.I.I., H.D. and A.A.S.; Methodology, A.U., A.I.I., H.D. and A.A.S.; Software, A.U., H.D., Y.A., M.O. and A.I.I.; Validation, A.U., H.D., M.O. and Y.A.; Supervision, H.D. and M.O.; Formal analysis, A.U., H.D. and A.I.I.; Writing—original draft, A.U. and A.I.I.; Data curation, A.U. and A.I.I.; Writing— review and editing, M.O., A.A.S., H.D., A.U. and A.I.I.; Visualization, A.A.S., M.O., H.D. and Y.A. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Yayasan Universiti Teknologi PETRONAS (YUTP) with cost center 015LC0-401 and INTI International University, Malaysia.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The authors would like to thank the Universiti Teknologi PETRONAS for providing support to this project. Finally, the authors express their gratitude to the referees for their insightful comments.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Pearson, K. Contributions to the mathematical theory of evolution. II. Skew variation in homogeneous material. Philos. Trans. R. Soc. Lond. A 1895, 186, 343–414. [Google Scholar] [CrossRef]
- Hastings, J.C.; Mosteller, F.; Tukey, J.W.; Windsor, C. Low moments for small samples: A comparativestudy of order statistics. Ann. Stat. 1947, 18, 413–426. [Google Scholar] [CrossRef]
- Gupta, R.D.; Kundu, D. Generalized exponential distributions. Aust. N. Z. J. Stat. 1998, 41, 173–188. [Google Scholar] [CrossRef]
- Usman, A.; Doguwa, S.I.S.; Alhaji, B.B.; Imam, A.T. A New Generalized Weibull Odd Frechet Family of Distributions: Statistical Properties and Applications. Asian J. Probab. Stat. 2020, 9, 25–43. [Google Scholar] [CrossRef]
- Eugene, N.; Lee, C.; Famoye, F. The beta-normal distribution and its applications. Commun. Stat. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
- Alzaatreh, A.; Famoye, F.; Lee, C. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
- Morgenstern, D. Einfache beispiele zweidimensionaler verteilungen. Mitteilings Bl. Math. Stat. 1956, 8, 234–235. [Google Scholar]
- Farlie, D.J. The performance of some correlation coe_cients for a general bivariate distribution. Biometrika 1960, 47, 307–323. [Google Scholar] [CrossRef]
- Gumbel, E.J. Bivariate exponential distributions. J. Am. Stat. Assoc. 1960, 55, 698–707. [Google Scholar] [CrossRef]
- Plackett, R.L. A class of bivariate distributions. J. Am. Stat. Assoc. 1965, 60, 516–522. [Google Scholar] [CrossRef]
- Clayton, D.G. A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika 1978, 65, 141–151. [Google Scholar] [CrossRef]
- Topp, C.W.; Leone, F.C. A Family of J-Shaped Frequency Functions. J. Am. Stat. Assoc. 1955, 50, 209–219. [Google Scholar] [CrossRef]
- Sklar, A. Fonctions de répartition à n dimensions et leurs marges. Publ. Inst. Statist. Univ. Paris 1959, 8, 229–231. [Google Scholar]
- Spiegelhalter, D.; Best, N.; Carlin, B.; van der Linde, A. Bayesian measures of model complexity and fit. J. R. Stat. Soc. Ser. B 2002, 64, 583–639. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).