Abstract
The inverse Lomax distribution has been widely used in many applied fields such as reliability, geophysics, economics and engineering sciences. In this paper, an unexplored practical problem involving the inverse Lomax distribution is investigated: the estimation of its entropy when multiple censored data are observed. To reach this goal, the entropy is defined through the Rényi and q-entropies, and we estimate them by combining the maximum likelihood and plugin methods. Then, numerical results are provided to show the behavior of the estimates at various sample sizes, with the determination of the mean squared errors, two-sided approximate confidence intervals and the corresponding average lengths. Our numerical investigations show that, when the sample size increases, the values of the mean squared errors and average lengths decrease. Also, when the censoring level decreases, the considered of Rényi and q-entropies estimates approach the true value. The obtained results validate the usefulness and efficiency of the method. An application to two real life data sets is given.
MSC:
60E05; 62E15; 62F10
1. Introduction
Entropy is one of the most popular measure of uncertainty. As former mathematical work, Reference [1] proposed a theory on the concept of entropy, with numerical indicators as well. This theory was enhanced by numerous other entropy-like measures, arising from various applied fields. In this regard, a complete survey can be found in Reference [2]. Here, we focus our attention on two of the most famous entropy measures—the Rényi entopy by Reference [3] and the q-entropy by Reference [4] (also called Tsallis entropy). The Rényi entropy finds its source in the information theory and the q-entropy comes from the statistical physics, with a plethora of applications in their respective fields. For a random variable X having the probability density function (pdf) , where represents the corresponding parameters, these two entropy measures are, respectively, defined by
where and , and
where and . In particular, the Rényi entropy contains several well-known entropy measures, such as the Hartley entropy given by , the Shannon entropy obtained as and the collision entropy given as .
In most of the observed phenomena, at least is unknown, and the entropy as well. For this reason, the theoretical or practical statistical treatment of the entropy have been the object of all the attentions, in various settings. Among the notable studies in this regard, we may refer the reader to Reference [5] discussing the entropy of ordered sequences and order statistic, Reference [6] focusing on the entropy of upper record values, Reference [7] proposing the entropy of hybrid censoring schemes, Reference [8] discussing the entropy of progressively censored samples, Reference [9] investigating the estimation of the entropy for the Weibull distribution under the progressive censoring scheme, Reference [10] using the maximum likelihood and Bayes estimators via doubly-generalized Type II hybrid censored samples to estimate the entropy for the Rayleigh distribution, Reference [11] studying the Bayes estimation of the entropy for the Weibull distribution under the generalized progressive hybrid censoring scheme, Reference [12] providing the maximum likelihood and Bayes estimators of the entropy for the Weibull distribution under a generalized progressive hybrid censoring scheme, Reference [13] studying the estimation of the entropy for the generalized exponential distribution based on record values, Reference [14] discussing the Shannon entropy for the Lomax distribution based on the generalized progressively hybrid censoring scheme and Reference [15] investigating point and interval estimation of the Shannon entropy for the inverse Weibull distribution under multiple censored data.
This paper provides a contribution to the estimation of the Rényi and q-entropies for the inverse Lomax distribution when multiple censored data are observed. Let us now motivate the consideration of the inverse Lomax distribution in this setting, as well as the multiple censored data. First and foremost, the inverse Lomax (IL) distribution is a lifetime distribution, defined as the distribution of the reciprocal of a random variable following the famous Lomax (L) distribution. Mathematically, the L distribution is defined by the cumulative distribution function (cdf) and pdf given as
respectively. Here, is a shape parameter and is a scale parameter. The essential of the L distribution can be found in References [16,17,18]. Thus, for a random variable X having the L distribution, the random variable follows the IL distribution with cdf and pdf given by
respectively. Among the advantages of the IL distribution, the corresponding probability functions are tractable, it is parsimonious in parameters and possesses a non-monotonic hazard rate function; it possesses decreasing and upside-down bathtub shapes. The practical usefulness of the related model is illustrated in Reference [19] for its application to the analyses of geophysical data and in Reference [20] for its application in economics and actuarial sciences. Also, we refer to Reference [21] for the estimation of the reliability parameter via Type II censoring samples, in Reference [22] for the estimation of the parameters based on hybrid censored samples, and in Reference [23] for the Bayesian estimation of the two-component mixture of the IL distribution under the Type I censoring scheme. Also, recent studies have proposed extensions of the IL distributions for further purposes. In this regard, let us cite Reference [24] for the inverse power Lomax (or power IL) distribution, Reference [25] for the Weibull IL distribution, Reference [26] for the alpha power transformed IL distribution, Reference [27] for the Marshall-Olkin IL distribution and Reference [28] for the odd generalized exponentiated IL distribution.
However, as far as we know, despite its interest, the estimation of entropy measures for the IL distribution, such as the Rényi and q-entropies, remains an unexplored aspect. This study fills this gap by considering this problem under the realistic scenario of multiple censored data. This scenario commonly occurs where several censoring levels logically exist, which is often the case for many applications in life testing and survival analysis. We refer the reader to References [29,30,31], as well as the recent estimation studies of Reference [32] and Reference [15]. In our statistical framework, after investigating the maximum likelihood estimates of and , estimates for the Rényi and q-entropies are derived. Then, two-sided approximate confidence intervals of the Rényi and q-entropies are discussed. A complete numerical study is performed, showing the favorable behavior of the obtained estimates at various sample sizes. In particular, the mean squared errors, approximate confidence intervals along with the corresponding average lengths are used as benchmarks. Our numerical investigations show that, when the sample size increases, the values of the mean squared errors and average lengths decrease. Also, when the censoring level decreases, the considered of Rényi and q-entropies estimates approach the true value. Two real life data sets, one physiological data set and one economic data set, are used to illustrate the findings.
The rest of the article is arranged as follows. The Rényi and q-entropies for the IL distribution are expressed in Section 2. Section 3 studies their estimation under multiple censored data. Simulation and numerical results are given in Section 4. An application to real data sets is presented in Section 5. The article ends with some concluding remarks in Section 6.
2. Expressions of the Rényi and q-Entropies
Let X be a random variable following the IL distribution with parameters and . Then, by (1) and (3) with , the Rényi entropy of X is given by
where and . It follows from the standard equivalence results and criterion for Riemann integrability that exists if and only if and . So the final conditions are , and . Note that the last condition is alway satisfied if , which will be considered in the coming simulation study.
Under these conditions, by applying the change of variables , we can rewrite the integral term as
where refers to the beta function defined by , or, alternatively, .
Therefore, after some algebraic manipulations, we get
with , and .
Similarly, based on (2) with and (4) applied with , the q-entropy of X is given by
with , and . For practical purposes, (5) and (6) are the required expressions of Rényi and q-entropies of X, written as simple functions of the parameters and .
3. Estimation of Rényi and q-Entropies
3.1. Mathematical Basics on Multiple Censored Data Setting
For our estimation study, we consider the situation of multiple censored data (including the type I and type II censoring), following the setting of Reference [30] (Section 1.3.2). We may also refer to Reference [25] in the context of the inverse Weibull distribution. The general framework can be summarized as follows. Let X be a random variable having the cdf and pdf given by and , respectively. Based on n units under a certain test, we get n values of which
- values are (independent) observations of X for failed units,
- values are (independent) observations of X for censored (nonfailed) units,
with . Then, the likelihood function for can be expressed as
where if the ith unit failed, and 0 otherwise (so ), if the ith unit censored, and 0 otherwise (so ), and K denotes a secondary constant (independent of ). Then, the maximum likelihood estimates (MLEs) of are obtained by maximizing with respect to . All the details in this regard can be found in Reference [30] (Section 1.3.2).
3.2. Considered Estimates
Hence, if a random variable X follows the IW distribution with parameters and , based on (3), the likelihood function for can be expressed as
The MLEs are thus obtained by maximizing with respect to and . In this regard, the log-likelihood function is useful, and can be expressed as
Then, the first partial derivatives of with respect to and , are obtained as follows
and
Hence, the MLEs of and are determined by solving the following equations: and , simultaneously. These equations cannot be solved analytically, so numerical iterative techniques must be applied in this regard. For the purpose of this study, the MLEs of and are denoted by and , respectively. Hence, based on (5) and (6), owing to the plugging approach, natural estimates for the entropies and , are, respectively, given by
with , and , and
with , and .
3.3. Confidence Intervals
Owing to the invariance property, and are also the MLEs of and , respectively. Therefore, the well-known theory of the maximum likelihood method can be applied to and . In particular, invoking the so-called Delta theorem, under some technical regularity conditions, the subjacent asymptotic distribution of can be approximated by the normal distribution with mean and variance , where denotes the inverse of the observed information matrix and , both can be determined from (5). Therefore, the two-sided approximate confidence interval for the Rényi entropy at the confidence level with is given by (so ), where
and is the standard normal percentile. A similar result holds for . We also refer to References [15,31] for more detail.
4. Simulation Study
Here, a simulation study is assessed to investigate the performance of the Rényi and q-entropies estimates given by (7) and (8), respectively. In this regard, we use the mean squared errors (MSEs), two-sided approximate confidence intervals along with their corresponding average lengths (ALs) (i.e., defined AL is the average value of U-L, where L and U denotes the lower and upper bounds of the corresponding interval, respectively) based on multiple censored data (or samples). We adopt the methodology of Reference [15]. Thus, the following procedure is conducted:
- 3000 random samples of sizes , 100, 150, 200 and 300 are generated from the IL distribution based on multiple censored sample.
- The values of parameters are selected asFor the failures at censoring level (CL), we arbitrary chose CL and (for instance, CL means that the observations are based on failed units and censored units).
- The true values for and given by (5) and (6), and the average estimates and given by (7) and (8) are calculated, respectively. Different values for and q are considered.
- Finally, the average of the obtained estimates, MSEs and ALs with level (so ) are computed.
All the numerical results are presented in Table 1, Table 2, Table 3 and Table 4 for the Rényi entropy, and Table 5, Table 6, Table 7 and Table 8 for the q-entropy. All is calculated by the use of the mathematical software Mathcad.
Table 1.
Rényi entropy estimates at Set1 and censoring level (CL) .
Table 2.
Rényi entropy estimates at Set1 and CL .
Table 3.
Rényi entropy estimates at Set2 and CL .
Table 4.
Rényi entropy estimates at Set2 and CL .
Table 5.
q-entropy estimates at Set1 and CL .
Table 6.
q-entropy estimates at Set1 and CL .
Table 7.
q-entropy estimates at Set2 and CL .
Table 8.
q-entropy estimates at Set2 and CL .
Figure 1, Figure 2, Figure 3 and Figure 4 provide a graphical approach of the MSEs and ALs of the Rényi entropy estimates, whereas Figure 5, Figure 6, Figure 7 and Figure 8 provide a graphical approach of the MSEs and ALs of the q-entropy estimates.
Figure 1.
(a) Mean squared errors (MSEs) and (b) average lengths (ALs) of Rényi entropy estimates for different sample sizes at Set1 and CL .
Figure 2.
(a) MSEs and (b) ALs of Rényi entropy estimates for different sample sizes at Set1 and CL .
Figure 3.
(a) MSEs and (b) ALs of Rényi entropy estimates for different sample sizes at Set2 and CL .
Figure 4.
(a) MSEs and (b) ALs of Rényi entropy estimates for different sample sizes at Set2 and CL .
Figure 5.
(a) MSEs and (b) ALs of q-entropy estimates for different sample sizes at Set1 and CL .
Figure 6.
(a) MSEs and (b) ALs of q-entropy estimates for different sample sizes at Set1 and CL .
Figure 7.
(a) MSEs and (b) ALs of q-entropy estimates for different sample sizes at Set2 and CL .
Figure 8.
(a) MSEs and (b) ALs of q-entropy estimates for different sample sizes at Set2 and CL .
Here, some remarks can be formulated about the behavior of the Rényi and q-entropies estimates according to Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8, and Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8:
- The MSEs of decrease as the sample size increases.
- The ALs of decrease as the sample size increases.
- The MSEs of increase when the value of increases.
- The ALs of increase when the value of increases.
- The MSEs of decrease as the sample size increases.
- The ALs of decrease as the sample size increases.
- The MSEs of decrease when the value of q increases.
- The ALs of decrease when the value of q increases.
- In almost situations the MSE of at CL is less than the MSE of at CL .
- In almost situations the MSE of at CL is less than the MSE of at CL .
These facts prove the good accuracy of our entropy estimates, which are logically recommended for further practical purposes.
5. Application
In this Section, two real life data sets are used to illustrate the finding, both described below.
The first data set is a physiological data set extracted from Reference [33]. It concerns twenty Duchenne patients (6–18 years age) with classical type of the muscular dystrophy. The Electrocardiography of these 20 patients based on the heart rate is given below in Table 9.
Table 9.
First data set: Heart rate data for twenty Duchenne patients.
The second data set is an economic data set extracted from the following electronic address: https://tradingeconomics.com/pakistan/consumer-price-index-cpi. It refers to Pakistan Consumer Price Index (CPI) in Pakistan from May 2019 to April 2020. The data are collected in Table 10.
Table 10.
Second data set: Consumer Price Index (CPI) in Pakistan from May 2019 to April 2020.
Then, based on the data, adopting the multiple censored data scheme, we apply and to estimate and where X denotes the considered random variables of interest, assuming to follow the IL distribution. Different values for CL, and q are considered. The obtained numerical results are displayed in Table 11 and Table 12 for the first and second data sets, respectively.
Table 11.
Estimated of Rényi and q-entropies at CL and CL for the first data set.
Table 12.
Estimated of Rényi and q-entropies at CL = and CL = for the second data set.
Thus, Table 11 and Table 12 show some numerical values of estimated entropies in a concrete scenario, following the multiple censored data scheme. We see that the results depends on the entropy parameters ( or q), and also, the value for CL, beyond the standard complete standard (which can be obtained by taking CL ).
6. Concluding Remarks
This article studies the estimation of the Rényi and q-entropies for inverse Lomax distribution under multiple censored data. We propose an efficient estimation strategy by using the maximum likelihood and plugging methods. The behavior of the Rényi and q-entropies estimates are calculated in terms of their mean squared errors and average lengths (depending on two-sided approximate confidence intervals). Numerical results are provided, showing that, as the sample size increases, the mean squared errors of our estimates decrease. Also, it can be observed that, as the sample size increases, the average lengths of our estimates decreases. Thus, the proposed estimates reveal to be efficient, providing new useful tools with potential applicability in many applied situations dealing with entropy of the inverse Lomax distribution. The article ends by presenting an application to two real life data sets.
Author Contributions
R.A.R.B., M.E., C.C., and F.J. contributed equally to this work. All authors have read and agreed to the published version of the manuscript.
Funding
This work was funded by the Deanship of Scientific Research (DSR), King AbdulAziz University, Jeddah, under grant No. (RG-2-150-37).
Acknowledgments
The authors are very grateful to the two reviewers which have provided very constructive comments, improving significantly some parts of the paper. This project was funded by the Deanship of Scientific Research (DSR), at King Abdulaziz University, Jeddah, under grant No. (RG-2-150-37). The authors, therefore, acknowledge with thanks to DSR technical and financial support.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
- Amigo, J.M.; Balogh, S.G.; Hernandez, S. A brief review of generalized entropies. Entropy 2018, 20, 813. [Google Scholar] [CrossRef]
- Rényi, A. On measures of entropy and information. In Proceedings of the 4th Fourth Berkeley Symposium on Mathematical Statistics and Probability; University of California Press: Berkeley, CA, USA, 1961; pp. 547–561. [Google Scholar]
- Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
- Wong, K.M.; Chan, S. The entropy of ordered sequences and order statistics. IEEE Trans. Inf. Theory 1990, 6, 276–284. [Google Scholar] [CrossRef]
- Baratpour, S.; Ahmadi, J.; Arghami, N.R. Entropy properties of record statistics. Stat. Pap. 2007, 48, 197–213. [Google Scholar] [CrossRef]
- Morabbi, H.; Razmkhah, M. Entropy of hybrid censoring schemes. J. Stat. Res. 2010, 6, 161–176. [Google Scholar] [CrossRef]
- Abo-Eleneen, Z.A. The entropy of progressively censored samples. Entropy 2011, 13, 437–449. [Google Scholar] [CrossRef]
- Cramer, E.; Bagh, C. Minimum and maximum information censoring plans in progressive censoring. Commun. Stat. Theory Methods 2011, 40, 2511–2527. [Google Scholar] [CrossRef]
- Cho, Y.; Sun, H.; Lee, K. An estimation of the entropy for a Rayleigh distribution based on doubly generalized Type II hybrid censored samples. Entropy 2014, 16, 3655–3669. [Google Scholar] [CrossRef]
- Cho, Y.; Sun, H.; Lee, K. Estimating the entropy of a Weibull distribution under generalized progressive hybrid censoring. Entropy 2015, 17, 102–122. [Google Scholar] [CrossRef]
- Lee, K. Estimation of entropy of the inverse Weibull distribution under generalized progressive hybrid censored data. J Korea Inf. Sci. Soc. 2017, 28, 659–668. [Google Scholar]
- Manoj, C.; Asha, P.S. Estimation of entropy for generalized exponential distribution based on record values. J Indian Soc. Probab. Stat. 2018, 19, 79–96. [Google Scholar]
- Liu, S.; Gui, W. Estimating the entropy for Lomax distribution based on generalized progressively hybrid censoring. Symmetry 2019, 11, 1219. [Google Scholar] [CrossRef]
- Hassan, A.S.; Zaki, A.N. Estimation of entropy for inverse Weibull distribution under multiple censored data. J. Taibah Univ. Sci. 2019, 13, 331–337. [Google Scholar] [CrossRef]
- Hassan, A.; Al-Ghamdi, A. Optimum step stress accelerated life testing for Lomax distribution. J. Appl. Sci. 2009, 5, 2153–2164. [Google Scholar]
- Kilany, N.M. Weighted Lomax distribution. SpringerPlus 2016, 5, 1862. [Google Scholar] [CrossRef]
- Lomax, K.S. Business failures: Another example of the analysis of failure data. J. Am. Stat. Assoc. 1954, 49, 847–852. [Google Scholar] [CrossRef]
- McKenzie, D.; Miller, C.; Falk, D.A. The Landscape Ecology of Fire; Springer: Dordrecht, The Netherlands; Heidelberg, Germany; New York, NY, USA, 2011. [Google Scholar]
- Kleiber, C.; Kotz, S. Statistical Size Distributions in Economics and Actuarial Sciences; John Wiley and Sons, Inc.: Hoboken, NJ, USA, 2003. [Google Scholar]
- Singh, S.K.; Singh, U.; Yadav, A.S. Reliability estimation for inverse Lomax distribution under type II censored data using Markov chain Monte Carlo method. Int. Math. Stat. 2016, 17, 128–146. [Google Scholar]
- Yadav, A.S.; Singh, S.K.; Singh, U. On hybrid censored inverse Lomax distribution: Application to the survival data. Statistica 2016, 76, 185–203. [Google Scholar]
- Reyad, H.M.; Othman, S.A. E-Bayesian estimation of two-component mixture of inverse Lomax distribution based on type-I censoring scheme. J. Adv. Math. Comput. Sci. 2018, 26, 1–22. [Google Scholar] [CrossRef]
- Hassan, A.S.; Abd-Allah, M. On the inverse power Lomax distribution. Ann. Data Sci. 2018, 6, 259–278. [Google Scholar] [CrossRef]
- Hassan, A.S.; Mohamed, R.E. Weibull inverse Lomax distribution. Pak. J. Stat. Oper. 2019, 15, 587–603. [Google Scholar] [CrossRef]
- ZeinEldin, R.A.; Haq, M.A.U.; Hashmi, S.; Elsehety, M. Alpha power transformed inverse Lomax distribution with different methods of estimation and applications. Complexity 2020, 2020, 1860813. [Google Scholar] [CrossRef]
- Maxwell, O.; Chukwu, A.U.; Oyamakin, O.S.; Khaleel, M.A. The Marshall-Olkin inverse Lomax distribution (MO-ILD) with application on cancer stem cell. J. Adv. Math. Comput. Sci. 2019, 33, 1–12. [Google Scholar] [CrossRef]
- Maxwell, O.; Kayode, A.A.; Onyedikachi, I.P.; Obi-Okpala, C.I.; Victor, E.U. Useful generalization of the inverse Lomax distribution: Statistical properties and application to lifetime data. Am. J. Biomed. Sci. Res. 2019, 6, 258–265. [Google Scholar] [CrossRef]
- Lawless, F.J. Statistical Models and Methods for Lifetime Data; Wiley: New York, NY, USA, 2003. [Google Scholar]
- Pham, H. Springer Handbook of Engineering Statistics; Springer: London, UK, 2006. [Google Scholar]
- Tobias, P.A.; Trindade, D.C. Applied Reliability, 2nd ed.; Chapman and Hall/CRC: New York, NY, USA, 1995. [Google Scholar]
- Wang, F.K.; Cheng, Y. EM algorithm for estimating the Burr XII parameters with multiple censored data. Qual. Reliab. Eng. Int. 2010, 6, 615–630. [Google Scholar] [CrossRef]
- Ansari, N.; Siddiqui, P.Q.R.; Kirmani, S.R. Electrocardiographic studies in the patients of Duchenne muscular dystrophy. J. Pak. Med. Assoc. 1980, 34, 117–121. [Google Scholar]
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).