Abstract
The Tsallis entropy is an extension of the Shannon entropy and is used extensively in physics. The cumulative residual Tsallis entropy, which is a generalization of the Tsallis entropy, plays an important role in the measurement uncertainty of random variables and has simple relationships with other important information and reliability measures. In this paper, some novel properties of the cumulative residual Tsallis entropy are disclosed. Moreover, this entropy measure is applied to testing the uniformity, where the limit distribution and an approximation of the distribution of the test statistic are derived. In addition, the property of stability is discussed. Furthermore, the percentage points and power against seven alternative distributions of this test statistic are presented. Finally, to compare the power of the suggested test with that of other tests of uniformity, a simulation study is conducted.
Keywords:
cumulative residual Tsallis entropy; stability; empirical cumulative distribution function; testing uniformity; Monte Carlo method; test power MSC:
62G05; 62G30; 94A15; 94A17
1. Introduction
The classical measure of uncertainty in a discrete distribution (Shannon [1]) has been used in many areas, such as computer science [2], communication theory [2], the physical and chemical fields [3], fuzzy sets [4], and finance [5,6]. A straightforward extension of the discrete case to continuous distributions based on a probability density function (PDF) of a continuous random variable (RV) , called differential entropy, reads (cf. [7])
Many generalizations of the Shannon entropy have been published by inserting some additional parameters, making these generalizations more responsive to diverse shapes of probability distributions. Rao et al. [7] (see also, Wang et al. [8]) suggested a non-negative measure of uncertainty and referred to it as the cumulative residual entropy (CRE). This suggested measure is obtained by replacing the PDF in (1) by the survival function . Thus, for any continuous RV X with a cumulative distribution function (CDF) , the CRE is specified by
where
The Tsallis entropy of order is a generalization of the Shannon entropy that was first given by Havrda and Charvat [9]. Then, Tsallis [10] used its properties and placed it in a physical context. This measure is defined for any continuous RV X as
where . Clearly, as , .
Motivated by the wide applicability of the Tsallis entropy, Sati and Gupta [11] proposed the cumulative residual Tsallis entropy (CRTE) of order , which is given by
Rajesh and Sunoj [12] introduced an alternate measure of CRTE of order , which possesses certain interesting properties with , as
when , then but , and , for more details see Mohamed [13]. In this work we focus only on the measure (2).
Rajesh and Sunoj [12] detected several eminent features of the CRTE (2). For example, the CRTE has more interesting mathematical features than the CRE, it can be easily estimated from sample data, and these estimates asymptotically converge to the true values. Moreover, the CRTE handles the information in residual life. For the standard uniform distribution, denoted by , Rajesh and Sunoj [12] determined the value of the CRTE, which is . The literature teems with several results of the Shannon entropy and its related measures. Interested readers may refer to [14,15,16,17,18] for the Shannon entropy, Kullback–Leibler divergence, and Fisher information number; [19,20] for fractional cumulative residual entropy and cumulative residual entropy, respectively; [13,21,22,23,24] for the Tsallis entropy and its related measures; and finally [25,26] for the extropy and Rényi entropy and its applications, respectively.
Stephens [27] provided a useful guide to goodness-of-fit tests using statistics based on the empirical CDF. Furthermore, power comparisons of several uniformity tests were performed in [27]. The power attributes of an entropy-based test when employed for measuring uniformity were examined by Dudewicz and Van der Meulen [28]. Furthermore, Dudewicz and Van der Meulen [28] demonstrated that the entropy-based test has good power qualities for various alternatives by comparing it to other uniformity tests. Noughabi [29] developed a test for uniformity based on the CRE and studied some of its features. In addition, he compared the percentage points and power of seven alternative distributions. Mohamed et al. [30] used the fractional and weighted CRE measures to test the uniformity.
In this paper, we study the CRTE (2) for testing uniformity. The outcome of a simulation study reveals that the test under CRTE is competing with the test based on CRE in terms of power. In addition, some interesting statistical properties of the CRTE are revealed. We also use the Monte Carlo method via simulation and normality asymptotic, as well as the beta approximation, to derive the percentage points under the CRTE. In addition, the CRTE and other tests are compared in terms of power analysis.
Work Motivation
The Tsallis entropy of order , which was introduced by Tsallis [10], plays an important role in the measurement uncertainty of RVs and leads nonextensive statistics. The Tsallis entropy is the basis of the so-called nonextensive statistical mechanics, which generalizes the Boltzmann-Gibbs theory (cf. [31]). Tsallis statistics have found applications in a wide range of phenomena in diverse disciplines such as physics, chemistry, biology, medicine, economics, geophysics, etc. For example, Cartwright [32] proposed applications of the Tsallis entropy in various fields, such as describing the fluctuation of the magnetic field in the solar wind and signs of breast cancer in mammograms. Sati and Gupta [11] introduced a cumulative residual Tsallis entropy of order and studied its various properties in the context of reliability modeling. After one year, Rajesh and Sunoj [12] introduced an alternate measure of CRTE (defined by (2)) and studied its properties. Unlike the CRTE of Sati and Gupta [11], the proposed measure had some additional features and had simple relationships with other important information and reliability measures.
There are many different types of probability distributions, and the uniform distribution is perhaps the simplest of them all. For a continuous distribution, the uniform distribution defines equal probability over a given range. As a result, it is valuable as a reference distribution. Random number generation is one of the most important uses of uniform distribution. Moreover, in the field of economics, usually, demand and replenishment may not follow the expected normal distribution. As a result, different distribution models are employed to better anticipate probabilities and trends. According to Wanke [33], uniform distribution is more effective when evaluating lead-time for inventory management at the beginning of the lifecycle when a brand new product is being studied. Furthermore, social scientists use uniform distribution to represent a lack of knowledge. For example, in a simulation where distribution is not known, uniform random variates are often used. Uniform distribution is also used to describe the measurement error of some instruments or measuring systems. All of these factors (cf. [34]) explain the increasing interest in the choice of simple and computationally efficient tests for hypotheses about the uniform law of analyzed samples.
The aforesaid theoretical and practical importance of the statistic CRTE defined in (2) and the tests for uniformity provides a sufficient motivation to study and reveal some important properties of that statistic and use it for testing the uniformity.
The rest of the paper is organized as follows. In Section 2, we obtain some new findings of the CRTE. In Section 3, we propose the CRTE test statistic for uniformity and discuss some of its properties, including the property of stability. In Section 5, we propose the methods of finding the percentage points of CRTE. In addition, we estimate the percentage points of CRTE. In Section 6, we use a Monte Carlo simulation to carry out the power comparison of the uniformity of different tests against seven alternative distributions.
2. Some Properties of CRTE
In what follows, the symbols (), (), and () stand for convergence in probability, convergence in distribution and almost surely, as . In this section, we derive some properties of the measure , which is defined in (2).
Theorem 1.
Let be a random vector in . Furthermore, for all and some , let , i.e., . Then, , .
Proof.
For all and , we can easily check that the function attains its maximum value at . Moreover,
On the other hand, for each , we are now going to prove the inequality
Clearly, in view of the relation (3), the inequality (4) holds if . Since, for , we obtain , we consider the ratio in the interval , where . By using (3), we obtain
Thus, the inequality (4) holds if or, equivalently, . On the other hand, since p may be arbitrarily chosen in the interval , we can choose it to be sufficiently close to 1, in order that the inequality (4) be held for any . This proves the inequality (4). It is worth mentioning that this inequality is not satisfied for , in general, Figure 1 shows this fact.
Figure 1.
The relation between the functions and with different values of and .
Remark 1.
For any RV , it is well-known that the existence of Var implies . Thus, the existence of Var is a sufficient condition for , .
Theorem 2 (Weak convergence).
Let the sequence of N-dimensional random vectors converge in distribution to a random vector . Furthermore, for all , let , . Then,
Proof.
Since , we have
Meanwhile, from (4), we obtain
where is the ith component of the random vector and is the indicator function, i.e., , . Therefore, if , is bounded by an integrable function. Meanwhile, for any , we can choose sufficiently close to one to satisfy . The use of the dominated convergence theorem completes the proof. □
We show below that the measure dominates the differential entropy (1), which may exist when X has density.
Theorem 3.
Suppose that X is a non-negative RV with CDF ; then,
where , and is the differential entropy defined in (1).
3. Further Theoretical Aspects and Test Statistic
To establish the test with a null hypothesis of uniformity, we need the following theorem.
Theorem 4.
Let X be a non-negative RV with a continuous CDF F with a support . Then, . Moreover, the value is uniquely attained by the uniform distribution for all .
Proof.
The proof of inequality follows directly from (3). Meanwhile, using the strict concavity of , we obtain as a concave function of distributions (with support ). Thus, is uniquely acquired by the distribution . This completes the proof. □
Let be a random sample with a continuous CDF F defined on . Furthermore, let be the corresponding order statistics. Clearly, we can suggest an estimator of by , where and is the empirical CDF, which is given by
Moreover, in order to get a consistent test of the hypothesis of uniformity, we propose the consistent statistic test
where , , and , , .
Theorem 5.
The test based on the sample estimate is consistent.
Proof.
From the Glivenko-Cantelli theorem, see Howard [35], we have . Moreover, it is easily asserted that , which proves the theorem. □
Remark 2.
Since we obtain , under the null hypothesis . On the other hand, under the alternative hypothesis (that F is any continuous CDF defined on , which is not the uniform) we have , where q is a smaller or larger number than .
Theorem 6.
Let the random sample be drawn from an unknown continuous CDF F defined on . Then, , .
Proof.
In view of (3), we obtain
This completes the proof. □
Theorem 7.
Under , the mean and variance of are given, respectively, by
Proof.
Clearly, for any , the RV , based on the uniform distribution has a beta distribution with parameter-vector , written (cf. [36]). This completes the proof. □
Remark 3.
Under , we have , and .
The critical region, which specifies the uniformity test, is defined by
where is the desired level of significance, and is the quantile of the asymptotic, or approximated, CDF of the test statistic , under .
The Stability of CRTE
The stability of measures of information has been studied by several works of literature, see [19,37,38,39,40]. Analogously, we define the stability of the CRTE as the following.
Definition 1.
Let be a random sample with a continuous CDF F and be any small deformation of . Then, the empirical CRTE is stable if , and , we have .
The next theorem gives a sufficient condition of the stability of the empirical CRTE.
Theorem 8.
For any continuous RV , the empirical CRTE is stable if X is distributed on a finite interval.
Proof.
Suppose that the RV X is supported in the finite interval such that and . In view of (8), the empirical CRTE can be derived as
For , the stability of empirical CRTE is obvious. In brief, denote , , and . Thus, when , we obtain
where the second term in the second inequality in (10) is legitimated in view of (3). On the other hand, the first term in that inequality is legitimated from the fact that for any , , and arbitrary small , , such that , whenever (cf. [19]), which implies , whenever, . Now, choose , we obtain . This completes the proof. □
4. Percentage Points of the Test Statistic
In this section, we obtain the asymptotic distribution of under . From (8), we can write , where , , and . Thus, the RV has the PDF
The mean and variance of are and , respectively. By using the Lyapunov central limit theorem (cf. Billingsley [41]), we obtain , where Z is the standard normal RV. Therefore, under , the percentage point (quantile) is estimated for large n by using the asymptotic normality of as follows
where is the quantile of the standard normal distribution .
Johannesson and Giri [42] suggested an approximation of the CDF of the linear combination of the finite number of beta RVs. Noughabi [29] utilized this result to approximate the percentage points of the CRE for finite . By following a similar method, an approximation of for finite n can be obtained as follows:
where the RV has the beta distribution with
According to (12), the mean and variance of are given, respectively, by
Using this approximation of , the quantiles of order and of the approximated CDF of the test statistic under are given, respectively, by
where is the quantile function of the beta distribution, and the parameter-vector is defined in (13).
Percentage Points
We generated 50,000 samples of sizes = 10, 20, 30, 40, 50, 70, and 100 from . Utilizing (8), the test statistic was estimated by the empirical CRTE for each sample. Moreover, we can see that , , , , , and , where is the CRTE of the CDF . Consequently, for , we present the percentage points of the Monte Carlo method, asymptotic normality, and beta approximation by using (9), (11), and (14), respectively. Table 1 shows that as n increased, the difference between percentage points (upper–lower) decreased. Furthermore, the Monte Carlo approach was more accurate than the other two methods for , because it had almost minimum differences between percentage points.
Table 1.
Percentage points of the suggested test statistic at level .
Figure 2 and Figure 3 depict the empirical PDFs of the test statistics via Monte Carlo samples for = 10, 20, 30, 50, and 100, and and = 10, 20, 30, 50, and 100, and 10, respectively. It is noted that the means of the empirical PDFs of became nearer to the exact values () as n increased, which indicates that the bias and variance decreased with an increase in . Moreover, the six corresponding figures to and 10 in Figure 2 and Figure 3 reveal that the improvement in the bias and variance does not depend almost on .
Figure 2.
The estimated PDFs of under , for .
Figure 3.
The estimated PDFs of under , for .
5. Power Analysis
In this section, we examine the power test of the Monte Carlo method under alternative distributions. The power of is estimated by the proportion of the generated samples that are in the critical region. Under seven alternative distributions, the power of is calculated by the Monte Carlo method for the generated 50,000 samples each of size , and . The alternative CDFs introduced by Stephens [27] in the power study of uniformity tests are
On the report of Stephens [27], the family gives points nearer to zero than predictable under the hypothesis of uniformity and is interpreted as a change in the mean, the family gives points near 0.5 and is interpreted as a change toward a smaller variance, and the family shows two clusters close to and is interpreted as a change toward a larger variance.
In Table 2, we recorded the power values of the proposed test statistics , Kolmogorov–Smirnov (K-S), Kuiper (V), Cramer-von Mises (), Watson (), and Anderson-Darling (), for , and 40, and . From Table 2, we can draw the following conclusions:
Table 2.
Power estimates of the tests at the level .
- For a fixed and as n increases, we see that the power of increases.
- For the alternatives and , the power of increases and gives better performance against the other tests when tends to 1 ().
- For the alternative , when n and increases, the power of increases and gives a better performance than the other tests.
6. Conclusions
Some novel properties of the CRTE quantity were presented such as sufficient conditions for the CRTE to be finite, the weak convergence of the CRTE, the connections between the CRTE, the CRE, and classic differential entropy, and the stability of the empirical CRTE. Furthermore, for the CDFs with support , we exhibited that the value of was within . Moreover, the test of uniformity was proposed by calculating the percentage points and power analysis of . In addition, for , we obtained the percentage points by using the Monte Carlo method via simulation and normality asymptotic, as well as the beta approximation. A power comparison was performed between the CRTE and other tests, where, by changing the value of , we indicated when the test had higher and lower power compared with the other tests.
When we talk about the prospects for future research, we consider here two problems. The first one is to extend the result of this work to a multivariate version of the entropy measures; see, for example, the Formulas (4), (5), and (6) of [43], as a starting point for that future work. The second future research goal is to apply the proposed test to a recent real-world dataset to help solve one of society’s practical concerns.
Author Contributions
The authors contributed equally to the paper. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The simulated data used to support the findings of this study are included within the article.
Acknowledgments
The authors are grateful to Mata Wang and the four anonymous reviewers for their careful and diligent reading, which improved the readability and presentation substantially.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Shannon, C.E. A mathematical theory of communication. Bell. Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
- Li, Z.; Li, W.; Liu, R. Applications of entropy principles in power systems: A survey. In Proceedings of the 2005 IEEE/PES Transmission & Distribution Conference & Exposition: Asia and Pacific, Dalian, China, 15–18 August 2005; pp. 1–4. [Google Scholar]
- Schmid, R.; Miah, A.M.; Sapunov, V.N. A new table of the thermodynamic quantities of ionic hydration: Values and some applications (enthalpy–entropy compensation and Born radii). Phys. Chem. Chem. Phys. 2000, 2, 97–102. [Google Scholar] [CrossRef]
- Song, Y.; Fu, Q.; Wang, Y.F.; Wang, X. Divergence-based cross entropy and uncertainty measures of Atanassov’s intuitionistic fuzzy sets with their application in decision making. Appl. Soft Comput. 2019, 84, 105703. [Google Scholar] [CrossRef]
- Gu, R. Multiscale Shannon entropy and its application in the stock market. Physica A 2017, 484, 215–224. [Google Scholar] [CrossRef]
- Zhou, R.; Cai, R.; Tong, G. Applications of entropy in finance: A review. Entropy 2013, 15, 4909–4931. [Google Scholar] [CrossRef]
- Rao, M.; Chen, Y.; Vemuri, B.C.; Wang, F. Cumulative residual entropy: A new measure of information. IEEE Trans. Inf. Theory 2004, 50, 1220–1228. [Google Scholar] [CrossRef]
- Wang, F.; Vemuri, B.C. Non-rigid multi-model image registration using cross-cumulative residual entropy. Int. J. Comp. Vision 2007, 74, 201–215. [Google Scholar] [CrossRef] [Green Version]
- Havrda, J.; Charvat, F. Quantification method of classification process: Concept of structural α-entropy. Kybernetika 1967, 3, 30–35. [Google Scholar]
- Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
- Sati, M.M.; Gupta, N. Some characterization results on dynamic cumulative residual Tsallis entropy. J. Probab. Stat. 2015, 2015, 694203. [Google Scholar] [CrossRef] [Green Version]
- Rajesh, G.; Sunoj, S.M. Some properties of cumulative Tsallis entropy of order α. Stat. Pap. 2019, 60, 933–943. [Google Scholar] [CrossRef]
- Mohamed, M.S. On cumulative Tsallis entropy and its dynamic past version. Indian J. Pure Appl. Math. 2020, 51, 1903–1917. [Google Scholar] [CrossRef]
- Abd Elgawad, M.A.; Alawady, M.A.; Barakat, H.M.; Xiong, S. Concomitants of generalized order statistics from Huang-Kotz Farlie-Gumbel-Morgenstern bivariate distribution: Some information measures. Bull. Malays. Math. Sci. Soc. 2020, 43, 2627–2645. [Google Scholar] [CrossRef]
- Abd Elgawad, M.A.; Barakat, H.M.; Xiong, S.; Alyami, S.A. Information measures for generalized order statistics and their concomitants under general framework from Huang-Kotz FGM bivariate distribution. Entropy 2021, 23, 335. [Google Scholar] [CrossRef]
- Alawady, M.A.; Barakat, H.M.; Abd Elgawad, M.A. Concomitants of generalized order statistics from bivariate Cambanis family of distributions under a general setting. Bull. Malays. Math. Sci. Soc. 2021, 44, 3129–3159. [Google Scholar] [CrossRef]
- Barakat, H.M.; Husseiny, I.A. Some information measures in concomitants of generalized order statistics under iterated Farlie-Gumbel-Morgenstern bivariate type. Quaest. Math. 2021, 44, 581–598. [Google Scholar] [CrossRef]
- Park, S. Information measure in terms of the hazard function and its estimate. Entropy 2021, 23, 298. [Google Scholar] [CrossRef]
- Xiong, H.; Shang, P.; Zhang, Y. Fractional cumulative residual entropy. Comm. Nonlin. Sci. Num. Simul. 2019, 78, 104879. [Google Scholar] [CrossRef]
- Zhang, Y.; Shang, P.; He, J.; Xiong, H. Cumulative Tsallis entropy based on power spectrum of financial time series. Chaos 2019, 29, 103–118. [Google Scholar] [CrossRef]
- Irshad, M.R.; Maya, R.; Buono, F.; Longobardi, M. Kernel estimation of cumulative residual Tsallis entropy and its dynamic version under ρ-mixing dependent data. Entropy 2022, 24, 9. [Google Scholar] [CrossRef]
- Mohamed, M.S. On cumulative residual Tsallis entropy and its dynamic version of concomitants of generalized order statistics. Commun. Stat. Theory Methods 2020. [CrossRef]
- Mohamed, M.S.; Abdulrahman, A.T.; Almaspoor, Z.; Yusuf, M. Ordered variables and their concomitants under extropy via COVID-19 data application. Complexity 2021, 2021, 114. [Google Scholar] [CrossRef]
- Toomaj, A.; Atabay, H.A. Some new findings on the cumulative residual Tsallis entropy. J. Comput. Appl. Math. 2021, 400, 113669. [Google Scholar] [CrossRef]
- Mohamed, M.S. A measure of inaccuracy in concomitants of ordered random variables under Farlie-Gumbel-Morgenstern family. Filomat 2019, 33, 4931–4942. [Google Scholar] [CrossRef]
- Mohamed, M.S. Some new findings on the survival Rényi entropy and application of COVID-19 data. Results Phys. 2021, 31, 104966. [Google Scholar] [CrossRef] [PubMed]
- Stephens, M.A. EDF statistics for goodness of fit and some comparisons. J. Am. Stat. Assoc. 1974, 69, 730–737. [Google Scholar] [CrossRef]
- Dudewicz, E.J.; Van der Meulen, E.C. Entropy-based tests of uniformity. J. Am. Stat. Assoc. 1981, 76, 967–974. [Google Scholar] [CrossRef]
- Noughabi, H.A. Cumulative residual entropy applied to testing uniformity. Commun. Stat. Theory Methods 2020, 50, 1811339. [Google Scholar] [CrossRef]
- Mohamed, M.S.; Barakat, H.M.; Alyami, S.A.; Abd Elgawad, M.A. Fractional entropy-based test of uniformity with power comparisons. J. Math. 2021, 2021, 5331260. [Google Scholar] [CrossRef]
- Anastasiadis, A. Special Issue: Tsallis Entropy. Entropy 2012, 14, 174–176. [Google Scholar] [CrossRef] [Green Version]
- Cartwright, J. Roll over, Boltzmann. Phys. World 2014, 27, 31–35. [Google Scholar] [CrossRef]
- Wanke, P. The uniform distribution as a first practical approach to new product inventory management. Int. J. Prod. Econ. 2008, 114, 811–819. [Google Scholar] [CrossRef]
- Blinov, P.Y.; Lemeshko, B.Y. A review of the properties of tests for uniformity. In Proceedings of the 2014 12th International Conference on Actual Problems of Electronic Instrument Engineering, Novosibirsk, Russia, 2–4 October 2014. [Google Scholar] [CrossRef]
- Howard, G.T. A generalization of the Glivenko-Cantelli theorem. Ann. Math. Stat. 1959, 30, 828–830. [Google Scholar] [CrossRef]
- Arnold, B.C.; Balakrishnan, N.; Nagaraja, H.N. A First Course in Order Statistics; Wiley: New York, NY, USA, 1992. [Google Scholar]
- Abe, S. Stability of Tsallis entropy and instabilities of Renyi and normalized Tsallis entropies: A basis for q-exponential distributions. Phys. Rev. E 2002, 66, 046134. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Abe, S.; Kaniadakis, G.; Scarfone, A.M. Stabilities of generalized entropies. J. Phys. A Math. Gen. 2004, 37, 10513. [Google Scholar] [CrossRef] [Green Version]
- Lesche, B. Instabilities of Renyi entropies. J. Stat. Phys. 1982, 27, 419–422. [Google Scholar] [CrossRef]
- Ubriaco, M.R. Entropies based on fractional calculus. Phys. Lett. A 2009, 373, 2516–2519. [Google Scholar] [CrossRef] [Green Version]
- Billingsley, P. Probability and Measure; John Wiley & Sons: New York, NY, USA, 2008. [Google Scholar]
- Johannesson, B.; Giri, N. On approximations involving the beta distribution. Commun. Stat. Simul. Comput. 1995, 24, 489–503. [Google Scholar] [CrossRef]
- Mesiar, R.; Sheikhi, A. Nonlinear random forest classification, a copula-based approach. Appl. Sci. 2021, 11, 7140. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).