1. Introduction
The concept of entropy first appeared as a thermodynamic state function, which is a macroscopic state quantity of a thermodynamic system [
1]. The significance of entropy is that it gives a quantitative criterion for the second law of thermodynamics: any change in an isolated system cannot cause a decrease in the total value of entropy.
The study of entropy in mathematics originated from the uncertainty measurement of discrete distribution proposed by Shannon in 1948 [
2]. Shannon defined the entropy function,
where
represents the probability of a discrete random event. Shannon entropy has been successfully applied to other fields [
3,
4,
5,
6]. In the continuous case, Shannon entropy, also known as differential entropy, can be given as follows [
2],
where
is the probability density function.
In 2004, Murali Rao et al. creatively proposed the definition of cumulative residual entropy (CRE) [
7],
by applying the survival function
of
, where
is the cumulative distribution function of
. The CRE replaces the probability density in information entropy with a distribution function, which successfully improves the shortcoming that Shannon entropy cannot be uniformly defined in discrete and continuous cases. Hence, this approach is applied to many new forms of entropy [
8,
9,
10,
11,
12,
13].
In 2009, Murali Rao et al. become interested in the dependence on parameter
in those new entropies generalized from Shannon entropy and proposed entropy based on fractional calculus [
14]
Obviously, Shannon entropy is the case of
, and the addition of fractional computing made entropy more useful [
15,
16,
17,
18].
Shannon also proposed the concept of relative entropy in Ref. [
2], called Kullback–Leibler (KL) information [
19], which measures the information discrepancy between
and
. The definition of KL information is as follows,
where
and
are two density functions.
In statistics, mean squared error (MSE) is a common measure to estimate the parameter accuracy [
20,
21,
22,
23]. Let us suppose that
is a random variable satisfying
The above formula indicates that
is distributed
with parameter
, and
is a parameter space. Assume that
when
and
are any estimator of parameter
. MRE is calculated as follows,
This is similar to the accuracy of the prediction results in the KL entropy measurement data analysis [
24,
25,
26,
27,
28,
29]. However, MSE has some shortcomings, which often cause confusion in the evaluation estimator. To avoid those confusions and make Kullback–Leibler divergence a finite measure in any case, Zhang Jin et al. modified relative entropy as below [
30]. Let
denote the support of
for any
,
for any
, where
is the
confined in
with probability density function (pdf)
,
. Combining the above modifications, the definition of mean relative entropy (MRE) is as follows,
MRE is only defined for distributions with density functions, but not every set of data has a clear distribution function, which complicates the calculation of MRE. In order to overcome the deficiency of MRE and explore the influence of q-order calculation form on MRE, we substitute the distributed function for the density function in MRE and add the fractional calculation to define the fractional cumulative residual mean relative entropy (FCMRE), which is easier to calculate from empirical entropy. In addition, fractional order makes MRE more sensitive to dynamic system changes and can explore more details of complex systems. The rest of the paper is organized as follows.
Section 2 introduces the transformation invariance and defines the fractional cumulative residual mean relative entropy (FCMRE). The statistical properties and example demonstration of the FCMRE are given in
Section 3.
Section 4 confirms that the empirical FCMRE converges to the value of theoretical FCMRE.
Section 5 describes the application of FCMRE in aeroengine gas path system. Finally,
Section 6 draws some conclusions.
2. Fractional Cumulative Residual Mean Relative Entropy
For the proof of the new property, we first state some statistical properties [
20,
31], which are necessary conditions for a logical measure of the estimation error.
A measure is invariant under any one-to-one mapping for parameter. That is, , where .
A measure is invariant under any one-to-one mapping for data. That is, , where , , and .
If is a sufficient statistic for , then the minimum of lies on the sample only through , which is the sufficiency principle.
MSE satisfies the sufficiency principle but does not meet the above two invariants. Inspired by the MRE and the fractional entropy, we substitute the probability density function in MRE with the distribution function and add the form of fractional calculation. The definition of fractional cumulative residual mean relative entropy (FCMRE) is given below.
Definition 1. Let be a random variable from Equation (6) and with a pdf . Let be any estimator of . Fractional cumulative residual mean relative entropy (FCMRE) is defined aswhere , is the confined in , represents the conditional relative entropy for given , and denotes a conditional probability for given . The first is the expectation of , and the second is the expectation taken of and . FCMRE can measure the estimation error of . If has common support, then Next, some properties of FCMRE are exhibited. Proposition 1 and 2 demonstrate the sufficiency principle and changeless properties, where the FCMRE is superior to MSE. In all the following propositions, we assume that is a nonnegative random variable satisfying Equation (6) and is any estimator of .
3. Some Properties of FCMRE
Proposition 1. The fractional cumulative residual MRE (FCMRE) is invariant under any one-to-one mapping of parameter . That is, , where .
The FCMRE is unaltered for any one-to-one mapping of data: , . That is, , where, , , and .
Proof. For
, the proof of parameter invariance under transformation is as follows,
and data invariance under transformation comes from
,
where
,
and
. □
Proposition 2. Given as a nonnegative random variable, and for any estimator ,
if is a sufficient statistic for θ, thenwhere represents the conditional expectation of for .
Proof. Based on the concavity of
for
,
, by the law of double expectation and Jensen’s inequality, gives
which means
; that is,
.
We prove that
for any given sample
. In fact, if
, the density function
. Using the concavity of
and Jensen’s inequality, we can derive that
Therefore, and . □
Proposition 3. Let be a nonnegative random variable, a full and sufficient statistic for , and the sole minimum FMRE estimator of with mean for any estimator .
According to Proposition 2 and the proof process of the Lemamann–Scheffé theorem [
16], Proposition 3 can be proved.
Proposition 4. .
Proof. It follows from Jensen’s inequality that
Using Jensen’s inequality again,
Hence, we proved the proposition. □
Example 1. Let and follow the exponential distribution with and .
The values of FCMRE
are shown in Figure 1 when takes a different value on .
The next proposition explains the effect of linear transformation on FCMRE. That is, the linear invariance of FCMRE.
Proposition 5. Let and be two nonnegative and independent random variables from Equation (6) and with a pdf .
If ,
where and ,
then Proof. Recalling that , , from Equation (9), we can prove this equality. □
Example 2. Consider the simplest case where follows a uniform distribution on and ,
and .
Figure 2 shows the linear invariance of FCMRE.
Figure 3 uses aeroengine time series to verify that FCMRE also has linear properties. EPR and N1 represent engine pressure ratio and high-pressure respectively. The entropy graphs of randomly generated sequences and the entropy curves of aeroengine gas path data prove the universality of property 5.
In traditional information theory, the sum of information entropy of two independent variables is greater than any one of them. The following proposition confirms that FCMRE has an analogous property.
Proposition 6. For two nonnegative and independent random variables and from Equation (6) and with a pdf ,
Proof. Due to
and
being independent,
, then
Since
for
and
,
Using Jensen’s inequality and the negative of Equation (21), we obtain
Both sides are integrated at the same time.
Because of the randomness of , this proposition can be proved. □
Example 3. In Figure 4, let and follow the exponential distribution with and , then let from the exponential distribution with . As shown in the Figure 4, for any , the FCMRE value between and is greater than that between and , indicating that the difference between multiple data points is greater than that between a single data point. Proposition 7. Let be a nonnegative random variable and . It holds that
where the
is cumulative residual mean relative entropy.
Proof. Since
when
, we have
where the second inequality applies Jensen’s inequality. □
Example 4. In Figure 5, given from the exponential distribution with and following exponential distribution with , Figure 6 uses the engine pressure ratio (EPR) and high-pressure rotor speed (N1) to draw the relationship between FCMRE and q-order CMRE. Figure 5 and
Figure 6 show that the value of q-order CMRE is much greater than the value of FCMRE, which reveals that the q-order calculation is not simply equivalent to the q-power of FCMRE.
The next proposition presents the connection between the FCMRE and differential entropy.
Proposition 8. For a nonnegative random variable ,
where
is the differential entropy, and
is a finite function of
.
Proof. By using the log-sum inequality,
Then, the first formula on the left can be expanded:
Exponentiating both sides, we proved Proposition 6. □
Example 5. Let follow an exponential distribution with and from an exponential distribution with in Figure 7. Figure 8 shows the engine pressure ratio (EPR) and high-pressure rotor speed (N1). The relationship between FCMRE and differential entropy is demonstrated by Figure 7 and Figure 8.
Figure 7 and
Figure 8 prove the correctness of Property 8 and also show that FCMRE has a lower limit. The closer q is to 1, the closer FCMRE is to its lower limit.
The following content explains that the empirical value of FCMRE converges to the theoretical value. It proves the value of FCMRE in practical applications.
4. Empirical Fractional Cumulative Residual Mean Relative Entropy
Let
be nonnegative and independent and subject to the same distribution function
. On the basis of Equation (9), let
be the empirical distribution of the sample
with mass
at each point, then the FCMRE with the empirical distribution is as follows,
where
.
Proposition 9. For , given a random variable in , the empirical FCMRE converges to the FCMRE of , i.e., as Proof. By using the dominated convergence proposition, it holds that
Therefore, we just need to illustrate that as
Recall that
where
represents the probability distribution on
. For every sample point
, we assume the mass is
. Then
where
is expectation relative to
. Then using the strong law, we obtain
In particular, almost surely.
Combining Equations (35) and (36), we obtain that
By using the dominated convergence theorem and Equation (35) in Ref. [
7], the proposition is proved. □
5. Application
In this section, we use FCMRE to analyze the complexity of the inherent dynamic characteristics of aeroengine time series and compare the information differences between different aeroengine data. The aeroengine gas path data we selected are shown in
Table 1.
Figure 9 presents a clear monotonic relationship between fractional cumulative residual mean relative entropy and q value, which reveals the internal dynamic characteristics of the aeroengine gas path time series. In
Figure 10, the fractional cumulative residual MRE of three groups of gas path series decreases first with the increase in parameter q and increases slightly when q approaches 1.
At the same time, like the relative entropy, FCMRE can also reflect the difference between different information. After the q value is determined, In
Figure 9, the longitudinal comparison shows that the FCMRE value between engine pressure ratio (EPR) and high-pressure rotor speed (N1) is much larger than that between fuel flow (WF) and high-pressure rotor speed (N1) and that between exhaust gas temperature (EGT) and engine pressure ratio (EPR), which indicates that the information difference between fuel flow (WF) and high-pressure rotor speed (N1), exhaust gas temperature (EGT) and engine pressure ratio (EPR) is much smaller than the FCMRE value between engine pressure ratio (EPR) and high-pressure rotor speed (N1). It also reflects that the information correlation between fuel flow (WF) and high-pressure rotor speed (N1), exhaust gas temperature (EGT), and engine pressure ratio (EPR) is stronger.
The FCMRE values between EPR, WF, EPR + WF, and N1, respectively, were calculated and the calculated results are displayed in
Figure 10.
It can be seen from
Figure 10 that the FCMRE values between engine pressure ratio (EPR) and high-pressure rotor speed (N1), fuel flow (WF) and high-pressure rotor speed (N1), and engine pressure ratio and fuel flow (EPR + WF) and high-pressure rotor speed (N1) also decrease gradually with the increase in q value. Observing the images from a longitudinal perspective, it can also be seen that for any determined value of q, the FCMRE value between engine pressure ratio and fuel flow (EPR + WF) and high-pressure rotor speed (N1) is larger than that between engine pressure ratio (EPR) and high-pressure rotor speed (N1) and between fuel flow (WF) and high-pressure rotor speed (N1). This difference is more obvious when q < 0.5, and when q > 0.5, the difference gradually decreases.
Based on the above analysis, the advantages of FCMRE compared with MRE are summarized in
Table 2.
Compared with FCMRE, neither mean relative entropy nor cumulative residual entropy can measure the difference between multiple pieces of information and amplify the difference at the same time. Moreover, the probability density function is replaced by the residual distribution function so that the calculation of MRE is not limited to the existing distribution function. Therefore, we have reason to think that FCMRE is superior to MRE and CRE.
6. Conclusions
In this work, in consideration of the properties of cumulative residual entropy and fractional entropy, we defined the fractional cumulative residual mean relative entropy by combining it with the average relative entropy. Then, some propositions of the fractional cumulative residual mean relative entropy were derived. Moreover, these properties of the new measure were manifested by numerical simulation. Finally, we prove that the empirical fractional cumulative residual mean relative entropy converges to the theoretical fractional cumulative residual mean relative entropy value.
To explore the practical application value of FCMRE, we selected multiple groups of aeroengine gas path data for analysis and comparison. The results show that FCMRE can analyze the complexity of an aeroengine system and the information difference of different aeroengine data. It has been proved that FCMRE has the advantages of both reflecting the internal complexity of the system and analyzing the differences between various kinds of information. In the future, FCMRE could potentially be used in aircraft internal system failure detection.
Author Contributions
Conceptualization, funding acquisition and formal analysis, K.D.; methodology and writing, S.L. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Ministry of Education (MOE) in China, Project of Humanities and Social Sciences, under grant No. 19YJC910001 and the Key Laboratory of Civil Aircraft Airworthiness Technology under grant No. SH2020112701.
Data Availability Statement
The data presented in this study are available on request from the corresponding author.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Mackay, D. Information Theory, Inference, and Learning Algorithms. IEEE Trans. Inf. Theory 2003, 50, 2315–2330. [Google Scholar] [CrossRef] [Green Version]
- Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 623–656. [Google Scholar] [CrossRef]
- Wu, J.; Sun, J.; Liang, L.; Zha, Y. Determination of weights for ultimate cross efficiency using Shannon entropy. Expert Syst. Appl. 2011, 38, 5162–5165. [Google Scholar] [CrossRef]
- Bruhn, J.; Lehmann, L.E.; Roepcke, H.; Bouillon, T.W.; Hoeft, A. Shannon entropy applied to the measurement of the electroencephalographic effects of desflurane. Anesthesiology 2001, 95, 30–35. [Google Scholar] [CrossRef]
- Silva, M.; Piqueira, J.; Vielliard, J. Using Shannon entropy on measuring the individual variability in the Rufous-bellied thrush Turdus rufiventris vocal communication. J. Theor. Biol. 2000, 207, 57–64. [Google Scholar] [CrossRef] [Green Version]
- Lee, R.; Jonathan, P.; Ziman, P. Pictish symbols revealed as a written language through application of Shannon entropy. Proc. R. Soc. A Math. Phys. Eng. Sci. 2010, 38, 5162–5165. [Google Scholar] [CrossRef] [Green Version]
- Ubriaco, M.R.; Chen, Y.; Vemuri, B.C.; Wang, F. Cumulative residual entropy: A new measure of information. IEEE Trans. Inf. Theory 2004, 50, 1220–1228. [Google Scholar]
- Asadi, M.; Zohrevand, Y. On the dynamic cumulative residual entropy. J. Stat. Plan. Inference 2007, 137, 1931–1941. [Google Scholar] [CrossRef]
- Navarro, J.; Aguila, Y.; Asadi, M. Some new results on the cumulative residual entropy. J. Stat. Plan. Inference 2010, 140, 310–322. [Google Scholar] [CrossRef]
- Psarrakos, G.; Navarro, J. Generalized cumulative residual entropy and record values. Metrika 2013, 76, 623–640. [Google Scholar] [CrossRef]
- Rajesh, G.; Abdul-Sathar, E.; Nair, K.M.; Reshmi, K. Bivariate extension of dynamic cumulative residual entropy. Stat. Methodol. 2014, 16, 72–82. [Google Scholar] [CrossRef]
- Baratpour, S.; Bami, Z. On the discrete cumulative residual entropy. J. Iran. Stat. Soc. 2012, 2, 203–215. [Google Scholar]
- Park, S.; Kim, I. On cumulative residual entropy of order statistics. Stat. Probab. Lett. 2014, 94, 170–175. [Google Scholar] [CrossRef]
- Ubriaco, M.R. Entropies based on fractional calculus. Phys. Lett. A 2009, 373, 2516–2519. [Google Scholar] [CrossRef] [Green Version]
- Baskonus, H.M.; Mekkaoui, T.; Hammouch, Z.; Bulut, H. Active Control of a Chaotic Fractional Order Economic System. Entropy 2015, 17, 5771–5783. [Google Scholar] [CrossRef] [Green Version]
- Magin, R.L.; Ingo, C. Entropy and Information in a Fractional Order Model of Anomalous Diffusion. IFAC Proc. Vol. 2012, 45, 428–433. [Google Scholar] [CrossRef]
- Crescenzo, A.D.; Kayal, S.; Meoli, A. Fractional generalized cumulative entropy and its dynamic version. Commun. Nonlinear Sci. Numer. Simul. 2021, 102, 105899. [Google Scholar] [CrossRef]
- Karci, A. Fractional order entropy: New perspectives. Opt.-Int. J. Light Electron. Opt. 2016, 127, 9172–9177. [Google Scholar] [CrossRef]
- Kullback, S.; Leibler, R.A. On Information and Sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
- Waerden, B.L.V.D. Mathematical Statistics; Intext Educational Publishers: New York, NY, USA, 1971. [Google Scholar]
- Bickel, P.J.; Doksum, K.A. Mathematical Statistics: Basic Ideas and Selected Topics, Volume II; Chapman and Hall/CRC: Boca Raton, FL, USA, 2015. [Google Scholar]
- Casella, G.; Berger, R.L. Statistical inference. Technometrics 1990, 33, 493. [Google Scholar] [CrossRef]
- Lehmann, E.L.; Casella, G. Theory of Point Estimation; Wiley: Hoboken, NJ, USA, 1983. [Google Scholar]
- Dragalin, V.; Fedorov, V.; Patterson, S.; Jones, B. Kullback-Leibler divergence for evaluating bioequivalence. Stat. Med. 2010, 22, 913–930. [Google Scholar] [CrossRef] [PubMed]
- Ludovisi, A.; Taticchi, M.I. Investigating beta diversity by Kullback-Leibler information measures. Ecol. Model. 2006, 192, 299–313. [Google Scholar] [CrossRef]
- Smith, A.; Naik, P.A.; Tsai, C.L. Markov-Switching Model Selection Using Kullback-Leibler Divergence. SSRN Electron. J. 2005, 134, 553–577. [Google Scholar] [CrossRef] [Green Version]
- Harmouche, J.; Delpha, C.; Diallo, D. Incipient fault detection and diagnosis based on Kullback–Leibler divergence using Principal Component Analysis: Part I—ScienceDirect. Signal Process. 2014, 94, 278–287. [Google Scholar] [CrossRef]
- Zhang, W.; Shan, S.; Chen, X. Local Gabor Binary Patterns Based on Kullback–Leibler Divergence for Partially Occluded Face Recognition. IEEE Signal Process. Lett. 2007, 14, 875–878. [Google Scholar] [CrossRef]
- Chung, Y.; Kim, C.; Dey, D.K. Simultaneous Estimation of Poisson Means under Weighted Entropy Loss. Calcutta Stat. Assoc. Bull. 1994, 44, 175. [Google Scholar] [CrossRef]
- Zhang, J.; Sampson, E. The Mean Relative Entropy: An Invariant Measure of Estimation Error. Am. Stat. 2021, 75, 117–123. [Google Scholar] [CrossRef]
- Ray, W.D. The Foundation of Statistical Inference. J. Oper. Res. Soc. 1963, 14, 92–94. [Google Scholar] [CrossRef]
Figure 1.
The fractional cumulative residual MRE of exponential distribution with and .
Figure 1.
The fractional cumulative residual MRE of exponential distribution with and .
Figure 2.
The fractional cumulative residual MRE of follows a uniform distribution on and , and .
Figure 2.
The fractional cumulative residual MRE of follows a uniform distribution on and , and .
Figure 3.
The fractional cumulative residual MRE of N1, 0.5 N1, and EPR time series.
Figure 3.
The fractional cumulative residual MRE of N1, 0.5 N1, and EPR time series.
Figure 4.
The fractional cumulative residual MRE of exponential distribution with , and the fractional cumulative residual MRE of .
Figure 4.
The fractional cumulative residual MRE of exponential distribution with , and the fractional cumulative residual MRE of .
Figure 5.
The fractional cumulative residual MRE of exponential distribution with , , and q-order cumulative MRE.
Figure 5.
The fractional cumulative residual MRE of exponential distribution with , , and q-order cumulative MRE.
Figure 6.
The fractional cumulative residual MRE of EPR, N1 time series, and q-order of fractional cumulative residual MRE.
Figure 6.
The fractional cumulative residual MRE of EPR, N1 time series, and q-order of fractional cumulative residual MRE.
Figure 7.
The fractional cumulative residual MRE of exponential distribution with , , and the values of .
Figure 7.
The fractional cumulative residual MRE of exponential distribution with , , and the values of .
Figure 8.
The fractional cumulative residual MRE of EPR, N1 time series, and the values of .
Figure 8.
The fractional cumulative residual MRE of EPR, N1 time series, and the values of .
Figure 9.
The fractional cumulative residual MRE of aeroengine gas path date N1, WF, EPR, and EGT time series.
Figure 9.
The fractional cumulative residual MRE of aeroengine gas path date N1, WF, EPR, and EGT time series.
Figure 10.
The fractional cumulative residual MRE of EPR, WF, EPR + WF, and N1 time series.
Figure 10.
The fractional cumulative residual MRE of EPR, WF, EPR + WF, and N1 time series.
Table 1.
Aeroengine data.
Table 1.
Aeroengine data.
EPR | N1 | WF | EGT |
---|
engine pressure ratio | high-pressure rotor speed | fuel flow | exhaust gas temperature |
Table 2.
Advantages of FCMRE compared with MRE.
Table 2.
Advantages of FCMRE compared with MRE.
MRE | FCMRE |
---|
It cannot be calculated for data without a distribution function | The FCMRE for arbitrary data can be approximated by empirical entropy |
Information differences can be measured | The difference between information can be measured when q is different |
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).