- freely available
- re-usable

*Entropy*
**2014**,
*16*(6),
3009-3025;
doi:10.3390/e16063009

Published: 27 May 2014

## Abstract

**:**As a novel data mining approach, a wavelet entropy algorithm is used to perform entropy statistics on wavelet coefficients (or reconstructed signals) at various wavelet scales on the basis of wavelet decomposition and entropy statistic theory. Shannon wavelet energy entropy, one kind of wavelet entropy algorithm, has been taken into consideration and utilized in many areas since it came into being. However, as there is wavelet aliasing after the wavelet decomposition, and the information set of different-scale wavelet decomposition coefficients (or reconstructed signals) is non-additive to a certain extent, Shannon entropy, which is more adaptable to extensive systems, couldn’t do accurate uncertainty statistics on the wavelet decomposition results. Therefore, the transient signal features are extracted incorrectly by using Shannon wavelet energy entropy. From the two aspects, the theoretical limitations and negative effects of wavelet aliasing on extraction accuracy, the problems which exist in the feature extraction process of transient signals by Shannon wavelet energy entropy, are discussed in depth. Considering the defects of Shannon wavelet energy entropy, a novel wavelet entropy named Tsallis wavelet energy entropy is proposed by using Tsallis entropy instead of Shannon entropy, and it is applied to the feature extraction of transient signals in power systems. Theoretical derivation and experimental result prove that compared with Shannon wavelet energy entropy, Tsallis wavelet energy entropy could reduce the negative effects of wavelet aliasing on accuracy of feature extraction and extract transient signal feature of power system accurately.

## 1. Introduction

Transient feature extraction is an important part of signal analysis, and as a new feature extraction algorithm, wavelet entropy has attracted the attention of experts and scholars all over the world. Wavelet entropy is a combination of wavelet decomposition and entropy statistics theories, and it has the advantages of multi-resolution analysis and complexity evaluation for time-varying signals, which means that the macro and micro aspects of some special signals could be researched in the time-frequency domain. For the above reasons, wavelet entropy has gradually been used in engineering signal surveys such as electroencephalography (EEG) testing, machinery vibration detection, power system fault diagnosis and so on [1,2,3]. Various studies have also manifested the better performance of wavelet entropy in analyzing the variability and complexity of climate processes compared with traditional methods [4,5,6]. For estimating the cognitive workload of human brains, wavelet entropy is used to extract the variation features from EEG signals, and for a subject-independent multi-channel classification scheme, the entropy features achieved high accuracy, up to 98% for channels from the frontal lobes, in the delta frequency band [7]. Bing et al. [8] presented a sensor fault detection method based on wavelet entropy and applied it to fault diagnosis of micro-gas turbine engine sensors. Remarkably, Zhenyou et al. [9,10] introduced wavelet entropy to the signal analysis of power systems, which makes it possible for some special power signals, including voltage dips, voltage interruptions, voltage flicker, voltage pulse and so on, to be analyzed more extensively and deeply than ever.

However, most research on wavelet entropy involves engineering applications of Shannon wavelet energy entropy (SWEE), but its physical meanings, working mechanism, and application principle have not been well discussed yet. In fact, Shannon wavelet energy entropy has some disadvantages in processing non-stationary signals, which could result in inaccurate or wrong results.

In view of the abovementioned facts, in this paper, the theoretical basis of SWEE has been analyzed, and the emphasis of research was placed on the working mechanism for feature extraction of transient signals. Wavelet aliasing’s negative effect on feature extraction, which uses Shannon wavelet entropy, is discussed to find the primary reason for the problem.

Because Tsallis entropy is good at expressing the uncertainty of generalized systems, with a combination of Tsallis entropy and wavelet decomposition, a wavelet entropy-Tsallis wavelet energy entropy (TWEE) is constructed and the physical meaning of its algorithm is also explained. Starting with an analysis of wavelet aliasing and feature extraction effects, this paper studied the working mechanism and the suitable scope of TWEE, analyzes the relationship and difference between TWEE and SWEE and provides the initial principle of non-extensivity index selection. Finally, for analyzing transient voltage fluctuations caused by non-faulty indirect lightning strikes, TWEE is applied to transient feature extraction of lightning strikes. The corresponding voltage data when a non-faulty indirect lightning strike took place on 110 kV overhead distribution line in Guangdong, China, is collected and analyzed using TWEE, and the comparison of feature extraction, based upon TWEE and SWEE, has been carried out. The experimental results show that TWEE is better than SWEE in analyzing transient signals.

## 2. SWEE and Its Theoretical Defects

#### 2.1. The Definition of SWEE

Shannon wavelet entropy (SWE) is the cooperative product of wavelet decomposition and Shannon entropy theory. By changing the segment of wavelet coefficients at the corresponding scales, and for different statistic objects using Shannon entropy, several derived algorithms are proposed based on SWE theory. Among these, SWEE is one of the most widely used algorithms in all fields [10]. The definition of SWEE is given as follows: first, a sliding window (width w ∈ N) is defined upon wavelet coefficients D = {d_{j} (k), k = 1,…, L; j = 1,…, S}. Then, suppose that:

#### 2.2. The Theoretical Defects of SWEE

In the macro-perspective, SWEE is the data mining of wavelet coefficients (or reconstructed signals) base on Shannon entropy statistics, so it is inevitable that Shannon WE inherits the statistic properties of Shannon entropy. As well known, Shannon entropy is the extended application for B-G entropy in information theory. As an important index in thermodynamics, B-G entropy represents the measurement for the uncertainty of N-energy-level system, and is defined as:

Shannon entropy is represented as:

From Equations (2) and (3), it is found that there is a strong connection between Shannon entropy and B-G entropy. In some ways, Shannon entropy naturally inherits the statistic property of B-G entropy such as convex linearity, extensivity, continuity, and so on, so Shannon entropy also satisfies the following Equation (4):

It is well known that Boltzmann-Gibbs statistics (BGS) are inadequate for treating some complex systems [14]. These are systems with complex or long term interactions and correlations, systems showing often distribution laws are different from the usual ones (Gauss, Poisson), systems in chaotic or fractal states and often related to nonextensive phenomena in energy, entropy, heat, and other quantities [15]. In the process of wavelet decomposition, energy leakage exists at each scale, and the leakage of energy invades neighboring scales. At the same time, the extent of interaction, among neighbor scales, will change with a change in the component of signal. Obviously, it belongs to the complex system mentioned above. In other words, Equation (4) is not true when Shannon entropy is used to analyze wavelet decomposition statistically, which means SWEE is not entirely applicable to some transient signals with components change in the time-frequency domain, so it is necessary to discuss this issue in detail and solve it.

## 3. Research on Issue in SWEE

#### 3.1. Energy Leakage and Wavelet Aliasing

The analyzed signal result using SWEE is undoubtedly connected to the consistence between the wavelet decomposition result and the corresponding component in the signal, so it is a key point for SWEE accuracy that the proper orthogonal wavelet basis be selected to obtain accurate coefficients or reconstructed signals. However, for most mother wavelets, wavelet aliasing still exists more or less at the neighbor scales, which means that the information and energy of signal are incomplete at the decomposition scales, so we can say that wavelet aliasing is the main reason for energy leakage. To better understand energy leakage, the mechanism of wavelet aliasing is researched as follows: wavelet aliasing is when frequency bands, corresponding with wavelet scales, overlap each other after wavelet decomposition. Here, the definition of continuous wavelet transform is given as below. For $\forall x(t)\in {L}^{2}(R)$, the continuous wavelet transform function is defined as:

From Equation (5), the derived frequency-domain expression of the correlation function is obtained as:

As the Db4 wavelet is taken as $\psi (t)$, according to Equation (6), the wavelet function curve, in the frequency domain, is drawn as in Figure 2. From Figure 2 and Equation (7), it is known that ${\omega}_{a}$ changes with the scale a and wavelet coefficient ${W}_{Tx}(a,b)$ reaches its maximum value when the frequency of the signal is equal to ${\omega}_{a}$. However, the curve of ${\widehat{\psi}}_{a}(\omega )$ is not the rectangle, and even if Mallat algorithm is used in discrete wavelet decomposition, this means that the frequency overlap of ${\widehat{\psi}}_{a}(\omega )$ still exists at the neighbor scales, so the energy of a signal in the frequency-domain window, corresponding to one scale, would leak into the neighbor scales. In conclusion, wavelet aliasing is the main reason of energy leakage, which is almost inevitable for most mother wavelets.

**Figure 1.**DB4 wavelet function curves with different scales in frequency domain.

#### 3.2. The Negative Effect of Wavelet Aliasing on SWEE

Based on the above analysis, wavelet aliasing can have negative effects on the accuracy of SWEE, therefore we further research the relationship between SWEE and wavelet aliasing.

First, the time sequence of signal x(t) is decomposed into N scales on a orthogonal wavelet, suppose that no wavelet aliasing takes place at neighbor scales, the mathematical expressions of expectation SWEE is derived as:

Similarly, suppose that wavelet aliasing takes place at neighbor scales k and k + 1, and ε represents the energy loss from k to k + 1, mathematical expressions of SWEE are derived as:

Obviously, H(P) is the Shannon information entropy of a two-channel information source with the prior probability $(\frac{{p}_{k}}{C},\frac{{p}_{k+1}}{C})$. According to the concavity of Shannon information entropy function, H(P) meets the following conditions:

According to Equations (11) and (12), it can be deduced as:

According to the above analysis, the curve of $f({p}_{k})$ is drawn as shown in Figure 2.

Considering that wavelet aliasing and energy leakage exist at the neighbor scales, on the basic premise of $0<{p}_{\epsilon}<{p}_{k}$, $\theta (\epsilon )$ is discussed as following:

- (1)
As $0<{p}_{k}\le {p}_{k+1}$,

The curve of $f({p}_{k})$ corresponds to $\stackrel{\ufe35}{AM}$ in Figure 2. With increasing value of ${p}_{\epsilon}$, it is found that $\theta (\epsilon )<0$, $f({p}_{k})$ is greater than $f({p}_{k}-{p}_{\epsilon})$, which means that ${W}_{EE}^{\prime}$ is less than ${W}_{EE}$.

- (2)
As $0<{p}_{k+1}<{p}_{k}$,

- (a)
if $0<{p}_{\epsilon}<{p}_{k}-{p}_{k+1}$, with increasing value of ${p}_{\epsilon}$, it is found that $\theta (\epsilon )>0$, $f({p}_{k}-{p}_{\epsilon})$ is greater than $f({p}_{k})$, which means that ${W}_{EE}^{\prime}$ is greater than ${W}_{EE}$.

- (b)
if ${p}_{k}-{p}_{k+1}<{p}_{\epsilon}$, with increasing value of ${p}_{\epsilon}$, it is found that $\theta (\epsilon )<0$, $f({p}_{k}-{p}_{\epsilon})$ is greater than $f({p}_{k})$, which means that ${W}_{EE}^{\prime}$ is less than ${W}_{EE}$.

From the above, it is only when p_{ε} = 0 or p_{k} − p_{k+1} that
$\theta (\epsilon )=0$, which means
${W}_{EE}^{\prime}$ is equal to
${W}_{EE}$ without wavelet aliasing at the neighbor scales. However, according to the analysis in Section 3.1, wave aliasing inevitably exists to varying degrees, so energy leakage caused by wavelet aliasing has have detrimental effects on the accuracy of SWEE. Without considering the signal characteristics, using the SWEE algorithm directly, the accuracy of feature extraction is sometimes unbelievable. To prove the actual existence of a negative effect of wavelet aliasing on SWEE, we propose a numerical example as follows. For above all, a signal is constructed as:

_{1}= 200 Hz, f

_{2}= 390 Hz, f

_{3}= 450 Hz, f

_{4}= 625 Hz, f

_{5}= 1,260 Hz. In addition, the sampling frequency f

_{s}= 5,000 Hz, and W

_{n}represents white noise. Then, four-level wavelet decomposition on the basis of the Db4 wavelet is used for the signal x(t). When w = 30 and $\delta =1$, according to Equation (1), SWEE of x(t) is obtained and the curve is drawn as in Figure 3.

**Figure 3.**SWEE under wavelet aliasing.

According to the definition of SWEE, the value of SWEE has to increase with the complexity of the signal in the time-frequency domain. For this reason, with the frequency component increasing, the value of SWEE, from 0 s to 0.4 s, should be minimum from 0 s to 1 s, and the value of 0.4–0.8 s should be less than that of 0.8–1 s. However, from Figure 3, SWEE never represents the complexity change of the signal accurately, and the ripple disturbance is also serious, which does not correspond with the fact, so it is true that wavelet aliasing has a negative effect on the accuracy of SWEE, and this also verifies the correctness of the above theoretical deduction.

## 4. Tsallis Wavelet Energy Entropy—The Development of SWEE

#### 4.1. Tsallis Entropy

Nonextensive statistical mechanics, pioneered by Tsallis, offer a consistent theoretical framework for the studies of complex systems with long-range interactions, long-time memories, multifractal and self-similar structures, or anomalous diffusion phenomena. As a nonextensive entropy, Tsallis entropy is the extension and development of extensive entropy (B-G entropy) in statistical physics [16]. It can explain some abnormal experimental phenomena, such as the complexity of non-additive systems, which cannot be explained by the theory of extensive entropy. Tsallis entropy in a discrete expression is defined as follows:

Different from extensive entropy, q is introduced as a parameter, so it is called nonextensivity index, and it represents the extent of nonextensivity of the system in Tsallis entropy. The non-additivity of Tsallis entropy of a complex system, composed of A and B subsystems, is defined as follows:

Note that q < 1 and q > 1 correspond respectively to the ultra-extensity and sub-extensity of the system. For researching the mathematical concave and convex nature of Tsallis entropy, let P and Q be probability variables, and satisfy:

Suppose that 0 ≤ α ≤ 1, when S_{q} satisfies:

From Equation (15), Tsallis entropy function represents the concave nature. At the same time, Tsallis entropy function has a definite concavity for q values (S_{q} is concave for q > 0 and convex for q < 0). Furthermore, we consider Tsallis entropy statistical characteristics as two independent subsystems. According to Equation (13), we have a curve trace like Figure 4 which indicates the change law of Tsallis entropy with the probability distribution under different q values.

From Figure 4, the variation of q values has considerable effects on the statistical characteristics of Tsallis entropy. When q > 0 and q ≠ 1, the function curves take on the appearance of concavity, and it exists corresponding maximum values for all. When q → 1, Tsallis entropy tends to B-G entropy and it can describe the complexity of additive systems when considering probabilistically independent events [17,18]. Moreover, when q < 0, the function curves are contrary to the former for q > 0, and there exists the corresponding minimum values. Based on the above analysis, Tsallis entropy with appropriate q value is not only more flexible in information measurement but more widespread in the statistical range of entropy.

**Figure 4.**Relations between entropy with q and probability distribution under two-level system.

#### 4.2. The Definition of TWEE

A sliding-window (w in width) is defined upon wavelet coefficients D = {d_{j}(k), k = 1, …, L} with wavelet decomposition at j scales like Figure 5, here the sliding factor is
$\delta \in N$. Then a sliding-window is defined as:

**Figure 5.**Wavelet coefficients matrix in the sliding-window.

At the moment of $w/2+n\delta $:

Suppose that ${p}_{n}(j)={E}_{n}(j)/E(n)$ and:

As the sliding-window slides, the trendline that TWEE changes with time can be obtained and plotted. In the above expression, the scale space corresponds to the frequency space, and TWEE indicates the energy distribution of signals. A wavelet functions does not have pulse selectivity in either the frequency or time domain, whereas it has a support region, so the partition of the original signal at scale space also indicates the signals’ energy distribution at time-frequency domains. The more complex the analyzed signal is, more modes the energy congregates to and the larger the TWEE is. Therefore the TWEE is an index to evaluate the signal complexity or uncertainty.

#### 4.3. The Inhibition on Wavelet Aliasing Using TWEE

According to Section 3, wavelet aliasing causes the energy loss at the neighbor scales, and the information set of wavelet coefficients (or reconstructed signals) at different frequency bands, is nonadditive, so SWEE is unsuited for extracting the features of complex signals, especially for transient signals. On the contrary, as the theoretical foundation of TWEE, Tsallis entropy is good at characterizing the uncertainty of nonextensive system because of its nonadditivity. The extent of energy loss has a close correlation with the degree of nonadditivity of information set, for this, if the appropriate value of q is chosen, using TWEE should be able to extract the features accurately without the wavelet aliasing effect.

To prove the above point, the example proposed in Section 3.2 is taken again as an analysis case. For the time sequence of x(t), as q = 0.1, the feature of frequency complexity is extracted using TWEE as Figure 6.

Comparing Figure 6 with Figure 3, it is found that the value of TWEE from 0 s to 0.4 s, is less than that from 0.4 s to 0.8 s, and the value of 0.4–0.8 s is less than 0.8–1 s, which is coherent with the frequency variance of x(t). The simulation results show TWEE can inhibit the negative effect of wavelet aliasing and extract the feature of transient signal accurately at the appropriate value of q.

Apart from the statistical characteristics of entropy, we drive the root reason why TWEE are not affected by wavelet aliasing and could characterize the accurate features of transient signals. Taking a three-level system as the analysis object, according to Equation (13), the statistical results are calculated, and the corresponding relation between Tsallis entropy with q and probability distribution are shown in Figure 7.

**Figure 6.**TWEE under wavelet aliasing.

**Figure 7.**Relation between Tsallis entropy with (

**a**) q = 0.1, (

**b**) q = 0.5, (

**c**) q = 0.9, (

**d**) q = 2 and probability distribution under three-level system.

From Figure 7, as q > 0 and q ≠ 1, for the small probability event, Tsallis entropy represents the concave nature, and extends the statistical range, but reduces sensitivity with the increasing value of q. On the contrary, with the decreasing value of q, the statistical range is narrowed down and sensitivity is enhanced. Therefore, when the energy loss at the neighbor scales is too serious to be ignored, it is suggested to reduce the value of q for TWEE, which could enhance the inhibition capability on wavelet aliasing. Of course, with the value of q decreasing, TWEE has less resolution to distinguish the changes of time-frequency complexity for the big probability event in transient signals.

## 5. TWEE’s Application in Signal Analysis in Power System

#### 5.1. Original Transient Power Signal

In the field of power quality, the research on transient signals such as transient overvoltages, voltage dips, voltage interruptions, voltage flicker and voltage pulses, have attracted great attention. As one of transient signals in power systems, the voltage fluctuations caused by indirect lightning strikes often happen in the overhead distribution lines in South China, where most indirect lightning strikes are low-energy and unlikely to cause faults directly. However, the safety of power transmission and the quality of power supply could be suffered from negative effects because of high incidence rate of lightning strikes. In addition to electromagnetic interference in PT secondary circuits, the short duration and low energy level of non-faulty indirect lightning strikes are another major reason which could result in failure in extracting the features of this kind of lightning strike, so the phase voltages, caused by indirect lightning strikes, are taken as analysis objects, and TWEE and SWEE are separately used to extract the transient features of lightning strikes. A set of phase voltage signals of a 110 kV transmission line, with lightning strike interference, are obtained from the Guangzhou EPRI (Figure 8). The details are as follows:

Fault line: the 110 kV Jiaji transmission line in Guangdong.

Fault type: lightning strike interference.

Recording site: the 110 kV side of main transformer of 220 kV Jiahe substation.

Occurrence time: 15:57:7:480, 23 March 2010.

**Figure 8.**Waveform of three-phase voltages with lightning strike.

On the afternoon of 3 March 2010, lightning strikes sequentially occurred on the 110 kV Jiaji transmission line of Guangdong Province. The relay protection didn’t operate because of the low lightning energy. The recorder located in the 220 kV Jiahe substation recorded the phase voltage waveforms when the lightning struck. Here is a set of collected lightning strike voltage waveforms is shown in Figure 8. In Figure 8, lightning strike interference occurs to the line’s three phases in a relative time of 38 ms. Because the traveling waves reciprocate in the transmission line, over-voltage and high-frequency components exist in the three-phase voltage signals. Meanwhile, with the energy loss in the transmission line, the high-frequency component of the voltage decreases and it almost disappears after about 50 ms. Finally, the three-phase voltage returns to normality.

#### 5.2. The Results of Feature Extraction Using Two WEE Algorithm

An experimental platform for signal processing is built to verify the practicality and validity of the proposed algorithm. Based on this platform, the historical data of three-phase voltages, which was recorded when a non-faulty lightning strike occurs, is converted to general data according to IEEE Std C37.111-1999. The transient lightning voltage signal is reconstructed by utilizing a programmable function generator, and then the operation module with DSP collects the real-time voltage signals from the programmable function generator through A/D channel. Furthermore, the TWEE algorithm and SWEE algorithm fixed in DSP are utilized separately to extract the transient features of the lightning signal. Finally, the results are output through the D/A channel and shown in the scope. The experiment flow is as shown as Figure 9.

**Figure 9.**Experiment flow chart.

Set sample frequency of A/D ${f}_{s}=6400\text{Hz}$, width of sliding data window $w=40$, noextensivity index $q=0.7$. The results of feature extraction based on TWEE and SWEE are shown in Figure 10. In Figure 10, the feature curve, obtained by TWEE, not only has less fluctuation, but also indicates the occurrence time of the lightning strike and the exponential decay trend of the lightning energy accurately. It was introduced in [19] that lightning energy decays exponentially Meanwhile, it shows that the peak of TWEE of C-phase is slightly larger than the A-phase and B-phase, which shows that the lightning interference of the C-phase is more serious than the other two phases, which accords with the facts. Although the lightning-strike feature curve, obtained by SWEE, shows the occurrence time of the lightning strike, there are false features in the B-phase and C-phase. The result proves that SWEE has poor anti-interference performance, and its feature curve has a significant fluctuation. Hence, it couldn’t indicate the decay trend of the lightning energy accurately.

**Figure 10.**(

**a**) Lightning-strike feature curves of A phase based on two WEE algorithm; (

**b**) Lightning-strike feature curves of B phase based on two WEE algorithm; (

**c**) Lightning-strike feature curves of C phase based on two WEE algorithm.

## 6. Conclusions

Entropy statistics theories and wavelet decomposition algorithmw are the academic foundation of wavelet entropy, therefore the choice of entropy has a direct impact on the signal analysis results. Because wavelet aliasing exists at neighbor scales, the information set of wavelet coefficients (or reconstructed signals) is endowed with additivity property, and the extent of nonadditivity is much higher for information sets corresponding to transient signals with varying frequency. At this point, using the SWEE algorithm would result in a false analysis result against the fact. In this paper, considering the advantage of Tsallis entropy in measuring the uncertainty of generalized system, TWEE is proposed to extract the features of transient signals with changing the value of q. We apply the TWEE algorithm to the analysis of transient power signals and find that it can not only indicate the occurrence time of lightning strikes, but also characterize the decay trend of lightning energy without the effect of wavelet aliasing, which means the TWEE algorithm has stronger ability to analyze transient signals than SWEE. Of course, do more in-depth research on the wavelet entropy’s theoretical basis and application range is needed, especially in the choice of nonextensivity index for the different signals.

## Acknowledgments

The financial support received from the National Natural Science Foundation of China (Grant No. 51377016), 2012 Science and Technology Project of State Grid Corporation of China (Grant No.2012515) and the Scientific Research Fundation for the Doctoral Program of Northeast Dianli University (Grant No.BSJXM201335) is gratefully acknowledged.

## Author Contributions

Jikai Chen mainly contributed to the Section 2, Section 3 and Section 4. Guoqing Li contributed to the Section 5.

## Conflicts of Interest

The authors declare no conflict of interest.

## References

- Quiroga, R.Q.; Rosso, O.A.; Basar, E.; Schürmann, M. Wavelet entropy in event-related potentials: A new method shows ordering of EEG oscillations. Biol. Cybern.
**2001**, 84, 291–299, doi:10.1007/s004220000212. - AlNashash, H.A.; Thakor, N.V. Monitoring of global cerebral ischemia using wavelet entropy rate of change. IEEE Trans. Biomed. Eng.
**2005**, 52, 2119–2122, doi:10.1109/TBME.2005.857634. - Cek, M.E.; Ozgoren, M.; Savaci, F.A. Continuous time wavelet entropy of auditory evoked potentials. Comput. Biol. Med.
**2010**, 40, 90–96, doi:10.1016/j.compbiomed.2009.11.005. - Brunsell, N.A. A multiscale information theory approach to assess spatial-temporal variability of daily precipitation. J. Hydrol.
**2010**, 385, 165–172, doi:10.1016/j.jhydrol.2010.02.016. - Chou, C.M. Wavelet-based multi-scale entropy analysis of complex rainfall time series. Entropy
**2011**, 13, 241–253, doi:10.3390/e13010241. - Sang, Y.-F.; Wang, Z.-G.; Li, Z.-L. Discrete wavelet entropy aided detection of abrupt change: A case study in the Haihe river basin, China. Entropy
**2012**, 14, 1274–1284, doi:10.3390/e14071274. - Zarjam, P.; Epps, J.; Chen, F.; Lovell, N.H. Estimating cognitive workload using wavelet entropy-based features during an arithmetic task. Comput. Biol. Med.
**2013**, 43, 2186–2195, doi:10.1016/j.compbiomed.2013.08.021. - Yu, B.; Liu, D.; Zhang, T. Fault diagnosis for micro-Gas turbine engine sensors via wavelet entropy. Sensors
**2011**, 11, 9928–9941, doi:10.3390/s111009928. - He, Z.Y.; Cai, Y.M.; Qian, Q.Q. A study of wavelet entropy theory and its application in electric power system fault detection. Proc. Chin. Soc. Electr. Eng.
**2005**, 25, 38–43. - He, Z.Y.; Chen, X.Q.; Luo, G.M. Wavelet entropy measure definition and its application for transmission line fault detection and identification. In Proceedings of the International Conference on Power System Technology, Chongqing, China, 22–26 October 2006.
- Hnizdo, V.; Gilson, M.K. Thermodynamic and differential entropy under a change of variables. Entropy
**2010**, 12, 578–590, doi:10.3390/e12030578. - Baez, J.C.; Fritz, T.; Leinster, T. A characterization of entropy in terms of information loss. Entropy
**2011**, 13, 1945–1957, doi:10.3390/e13111945. - Wang, A. Nonextensive Statistics and Incomplete Information. Eur. Phys. J. B
**2002**, 26, 357–368. - Tsallis, C. Possible Generalization of Boltzmann-Gibbs Statistics. J. Stat. Phys.
**1988**, 52, 479–487, doi:10.1007/BF01016429. - Huang, Z.F.; Ou, C.J.; Chen, J.C. Incomplete nonextensive statistics and the zeroth law of thermodynamics. Chin. Phys. B
**2013**, 22, 040501:1–040501:4. - Tsallis, C.; Mendes, R.S.; Plastino, A.R. The role of constraints within generalized nonextensive statistics. Phys. A
**1998**, 261, 534–554, doi:10.1016/S0378-4371(98)00437-3. - Martins, A.; Aguiar, P.; Figueiredo, M. Tsallis Kernels on Measures. In Proceedings of the IEEE Information Theory Workshop, Porto, Portugal, 5–9 May 2008.
- Sneddon, R. The Tsallis entropy of natural information. Phys. A
**2007**, 386, 101–118, doi:10.1016/j.physa.2007.05.065. - Heidler, F.; Cvetic, J.M.; Stanic, B.V. Calculation of lightning current parameters. IEEE Trans. Power Deliv.
**1999**, 14, 399–404, doi:10.1109/61.754080.

© 2014 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).