Next Article in Journal
Validation and Improvement of COCTS/HY-1C Sea Surface Temperature Products
Next Article in Special Issue
2D-DOA Estimation in Switching UCA Using Deep Learning-Based Covariance Matrix Completion
Previous Article in Journal
Controllable Music Playlist Generation Based on Knowledge Graph and Reinforcement Learning
Previous Article in Special Issue
Gridless Underdetermined Direction of Arrival Estimation in Sparse Circular Array Using Inverse Beamspace Transformation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Entropy-Based Concentration and Instantaneous Frequency of TFDs from Cohen’s, Affine, and Reassigned Classes

by
David Bačnar
1,
Nicoletta Saulig
2,
Irena Petrijevčanin Vuksanović
3,4 and
Jonatan Lerga
1,*
1
Department of Computer Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia
2
Department of Engineering, Juraj Dobrila University of Pula, Zagrebačka 30, 52100 Pula, Croatia
3
Ministry of the Interior, 10000 Zagreb, Croatia
4
Center for Artificial Intelligence and Cybersecurity, University of Rijeka, Radmile Matejčić 2, 51000 Rijeka, Croatia
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(10), 3727; https://doi.org/10.3390/s22103727
Submission received: 6 April 2022 / Revised: 5 May 2022 / Accepted: 10 May 2022 / Published: 13 May 2022
(This article belongs to the Special Issue Signal Processing and Machine Learning for Smart Sensing Applications)

Abstract

:
This paper explores three groups of time–frequency distributions: the Cohen’s, affine, and reassigned classes of time–frequency representations (TFRs). This study provides detailed insight into the theory behind the selected TFRs belonging to these classes. Extensive numerical simulations were performed with examples that illustrate the behavior of the analyzed TFR classes in the joint time–frequency domain. The methods were applied both on synthetic and real-life non-stationary signals. The obtained results were assessed with respect to time–frequency concentration (measured by the Rényi entropy), instantaneous frequency (IF) estimation accuracy, cross-term presence in the TFRs, and the computational cost of the TFRs. This study gives valuable insight into the advantages and limitations of the analyzed TFRs and assists in selecting the proper distribution when analyzing given non-stationary signals in the time–frequency domain.

1. Introduction

1.1. Time–Frequency Analysis

The most common way to represent a signal is as a time-dependent function, which can be viewed as a representation with perfect time resolution but no direct information about the signal’s frequency content. The other standard signal representation is a frequency spectrum, which is obtained by the Fourier transform of the signal and shows the magnitudes of the individual frequencies that make up the original signal but does not provide any information about the time instants when they occurred in the signal [1,2]. The Fourier transform definition states that the entire signal should be known up to infinity; furthermore, for their exhaustive representation, signals should be stationary, meaning that the frequency content should not change over time.
The definitions of the Fourier transform and its inverse are, respectively:
X ( f ) = F { x ( t ) } = x ( t ) e j 2 π f t d t ,
x ( t ) = F 1 { X ( f ) } = X ( f ) e j 2 π f t d f ,
where x ( t ) is the original time signal, X ( f ) is the Fourier transform of the original time-domain signal, t is the time variable, f is the frequency variable, and F and F 1 are the Fourier transform and its inverse, respectively.
On the other hand, time–frequency (TF) analysis is an approach to signal analysis that is used to simultaneously track the signal’s time and frequency changes. The problem caused by finite signal duration and non-stationarity can be overcome by considering the original signal inside finite time windows, i.e., calculating the basic time–frequency representation known as the short-time Fourier transform (STFT). The time window acts as a filter that gives, as a result, the Fourier transform for the signal centered around the output time, which is the Fourier transform for that specific windowed data segment. The signal in each time window is assumed to be stationary and of known duration. Thus, the STFT may be defined as the Fourier transform of the original signal multiplied by a sliding window:
F x ( t , f ) = x ( τ ) h ( τ t ) e j 2 π f τ d τ ,
where F x ( t , f ) is the STFT and h ( t ) is the moving time window.
Insight into both time and frequency domains is provided by TF signal analysis. The original signal, acquired in the time domain, is now represented in order to show time instants of frequency changes, allowing appropriate monitoring and tracking of non-stationary signals. An example of a signal in the time, frequency, and TF domains is shown in Figure 1.
There are several groups of time–frequency representations or distributions (TFRs/TFDs) to choose from, depending on the application. The three most important are the Cohen’s, affine, and reassigned classes. Cohen’s class is covariant to time and frequency shifts of the signal [3], while the affine class is covariant to time shifts or translations and the scaling or, rather, the dilatations of the signal, also called timescale representations or distributions. The most common example of the affine representation is the continuous wavelet transform, which linearly expands the signal to a set of analysis functions called wavelets that can be time-shifted or scaled appropriately to fit the original signal [4]. In some cases, a TFR can belong to both the Cohen’s and affine classes, depending on the properties it satisfies. Both contain bilinear or quadratic representations, and this non-linearity results in the presence of interference terms, also called cross-terms, in the TF plane [5]. These phenomena can complicate the interpretation of the representation by overlapping the signal components or producing negative values. Nevertheless, bilinear representations generally exhibit improved TF concentration and resolution, despite the interference terms being introduced [6,7].

1.1.1. Analytic Form of a Signal

The first step in reducing the number of interferences in the TFR is using the analytic form of the signal. The analytic form gives a complex signal containing only the positive frequencies present in the original real-valued signal (the real part of the analytical signal corresponds to the original signal, while the imaginary part corresponds to the Hilbert transform of the original signal [8]). The spectrum of the analytic signal for f < 0 is zero, while for f > 0 , the spectrum coincides with that of the real-valued signal [9]. By using an analytic function instead of a real-valued one, there is no loss of information caused by the removal of negative frequencies, since the Fourier transform satisfies Hermitian symmetry X * ( f ) = X ( f ) . Hence, most TFRs should be used with analytic forms of signals, constructed as in [10]:
x a ( t ) = x ( t ) + j H { x ( t ) } ,
X a ( f ) = 2 U ( f ) X ( f ) = X ( f ) + s g n ( f ) X ( f ) ,
where x a ( t ) is the analytic form of the signal, x ( t ) is the real-valued signal, H is the Hilbert transform, X a ( f ) is the spectrum of the analytic signal, X ( f ) is the original spectrum of the real-valued signal, and U ( f ) is the unit step function.

1.1.2. Instantaneous Frequency

The purpose of the instantaneous frequency (IF) estimation is to capture and trace the frequency content resulting from non-stationarities of the real-valued signal or its analytic associate. It is defined as the derivative of the signal phase showing the time change in the dominant frequency [10,11,12]. Thus, for mono-component signals, the IF describes the behavior of the signal frequency over time [9,13].
For an analytic signal defined with time-varying amplitude a ( t ) and phase φ ( t ) defined as x a ( t ) = a ( t ) e j φ ( t ) , the IF can then be calculated as follows:
f x ( t ) = 1 2 π d φ ( t ) d t ,
where φ ( t ) is the phase of the signal and f x ( t ) is the IF of the signal.
This IF estimation approach is not suitable for multi-component signals, especially when spectral components overlap in the TF plane.

1.1.3. Group Delay

The group delay is dual to the IF and is defined as the derivative of the phase spectrum or frequency behavior as a function of time. It is also usually described as the time delay or average arrival time for a given frequency [9,13].
The Fourier transform for an analytic signal is defined with spectral amplitude B ( f ) and spectral phase ψ ( f ) as X ( f ) = B ( f ) e j ψ ( f ) , and the group delay can be defined as:
t x ( f ) = 1 2 π d ψ ( f ) d f ,
where ψ ( f ) is the spectral phase of the Fourier transform of the signal and t x ( f ) is the group delay (which is a function of frequency).
The IF and group delay can be approximated as inverse functions of each other for signals with large time-bandwidth products ( T × B ) [1,9].
This paper aims to provide comprehensive insight, including a comparison between TFRs from three classes: the Cohen’s, affine, and reassigned classes. We provide the theoretical basis behind the analyzed TFRs and showcase their behavior in noisy environments for different noise levels. We analyze TFRs’ performance for synthetic signals, both mono-component and multi-component, as well as on real-life examples. To the best of our knowledge, based on an extensive literature review, there exist no similar studies investigating the given TFRs from these three classes based on the accuracy in IF estimation, distribution concentration measured by entropy measures, and computational cost. The study provides support in selecting the appropriate TFR for a signal of interest being analyzed.
The rest of the paper is structured as follows. Section 2 provides insight into the theory behind the TFRs, explains the different types of TFRs, and lists some of the previous work in the field. Section 3 presents the application of TFRs to synthetic and real-life signals and compares TFR performance, providing details on the computational cost of the tested TFRs. Next, the obtained results are discussed and elaborated on in Section 4. Finally, the paper closes with concluding remarks in Section 5.

2. Affine TFRs with Respect to Cohen’s and Reassigned TFRs

The affine class is a group of bilinear representations that are covariant to time shifts and dilations of the signal [14] and it is the counterpart to Cohen’s class or the Weyl-Heisenberg group [14,15,16]. The time shift is the same covariance as in Cohen’s class, and the scale or dilation of the signal is defined as the reciprocal of the frequency [17,18]. Affine representations are suited for applications such as radar, sonar, self-similar signal analysis, multi-resolution signal analysis, and adaptive signal analysis [10,19,20]. Bilinearity results from the way the observed signal is compared to a delayed copy of itself by multiplication (autocorrelation) [9].
The general bilinear energy representation can be written as:
Ω x ( t , f ) = K ( t 1 , t 2 ; t , f ) x ( t 1 ) x * ( t 2 ) d t 1 d t 2 ,
where Ω x is the TFR, ν is the frequency shift, also known as Doppler, τ is the time delay or time shift, the product x ( t 1 ) x * ( t 2 ) is the useful part of the signal, and K ( t 1 , t 2 ; t , f ) is the arbitrarily parametrized 4-dimensional kernel characterizing the TFR properties. The kernel K ( t 1 , t 2 ; t , f ) can also assume a different form, where t 1 and t 2 are replaced by f 1 and f 2 , respectively (the different form is obtained by applying a double Fourier transform to the original kernel).
According to the energy conservation constraint, the parametrization kernel K must satisfy the following condition:
K ( t 1 , t 2 ; t , f ) d t d f = δ ( t 1 t 2 ) ,
where δ is the Dirac function.
The general affine representation form, which can be adapted to several representations by changing the index k, is defined as follows:
Ω x ( k ) ( t , f ) = f μ k ( u ) X ( λ k ( u ) f ) X * ( λ k ( u ) f ) e j 2 π f t ζ k ( u ) d u ,
λ k ( u ) = k e u 1 e k u 1 1 k 1 , k 0 , 1 ,
λ k = 0 ( u ) = u 1 e u , λ k = 1 ( u ) = e x p 1 + u e u e u 1 ,
ζ k ( u ) = λ k ( u ) λ k ( u ) ,
where μ k ( u ) is a continuous positive weighting function, the choice of which affects the representation properties, and u is the dual variable of the product t × f .
Changing the value of the index k in λ k ( u ) and ζ k ( u ) , we obtain various representations [19,20], including the following:
  • k = 2 —Affine Wigner representation (extended covariance along straight line paths);
  • k = 1 / 2 —D-Flandrin representation (extended covariance along square-root-hyperbolic paths);
  • k = 0 —Bertrand representation (extended covariance along hyperbolic paths);
  • k = 1 —Unterberger representation;
  • k = ± 5 —Approximate affine Wigner representations (unsmoothed);
  • k = ± —Margenau–Hill representation.
Choosing different μ k ( u ) functions for one specific value of k makes it possible to obtain different variations of the selected distribution, one of the examples being the active or passive Unterberger representation.

2.1. Kernels

Kernels result from arbitrary parametrizations that characterize the properties of the representation. By restricting the structure of the parametrized kernel and specifying certain analytic properties that the representations must satisfy, subclasses can be identified [17].
Kernels can be defined as:
ϕ t f ( t , f ) = δ ( t ) δ ( f f 0 ) ,
ϕ d D ( τ , ν ) = ϕ t d ( t , τ ) e j 2 π ν t d t = ϕ t f ( t , f ) e j 2 π ( f τ ν t ) d f d t ,
ϕ f D ( f , ν ) = ϕ t f ( t , f ) e j 2 π ν t d t = ϕ t d ( t , τ ) e j 2 π ( t ν + τ f ) d t d τ ,
ϕ t f ( t , f ) = ϕ t d ( t , τ ) e j 2 π f τ d τ = ϕ f D ( f , ν ) e j 2 π ν t d ν ,
where ϕ t d ( t , τ ) is the time–delay kernel, ϕ d D ( τ , ν ) is the delay–Doppler kernel, ϕ f D ( f , ν ) is the frequency–Doppler kernel, ϕ t f ( t , f ) is the time–frequency kernel, δ is the Dirac delta function, and f 0 is the arbitrary non-zero frequency (usually set to 1 Hz).
The different kernels, representing the various domains, and the Fourier transform between them can be represented using a commutative diagram (10) that explains their mutual relationships [9,21,22]:
Sensors 22 03727 i001
The time–delay kernel is mainly used to compute the representation from the time-domain signal, while the delay–Doppler kernel allows a simpler analysis of the characteristics of the representation. The time–frequency kernel can be interpreted as a two-dimensional low-pass filter that smooths the signal in both the time and frequency domains [8,23].
Depending on the number of variables, the kernel can be one-dimensional, two-dimensional, or higher-dimensional. The kernel can be independent on a specific variable. Separable kernels consist of the product of two windowing functions, each one for its respective domain. These windowing functions can be set independently of each other, offering the possibility of independent resolution tuning for each domain [9,10,21,24].
An example of a separable kernel consisting of two windowing functions may be defined as follows:
ϕ ( ν , τ ) = G 1 ( ν ) g 2 ( τ ) .
This form of the kernel can be transferred to the time–frequency domain by applying the inverse and forward Fourier transforms to the two windows, respectively Φ ( t , f ) = F 1 { G 1 ( ν ) } F { g 2 ( τ ) } = g 1 ( t ) G 2 ( f ) .
The separable kernel ensures that the 2D convolution is replaced by two 1D convolutions, and so the representation becomes:
Ω x Φ ( t , f ) = g 1 ( t ) Ω x ( t , f ) G 2 ( f ) .
If G 1 ( ν ) = 1 , then ϕ ( ν , τ ) = g 2 ( τ ) , and the kernel becomes Doppler independent with smoothing only in the frequency direction. In that case, the resulting representation is:
Ω x Φ ( t , f ) = Ω x ( t , f ) G 2 ( f ) .
If g 2 ( τ ) = 1 , then ϕ ( ν , τ ) = G 1 ( ν ) , and the kernel becomes delay independent with smoothing only in the time direction, and the representation becomes:
Ω x Φ ( t , f ) = g 1 ( t ) Ω x ( t , f ) .

2.2. Energy

The signal energy is defined as the squared modulus of the signal in the time or frequency domain, according to Parseval’s theorem [8,25]:
E x = | x ( t ) | 2 d t = | X ( f ) | 2 d f ,
where the energy representations are the instantaneous power | x ( t ) | 2 for the time domain and the spectral energy density | X ( f ) | 2 for the frequency domain.
The bilinear TFRs are signal energy distributions defined as the combination of the instantaneous power and the spectral energy density [17]:
E x = Ω x ( t , f ) d t d f .
The representation Ω x satisfies the energy conservation property for the affine class only if the kernel satisfies the following condition:
ϕ f D ( 0 , ν ) | ν | d ν = 1 .
By integrating the representation over the timescale plane, one obtains the signal energy [17].

2.3. Covariance

The covariance of the affine class refers to time shifts and scale changes in the signal. This property is based on the parameters of the affine group denoted by A ( α , τ ) , where α stands for the analyzing scale parameter and τ for the time shift. For the one-dimensional affine group, the group operation is ( A , B ) ( α , τ ) = ( A α , B + A τ ) , corresponding to a time axis clock change with t α t + τ , where the scale parameter α is defined as α = f 0 f , and the arbitrary non-zero frequency is f 0 = 1 Hz [17].
Thus, the affine group of transforms for a time signal and its frequency counterpart are:
x ( t ) x α , τ ( t ) = 1 α x t τ α ,
X ( f ) X α , τ ( f ) = α e j 2 π f τ X ( α f ) .
The covariance requirement for the representation can then be expressed as:
Ω x α , τ ( t , f ) = Ω x t τ α , α f .
An example of an affine covariant representation is the convolution of the Wigner representation, which is a member of the Cohen class, with a signal-independent kernel as:
Ω x ( t , f ) = Π t τ α , α f W x ( t , f ) d t d f ,
where W x ( t , f ) is the Wigner representation and Π ( t , f ) is the signal-independent kernel, which can also be denoted as the time–frequency kernel ϕ t f ( t , f ) . The representation is called the pseudo-affine Wigner representation.

2.4. Marginals

The marginal constraint states that the integral of the representation over one variable gives the energy corresponding to the other variable [25,26].
The marginal properties are defined for both the time and frequency domains as follows:
Ω x ( t , f ) d t = | X ( f ) | 2 ,
Ω x ( t , f ) d f = | x ( t ) | 2 .
Satisfying the marginal conditions implies the conservation of the representation energy [27,28]. Structural constraints on the kernel assure that the marginals are satisfied [17]. These constraints refer to the marginal distribution in frequency (energy spectrum density) and to the marginal distribution in time (instantaneous power), as follows:
ϕ f D ( 0 , ν ) = δ ( ν 1 ) ,
ϕ d D f τ , ν f d f = δ ( τ ) .

2.5. Interference Terms

Due to the bilinear nature of the affine class, the interference terms (also known as cross-terms) are generated at midway points between the original components (also called auto-terms) or between other interference terms [29]. Thus, due to the bilinearity, the sum of the representations of two signals is not equal to the representation of their sum [2,9,30].
Interference terms oscillate with a frequency that is proportional to the distance of the auto-terms [13,21]. The direction of the oscillations is orthogonal to the line connecting the auto-terms; the number of interference terms is n ( n 1 ) / 2 , where n is the number of signal components.
For a two component signal, x ( t ) = x 1 ( t ) + x 2 ( t ) , the bilinear representation is:
Ω x ( t , f ) = Ω x 1 ( t , f ) + Ω x 2 ( t , f ) + 2 [ Ω x 1 , 2 ( t , f ) ] ,
2 [ Ω x 1 , 2 ( t , f ) ] = 2 [ Ω x 1 , x 2 ( t , f ) + Ω x 2 , x 1 ( t , f ) ] = 2 [ F ( x 1 ( t ) x 2 * ( t ) ) ] ,
where Ω x 1 ( t , f ) and Ω x 2 ( t , f ) are the auto-terms, and 2 [ Ω x 1 , 2 ( t , f ) ] is the cross-term.
Figure 2 reports a TFR with cross-terms being generated between the three auto-terms. The location of the cross-term for a two-component signal is defined at the midpoint between the auto-terms, or at the midpoints of the two coordinates defined as the mean values t i = t 1 + t 2 2 and f i = f 1 + f 2 2 , for time and frequency coordinates, respectively [31].

2.6. Reassignment Method

This approach was first defined by Kodera et al. [32,33] under the name “Modified Moving Window Method”. Although the method was defined only for spectrograms and could not be applied to discrete signals because of the partial derivatives used in the method’s definition, it was later extended by Auger and Flandrin [34,35], generalized for any bilinear time–frequency or timescale representation, and called the “reassignment method”.
The reassignment method is a local post-processing method for signal representations, with the goal of improving the signal sharpness or concentration, and it refocuses the energy distribution for better readability in the TF plane. It is applicable to the Cohen’s and affine classes as well as any bilinear representation, improving the TF localization of the signal and suppressing the interference of the cross-terms. This method is the second of a two-step procedure in which a 2D low-pass filter smoothing kernel is first applied to reduce the interfering components, which also leads to the smearing of the signal in the tf plane. The second step is squeezing, which refocuses the signal terms after the smoothing procedure [36].
Different researchers introduced this method under different names, such as the method of reassignment, remapping, tf reassignment, and the modified moving window method. Unlike other representations, this method requires the signal phase and not just the magnitude. The phase is needed because the components that represent the maximum contribution to the signal, also known as the center of gravity, have a slow phase variation through time and can thus be detected by observing the rate of variation of the phase [37].
The reassignment is performed by mapping the computed data points from their initial coordinates ( t , f ) closer to the true region of the signal support [14,38], also known as the centroid or center of gravity ( t ^ ( t , f ) , f ^ ( t , f ) ) . The center of gravity is calculated for each windowed region of the tf plane, and then the calculated energy values for that region are placed at that point, rather than at the geometric center [13,39]. By averaging the energy given by the windowed region and then assigning the value to the center of gravity for that region, the energy is focused where it actually occurred [33].
The definitions of the center of gravity coordinates for time and frequency, respectively, are:
t ^ ( t , f ) = t 1 2 π φ ( t , f ) f = 1 2 π φ ( t , f ) f ,
f ^ ( t , f ) = 1 2 π φ ( t , f ) t = f + 1 2 π φ ( t , f ) t ,
where t ^ ( t , f ) and f ^ ( t , f ) are the coordinates of the center of gravity and can also be referred to as local group delay and local IF, respectively, while φ ( t , f ) is the phase of the signal. The generalized form for Cohen’s class is then defined as:
t ^ ( t , f ) = t τ ϕ ( τ , ν ) W x ( t τ , f ν ) d τ d ν ϕ ( τ , ν ) W x ( t τ , f ν ) d τ d ν ,
f ^ ( t , f ) = f ν ϕ ( τ , ν ) W x ( t τ , f ν ) d τ d ν ϕ ( τ , ν ) W x ( t τ , f ν ) d τ d ν ,
R C ( t , f ) = T F R ( t , f ) δ [ t t ^ ( t , f ) ] δ [ f f ^ ( t , f ) ] d t d f ,
where ϕ ( τ , ν ) is the kernel, W x is the Wigner–Ville representation, T F R ( t , f ) is the representation being reassigned, δ is the Dirac delta function, R C ( t , f ) is the reassigned representation, and t and f are the coordinates of any point.
The generalized form for the affine class is then defined as:
t ^ ( t , α ) = t τ ϕ ( τ α , f 0 α ν ) W x ( t τ , ν ) d τ d ν ϕ ( τ α , f 0 α ν ) W x ( t τ , ν ) d τ d ν ,
f ^ ( t , α ) = f 0 α ^ ( t , α ) = ν ϕ ( τ α , f 0 α ν ) W x ( t τ , ν ) d τ d ν ϕ ( τ α , f 0 α ν ) W x ( t τ , ν ) d τ d ν ,
R A ( t , α ) = ( α ) 2 T S R ( t , α ) δ [ t t ^ ( t , α ) ] δ [ α α ^ ( t , α ) ] d t d α α 2 ,
where α = f 0 f , f 0 = 1 Hz , T S R ( t , α ) is any timescale representation, R A ( t , α ) is the reassigned representation, and α is the scale coordinate for any point.
However, the reassignment method encounters performance difficulties for noisy signals with a low signal-to-noise ratio (SNR), where the computed centers of gravity might not actually be part of the signal due to the noise; they could instead be assigned to random noise pattern locations [40]. In addition, representations resulting from the application of the reassignment method may not satisfy some properties that the original representations satisfied, e.g., energy preservation. The reassigned distribution is time and frequency shift covariant, although it is no longer a bilinear representation [34,41].

2.7. Affine Class Distributions

Several TFR methods can be assigned to the affine class. In the following subsections, we list some of them with their definitions and main characteristics.

2.7.1. Scalogram

The scalogram is obtained by squaring the continuous wavelet transform magnitudes or, rather, by frequency-dependent affine smoothing of the Wigner representation with the analyzing wavelet. It provides the energy distribution of the signal in the TF plane and represents the simplest affine TFR. Its main drawback is poor resolution due to the proportional-bandwidth time–frequency trade-off that is controlled by the analyzing wavelet. Thus, it is impossible to control the time and frequency resolutions independently. However, it has fewer of the interference terms that disturb other affine class TFRs because they are restricted only to those regions where the signal terms overlap [13,20].
The scalogram can be defined as a 2D kernel windowing of the Wigner representation [16] or as the absolute square of the continuous wavelet transform:
S x ( t , f ) = W x ( τ , ν ) ϕ t f τ t α , α ν d τ d ν
       = C W T x ( t , f ) 2 = 1 | α | x ( t ) h t τ α d t 2 ,
where S x ( t , f ) is the scalogram, W x ( t , f ) is the Wigner representation, ϕ t f ( t , f ) is the time–frequency kernel, C W T x ( t , f ) is the continuous wavelet transform, α = f 0 f , and h ( t ) is the mother wavelet.

2.7.2. Smoothed Pseudo-Affine Wigner Distribution

The smoothed pseudo-affine Wigner distribution (SPAWD) shows similar properties to the wavelet transform; however, it exhibits a higher time–frequency resolution. Its two main drawbacks are the interference terms generated due to bilinearity and the difficult application to long-duration time signals. The method uses a short-time window that controls the trade-off between interference attenuation and resolution, making its computation more efficient. A transition from the high-resolution affine Wigner representation with interferences to the interference-free scalogram is possible. The SPAWD is defined as a self-correlation of the wavelet transform across frequency; the time windowing suppresses the interference components in the frequency direction, while the time direction smoothing is implemented by convolving the time windowing with a low-pass function [42]. Windowing must be frequency-dependent so that the resulting TFD remains affine covariant; thus, the smoothing in the frequency direction exhibits proportional bandwidth rather than constant bandwidth [13].
The enhancement in resolution is the result of the self-correlation that is applied to the signal, behaving like match-filtering, rather than just squaring the signal as for the wavelet transform [13,20]. The algorithm for the SPAWD is defined by computing the wavelet transform of the signal and by performing the generalized frequency correlation for each time point, which is efficiently implemented by the Mellin transform.
The general representation and the representation parameters (that for k = 2 result in the affine smoothed pseudo-Wigner–Ville representation) are defined as follows:
P ˜ x k ( t , f ) = f μ k ( u ) X f λ k ( u ) X * f λ k ( u ) e j 2 π f t ζ k ( u ) d u
               = f G ( u ) μ k ( u ) λ k ( u ) λ k ( u ) C W T x ψ ˜ ( t , λ k ( u ) f ) [ C W T x ψ ˜ ( t , λ k ( u ) f ) ] * d u ,
λ k = 2 ( u ) = 1 + tanh u 2 ,
μ k = 2 ( u ) = 1 tanh 2 u 2 ,
ζ k = 2 ( u ) = 2 tanh u 2 ,
where P ˜ x k ( t , f ) is the smoothed pseudo-affine Wigner representation, C W T x ψ ˜ ( t , f ) is the time–frequency version of the continuous wavelet transform with a band pass wavelet function ψ ( τ ) = h ( τ ) e j 2 π τ , λ k ( u ) as per Equation (8b), and G ( u ) is the dual variable window or low-pass function that smooths the TFR by proportional-bandwidth time smoothing.

2.7.3. Unitary Bertrand Distribution

Also known as the P 0 representation, obtained for k = 0 , its role in the affine class is comparable to the Wigner representation for the Cohen class. The unitary Bertrand representation localizes hyperbolas on the time–frequency plane, making it a hyperbolic representation [17,22,31].
The Bertrand representation B x ( t , f ) and its parameters for k = 0 are defined as follows:
B x ( t , f ) = f μ k ( u ) X f λ k ( u ) X * f λ k ( u ) e j 2 π f t u d u ,
λ k = 0 ( u ) = e u 2 u 2 sinh ( u 2 ) ,
μ k = 0 ( u ) = λ k = 0 ( u ) λ k = 0 ( u ) = u 2 sinh ( u 2 ) .
By using μ k = 0 ( u ) = 1 , we obtain the nonunitary Bertrand representation instead.

2.7.4. Unterberger Distribution

The Unterberger representation, preserving the scaling properties across frequency while smoothing the interferences in the frequency direction, is defined for the value k = 1 in Equation (8). There are two forms of the representation: the active form that exhibits the localization property and the passive form that does not [20]. The active Unterberger representation localizes on squared hyperbolas.
The representation and its parameters are defined as follows:
U x a ( t , f ) = f μ k ( u ) X f λ k ( u ) X * f λ k ( u ) e j 2 π f t ζ k ( u ) d u
= f 1 + 1 γ 2 X f γ X * f γ e j 2 π f t γ 1 γ d γ ,
μ k = 1 ( u ) = cosh u 2 ,
U x p ( t , f ) = f μ k ( u ) X f λ k ( u ) X * f λ k ( u ) e j 2 π f t ζ k ( u ) d u
= 2 γ X f γ X * f γ e j 2 π f t γ 1 γ d γ ,
λ k = 1 ( u ) = γ ( u ) = e u 2 ,
ζ k = 1 ( u ) = 2 sinh u 2 ,
where U x a ( t , f ) is the active Unterberger representation and U x p ( t , f ) is the passive Unterberger representation, with their respective parameters. The active Unterberger representation is achieved by using a specific μ k = 1 ( u ) function, as defined in Equation (31c), opposed to the passive Unterberger representation.

2.7.5. D-Flandrin Distribution

The D-Flandrin representation perfectly localizes signals on square root hyperbolas [31], and it is obtained for the value k = 1 2 .
The D-Flandrin representation D x ( t , f ) and its parameters are defined as follows:
D x ( t , f ) = f μ k ( u ) X f λ k ( u ) X * f λ k ( u ) e j 2 π f t ζ k ( u ) d u
         = f 1 γ 4 2 X f 1 + γ 4 2 X * f 1 γ 4 2 e j 2 π f t γ d γ ,
λ k = 1 2 ( u ) = 1 + tanh u 4 2 ,
ζ k = 1 2 ( u ) = γ ( u ) = 4 tanh u 4 ,
μ k = 1 2 ( u ) = 1 tanh 2 u 4 .
By using a function different from μ k = 1 2 ( u ) as defined in Equation (33e), the D-Distribution representation is obtained instead [20,31].

2.7.6. Other Affine Class Distributions

Affine time–frequency representations were first introduced in 1985 by Pierre Bertrand and Jaqueline Bertrand [43,44] soon after the wavelet theory was developed by Grossmann and Morlet [17]. The first affine representation was later referred to as the unitary affine Bertrand representation. In 1990, Flandrin and Rioul applied affine smoothing on some representations from Cohen’s class [45].
Flandrin and Rioul [45] provided a description of the affine smoothing of the Wigner–Ville distribution that resulted in a new class of representations with scale-dependent smoothing. The choices for the smoothing function and the properties were discussed. In addition, smoothing using separable Gaussian kernels was discussed, with the possibility of obtaining a continuous transition from spectrograms to scalograms, with the Wigner–Ville distribution between them. It was shown that it was possible to set up a specific requirement for a given application and then build a subset of timescale energy representations that meet those requirements by setting the values of selected parameters.
In [46], Ovarlez et al. provided more efficient algorithms based on the fast discrete Mellin transform for easier computation of the affine-group-affiliated TF distributions. Since the affine distributions use stretched forms of signals, they are normally more challenging to compute compared to standard techniques. Using only fast Fourier transform (FFT) routines made the algorithm very fast, allowing it to be employed as a practical tool for broad-band signal study.
Shenoy and Parks [47] proposed a symmetrized version of the wideband ambiguity function; by taking its 2D Fourier transform, the resulting function had properties similar to the Wigner distribution and was called the affine Wigner distribution. The introduced affine Wigner distribution was compared to the Wigner and the Q-distributions, and its properties were described. The method was based on group theory, and the results were compared to those presented in previous research by Altes, Rioul, and Flandrin.
Flandrin and Gonçalvès considered the geometry of bilinear affine distributions in the time–frequency plane [48]. The localization properties and generalized means of interference terms, as well as the generalized construction means, were established. It was stated that for frequency modulated (FM) signals, the defined general construction rules could be refined using the study of a critical manifold and stationary phase-type approximation in the case of point-wise application of those rules.
Flandrin and Gonçalvès [31] also considered the geometry of Bertrand’s bilinear affine distributions in the time–frequency plane. The localization properties, symmetry, and generalized means of interference terms, as well as the generalized construction means, were established. The reported theoretical results were corroborated by analytical and numerical examples. The affine distributions were shown to obey most of the same construction rules that the Wigner–Ville distribution obeys. A way of predicting interference diagrams was given for the affine class distributions and compared to the real interference patterns, showing good agreement between numerical computation and the real distribution generated for the signal.
Gonçalvès and Baraniuk suggested a set of pseudo- and smoothed pseudo-affine Wigner distributions [19]. When a short time window that controls the trade-off between localization and interference attenuation was applied to a pseudo-Wigner TFD, the pseudo-affine Wigner distribution was obtained. The applied windowing was frequency-dependent, so that the TFD remained affine, and as a result, the bandwidth was proportional rather than constant as in the pseudo-Wigner distribution. The pseudo-affine Wigner distribution was defined as the convolution of the wavelet transform with itself. The pseudo-affine version of the Wigner distribution can be interpreted as the sliding version of the affine Wigner distribution, thus making it suitable for long-duration signals and online real-time operation with the same computation cost as for the continuous wavelet transform. The introduced time windowing acted like a proportional bandwidth filter for the interference components oscillating in the frequency direction. A second window was utilized for smoothing in the time direction and for the suppression of interference components. This method provided continuous transition ability in smoothing from the affine Wigner distribution to the scalogram.
In [49], Murray et al. proposed a new higher-order Bertram distribution ( H O P 0 D ) as an extension of the Bertram distribution ( P 0 ); it preserved scale changes as well as constant and hyperbolic time shifts of the signal. A class of smoothed higher-order Bertram distributions was also derived; a formulation was proposed for the higher-order extension of the quadratic class that preserved scale changes and constant time shifts. A novel higher-order distribution was proposed as an extension of the second-order Bertrand distribution, and a high-order affine class for multidimensional smoothing of a higher-order Wigner distribution was introduced.
In [20], Gonçalvès and Baraniuk introduced pseudo-affine Wigner distributions as tools for time-varying spectral analysis. These new timescale distributions demanded reduced computational power, allowing online operation with resource costs similar to the continuous wavelet transform. The distributions offered the suppression of interference terms because of proportional bandwidth smoothing due to the short-time window that controls localization. The wavelet-based structure for these distributions allowed continuous transition in smoothing from the scalogram to the affine Wigner distribution. An alternative set of generators was introduced for this class that simplified the kernel formulation and helped design new distributions for specific signal classes.
Iribarren et al. dealt in [50] with monitoring conditions and the prediction of faults on rotating machines by implementing the processing of non-stationary vibrations using the affine Wigner distribution. The synthesis of the technique was briefly explained. Tests were performed on synthetic and real-life signals and showed promising results in detecting non-stationary events. The diagnosed faults were characterized by complex spectrum changes, and weak vibration non-stationarities were used for the condition monitoring of rotating machines. The tested distributions were the STFT, the smoothed pseudo-Wigner–Ville distribution, and the SPAWD. It was concluded that the affine distribution presented higher resolution but often exhibited undesired interference. Nevertheless, the SPAWD presented fewer interference cross-terms than the SPWVD.
Murray et al. proposed in [18] a new higher-order affine TF representation called HO-TFR. The new class was of a higher order than quadratic TFRs ( N > 2 ). Five alternative formulations were provided that defined multidimensional smoothing kernels. The new higher class preserved the time shift, frequency shift, and signal scale. Another subclass was defined that crossed with Cohen’s class and satisfied three covariances with a higher-order affine–Cohen intersection. Simplified formulations for each member of the new higher-order affine–Cohen subclass were provided as one-dimensional functions.
Next, in [51], Gosme et al. proposed a method for adaptive and iterative smoothing of bilinear affine representations while preserving the covariance properties with a diffusion-based technique. This provided locally adapted smoothing to the representation with the application of a conductance function that locally controls the amount of diffusion, thus allowing the adaptation of the kernel’s width with regard to the analysis scale. Depending on the area being processed, the technique was able to discriminate high-value structured signal terms from lower-value weakly structured noise or likely interference term signals (and thus could identify and protect the structured components while smoothing the weakly structured ones, or rather, the interference terms). Thus, the readability of the representations was improved while the interference terms were mostly removed.
Gosme and Richard [40] proposed adaptive diffusion processes that locally controlled the amount of smoothing and the orientation of the applied smoothing, while enhancing localization and preserving covariance. This approach provided a framework not only for the Cohen class but also for the affine class. Variable conductance, general scheme, local smoothing strength, and orientation of the diffusion were suggested both for the Cohen and affine classes. A selective forward-and-backward diffusion process was investigated to prove its ability to remove cross-terms while reaching a high concentration for the signal components.
Gang and Xiao-niu introduced the pseudo-affine spectral correlation analysis, an alternative set of spectral correlation and class kernels [52]. The newly proposed pseudo-affine spectral correlation functions overcame issues of interference terms; the flexibility of the wavelet-based structure allowed continuous smoothing transition from the affine to the wavelet spectral correlation functions. In addition, the proposed spectral correlation functions were efficient for online computation and had the same resource requirements as the continuous wavelet transform. They suppressed interference terms by using a sliding structure that acted as proportional bandwidth smoothing. The introduced generators simplified the kernel formulations and helped the design of new affine distributions.
Gonçalvès et al. discussed approaches for constructing the affine class and the tools associated with it [17]. The advantages and shortcomings were discussed, different covariant classes were compared, and their interference terms and kernels were elaborated. In addition, new classes of time–frequency covariant distributions were introduced.
Gavrovska et al., in [53], proposed an algorithm for the detection of fundamental heart sounds S1 and S2 from phonocardiograms without the use of an ECG as a reference signal. The algorithm was based on a joint TFR from the pseudo-affine Wigner–Ville distribution, the Haar wavelet lifting scheme, the normalized average Shannon energy, and autocorrelation. The results were obtained using the algorithm on real-life healthy and pathological pediatric phonocardiogram signals, achieving a relatively high success rate without using a reference ECG signal.
Finally, Berge et al. [16] examined the Wigner distribution through a quantization perspective emphasizing the group structure. One of the main results was to express the scalogram as a convolution of affine distributions. In addition, the literature on affine Wigner distributions was reviewed, and a connection was made to the Mellin transform, with the affine ambiguity function presented; several applications were given.
Next, we provide a numerical analysis of the above-described TFRs in terms of their tf concentration and IF estimation.

3. Examples and Simulation Results

Here we present simulation results and a comparison of three classes of TFRs: Cohen’s (classic), affine, and reassigned. The simulated examples were implemented in MATLAB with the help of the TFTB toolbox [13]. We also point interested readers to other freely available time–frequency signal analysis and processing toolboxes, such as the TFSAP [54,55] and the LTFAT [56]. The methods were tested on mono-component and multi-component synthetic signals (having constant, linear, parabolic, and sinusoidal FM) with additive noise, as well as on real-world signals. The evaluation Was performed in terms of IF estimation accuracy and TFR concentration. The estimated IF was calculated as in [10,11,12,57]:
f e ( t ) = a r g { max f TFR ( t , f ) } ,
where f e ( t ) is the estimated IF. Estimation of the IF from TFR maxima cannot be applied directly in the case of multi-component signals with components being present instantaneously; hence, component extraction was the step performed preceding IF estimation.
The analyzed TFRs from Cohen’s class include the spectrogram (SP), smoothed pseudo-Wigner–Ville (SPWV), and Wigner–Ville (WV) distributions. The tested affine class TFRs include affine Morlet wavelet scalogram (AMWSC), affine smoothed pseudo-Wigner–Ville (ASPWV), affine unitary Bertrand (AUB), affine active Unterberger (AAU), and affine D-Flandrin (ADF) representations, while from the reassigned class we considered the reassigned spectrogram (RSP), reassigned Gabor spectrogram (RGSP), reassigned Morlet scalogram (RMSC), reassigned pseudo-Wigner–Ville (RPWV), and reassigned smoothed pseudo-Wigner–Ville (RSPWV). The IF accuracy was assessed in terms of mean squared error (MSE), and the Rényi entropy was calculated for each TFR to evaluate the energy distribution concentration [58] and signal complexity [57] in the time–frequency domain as:
R x α = 1 1 α log 2 ( TFR x α ( t , f ) d t d f ) ,
where R x α is the α order Rényi entropy for the normalized TFR x (in order to annul interferences contributions, we have set the Rényi entropy order to α = 3 , as in [22,59]).

3.1. Examples of Synthetic Signals

The methods from the three considered classes of TFRs were tested on synthetic, noisy (corrupted by additive white Gaussian noise with SNR ranging from 10 dB to −5 dB) signals having constant FM, linear FM (chirp), parabolic FM, and sinusoidal FM. The TFR plots for noisy signals are shown for a single noise realization, while the reported tables contain the MSE values and the Rényi entropies averaged over 100 iterations of random noise applied to the signal.

3.1.1. Examples of Mono-Component Noisy Signals

First, we compare the Cohen’s, affine, and reassigned classes of TFRs in the time–frequency domain for a mono-component noisy chirp signal.
Figure 3 shows the TFRs and IFs for the noisy chirp signal for 5 dB SNR. When considering classic TFRs, as expected, the best resolution was achieved by WV. On the other hand, the TFR from Cohen’s class that provided a balance between cross-terms and TF resolutions was SPWV. At the same time, SP resulted in the poorest TF resolution and reduced cross-terms. Next, the affine class TFRs are given in the second column of Figure 3, where AMWSC offered similar performance to the SP, achieving poorer resolution with significantly reduced cross-terms. Visual inspection shows a similar performance for AUB, AAU, and ADF, with numerous cross-terms and relatively good resolution. ASPWV offered an acceptable trade-off between cross-terms and resolution compared to the aforementioned affine TFRs for the noisy chirp signal. The reassigned TFRs are listed in the third column of Figure 3. RSP, RGSP, and RMSC acted rather similarly, exhibiting slightly poorer resolution with different levels of noise (while RPWV and RSPWV presented better resolution with residual cross-terms, with the latter performing significantly better with regard to cross-terms).
The IF estimation results, in terms of the MSE, for the noisy chirp signal are provided in Table 1. In general, as shown in the provided numerical results, Cohen’s class offered the most accurate IF estimation for the noisy, linear FM signal (with WV performing the best for intensive noise of −5 dB SNR, and SPWV performing the best for other tested SNRs). On the other hand, the worst performance for this type of signal for all SNRs was the affine class (to be more specific, ASPWV, except for the noise case of −5 dB SNR where AAU yielded the highest IF estimation MSE). From the reassigned class, RSPWV was the best for all SNRs, except for −5 dB, where it was outperformed by RPWV. The classic and reassigned TFRs exhibited better performance in comparison with affine TFRs for the tested signal.
In addition to the IF estimation accuracy, interesting conclusions can be drawn from Table 2 showing the Rényi entropy values for each tested TFR of the noisy chirp signal. Here we find that TFRs belonging to Cohen’s class resulted in the highest Rényi entropy values compared to the affine and reassigned classes for all tested SNRs. The best-performing TFRs, in terms of distribution concentrations, from the reassigned class were RSPWV for higher SNRs (10 dB and 5 dB) and RMSC for lower SNRs (0 dB and −5 dB). Interestingly, RSPWV reduced the Rényi entropy by 19.61% and 17.60% compared to the worst-performing TFRs from Cohen’s class, respectively (SP for 10 dB, WV for 5 dB). On the other hand, RMSC reduced the Rényi entropy by 17.09% and 15.43% compared to the worst-performing WV for 0 dB and −5 dB, respectively. As another example of a mono-component signal, let us consider a noisy, sinusoidal FM signal.
TFRs from the Cohen’s, affine, and reassigned classes for the mono-component, noisy, sinusoidal FM signal are found in Figure 4. Here, when considering the classic TFR class, the WV showed severe cross-terms and better resolution than SP and SPWV (SPWV outperformed SP in terms of resolution, having cross-terms suppressed). Next, considering the affine TFRs, ADF, AAU, and AUB offered comparable performance, with relatively poor resolution and partly reduced cross-terms. AMWSC visually performed similarly to SP, with less resolution in the low-frequency range. ASPWV offered reduced cross-terms with rather reasonable resolution. Lastly, from the reassigned TFRs, RGSP, RSP, and RMSC had slightly poorer resolutions (the last being the worst-performing).
On the other hand, the RPWV and RSPWV had better resolution, with the first having faint cross-terms.
A quantitative comparison of the analyzed TFRs for this example is given in Table 3, which presents the estimated IF MSE from each tested TFR for a noisy, mono-component sinusoidal FM signal. Unlike in the case of the linear FM signal, here we find a significant difference between low and intensive noise. Namely, for low noise, the reassigned class significantly outperformed Cohen’s class, decreasing the MSE from 3.09   ×   10 4 and 6.05   ×   10 4 (achieved for the best-performing SPWV from Cohen’s class) to 1.06   ×   10 4 and 8.08   ×   10 4 (achieved for the best-performing RGSP and RSPWV from the reassigned class) for SNRs of 10 dB and 5 dB, respectively. The affine class performed poorly, with only WV providing a higher IF estimation MSE. On the other hand, in the case of intensive noise (0 dB and −5 dB SNR), SP and SPWV from Cohen’s class performed better than the reassigned and affine class TFRs (except for RPWV for −5 dB SNR, which decreased MSE in comparison to SPWV from 1.51   ×   10 2 to 1.46   ×   10 2 ). Again, the affine class performed poorly, with AAU resulting in the largest MSE.
The consistency in terms of TFR entropy-based concentration for different SNRs for this example can be observed in Table 4, which reports the Rényi entropy values for each tested TFR. Simulation results show the reassigned TFRs achieved the lowest Rényi entropy values (with negligible differences inside the class), and classic TFRs had the highest entropy values (with WV yielding the highest entropy for all SNRs). For the 10 dB, 5 dB, and 0 dB scenarios, the Rényi entropy ranged from 13.368 , 13.587 , and 13.923 (for RGSP) to 16.118 , 16.355 , and 16.569 (for WV). Finally, for −5 dB, the TFR entropy ranged from 14.207 for RMSC to 16.690 for WV. The affine TFRs demonstrated medium performance compared to the other two classes. On average, the best-performing TFRs for each class, in the case of the sinusoidal mono-component FM signal, were RGSP, AMWSC, and SPWV.

3.1.2. Example of a Multi-Component Noisy Successive Signal

We continue our analysis for multi-component noisy synthetic signals. The first one to be analyzed is a signal consisting of multiple successive parabolic and linear FM components corrupted by additive white Gaussian noise.
Examples of TFRs of a noisy multi-component signal with successive parabolic and linear FM components for the SNR of 5 dB are given in Figure 5. Cohen’s class of TFRs (the first column in Figure 5) varied from good resolution and numerous cross-terms for WV to poor resolution with reduced cross-terms for SP, while SPWV provided a reasonable trade-off between the two. Next, the second column of Figure 5 presents the TFRs from the affine class, with AMWSC having similar performance to SP with visually somewhat better resolution (with a loss of resolution for the lower frequencies). AUB, AAU, and ADF visually performed similarly, with relatively good resolution, but they were also affected by cross-terms. ASPWV had moderately poorer resolution with a decrease in cross-term content. Lastly, from the reassigned TFRs, RSP, RGSP, and RMSC visually performed similarly (having acceptable resolution). RPWV and RSPWV presented good resolution, but both suffered from cross-terms.
MSEs of the estimated IF of a noisy multi-component signal with successive parabolic and linear FM components are given in Table 5. In the case of higher SNRs (10 dB and 5 dB) the IF MSE for Cohen’s class was minimal for SPWV (67.96 × 10−5 and 136.99 × 10−5, respectively). From the affine class, AMWSC outperformed other TFRs achieving 76.27 × 10−5 and 113.90 × 10−5 for SNRs of 10 dB and 5 dB, respectively. From the reassigned class, for SNRs of 10 dB and 5 dB, RGSP outperformed other reassigned TFRs, achieving MSEs of 9.39 × 10−5 and 39.73 × 10−5, respectively. When considering intensive noise scenarios, it is interesting to note that SP offers the most accurate IF estimation for 0 dB (304.51 × 10−5). For comparison, the worst-performing was AAU from the affine class, resulting in an IF estimation MSE of 1774.30 × 10−5. Similar conclusions can be drawn for the case of an SNR of −5 dB, where SP’s IF estimation MSE was 1125.70 × 10−5 (negligibly outperformed by RPWV with an MSE of 1082.20 × 10−5). Again, as for 0 dB, AAU was the worst-performing with 2321.90 × 10−5. Thus, Cohen’s class resulted in the most accurate IF estimation for intensive noise SNRs for the noisy multi-component signal with successive parabolic and linear FM components, while the reassigned class of TFRs was the best-performing for low-noise environments. In both cases, the affine class was outperformed by the other two classes.
Unlike in the case of IF estimation, where the reassigned class was the best choice for low-noise and Cohen’s class was best for high-noise scenarios, when considering the Rényi entropy of TFRs of a signal with successive parabolic and linear FM components, the best-performing class for all tested SNRs was the reassigned class. This is evident from Table 6, where it can also be seen that Cohen’s class performs, in general, worse than the affine class for this type of signal in terms of TFR concentration. The worst-performing of all tested SNRs was WV, while the worst-performing affine class TFR, AAU, outperformed WV by up to 2.76%, 1.99%, 1.27%, and 0.77% for SNRs 10, 5, 0, and −5 dB, respectively. On the other hand, the best-performing reassigned TFR for all tested SNRs was RMSC, which reduced the Rényi entropy by 19.86%, 18.77%, 16.55%, and 14.42% for SNRs 10, 5, 0, and −5 dB, respectively, compared to WV.

3.1.3. Example of a Multi-Component Noisy Concurrent Signal

We conclude the simulation analysis of synthetic signals with another example of a multi-component noisy signal consisting of concurrent constant and parabolic FM components with additive Gaussian noise. For visual assessment, in Figure 6, we present TFRs from the Cohen’s, affine, and reassigned classes for 5 dB SNR. Again, from the classic TFRs, SP had poor resolution, SPWV achieved better resolution with some cross-terms, and WV had even better resolution (however, with severe interference terms). From the affine class, AUB, AAU, and ADF visually had similar performance (balancing time–frequency resolution and amount of cross-terms). ASPWV had poorer resolution; however, it also had weaker cross-terms. AMWSC displayed a similar performance to the scalogram. Lastly, from the reassigned TFRs, RSP, RGSP, and RMSC visually performed similarly with somewhat poorer resolution.
RPWV and RSPWV presented satisfactory resolution, with the former exhibiting more accentuated cross-terms.
MSE for the estimated IF for the noisy multi-component signal with concurrent constant and parabolic FM components is given in Table 7. It can be seen that the affine class AMWSC provided, in general, the lowest MSE for all tested SNRs. For the 10 dB SNR, AMWSC reduced the MSE by 75.48 % and 29.84% compared to the second-best (SP) for the first and second components, respectively. For 5 dB, AMWSC reduced MSE by 90.41% compared to the second-best (SP) for the first component, while for the second component, SP performed somewhat better, resulting in an MSE of 8.04   ×   10 6 (AMWSC obtained 8.81   ×   10 6 ). For 0 dB, AMWSC performed well, with MSE being reduced by 93.19% for the first component compared to the second-best (SPWV), and MSE increased by 5.01% compared to SPWV for the second component. Moreover, for an intensive noise environment (−5 dB SNR), AMWSC reduced the MSE by 61.68% and 0.32% (for the first and second components, respectively) compared to the second-best from the affine class (AAU).
TFR Rényi entropy results for a noisy multi-component signal with concurrent constant and parabolic FM components for different SNR levels (averaged over 100 noise realizations) is presented in Table 8. The Rényi entropy ranged from 13.308 to 16.698, with those values belonging to RSPWV for the SNR of 10 dB and WV for an SNR of −5 dB, respectively. The results show that the reassigned TFRs outperformed those from the Cohen’s and affine classes by up to 18.01%, 16.28%, 15.34%, and 14.33% for 10 dB, 5 dB, 0 dB, and −5 dB SNRs, respectively (with Cohen’s class WV resulting in the highest Rényi entropy for all SNRs). In terms of the entropy-based distribution concentration for this multi-component signal, the best-performing TFRs on average per class were AMWSC for the affine class, SPWV for Cohen’s class, and RSPWV and RMSC from the reassigned class for low and intense noise, respectively.

3.2. Examples of Real-World Signals

Next, we present the performance of the Cohen’s, affine, and reassigned class distributions using two real-world examples, in terms of concentration, measured by the Rényi entropy.
First, we analyzed the concentrations of TFRs for an electroencephalogram (EEG) signal, observing the P300 response to external stimuli. The multichannel dataset consisted of 32 channels measuring brain activity and 942 trials with and without external stimuli [60]. We focused on the Cz electrode and analyzed the signal obtained from the average of 466 responses to stimuli.
Figure 7 gives the time–frequency distributions of the analyzed EEG P300 signal. A simple visual inspection of the classic group of TFRs shows that SP had poor concentration, which was improved on by SPWV at the cost of interference terms. WV showed even better resolution, but was more deteriorated by cross-terms. From the affine TFRs, the poorest concentration was found for AMWSC, followed by AUB, AAU, and ADF. ASPWV had moderately poorer concentration; however, it had fewer cross-terms. From the reassigned TFRs, RSP and RGSP achieved similar performance. RMSC performed with acceptable concentration, and RPWV and RSPWV visually performed similarly, showing satisfactory concentration, with the latter performing better regarding cross-terms.
A numerical comparison of the analyzed TFRs’ for the EEG P300 signal is given in Table 9. The Rényi entropy ranges between 10.178 and 16.689. The classic TFRs exhibited average performance, the affine TFRs had poorer performance, and the reassigned TFRs achieved the best performance. The best- and worst-performing TFRs, when considering all TFR classes, were RMSC and AMWSC. The best reassigned TFR (RMSC) outperformed the worst TFR (affine AMWSC) by 39.01%, and the best TFR from the Cohen class (SPWV), by 20.6%.
Finally, we provide the comparison of the analyzed TFRs on another real-world example: the vertical acceleration seismogram of the earthquake that hit Zagreb, the capital of Croatia, on 22 March 2020. The magnitude was 5.5 according to the Richter scale, making it the strongest earthquake to hit Zagreb since the 1880s. The depth at the epicenter was 10 km, and the epicenter was 7 km north of the city center. The closest surveying station was located in the city and municipality of Lobor, some 38 km from the capital of Zagreb, where the seismogram signal was recorded.
TFRs of the earthquake signals are shown in Figure 8. As in the previous real-life example, for Cohen’s class, WV achieved the best resolution and most cross-terms, as opposed to SP for both features. SPWV, as already seen, provided a trade-off between resolution and cross-terms. When considering the affine group of TFRs, AMWSC presented poor concentration according to visual inspection, while AUB, AAU, and ADF had intense cross-terms compared to ASPWV. From the reassigned TFRs, RSP and RGSP showed relatively good concentration, which was further improved for RMSC. RPWV visually showed good concentration, but it was also affected by cross-terms, whilst RSPWV showed acceptable performance with regard to concentration, with the presence of few cross-terms.
Table 10 shows the TFR Rényi entropies for the Zagreb 2020 earthquake vertical acceleration seismogram signal. The Rényi entropy values ranged from a minimum of 15.084 to a maximum of 21.526. It can be seen that the TFRs assume class-related behavior, with the affine group performing the worst, the classic group of TFRs performing in the middle, and the reassigned group of TFRs being the best in terms of the Rényi entropy concentration. The best-performing TFRs per group were SPWV for the classic group, ASPWV for the affine class, and RMSC for the reassigned class. The best-performing TFR for this example, in general, was RMSC, and the worst-performing was AMWSC. The best reassigned TFR (RMSC) outperformed the worst TFR (affine AMWSC) by 29.92%, and the best TFR from the Cohen class (SPWV) by 12.33%, in the case of the tested earthquake signal.

3.3. Computational Cost of Analyzed TFRs

Here we present the computational cost of the tested methods. The TFRs were computed in MATLAB (R2016a version) on Windows 10. As a computational platform, we used an HP Pavilion Power 17-ab307nm 2ZJ05EA with an Intel® Core™ i5-7300HQ CPU @ 2.50 GHz, 16 GB of DDR4-2666 (1333 MHz) RAM, and NVIDIA GeForce GTX 1050 Ti GPU with 4 GB GDDR5 of dedicated memory (7.9 GB total shared memory).
Table 11 shows the computational cost of calculating each of the analyzed TFRs from previous examples (averaged over 100 calculations for an SNR of 5 dB). The fastest TFRs, in terms of computational CPU time, were WV, SP, RPWV, and AMWSC, while the computationally most demanding was AAU. As for groups, Cohen’s class performed the fastest, followed by the reassigned class and then the affine class. In addition, as expected, signal length significantly affected the computational cost. To study the effect of signal length on computational cost in more depth, we present CPU computational times in Table 12 for an example of a linear FM signal with a different number of signal samples (averaged over 100 calculations).
Next, in terms of the computational cost of analyzed TFDs, we present the computational time for calculating TFRs of different dimensions, as shown in Table 13. As shown, in general, a larger TFR dimension led to slower TFR calculations, especially for the affine and reassigned classes, while for Cohen’s class, this increase in CPU computational time was not so significant.
In conclusion, this paper provides novel insight into the performances of TFRs from the Cohen’s, affine, and reassigned classes in terms of their IF estimation, entropy-based concentration, and computational cost. To the best of our knowledge, based on an extensive literature review, there exists no similar study comparing SP, WV, SPWV, AMWSC, ASPWV, AUB, AAU, ADF, RSP, RGSP, RMSC, RPWV, and RSPWV on both synthetic (mono-component and multi-component) signals in noise for different SNR levels, as well as on the real-time examples. The conclusions and findings of this research may help in selecting the appropriate TFR for a signal of interest to be analyzed.
As a direction for future work, we suggest designing new adaptive TFRs optimized for IF estimation and/or distribution concentration and applying these in fields where classical TFRs are often used, with the expectation of outperforming them.

4. Results Discussion

This study compares the performance of three classes of TFRs (Cohen’s, affine, and reassigned) for different signals (mono-component and multi-component), different frequency modulations, and different noise levels. As it is visible from the reported results, there is no one particular TFR that performs best in all tested scenarios in terms of IF estimation accuracy and TF concentration.
In general, when considering Cohen’s class, consistently good performance was achieved by SPWV. From the affine class, MWSC and ASPWV performed relatively well, while RSPWV and RPWV should be considered as the first choices from the reassigned class.
When comparing TFR classes, the classic Cohen’s and reassigned TFRs outperformed the affine group with regard to the MSE of the estimated IF. In general, for most cases and SNRs, the Cohen’s class resulted in improved IF estimation, while the reassigned class resulted in somewhat higher MSE values. Finally, the affine class saw the least change in performance with the change in the SNR, having mostly medium to poor performance, with a few exceptions.
According to the Rényi entropy measure, the reassigned class performed best, followed by the affine and Cohen’s class TFRs, with some exceptions. The best-performing TFRs per group were AMWSC, SPWV, and RMSC. Over most of the examples and SNRs, the best-performing TFRs were RMSC, RSP, RSPWV, and RGSP.
The best-performing TFR of the reassigned class was RSPWV, followed by RMS and RGS (which was more sensitive to noise). The best-performing TFRs from Cohen’s class were SP (having relatively poor concentration) and SPWV (presenting some residual cross-terms). Finally, in the case of the affine class, visually, the best-performing was AMWSC, with a poorer concentration in the lower frequency bands, and ASPWV, which suffered from cross-terms.
According to the Rényi entropy, used as a measure of TFR concentration, the affine class of TFRs achieved poor performance, Cohen’s class TFRs displayed medium performance, while the reassigned class resulted in the best performance for the real-life examples. In general, the best TFR, according to the Rényi entropy, for tested real-world examples was RSPWV, followed by RMS, RGS, and RS. When compared at a class level, the best TFRs for analyzed real-life examples were ASPWV in the affine class, SPWV in Cohen’s class, and RMS in the reassigned class.

5. Conclusions

This paper elaborated on and compares three classes of TFRs: Cohen’s, affine, and reassigned, including the theoretical background of the selected TFRs belonging to these classes. Next, we performed extensive numerical simulations on non-stationary signals, including both synthetic and real-life examples. TFD quality was evaluated with respect to IF estimation accuracy, TF concentration measured by the Rényi entropy, and the presence of cross-terms.
As shown, the performances of the TFRs were highly affected by the type of signal being analyzed. When choosing the proper TFR, one should consider the following parameters: the presence and intensity of noise, the number of signal components, and the FM of signal components, to name a few.
The best-performing TFRs in terms of the IF estimation MSE, for the mono-component linear FM signal, were WV and SPWV, depending on the noise intensity. In the case of the mono-component sinusoidal FM signal, the best-performing TFRs were RGSP, RSPWV, SP, and SPWV as the noise intensified. On the other hand, the best for the successive multi-component signal was RGSP for lower noise and SP for high noise. For the concurrent multi-component signal, AMWSC outperformed all other TFRs.
Regarding the Rényi entropy, the best-performing TFRs for mono-component linear FM signals were RSPWV for lower noise and RMSC for higher noise. For the sinusoidal signal, the best TFRs were RGSP and RMSC, depending on the noise level. For the successive multi-component FM signal, RMSC outperformed other TFRS for all tested SNRs, while for the concurrent multi-component signal, the best TFRs were RSPWV for the higher SNRs and RMSC for the lower SNRs.
Furthermore, there is often a compromise between the time–frequency resolution and cross-term presence, as well as between the accuracy of the IF estimation and TF concentration.
Since there is no one particular TFR with superior performance for all cases, this study presents extensive insight into the behavior of the analyzed TFRs classes, providing help in choosing the appropriate representation for the analyzed signal of a non-stationary phenomenon.
In future work, we plan to focus on developing new adaptive TFRs (from the Cohen’s, affine, and reassigned classes) optimized for IF estimation and/or entropy-based concentration and to apply these adaptive TFRs in cases where classical TFRs are often utilized, with the expectation of improving upon their performance.

Author Contributions

Conceptualization, J.L. and N.S.; methodology, D.B., J.L. and I.P.V.; software, D.B. and J.L.; validation, J.L., N.S. and I.P.V.; formal analysis, D.B., J.L. and N.S.; investigation, D.B., J.L., N.S. and I.P.V.; resources, D.B. and J.L.; data curation, D.B. and J.L.; writing—original draft preparation, D.B. and J.L.; writing—review and editing, D.B., J.L., N.S. and I.P.V.; visualization, D.B. and J.L.; supervision, J.L.; project administration, J.L. and I.P.V.; funding acquisition, J.L., N.S. and I.P.V. All authors have read and agreed to the published version of the manuscript.

Funding

This work was fully supported by the Croatian Science Foundation under the project IP-2018-01-3739, EU Horizon 2020 project “National Competence Centres in the Framework of EuroHPC (EUROCC)”, IRI2 project “ABsistemDCiCloud” (KK.01.2.1.02.0179), the University of Rijeka projects uniri-tehnic-18-17 and uniri-tehnic-18-15, Croatian–Slovenian bilateral project BI-HR/20-21-043, and the Juraj Dobrila University of Pula.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The Zagreb earthquake seismogram data were provided by the University of Zagreb, Faculty of Science, Department of Geophysics.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
TFRTime–Frequency Representation
TFDTime–Frequency Distribution
TSRTime–Scale Representation
TSDTime–Scale Distribution
IFInstantaneous Frequency
TFTime–Frequency
FMFrequency Modulation
SNRSignal-to-Noise Ratio
MSEMean Squared Error
SPSpectrogram
SPAWDSmoothed Pseudo-Affine Wigner Distribution
SPWVSmoothed Pseudo-Wigner–Ville
WVWigner–Ville
AMWSCAffine Morlet Wavelet Scalogram
ASPWVAffine Smoothed Pseudo-Wigner–Ville
AUBAffine Unitary Bertrand
AUUAffine Active Unterberger
ADFAffine D-Flandrin
RSPReassigned Spectrogram
RGSPReassigned Gabor Spectrogram
RMSCReassigned Morlet Scalogram
RPWVReassigned Pseudo-Wigner–Ville
RSPWVReassigned Smoothed Pseudo-Wigner–Ville

References

  1. Gonçalves, P.; Flandrin, P.; Chassande-Mottin, E. Time-Frequency Methods in Time Series Data Analysis. In Proceedings of the Gravitational Wave Data Analysis Workshop 2, Orsay, France, 13–15 November 1997; Davier, M., Hello, P., Eds.; Editions Frontières: Paris, France, 1997; pp. 35–46. [Google Scholar]
  2. Gröchenig, K. Foundations of Time-Frequency Analysis; Birkhauser Verlag: Basel, Switzerland, 2013. [Google Scholar] [CrossRef]
  3. Orini, M.; Laguna, P.; Mainardi, L.T.; Bailón, R. Time-Frequency Analysis of Cardiovascular Signals and Their Dynamic Interactions. In Complexity and Nonlinearity in Cardiovascular Signals; Springer International Publishing: Berlin/Heidelberg, Germany, 2017; pp. 257–287. [Google Scholar] [CrossRef] [Green Version]
  4. Zeevi, Y.; Coifmana, R. (Eds.) Wavelet Analysis and its Applications. In Signal and Image Representation in Combined Spaces; Academic Press: Cambridge, MA, USA, 1998; Volume 7, p. 585. [Google Scholar] [CrossRef]
  5. Volpato, A.; Collini, E. Time-frequency methods for coherent spectroscopy. Opt. Express 2015, 23, 20040–20050. [Google Scholar] [CrossRef]
  6. Hurley, C.; Mclean, J. Wavelet: Analysis and Methods; Wavelet, ETP: Selangor Darul Ehsan, Malaysia, 2018. [Google Scholar]
  7. Volpato, A.; Collini, E. Optimization and selection of time-frequency transforms for wave-packet analysis in ultrafast spectroscopy. Opt. Express 2019, 27, 2975–2987. [Google Scholar] [CrossRef] [PubMed]
  8. Auger, F.; Chassande-Mottin, E. Quadratic Time-Frequency Analysis I: Cohen’s Class; ISTE: Vallsta, Sweden, 2008; pp. 131–163. [Google Scholar] [CrossRef]
  9. Boashash, B. Time Frequency Signal Analysis and Processing: A Comprehensive Reference; Academic Press: Amsterdam, The Netherlands; Boston, MA, USA, 2016. [Google Scholar]
  10. Lerga, J.; Member, V.S.E.; Boashash, B. An Efficient Algorithm for Instantaneous Frequency Estimation of Nonstationary Multicomponent Signals in Low SNR. EURASIP J. Adv. Signal Process. 2011, 2011, 725189. [Google Scholar] [CrossRef] [Green Version]
  11. Lerga, J.; Sucic, V. Nonlinear IF Estimation Based on the Pseudo WVD Adapted Using the Improved Sliding Pairwise ICI Rule. IEEE Signal Process. Lett. 2009, 16, 953–956. [Google Scholar] [CrossRef]
  12. Lerga, J.; Sucic, V. An instantaneous frequency estimation method based on the improved sliding pair-wise ICI rule. In Proceedings of the 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010), Kuala Lumpur, Malaysia, 10–13 May 2010; pp. 161–164. [Google Scholar] [CrossRef]
  13. Auger, F.; Flandrin, P.; Gonçalves, P.; Lemoine, O. Time-Frequency Toolbox Tutorial; CNRS: Paris, France; Rice University: Houston, TX, USA, 2005. [Google Scholar]
  14. Mika, D.; Józwik, J. Advanced Time-Frequency Representation in Voice Signal Analysis. Adv. Sci. Technol. Res. J. 2018, 12, 251–259. [Google Scholar] [CrossRef]
  15. Feichtinger, H.G.; Strohmer, T. (Eds.) Gabor Analysis and Algorithms Theory and Applications; Birkhäuser: Boston, MA, USA, 1998. [Google Scholar] [CrossRef]
  16. Berge, E.; Berge, S.M.; Luef, F. The Affine Wigner Distribution. arXiv 2019, arXiv:1912.03112. [Google Scholar] [CrossRef]
  17. Gonçalvés, P.; Ovarlez, J.P.; Baraniuk, R. Quadratic Time-Frequency Analysis III: The Affine Class and Other Covariant Classes. In Time-Frequency Analysis; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2008; Chapter 7; pp. 193–226. [Google Scholar] [CrossRef]
  18. Murray, R.; Papandreou-Suppappola, A.; Boudreaux-Bartels, G. A new class of affine higher order time-frequency representations. In Proceedings of the 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP99 (Cat. No.99CH36258), Phoenix, AZ, USA, 15–19 March 1999; Volume 3, pp. 1613–1616. [Google Scholar] [CrossRef]
  19. Goncalves, P.; Baraniuk, R. Pseudo affine Wigner distributions. In Proceedings of the 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, Atlanta, GA, USA, 9 May 1996; Volume 3, pp. 1423–1426. [Google Scholar] [CrossRef] [Green Version]
  20. Goncalves, P.; Baraniuk, R. Pseudo affine Wigner distributions: Definition and kernel formulation. IEEE Trans. Signal Process. 1998, 46, 1505–1516. [Google Scholar] [CrossRef] [Green Version]
  21. Sandsten, M. Time-Frequency Analysis of Time-Varying Signals and Non-Stationary Processes; Lund University: Lund, Sweden, 2016. [Google Scholar]
  22. Boashash, B.; Ouelha, S. Designing high-resolution time–frequency and time–scale distributions for the analysis and classification of non-stationary signals: A tutorial review with a comparison of features performance. Digit. Signal Process. 2018, 77, 120–152. [Google Scholar] [CrossRef]
  23. Pikula, S.; Beneš, P. A New Method for Interference Reduction in the Smoothed Pseudo Wigner-Ville Distribution. Int. J. Smart Sens. Intell. Syst. 2014, 7, 1–5. [Google Scholar] [CrossRef] [Green Version]
  24. Rioul, O.; Flandrin, P. Time-scale energy distributions: A general class extending wavelet transforms. IEEE Trans. Signal Process. 1992, 40, 1746–1757. [Google Scholar] [CrossRef]
  25. Hlawatsch, F.; Kozek, W. The Wigner distribution of a linear signal space. IEEE Trans. Signal Process. 1993, 41, 1248–1258. [Google Scholar] [CrossRef]
  26. Stankovic, L. An analysis of some time-frequency and time-scale distributions. Ann. Telecommun. 1994, 49, 505–517. [Google Scholar] [CrossRef]
  27. Matz, G.; Hlawatsch, F. Wigner distributions (nearly) everywhere: Time–frequency analysis of signals, systems, random processes, signal spaces, and frames. Signal Process. 2003, 83, 1355–1378. [Google Scholar] [CrossRef]
  28. Hlawatsch, F.; Mecklenbräuker, W. (Eds.) The Wigner Distribution; Elsevier: Amsterdam, The Netherlands, 1997; p. 459. [Google Scholar]
  29. Szmajda, M.; Górecki, K.; Mroczka, J. Gabor Transform, Gabor-Wigner Transform and SPWVD as a time-frequency analysis of power quality. In Proceedings of the 14th International Conference on Harmonics and Quality of Power—ICHQP 2010, Bergamo, Italy, 26–29 September 2010; pp. 1–8. [Google Scholar] [CrossRef]
  30. Boudreaux-Bartels, G. Mixed Time- Frequency Signal Transformations. In Electrical Engineering Handbook; CRC Press LLC: Boca Raton, FL, USA, 2000; Chapter 12; p. 76. [Google Scholar] [CrossRef]
  31. Flandrin, P.; Goncalvès, P. Geometry of Affine Time–Frequency Distributions. Appl. Comput. Harmon. Anal. 1996, 3, 10–39. [Google Scholar] [CrossRef] [Green Version]
  32. Kodera, K.; De Villedary, C.; Gendrin, R. A new method for the numerical analysis of non-stationary signals. Phys. Earth Planet. Inter. 1976, 12, 142–150. [Google Scholar] [CrossRef]
  33. Kodera, K.; Gendrin, R.; Villedary, C. Analysis of time-varying signals with small BT values. IEEE Trans. Acoust. Speech Signal Process. 1978, 26, 64–76. [Google Scholar] [CrossRef]
  34. Auger, F.; Flandrin, P. Generalization of the reassignment method to all bilineartime-frequency and time-scale representations. In Proceedings of the ICASSP’94—IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, Australia, 19–22 April 1994; Volume 4, pp. IV/317–IV/320. [Google Scholar] [CrossRef]
  35. Auger, F.; Flandrin, P. Improving the readability of time-frequency and time-scale representations by the reassignment method. IEEE Trans. Signal Process. 1995, 43, 1068–1089. [Google Scholar] [CrossRef] [Green Version]
  36. Flandrin, P.; Auger, F.; Chassande-Mottin, E. Time-Frequency reassignment: From principles to algorithms. In Applications in Time-Frequency Signal Processing; Electrical Engineering & Applied Signal Processing Series; Papandreou-Suppappola, A., Ed.; CRC Press: Boca Raton, FL, USA, 2002; Volume 10, pp. 179–204. [Google Scholar] [CrossRef] [Green Version]
  37. Shafi, I.; Jamil, A.; Shah, S.; Kashif, F. Techniques to Obtain Good Resolution and Concentrated Time-Frequency Distributions: A Review. EURASIP J. Adv. Signal Process. 2009, 2009, 43. [Google Scholar] [CrossRef] [Green Version]
  38. Abratkiewicz, K.; Samczyński, P.; Fourer, D. A Comparison of the Recursive and FFT-based Reassignment Methods in Micro-Doppler Analysis. In Proceedings of the 2020 IEEE Radar Conference (RadarConf20), Florence, Italy, 21–25 September 2020; pp. 1–6. [Google Scholar] [CrossRef]
  39. Fitz, K.R.; Fulop, S.A. A Unified Theory of Time-Frequency Reassignment. arXiv 2009, arXiv:0903.3080. [Google Scholar]
  40. Gosme, J.; Richard, C.; Goncalves, P. Adaptive diffusion as a versatile tool for time-frequency and time-scale representations processing: A review. IEEE Trans. Signal Process. 2005, 53, 4136–4146. [Google Scholar] [CrossRef] [Green Version]
  41. Hainsworth, S.; Macleod, M. Time Frequency Reassignment: A Review and Analysis; Technical Report; Cambridge University Engineering Department: Cambridge, UK, 2003. [Google Scholar]
  42. Yang, H.; Sun, M.; Xu, Y. A New Time-frequency Combined Method Based on Improved Complete Ensemble EMD and Affine Smooth Pseudo Wigner-Ville Distribution. UPB Sci. Bull. Ser. D Mech. Eng. 2021, 83, 71–86. [Google Scholar]
  43. Bertrand, J.; Bertrand, P. A class of affine Wigner functions with extended covariance properties. J. Math. Phys. 1992, 33, 2515–2527. [Google Scholar] [CrossRef]
  44. Bertrand, J.; Bertrand, P. Affine time-frequency distributions. In Time-Frequency Signal Analysis—Methods and Applications; Longman-Cheshire: Melbourne, Australia, 1992; Volume 1992, pp. 118–140. [Google Scholar]
  45. Flandrin, P.; Rioul, O. Affine smoothing of the Wigner-Ville distribution. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, USA, 3–6 April 1990; Volume 5, pp. 2455–2458. [Google Scholar] [CrossRef]
  46. Ovarlez, J.; Bertrand, J.; Bertrand, P. Computation of affine time-frequency distributions using the fast Mellin transform. In Proceedings of the ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, CA, USA, 23–25 March 1992; Volume 5, pp. 117–120. [Google Scholar] [CrossRef]
  47. Shenoy, R.; Parks, T. Affine Wigner distributions. In Proceedings of the ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, CA, USA, 23–25 March 1992; Volume 5, pp. 185–188. [Google Scholar] [CrossRef]
  48. Flandrin, P.; Goncalves, P. Geometry of affine distributions. In Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis, Pittsburgh, PA, USA, 9 October 1994; pp. 80–83. [Google Scholar] [CrossRef]
  49. Murray, R.; Papandreou-Suppappola, A.; Boudreaux-Bartels, G. New higher order affine time-frequency representations. In Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis (Cat. No.98TH8380), Victoria, BC, Canada, 4–6 October 1998; pp. 325–328. [Google Scholar] [CrossRef]
  50. Iribarren, M.; Martin, C.S.; Saavedra, P. Processing of Non-Stationary Vibrations Using the Affine Wigner Distribution. In Signal Analysis and Prediction; Birkhäuser: Boston, MA, USA, 1998; pp. 89–101. [Google Scholar] [CrossRef]
  51. Gosme, J.; Richard, C.; Goncalves, P. Diffusion equations for adaptive affine distributions. In Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, QC, USA, 17–21 May 2004; Volume 2, p. ii-629. [Google Scholar] [CrossRef]
  52. Xiong, G.; Yang, X.N. Pseudo Affine Spectral Correlation Analysis: Definition and the Kernel Formulation. In Proceedings of the 2006 International Conference on Communications, Circuits and Systems, Guilin, China, 25–28 June 2006; Volume 1, pp. 248–252. [Google Scholar] [CrossRef]
  53. Gavrovska, A.; Bogdanović, V.; Reljin, I.; Reljin, B. Automatic heart sound detection in pediatric patients without electrocardiogram reference via pseudo-affine Wigner–Ville distribution and Haar wavelet lifting. Comput. Methods Programs Biomed. 2014, 113, 515–528. [Google Scholar] [CrossRef] [PubMed]
  54. Chapter 17—Getting Started with a Practical and Efficient Time-Frequency Toolbox TFSAP-7.000. In Time-Frequency Signal Analysis and Processing, 2nd ed.; Boashash, B. (Ed.) Academic Press: Oxford, UK, 2016; pp. 967–988. [Google Scholar] [CrossRef]
  55. Boashash, B.; Ouelha, S. Efficient software platform TFSAP 7.1 and Matlab package to compute Time–Frequency Distributions and related Time-Scale methods with extraction of signal characteristics. Softwarex 2018, 8, 48–52. [Google Scholar] [CrossRef]
  56. Pruša, Z.; Søndergaard, P.; Balazs, P.; Holighaus, N. LTFAT: A Matlab/Octave toolbox for sound processing. In Proceedings of the 10th International Symposium on Computer Music Multidisciplinary Research (CMMR), Marseille, France, 15–18 October 2013; pp. 299–314. [Google Scholar]
  57. Lerga, J.; Saulig, N.; Stanković, L.; Seršić, D. Rule-Based EEG Classifier Utilizing Local Entropy of Time–Frequency Distributions. Mathematics 2021, 9, 451. [Google Scholar] [CrossRef]
  58. Sandsten, M.; Brynolfsson, J.; Reinhold, I. The Matched Window Reassignment. In Proceedings of the 2018 26th European Signal Processing Conference (EUSIPCO), Rome, Italy, 3–7 September 2018; pp. 2340–2344. [Google Scholar] [CrossRef]
  59. Baraniuk, R.; Flandrin, P.; Janssen, A.; Michel, O. Measuring time-frequency information content using the Renyi entropies. IEEE Trans. Inf. Theory 2001, 47, 1391–1409. [Google Scholar] [CrossRef] [Green Version]
  60. Hoffmann, U.; Garcia, G.; Vesin, J.M.; Diserens, K.; Ebrahimi, T. A Boosting Approach to P300 Detection with Application to Brain-Computer Interfaces. In Proceedings of the 2nd International IEEE EMBS Conference on Neural Engineering, Arlington, VA, USA, 16–19 March 2005; pp. 97–100. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Example of three elementary Gaussian atoms in time, frequency, and TF domains (spectrogram).
Figure 1. Example of three elementary Gaussian atoms in time, frequency, and TF domains (spectrogram).
Sensors 22 03727 g001
Figure 2. Example of three elementary Gaussian atoms in the time, frequency, and TF domains (Wigner-Ville) with interferences.
Figure 2. Example of three elementary Gaussian atoms in the time, frequency, and TF domains (Wigner-Ville) with interferences.
Sensors 22 03727 g002
Figure 3. TFRs of a noisy mono-component linear FM signal with estimated IFs for the SNR of 5 dB.
Figure 3. TFRs of a noisy mono-component linear FM signal with estimated IFs for the SNR of 5 dB.
Sensors 22 03727 g003
Figure 4. TFRs of a noisy mono-component sinusoidal FM signal with estimated IFs for the SNR of 5 dB.
Figure 4. TFRs of a noisy mono-component sinusoidal FM signal with estimated IFs for the SNR of 5 dB.
Sensors 22 03727 g004
Figure 5. TFRs of a noisy multi-component signal with successive parabolic and linear FM components for 5 dB SNR.
Figure 5. TFRs of a noisy multi-component signal with successive parabolic and linear FM components for 5 dB SNR.
Sensors 22 03727 g005
Figure 6. TFRs of a noisy multi-component signal with concurrent constant and parabolic FM components for the 5 dB SNR.
Figure 6. TFRs of a noisy multi-component signal with concurrent constant and parabolic FM components for the 5 dB SNR.
Sensors 22 03727 g006
Figure 7. TFRs of an EEG P300 signal for the Cz electrode (averaged over 466 trials).
Figure 7. TFRs of an EEG P300 signal for the Cz electrode (averaged over 466 trials).
Sensors 22 03727 g007
Figure 8. TFRs of the 2020 Zagreb earthquake vertical acceleration seismogram.
Figure 8. TFRs of the 2020 Zagreb earthquake vertical acceleration seismogram.
Sensors 22 03727 g008
Table 1. Estimation of IF MSE for noisy mono-component linear FM signal for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively, 1 × 10−5.
Table 1. Estimation of IF MSE for noisy mono-component linear FM signal for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively, 1 × 10−5.
10 dB5 dB0 dB−5 dB
SP1.582.39111.241329.70
SPWV1.462.0898.581320.50
WV1.6434.19231.40918.89
AMWSC4.619.40326.762039.20
ASPWV277.84695.281789.102647.70
AUB31.8877.71706.322093.80
AAU75.61280.151667.803137.30
ADF66.84181.901249.602381.20
RSP11.8835.94497.451811.40
RGSP13.0937.51683.302014.00
RMSC22.8588.66609.311662.40
RPWV42.16100.22521.221288.90
RSPWV1.5711.84343.781645.20
Table 2. TFR Rényi entropy for a noisy mono-component linear FM signal for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
Table 2. TFR Rényi entropy for a noisy mono-component linear FM signal for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
10 dB5 dB0 dB−5 dB
SP15.22515.47515.95916.480
SPWV14.90515.29315.89816.437
WV14.86415.60016.34316.641
AMWSC13.77414.24215.07815.883
ASPWV15.02215.55115.93316.134
AUB14.19315.07115.87516.232
AAU14.69015.51616.19916.446
ADF14.30115.19115.91916.187
RSP12.88613.17213.68214.187
RGSP12.97213.27213.74214.174
RMSC12.79613.06113.55014.074
RPWV13.10714.01814.80015.104
RSPWV12.24012.85413.68414.349
Table 3. Estimation of IF MSE for a noisy mono-component sinusoidal FM signal for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively, 1 × 10−4.
Table 3. Estimation of IF MSE for a noisy mono-component sinusoidal FM signal for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively, 1 × 10−4.
10 dB5 dB0 dB−5 dB
SP6.8810.2836.54153.62
SPWV3.096.0534.86150.80
WV187.12153.31135.68150.92
AMWSC17.8326.0873.08197.99
ASPWV11.5860.63150.89217.60
AUB92.3097.53134.96185.98
AAU99.05127.31187.59245.58
ADF100.06109.18152.09200.02
RSP1.798.5472.52197.80
RGSP1.068.6890.08217.16
RMSC7.9324.1989.60197.35
RPWV2.7413.0965.93145.99
RSPWV1.928.0862.18184.94
Table 4. TFR Rényi entropy for a noisy mono-component sinusoidal FM signal for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
Table 4. TFR Rényi entropy for a noisy mono-component sinusoidal FM signal for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
10 dB5 dB0 dB−5 dB
SP15.94716.11116.38116.654
SPWV15.72415.97316.30016.586
WV16.11816.35516.56916.690
AMWSC15.17015.40215.78816.227
ASPWV15.41615.87116.08016.240
AUB15.48215.76516.07016.340
AAU15.61515.95616.26916.518
ADF15.49915.78316.05816.278
RSP13.57913.72013.99814.301
RGSP13.36813.58713.92314.249
RMSC13.52213.68913.99114.207
RPWV14.12714.57814.95715.152
RSPWV13.65013.89014.22614.488
Table 5. Estimation of IF MSE for a noisy multi-component signal with successive parabolic and linear FM components for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively, 1 × 10−5.
Table 5. Estimation of IF MSE for a noisy multi-component signal with successive parabolic and linear FM components for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively, 1 × 10−5.
10 dB5 dB0 dB−5 dB
SP127.12166.02304.511125.70
SPWV67.96136.99325.231160.50
WV1046.801068.601061.801206.60
AMWSC76.27113.90350.101501.40
ASPWV148.89480.461292.901816.10
AUB684.84783.601122.601546.40
AAU848.841098.201774.302321.90
ADF750.37910.031337.601707.00
RSP22.1865.28456.631509.00
RGSP9.3939.73445.731513.40
RMSC9.8445.78524.601540.00
RPWV61.37179.57514.641082.20
RSPWV42.95107.03475.401487.00
Table 6. TFR Rényi entropy for a noisy multi-component signal with successive parabolic and linear FM components for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
Table 6. TFR Rényi entropy for a noisy multi-component signal with successive parabolic and linear FM components for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
10 dB5 dB0 dB−5 dB
SP19.61619.83920.22620.519
SPWV19.37919.69620.14820.449
WV20.09720.30120.51820.637
AMWSC18.57718.93219.56120.118
ASPWV19.18419.74220.10820.267
AUB19.32519.65820.02720.274
AAU19.54219.89720.25820.478
ADF19.35819.69220.02720.231
RSP16.63916.91617.41617.869
RGSP16.28816.64217.21717.733
RMSC16.10516.49117.12217.662
RPWV17.57518.25618.85819.097
RSPWV16.83717.24417.75318.032
Table 7. Estimation of IF MSE for a noisy multi-component signal with concurrent constant and parabolic FM components for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
Table 7. Estimation of IF MSE for a noisy multi-component signal with concurrent constant and parabolic FM components for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
10 dB5 dB0 dB−5 dB
(a) First Component, 1 × 10−6
SP5.2622.411032.103914.40
SPWV7.6038.281017.903877.20
WV4375.406032.508521.809397.40
AMWSC1.292.1569.361446.50
ASPWV3384.303910.004724.805443.90
AUB4108.604593.105160.005430.20
AAU3774.003900.404066.403775.90
ADF4872.905352.505718.605528.00
RSP227.92404.641942.204245.50
RGSP168.09487.042243.004406.40
RMSC508.97930.731785.504107.40
RPWV3648.104975.406204.106903.10
RSPWV45.29193.541584.203961.30
(b) Second Component, 1 × 10−5
SP5.168.04136.88483.73
SPWV5.768.80133.50467.15
WV237.34306.31405.21447.99
AMWSC3.628.81140.19444.00
ASPWV400.74450.91557.93635.48
AUB49.2094.12298.59452.07
AAU19.2954.31273.11445.41
ADF137.54216.86408.39508.84
RSP26.1752.88288.19624.45
RGSP26.7373.70371.42700.51
RMSC20.9039.49277.02600.25
RPWV183.87251.13398.47508.63
RSPWV5.2321.09222.81544.92
Table 8. TFR Rényi entropy for a noisy multi-component signal with concurrent linear and parabolic FM components for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
Table 8. TFR Rényi entropy for a noisy multi-component signal with concurrent linear and parabolic FM components for different SNR levels (averaged over 100 noise realizations), cell colors range from green for the best result to red for the worst result in each column respectively.
10 dB5 dB0 dB−5 dB
SP16.03316.16816.39216.643
SPWV15.79416.05116.34516.598
WV16.23216.45316.62516.698
AMWSC14.64214.94615.49716.125
ASPWV15.73816.03016.18216.271
AUB15.23915.71016.12216.342
AAU15.26015.78016.23316.483
ADF15.28715.73716.09916.277
RSP13.82013.97114.16214.372
RGSP13.91314.03814.18214.315
RMSC13.61513.78614.07514.305
RPWV14.24514.74115.06315.181
RSPWV13.30813.77414.22314.506
Table 9. TFR Rényi entropy of an EEG P300 signal for the Cz electrode (averaged over 466 trials), cell colors range from green for the best result to red for the worst result in each column respectively.
Table 9. TFR Rényi entropy of an EEG P300 signal for the Cz electrode (averaged over 466 trials), cell colors range from green for the best result to red for the worst result in each column respectively.
Rényi Entropy
SP13.414
SPWV13.251
WV14.366
AMWSC16.689
ASPWV16.200
AUB16.300
AAU16.302
ADF16.325
RSP11.369
RGSP11.230
RMSC10.178
RPWV11.782
RSPWV11.439
Table 10. Rényi entropy for the 2020 Zagreb earthquake vertical acceleration seismogram, cell colors range from green for the best result to red for the worst result in each column respectively.
Table 10. Rényi entropy for the 2020 Zagreb earthquake vertical acceleration seismogram, cell colors range from green for the best result to red for the worst result in each column respectively.
Rényi Entropy
SP18.961
SPWV18.871
WV19.831
AMWSC21.526
ASPWV20.956
AUB21.350
AAU21.417
ADF21.260
RSP16.636
RGSP16.441
RMSC15.084
RPWV18.099
RSPWV16.494
Table 11. Computation time for analyzed TFRs for tested signals (averaged over 100 realizations, SNR of 5 dB), cell colors range from green for the best result to red for the worst result in each column respectively, in milliseconds (ms).
Table 11. Computation time for analyzed TFRs for tested signals (averaged over 100 realizations, SNR of 5 dB), cell colors range from green for the best result to red for the worst result in each column respectively, in milliseconds (ms).
Noisy Mono Component Linear FMNoisy Mono Component Sinusoidal FMNoisy Three Component Parabolic and Linear FMNoisy Two Component Constant and Parabolic FMEEG P300 Signal2020 Zagreb Earthquake
Number of time samples128128384128128487
SP1.64481.49877.49161.41241.410512.69
SPWV23.02422.617225.1222.42822.569369.87
WV1.46771.33497.70091.29961.290513.539
AMWSC10.2199.779560.6759.78019.664393.012
ASPWV188.92188.193122.5188.94189.514356.7
AUB284.9283.592110.8281.91283.342476.9
AAU749.92751.216348.1745.8748.247999
ADF250.64250.611797.7250.28251.042049.4
RSP19.52619.205170.0819.44919.203275.86
RGSP16.12615.649145.1115.56513.163192.5
RMSC275.13268.872650.7273.55269.984207.1
RPWV3.77733.425324.5593.41973.494841.555
RSPWV44.62343.586412.5143.93143.878670.22
Table 12. Computational time for analyzed TFRs for a noisy linear FM signal with different numbers of time samples (averaged over 100 realizations, SNR of 5 dB), cell colors range from green for the best result to red for the worst result in each column respectively, in milliseconds (ms).
Table 12. Computational time for analyzed TFRs for a noisy linear FM signal with different numbers of time samples (averaged over 100 realizations, SNR of 5 dB), cell colors range from green for the best result to red for the worst result in each column respectively, in milliseconds (ms).
Number of Time Samples3264128256512
SP0.533080.689261.41453.658310.057
SPWV1.95745.762522.27592.194407.77
WV0.381850.474531.27142.96811.174
AMWSC2.27363.94729.608728.944104.7
ASPWV26.75547.652179.4455.844537.6
AUB46.272103.14278.18791.832500.5
AAU99.284257.56727.062276.18031.1
ADF42.6894.99244.54681.512116.2
RSP2.21715.466818.8770.807285.32
RGSP2.11115.028315.1851.456201.96
RMSC16.18863.321261.11110.34600.3
RPWV0.996581.37833.35179.950938.288
RSPWV3.866211.51442.46169.57721.94
Table 13. Computational time for analyzed TFRs of different dimensions for a noisy linear FM signal with 128 time samples (averaged over 100 realizations, SNR of 5 dB), cell colors range from green for the best result to red for the worst result in each column respectively, in milliseconds (ms).
Table 13. Computational time for analyzed TFRs of different dimensions for a noisy linear FM signal with 128 time samples (averaged over 100 realizations, SNR of 5 dB), cell colors range from green for the best result to red for the worst result in each column respectively, in milliseconds (ms).
TFR Dimension 32 × 32 64 × 64 128 × 128 256 × 256 512 × 512
SP1.41931.34271.44661.64532.0954
SPWV23.05822.47822.41523.57624.098
WV1.09211.09861.27741.36181.5274
AMWSC3.18865.05769.60719.04837.809
ASPWV34.87957.728101.24188.75447.8
AUB19.10128.75748.06188.604160.95
AAU26.89141.81173.08126.18228.62
ADF16.00125.11343.07975.009141.32
RSP7.223611.60419.85138.00772.442
RGSP5.82199.131916.08529.33857.913
RMSC66.308134.42269.46542.51129.2
RPWV2.83392.77273.35494.84378.2814
RSPWV30.52934.78242.88859.8695.77
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bačnar, D.; Saulig, N.; Petrijevčanin Vuksanović, I.; Lerga, J. Entropy-Based Concentration and Instantaneous Frequency of TFDs from Cohen’s, Affine, and Reassigned Classes. Sensors 2022, 22, 3727. https://doi.org/10.3390/s22103727

AMA Style

Bačnar D, Saulig N, Petrijevčanin Vuksanović I, Lerga J. Entropy-Based Concentration and Instantaneous Frequency of TFDs from Cohen’s, Affine, and Reassigned Classes. Sensors. 2022; 22(10):3727. https://doi.org/10.3390/s22103727

Chicago/Turabian Style

Bačnar, David, Nicoletta Saulig, Irena Petrijevčanin Vuksanović, and Jonatan Lerga. 2022. "Entropy-Based Concentration and Instantaneous Frequency of TFDs from Cohen’s, Affine, and Reassigned Classes" Sensors 22, no. 10: 3727. https://doi.org/10.3390/s22103727

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop