Coarse-Graining Approaches in Univariate Multiscale Sample and Dispersion Entropy

The evaluation of complexity in univariate signals has attracted considerable attention in recent years. This is often done using the framework of Multiscale Entropy, which entails two basic steps: coarse-graining to consider multiple temporal scales, and evaluation of irregularity for each of those scales with entropy estimators. Recent developments in the field have proposed modifications to this approach to facilitate the analysis of short-time series. However, the role of the downsampling in the classical coarse-graining process and its relationships with alternative filtering techniques has not been systematically explored yet. Here, we assess the impact of coarse-graining in multiscale entropy estimations based on both Sample Entropy and Dispersion Entropy. We compare the classical moving average approach with low-pass Butterworth filtering, both with and without downsampling, and empirical mode decomposition in Intrinsic Multiscale Entropy, in selected synthetic data and two real physiological datasets. The results show that when the sampling frequency is low or high, downsampling respectively decreases or increases the entropy values. Our results suggest that, when dealing with long signals and relatively low levels of noise, the refine composite method makes little difference in the quality of the entropy estimation at the expense of considerable additional computational cost. It is also found that downsampling within the coarse-graining procedure may not be required to quantify the complexity of signals, especially for short ones. Overall, we expect these results to contribute to the ongoing discussion about the development of stable, fast and robust-to-noise multiscale entropy techniques suited for either short or long recordings.


Introduction
A system is complex when it entails a number of components intricately entwined altogether (e.g., the subway network of the New York City) [1]. Following Costa's framework [2,3], the complexity in univariate signals denotes "meaningful structural richness", which may be in contrast with regularity measures defined from entropy metrics such as sample entropy (SampEn), permutation entropy, (PerEn), and dispersion entropy (DispEn) [3][4][5][6]. In fact, these entropy techniques assess repetitive patterns and return maximum values for completely random processes [3,5,7]. However, a completely ordered signal with a small entropy value or a completely disordered signal with maximum entropy value is the least complex [3,5,8]. For instance, white noise is more irregular than 1/ f noise (pink noise), although the latter is more complex because 1/ f noise contains long-range correlations and its 1/ f decay produces a fractal structure in time [3,5,8].
From the perspective of physiology, some diseased individuals' recordings, when compared with those for healthy subjects, are associated with the emergence of more regular behavior, thus leading to lower entropy values [3,9]. In contrast, certain pathologies, such as cardiac arrhythmias, are associated with highly erratic fluctuations with statistical characteristics resembling uncorrelated noise. The entropy values of these noisy signals are higher than those of healthy individuals, even though the healthy individuals' time series show more physiologically complex adaptive behavior [3,10].
In brief, the concept of complexity for univariate physiological signals builds on the following three hypotheses [3,5]: • The complexity of a biological or physiological time series indicates its ability to adapt and function in an ever-changing environment. • A biological time series requires operating across multiple temporal and spatial scales and so its complexity is similarly multiscaled and hierarchical. • A wide class of disease states, in addition to ageing, which decrease the adaptive capacity of the individual, appear to degrade the information carried by output variables.
Therefore, the multiscale-based methods focus on quantifying the information expressed by the physiological dynamics over multiple temporal scales.
To provide a unified framework for the evaluation of impact of diseases in physiological signals, multiscale SampEn (MSE) [3] was proposed to quantify the complexity of signals over multiple temporal scales. The MSE algorithm includes two main steps: (1) coarse-graining technique-i.e., combination of moving average (MA) filter and downsampling (DS) process-; and (2) calculation of SampEn of the coarse-grained signals at each scale factor τ [3]. A low-pass Butterworth (BW) filter was used as an alternative to MA to limit aliasing and avoid ripples [11]. To differentiate it from the original MSE, we call this method MSE BW herein.
Since their introduction, MSE and MSE BW have been widely used to characterize physiological and non-physiological signals [12]. However, they have several main shortcomings [12][13][14]. First, the coarse-graining process causes the length of a signal to be shortened by the scale factor τ as a consequence of the downsampling in the process. Therefore, when the scale factor increases, the number of samples in the coarse-grained sequence decreases considerably [14]. This may yield an unstable estimation of entropy. Second, SampEn is either undefined or unreliable for short coarse-grained time series [13,14].
To alleviate the first problem of MSE, intrinsic mode SampEn (InMSE) [15] and refined composite MSE (RCMSE) [14] were developed [15]. The coarse-graining technique is substituted by an approach based on empirical mode decomposition (EMD) in InMSE. The length of coarse-grained series obtained by InMSE is equal to that of the original signal, leading to more stable entropy values. Nevertheless, EMD-based approaches have certain limitations such as sensitivity to noise and sampling [16]. At the scale factor τ, RCMSE considers τ different coarse-grained signals, corresponding to different starting points of the coarse-graining process [14]. Therefore, RCMSE yields more stable results in comparison with MSE. Nevertheless, both InMSE and RCMSE may lead to undefined values for short signals as a consequence of using SampEn in the second step of their algorithms [13]. Additionally, the SampEn-based approaches may not be fast enough for some real-time applications.
To deal with these deficiencies, multiscale DispEn (MDE) based on our introduced DispEn was developed [13]. Refined composite MDE (RCMDE) was then proposed to improve the stability of the MDE-based values [13]. It was found that MDE and RCMDE have the following advantages over MSE and RCMSE: (1) they are noticeably faster as a consequence of using DispEn with computational cost of O(N)-where N is the signal length-, compared with the O(N 2 ) for SampEn; (2) they result in more stable profiles for synthetic and real signals; (3) MDE and RCMDE discriminate different kinds of physiological time series better than MSE and RCMSE; and (4) they do not yield undefined values [13].
The aim of this research is to contribute to the understanding of different alternatives to coarse-graining in complexity approaches. To this end, we first revise the frequency responses for the three main filtering processes (i.e., MA, BW, and EMD) used in such methods. The role of downsampling in the classical coarse-graining process, which has not been systematically explored yet, is then investigated in the article. We assess the impact of coarse-graining in multiscale entropy estimations based on both SampEn and DispEn. To compare these methods, several synthetic data and two real physiological datasets are employed. For the sake of clarity, a flowchart of the alternatives to the coarse-graining method in addition to the datasets used in this article is shown in Figure 1.

Multiscale Entropy-Based Approaches
The MSE-and MDE-based methods include two main steps: (1) coarse-graining process; and (2) calculation of SampEn and DispEn at each scale τ. For simplicity, we detail only the DispEn-based complexity algorithms. Likewise, the SampEn-based algorithms are defined.

Coarse-Graining Approaches
A coarse-graining technique with DS denotes a decimation by scale factor τ. Decimation is defined as two steps [17,18]: (1) reducing high-frequency time series components with a digital low-pass filter; and (2) DS the filtered time series by τ; that is, keep only one every τ sample points.
Assume that we have a univariate signal of length L: u = {u 1 , u 2 , . . . , u i , . . . , u L }. In the coarse-graining process, the original signal u is first filtered by an MA-a low-pass finite-impulse response (FIR) filter-as follows: The frequency response of the MA filter is as follows [19]: where f denotes the normalized frequency ranging from 0 to 0.5 cycles per sample (normalized Nyquist frequency). The frequency response of the MA filter has several shortcomings: (1) a slow roll-off of the main lobe; (2) large transition band; (3) and important side lobes in the stop-band. To alleviate these problems, a low-pass BW filter was proposed [11]. This filter provides a maximally flat (no ripples) response [19]. The squared magnitude of the frequency of BW filter is defined as follows: H e j2π f 2 where f c and n denote the normalized cut-off frequency and filter order, respectively [11,19]. Herein, n = 6 and f c = 0.5 τ [11]. The original signal u is filtered by BW filter. In fact, the low-pass filters eliminate the fast temporal scales (higher frequency components) to take into account progressively slower time scales (lower frequency components).
Next, the time series filtered by either MA or BW is downsampled by the scale factor τ. Assume the . In this study, we consider the coarse-graining process with and without DS. MSE and MDE with MA filter and without DS are respectively named MSE MA and MDE MA . MSE MA and MDE MA with DS are termed MSE and MDE herein.

Calculation of DispEn or SampEn at Every Scale Factor
The DispEn or SampEn value is calculated for each coarse-grained signal x (τ) = {x (τ) j }. It is worth noting that MDE is more than the combination of the coarse-graining [3] with DispEn: the mapping based on the normal cumulative distribution function (NCDF) used in the calculation of DispEn [6] for the first temporal scale is maintained across all scales. That is, in MDE and RCMDE, µ and σ of NCDF are respectively set at the average and standard deviation (SD) of the original signal and they remain constant for all scale factors. This approach is similar to keeping the threshold r constant fixed (usually 0.15 of the SD of the original signal) in the MSE-based algorithms [3]. In a number of studies (e.g., [3,20]), it was found that keeping r constant is preferable to recalculating the threshold r at each scale factor separately.

Refined Composite Multiscale Dispersion Entropy (RCMDE)
At scale factor τ, RCMDE considers τ different coarse-grained signals, corresponding to different starting points of the coarse-graining process. Then, for each of these shifted series, the relative frequency of each dispersion pattern is calculated. Finally, the RCMDE value is defined as the Shannon entropy value of the averages of the rates of appearance of dispersion patterns of those shifted sequences [13]. The MA filter used in RCMDE and RCMSE may be substituted by the BW filter, respectively called RCMDE BW and RCMSE BW here.

Intrinsic Mode Dispersion Entropy (InMDE)
Due to the advantages of DispEn over SampEn for short signals, intrinsic mode DispEn (InMDE) based on the algorithm of InMSE is proposed herein. The algorithm of InMDE includes the following two key steps: 1. Calculation of the sum of the intrinsic mode functions (IMFs) obtained by EMD: In this step, the original signal u is decomposed to I MF α (1 ≤ α ≤ τ max − 1) and a residual signal It is worth noting that the first IMF, I MF 1 , shows the highest frequency component in a signal, while the last IMF, I MF τ max , reflects the trend of the time series. Next, the cumulative sums of IMFs (CSI) for each scale factor τ are defined as follows [15]: where I MF λ denotes the λ th IMF obtained by EMD. Thus, CSI (1) is equal to the original signal u.
2. Calculation of DispEn of CSI (τ) (x) at each scale factor: The DispEn value is calculated at each scale factor. Like MDE and RCMDE, µ and σ of NCDF are respectively set at the average and SD of the original signal and they remain constant for all scale factors in InMDE.
It is worth noting that InMSE and InMDE do not downsample the filtered signals. That is, the number of samples for each CSI (τ) (x) is equal to that for the original signal, leading to more reliable results for higher scale factors. The complexity metrics for univariate signals and their characteristics are summarized in Table 1. The Matlab codes used in this study are described in Appendix.

Parameters of the Multiscale Entropy Approaches
For all the SampEn-based methods, we set d = 1, m = 2, and r = 0.15 of the SD of the original signal [3]. For all the DispEn-based approaches, we set d = 1 and c = 6. For more information about c and d, please refer to [6,13].
For the DispEn-based complexity measures without DS, as the length of coarse-grained signals is equal to that of the original signal, it is advisable to follow c m < L. For the SampEn-based complexity approaches without DS, it is recommended to have at least 10 m (or preferably 20 m ) sample points for the embedding dimension m [21,22].
For the DispEn-based multiscale approaches with DS, since the decimation process causes the length of a signal decreases to L τ max , c m < L τ max is recommended. Similarly, for the SampEn-based complexity techniques with DS, 10 m < L τ max [3] is recommended. On the other hand, in RCDME, we consider τ coarse-grained time series with length L τ max . Therefore, the total sample points calculated in RCMDE is τ × L τ max ≈ L. Thus, RCMDE follows c m < L, leading to more reliable results, especially for short signals. Likewise, it is advisable to have at least 10 m (or preferably 20 m ) sample points for RCMSE with embedding dimension m.

Evaluation Signals
In this section, the synthetic and real signals used in this study to evaluate the behaviour of the multiscale entropy approaches are described.

Synthetic Signals
White noise is more irregular than pink noise (1/ f noise), although the latter is more complex because pink noise contains long-range correlations and its 1/ f decay produces a fractal structure in time [3,5,8]. Therefore, white and pink noise are two important signals to evaluate the multiscale entropy techniques [3,5,8,[23][24][25].
In order to investigate the change in the behavior of a nonlinear system, the Lorenz attractor is used. Further details can be found in [26,27]. To evaluate the effect of filtering and downsampling processes on different frequency components of time series, multi-harmonic signals are employed [16]. Finally, to inspect the effect of noise on multiscale approaches, white noise was added to the Lorenz and multi-harmonic time series.

Real Biomedical Datasets
Multiscale entropy techniques are broadly used to characterize physiological recordings [2,3,12,25]. To this end, electroencephalograms (EEGs) [28] and stride internal fluctuations [29] are used to distinguish different kinds of dynamics of time series.

Dataset of Focal and Non-Focal Brain Activity
The ability of complexity measures to discriminate focal from non-focal signals is evaluated by the use of an EEG dataset (publicly-available at [30]) [28]. The dataset includes five patients and, for each patient, there are 750 focal and 750 non-focal bivariate time series. The length of each signal was 20 s with sampling frequency of 512 Hz (10,240 samples). For more information, please, refer to [28]. All subjects gave written informed consent that their signals from long-term EEG might be used for research purposes [28]. Before computing the entropies, the EEG signals were digitally band-pass filtered between 0.5 Hz and 150 Hz using a fourth-order Butterworth filter.

Dataset of Stride Internal Fluctuations
To compare multiscale entropy methods, stride interval recordings are used [29,31]. The time series were recorded from five young, healthy men (23-29 years old) and five healthy old adults (71-77 years old). All the individuals walked continuously on level ground around an obstacle-free path for 15 min. The stride interval was measured by the use of ultra-thin, force sensitive resistors placed inside the shoe. For more information, please refer to [29]. To investigate the frequency responses of MA, BW, and CSI, we used 200 realizations of white noise with length 512 sample points following [32,33]. The average Fourier spectra obtained by MA, BW, and CSI at different scale factors (i.e., 2, 4, 6, 8, and 10) are depicted in Figure 2. The results show that BW, MA, and CSI can be considered as low-pass filters with different cut-off frequencies.

Synthetic Signals
The results for MA and BW filters are in agreement with their theoretical frequency responses shown in Equations (2) and (3), respectively. The results for CSI are also in agreement with the fact that I MF 1 corresponds to a half-band high-pass filter and I MF λ (λ ≥ 2) can be considered as a filter bank of overlapping bandpass filters [33].
The magnitude of the frequency response for BW, compared with MA, is flatter in the passband, side lobes in its stopband are not present, and the roll-off is faster. Therefore, the filter's frequency response leads to a more accurate elimination of the components with frequency above cut-off frequencies. This fact reduces aliasing while the filtered signals are downsampled. The behavior of the frequency response for CSI is similar to that for BW. However, the cut-off frequencies obtained by CSI are considerably smaller than those for BW. This fact results in very low entropy values at high scale factors.

Effect of Different Low-Pass Filters on Multi-Harmonic and Lorenz Series
To understand the effect of MA, BW, and CSI on multi-harmonic signals, we use b i = cos(2π10i) + cos(2π20i) + cos(2π50i) with sampling frequency 200 Hz and length 20 s. The first second of the signal b is depicted in Figure 3. To show the frequency components of b and their amplitude values, we used the combination of Hilbert transform and recently introduced variational mode decomposition (VMD). VMD is a generalization of the classic Wiener filter into adaptive, multiple bands [16]. After decomposing the original signals into its IMFs using VMD, we employ the Hilbert transform to find the instantaneous frequency of each IMF [16,34].
The frequency components of b and their corresponding amplitudes are depicted in Figure 3a. The Hilbert transform of b filtered by 4-sample MA (Figure 3b) illustrates that the harmonic cos(2π50i) is completely eliminated, in agreement with the fact that MA is a low-pass filter with cut-off frequency   Figure 4). In fact, the largest changes in entropy values are related to the main frequency components of a multi-harmonic time series. To investigate the effect of noise on MDE values, we created g i = b i + η, where η denotes a uniform random variable between 0 to 1. The MDE values for g, plotted in Figure 4b, illustrate a decrease at temporal scales from 1 to 19 and then the entropy values become approximately constant. This is in agreement with the fact that the smaller scale factors correspond to higher frequency components, whereas smaller scales correspond to lower frequencies [35]. Comparing Figures 4a  and b shows that after filtering the effect of white noise by MA, the profiles for b and g are very close (temporal scales 19 to 25). This suggests that white noise affects lower temporal scales. It is worth noting that the behavior of MDE BW and that of MDE MA is similar.  However, the effect of CSI at scale 2 on b is shown in Figure 5. The results, compared with those for MA (see Figure 4b), illustrate similar behavior of CSI at scale 2 and MA at scale 4 in terms of the elimination of the highest frequency component of b. This is in agreement with the fact that, at a specific scale factor, the cut-off frequency for CSI is considerably lower than that for MA or BW (see Figure 2). We also generated the Lorenz signal o with length 10,000 sample points and sampling frequency ( f s ) 300 Hz. To have a nonlinear behavior, λ = 10, β = 8 3 , and ρ = 99.96 were set [26,27]. The signal o and o filtered by MA at scale 10 are shown in Figure 6.   To inspect the effect of additive noise on MDE values, we created q i = o i + η, where η is a random variable between 0 to 1. The MDE values for q, plotted in Figure 7b, illustrate a decrease at low temporal scale and then an increase at high time scale factors. It is also found that the MDE values of o and q are approximately equal at scales between 18 to 25. This is also consistent with the fact that lower scale factors correspond to higher frequency components, whereas larger scales correspond to lower frequencies [35].

Effect of Downsampling and Sampling Frequency on Multiscale Entropy Methods
To investigate the effect of downsampling (without low-pass filtering) on multiscale entropy approaches, we created the signal s i = cos(2πi) with length 300 sample points and sampling frequency 10 Hz, and (b) w i = cos(2πi) with length 300 sample points and sampling frequency 100 Hz. The signals and their downsampled series by a factor of 12 are depicted in Figure 8.
When the sampling frequency of a time series is close to its main frequency components (see s- Figure

Multiscale Entropy Methods vs. Noise
All of the complexity methods are used to distinguish the dynamics of white from pink noise. The mean and SD of results for the signals with length 8000 (long series) and 400 (short series) sample points are respectively depicted in Figures 9 and 10. The results obtained by the complexity techniques with DS show that the entropy values decrease monotonically with scale factor τ for white noise. However, for pink noise, the entropy values become approximately constant over larger-scale factors. These are in agreement with the fact that, unlike white noise, 1/ f noise has structure across temporal scale factors [3,5]. The profiles for MDE MA   To compare the results obtained by the complexity algorithms, we used the coefficient of variation (CV) defined as the SD divided by the mean. We use such a metric as the SDs of signals may increase or decrease proportionally to the mean. The CV values at scale 10, as a trade-off between low and high scale factors, for noise signals with length 8000 and 400 sample points are respectively illustrated in Tables 2 and 3. Of note is that we consider scale 25 and 5 for the MSE MA and MDE MA , and InMSE and InMDE profiles, respectively. The refined composite technique decreases the CVs for all the MSE-and MDE-based algorithms, showing its advantage to improve the stability of results for short and long noise. The smallest CVs for long pink and white noise are our developed MDE BW without DS and RCMDE BW methods, respectively. The smallest CVs for short pink and white noise are achieved by RCMDE BW and RCMDE, respectively. Overall, the smallest CVs are obtained by the DispEn-based complexity measures.  To understand the effect of the refined composite technique on nonlinear signals without noise, we created 40 realizations of two Lorenz signals with lengths of 450 and 4500 sample points and sampling frequency ( f s ) 150 Hz. To have a nonlinear behavior, the values of λ = 10, β = 8 3 , and ρ = 28 were used in the Lorenz system [26,27]. The results obtained by MSE, MDE, RCMSE, and RCMDE are depicted in Figure 11 and are in agreement with [25,27]. Of note is that the entropy values for RCMSE BW and RCMDE BW are similar to those for RCMSE and RCMDE, respectively. Thus, these results are not shown herein.
To investigate the effect of the refined composite technique on the stability of results, the CVs for the multiscale approaches at scale 5 are calculated. The smallest CVs, illustrated in Table 4 are obtained by MDE and RCMDE approaches. The results also suggest that the refined composite does not improve the stability of profiles for the signal with length 4500 samples (long signals). For the Lorenz series with length 450 sample points, RCMSE and RCMDE lead to smaller CV values in comparison with MSE and MDE, in that order, showing the importance of the refined composite method to characterize small time series.   Figure 12, show that the non-focal signals are more complex than the focal ones. This fact is in agreement with previous studies [28,36].
The results for RCMSE BW and RCMDE BW were respectively similar to those for MSE BW and MDE BW . Thus, they are not shown herein. Note that, for MDE and RCMDE, τ max and m, respectively, were 30 and 3. It should also be mentioned that the average entropy values over two channels for these bivariate EEG signals are reported for the univariate complexity techniques.
To compare the results, the CV values obtained by the univariate multiscale approaches, except InMSE and InMDE, are calculated at scale factor 15. These are shown in Table 5. The CV values for MDE, RCMDE, MSE, and RCMSE illustrate that the refined composite approach does not enhance the stability of the MDE and MSE profiles. Overall, the smallest CV values are achieved by DispEn-based complexity methods.  For each scale factor, the average of entropy values for elderly subjects is smaller than that for young ones, in agreement with those obtained by the other entropy-based methods [37] and the fact that recordings from healthy young subjects correspond to more complex states because of their ability to adapt to adverse conditions, whereas aged individuals' signals present complexity loss [3,5,38]. The results also suggest that, when dealing with short signals, the complexity measures without downsampling (i.e., MSE MA , MDE MA , and MSE BW and MDE BW without DS) are appropriate to distinguish different kinds of dynamics of real signals.
The CV values at those scales whose profiles do not have an overlap are illustrated in Table 6. It is found that MDE BW without DS leads to the smallest CV values.  Figure 13. Mean value and SD of results obtained by the complexity measures computed from the young and old subjects' stride interval recordings.

Time Delay, Downsampling, and Nyquist Frequency
According the previous complexity-based approaches [2,3,13,15], the time delay was equal to 1 in this study. Nevertheless, if the sampling frequency is considerably larger than the highest frequency component of a signal, the first minimum or zero crossing of the autocorrelation function or mutual information can be used for the selection of an appropriate time delay [39].
Alternatively, a signal may be downsampled before calculating the complexity-based entropy approaches to adjust its highest frequency component to its Nyquist frequency ( f s /2) [40]. Accordingly, when the coarse-graining process starts, the low-pass filtering will affect the highest frequency component of the signal at low temporal scale factors. It is worth noting that if the main frequency components of the signal are considerably lower than its highest frequency component (e.g., the signal o -please see Figure 7), the filtering process may make only a little change in the amplitude values of the signal at even large scales.

Future Work
Wavelet transform, which is a powerful filter bank broadly used for analysis of non-stationary recordings, can be employed to decompose a signal to several series with specific frequency bands [41]. Accordingly, the wavelet-based filter bank could be used as a complexity approach. VMD can also be used as an alternative to EMD in InMSE and InMDE. VMD, unlike EMD, provides a solution to the decomposition problem that is theoretically well founded and more robust to noise than EMD [16]. A recent development in the field has tried to generalize multivariate and univariate multiscale algorithms to a family of statistics by using different moments (e.g., variance, skewness, and kurtosis) in the univariate and multivariate coarse-graining process [25,[42][43][44]. It is recommended to compare these techniques in the context of signal processing and to investigate their interpretations. As the existing univariate and even multivariate coarse-graining processes filter only series in each channel separately [38,43,45], there is a need to propose new multivariate filters dealing with the spatial and time domains at the same time.

Conclusions
In summary, we have compared existing and newly proposed coarse-graining approaches for univariate multiscale entropy estimation. Our results indicate that, as expected due to the filter bank properties of the EMD [33] in comparison with moving average and Butterworth filtering, the cut-off frequencies at each temporal scale τ of the former are considerably smaller than those for the latter. Therefore, InMSE and our developed InMDE have entropy values very close to 0 for relatively low values of temporal scales due to the exponential, rather than linear, dependency of the bandwidth at each scale. We also inspected the effect of the downsampling in the coarse-graining process in the entropy values, showing that it may lead to increased or decreased values of entropy depending on the sampling frequency of the time series.
Our results confirmed previous reports indicating that, when dealing with short or noisy signals, the refined composite approach [14,25] may improve the stability of entropy results. On the other hand, for long signals with relatively low levels of noise, the refined composite method makes little difference in the quality of the entropy estimation at the expense of a considerable additional computational cost. In any case, the use of dispersion entropy over sample entropy in the estimations led to more stable results based on CV values and ensured that the entropy values were defined at all temporal scales.
Finally, the profiles obtained by the multiscale techniques with and without downsampling led to similar findings (e.g., pink noise is more complex than white noise based on all the complexity methods) although the specific values of entropy may differ depending on the coarse-graining used. This suggests that downsampling within the coarse-graining procedure may not be needed to quantify the complexity of signals, especially for short ones. In fact, these kinds of techniques still eliminate the fast temporal scales to deal with progressively slower time scales as τ increases and take into account multiple time scales inherent in time series.
On the whole, it is expected that these findings contribute to the ongoing discussion regarding the development of stable, fast, and less sensitive-to-noise complexity approaches appropriate for either short or long time series. We recommend that future studies explicitly justify their choices for coarse-graining procedure in the light of the characteristics of the signals under analysis and the hypothesis of the study, and that they discuss their findings on the light of the behaviour of the selected entropy metric and coarse-graining procedure.
Author Contributions: Hamed Azami and Javier Escudero conceived and designed the methodology. Hamed Azami was responsible for analysing and writing the paper. Both the authors contributed critically to revise the results and discussed them and have read and approved the final manuscript.

Conflicts of Interest:
The authors declare no conflict of interest.