Noise Cancellation Method Based on TVF-EMD with Bayesian Parameter Optimization

Yu, Miaomiao; Yuan, Hongyong; Li, Kaiyuan; Deng, Lizheng

doi:10.3390/a16060296

Open AccessArticle

Noise Cancellation Method Based on TVF-EMD with Bayesian Parameter Optimization

Institute of Public Safety Research, Department of Engineering Physics, Tsinghua University, Beijing 100084, China

^*

Author to whom correspondence should be addressed.

Algorithms 2023, 16(6), 296; https://doi.org/10.3390/a16060296

Submission received: 18 April 2023 / Revised: 3 June 2023 / Accepted: 7 June 2023 / Published: 12 June 2023

Download

Browse Figures

Versions Notes

Abstract

:

To separate the noise and important signal features of the indoor carbon dioxide (CO₂) concentration signal, we proposed a noise cancellation method, based on time-varying, filtering-based empirical mode decomposition (TVF-EMD) with Bayesian optimization (BO). The adaptive parameters of TVF-EMD, that is, bandwidth threshold

ξ

and B-spline order

n

, were determined by the BO algorithm, and the correlation coefficient for the kurtosis index (CCKur) constituted the objective function. Initially, the objective function CCKur was introduced to systematically identify anomalous signals while preserving signal feature extraction between the modes and the input signal. Subsequently, the proposed signal noise cancellation model based on TVF-EMD and the BO algorithm were employed, along with the Hurst exponent, to extract the sensitive mode. An examination of the optimization indices of the decomposed intrinsic mode functions (IMFs), namely CC, Kur, MI, EE, EEMI, and CCKur, revealed that the synthetic measurement index CCKur and objective function fitness were reasonable and effective. The proposed method exhibited better signal cancellation performance, compared to that of TVF-EMD with the default values, EMD, the moving average method, and the exponential smoothing method.

Keywords:

CO₂ concentration signal; signal noise cancellation; time-varying filtering-based empirical mode decomposition (TVF-EMD); Bayesian parameter optimization (BO); correlation coefficient for kurtosis index (CCKur)

1. Introduction

Signals contain rich characteristic information; thus, signal processing analysis is crucial in the development of natural sciences, particularly in environmental monitoring [1]. There is a relationship between the indoor carbon dioxide concentration and respiratory mucosal symptoms [2,3]; hence, it is essential to monitor and study carbon dioxide concentrations in various functional building spaces in real time. However, when there are sudden changes in background concentrations of CO₂ or equipment sampling failures, the CO₂-monitoring signal becomes noisy, making it difficult to distinguish important signal features from noise. Therefore, the noise cancellation of the CO₂-monitoring signal is important.

There are various methods of signal denoising, and the most commonly used include smoothing, Fourier transform [4], wavelet theory [5], and Hilbert–Huang transform [6,7]. Smoothing is a simple and convenient method that denoises the signal at the expense of reduced temporal resolution [8,9]. In Fourier and wavelet transforms, the selection of the a priori basis function directly affects the results of signal noise reduction [9,10], which is a major limitation in the application of these methods. The core of the Hilbert–Huang transform is empirical modal decomposition (EMD). The EMD model identifies the intrinsic oscillatory modes in the signal, based on the local characteristic timescale of the signal, and, accordingly, decomposes the raw signal into several intrinsic mode functions (IMFs) without requiring previous knowledge of original signal values [11]. Therefore, EMD is commonly adopted for analyzing adaptive, nonlinear, and nonstationary signal processing, such as indoor CO₂ concentration signals. However, the EMD methods applied for noise reduction are subject to modal aliasing and endpoint effects, known as the mode-mixing issue [12]. In other words, EMD is prone to noise. Based on the framework of EMD, the improved methods, such as ensemble EMD [13] and noise-assisted MEMD [14], are proposed to solve the intermittence problem of EMD. However, there are still many problems with these methods, such as unselectable parameters of noise and failure to separate modes.

The time-varying, filtering-based empirical mode decomposition (TVF-EMD) was developed to address the shortcomings of the EMD model, with the shifting process completed by the B-spline approximation filter [12]. The three main features of TVF-EMD, compared with most existing methods, can be summarized as follows: (1) The TVF-EMD method can simultaneously address the issues of separation and intermittence [15]. (2) With a B-spline approximation filter integrated into the shifting process, TVF-EMD solves the issue of mode mixing and maintains time-varying features. (3) The enhanced stopping criterion improves the performance of the TVF-EMD model for low sampling rates. However, two significant effects, bandwidth threshold

ξ

and B-spline order

n

, have direct impacts on the separation and filter performance of the time-varying filter, respectively [12]. A reasonable selection of a combination of the two parameters in advance enables the TVF-EMD model to resolve the mode-mixing problem, thereby achieving optimal noise reduction. Hence, the choice of parameters for the TVF-EMD is clearly important.

Many optimization methods are used in hyperparameter estimation. Among the most widely used are the grid search [16,17], random search [18], Genetic algorithm [19], and Bayesian optimization (BO) algorithms [20,21]. Unlike grid and random searches, the framework of BO leverages information from existing data, and the current search for optimal values is based on previous search results [22]. In addition, there are swarm intelligence optimization technologies that imitate the behaviors of various organisms, such as ants [23], particle swarm optimization [24], fish schools [25], glow worm swarm optimization [26], and grey wolves [27], to achieve optimal parameter estimation. Zhou et al. [28] proposed the parameter-adaptive TVF-EMD feature extraction method, based on improved GOA, to deal with the mechanical fault diagnosis. However, the Genetic algorithm and the swarm intelligence optimization algorithm can be categorized as population optimization algorithms. Population optimization algorithms are not particularly suitable for model hyperparameter tuning because they require a sufficient number of initial sample points and are not particularly efficient for optimization. To accelerate computation, Bayesian hyperparameter optimization based on surrogate algorithms is widely used. The tree-structured Parzen estimator (TPE) is one of the most notable hyperparameter optimization methods [29,30]. The Parzen-based estimator can naturally handle complex search spaces and can be extended to dozens of variables, with at least a thousand observations [31]. Therefore, selecting a BO algorithm based on a tree-structured Parzen estimator (BO-TPE) is a reasonably effective method for TVF-EMD parameters.

In the aforementioned denoising theories, the smoothing method can reduce the signal resolution in time. Indoor CO₂ concentrations in buildings is a typical non-stationary signal. These signal-denoising methods, such as Fourier transform, EMD, ensemble EMD, and noise-assisted MEMD, are not suitable for processing non-stationary signals. The wavelet transform is suitable for dealing with non-stationary, time-varying signals, but there is the problem of difficult wavelet basis selection. The TVF-EMD method can solve the shortcomings of the above methods. Combined with the hyperparameter algorithm, the TVF-EMD method can match the appropriate parameters more quickly to achieve accurate noise reduction of the indoor CO₂ concentration.

Considering this background, this study proposed an optimized TVF-EMD method, based on the BO-TPE algorithm, for noise cancellation. First, the correlation coefficient was defined as a comprehensive index for the objective condition of TVF-EMD parameter optimization. In addition, the correlation coefficient was used as the mode selection criterion for the target IMF. Second, an optimization algorithm called BO-TPE was used to search for the optimal combination of TVF-EMD parameters to match the input signal. The effectiveness of the proposed method was suitable for processing the noise of the non-stationary signal, especially large amplitudes, but with dispersed distribution noise signals. The noise reduction results of the CO₂ concentration signals from two different sets of functional building spaces demonstrated that the method was successful in canceling noise and could be effective in CO₂-monitoring engineering applications.

2. Preliminaries

2.1. Time-Varying Filter Empirical Mode Decomposition

EMD can decompose a given signal

x (t)

into a set of component signals known as IMFs and residual

r (t)

, as shown in the following equation:

x (t) = \sum_{i = 1}^{N} i m f_{i} (t) + r (t)

(1)

where

i m f_{i} (t)

represents the

i

-th IMF. The screening process of EMD comprises two steps: (1) estimation of the “local mean”

m (t)

and (2) recursive subtraction of the

m (t)

from the input signal until the resulting signal meets the stopping criterion.

In TVF-EMD, the mono-components are replaced by local narrow-band signals to improve the performance of the EMD method. These signals have properties similar to those of the IMF but provide a meaningful Hilbert spectrum. The local instantaneous bandwidth is used to define local narrow-band signals, which necessitate that the bandwidth be below a given threshold value. This method aims to initially determine the local cut-off frequency and subsequently applies a time-varying filter. The TVF-EMD shifting process is performed using a time-varying filter, which primarily involves three main steps [12]:

Local cutoff frequency rearrangement

Step 1: Locate the maximum time consumption of

x (t)

, expressed as

u_{i}, i = 1, 2, 3 \dots

Step 2: Determine all intermittences, expressed as

e_{j}, where j = 1, 2, 3 \dots

by setting a threshold value for the rate of change within a certain time span. These breaks should satisfy the following condition:

\frac{\max (ϕ_{b i s}^{'} (u_{i} : u_{i + 1})) - \min (ϕ_{b i s}^{'} (u_{i} : u_{i + 1}))}{\min (ϕ_{b i s}^{'} (u_{i} : u_{i + 1}))} > ρ

(2)

where

ρ = 0.25

, and

ϕ_{b i s}^{'}

is the bisection frequency.

Step 3: The condition of the rising edge of

ϕ_{b i s}^{'} (e_{j} : e_{j + 1})

is

ϕ_{b i s}^{'} (u_{i + 1}) - ϕ_{b i s}^{'} (u_{i}) > 0

, and

ϕ_{b i s}^{'} (e_{j} : e_{j + 1})

is considered the floor. Similarly, the falling edge is

ϕ_{b i s}^{'} (u_{i + 1}) - ϕ_{b i s}^{'} (u_{i}) < 0

, and

ϕ_{b i s}^{'} (e_{j + 1} : e_{j})

is considered the floor. The rest of

ϕ_{b i s}^{'}

refers to the peak.

Step 4: By interpolating between the peaks, the final local cutoff frequency can be obtained. During the local cutoff frequency rearrangement phase, the TVF-EMD addresses the separation and intermittence problems.

2.: Shifting process for TVF-EMD

Step 1: Address the local cut-off frequency for signal

x (t)

.

Step 2: Filter the input signal

x (t)

using the time-varying filter (i.e., B-spline approximation filter) to obtain the local mean. The bandwidth threshold

ξ

determines the separation effect and whether the input signal must be filtered. The B-spline order

n

is independent of the cutoff frequency estimate, which determines the attenuation and filtering effect of the TVF.

Step 3: Verify that the residual signal satisfies the condition of the stopping criterion indicated below:

θ (t) = \frac{B_{L o u g h l i n} (t)}{φ_{a v g} (t)}

(3)

where

B_{L o u g h l i n}

is the Loughlin instantaneous bandwidth, and

φ_{a v g} (t)

is the weighted average instantaneous frequency. The detailed calculation process of TVF-EMD was discussed in previous studies [12,32].

2.2. Bayesian Optimization

The BO-TPE works by assuming that the black-box function is sampled from a Gaussian mixture model and maintains the posterior distribution of that function while making observations. The posterior probabilities are updated using new sample points at each iteration. To prevent the results from falling into local optima, the BO algorithm adds a stochastic algorithm to balance stochastic exploration and posterior distribution.

The probabilistic surrogate model and acquisition function are the core components of the BO method, and the TPE is the probabilistic surrogate model of the study. The BO algorithm is highly efficient for hyperparameter estimation, as it exhibits excellent convergence.

The objective of Bayesian optimization is to determine the global maximum or minimum value of the objective function in the hyperparameter space. This study aimed to determine the maximum value of the true fitness objective function on a bounded set

χ

in a limited number of iterations. The mathematical model of the BO algorithm is as follows:

x^{*} = \arg \max_{x \in χ} f (x)

(4)

where the point

x^{*}

that maximizes the surrogate function is the proposed point for evaluating the objective function

f (x)

.

Let us suppose that the observation values are of the form

D_{1 : t} = {x_{i}, y_{i}}_{i = 1}^{t}

, where

y_{i}

is the generalization accuracy of the algorithm under

x_{i}

. This can be considered the generalization accuracy of random observations

y = f (x) + ε

, where the noise introduced into the observation

ε

is assumed to satisfy

ε ~ N (0, σ_{ε}^{2})

. To explore

x^{*}

in the hyperparameter space, the acquisition functions were obtained, in which the maximum of the function was calculated as

x_{t + 1} = \arg \max x_{t} (x, D)

(5)

In this study, the bandwidth threshold

ξ

and B-spline order

n

were the independent variables

x

of the BO model, and the flowchart of the Bayesian optimization was outlined as follows (see Figure 1). First, we determined whether the parameters of the model,

ξ

and

n

, were initialized, and if not, the initial parameters were generated randomly. If the parameters were initialized, the values were brought into the tree Parzer estimator surrogate model. Subsequently, it was judged whether the acquisition function reached its maximum value when given

ξ

and

n

. If the maximum value of the target function was satisfied, the value of the two parameters were output. If not, the values of the surrogate model parameters were updated until the requirements were met.

3. Adopted Methodology

The proposed optimized TVF-EMD method is based on the BO-TPE algorithm. It aims to search for optical combinations of parameters for the bandwidth threshold

ξ

and B-spline order

n

using the objective function, which determines the merits of the decomposition results. The kurtosis index

K u r

depends on the distribution density of the signal, which is highly sensitive to large amplitudes with dispersed distributions. A kurtosis index value between 0 and 3 indicates that the center peak of the signal is lower and broader, compared to the normal distribution represented by

K u r = 3 (K u r \approx 3)

. In contrast,

K u r > 3

indicates that the central peak of the signal is higher and sharper. Thus, a smaller kurtosis index is required for a more sensitive identification of outliers. However, to avoid excessive noise cancellation, the correlation coefficient (CC) is used to characterize the similarities between original and decomposing signals. Therefore, the synthetic measurement index, consisting of the kurtosis index and CC, was developed as an objective function for TVF-EMD parameter optimization. The synthetic measurement index, correlation coefficient for kurtosis index (CCKur), was calculated as follows:

C C K u r = \frac{| C C |}{K u r}

(6)

C C : ρ_{x, i m f} = \frac{\sum_{i = 1}^{N} (x_{i} - \bar{x}) (i m f_{i} - \bar{i m f})}{\sqrt{\sum_{i = 1}^{N} {(x_{i} - \bar{x})}^{2} {(i m f_{i} - \bar{i m f})}^{2}}}

(7)

K u r = {\frac{1}{M} \sum_{m = 0}^{m - 1} x^{4} (m) / (\frac{1}{M} \sum_{m = 0}^{m - 1} x^{4} (m))}^{2}

(8)

As the original BO algorithm was developed to determine the maximum value, the method used the maximum CCKur between the original signal and the modes obtained by TVF-EMD as the fitness. Therefore, the maximization of CCKur was the optimization problem expressed below:

{\begin{matrix} f i t n e s s = \max_{γ \in {ξ, n}} {C C K u r} \\ \begin{matrix} ξ \in (0, 0.4] \\ n \in [1, 16] \end{matrix} \end{matrix}

(9)

where

f i t n e s s

is the objective function, and

γ = {ξ, n}

represents the parameter combination of the TVF-EMD method to be optimized. The CC of the original signal

x (t)

, which is a function of mode

i m f (t)

with the same length as

N

, is described in Equation (7).

Ensuring the reliability of the parameter optimization was essential, and the number of modes after CO₂ signal decomposition must be at least two. After several attempts, it was discovered that the bandwidth threshold

ξ

met

0 \leq ξ \leq 0.4

, and

1 \leq n \leq 16

satisfied the requirements. The same number of modes was obtained with the typically used standard EMD, and the maximum number of modes for the TVF-EMD was set to

K = \log_{2} N

[4].

The autocorrelation properties of the signal and Hurst exponent

H

were evaluated to distinguish noise from the most relevant modes. When

0 \leq H < 0.5

, two signals were anticorrelated. In contrast,

H = 0.5

indicated white noise, and

H > 0.5

represented a positive correlation. In this study, the threshold for the Hurst index was defined as 0.5 (

H_{t h r} = 0.5

). The detailed steps of the noise cancellation methodology were as follows:

Step 1: Input the CO₂ concentration signal

x (t)

and set the parameter population

X

to the TVF-EMD model. Concurrently, initialize the parameters of BO algorithms and population

X

, including the bandwidth threshold

ξ

and B-spline order

n

.

Step 2: Decompose the signal

x (t)

using the TVF-EMD model for the parameter combination of

ξ

and

n

, and then calculate the IMFS to obtain the objective function

f i t n e s s

, where the best fitness for each iteration of the BO algorithm is stored.

Step 3: If the stored value of fitness satisfies the threshold, then save the optical parameters

ξ

and

n

. Otherwise,

l = l + 1

, and continue Step 2 to update parameters

ξ

and

n

until the maximization of

C C K u r

is up to requirements.

Step 4: Obtain and save the best maximization of

C C K u r

and the corresponding parameter combination of the TVF-EMD.

Step 5: Update the population

X

by obtaining the best parameter combination.

Step 6: Use the optimized TVF-EMD with the combination parameters

ξ

and

n

to decompose the original CO₂ signal.

Step 7: Calculate the Hurst exponent

H

of each IMF. If the

H

is greater than

H_{t h r}

, save the sensitive IMF.

Step 8: Sum these sensitive IMFs together to obtain the reconstructed signals. The other insensitive IMFs are considered to be noise.

A flowchart of the proposed BO-based TVF-EMD method for the SNC model is shown in Figure 2.

4. Results and Discussion

4.1. Data Acquisition

The CO₂ concentration was analyzed after denoising the CO₂ signals, which were obtained using pump-activated CO₂ concentration detectors installed in offices and university classrooms. The CO₂ signals were sampled at a frequency of 5 s and monitored continuously for 24 h. Figure 3 illustrates the CO₂ signal from an

8 m \times 8 m \times 3 m

regular-sized office accommodating approximately six to ten people. The detector measured the CO₂ concentration at a height of 2.8 m in the room. The specifications used are listed in Table 1.

4.2. Analysis of the Simulated Signal

4.2.1. Simulation and Comparison

The result of the signal decomposition using the EMD method is shown in Figure 4a. Signal

x (t)

was decomposed into nine IMFs. Therefore, the maximum number of modes for TVF-EMD was set to three. The original CO₂ signal was then processed by the proposed TVF-EMD model, based on the BO algorithm. The combination of the bandwidth threshold

ξ

, B-spline order

n

, and the maximum

C C K u r

of the BO algorithm were

ξ = 0.31

,

n = 7

, and

C C K u r_{m a x} = 0.49

, respectively. Figure 4b shows the signal decomposition results obtained using the optimized TVF-EMD model.

The Hurst exponent was computed for each IMF. As shown in Figure 5, the different values of the Hurst exponent for the seven consecutive IMFs were NaN, 0.37, 0.32, 0.58, 0.72, 0.79, and 1.0, respectively. The first IMF, second IMF, and third IMF had a Hurst exponent value of less than the threshold of 0.5, indicating the weak long-range autocorrelation of residual signals. The fourth through seventh IMFs were reserved for further processing and summed to obtain the noise-reduced signal. To better illustrate the effects of signal noise cancellation by the proposed method, a quantitative comparison of the three denoising methods was conducted, as shown in Figure 6. Conventional noise reduction methods, such as exponential smoothing and the moving average method, were not highly effective in eliminating the impact of shock signals. This comparison verified the superiority of the TVF-EMD method in signal noise cancellation.

4.2.2. The Effects of TVF-EMD Parameters

In addition to

C C

and

K u r

, other indices, including mutual information (MI) [33], energy entropy (EE) [34], and energy entropy mutual information (EEMI) [28], are commonly used to evaluate the degree of optimization in signal processing. As shown in Figure 7, the selection of optical parameters

ξ

and

n

did not significantly change the MI, EE, and EEMI indices of IMF1 to IMF7, indicating that these indices were insensitive to the optical parameters. In contrast,

C C

,

K u r

, and

C C K u r

were sensitive to optical parameters.

Different parameters affected the decomposition results to different degrees. The decomposition of the CO₂ concentration signal

x (t)

and the effect of TVF-EMD parameters on the decomposition results are shown in Figure 8. The bandwidth threshold

ξ

reached 0.95 in the decomposition results, indicating that the selection of the optimization index should be sensitive to the change in

ξ

. Therefore, we analyzed the results of CC, Kur, MI, EE, EEMI, and

C C K u r

under varying

ξ

. As shown in Figure 9, when

ξ

assumed values of 0.1, 0.2, and 0.31 (the optimal parameter), eight, nine, and seven IMFs were obtained by decomposition, respectively. When

ξ

was at least four, two IMFs were obtained, indicating that mode aliasing and under-decomposition occurred. When

ξ

increased from 0.1 to 0.31, the MI, EE, and EEMI indices of the IMFs did not change significantly, suggesting that these indices were insensitive to the variation in

ξ

. The CC, Kur, and

C C K u r

indices were sensitive to

ξ

; hence, they can be selected as indices for the best parameters. When

ξ = 0.31

, the sum of the

C C K u r

indices of IMF1 to IMF9 was the largest, which indicates that

C C K u r

can be regarded as the best optimization model. The results showed that (1)

ξ

had to be at least 4 for mode aliasing and under-decomposition to occur; (2)

C C

,

K u r

, and

C C K u r

indices were sensitive to

ξ

, whereas MI, EE, and EEMI indices were not; (3) the proposed

C C K u r

index and objective function

\max_{γ \in {ξ, n}} {C C K u r}

were effective; and (4) the value of 0.31 for

ξ

was reasonable and correct.

4.3. Validation of the Proposed Method

To study the denoised signal performance of the proposed signal cancellation method, the signal-to-noise ratio (SNR), mean squared error (MSE), relative squared error (RSE), and normalized root mean squared error (NRMSE) were used to evaluate the quality of the reconstructed signals. The SNR is the ratio of the original signal power to noise signal power. The MSE calculates the error by squaring the difference between the reconstructed and original signals and averaging all values. The RSE is the ratio of the MSE to the square of the difference between the original signal and the mean of the original signal. The smaller the values of MSE, RSE, and NRMSE, the better the noise reduction effect. In contrast, a smaller SNR implies a poor noise reduction effect. The following equations were used for the comparisons (Equations (10)–(13)).

S N R (db) = 10 \log (\frac{P_{s}}{P_{n}})

(10)

M S E = \frac{1}{N} \sum_{n = 1}^{N} {(x (n) - \hat{x} (n))}^{2}

(11)

R S E = \frac{\sum_{n = 1}^{N} {(\hat{x} (n) - x (n))}^{2}}{\sum_{n = 1}^{N} {(\bar{x} (n) - x (n))}^{2}}

(12)

N R M S E = \frac{\sqrt{\frac{1}{N} \sum_{n = 1}^{N} {(x (n) - \hat{x} (n))}^{2}}}{x_{\max} - x_{\min}}

(13)

where

P_{s}

and

P_{n}

represent the effective powers representing the signal and noise, respectively. Here,

x (n)

,

\hat{x} (n)

, and

\bar{x} (n)

denote the original signal, reconstructed signal, and the average of the original signal, respectively.

A detailed study was conducted, and signal cancellation performance was evaluated for different denoising methods in terms of SNR, MSE, RSE, and NRMSE. The CO₂ signals studied from the university classroom had a capacity of 100 students. As shown in Table 2, the proposed noise cancellation model yielded the largest absolute SNR value. The MSE, RES, and NRMSE of the proposed method were all smaller than those obtained by the TVF-EMD, EMD, moving average, and exponential smoothing methods, indicating that the proposed noise cancellation model had superior noise reduction quality for the CO₂ concentration signal.

5. Conclusions

An optimized TVF-EMD model based on a Bayesian algorithm was adopted to develop a noise cancellation model for denoising the CO₂ concentration signal of a building. The Bayesian algorithm was used to optimally estimate the TVF-EMD parameters, namely the bandwidth threshold

ξ

and B-spline order

n

, and the adaptive matching of the given CO₂ concentration signal. The main conclusions can be summarized as follows:

In parameter optimization, a synthetic measurement index consisting of CCKur was used as the objective function of TVF-EMD. This function could identify anomalous signals while preserving the signal profile and avoided excessive noise reduction. In the proposed noise cancellation model, a thresholding parameter

H_{t h r} = 0.5

, based on the Hurst exponent, was introduced as a measurement index for selecting the relevant IMFs for signal reconstruction.

The hyperparameter

ξ

was more important for decomposition results. The efficacy of the synthetic measurement index was verified against five optimization indices: CC, Kur, MI, EE, and EEMI on decomposed IMFs. The results demonstrated that the proposed CCKur index was sensitive to

ξ

, and the selection of CCKur as a synthetic measurement index and

\max {C C K u r}

as an objective function were reasonable and effective.

The noise reduction effect between different signal-denoising models, that is, TVF-EMD with default values, EMD, moving average method, and exponential smoothing method, was compared in terms of SNR, MSE, RSE, and NRMSE. The proposed noise cancellation model yielded the largest absolute value of SNR and the smallest MSE, RSE, and NRMSE, demonstrating the high noise reduction capability of the proposed model for CO₂ concentration signals.

Author Contributions

Data curation, M.Y. and K.L.; Formal analysis, M.Y.; Funding acquisition, H.Y.; Investigation, H.Y.; Methodology, M.Y. and H.Y.; Project administration, H.Y.; Resources, H.Y.; Software, M.Y. and L.D.; Validation, L.D.; Visualization, H.Y.; Writing—original draft, M.Y.; Writing—review & editing, K.L. and L.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Project funded by the China Postdoctoral Science Foundation, grant number No. 2022M721845.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Longo, A.; Bianchi, S.; Plastino, W. tvf-EMD based time series analysis of 7Be sampled at the CTBTO-IMS network. Phys. A Stat. Mech. Its Appl. 2019, 523, 908–914. [Google Scholar] [CrossRef] [Green Version]
Apte, M.G.; Fisk, W.J.; Daisey, J.M. Associations between indoor CO₂ concentrations and sick building syndrome symptoms in US office buildings: An analysis of the 1994-1996 BASE study data. Indoor Air-Int. J. Indoor Air Qual. Clim. 2000, 10, 246–257. [Google Scholar] [CrossRef] [Green Version]
Muscatiello, N.; McCarthy, A.; Kielb, C.; Hsu, W.H.; Hwang, S.A.; Lin, S. Classroom conditions and CO₂ concentrations and teacher health symptom reporting in 10 New York State Schools. Indoor Air 2015, 25, 157–167. [Google Scholar] [CrossRef] [PubMed]
Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.L.C.; Shih, H.H.; Zheng, Q.N.; Yen, N.C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
Wang, W.J.; McFadden, P.D. Application of wavelets to gearbox vibration signals for fault detection. J. Sound Vib. 1996, 192, 927–939. [Google Scholar] [CrossRef]
Huang, N.E.; Shen, Z.; Long, S.R. A new view of nonlinear water waves: The Hilbert Spectrum. Annu. Rev. Fluid Mech. 1999, 31, 417–457. [Google Scholar] [CrossRef] [Green Version]
Montesinos, M.E.; Muñoz-Cobo, J.L.; Pérez, C. Hilbert–Huang analysis of BWR neutron detector signals: Application to DR calculation and to corrupted signal analysis. Ann. Nucl. Energy 2003, 30, 715–727. [Google Scholar] [CrossRef]
Tian, P.; Cao, X.; Liang, J.; Zhang, L.; Yi, N.; Wang, L.; Cheng, X. Improved empirical mode decomposition based denoising method for lidar signals. Opt. Commun. 2014, 325, 54–59. [Google Scholar] [CrossRef]
Wu, S.; Liu, Z.; Liu, B. Enhancement of lidar backscatters signal-to-noise ratio using empirical mode decomposition method. Opt. Commun. 2006, 267, 137–144. [Google Scholar] [CrossRef]
Torrence, C.; Compo, G.P. A practical guide to wavelet analysis. Bull. Am. Meteorol. Soc. 1998, 79, 61–78. [Google Scholar] [CrossRef]
Huang, N.E.; Wu, Z.; Long, S.R. Hilbert-Huang transform. Scholarpedia 2008, 3, 2544. [Google Scholar] [CrossRef]
Li, H.; Li, Z.; Mo, W. A time varying filter approach for empirical mode decomposition. Signal Process. 2017, 138, 146–158. [Google Scholar] [CrossRef]
Wu, Z.; Huang, N.E. Ensemble empirical mode decomposition: A noise-assisted data analysis method. Adv. Adapt. Data Anal. 2009, 1, 1–41. [Google Scholar] [CrossRef]
Ur Rehman, N.; Mandic, D.P. Filter bank property of multivariate empirical mode decomposition. IEEE Trans. Signal Process. 2011, 59, 2421–2426. [Google Scholar] [CrossRef]
Rehman, N.; Mandic, D.P. Multivariate empirical mode decomposition. Proc. R. Soc. A Math. Phys. Eng. Sci. 2010, 466, 1291–1302. [Google Scholar] [CrossRef]
Stoica, P.; Gershman, A.B. Maximum-likelihood DOA estimation by data-supported grid search. IEEE Signal Process. Lett. 1999, 6, 273–275. [Google Scholar] [CrossRef]
Bellman, R.E.; Dreyfus, S.E. Applied Dynamic Programming; Princeton University Press: Princeton, NJ, USA, 2015; Volume 2050. [Google Scholar] [CrossRef]
Bergstra, J.; Bengio, Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 2012, 13, 281–305. [Google Scholar]
Yang, X.-S. (Ed.) Chapter 6—Genetic Algorithms. In Nature-Inspired Optimization Algorithms, 2nd ed.; Academic Press: Cambridge, MA, USA, 2021; pp. 91–100. [Google Scholar] [CrossRef]
Frazier, P.I. A tutorial on Bayesian optimization. arXiv 2018, arXiv:1807.02811. [Google Scholar]
Shahriari, B.; Swersky, K.; Wang, Z.; Adams, R.P.; De Freitas, N. Taking the human out of the loop: A review of Bayesian optimization. Proc. IEEE 2015, 104, 148–175. [Google Scholar] [CrossRef] [Green Version]
Jones, D.R.; Schonlau, M.; Welch, W.J. Efficient global optimization of expensive black-box functions. J. Glob. Optim. 1998, 13, 455–492. [Google Scholar] [CrossRef]
Mirjalili, S. The ant lion optimizer. Adv. Eng. Softw. 2015, 83, 80–98. [Google Scholar] [CrossRef]
Marini, F.; Walczak, B. Particle swarm optimization (PSO). A tutorial. Chemom. Intell. Lab. Syst. 2015, 149, 153–165. [Google Scholar] [CrossRef]
Mirjalili, S.; Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
Krishnanand, K.N.; Ghose, D. Glowworm swarm optimization for simultaneous capture of multiple local optima of multimodal functions. Swarm Intell. 2009, 3, 87–124. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
Zhou, C.; Xiong, Z.; Bai, H.; Xing, L.; Jia, Y.; Yuan, X. Parameter-Adaptive TVF-EMD Feature Extraction Method Based on Improved GOA. Sensors 2022, 22, 7195. [Google Scholar] [CrossRef]
Bergstra, J.; Bardenet, R.; Bengio, Y.; Kégl, B. Algorithms for hyper-parameter optimization. Adv. Neural Inf. Process. Syst. 2011, 24, 2546–2554. [Google Scholar]
Bergstra, J.; Yamins, D.; Cox, D. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In Proceedings of the 30th International Conference on Machine Learning, Atlanta, GA, USA, 17–19 June 2013; pp. 115–123. [Google Scholar]
Ozaki, Y.; Tanigaki, Y.; Watanabe, S.; Nomura, M.; Onishi, M. Multiobjective Tree-Structured Parzen Estimator. J. Artif. Intell. Res. 2022, 73, 1209–1250. [Google Scholar] [CrossRef]
Zhang, S.; Xu, F.; Hu, M.; Zhang, L.; Liu, H.; Li, M. A novel denoising algorithm based on TVF-EMD and its application in fault classification of rotating machinery. Measurement 2021, 179, 109337. [Google Scholar] [CrossRef]
Wang, J.; Liu, P.; Lu, S.; Zhou, M.; Chen, X. Decentralized plant-wide monitoring based on mutual information-Louvain decomposition and support vector data description diagnosis. ISA Trans. 2023, 133, 42–52. [Google Scholar] [CrossRef]
Gao, S.; Ren, Y.; Zhang, Y.; Li, T. Fault diagnosis of rolling bearings based on improved energy entropy and fault location of triangulation of amplitude attenuation outer raceway. Measurement 2021, 185, 109974. [Google Scholar] [CrossRef]

Figure 1. Flowchart of Bayesian optimization.

Figure 2. Flowchart for the noise-cancellation model.

Figure 3. Office CO₂ concentrations throughout the day.

Figure 4. Decomposition results of the CO₂ signal obtained by (a) EMD and (b) TVF-EMD.

Figure 5. Hurst exponent values in different modes.

Figure 6. Comparison of signal noise cancellation and the simulated CO₂-concentration signal.

Figure 7. Relationship between IMF and the optimization index.

Figure 8. Hyperparameter importance in the decomposition result.

Figure 9. Relationship between IMF indices and the bandwidth threshold.

Table 1. Specifications considered for processing CO₂ signals.

Parameters	Value
Sampling frequency	5 s
Measurement accuracy	1 ppm
Error range	5%
Time sample length	24 h

Table 2. Comparison of the noise reduction effect under different signal-denoising models.

Index	Proposed TVF-EMD $(ξ = 0.015, n = 4)$	TVF-EMD with Default Values $(ξ = 0.1, n = 26)$	EMD	Moving Average (Window = 40)	Exponential Smoothing $(α = 0.5)$
SNRabs	71.376	71.296	71.141	71.251	71.239
MSE	7.877	95.976	3217.423	154.281	21.473
RSE	0.000276	0.00336	0.112	0.00540	0.000752
NRMSE	0.0166	0.0581	0.3614	0.0736	0.0274

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yu, M.; Yuan, H.; Li, K.; Deng, L. Noise Cancellation Method Based on TVF-EMD with Bayesian Parameter Optimization. Algorithms 2023, 16, 296. https://doi.org/10.3390/a16060296

AMA Style

Yu M, Yuan H, Li K, Deng L. Noise Cancellation Method Based on TVF-EMD with Bayesian Parameter Optimization. Algorithms. 2023; 16(6):296. https://doi.org/10.3390/a16060296

Chicago/Turabian Style

Yu, Miaomiao, Hongyong Yuan, Kaiyuan Li, and Lizheng Deng. 2023. "Noise Cancellation Method Based on TVF-EMD with Bayesian Parameter Optimization" Algorithms 16, no. 6: 296. https://doi.org/10.3390/a16060296

APA Style

Yu, M., Yuan, H., Li, K., & Deng, L. (2023). Noise Cancellation Method Based on TVF-EMD with Bayesian Parameter Optimization. Algorithms, 16(6), 296. https://doi.org/10.3390/a16060296

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Noise Cancellation Method Based on TVF-EMD with Bayesian Parameter Optimization

Abstract

1. Introduction

2. Preliminaries

2.1. Time-Varying Filter Empirical Mode Decomposition

2.2. Bayesian Optimization

3. Adopted Methodology

4. Results and Discussion

4.1. Data Acquisition

4.2. Analysis of the Simulated Signal

4.2.1. Simulation and Comparison

4.2.2. The Effects of TVF-EMD Parameters

4.3. Validation of the Proposed Method

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI