Next Article in Journal
Novel Higher Order Technologies, Based on Spectral Moduli, for Condition Monitoring of Rotating Machinery
Previous Article in Journal
PDSRS-LD: Personalized Deep Learning-Based Sleep Recommendation System Using Lifelog Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Hybrid Adaptive Self-Loading Filter and GRU-Net for Active Noise Control in a Right-Angle Bending Pipe of an Air Conditioner

1
Key Laboratory of Intelligent Control and Neural Information Processing, Education Ministry of China, Yanshan University, Qinhuangdao 066004, China
2
School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China
3
Key Laboratory of Optoelectronic Technology and Systems, Education Ministry of China, Chongqing University, Chongqing 400044, China
4
Department of Electronic and Communication Engineering, North China Electric Power University, Baoding 071003, China
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(20), 6293; https://doi.org/10.3390/s25206293
Submission received: 27 August 2025 / Revised: 30 September 2025 / Accepted: 9 October 2025 / Published: 10 October 2025
(This article belongs to the Section Sensor Networks)

Abstract

The air-conditioner noise in a rehabilitation room can seriously affect the mental state of patients. However, the existing single-layer active noise control (ANC) filters may fail to attenuate the complicated harmonic noise, and the deep recursive ANC method may fail to work in real time. To solve the problem, in a bending-pipe model, a new hybrid adaptive self-loading filtered-x least-mean-square (ASL-FxLMS) and convolutional neural network-gate recurrent unit (CNN-GRU) network is proposed. At first, based on the recursive GRU translation core, an improved CNN-GRU network with multi-head attention layers is proposed. Especially for complicated harmonic noises with more or fewer frequencies than harmonic models, the attenuation performance will be improved. In addition, its structure is optimized to decrease the computing load. In addition, an improved time-delay estimator is applied to improve the real-time ANC performance of CNN-GRU. Meanwhile, an adaptive self-loading FxLMS algorithm has been developed to deal with the uncertain components of complicated harmonic noise. Moreover, to achieve balance attenuation, robustness, and tracking performance, the ASL-FxLMS and CNN-GRU are connected by a convex combination structure. Furthermore, theoretical analysis and simulations are also conducted to show the effectiveness of the proposed method.

1. Introduction

People’s emotions are affected in environments with full-power air conditioners, especially patients in recovery [1]. An active noise control (ANC) system is an effective option to attenuate an air-conditioner noise [2]. Due to increasing noise attenuation needs in various settings, ANC technologies are continuously advancing, particularly concerning adaptive algorithms and network structures [3,4]. Presently, the ability to suppress complex noise has become a primary focus in enhancing ANC system performance [5].
Achieving a high attenuation level for certain complex noise remains an urgent ANC problem [6], especially for the Gaussian noise with multi-sinusoidal components. Due to the superior performance of Filtered-x Least-Mean-Square (FxLMS) methods, the Volterra expansions of FxLMS [7] are first used to reduce Gaussian noise with nonlinear sinusoidal components. Similarly, the Volterra Filtered-x Recursive Least Square (FxRLS) is applied to address weak non-linearity and modeling issues [8]. However, using too many parallel adaptive filters can cause interference in error signals, degrading the robustness of the ANC system. Therefore, an improved functional link neural network (FLANN) based on an artificial neural network (ANN) is proposed to attenuate each frequency independently [9]. Additionally, to enhance both attenuation and robustness [10], a feedback Volterra filter is employed to improve the accuracy of nonlinear path estimation [11].
However, real nonlinear noise always contains multiple frequencies, which are hard to predict with harmonic models. Therefore, the ANC system has introduced a neural network capable of handling random nonlinear problems [12]. Then, a Fuzzy neural network is used to attenuate the random single-frequency noise in ANC research [13]. Additionally, considering the correlations between previous and immediate inputs, a recurrent neural network is also employed in the ANC system [14]. However, the attenuation performance of these methods is lower than that of Volterra-based ANC methods.
Recently, a deep learning ANC (DANC) approach [15] has achieved better attenuation of certain noise than the traditional methods. It utilizes a convolutional layer and a Long Short-Term Memory (LSTM) network to classify and attenuate different kinds of noise. Significantly, the DANC can attenuate particular noises, e.g., babble and tire noises, which cannot be canceled well previously. Based on deep networks, feedback deep ANC structure [16], ANC with deep secondary path estimation [17], and ANC with deep weight vector estimation [18] are developed. However, considering the real-time performance, these deep network-based ANC methods need further improvements to decrease the computational load in one iteration of the ANC process. Although the LSTM network exhibits remarkable predictive capabilities in ANC application, the gate recurrent unit (GRU) performs a lower computational load [19]. Hence, the GRU-Net also has excellent potential in ANC applications. Additionally, real-world noise varies over time and includes known and unknown components [20]. The proportion of these components is not fixed, which can reduce the effectiveness of deep ANC methods. These methods require sufficient noise data to train their weight vectors for optimal performance. When a certain noise occurs briefly or has very low power, it becomes difficult to attenuate complex harmonic noises that contain multiple components [21]. In other words, the DANC method delivers exceptional attenuation mainly for the noise it has been trained on.
Actually, it can be seen that feedforward adaptive filters and deep neural networks have advantages and disadvantages. Hence, hybrid feedforward adaptive filters and deep neural networks ANC method are developed to balance the attenuation and generalization of ANC performance [22,23]. In particular, adaptive filters can handle complicated harmonic noise with uncertain components, while deep networks are suitable for periodic signals with certain components. Moreover, the convex combination is a good choice because the signal-to-noise ratio (SNR) of both certain and uncertain noise is difficult to estimate [24]. On the other hand, based on [18], a selective hybrid adaptive filter and deep network database algorithm are developed in [25], which provides another hybrid control structure to balance the attenuation and generalization performance. However, compared with the convex combined methods, the hybrid structure in [25] only has one part to control the weight vector. When certain and uncertain noise occur at nearly the same level, the method in [25] needs to select one controller from the adaptive filter and deep network database, which may fail to achieve good generalization performance.
As discussed above, to improve the attenuation and generalization performance simultaneously in different environments, a new hybrid adaptive self-loading FxLMS (ASL-FxLMS) and CNN-GRU structure is proposed for complicated harmonic noise control. Firstly, the CNN and GRU nets are serial-connected to achieve a good attenuation level and competitively low computing load for training certain complicated harmonic noise [15]. Then, the multi-head attention layer [26] is introduced to improve the tracking performance for complicated harmonic single frequencies. Moreover, an improved phase estimation is applied to estimate the time delay of the proposed CNN-GRU. On the other hand, inspired by the structure in [27] and the diagonal loading concept in [28], an ASL-FxLMS algorithm is developed for uncertain complicated harmonic noise. Furthermore, the ASL-FxLMS and CNN-GRU are connected by a convex combination structure [29]. This design allows both subsystems to produce outputs simultaneously. If one subsystem’s tracking performance and accuracy are insufficient, the other can compensate, leveraging the strengths of both subsystems. The contributions of each subsystem’s output are set based on amplitude, sustained amplitude, and SNR of the periodic and complex harmonic noise. This arrangement enables the combined system to balance attenuation, robustness, and generalization in tracking performance, potentially overcoming the synchronization output issues of the two subsystems [25]. Additionally, both certain and uncertain noises are used to evaluate the performance of the proposed method. To show a clear presentation of the proposed hybrid net, the novelty works are shown as follows:
  • Inspired by the diagonal loading concept in [28], the adaptive self-loading FxLMS (ASL-FxLMS) is a newly proposed method.
  • The CNN-GRU with attention mechanism is a new deep recursive neural network to be first applied in the ANC system, and a time-delay estimator is improved based on the reference [30].
  • The framework combines ASL-FxLMS and CNN-GRU for complex noise environments, including both stable and complicated harmonic noise, has been developed.
Based on the references and previous discussions, the rest of the paper is organized as follows. At first, the hybrid structure of ASL-FxLMS and CNN-GRU-Net is illustrated in Section 2. Then, the CNN-GRU subsystem with multi-head attention and time-delay estimator is given in Section 3. In Section 4, the details and derivations of the ASL-FxLMS subsystem are illustrated. Moreover, in Section 5, numerical simulations and recording noise experiments are carried out to verify the superior performance of the proposed method. Finally, conclusions are given in Section 6.

2. Bending-Pipe Model of an Air Conditioner and the New Hybrid Active Noise Control Structure

2.1. Bending-Pipe Model of an Air Conditioner

In a rehabilitation room, when the air conditioner operates in variable frequency mode or at maximum power, the noise generated by the air conditioner is non-negligible. In addition, the brief diagram of the air-conditioner active noise control (ANC) system is shown in Figure 1. This noise contains complex harmonic frequencies and broadband time-varying noise, making it difficult for traditional noise reduction methods to address. Hence, based on [31], the Helmholtz equation is used to derive the propagation characteristics of sound waves in a curved pipe. Considering the mass conservation equation as
ρ 0 + ρ ˜ t + ρ 0 + ρ ˜ ν = 0 ,
where ρ ˜ is the perturbation of fluid density, ρ 0 is the constant fluid density, t is the time, and ν is the fluid velocity vector. Based on [31], the simplified Euler equations is given as
ρ 0 v t = p ˜ .
where p ˜ is the perturbation of sound pressure. Substituting Equation (2) into (1), we have
2 p ˜ ρ 0 ρ ˜ 2 p ˜ t 2 = 0 .
further reform the p ˜ as the sum of the gradients of a scalar potential ϕ and a vector potential ψ as
p ˜ = ϕ + ψ .
transforming the ϕ and ψ into harmonic forms, we have
ϕ = ϕ r e j ω t ,
and
ψ = ψ r e j ω t .
re-substituting Equations (5) and (6) into Equation (4), the Helmholtz equation can be represented as
2 ϕ r + ψ r + k 2 ϕ r + ψ r = 0 ,
and
k = ω c .
where ω is the angular frequency, k is the wave number, and c is the speed of sound. Using MATLAB to solve Equation (7), the sound pressure distribution of the sound field in the curved pipe can be determined, and the propagation characteristics of the sound wave within the curved pipe can be analyzed. As in ref. [30], after determining the temperature and air density, the ρ 0 is calculated. Then, these values are entered into Equation (1), along with the propagation length r, to obtain Equation (7) and find the solution. Repeat these steps until the secondary path model is established. Input white noise to train the secondary path model and obtain the secondary path parameters for the bent pipe model.

2.2. Hybrid of ASL-FxLMS and CNN-GRU

As mentioned in ref. [15], the deep CNN-LSTM network has great attenuation potential for certain noise that can be recorded. However, uncertain noise is a kind of interference for the deep CNN-LSTM, which may deteriorate its robustness. To solve the problem, a hybrid structure with different types of adaptive networks is proposed as shown in Figure 2. The main purpose of the proposed method is to integrate the complex memory capabilities of deep learning networks with the flexibility of adaptive filters to enhance real-time ANC systems, thereby overcoming issues with mixture noise of stable noise and unstable noise. That is, the real-time single-layer adaptive filter and deep learning network are connected through a convex combination structure. The new system will have the advantages of two subsystems. Moreover, the proposed method integrates embedded systems and computer systems to adapt to practical applications. As shown in Figure 2, the first part is the ASL-FxLMS (Adaptive Self-Loading Filtered-x Least-Mean-Square) subsystem in a blue dashed line. The second part is the CNN-GRU (Convolutional Neural Network–Gated Recurrent Unit) subsystem in a red dashed line. The two subsystems are illustrated in detail in Section 3 and Section 4. In this hybrid structure, ASL-FxLMS is designed to attenuate uncertain noise with strong harmonic characteristics. The CNN-GRU is applied to attenuate certain noise in the time domain with periodical characteristics. Specifically, the hybrid convex combination structure is applied to balance robustness and attenuation performance, especially in a time-varying environment.
For the ASL-FxLMS in the Figure 2, the output y A n of a single-layer adaptive weight vector is represented as Θ n . The updating equation is shown as
Θ n + 1 = Θ n + μ A x f n e A n ,
where μ A is the step size, e A n is the error signal of ASL-FxLMS subsystem, given by
e A n = d ^ n y ^ Af n ,
where y ^ Af n is the filtered signal which is the result of y Af n filtered by the estimated secondary path S ^ z , d ^ n is the estimated reference signal.
As for the CNN-GRU-Net in the red-line part in the Figure 2, the y CG is the output of the CNN-GRU-Net. Moreover, the output after time-delay compensation is y ¯ CG n . Furthermore, after the y ¯ CG n is filtered by the estimated secondary path S ^ z , the filtered signal y ^ CGf n is also calculated. Hence, the error e CG of CNN-GRU for fine-tune process of CNN-GRU is represented as
e CG n = d ^ n y ^ CGf n ,
where the fine-tune uses an adaptive strategy, the additional layer is set before the final full-connection layers.
Then, the output y Hy n of the hybrid structure is represented as
y Hy n = λ n y A n + 1 λ n y ¯ CG n .
where the λ n is shown as
λ n = 1 1 + e σ n ,
and σ n is updated as the same way in ref. [24] as
σ n + 1 = σ n + μ σ 2 e n 1 + e 2 n 2 y ¯ CG n y A n λ n 1 λ n ,
where μ σ is the step size.
Subsequently, as shown in the green-line part in Figure 2, the output y Hy n of the secondary loud-speaker is sent into the real acoustic path. After filtering by the real secondary acoustic path s z , the filtered signal y Hyf n is subtracted by the real reference signal d n at the error sensor. Hence, the real error e n is received by the error sensor.
Meanwhile, to estimate the reference signal and error signal for both ASL-FxLMS and CNN-GRU-Net, the output y Hy n is filtered by the estimated secondary path S ^ z . In addition, the y ^ Hyf n is the filtered signal. Hence, the estimated reference signal d ^ n at the error sensor is represented as
d ^ n = e n + y ^ Hyf n .
Further details of the CNN-GRU subsystem and ASL-FxLMS subsystem are given in the following sections. The main notations of the proposed hybrid ASL-FxLMS and CNN-GRU are shown in Table 1.

3. A CNN-GRU Network with Multi-Head Attention Mechanism

Benefiting from the powerful nonlinear mapping capabilities of neural networks [32,33,34], it has been extensively used in ANC systems to enhance noise reduction performance [35,36]. Nevertheless, real-world challenges such as real-time constraints, acoustic path variations, and shifts like background noise have imposed limitations on its effectiveness [37]. As a result, there remains a critical need to strike a balance between the noise reduction capability and real-time performance within the ANC framework using deep networks. In the proposed work, nonlinear noise means [2], where, compared with the harmonic frequency model, some frequencies are missing, resulting in noise that cannot be accurately modeled by a linear model for all frequencies.

3.1. Pre-Train: The CNN-GRU-Net with Multi-Head Attention

To solve the issue of denoising irregular harmonic frequency noise in practical applications, an improved CNN-GRU was proposed, as shown in Figure 3, to handle a type of noise that contains multiple single-frequency noises and has unclear harmonic frequency components [2]. In Figure 3, the complex structure of the CNN-GRU network is depicted through a series of carefully coordinated steps.
(I) Input data: Divide the time series into multiple groups. Apply the Fourier transform to each group. Extract the necessary frequency information from the noise.
(II) Data compression: Use a CNN network to classify different frequency groups, and apply network parameter settings to filter out infrequent frequency information.
(III) Encoding-GRU-decoding: The main structure of the GRU network encodes essential frequency group information and makes predictions. Compared to the LSTM in [15], it has a simpler design and lower computational requirements. After prediction, the result is decoded to extract the frequency group information that needs to be tracked.
(IV) Single-frequency noise in real-world applications can result from factors like mechanical vibration, material rigidity, and different populations. As a result, the single-frequency component in actual scenarios will be much higher than in theoretical analysis and simulations. Based on the multi-head attention mechanism in [26], the proposed CNN-GRU will reduce the number of frequencies needing attention and enhance the noise reduction capacity of the main frequency group. More specifically, the detailed processes of the multi-head attention layer are shown in Figure 4, where the K, Q, and V represent Key, Query, and Value, respectively. In the step 1, the similarity of K and Q is calculated as
Sum = Q · K .
then, the vector Sum is normalized by the SoftMax function as
W = Softmax Sum ,
where
Sum = S u m 1 , S u m 2 , , S u m i , , S u m I ,
and
W = w 1 , w 2 , , w i , , w I ,
and I represents the total number of frequencies. While the output of the multi-head attention layer can be represented as
Attention Output = W · V .
(V) Decompressing data and reconstructing time series: In the final stage, decompression and output data processing help convert the signal from the abstract frequency domain back to the concrete time domain. Here, fine-tuning methods are introduced in the final fully connected layer to improve the system’s responsiveness and adaptability in real-time scenarios.

3.2. Real-Time Control: Improved Time-Delay Estimator

The proposed CNN-GRU needs noise data and enough training time to achieve noise tracking. However, a phase error still exists [15], which deteriorates the ANC robustness. More exactly, in the real acoustic environment, the time delay will change with time. Hence, an improved time-delay estimator is applied based on the time-delay estimator in ref. [30]. And the CNN-GRU network is pre-trained, which means it may fail to track the complicated harmonic delay. Thus, based on this characteristic, an auxiliary pre-Fourier analyzer and an Error-Fourier analyzer are combined with the phase tracking filter as in Figure 5. The three parts of the additional phase tracking filter are illustrated as follows.

3.2.1. Pre-Fourier Analyzer

The Pre-Fourier analyzer is shown in green parts in Figure 5. It is used to extract the time-domain signal with the strongest single-frequency noise over a period of time. At first, the x n , which is from the noise source, is input as a reference signal for the Pre-Fourier analyzer. Simultaneously, the x n is put into a time sequence, which is named as Buffer in the Figure 5. After 256 point FFT (Fast Fourier Transform), the single-frequency noise with max-power is signed to provide reference ω i . More exactly, the reference ω will change with time, and several single-frequency noises are possible to be set as the reference frequency. Thus, the reference ω i is used rather than ω to show that the reference frequency will change with time. Based on the above, the error e PFa n of Pre-Fourier analyzer is given as
e PFa n = x n a ^ c , i n x a i n + b ^ c , i n x b i n ,
where x a i n represents for cos ω i n and x b i n represents for sin ω i n . Moreover, using the LMS (Least-Mean-Square) algorithm to update the weight vector a ^ c , i and b ^ c , i as
a ^ c , i n + 1 = a ^ c , i n + μ PFa x a i n e PFa n
and
b ^ c , i n + 1 = b ^ c , i n + μ PFa x b i n e PFa n ,
respectively. μ PFa is the step size. Furthermore, the output y PFa n of Pre-Fourier analyzer is given as
y PFa n = a ^ c , i n x a i n + b ^ c , i n x b i n .

3.2.2. Error-Fourier Analyzer

In the Error-Fourier analyzer process, which is shown as the red part in Figure 5, the reference ω i is provided by the Pre-Fourier analyzer process. Moreover, the calculation formulas for the Error-Fourier and Pre-Fourier analyzer are similar in principle. Hence, the output and updating equations of the Error-Fourier analyzer process are as follows. At first, the error of e EFa n of Error-Fourier analyzer is given as
e EFa n = e CG n a ^ e , i n x a i n + b ^ e , i n x b i n .
and the updating equations is given as
a ^ e , i n + 1 = a ^ e , i n + μ EFa x a i n e EFa n ,
b ^ e , i n + 1 = b ^ e , i n + μ EFa x b i n e EFa n ,
where μ EFa is the step size in Error-Fourier analyzer. In addition, the output y EFa n of Error-Fourier analyzer is given as
y EFa n = a ^ e , i n x a i n + b ^ e , i n x b i n .

3.2.3. Phase Tracking Filter

Although the deep network can learn more noise characteristics than single adaptive filters using the reference and error signal, it takes a longer time to converge, which leads to phase error. To solve this problem, a repetitive control algorithm is introduced to achieve optimal phase tracking as the blue part in Figure 5. The output y PFa n of the Pre-Fourier analyzer is input into a buffer and changes as y PFa n . Similarly, the y EFa n is input into a buffer and changes as y EFa n . Then the error e Ptf n of phase tracking filter is given as
e Ptf n = y PFa n y EFa n .
To estimate the time delay of the secondary path, an N m order FIR (Finite Impulse Response) filter is applied. The output is given as
y EFa n = q y EFa n p m = 1 N m l n y IL n m + 1 ,
where q is the amplitude compensation coefficient and p is the time-delay compensation factor, l n is an FIR filter to adjust the time delay, given by
l n = e j 2 π f n ,
where f represents the frequency vector. y IL n is defined as
y IL n = m = 1 N m e Ptf n s ^ n N m + 1 ,
where s ^ n represents the n-th coefficient of the estimated secondary path, and N m represents half-length of the estimated secondary path. Moreover, the y IL n is the vector form of y IL n .

3.3. Convergence Condition Analysis of CNN-GRU Subsystem in ANC Process

Although it is impossible to analyze the steady-state and convergence condition of CNN-GRU step by step in the pre-training process, the convergence condition of the ANC process can be analyzed. As shown in Figure 2, the CNN-GRU has an adaptive fine-tune filter updated by the error signal e CG n . The detailed connection of CNN-GRU and fine-tune filter is shown in Figure 6. As the same function is illustrated in [15], the fine-tune filter is set to modify the output of the deep network. In the paper, the 256-length reference signal is an input block. The CNN-GRU network is trained offline with 80 types of noise. Moreover, a 256-length adaptive filter is introduced as the fine-tune weight vector to adjust the output of CNN-GRU to fit the acoustic environment changes.
In the ANC process, the weight vector Θ CG n of CNN-GRU is fixed. An updating equation of the 256 length fine-tune adaptive filter is given as
Θ Ft n + 1 = Θ Ft n + μ Ft e CG n x n .
where μ Ft is the step size of fine-tune filter, e CG n is the error signal, Θ Ft n is the weight vector of fine-tune filter, and x n represent the vector of x n . Considering the time-delay estimator as shown in the yellow block in Figure 2, the output of GNN-GRU is
d ^ n Θ CG T + Θ Ft T n x f n .
and define the error weight vector Θ ˜ Ft n of fine-tune filter as
Θ ˜ Ft n = Θ CG + Θ Ft _ Opt Θ CG + Θ Ft n .
thus, the error signal e CG n is shown as
e CG n = d ^ n y ¯ CG n = d ^ n y CG n e 2 j π f T Θ ˜ Ft T n x n .
where f is the frequency, T is the time delay. To obtain further steady-state equations [27], taking the Euclidean norm on both sides of Equation (33), we have
E Θ ˜ Ft 2 n + 1 = E Θ ˜ Ft 2 n + μ Ft 2 E e CG 2 n x 2 n + 2 μ Ft E Θ ˜ Ft T n x n e CG n .
To keep robust, limited by the following equations,
E Θ ˜ Ft 2 n + 1 E Θ ˜ Ft 2 n .
further considering the real noise power is not normalized, hence the step size μ Ft should be limited by
0 < μ Ft < 2 e 4 j π f T λ x 2 ,
where λ x 2 is the power of x n . In addition, further limited by
e 4 j π f T 1 .
Equation (39) can be rewritten as
0 < μ Ft < min 2 e 4 j π f T λ x 2 = 2 λ x 2 .

4. Adaptive Self-Loading FxLMS Subsystem

To deal with air-conditioner noise with power changes, a clear method is to set another parallel adaptive filter [38]. However, the CNN-GRU has superior attenuation performance for harmonic and periodical noise, which may lead to lower energy of stable harmonic signals in error signals. This will influence the robustness of the parallel adaptive filter. Hence, the parallel filter should have good robustness even if there is no periodic noise in the error signal. Based on all mentioned, as shown in Figure 7, an adaptive self-loading filtered-x least-mean-square (ASL-FxLMS) algorithm is proposed in this section for another parallel adaptive filter.

4.1. The New Adaptive Self-Loading FxLMS with Online Secondary Path Estimation Strategy

Considering the parallel CNN-GRU mentioned in Section 3, sometimes there will be no periodic components in the error signal, which will deteriorate the tracking performance of the FxLMS. As mentioned above, based on the online estimation structure in [27] and the diagonal loading method in [28], a self-loading filter is introduced to improve the robustness of the ANC system.
As shown in the dark green part in Figure 7, the proposed adaptive self-loading filter uses the online estimation structure to inject a known sinusoidal wave into the reference signal x n as the input for the weight vector Θ n . In this way, the self-loading filter stabilizes the tracking performance of the filter by forcibly inputting maximum sinusoidal periodical signals. In the initiate state, the reference signal x n and output y Slf n of the adaptive self-loading filter is added as input time-series in the reference input array x n as
x n = x n + y Slf n , x n 1 + y Slf n 1 , , x n m x + 1 + y Slf n m x + 1 ,
where m x is the length of input array x n . Then, filtered by the single-layer adaptive filter Θ n , the output y A n of Θ n is given as
y A n = Θ T n x n .
moreover, filtered by the secondary path s n , the y ^ Af n is represented as
y ^ Af n = m = 1 m S y A n y Slf n N OS n s n m ,
where
N OS n = N n e A n 1 ,
and s n is the coefficient of secondary path FIR filter. In addition, m S is the length of the secondary path FIR filter. Then, the error e A of single-layer filter is
e A n = d ^ n y ^ Af n .
Based on the above equations and steady-state assumption in [27], using the FxLMS method to update the weight vectors, the updating equations of the noise control filter Θ n is given as
Θ n + 1 = Θ n + μ A e A n x f n ,
where μ A is the step size of the noise control filter.
Moreover, for the self-loading filter, the output y Slf n can be represented as
y Slf n = a ^ s , i n x a i n + b ^ s , i n x b i n ,
and hence the error e Slf n is represented as
e Slf n = x n y Slf n .
meanwhile, the updating equation of the self-loading filter is represented as
a ^ s , i n + 1 = a ^ s , i n + μ Slf x a i n e Slf n
and
b ^ s , i n + 1 = b ^ s , i n + μ Slf x b i n e Slf n .
Furthermore, for the filter of online secondary path estimation, the updating equations of the weight vector are represented as
S ^ n + 1 = S ^ n + μ OS e OS n N OS n ,
where
e OS n = e A n m = 1 m S N OS n s ^ n m ,
and s ^ n is the coefficient of weight vector S ^ n .

4.2. Convergence Condition Analysis

To ensure the robustness of the ASL-FxLMS algorithm, a steady-state analysis must be performed that provides proper step-size ranges. Assuming that the Θ Opt is the optimal weight vector in steady states. In addition, the approximation estimation of d ^ n is given as follows:
d ^ n Θ Opt T x f n .
then the error weight vector Θ ˜ n between optimal weight vector Θ Opt and real-time weight vector Θ n is represented as
Θ ˜ n = Θ Opt Θ n .
and as shown in Equation (46), the error signal e A n of the noise control filter of ASL-FxLMS can also be estimated by
e A n = d ^ n y ^ Af n Θ ˜ T n x f n .
by subtracting the optimal weight vector Θ Opt on both side of Equation (47), we obtain
Θ ˜ n + 1 = Θ ˜ n + μ A e A n x f n .
To obtain further steady-state equations, taking the Euclidean norm on both sides of Equation (57), we have
Θ ˜ 2 n + 1 = Θ ˜ 2 n + μ A 2 e A 2 n x f 2 n + 2 μ A Θ ˜ n e A n x f n .
Assuming that the weight vector changes slowly, meanwhile, taking expectations on both sides of Equation (58), we obtain
E Θ ˜ 2 n + 1 = E Θ ˜ 2 n + μ A 2 E e A 2 n x f 2 n + 2 μ A E Θ ˜ T n x f n e A n .
To keep robust, limited by the following equations,
E Θ ˜ 2 n + 1 E Θ ˜ 2 n .
the step size μ A should be limited by
0 < μ A 2 .
Considering that the real noise power is not normalized. Hence Equation (61) should be rewrite as
0 < μ A min 2 , 2 λ x 2 ,
where λ x 2 is the power of input reference signal x n .
Moreover, by similar calculations, the range of step size μ Slf is represented as
0 < μ Slf 2 λ x , i 2 .
where the λ x , i 2 is the power of the maximum power frequency noise. Furthermore, the range of step size μ OS is represented as
0 < μ OS 2 λ N 2 λ x 2 ,
λ N 2 is the power of injecting Gaussian white noise.

5. Simulations

Several simulations are conducted in this section to verify the attenuation performance for the complicated harmonic noise. In addition, standard FxLMS [2], FLANN [9], and DANC [15] algorithms are introduced as comparisons of the proposed hybrid ASL-FxLMS and CNN-GRU. Moreover, the step size of FxLMS, FLANN, and ASL-FxLMS is set as 0.002 . More exactly, the structure of CNN-GRU with multi-head attention is shown in Figure 8. In the same way as the parameters shown in [15], the input size and output size are defined as F e a t u r e M a p s × T i m e S t e p s × F r e q u e n c y N u m b e r s , the hyperparameter is defined as K e r n e l S i z e × S r i d e s × O u t p u t C h a n n e l . In addition, the parameters of the proposed CNN-GRU are shown in Table 2. The computational cost of each mentioned algorithm in one step is given in Table 3. The main operation of the proposed hybrid CNN-GRU and ASL-FxLMS is shown in Table 4.
The noise-92 datasets are used to train the CNN-GRU model, like the operation in [15]. The following simulations involve a changing noise environment and the changing acoustic paths as shown in Figure 9 and Figure 10. The hybrid embedded and computer ANC system framework is tested here as shown in Figure 11. More precisely, the computation of a convex combination is set in a Field Programmable Gate Array (FPGA). Meanwhile, the FPGA updates the adaptive weight vectors. The computing of CNN-GRU is set in the CPU and GPU of a computer. Moreover, the PCI-E (Peripheral Component Interconnect Express) connects the embedded and computer systems. It should also be noted that the definition of the SNR (Signal-to-Noise Ratio) is used in the simulations.

5.1. Performance of CNN-GRU

In this subsection, to test the performance in a complicated harmonic noise environment, the CNN-GRU is tested with the noise of different SNRs. To provide a fair comparison, with the same noise source and acoustic environment, average noise reduction (ANR) curves are used to evaluate the performance of the mentioned FxLMS, FLANN, DANC, and proposed CNN-GRU. Moreover, for the DANC and CNN-GRU, the target noise has three parts: the record training signal, the complicated harmonic signal, and Gaussian noise. Specifically, there is an unknown complicated harmonic noise that needs to be predicted first by the DANC and CNN-GRU-Net.
The averaged noise reduction (ANR) is used to assess the performance of different methods, and error frequency analysis is also provided. The ANR is calculated as
ANR n = 20 log A e n A d n ,
where
A e n = γ A e n 1 + 1 γ e n ,
and
A d n = γ A d n 1 + 1 γ d n .
and γ = 0.999 .
As shown in the Figure 12, the SNR is set as 25 dB. The FxLMS achieves 4.6 dB noise attenuation at 10 s. In addition, the FLANN achieves 11.3 dB noise attenuation at 10 s. As for the deep learning method DANC and CNN-GRU, the DANC achieves 36.7 dB and the CNN-GRU 31.7 dB attenuation level at 10 s. More exactly, as shown in the spectrum analysis in Figure 12, the FxLMS could not track complicated noise well. Compared with the FxLMS, the FLANN achieves a better attenuation level, especially for single-frequency noise. In the same simulation environment, although the DANC and CNN-GRU achieve a steady state at 0.13 s in the initial tracking, the DANC has a better attenuation level and smaller steady-state error. Because the CNN-GRU has fewer computing parameters than the DANC, if the CNN-GRU has more GRU layers, it could achieve better attenuation and robust performance.
When SNR decreases to 15 dB, as shown in Figure 13, the FxLMS, FLANN, DANC, and proposed CNN-GRU achieve 7.2 dB, 14.7 dB, 27.4 dB, and 31.3 dB noise attenuation at 10 s, respectively. With the decrease of SNR, the power of the training signal also decreases. Hence, the attenuation level of DANC decreased. However, for the FxLMS and FLANN, the power of complicated harmonic noise is decreasing, and the attenuation performance becomes better. The proposed CNN-GRU focuses on all the main components of the noise. Thus, the CNN-GRU still maintains a good attenuation performance. Meanwhile, the robustness of the steady state still needs to be improved because the multi-head attention filter has some noise characteristics. Moreover, as for spectrum performance in Figure 13, the FxLMS could not deal with single frequencies well. Meanwhile, the FLANN has better attenuation performance for single-frequency noise because it has multi-layer Fourier analyzers. However, as the proportion of training noise energy in the target noise decreases, the single-frequency attenuation performance of DANC and CNN-GRU decreases.
When SNR decreases to 5 dB, as shown in Figure 14, the FxLMS, FLANN, DANC, and proposed CNN-GRU achieve 9.4 dB, 17.3 dB, 11.9 dB, and 16.7 dB noise attenuation at 10 s, respectively. Because the power of complicated harmonic noise decreases, the FLANN has the best attenuation performance. In addition, compared spectrum performance in Figure 14, the FLANN still has the best single-frequency attenuation performance. However, the frequency information must be known for the FLANN; otherwise, frequency shift and frequency estimation errors will lead the FLANN algorithm to diverge. Compared the performance of FxLMS, FLANN, DANC, and CNN-GRU in Figure 12, Figure 13 and Figure 14, the proposed CNN-GRU has the best performance in when SNR > 5 dB.
As shown in Figure 12, Figure 13 and Figure 14, residual harmonic noise persists after denoising using DANC and CNN-GRU, and this phenomenon becomes more significant as the signal-to-noise ratio decreases. This occurs because both DANC and CNN-GRU are pre-trained with noisy recordings and do not consider different signal-to-noise ratios. Consequently, as the signal-to-noise ratio drops, the noise frequency components added in the simulation show more randomness than the noisy recordings, which relates to the added Gaussian white noise.

5.2. Performance of ASL-FxLMS

To test the ASL-FxLMS (Adaptive Self-Loading FxLMS) algorithm separately, the FxLMS [27] and FxNLMS [38] are set as comparisons. To provide a fair comparison, the FxLMS, FxNLMS, and ASL-FxLMS are set with different step sizes to achieve nearly the same attenuation performance in steady state. The engine and babble noise are combined as the signal; meanwhile, 15 dB Gaussian noise is also added. As shown in Figure 15, the FxLMS, FxNLMS, and ASL-FxLMS achieve nearly 15 dB attenuation level in steady state. The FxLMS achieves the best initial tracking rate of the three methods; it reaches steady state at 2.3 s. In particular, the FxNLMS often has a better initial tracking rate than the FxLMS. However, due to the presence of strong complicated harmonic noise in the reference noise, its power fluctuates. Hence, its step size changes in a big range, which decreases the tracking rate, which leads the FxNLMS to reach a steady state at 3.2 s. While the FxLMS uses a competitive big step size, which is set by repeated experiments. That is why the FxLMS has better initial tracking than the FxNLMS in this simulation. As for the proposed ASL-FxLMS, it reaches steady state at 7.8 s. The adaptive self-loading filter decreases the tracking performance because a serial Fourier analyzer is introduced to improve its robustness in a complicated harmonic noise environment.
On the other hand, the robustness performance is very important for the ASL-FxLMS. It needs to work with reference and error interference noise. More exactly, the reference interference noise usually occurs and disappears very fast in the reference signal, which is part of the reference noise and is hard to track. In addition, error interference is usually combined with random noise and burst noise in the error signal. Especially when ASL-FxLMS works in parallel with the CNN-GRU-Net, the periodical component may be almost attenuated by the CNN-GRU. Then, the Gaussian noise will become fatal to the robustness of the traditional adaptive filter. Hence, the FxLMS, FxNLMS, and the proposed ASL-FxLMS need to be tested in the acoustic environment with reference and error interference to verify their robust performance.
The reference interference, which includes whistling and Gaussian noise, is introduced from 8.6 s to 10.5 s as shown in Figure 16. The FxLMS diverges at 8.6 s because it cannot deal with impulsive interference. The FxNLMS achieves robust tracking when the reference interference occurs and when it disappears. As for the ASL-FxLMS, it remains robust from 8.6 s to 12 s.
Moreover, as shown in Figure 17, the error interference will also decrease the performance of ANC algorithms. Thus, the whistling and Gaussian noise interference are added to the error signal from 8.6 s to 10.5 s. Meanwhile, the FxLMS diverges at 8.6 s. Although the FxNLMS achieves robust tracking, it fails to reach the best attenuation level from 10.5 s to 12 s. In comparison, the proposed ASL-FxLMS still achieves robust tracking even when the interference is added, which gives further proof that the ASL-FxLMS has better robust performance than the FxLMS and FxNLMS in the simulations.

5.3. Performance of Hybrid ASL-FxLMS and CNN-GRU

To balance the real-time, robustness, and attenuation performance, the CNN-GRU and ASL-FxLMS are combined in parallel. Moreover, the hybrid of CNN-GRU and ASL-FxLMS net is briefly renamed as the Hybrid net in simulations in this subsection. The reference and error interference are introduced to test the performance of the Hybrid net. Furthermore, to provide a fair comparison, the parameters of ASL-FxLMS, CNN-GRU, and the Hybrid net are selected by multiple experiments.
As shown in Figure 18, the reference interference is introduced from 8.6 s to 10 s as the same simulation results in Section 5.2. The ASL-FxLMS needs two adaptive self-loading sine periods to recover the normal tracking rate. The CNN-GRU and the Hybrid net perform similarly in steady-state attenuation level, achieving 26.3 dB and 27.2 dB, respectively. However, compared with the Hybrid net, the CNN-GRU has a bigger misleading peak during the occurrence of reference interference. More exactly, the average ANR of CNN-GRU is 3.0 dB from 8.6 s to 10 s, while the average ANR of the Hybrid net is 21.4 dB from 8.6 s to 10 s. Actually, this advantage benefits from the convex combination function λ n as shown in Equation (13), which can effectively integrate the advantages of ASL-FxLMS and CNN-GRU.
When the error interference is introduced as shown in Figure 19, during the occurrence of error interference, the ASL-FxLMS, CNN-GRU, and Hybrid net have 10.6 dB, 3.4 dB, and 20.3 dB average misleading peak. Moreover, the Hybrid net has the best 27.4 dB attenuation level, followed by CNN-GRU with 25.8 dB. The mentioned also proves the Hybrid net has a balanced performance of competitive good tracking, high attenuation level, and robust steady state.

5.4. Time-Series and Frequency ANC Analysis for Various Types of Noise

This section evaluates the proposed algorithm’s noise reduction performance across different noise types. Specifically, speech noise, truck noise, and building noise are used as examples to assess its capabilities. Additionally, we analyze the average noise reduction results for these three noise categories.

5.4.1. Adaptability of the Proposed Algorithm to Various Types of Noise

To verify the adaptability of the proposed algorithm to various noises, noise clips of speech, truck, and building noise were selected. The analysis in both the time and frequency domains before and after ANC noise reduction was displayed. As shown in Figure 20, Figure 21 and Figure 22, speech, truck, and building noise are used to test the attenuation capability of the proposed method. The original time-series and spectrum are shown with a blue line, and the time-series and spectrum after the ANC process are represented by a red line.
In Figure 20a, the maximum normalized amplitude is 0.52 , while the maximum normalized amplitude after the ANC process is 0.18 . As shown in Figure 20b, the noise within 1500 Hz has an average noise reduction of 22.83 dB relative to the initial speech noise. However, the noise reduction effect near 420 Hz on the spectrum is poor, and the noise level increases by 6.73 dB compared to the original speech noise.
In Figure 21a, the maximum normalized amplitude is 0.41 , while the maximum normalized amplitude after the ANC process drops to 0.19 . As shown in Figure 21b, the noise within 1500 Hz is reduced by an average of 14.59 dB compared to the truck noise. However, the noise reduction near 870 Hz on the spectrum is ineffective, and the noise level increases by 13.22 dB compared to the original truck noise.
In Figure 22a, the maximum normalized amplitude is 0.47 , while after the ANC process, it decreases to 0.21 . As shown in Figure 22b, the noise below 1500 Hz has an average noise reduction of 7.33 dB compared to the initial building noise. Meanwhile, the power around 710 Hz in the spectrum is 13.22 dB higher than the original building noise.
The above simulation results show that although the noise reduction ability of the proposed method varies with different noises, the effective noise reduction capability of the algorithm has been confirmed in both the time domain and the frequency domain.

5.4.2. Average Noise Reduction Analysis

To further verify the algorithm’s noise reduction ability for different noises and compare multiple types of ANC algorithms under the same conditions, the simulations shown in Figure 23, Figure 24 and Figure 25 are added to demonstrate the advantages of the proposed method. To ensure a fair comparison, all ANC algorithms are adjusted to their optimal parameters under non-misadjustment conditions to achieve the maximum noise reduction level.
As shown in Figure 23, speech noise is used as the target noise, and FxLMS [39], FxRLS [40], FxRMC [41], FxGMN [38], GFANC-Kalman [42], DANC [15], and the proposed method are employed as ANC noise reduction algorithms. The FxLMS reaches steady state at 3.3 s and achieves a maximum attenuation level of 5.2 dB. The FxRLS reaches steady state at 2.7 s and achieves a maximum attenuation level of 10.6 dB. The FxRMC reaches steady state at 3.5 s and achieves a maximum attenuation level of 13.7 dB. The FxGMN reaches steady state at 8 s and achieves a maximum attenuation level of 10.5 dB. The GFANC-Kalman reaches steady state at 1.4 s and achieves a maximum attenuation level of 16.1 dB. The DANC reaches steady state at 0.17 s and achieves a maximum attenuation level of 22.9 dB. The proposed method reaches steady state at 0.17 s and achieves a maximum attenuation level of 23.3 dB.
As shown in Figure 24, for the truck noise, the FxLMS reaches steady state at 2.2 s and attains a maximum attenuation of 3.1 dB. The FxRLS reaches steady state at 3.5 s with a maximum attenuation of 8.2 dB. The FxRMC reaches steady state at 3.8 s and achieves 12.4 dB maximum attenuation. The FxGMN reaches steady state at 7 s with a maximum attenuation of 8.1 dB. The GFANC-Kalman reaches steady state at 0.8 s with a maximum attenuation of 12.1 dB. The DANC reaches steady state at 0.9 s and achieves 18.1 dB maximum attenuation. However, its robust performance still needs improvement. The proposed method reaches steady state at 0.7 s with a maximum attenuation of 18.7 dB.
As shown in Figure 25, for building noise, the FxLMS reaches steady state at 2.1 s and achieves a maximum attenuation of 1.3 dB. The FxRLS reaches steady state at 4 s with a maximum attenuation of 4.2 dB. The FxRMC reaches steady state at 4 s with a maximum attenuation of 8.7 dB. The FxGMN reaches steady state at 9 s and achieves a maximum attenuation of 3.2 dB. The GFANC-Kalman reaches steady state at 4 s with a maximum attenuation of 8.8 dB. The DANC reaches steady state at 0.13 s with a maximum attenuation of 10.1 dB. The proposed method reaches steady state at 0.13 s and achieves a maximum attenuation of 13.3 dB. Under the same conditions, the time costs of each mentioned algorithm are listed in Table 5.
In summary, the simulation comparison results show that the FxLMS algorithm [39] has the lowest computational load, but its noise reduction effect on various types of noise is not very effective. Compared with the FxLMS algorithm, the RLS algorithm [40,41] adds a Gaussian kernel function that offers better stability and increases the computational load to improve noise reduction. However, it is necessary to test multiple sets of different parameters to adapt to various noise environments and ensure system stability. The hybrid structure algorithm [38] combines the advantages of the two substructures, but selecting the optimal parameters when dealing with different noise environments remains challenging. The deep network algorithm [15,42] generally achieves higher noise reduction through pre-training. By improving the network structure and integrating methods such as Kalman adaptive filters, noise reduction performance and stability can be significantly enhanced. The simulation results indicate that the proposed algorithm has a computational load similar to that of the deep network method but offers better stability and noise reduction performance. It also demonstrates the superiority of the hybrid filter structure in the ANC system. However, the deep network algorithm still needs to reduce computational load further to meet practical application requirements.

6. Conclusions

Based on the bending-pipe model of an air conditioner, a hybrid CNN-GRU and adaptive self-loading FxLMS (ASL-FxLMS) structure is proposed to achieve more efficient complex harmonic noise attenuation with an ANC system. The CNN-GRU network provides a high attenuation level, especially for various types of noise that can be trained. While the CNN-GRU network attains significant noise reduction, it may leave residual harmonic noise when the signal-to-noise ratio is low. Simultaneously, the parallel setup of ASL-FxLMS can robustly track uncertain, complex harmonic noise. However, the noise reduction performance of the ASL-FxLMS is limited for signals with many single-frequency and uncertain harmonic frequency components. Moreover, the time-domain and frequency-domain simulation analysis results have demonstrated that the proposed hybrid network balances tracking, noise reduction, and stability performance. Furthermore, in future practical applications, such as when air conditioning noise mixes with mechanical noise, building noise, and outdoor speech noise, the proposed hybrid ASL-FxLMS and CNN-GRU network could deliver even better noise reduction performance.

Author Contributions

Conceptualization, W.Z., X.C. and L.L.; methodology, W.Z., P.X. and L.L.; software, W.Z. and Z.G.; validation, W.Z., Z.G. and L.L.; formal analysis, W.Z. and Z.G.; investigation, W.Z. and Z.G.; resources, Z.B. and L.L.; data curation, W.Z., Z.G. and L.L.; writing—original draft preparation, W.Z., Z.G. and L.L.; funding acquisition, W.Z., L.L. and Z.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grants 12304534, 12474471, 12004058, and 12404545, the China Postdoctoral Science Foundation under Grant 2020M673128, Chongqing Natural Science Foundation General Project: CSTB2023NSCQ-MSX0939, and Hebei Natural Science Foundation under Grant A2024203012.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Chan, S.; Zennaro, M.; Pietrosemoli, E.; Rainone, M.; Oktavianti, I.; Nopphawan, P. The Blindspot of Electromagnetic Interference Affecting Sensitive Medical Devices at Hospitals and Healthcare Facilities. In Proceedings of the 2020 10th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, 6–8 January 2020; pp. 449–457. [Google Scholar]
  2. Lu, L.; Yin, K.-L.; de Lamare, R.C.; Zheng, Z.; Yu, Y.; Yang, X.; Chen, B. A survey on active noise control in the past decade—Part I: Linear systems. Signal Process. 2021, 183, 108039. [Google Scholar] [CrossRef]
  3. Kajikawa, Y.; Gan, W.S.; Kuo, S.M. Recent advances on active noise control: Open issues and innovative applications. APSIPA Trans. Signal Inf. Process. 2012, 1, e3. [Google Scholar] [CrossRef]
  4. Lam, B.; Gan, W.S.; Shi, D.; Nishimura, M.; Elliott, S. Ten questions concerning active noise control in the built environment. Build. Environ. 2021, 200, 107928. [Google Scholar] [CrossRef]
  5. Jiang, J.; Li, Y. Review of active noise control techniques with emphasis on sound quality enhancement. Appl. Acoust. 2018, 136, 139–148. [Google Scholar] [CrossRef]
  6. Luo, L.; Sun, J.; Huang, B.; Quoc, D.D. Efficient combination of feedforward and feedback structures for nonlinear narrowband active noise control. Signal Process. 2016, 128, 494–503. [Google Scholar] [CrossRef]
  7. George, N.V.; Panda, G. Advances in active noise control: A survey, with emphasis on recent nonlinear techniques. Signal Process. 2013, 93, 363–377. [Google Scholar] [CrossRef]
  8. Tan, L.; Jiang, J. Adaptive Volterra filters for active control of nonlinear noise processes. IEEE Trans. Signal Process. 2001, 49, 1667–1676. [Google Scholar] [CrossRef]
  9. Luo, L.; Bai, Z.; Zhu, W.; Sun, J. Improved functional link artificial neural network filters for nonlinear active noise control. Appl. Acoust. 2018, 135, 111–123. [Google Scholar] [CrossRef]
  10. Zhang, B.; Ming, Z.M.; Huang, Y.L.; Xiang, Y. Research on cabin noise prediction and experimental verification of an oil tanker within full frequency. Appl. Acoust. 2022, 192, 108746. [Google Scholar] [CrossRef]
  11. Luo, L.; Zhu, W.; Xie, A. A novel acoustic feedback compensation filter for nonlinear active noise control system. Mech. Syst. Signal Process. 2021, 158, 107675. [Google Scholar] [CrossRef]
  12. Sicuranza, G.L.; Carini, A. A generalized FLANN filter for nonlinear active noise control. IEEE Trans. Audio Speech Lang. Process. 2011, 19, 2412–2417. [Google Scholar] [CrossRef]
  13. Zhang, Q.Z.; Gan, W.S.; Zhou, Y.L. Adaptive recurrent fuzzy neural networks for active noise control. J. Sound Vib. 2006, 296, 935–948. [Google Scholar] [CrossRef]
  14. Bambang, R.T. Adjoint EKF learning in recurrent neural networks for nonlinear active noise control. Appl. Soft Comput. 2008, 8, 1498–1504. [Google Scholar] [CrossRef]
  15. Zhang, H.; Wang, D. Deep ANC: A deep learning approach to active noise control. Neural Netw. 2021, 141, 1–10. [Google Scholar] [CrossRef]
  16. Cha, Y.J.; Mostafavi, A.; Benipal, S.S. DNoiseNet: Deep learning-based feedback active noise control in various noisy environments. Eng. Appl. Artif. Intell. 2023, 121, 105971. [Google Scholar] [CrossRef]
  17. Chen, D.; Cheng, L.; Yao, D.; Li, J.; Yan, Y. A secondary path-decoupled active noise control algorithm based on deep learning. IEEE Signal Process. Lett. 2021, 29, 234–238. [Google Scholar] [CrossRef]
  18. Shi, D.; Lam, B.; Ooi, K.; Shen, X.; Gan, W.S. Selective fixed-filter active noise control based on convolutional neural network. Signal Process. 2022, 190, 108317. [Google Scholar] [CrossRef]
  19. Ahmed, M.R.; Islam, S.; Islam, A.M.; Shatabda, S. An ensemble 1D-CNN-LSTM-GRU model with data augmentation for speech emotion recognition. Expert Syst. Appl. 2023, 218, 119633. [Google Scholar] [CrossRef]
  20. Lu, L.; Yin, K.L.; de Lamare, R.C.; Zheng, Z.; Yu, Y.; Yang, X.; Chen, B. A survey on active noise control in the past decade—Part II: Nonlinear systems. Signal Process. 2021, 181, 107929. [Google Scholar] [CrossRef]
  21. Zhong, Y.; Chen, X.; Jiang, J.; Ren, F. A cascade reconstruction model with generalization ability evaluation for anomaly detection in videos. Pattern Recognit. 2022, 122, 108336. [Google Scholar] [CrossRef]
  22. Zhu, W.; Luo, L.; Christensen, M.G.; Sun, J. A new feedforward hybrid active control system for attenuating multi-frequency noise with bursty interference. Mech. Syst. Signal Process. 2020, 144, 106859. [Google Scholar] [CrossRef]
  23. Luo, L.; Zhu, W. Fast-convergence hybrid functional link artificial neural network for active noise control with a mixture of tonal and chaotic noise. Digit. Signal Process. 2020, 106, 102846. [Google Scholar] [CrossRef]
  24. Wang, L.; Chen, K.; Xu, J. Convex combination of the FxAPV algorithm for active impulsive noise control. Mech. Syst. Signal Process. 2022, 181, 109443. [Google Scholar] [CrossRef]
  25. Luo, Z.; Shi, D.; Gan, W.S. A hybrid sfanc-fxnlms algorithm for active noise control based on deep learning. IEEE Signal Process. Lett. 2022, 29, 1102–1106. [Google Scholar] [CrossRef]
  26. Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30. Available online: https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf (accessed on 8 October 2025).
  27. Liu, J.; Xiao, Y.; Sun, J.; Xu, L. Analysis of online secondary-path modeling with auxiliary noise scaled by residual noise signal. IEEE Trans. Audio Speech Lang. Process. 2010, 18, 1978–1993. [Google Scholar]
  28. Wang, X.; Jiang, X.; Chen, Y. An Improved Reduced-Dimension Robust Capon Beamforming Method Using Krylov Subspace Techniques. Sensors 2024, 24, 7152. [Google Scholar] [CrossRef]
  29. Yin, K.L.; Pu, Y.F.; Lu, L. Combination of fractional FLANN filters for solving the Van der Pol-Duffing oscillator. Neurocomputing 2020, 399, 183–192. [Google Scholar] [CrossRef]
  30. Zhou, Y.L.; Yin, Y.X.; Zhang, Q.Z. An optimal repetitive control algorithm for periodic impulsive noise attenuation in a non-minimum phase ANC system. Appl. Acoust. 2013, 74, 1175–1181. [Google Scholar] [CrossRef]
  31. Han, T.; Wang, L.; Cen, K.; Song, B.; Shen, R.; Liu, H.; Wang, Q. Flow-induced noise analysis for natural gas manifolds using LES and FW-H hybrid method. Appl. Acoust. 2020, 159, 107101. [Google Scholar] [CrossRef]
  32. Yang, L.; Dai, X.; Wu, L.; Xu, T.; Chen, Z.; Min, Z.; Liao, Y.; Ni, L.; Yao, Y.; Yi, G.; et al. Stress hormone biosynthesis-based genes and lifestyle moderated the association of noise exposure with blood pressure in a cohort of Chinese tobacco factory workers: A cross-sectional analysis. Int. J. Hyg. Environ. Health 2022, 239, 113868. [Google Scholar] [CrossRef] [PubMed]
  33. Khan, D.; Burdzik, R. Measurement and analysis of transport noise and vibration: A review of techniques, case studies, and future directions. Measurement 2023, 220, 113354. [Google Scholar] [CrossRef]
  34. Patel, V.; Cheer, J. A hybrid multi-reference subband control strategy for active noise control headphones. Appl. Acoust. 2022, 197, 108932. [Google Scholar] [CrossRef]
  35. Shi, D.; Gan, W.S.; Shen, X.; Luo, Z.; Ji, J. What is behind the meta-learning initialization of adaptive filter?—A naive method for accelerating convergence of adaptive multichannel active noise control. Neural Netw. 2024, 172, 106145. [Google Scholar] [CrossRef]
  36. Oh, J.Y.; Jung, H.W.; Lee, M.H.; Lee, K.H.; Kang, Y.J. Enhancing active noise control of road noise using deep neural network to update secondary path estimate in real time. Mech. Syst. Signal Process. 2024, 206, 110940. [Google Scholar] [CrossRef]
  37. Tan, K.; Wang, D. Complex spectral mapping with a convolutional recurrent network for monaural speech enhancement. In Proceedings of the ICASSP 2019–2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 6865–6869. [Google Scholar]
  38. Song, P.; Zhao, H. Filtered-x generalized mixed norm (FXGMN) algorithm for active noise control. Mech. Syst. Signal Process. 2018, 107, 93–104. [Google Scholar] [CrossRef]
  39. Ardekani, I.T.; Abdulla, W.H. Theoretical convergence analysis of FxLMS algorithm. Signal Process. 2010, 90, 3046–3055. [Google Scholar] [CrossRef]
  40. Zeb, A.; Mirza, A.; Khan, Q.U.; Sheikh, S.A. Improving performance of FxRLS algorithm for active noise control of impulsive noise. Appl. Acoust. 2017, 116, 364–374. [Google Scholar] [CrossRef]
  41. Lu, L.; Zhao, H. Active impulsive noise control using maximum correntropy with adaptive kernel size. Mech. Syst. Signal Process. 2017, 87, 180–191. [Google Scholar] [CrossRef]
  42. Luo, Z.; Shi, D.; Shen, X.; Ji, J.; Gan, W.S. Gfanc-kalman: Generative fixed-filter active noise control with cnn-kalman filtering. IEEE Signal Process. Lett. 2023, 31, 276–280. [Google Scholar] [CrossRef]
Figure 1. Brief diagram of the air-conditioner ANC system.
Figure 1. Brief diagram of the air-conditioner ANC system.
Sensors 25 06293 g001
Figure 2. Brief diagram of a hybrid adaptive filter and deep network.
Figure 2. Brief diagram of a hybrid adaptive filter and deep network.
Sensors 25 06293 g002
Figure 3. Structure of the CNN-GRU.
Figure 3. Structure of the CNN-GRU.
Sensors 25 06293 g003
Figure 4. Process of multi-head attention layer.
Figure 4. Process of multi-head attention layer.
Sensors 25 06293 g004
Figure 5. Brief diagram of phase tracking process.
Figure 5. Brief diagram of phase tracking process.
Sensors 25 06293 g005
Figure 6. Connections of CNN-GRU and fine-tune.
Figure 6. Connections of CNN-GRU and fine-tune.
Sensors 25 06293 g006
Figure 7. Brief diagram of adaptive self-loading FxLMS (ASL-FxLMS).
Figure 7. Brief diagram of adaptive self-loading FxLMS (ASL-FxLMS).
Sensors 25 06293 g007
Figure 8. Layers connection of CNN-GRU with multi-head-attention.
Figure 8. Layers connection of CNN-GRU with multi-head-attention.
Sensors 25 06293 g008
Figure 9. Frequency responses of the primary and secondary paths before changing.
Figure 9. Frequency responses of the primary and secondary paths before changing.
Sensors 25 06293 g009
Figure 10. Frequency responses of the primary and secondary paths after changing.
Figure 10. Frequency responses of the primary and secondary paths after changing.
Sensors 25 06293 g010
Figure 11. Block diagram of hybrid embedded and computer ANC system.
Figure 11. Block diagram of hybrid embedded and computer ANC system.
Sensors 25 06293 g011
Figure 12. ANR and spectrum performance of the mentioned methods with SNR 25 dB.
Figure 12. ANR and spectrum performance of the mentioned methods with SNR 25 dB.
Sensors 25 06293 g012
Figure 13. ANR and spectrum performance of the mentioned methods with SNR 15 dB.
Figure 13. ANR and spectrum performance of the mentioned methods with SNR 15 dB.
Sensors 25 06293 g013
Figure 14. ANR and spectrum performance of the mentioned methods with SNR 5 dB.
Figure 14. ANR and spectrum performance of the mentioned methods with SNR 5 dB.
Sensors 25 06293 g014
Figure 15. ANR performance comparison for FxLMS, FxNLMS, and ASL-FxLMS.
Figure 15. ANR performance comparison for FxLMS, FxNLMS, and ASL-FxLMS.
Sensors 25 06293 g015
Figure 16. ANR performance comparison with reference interference for FxLMS, FxNLMS, and ASL-FxLMS.
Figure 16. ANR performance comparison with reference interference for FxLMS, FxNLMS, and ASL-FxLMS.
Sensors 25 06293 g016
Figure 17. ANR performance comparison with error interference for FxLMS, FxNLMS, and ASL-FxLMS.
Figure 17. ANR performance comparison with error interference for FxLMS, FxNLMS, and ASL-FxLMS.
Sensors 25 06293 g017
Figure 18. ANR performance comparison with reference interference for ASL-FxLMS, CNN-GRU, and hybrid net of ASL-FxLMS-CNN-GRU.
Figure 18. ANR performance comparison with reference interference for ASL-FxLMS, CNN-GRU, and hybrid net of ASL-FxLMS-CNN-GRU.
Sensors 25 06293 g018
Figure 19. ANR performance comparison with error interference for ASL-FxLMS, CNN-GRU, and hybrid net of ASL-FxLMS-CNN-GRU.
Figure 19. ANR performance comparison with error interference for ASL-FxLMS, CNN-GRU, and hybrid net of ASL-FxLMS-CNN-GRU.
Sensors 25 06293 g019
Figure 20. Speech noise analysis with the proposed ANC method turned off and on, (a) in the time domain, (b) in the frequency domain.
Figure 20. Speech noise analysis with the proposed ANC method turned off and on, (a) in the time domain, (b) in the frequency domain.
Sensors 25 06293 g020
Figure 21. Truck noise analysis with the proposed ANC method turned off and on, (a) in the time domain, (b) in the frequency domain.
Figure 21. Truck noise analysis with the proposed ANC method turned off and on, (a) in the time domain, (b) in the frequency domain.
Sensors 25 06293 g021
Figure 22. Building noise analysis with the proposed ANC method turned off and on, (a) in the time domain, (b) in the frequency domain.
Figure 22. Building noise analysis with the proposed ANC method turned off and on, (a) in the time domain, (b) in the frequency domain.
Sensors 25 06293 g022
Figure 23. Comparison of ANR (Average Noise Reduction) curves for various ANC algorithms, for combined speech and Gaussian noise with an SNR (Signal-to-Noise Ratio) of 15 dB.
Figure 23. Comparison of ANR (Average Noise Reduction) curves for various ANC algorithms, for combined speech and Gaussian noise with an SNR (Signal-to-Noise Ratio) of 15 dB.
Sensors 25 06293 g023
Figure 24. Comparison of ANR curves for various ANC algorithms, for combined truck and Gaussian noise with an SNR of 15 dB.
Figure 24. Comparison of ANR curves for various ANC algorithms, for combined truck and Gaussian noise with an SNR of 15 dB.
Sensors 25 06293 g024
Figure 25. Comparison of ANR curves for various ANC algorithms, for combined building and Gaussian noise with an SNR of 15 dB.
Figure 25. Comparison of ANR curves for various ANC algorithms, for combined building and Gaussian noise with an SNR of 15 dB.
Sensors 25 06293 g025
Table 1. Notation and Definitions.
Table 1. Notation and Definitions.
NotationDefinitions
d n Reference signal at error sensor
d ^ n Estimated reference signal at error sensor
λ n Output coefficient of convex combination
e n Real error signal
y ^ Hyf n Hybrid net output filtered by estimated secondary path
x n Reference input signal vector
e CG n Error signal of CNN-GRU
y ^ CGf n CNN-GRU Output filtered by estimated secondary path
x f n Reference signal filtered by estimated secondary path
Θ n Weight vector of ASL-FxLMS
a ^ s , j n Weight vector of adaptive self-loading filter of ASL-FxLMS
b ^ s , j n Weight vector of adaptive self-loading filter of ASL-FxLMS
e A n Error signal of ASL-FxLMS
Table 2. Input size, output size, and hyperparameters of each layer.
Table 2. Input size, output size, and hyperparameters of each layer.
Layer NameInput SizeHyperparametersOutput Size
Sequence 1 × T × 256 256 × T × 1
Conv1 256 × T × 1 3 × 1 , 1 , 4 , 64 64 × T × 4
Relu1 64 × T × 4 64 × T × 4
Seqfold 64 × T × 4 Mini-Batch-Size 64 × T × 4
Conv2 64 × T × 4 3 × 1 , 1 , 4 , 16 16 × T × 16
Relu2 16 × T × 16 16 × T × 16
SeqUnfold 16 × T × 16 Mini-Batch-Size 16 × T × 16
Flatten 1 × T × 256 1 × T × 256
GRU1 1 × T × 256 256 1 × T × 256
GRU2 1 × T × 256 256 1 × T × 256
Reshape 1 × T × 256 16 × T × 16
Conv3 16 × T × 16 3 × 1 , 1 , 4 , 64 64 × T × 4
Full-Connection 64 × T × 4 256 × T × 1
Multi-head Attention 256 × T × 1 256 × T × 1
Softmax 256 × T × 1 256 × T × 1
Output 256 × T × 1 1 × T × 256
Table 3. The computational cost of each mentioned algorithm in one step.
Table 3. The computational cost of each mentioned algorithm in one step.
NameAdditionMultiplicationExponent
FxLMS512641No
FxNLMS768899No
ASL-FxLMS516645No
FLANN67827385No
DANC21,24910,977No
CNN-GRU15,32868363
Table 4. Main operation of proposed hybrid CNN-GRU and ASL-FxLMS.
Table 4. Main operation of proposed hybrid CNN-GRU and ASL-FxLMS.
Parameters and Preparation.
Train CNN-GRU Θ CG n with noise of multi-sources.
Step size μ A , μ PFa , μ EFa , μ Slf , μ σ , and μ OS .
Initiate the ANC system by estimating the estimated secondary path S ^ z
and max-power frequency of the adaptive self-loading structure.
Main Operations.
(ASL-FxLMS)
1. Reference signal:
x n = x n + y Slf n , x n 1 + y Slf n 1 , , x n m x + 1 + y Slf n m x + 1
2. ASL-FxLMS output: y A n = Θ T n x n
3. Error signal: e A n = d ^ n y AF n
4. Update weight vector: Θ n + 1 = Θ n + μ A e A n x f n
5. Update adaptive self-loading filters: a ^ s , i n + 1 = a ^ s , i n + μ Slf x a i n e Slf n
b ^ s , i n + 1 = b ^ s , i n + μ Slf x b i n e Slf n
6. Output of adaptive self-loading filters: y slf n = a ^ s , j n x a i n + b ^ s , j n x b i n
(CNN-GRU)
7. Data input: Put x n , x ( n 1 ) , , x n T + 1 into CNN-GRU-Net.
8. Estimate time-delay: Put y CG n into Time-delay estimator and obtain y ¯ CG n .
9. Get error signal for fine-tune: e CG n = d ^ n y ^ CGf n
(Convex Combination)
9. Calculate output: y Hy n = λ n y A n + 1 λ n y ¯ CG n
10. Update partition coefficient: λ n = 1 1 + e σ n
11. Estimate power: σ n + 1 = σ n + μ σ 2 e n 1 + e 2 n 2 y ¯ CG n y A n λ n 1 λ n
Table 5. Time cost of training and tracking with MATLAB 2025, NVIDIA 4060 (Santa Clara, CA, USA), and Intel I7 core CPU (Santa Clara, CA, USA).
Table 5. Time cost of training and tracking with MATLAB 2025, NVIDIA 4060 (Santa Clara, CA, USA), and Intel I7 core CPU (Santa Clara, CA, USA).
Training Cost (Hours)Cost of ANC Process for 200 s Noise (Seconds)
FxLMS0312
FxRLS0940
FxRMC01711
FxGMN02691
GFANC-Kalman142067
DANC223987
Propose method162670
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhu, W.; Gu, Z.; Chen, X.; Xie, P.; Luo, L.; Bai, Z. A New Hybrid Adaptive Self-Loading Filter and GRU-Net for Active Noise Control in a Right-Angle Bending Pipe of an Air Conditioner. Sensors 2025, 25, 6293. https://doi.org/10.3390/s25206293

AMA Style

Zhu W, Gu Z, Chen X, Xie P, Luo L, Bai Z. A New Hybrid Adaptive Self-Loading Filter and GRU-Net for Active Noise Control in a Right-Angle Bending Pipe of an Air Conditioner. Sensors. 2025; 25(20):6293. https://doi.org/10.3390/s25206293

Chicago/Turabian Style

Zhu, Wenzhao, Zezheng Gu, Xiaoling Chen, Ping Xie, Lei Luo, and Zonglong Bai. 2025. "A New Hybrid Adaptive Self-Loading Filter and GRU-Net for Active Noise Control in a Right-Angle Bending Pipe of an Air Conditioner" Sensors 25, no. 20: 6293. https://doi.org/10.3390/s25206293

APA Style

Zhu, W., Gu, Z., Chen, X., Xie, P., Luo, L., & Bai, Z. (2025). A New Hybrid Adaptive Self-Loading Filter and GRU-Net for Active Noise Control in a Right-Angle Bending Pipe of an Air Conditioner. Sensors, 25(20), 6293. https://doi.org/10.3390/s25206293

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop