Adaptive Wavelet Threshold Denoising Method for Machinery Sound Based on Improved Fruit Fly Optimization Algorithm

: As the sound signal of a machine contains abundant information and is easy to measure, acoustic-based monitoring or diagnosis systems exhibit obvious superiority, especially in some extreme conditions. However, the sound directly collected from industrial ﬁeld is always polluted. In order to eliminate noise components from machinery sound, a wavelet threshold denoising method optimized by an improved fruit ﬂy optimization algorithm (WTD-IFOA) is proposed in this paper. The sound is ﬁrstly decomposed by wavelet transform (WT) to obtain coefﬁcients of each level. As the wavelet threshold functions proposed by Donoho were discontinuous, many modiﬁed functions with continuous ﬁrst and second order derivative were presented to realize adaptively denoising. However, the function-based denoising process is time-consuming and it is difﬁcult to ﬁnd optimal thresholds. To overcome these problems, fruit ﬂy optimization algorithm (FOA) was introduced to the process. Moreover, to avoid falling into local extremes, an improved ﬂy distance range obeying normal distribution was proposed on the basis of original FOA. Then, sound signal of a motor was recorded in a soundproof laboratory, and Gauss white noise was added into the signal. The simulation results illustrated the effectiveness and superiority of the proposed approach by a comprehensive comparison among ﬁve typical methods. Finally, an industrial application on a shearer in coal mining working face was performed to demonstrate the practical effect.


Introduction
Generally, the vibration and strain signals of a machine are mostly applied to provide dynamic information of the machine's working condition [1,2], even though they have some common disadvantages, such as contact measurement, limited detecting positions and difficult to maintain detectors in some severe situations.Therefore, vibration and strain measuring is inappropriate or sometimes even impossible in these cases.On the other hand, the sound signal of a machine can be a significant criterion for state recognition or fault diagnosis because it is convenient to collectand does not affect the machine [3].Thus, acoustic-based diagnosis (ABS) has received much attention in recent years [4].One of the most important preconditions for ABS is eliminating noise from the initial sound signal, and the performance of denoising directly influences the effect of subsequent processing [5,6].
Preliminary analysis of machinery sound shows that significant details distribute in both time and frequency domains, which implies that noise elimination methods considering both scales would perform better than those only focusing one.Throughout the development of signal processing, the most influential time-frequency joint analysis approaches are Fast Fourier Transform (FFT) and Wavelet Transform (WT) [7].WT was firstly proposed by Mallat in 1989 [8].As the window function of FFT is fixed, WT is obviously superior to FFT for non-stationary signal as the property of characterizing local features in both domains.Six years later, Donoho proposed hard-threshold and soft-threshold denoising solutions based on WT.The corresponding threshold value was selected by combining WT and Stein's unbiased risk estimate (SURE) [9].However, since the derivative of standard threshold function is not continuous and lacks adaptability, many improved wavelet noise reduction methods have been proposed [10][11][12].With the extensive application of artificial intelligence in recent years, adaptive threshold selecting approaches based on intelligent optimization algorithms, such as the particle swarm optimization (PSO), genetic algorithm (GA) and ant colony optimization (ACO) have been adopted gradually [13][14][15].
Fruit fly optimization algorithm (FOA) was proposed by Pan in 2012 [16][17][18].As a meta-heuristic method, FOA simulates the intelligent foraging behavior of fruit fly group in food finding process [19].The fruit fly is superior to other species with regard to its senses of osphresis and vision, it can even a smell food source from 40 km away and locate other flocks by its sensitive vision [20].The FOA has many advantages compared with the above optimization algorithms, such as simple structure, immediately accessible for practical applications, ease of implementation and rapid convergence rate.Since the FOA was proposed, it has been widely applied in financial parameter optimization [21], forecasting [22], scheduling [23], etc.However, like other optimization algorithms, the basic FOA also has the possibility of falling into local extremes due to its fixed fly distance range [24].
Bearing the above observations in mind, an adaptive wavelet threshold denoising method for machinery sound based on an improved FOA (WTD-IFOA) is proposed.The rest of this paper is organized as follows.In Section 2, some related works are outlined based on the literature.In Section 3, the basic wavelet noise elimination method and optimization process of FOA are presented.In Section 4, an improved fly distance range, obeying normal distribution, is performed, and the denoising solution based on WTD-IFOA is elaborated.In Section 5, Gauss white noise is added into the motor sound signal to verify the effectiveness and superiority of the proposed method, and an industrial application is performed.Some conclusions and outlooks are summarized in Section 6.

Literature Review
Recent publications relevant to this paper are mainly concerned with two research streams: wavelet threshold denoising and fruit fly optimization algorithm.In this section, we try to summarize the relevant literature.

Wavelet Threshold Denoising
Traditional denoising methods, such as low-pass filter, Kalman filter and median filter, aim either at the time domain or the frequency domain.However, single-scale representations of signals are often inadequate when attempting to separate signals from noisy data.By combining the two scales, wavelet threshold denoising presents obvious superiority.According to the wavelet threshold denoising theory proposed by Donoho, the optimal threshold should diminish the noise but preserve the signal as much as possible [25].The traditional hard-threshold function exhibits some discontinuities and may be unstable or more sensitive to small changes in the data, while in soft thresholding the wavelet coefficients are reduced by a quantity equal to the threshold value, which will induce the deviation when the filtered wavelet coefficients is reconstructed [26].Moreover, the threshold is fixed once determined and adaptability is weak during the denoising process.In order to overcome the disadvantages of the original threshold functions proposed by Donoho, many adaptive denoising approaches have been elaborated by researchers.Improved solutions can be divided into two streams: the first on the improvements of threshold function and the other focus on searching optimal threshold through intelligent algorithms.The threshold function-based methods aim at establishing appropriate function with continuous derivative and selecting thresholds based on gradient descent algorithm.In [10], a new adaptive denoising function with continuous first and second order derivative is presented based on SURE model.In [27], an adaptive logarithmic wavelet threshold denoising function was proposed to select optimal threshold for each decomposition level.Relative to hard and soft functions, the proposed approach increased the signal-to-noise ratio by 44.2% and 27.9%, and decreased processing time by 37.6% and 38.5%, respectively.With the rapid development of artificial intelligent optimization algorithm, intelligent searching-based wavelet denoising approaches have been widely applied in recent years.In [14], a PSO-based image denoising method was proposed for learning the parameters of the adaptive thresholding function required for optimum performance.Li et al. adopted an adaptive denoising solution for partial discharge signals based on threshold function and genetic algorithm (GA), and the result presents significantly smaller waveform distortion and magnitude errors than the Donoho's soft threshold estimation [15].In order to eliminate noise components of satellite images, some stochastic global optimization techniques such as Cuckoo Search (CS) algorithm, artificial bee colony (ABC), and PSO as well as their different variants have been exploited for learning the parameters of adaptive thresholding function in [28].

Fruit Fly Optimization Algorithm
Although it is not long since FOA was put forward, it has aroused much attention and scored great academic achievements.In [21], FOA was adopted to optimize general regression neural network, and the simulation result showed the superiority compared with other intelligent optimization algorithms.In [29], an annual electric load forecasting method was proposed by the least squares support vector machine (LSSVM) model.The FOA was used to determine appropriate parameters of the model, and an experiment, with the mean absolute percentage error of 1.305%, proved the validity of the approach.Although the FOA has an extensive application in many fields, there still exists the possibility of getting into the local extreme [24].The main reason lies in the fruit fly individuals move toward fixed fly distance range in the iteration of optimization.Once the fruit fly group fall into the local extreme and the fly distance range is not big enough, the optimization process is prone to fail [30].On the other hand, excessive fly distance range may lead to slow convergence rate of the iteration process.In [23], an improved FOA was presented to solve the joint replenishment problems.In order to avoid local optimal solution, swarm collaboration and random perturbation were added into original FOA.Pan et al. presented a changeable fly distance range in FOA to eliminate the drawbacks lies with fixed values of search radius, and 29 benchmark functions were carried out to make a comparison with basic FOA [24].Yuan et al. proposed a multi-swarm FOA, where several sub-swarms moving independently in the search space with the aim of simultaneously exploring global optimal and local behavior between sub-swarms is also considered [31].In [32], an improved FOA, called linear generation mechanism of candidate solution fruit fly optimization algorithm (LGMS-FOA), was introduced for solving optimization problems.Four disadvantages of the original FOA were listed and some improvements were operated, and the simulation result showed local extreme could be avoided efficiently.

Discussion
Many valuable wavelet denoising methods have been proposed and applied by researchers in recent decades, which greatly pushes forward the development of this field.However, there are still some shortcomings including the following.Firstly, the disadvantage of weak adaptability seriously restricted the development of Donoho's wavelet threshold approaches.Secondly, adaptive noise elimination methods based on gradient descent algorithm were also limited because of the great amount of calculation.The above approaches were gradually replaced by intelligent optimization-based algorithms.Thirdly, the iterative process of the common optimization solution has the problems of slow convergence rate and high complexity of coding.In [15], the proposed method based on GA adaptive threshold cost more than 38.66 times the calculation time compared to soft threshold approach.Moreover, the FOA has great advantages in iteration rate and encoding efficiency, but still has the probability of falling into local extreme.Many improvements have been elaborated by past scholars, but few researchers could balance both local extreme and iterative rate.
Therefore, a novel wavelet threshold denoising method optimized by an improved FOA is proposed in this paper.The fly distance range obeying uniform distribution in the basic FOA is replaced by the following normal distribution.Both local extreme and iterative rate are taken into consideration.A series of simulations and an industrial application prove the effectiveness and superiority of the proposed method.

Wavelet Threshold Denoising
Fundamental theory of wavelet denoising can be concluded as follows: wavelet decomposition is firstly conducted on the noisy signal, then wavelet coefficients that belong to useful signal are kept and others are eliminated, and finally inverse wavelet transform is operated to reconstruct the remainder coefficients.
Assume that the noisy signal series x = {x 1 , x 2 , x 3 , ..., x k } can be expressed as follows: where i = 1, 2, 3, ..., k, s = {s 1 , s 2 , s 3 , ..., s k } is the useful initial signal and n = {n 1 , n 2 , s 3 ,..., n k } is noise signal.Then x is decomposed by J levels WT and the i-th wavelet coefficient in j-th can be presented as d i,j , where j = 1, 2, 3, . . ., J. Since WT is a kind of linear transform, wavelet coefficients of x are consisted of ones decomposed by s, denoted as U i,j , and that of n, called V i,j .The purpose of wavelet denoising is to eliminate V i,j and obtain the estimate signal ŝ of the noisy signal.The ideal ŝ has a minimum mean square error with s under the premise of eliminating noise component furthest.The mean square error (MSE) ξ can be calculated as follows: The threshold during the denoising process is calculated according to SURE model, which can be obtained as follows: where λ j denotes the threshold of j-th level, MAD(¨) is a median value function and the value range of q is [0.4,1] in general.
There are two typical threshold functions during the wavelet denoising process, the first called hard-threshold: di,j " where di,j donates the wavelet coefficient of denoised signal.The other function is called soft-threshold: di,j " # sgnpd i,j qp ˇˇd i,j ˇˇ´λ j q, for ˇˇd i,j ˇˇě λ j 0, otherwise where sgn(¨) is sign function, which returns 1 if the element is greater than 0; 0 if it equals 0; and ´1 if it is less than 0. The estimated signal is reconstructed through inverse wavelet transform on di,j .It can be seen that the key point of the denoising process is selecting appropriate threshold to minimize Equation (2).

Fruit Fly Optimization Algorithm
FOA is a new interactive evolutionary computation method, which was proposed by Pan in 2012.By simulating the process of foraging behavior for fruit fly individuals and populations, global optimum can be obtained through appropriate iteration, as shown in Figure 1.Standard foraging process of FOA can be summarized as follows.
Appl.Sci.2016, 6, 199 5 of 16 The estimated signal is reconstructed through inverse wavelet transform on , ˆi j d .It can be seen that the key point of the denoising process is selecting appropriate threshold to minimize Equation (2).

Fruit Fly Optimization Algorithm
FOA is a new interactive evolutionary computation method, which was proposed by Pan in 2012.By simulating the process of foraging behavior for fruit fly individuals and populations, global optimum can be obtained through appropriate iteration, as shown in Figure 1.Standard foraging process of FOA can be summarized as follows.Step 1.1:The key initial parameters of the FOA are the population amount (PA), the fruit fly group location range (LR), the maximum iteration number (INmax) and the random fly distance range (FR).The initial location of the fruit fly group can be presented as follows: Step 1.2:The random direction and distance for the search of food using osphresis by an individual fruit fly is given as follows: Step 1.3: Since the food location cannot be known, the distance to the origin of coordinates (Disti) and the smell concentration judgment value (Si) are calculated as follows: The fruit fly with maximal smell concentration among the fruit fly group can be searched according to the smell concentration judgment function (or called Fitness function), which can be presented as follows: where bestsmell donates the maximal smell concentration, bestindex is the corresponding fruit fly number and smell is the smell concentration set of the group.Step 1.1:The key initial parameters of the FOA are the population amount (PA), the fruit fly group location range (LR), the maximum iteration number (IN max ) and the random fly distance range (FR).The initial location of the fruit fly group can be presented as follows:

#
X_axis " rand pLRq Y_axis " rand pLRq Step 1.2:The random direction and distance for the search of food using osphresis by an individual fruit fly is given as follows: Step 1.3: Since the food location cannot be known, the distance to the origin of coordinates (Dist i ) and the smell concentration judgment value (S i ) are calculated as follows: The fruit fly with maximal smell concentration among the fruit fly group can be searched according to the smell concentration judgment function (or called Fitness function), which can be presented as follows: smell i " functionpS i q, rbestsmell bestindexs " maxpsmellq (9) where bestsmell donates the maximal smell concentration, bestindex is the corresponding fruit fly number and smell is the smell concentration set of the group.
Step 1.4:The smell concentration is compared with that of the former iteration.If it is inferior to the last generation, Steps 1.2 to 1.3 are repeated; else the best location and smell concentration can be presented as follows: smellbest " bestsmell # X _axis " Xpbestindexq Y _axis " Ypbestindexq Step 1.5:When the smell concentration reaches the preset precision value or the iteration number reaches the maximal IN, the circulation stops.Otherwise, Steps 1.2 to 1.4 are repeated.

The Proposed Method
In this section, the improved fruit fly optimization algorithm is proposed to enhance the capacity of global and local research.Then the flowchart of the improved method is designed and the process of the denoising approach based on WTD-IFOA is presented.

Improvement of FOA
The fly distance range (FR) of FOA is a random value in the range of [´L, L], which can be presented as FR ~U(´L, L), where L named as step size.The value of FR is distributed evenly among the value range.If the step size is big enough, the global search capability will be improved remarkably, while the convergence speed will be decreased obviously.Otherwise, the FOA easily gets stuck in local optimal, while has a high convergence speed.
In order to balance the global search ability and convergence rate, the distribution function of FR is modified in this paper.The value of FR follows normal distribution, FR ~N(0, L 2 ).According to the characteristic of normal distribution, the probability of FR P r´L, Ls is about 68.27%, the probability of FR P r´2L, 2Ls is about 95.45% and the probability of FR P r´3L, 3Ls is about 99.73%.The probability density distribution of original fly distance range and the proposed one are presented in Figure 2. Most individuals fly towards the present best location, while more than 30% fruit flies continuing searching at a larger scale.Moreover, individuals flying towards the present optimum tend to a more concentrated region according to the probability distribution condition.Thus, the capacity of global searching and partial location are both enhanced.The flowchart of the IFOA is shown in Figure 3.
Appl.Sci.2016, 6, 199 6 of 16 Step 1.4:The smell concentration is compared with that of the former iteration.If it is inferior to the last generation, Steps 1.2 to 1.3 are repeated; else the best location and smell concentration can be presented as follows: Step 1.5:When the smell concentration reaches the preset precision value or the iteration number reaches the maximal IN, the circulation stops.Otherwise, Steps 1.2 to 1.4 are repeated.

The Proposed Method
In this section, the improved fruit fly optimization algorithm is proposed to enhance the capacity of global and local research.Then the flowchart of the improved method is designed and the process of the denoising approach based on WTD-IFOA is presented.

Improvement of FOA
The fly distance range (FR) of FOA is a random value in the range of [−L, L], which can be presented as FR ~ U(−L, L), where L named as step size.The value of FR is distributed evenly among the value range.If the step size is big enough, the global search capability will be improved remarkably, while the convergence speed will be decreased obviously.Otherwise, the FOA easily gets stuck in local optimal, while has a high convergence speed.
In order to balance the global search ability and convergence rate, the distribution function of FR is modified in this paper.The value of FR follows normal distribution, FR ~ N(0, L 2 ).According to the characteristic of normal distribution, the probability of ∈ , is about 68.27%, the probability of ∈ 2 , 2 is about 95.45% and the probability of

Flow of the Proposed Denosing Method
The adaptive threshold denoising method based on WTD-IFOA can be summarized as follows: Step 2.1: For the sake of convenient calculation, the sound signal is first quantized into a certain range.Then, the initial signal is decomposed by a J-level wavelet transform.i-th coefficient in j-th level can be presented as d i,j , where i = 1, 2, 3, . . ., m, j = 1, 2, 3, . . ., J, m is the length of the sound signal.
Step 2.2: The parameters of IFOA are initialized, such as LR, IN max and FR, where FR ~N(0, L 2 ).For J-level WT, each level has an optimal threshold.So there are J groups of fruit flies, each group contains PA individuals.The initial location of the fruit fly group is obtained by Equation (6).

Flow of the Proposed Denosing Method
The adaptive threshold denoising method based on WTD-IFOA can be summarized as follows: Step 2.1: For the sake of convenient calculation, the sound signal is first quantized into a certain range.Then, the initial signal is decomposed by a J-level wavelet transform.i-th coefficient in j-th level can be presented as di,j, where i = 1, 2, 3, …, m, j = 1, 2, 3, …, J, m is the length of the sound signal.
Step 2.2: The parameters of IFOA are initialized, such as LR, INmax and FR, where FR ~ N(0, L 2 ).For J-level WT, each level has an optimal threshold.So there are J groups of fruit flies, each group contains PA individuals.The initial location of the fruit fly group is obtained by Equation ( 6).
Step 2.3: The location of each individual is gained through the fly group and FR.The distance and smell concentration of each fly are calculated according to Equation (8).Each smell concentration judgment is regarded as a potential threshold.Then, the useful signal and the noise component are separated according to the soft-threshold function.In order to judge the denoising performance of each fruit fly individual, the fitness function f is calculated as follows: where r22 is an autocorrelation of noise.As noise is yielded randomly, it has neither high autocorrelation nor zero autocorrelation.However, an ascending value implies that more original signal adheres to noise, thus the restructured signal will not be a good recovery.hr21 is a high-order cross-correlation between useful signal and the noise.If these coefficients are descending, then it implies that both signals become more independent to each other.Thus, the original signal and noise are toward separation gradually.r11 is an autocorrelation of useful signal.An ascending value Step 2.3: The location of each individual is gained through the fly group and FR.The distance and smell concentration of each fly are calculated according to Equation (8).Each smell concentration judgment is regarded as a potential threshold.Then, the useful signal and the noise component are separated according to the soft-threshold function.In order to judge the denoising performance of each fruit fly individual, the fitness function f is calculated as follows: g " rr 22 ˆhr 21 s 2 r 2 11 (12) where r 22 is an autocorrelation of noise.As noise is yielded randomly, it has neither high autocorrelation nor zero autocorrelation.However, an ascending value implies that more original signal adheres to noise, thus the restructured signal will not be a good recovery.hr 21 is a high-order cross-correlation between useful signal and the noise.If these coefficients are descending, then it implies that both signals become more independent to each other.Thus, the original signal and noise are toward separation gradually.r 11 is an autocorrelation of useful signal.An ascending value implies that its own component is more than component of noise.Hence, the restructured signal has a good recovery [33].r 22 , hr 21 and r 11 are defined as follows: where s 1 is the useful signal, s 2 is the noise component, cov[s i , ) is mathematical expectation of s i and ϕ(s i ) = s i 2 + s i 3 .r 22 , hr 21 tend to minimum and r 11 tends to the maximum when the location of each fruit fly group is placed in the best.Then g is the minimal and f is the maximal [34].
Step 2.4: Fruit fly with maximal fitness is selected as bestsmell and the corresponding fly number is named as bestindex.If the present bestsmell is bigger than that of the former, smellbest, the corresponding coordinates are updated.Otherwise, smellbest, X _axis and Y _axis are reserved.
Step 2.5: If the ending conditions are researched, smellbest, X _axis and Y _axis are treated as the optimum.Otherwise, Steps 2.3 and 2.4 are repeated.
Step 2.6: The wavelet coefficients are adjusted according to the soft-threshold function and inverse wavelet transform is conducted consequently to obtain denoised signal.The flowchart of the process is shown in Figure 4.
where s1 is the useful signal, s2 is the noise component 2 }, E(si) is mathematical expectation of si and φ(si) = si 2 + si 3 .r22, hr21 tend to minimum and r11 tends to the maximum when the location of each fruit fly group is placed in the best.Then g is the minimal and f is the maximal [34].
Step 2.4: Fruit fly with maximal fitness is selected as bestsmell and the corresponding fly number is named as bestindex.If the present bestsmell is bigger than that of the former, smellbest, the corresponding coordinates are updated.Otherwise, smellbest, X_axis and Y_axis are reserved.
Step 2.5: If the ending conditions are researched, smellbest, X_axis and Y_axis are treated as the optimum.Otherwise, Steps 2.3 and 2.4 are repeated.
Step 2.6: The wavelet coefficients are adjusted according to the soft-threshold function and inverse wavelet transform is conducted consequently to obtain denoised signal.The flowchart of the process is shown in Figure 4.

Simulation and Industrial Application
In order to validate the effectiveness and superiority of the proposed method, a piece of pure sound signal of a motor was recorded.Then, Gauss white noise with different signal to noise ratio were added into the pure signal.The mean square error (MSE), peak value error (PVE) and the computation time (CT) were regarded as evaluation criteria of the noise elimination solutions.The denoising performance of standard soft threshold denoising method (SST), a threshold function-based Appl.Sci.2016, 6, 199 9 of 16 noise elimination solution proposed in [10] (TFB), wavelet threshold denoising optimized by genetic algorithm (WTD-GA), wavelet threshold denoising optimized by fruit fly optimization algorithm (WTD-FOA) and the proposed WTD-IFOA were compared subsequently.Finally, an industrial application for the shearer of coal mining working face is exhibited.All calculations in this section were conducted on a workstation configured as shown in Table 1.

Signal Acquisition
To test the performance of the denoising methods, a pure sound signal sequence was needed first.Because there was much background noise in industrial field, it was extremely difficult to collect the pure sound.Moreover, sound signal of a machine consisted of many frequency components, so it was also hard to synthesize a representative series with practical meaning artificially.In this paper, the sound of a motor working in a soundproof room was recorded as the original signal.Concretely, the sound signal was acquired from soundproof testing branch, Jiangsu Key Laboratory of Mining Mechanical and Electrical Equipment.The walls of the testing room were constructed of a special acoustic insulating material and echo cancellation was designed in the testing process.An AC servomotor with the rated out power of 1 kW and the corresponding electrical system were installed in the laboratory.The schematic of the testing room and the experiment site are shown in Figure 5.
In order to validate the effectiveness and superiority of the proposed method, a piece of pure sound signal of a motor was recorded.Then, Gauss white noise with different signal to noise ratio were added into the pure signal.The mean square error (MSE), peak value error (PVE) and the computation time (CT) were regarded as evaluation criteria of the noise elimination solutions.The denoising performance of standard soft threshold denoising method (SST), a threshold function-based noise elimination solution proposed in [10] (TFB), wavelet threshold denoising optimized by genetic algorithm (WTD-GA), wavelet threshold denoising optimized by fruit fly optimization algorithm (WTD-FOA) and the proposed WTD-IFOA were compared subsequently.Finally, an industrial application for the shearer of coal mining working face is exhibited.All calculations in this section were conducted on a workstation configured as shown in Table 1.

Signal Acquisition
To test the performance of the denoising methods, a pure sound signal sequence was needed first.Because there was much background noise in industrial field, it was extremely difficult to collect the pure sound.Moreover, sound signal of a machine consisted of many frequency components, so it was also hard to synthesize a representative series with practical meaning artificially.In this paper, the sound of a motor working in a soundproof room was recorded as the original signal.Concretely, the sound signal was acquired from soundproof testing branch, Jiangsu Key Laboratory of Mining Mechanical and Electrical Equipment.The walls of the testing room were constructed of a special acoustic insulating material and echo cancellation was designed in the testing process.An AC servomotor with the rated out power of 1 kW and the corresponding electrical system were installed in the laboratory.The schematic of the testing room and the experiment site are shown in Figure 5.  Echo cancellation material was installed on the inner wall and acoustic insulating equipment was placed on the external wall.The motor, microphone and computer were fixed in the room with the length of 6 m, width of 5 m and height of 4 m.Operators controlled the motor outside the room.The sound signal was recorded by the microphone and then transmitted to the computer.Sampling frequency of the sound signal was 44.1 kHz.The experiment was conducted as follows: start the motor remotely and keep it in no-load operation, then stop the motor after 10 min, pretreat and save the sound in the computer in wav format.Quantization was subsequently operated to convert the sound amplitude into the scope of [-1, 1].Finally, a piece of relatively stable sound was extracted with the duration of 0.5 s, as shown in Figure 6.
Echo cancellation material was installed on the inner wall and acoustic insulating equipment was placed on the external wall.The motor, microphone and computer were fixed in the room with the length of 6 m, width of 5 m and height of 4 m.Operators controlled the motor outside the room.The sound signal was recorded by the microphone and then transmitted to the computer.Sampling frequency of the sound signal was 44.1 kHz.The experiment was conducted as follows: start the motor remotely and keep it in no-load operation, then stop the motor after 10 min, pretreat and save the sound in the computer in wav format.Quantization was subsequently operated to convert the sound amplitude into the scope of [-1, 1].Finally, a piece of relatively stable sound was extracted with the duration of 0.5 s, as shown in Figure 6.

Signal Denoising
In this paper, the MSE ξ, the PVE η and the CT t were selected as evaluation indexes of the denoising methods.MSE and the PVE are calculated as follows: 100% where ds is denoised signal, s is original signal, k is the length of the signal, and Po and Pd are, respectively, the peak value of the original signal and denoised signal.To test the performance of SST, TFB, WTD-GA, WTD-FOA and the proposed WTD-IFAO, Gauss white noise was added into the original sound.The signal to noise ratio (SNR) ζ (dB) was introduced to measure the degree of noise, and SNR was defined as follows: where si was the amplitude of the original signal and ni was that of the added noise.
The denoising process was conducted as follows: (1) Add Gaussian white noise into the original sound and conduct wavelet decomposition.A noisy signal with ζ = 5 dB was firstly analyzed and the synthesis was finished in Matlab 8.0 (MathWorks Inc., Natick, MA, USA, 2012).Then the synthetic signal was decomposed by wavelet decomposition with db2 wavelet at 5 levels [6].The decomposition result is shown in Figure 7.

Signal Denoising
In this paper, the MSE ξ, the PVE η and the CT t were selected as evaluation indexes of the denoising methods.MSE and the PVE are calculated as follows: where ds is denoised signal, s is original signal, k is the length of the signal, and P o and P d are, respectively, the peak value of the original signal and denoised signal.To test the performance of SST, TFB, WTD-GA, WTD-FOA and the proposed WTD-IFAO, Gauss white noise was added into the original sound.The signal to noise ratio (SNR) ζ (dB) was introduced to measure the degree of noise, and SNR was defined as follows: where s i was the amplitude of the original signal and n i was that of the added noise.
The denoising process was conducted as follows: (1) Add Gaussian white noise into the original sound and conduct wavelet decomposition.A noisy signal with ζ = 5 dB was firstly analyzed and the synthesis was finished in Matlab 8.0 (MathWorks Inc., Natick, MA, USA, 2012).Then the synthetic signal was decomposed by wavelet decomposition with db2 wavelet at 5 levels [6].The decomposition result is shown in Figure 7. (2) Denoise the noisy signal by SST.The value range of the wavelet threshold was firstly calculated according to Equation (3).The recommended threshold λ rec was adopted according to Donoho, where q = 0.6745.The wavelet coefficients of each level were shrunk according to Equation (5).
Then the signal was reconstructed by inverse WT.
(3) Denoise the noisy signal by TFB.An improved threshold function with continuous first and second order derivative was introduced in [10], and is presented as follows: di,j " where λ is determined according to Equation (3) and k = 3. (4) Denoise the noisy signal by WTD-GA.The maximum λ max appeared at q = 1 and λ min obtained when q = 0.4, so λ P rλ min , λ max s.The population size was 100, each chromosome was a five-dimensional vector, the crossover probability was 0.7, the mutation probability was 0.01 and the most iteration generation was 100, as recommended by ref. [15].( 5) Denoise the noisy signal by WTD-FOA.The parameters were set as follows: λ P rλ min , λ max s.
λ min and λ max were calculated as previously.The fly distance obeys uniform distribution, FR ~U(´0.2,0.2).The population number was 5, each group contained 20 individuals and the iteration number was 100.The fitness of each fruit fly was calculated according to Equation ( 11).( 6) Denoise the noisy signal by WTD-IFOA.The parameters were set as follows: λ P rλ min , λ max s, the fly distance obeys normal distribution, FR ~N(0, 0.2 2 ).The population number was 5, each group contained 20 individuals and the iteration number was 100.The fitness of each fruit fly was calculated according to Equation (11).
Subsequently, a comprehensive comparison was made for the five methods.The ξ, η and t at ζ = 5 dB of the average value of the five simulation results are presented in Table 2.And the denoised signals of the 5 solutions were shown in Figure 8.It can be seen in the table that adaptive denoising methods based on intelligent optimization had a better comprehensive performance than SST and TFB.MSE of the WTD-FOA was smaller than that of WTD-GA while the PVE was contrary, which indicated the two methods had no optimal solution, both in global and local.The WTD-IFOA overcame the disadvantage and contributed a superior scheme.The MSE of WTD-IFOA was decreased about 35.36% compared with SST, and the PVE decreased about 9.40%.As optimal threshold was obtained through the iterative process, the last four methods were much more time-consuming.Among these approaches, TFB cost the most time due to its complex calculation.The optimization process of WTD-GA was much more complex than the other intelligent solutions from the table.Moreover, the WTD-IFOA, respectively, saved 26.96% and 12.47% time compared with WTD-GA and WTD-FOA because of its stronger addressing ability.In order to research the denoising performance at different SNR, a further comparison was made and the average value of five simulation results are shown in Figure 9. Four noisy signal with SNR = 5 dB, 10 dB, 15 dB and 20 dB were synthetized and handled.Then the MSE, PVE and CT of the denoising processes were presented in the figure.It can be roughly obtained that the three evaluation parameters decreased with the SNR.As the threshold value of each level and wavelet coefficients were determined directly by Equations ( 3) and (4) in SST, the denoising process could be finished in a short time, while the TFB based on gradient descent algorithm was time-consuming in different SNR as its complex calculation process.Denoising methods using optimization algorithm had a comprehensive denoising performance.The computation time was decreased sharply compared with TFB while it still cost more time than SST.In detail, the WTD-GA did not obviously vary from WTD-FOA in PVE, while it had a distinct weakness in MSE and CT compared with FOA-based methods.It could be seen from the simulation results that the TFB, WTD-GA and WTD-FOA fell into local extreme during the parameters optimization.Moreover, the WTD-IFOA exhibited obvious superiority in the denoising effect, which revealed its terrific global and local ability compared to the other methods in different SNR.In order to research the denoising performance at different SNR, a further comparison was made and the average value of five simulation results are shown in Figure 9. Four noisy signal with SNR = 5 dB, 10 dB, 15 dB and 20 dB were synthetized and handled.Then the MSE, PVE and CT of the denoising processes were presented in the figure.It can be roughly obtained that the three evaluation parameters decreased with the SNR.As the threshold value of each level and wavelet coefficients were determined directly by Equations ( 3) and ( 4) in SST, the denoising process could be finished in a short time, while the TFB based on gradient descent algorithm was time-consuming in different SNR as its complex calculation process.Denoising methods using optimization algorithm had a comprehensive denoising performance.The computation time was decreased   In order to research the denoising performance at different SNR, a further comparison was made and the average value of five simulation results are shown in Figure 9. Four noisy signal with SNR = 5 dB, 10 dB, 15 dB and 20 dB were synthetized and handled.Then the MSE, PVE and CT of the denoising processes were presented in the figure.It can be roughly obtained that the three evaluation parameters decreased with the SNR.As the threshold value of each level and wavelet coefficients were determined directly by Equations ( 3) and ( 4) in SST, the denoising process could be finished in a short time, while the TFB based on gradient descent algorithm was time-consuming in different SNR as its complex calculation process.Denoising methods using optimization exhibited obvious superiority in the denoising effect, which revealed its terrific global and local ability compared to the other methods in different SNR.

Application
In order to test practical effect of the proposed adaptive denoising method for the machinery sound signal based on WTD-IFOA, an industrial application was operated in a fully-mechanized coal mining working face.The shearer is an important machine in automatic coal mining, and working condition monitoring for the shearer is of great necessity.Traditional monitoring methods are mainly based on the vibration signal [35], even though the working life of the vibration sensors are very short due to the bad working condition and contact-measurement.Coal output is seriously restricted by the frequent maintenance.In the August 2015, an online monitoring system through the shearer cutting sound signal was built in the 71,507 coal mining face in the NO.2 Mine of Yangquan Coal Industry Group Corporation.However, there existed a large number of noise signal among the initial signal because of the harsh working environment.To eliminate the background noise from the sound, an industrial microphone was installed and WTD-IFOA was applied.The three-dimensional model of the coal mining shearer and field sound collection is presented in Figure 10.To illustrate the effectiveness of the proposed method, a piece of field sound signal with the length of 0.5 s was extracted and denoised.The original sound and the denoised one are shown in Figure 11.Then, 4096 points FFT was conducted to analyze the frequency components of the two signals, as presented in Figure 12.It can be seen in Figure 12 that the processed signal was sharp

Application
In order to test practical effect of the proposed adaptive denoising method for the machinery sound signal based on WTD-IFOA, an industrial application was operated in a fully-mechanized coal mining working face.The shearer is an important machine in automatic coal mining, and working condition monitoring for the shearer is of great necessity.Traditional monitoring methods are mainly based on the vibration signal [35], even though the working life of the vibration sensors are very short due to the bad working condition and contact-measurement.Coal output is seriously restricted by the frequent maintenance.In the August 2015, an online monitoring system through the shearer cutting sound signal was built in the 71,507 coal mining face in the NO.2 Mine of Yangquan Coal Industry Group Corporation.However, there existed a large number of noise signal among the initial signal because of the harsh working environment.To eliminate the background noise from the sound, an industrial microphone was installed and WTD-IFOA was applied.The three-dimensional model of the coal mining shearer and field sound collection is presented in Figure 10.

Application
In order to test practical effect of the proposed adaptive denoising method for the machinery sound signal based on WTD-IFOA, an industrial application was operated in a fully-mechanized coal mining working face.The shearer is an important machine in automatic coal mining, and working condition monitoring for the shearer is of great necessity.Traditional monitoring methods are mainly based on the vibration signal [35], even though the working life of the vibration sensors are very short due to the bad working condition and contact-measurement.Coal output is seriously restricted by the frequent maintenance.In the August 2015, an online monitoring system through the shearer cutting sound signal was built in the 71,507 coal mining face in the NO.2 Mine of Yangquan Coal Industry Group Corporation.However, there existed a large number of noise signal among the initial signal because of the harsh working environment.To eliminate the background noise from the sound, an industrial microphone was installed and WTD-IFOA was applied.The three-dimensional model of the coal mining shearer and field sound collection is presented in Figure 10.To illustrate the effectiveness of the proposed method, a piece of field sound signal with the length of 0.5 s was extracted and denoised.The original sound and the denoised one are shown in Figure 11.Then, 4096 points FFT was conducted to analyze the frequency components of the two signals, as presented in Figure 12.It can be seen in Figure 12 that the processed signal was sharp To illustrate the effectiveness of the proposed method, a piece of field sound signal with the length of 0.5 s was extracted and denoised.The original sound and the denoised one are shown in Figure 11.Then, 4096 points FFT was conducted to analyze the frequency components of the two signals, as presented in Figure 12.It can be seen in Figure 12 that the processed signal was sharp decreased in amplitude compared with the original field signal.The reason lies in that the scope saltation caused by the noise component was removed.Moreover, frequency components of original signal shown in Figure 12 had a disordered distribution, and it was difficult to identify the working state of the shearer.On the contrary, it was regular for the denoised signal.Some wave peaks appeared in the spectrogram, and other areas were stable.Different spectra of the collected signal reflected different working conditions of the shearer.Thus, the working state could be identified according to the wave peaks.decreased in amplitude compared with the original field signal.The reason lies in that the scope saltation caused by the noise component was removed.Moreover, frequency components of original signal shown in Figure 12 had a disordered distribution, and it was difficult to identify the working state of the shearer.On the contrary, it was regular for the denoised signal.Some wave peaks appeared in the spectrogram, and other areas were stable.Different spectra of the collected signal reflected different working conditions of the shearer.Thus, the working state could be identified according to the wave peaks.

Conclusions and Future Work
In order to eliminate noise components from the sound signal of a working machine, this paper proposes a novel approach based on wavelet threshold denoising and an improved FOA.Improved strategy on the basis of fly distance range obeying normal distribution was applied in the denoising process.To verify the feasibility and superiority of the proposed WTD-IFOA, a simulation example was provided and some comparisons were conducted.The simulation example and comparison results showed that the adaptive denoising method could effectively eliminate noise components and the proposed approach outperformed the others.Finally, an industrial application was performed on the shearer in a fully-mechanized coal mining face to test practical effect.
However, there are also some deficiencies and shortcomings in this method including the following.On the one hand, although the proposed WTD-IFOA is much more timesaving than other adaptive denoising approaches, the calculation duration is still a serious problem that cannot be neglected.On the other hand, parameters during the optimization process are determined according to past scholars and a large number of simulation experiments, while strict mathematical deduction are lacking.In future studies, the authors plan to investigate some improvements to the proposed  decreased in amplitude compared with the original field signal.The reason lies in that the scope saltation caused by the noise component was removed.Moreover, frequency components of original signal shown in Figure 12 had a disordered distribution, and it was difficult to identify the working state of the shearer.On the contrary, it was regular for the denoised signal.Some wave peaks appeared in the spectrogram, and other areas were stable.Different spectra of the collected signal reflected different working conditions of the shearer.Thus, the working state could be identified according to the wave peaks.

Conclusions and Future Work
In order to eliminate noise components from the sound signal of a working machine, this paper proposes a novel approach based on wavelet threshold denoising and an improved FOA.Improved strategy on the basis of fly distance range obeying normal distribution was applied in the denoising process.To verify the feasibility and superiority of the proposed WTD-IFOA, a simulation example was provided and some comparisons were conducted.The simulation example and comparison results showed that the adaptive denoising method could effectively eliminate noise components and the proposed approach outperformed the others.Finally, an industrial application was performed on the shearer in a fully-mechanized coal mining face to test practical effect.
However, there are also some deficiencies and shortcomings in this method including the following.On the one hand, although the proposed WTD-IFOA is much more timesaving than other adaptive denoising approaches, the calculation duration is still a serious problem that cannot be neglected.On the other hand, parameters during the optimization process are determined according to past scholars and a large number of simulation experiments, while strict mathematical deduction are lacking.In future studies, the authors plan to investigate some improvements to the proposed

Conclusions and Future Work
In order to eliminate noise components from the sound signal of a working machine, this paper proposes a novel approach based on wavelet threshold denoising and an improved FOA.Improved strategy on the basis of fly distance range obeying normal distribution was applied in the denoising process.To verify the feasibility and superiority of the proposed WTD-IFOA, a simulation example was provided and some comparisons were conducted.The simulation example and comparison results showed that the adaptive denoising method could effectively eliminate noise components and the proposed approach outperformed the others.Finally, an industrial application was performed on the shearer in a fully-mechanized coal mining face to test practical effect.
However, there are also some deficiencies and shortcomings in this method including the following.On the one hand, although the proposed WTD-IFOA is much more timesaving than other adaptive denoising approaches, the calculation duration is still a serious problem that cannot be neglected.On the other hand, parameters during the optimization process are determined according to past scholars and a large number of simulation experiments, while strict mathematical deduction are lacking.In future studies, the authors plan to investigate some improvements to the proposed approach.These may include an improved algorithm code with higher execution efficiency and appropriate scheme for determining optimization parameters.

Figure 1 .
Figure 1.Process of foraging behavior for fruit fly group.

Figure 1 .
Figure 1.Process of foraging behavior for fruit fly group.

∈ 3 , 3
is about 99.73%.The probability density distribution of original fly distance range and the proposed one are presented in Figure2.Most individuals fly towards the present best location, while more than 30% fruit flies continuing searching at a larger scale.Moreover, individuals flying towards the present optimum tend to a more concentrated region according to the probability distribution condition.Thus, the capacity of global searching and partial location are both enhanced.The flowchart of the IFOA is shown in Figure3.

Figure 2 .
Figure 2. Probability density functions of basic fly distance range and the improved.Figure 2. Probability density functions of basic fly distance range and the improved.

Figure 2 .
Figure 2. Probability density functions of basic fly distance range and the improved.Figure 2. Probability density functions of basic fly distance range and the improved.

Figure 3 .
Figure 3. Flowchart of the improved fruit fly optimization algorithm.

Figure 3 .
Figure 3. Flowchart of the improved fruit fly optimization algorithm.

Figure 4 .
Figure 4. Process of the proposed denoising method.Figure 4. Process of the proposed denoising method.

Figure 4 .
Figure 4. Process of the proposed denoising method.Figure 4. Process of the proposed denoising method.

Figure 5 .
Figure 5. (a) Schematic of the soundproof testing branch; and (b) The sound recorded in site.Figure 5. (a) Schematic of the soundproof testing branch; and (b) The sound recorded in site.

Figure 5 .
Figure 5. (a) Schematic of the soundproof testing branch; and (b) The sound recorded in site.Figure 5. (a) Schematic of the soundproof testing branch; and (b) The sound recorded in site.

Figure 6 .
Figure 6.The extracted sound signal of the motor.

Figure 6 .
Figure 6.The extracted sound signal of the motor.

Figure 9 .
Figure 9. Comprehensive comparison of the five methods: (a) Mean square error of the five methods; (b) Peak value error of the five methods; and (c) Computation time of the five methods.

Figure 10 .
Figure 10.(a) Three-dimensional model of the shearer; and (b) collection of field cutting sound.

Figure 9 .
Figure 9. Comprehensive comparison of the five methods: (a) Mean square error of the five methods; (b) Peak value error of the five methods; and (c) Computation time of the five methods.
Appl.Sci.2016, 6, 199   13 of 16exhibited obvious superiority in the denoising effect, which revealed its terrific global and local ability compared to the other methods in different SNR.

Figure 9 .
Figure 9. Comprehensive comparison of the five methods: (a) Mean square error of the five methods; (b) Peak value error of the five methods; and (c) Computation time of the five methods.

Figure 10 .
Figure 10.(a) Three-dimensional model of the shearer; and (b) collection of field cutting sound.

Figure 10 .
Figure 10.(a) Three-dimensional model of the shearer; and (b) collection of field cutting sound.

Figure 11 .
Figure 11.(a) The orginal sound signal; and (b) the denoised sound signal.

Figure 12 .
Figure 12.(a) Frequency components of the original signal; and (b) frequency components of the denoised signal.

Figure 11 .
Figure 11.(a) The orginal sound signal; and (b) the denoised sound signal.

Figure 11 .
Figure 11.(a) The orginal sound signal; and (b) the denoised sound signal.

Figure 12 .
Figure 12.(a) Frequency components of the original signal; and (b) frequency components of the denoised signal.

Figure 12 .
Figure 12.(a) Frequency components of the original signal; and (b) frequency components of the denoised signal.

Table 1 .
Configuration of the workstation.

Table 1 .
Configuration of the workstation.