UVicSPACE: Research & Learning Repository

: Nonlinear diffusion has been proved to be an indispensable approach for the removal of noise in image processing. In this paper, we employ nonlinear diffusion for the purpose of denoising audio signals in order to have this approach also recognized as a powerful tool for audio signal processing. We apply nonlinear diffusion to wavelet coefﬁcients obtained from different ﬁlters associated with orthogonal and biorthogonal wavelets. We use wavelet decomposition to keep signal components well-localized in time. We compare denoising results using nonlinear diffusion with wavelet shrinkage for different wavelet ﬁlters. Our experiments and results show that the denoising is much improved by using the nonlinear diffusion process.


Introduction
The removal of noise in the area of signal processing has been a center of creative study among researchers due to the universal fact that signals are degraded by noise.Diffusion has been proved as one of the most important denoising techniques for the removal of noise for image processing problems [1][2][3][4][5].Linear diffusion removes noise very effectively, but unsystematically blurs all image information and, thus, removes or degrades important necessary image features.
In the recent past, nonlinear diffusion techniques, which are isotropic and anisotropic, have been used very successfully for the removal of noise in image processing problems by enabling the design of image filters.Using these techniques, important image features such as edges are also preserved or enhanced.
In this work, we apply nonlinear diffusion in order to investigate this approach as a powerful tool for denoising digital audio signals as well.Audio signals are one class of signals, which have similar importance to images.Since audio signals are onedimensional, linear and isotropic, nonlinear diffusion techniques can be adapted to the denoising problems of audio signal processing.On the other hand, well-localized features of music samples are also protected using nonlinear diffusion.
In audio signals, the most important features in sampled audio signals are oscillations and there is a need to identify between features such as noise and useful information.During the diffusion process of sampled music signals, the useful information should contain periodic oscillations and noise is to be made up of irregular oscillations.Thus, it is necessary to separate useful signal components from noise using some frequency analysis method.Here, we use wavelet decomposition to fulfill the need of keeping signal components well-localized in time, as Fourier analysis methods cannot do the same [6,7].
Wavelet shrinkage is an established denoising technique in wavelet representation [8,9].We will compare the results of our diffusion-based method with those of wavelet shrinkage working on the same wavelet representation.In this analysis, wavelet filters are used.A multilevel wavelet decomposition structure gives approximate and detailed wavelet coefficients for different levels.We apply nonlinear diffusion techniques to the selected detailed wavelet coefficients for the removal of noise.For the reconstruction of signals, we use approximation wavelet coefficients of the obtained decomposition levels and diffuse detail wavelet coefficients of different decomposition levels with different parameters of diffusion filters.
Readers may consult the works of [10][11][12][13][14] for further details on wavelet decomposition and related topics.Readers may also consult the most recent works of [15][16][17][18] for further details on the direction of present work.
Further organization of the present work is as follows: in Section 2, we give definitions and concepts related to the work of the paper.Section 3 presents the experiments and results of denoising, while the conclusion is given in Section 4.

Preliminaries
In this section, we present the necessary definitions and concepts related to the work of this paper.

Digital Signal Processing Using Discrete Wavelet Transform
Digital signal processing [7,19] is concerned with specialized digital signal processors to perform a wide variety of signal analyses and processing via computer algorithms.The wavelet transform provides a time-frequency [20][21][22] representation of a signal by decomposing it into different scales at different resolution levels.In particular, discrete wavelet transform [6,9,23,24] means a numeric algorithm associated with the decomposition and reconstruction of the signals.DWT provides a multiresolution analysis of the signal extracting details and features with the help of scaling and wavelet functions.The fundamentals of DWT originate from subband coding [25].The wavelet method in digital signal processing depends on the representation of a signal f (x) with respect to a basis consisting of scaling functions φ j,k (x) and wavelet functions ψ j,k (x).
If the wavelet functions ψ j,k (x) and the scaling functions φ j,k (x) form an orthogonal basis, then In the above terms, a n,k and b n,k are defined as

Familiy of Wavelet Filters
The properties of the family of wavelets [6,26,27] depend upon the mother wavelet features.The wavelet coefficients obtained from different wavelet filters represent how well the signal is similar to the scaled and translated mother wavelet.The wavelet and scaling functions associated with different wavelets differentiate among them.

Haar Wavelet
If φ(x) = χ [0,1) denotes a scaling function, then a Haar wavelet [26] is given by ψ , where χ is a characteristic function.Two scale relations are given by A Haar wavelet comprises a hierarchical subdivision of the domain of a signal.Thus, it has a disadvantage of lacking translational invariance.

Daubechies Wavelets
Daubechies wavelets [6,23,28] are a family of compactly supported orthogonal wavelets characterized by a maximal number of vanishing moments for some given support.These wavelet filters can provide a perfect reconstruction and their wavelet and scaling functions satisfy recursive relationships.

Symlets and Coiflets Wavelets
Coiflets wavelets [6] are near-symmetric.Their wavelet functions have N  3 and scaling functions have ( N 3 − 1) vanishing moments.Symlets wavelets are a modified version of Daubechies [6,23,27] wavelets with increased symmetry.For efficient implementation, Coiflets filters are compactly supported corresponding to finite impulse response filters.

Biorthogonal Wavelets
Biorthogonal wavelet bases provide us with compactly supported symmetric wavelets [23].Biorthogonal wavelets are formed by a pair of families of dual wavelets, ψ j,k (x) and ψj,k (x), derived from two mother wavelets ψ(x) and ψ(x), respectively, and such that any function f ∈ L 2 (R) can be defined by Hence, f is decomposed by one family and reconstructed by the other.

Wavelet Shrinkage
Wavelet shrinkage is a procedure of denoising by shrinking (i.e., nonlinear soft thresholding) in the wavelet transform domain.Because of the nonlinear shrinking of coefficients in the transform domain, this procedure is different from those denoising methods that are entirely linear.This denoising method is known as the non-parametric method.Thus, it is different from the parametric method, in which parameters must be estimated for a particular method that must be assumed a priori.

Nonlinear Diffusion
Nonlinear diffusion is given by equation [29]: with the initial condition u(x, 0) = s(x) gives 1-D diffusion process, which embeds s(x) into u(x, t), where u(x, t) is a family of signals.It can be noted that u(x, t) has a diffusion time parameter t ∈ (0, +∞), which provides smoothed versions of signal s.The diffusivity g(|∇u|) depends explicitly on |∇u|.This type of diffusivity helps to diffuse u(x, t) least at locations where |∇u| is the largest, since g(s) is small for large s values and at most locations where |∇u| is the smallest, since g(0) = 1.This means fine details, which correspond to small gradients, are removed via diffusion processes, whereas important details such as the edge, which corresponds to large gradients, are preserved.Considering the work in the field of image processing on the theory of edge detection [3,4], the family of diffusivities [30] of an analytic function is defined as follows: where α > 0 and λ > 0.
For the family of filters, the diffusivity g decreases monotonically for s ≥ 0. This property ensures a lower diffusion at the edges.The diffusivity g depends on two parameters λ and α.The parameter λ plays the role of a threshold for the gradients.Gradients that are smaller than λ are smoothed, whereas gradients that are larger than λ are sharpened.The parameter α controls the decay of the diffusivity g.
Using a finite difference scheme [29], we obtain the following difference equation: Rearranging the terms, we obtain the difference Equation ( 4) as The discrete bounday conditions are u n 0 = u n 1 and u n N+1 = u n N , with ∆x and ∆t satisfying the positivity condition ∆t/(∆x) 2 ≤ 1/2.In view of the fact that, from one level to another coarser level, step size is doubled, the explicit discretization [30] of the wavelet coefficient becomes with step size τ for diffusion time.

Experiments and Results
We need some computational data to establish the effectiveness of wavelet denoising.The DWT approach gives the frequency spectrum of a signal in the form of different frequency bands, and we remove the small detail components assumed as noise using nonlinear diffusion.We verify our theoretical approach by denoising three music samples by adding white Gaussian noise with a certain noise variance percentage of the signal variance.Digital technologies are implemented using a MATLAB computing environment.We use wavelet shrinkage [8] and nonlinear diffusion techniques for different wavelet families.The Universal threshold [8] for the 1D signal when data is not normalized with regard to noise-standard deviation is given by Here, in our case, n is the number of samples taken for a music signal and noise level is given by where ∇D j represents the gradient of detail coefficients of the chosen level of decomposition.
The SNR is computed using the formula SNR = 10 log 10 var(x) var(x − y) dB, where x represents the original signal taken for analysis, y represents the denoised signal obtained from the noisy signal (original signal with added noise) and 'var' denotes variance.
As wavelet decomposition influences the frequency bands, thus, the SNR values after denoising are determined for different levels of the wavelet decomposition.To avoid the effect on the main frequency components, a suitable choice of decomposition level is a must.The highest SNR value gives the desired level of wavelet decomposition.Other parameters required for further computation of denoising results are the value of α, positivity condition (mesh ratio), threshold value and wavelet type.
The diffusivity function g(s) is monotonically decreasing for s ≥ 0 and g(s) has rapid decay as s tends to infinity with g(0) = 1.This ensures a lower diffusion at the edges.Values of diffusitivity function g(s) for different values of α are given in Figure 1.Note that we need to diffuse the wavelet coefficients least at the locations where s is large and most at the locations where s is small.Keeping this view point, we take α > 1 so that g(s) decreases with a less rapid rate up to a certain threshold for α > 1.In contrast, g(s) decreases rapidly for 0 < α < 1, i.e., for smaller values of α.

Denoising of First Music Signal
In the first music signal (Note: Haviitz_Get_Out D rumRimShotStraight_4bar 8 4bpm), we add white Gaussian noise, which is 5% of the signal variance.The following table shows a comparision of SNR values by applying wavelet shrinkage and non-linear diffusion techniques of denoising for different wavelet filters.
Below, Table 1 was developed after calculating SNR values using wavelet shrinkage and nonlinear diffusion techniques for the wavelet filters, namely Haar (Daubechies-1), Symlet-8, Daubechies-10, biorthogonal-6.8and coiflet-5 for each wavelet level up to 10 and, out of those, only the highest SNR values for these filters in both techniques are considered for the purpose of study.We note that these highest SNR values are achived at wavelet decomposition level six in each case.Now, a comparision between the SNR values obtained from the two different denoising methods for the above-mentioned filters is made, and this shows that the nonlinear diffusion method performs better as compared with the wavelet shrinkage method for the Daubechies-10 wavelet filter.
Figure 2 shows the original signal (first music signal) and Figure 3 shows the noisy signal after adding noise of 5% to the original signal.Figure 4 shows the denoised signal using nonlinear diffusion for the Daubechies-10 wavelet filter at wavelet decomposition level six.For this denoised signal, we obtain SNR 19.56 db, which is the highest among other wavelet filters used for denoising.We obtain SNR 19.37 db using wavelet shrinkage for the Daubechies-10 wavelet filter at wavelet decomposition level six.Thus, a higher SNR value is achieved using the nonlinear diffusion method, which gives better denoising results as compared with the wavelet shrinkage method.We also note that in the first music signal, the nonlinear diffusion method also has an advantage over wavelet shrinkage for other wavelet filters.Further, Figure 5 shows level six detail coefficients and Figure 6 shows level six approximation coefficients.A comparison of Figures 2, 4 and 6 shows the effectiveness of the nonlinear diffusion technique.

Denoising of Second Music Signal
In the second music signal (Note: 110 D iscoDrums_033_172_SPT2), we add white Gaussian noise, which is 5% of the signal variance.The following table shows a comparison of SNR values by applying wavelet shrinkage and non-linear diffusion techniques of denoising for different wavelet filters.
Table 2 was developed after calculating SNR values using wavelet shrinkage and nonlinear diffusion techniques for the wavelet filters, namely Haar (Daubechies-1), symlet-8, Daubechies-10, biorthogonal-6.8and coiflet-5, for each wavelet level up to 10, and, out of those, only the highest SNR values for these filters in both techniques are considered for the purpose of study.We note that these highest SNR values are achived at wavelet decomposition level six in each case.Now, a comparison between the SNR values obtained from the two different denoising methods for the above-mentioned filters is made, and this shows that the nonlinear diffusion method performs better as compared with the wavelet shrinkage method for the Daubechies-10 wavelet filter.This validates the result of our first music signal.
Figure 7 shows the original signal (second music signal) and Figure 8 shows the noisy signal after adding noise of 5% to the original signal.Figure 9 shows the denoised signal using nonlinear diffusion for the Daubechies-10 wavelet filter at wavelet decomposition level six.For this denoised signal, we obtain SNR 19.65 db, which is the highest among other wavelet filters used for denoising.We obtain SNR 19.56 db using wavelet shrinkage for the Daubechies-10 wavelet filter at wavelet decomposition level six.Thus, a higher SNR value is achieved using the nonlinear diffusion method, which gives better denoising results as compared with the wavelet shrinkage method.We also note that in the second music signal, the nonlinear diffusion method also has an advantage over wavelet shrinkage for other wavelet filters.Further, Figure 10 shows level six detail coefficients and Figure 11 shows level six approximation coefficients.A comparison of Figures 7,9

Denoising of Third Music Signal
In the third music signal (Note: Alesis − Fusion − Acoustic − Bass − C2), we add white Gaussian noise, which is 5% of the signal variance.The following table shows a comparison of SNR values by applying wavelet shrinkage and non-linear diffusion techniques of denoising for different wavelet filters.
Table 3 was developed after calculating SNR values using wavelet shrinkage and nonlinear diffusion techniques for the wavelet filters, namely Haar (Daubechies-1), symlet-8, Daubechies-10, biorthogonal-6.8and coiflet-5, for each wavelet level up to 10, and, out of those, only the highest SNR values for these filters in both techniques are considered for the purpose of study.We note that these highest SNR values are achieved at wavelet decomposition level six in each case.Now, a comparison between the SNR values obtained from the two different denoising methods for the above-mentioned filters is made, and this shows that the nonlinear diffusion method performs better as compared with the wavelet shrinkage method for the Daubechies-10 wavelet filter.This also validates the result of our first music signal.
Figure 12 shows the original signal (third music signal) and Figure 13 shows the noisy signal after adding noise of 5% to the original signal.Figure 14 shows the denoised signal using nonlinear diffusion for the Coiflet-5 wavelet filter at wavelet decomposition level six.For this denoised signal, we obtain SNR 22.69 db, which is the highest among other wavelet filters used for denoising.We obtain SNR 22.55 db using wavelet shrinkage for the Coiflet-5 wavelet filter at wavelet decomposition level six.Thus, a higher SNR value is achieved using the nonlinear diffusion method, which gives better denoising results as compared with the wavelet shrinkage method.We also note that in the third music signal, the nonlinear diffusion method also has an advantage over wavelet shrinkage for other wavelet filters.Further, Figure 15 shows level six detail coefficients and Figure 16 shows level six approximation coefficients.A comparison of Figures 12,14

Discussion
Although all three music samples which were chosen for analysis are quite different (in terms of instruments producing sound), our denoising results obtained for the first music samples are validated by the similar denoising results obtained from the second and third music samples using similar wavelet representations.We see that the Daubechies-10 wavelet for the first and second music samples and Coiflets-5 for the third music sample work well for both the denoising methods.However, the nonlinear diffusion technique leads to better results than the wavelet shrinkage technique.
The number of wavelet levels which are used in our wavelet diffusion process is most important.The value of α determines the rate of diffusion of coefficients, as we use a global value of α for the nonlinear diffusion process and the threshold λ determines the detail coefficients, which are needed for reconstruction after nonlinear diffusion.Both α and λ are interlinked with the coefficient diffusion process because of the choice of the family of diffusivity.Furthermore, the choice of an appropriate mesh ratio helps in improving the quality of denoising.

Conclusions
Wavelet denoising shows qualitative results for the the denoising of audio signals via the diffusive wavelet decomposition method.Thus, we can say that the estimation of the correct wavelet decomposition level is most important.Wavelet type comes into play when wavelets with a larger number of oscillations are used.Considering frequency bands and sampling frequency of the signal under consideration, different parameters linked to diffusivity are to be considered.The results obtained in this work indicate that our proposed technique (nonlinear diffusion), adopted in the paper, gives better desired results as compared with wavelet shrinkage for audio signals as well.

Figure 5 .
Figure 5. Level six detail coefficients for first music signal.

Figure 6 .
Figure 6.Level six approximation coefficients for first music signal.
and 11  shows the effectiveness of the nonlinear diffusion technique.

Figure 10 .
Figure 10.Level six detail coefficients for second music signal.

Figure 11 .
Figure 11.Level six approximation coefficients for second music signal.
and 16  shows the effectiveness of the nonlinear diffusion technique.

Figure 15 .
Figure 15.Level six detail coefficients for third music signal.

Figure 16 .
Figure 16.Level six approximation coefficients for third music signal.

Table 1 .
SNR values for first music signal using wavelet shrinkage and nonlinear diffusion techniques of denoising for different wavelet filters.

Table 2 .
SNR values for second music signal using wavelet shrinkage and nonlinear diffusion techniques of denoising for different wavelet filters.

Table 3 .
SNR values for third music signal.