A Non-Reference Image Denoising Method for Infrared Thermal Image Based on Enhanced Dual-Tree Complex Wavelet Optimized by Fruit Fly Algorithm and Bilateral Filter

To eliminate the noise of infrared thermal image without reference and noise model, an improved dual-tree complex wavelet transform (DTCWT), optimized by an improved fruit-fly optimization algorithm (IFOA) and bilateral filter (BF), is proposed in this paper. Firstly, the noisy image is transformed by DTCWT, and the noise variance threshold is optimized by the IFOA, which is enhanced through a fly step range with inertia weight. Then, the denoised image will be re-processed using bilateral filter to improve the denoising performance and enhance the edge information. In the experiment, the proposed method is applied to eliminate both addictive noise and multiplicative noise, and the denoising results are compared with other representative methods, such as DTCWT, block-matching and 3D filtering (BM3D), median filter, wiener filter, wavelet decomposition filter (WDF) and bilateral filter. Moreover, the proposed method is applied as pre-processing utilization for infrared thermal images in a coal mining working face.


Introduction
Image denoising is a very important part of image processing, and it is the basis of all subsequent processing, such as image recognition [1], object detection and tracking [2].The denoising methods are mainly divided into spatial domain and transform domain filters.Spatial domain filters have high processing speed since they operate the pixels on the original images, such as median filter [3], wiener filter [4] and bilateral filter (BF) [5].These algorithms can remove noise quickly, but the denoising abilities are mediocre.Transform domain filters are more robust since the images are transformed into another domain.Two major transform domain filters are Fourier transform (FT) [6] and wavelet transform (WT) [7].WT overcomes the problem of FT that the window size is fixed, and does not change with frequency.Thus, WT gets better effect than FT.Discrete wavelet transform (DWT), proposed by Shensa [8], is widely applied in the image processing field.Dual-tree complex wavelet transform (DTCWT) proposed by N. Kingsbury in 1998 [9] can improve the DWT in several aspects.It owns three additional properties: approximate shift invariance, good directional selectivity, and limited data redundancy [10].Recently, DTCWT has been used to eliminate the noise of images, and the method has been improved by many researchers.Shahdoosti et al. [11] proposed a denoising method using the dual contourlet transform (DCT), which combined the directional filter bank (DFB) and DTCWT.The bivariate shrinkage function (BSF) with local variance estimation was adopted in [12] to process wavelet coefficients of DTCWT and the denoised images were reconstructed by the shrunk wavelet coefficients.Through changing the window size and shape of DTCWT, Liu et al. [13] acquired good quality of denoised images.However, these methods cannot reach the best denoising performance since the parameters are fixed.Therefore, intelligent methods that search the best parameters through the optimization algorithm have been proposed by some researchers.
In recent years, optimization problems are more and more extensive and many meta-heuristic algorithms has been proposed, such as genetic algorithm [14], ant colony algorithm [15], fish swarm algorithm [16], particle swarm algorithm [17], Virus colony search (VCS) algorithm [18], fire-fly algorithm [19], and so on.Among these algorithms, fruit-fly optimization algorithm (FOA), proposed by Professor Pan Wenchao in 2011 [20], has the advantages of simple structure, easy parameter setting, and fast convergence rate.Thus, FOA is widely used in economic management [21], forecasting [22], planning [23], etc.Nevertheless, similar to other optimal evolutionary algorithms, the original FOA may sink into local optima due to fixed fly step size.Meanwhile, the fitness function is a key problem that affects the denoising performance.Some researchers have put forward some examples.However, Most of them are full-reference.In practice, reference images cannot be acquired and these methods are not capable for practical usage.
Moreover, the original thermal images cannot be used for analyzing because of the low sharpness, noise problem and inhomogeneous.Therefore, it is a significant task to create a method for thermal images that is capable of acquiring best image denoised performance without reference images.In this work, a non-reference image denoising method based on enhanced dual-tree complex wavelet optimized by fruit fly algorithm and bilateral filter (IFOA-DTCWT-BF) is proposed and the method is compared with DTCWT and other denoising algorithms.Some common objective indexes are used to analyze the performance of these algorithms.
The rest of this paper is organized as follows.In Section 2, some related works are outlined based on the literature.In Section 3, the basic theories of DTCWT, FOA and bilateral filter are presented.In Section 4, the image denoising method based on IFOA-DTCWT-BF is elaborated, and a fly step range improved by inertia weight is performed.In Section 5, experiments are conducted and the proposed denoising method is tested, and an application is performed.Some conclusions and outlooks are summarized in Section 6.

Literature Review
This paper is mainly concerned with three research streams: image denoising method, dual-tree wavelet transform, and fruit fly optimization algorithm.In this section, we will summarize the relevant literature.

Image Denoising Method
In recent years, image denoising becomes an urgent need in image processing field.Many creative methods are put forward to wipe out the image noise.HR.Shahdoosti and SM.Hazavei [11] applied the hidden Markov tree (HMT) with mixtures of one-sided exponential densities to denoise images.To excavate the spatial information in hyperspectral image, a novel denoising method which integrates superpixel segmentation (SS) into low-rank representation (LRR), is presented in [24].Because of the advantage in preserving texture in image, total variation is widely applied in the problems of image denoising [25].Jia Zhi-Gang et al. [26] proposed a new TV-Stokes model for image deblurring with a good geometry explanation.In reality, the noise models of noisy images are unknown beforehand.Zhu et al. [27] consider the problem and proposes a novel blind image denoising algorithm to recover the clean image from noisy one with the unknown noise model.Kernel greedy algorithm for robust denoising (KGARD), put forward in [28], employs sparse modeling arguments to explicitly model and estimate the outliers, adopting a greedy approach.Recently, intelligent algorithms are applied in the image denoising field.Zhang Lin et al. [29] proposed a non-model dual-tree wavelet thresholding for image denoising that uses the improved chaotic drosophila optimization algorithm (CDOA) to estimate the noise variance.

Dual-Tree Complex Wavelet Transform
DTCWT is suitable for image processing applications [9].In [30], DTCWT is used to create an effective contrast enhancement method and achieves a good performance in low illumination image enhancement.A new method for the interpolation of a full high-definition (HD) image which applied DTCWT to decompose the low-resolution image into different sub-bands was presented in [31].To acquire good performance, the Wiener filter is used to remove the noise component of the dual-tree complex wavelet transform (DTCWT) coefficients [32].In [33], the DTCWT coefficients are preprocessed by median filtering.Zhao et al. [34] presented a novel biometric image encryption algorithm based on DTCWT and compressed sensing.To produce better fusion effects, Q-shift DTCWT was proposed to simplify the construction of filters in DTCWT [35].

Fruit Fly Optimization Algorithm
FOA is easy to understand and has lower computational complexity.That is why FOA can be successfully applied in many areas such as the financial distress model solving [20], annual power load forecasting [36], the multidimensional knapsack problem solving [22], tuning of PID controller [37], neural network training [38] and so on.In [39], W Lijuan et al. proposed a novel approach, namely SFOASVR, which hybridizes SVR model with FOA and the seasonal index adjustment to forecast monthly tourist flow.Niu jinwei et al. [40] applied FOA to optimize the operation of the Texaco gasification process.However, the original FOA also has the possibility of falling into local extreme due to the fixed fly range and limited search space.To cover these problems, many improved methods have been proposed.Niu et al. [41] divided fruit group of FOA into two parts and used Cauchy mutation process to make fruit fly individuals variant.Xu et al. [42] improved the fly distance range (FR) of the FOA by using normal distribution FR ∼ N 0, L 2 .A novel multi scale cooperative mutation FOA was proposed by Zhang et al. [43] to tackle the limitation of local optimum.In [22], the parallel search is employed to balance exploitation and exploration of FOA.Wang Lin et al. [44] applied a level probability policy and a new mutation parameter to balance the population diversity and stability of FOA.

Discussion
Although the above research works are valuable, there are still some drawbacks remained.First, the denoising methods were improved by optimization algorithms and acquired better performances in recent decades, at the same time they were dragged down by the great amount of calculation and slow convergence rate of the optimization algorithms.Second, improved denoising algorithms always need reference images to get good performances, while, in practice, reference images do not exist.Third, despite the fine iteration rate and easy encoding, the original FOA is still likely to fall into local extreme due to the settled parameters.Moreover, the images processed by DTCWT always have small speckles which influence the quality of denoised images.
Therefore, an adaptive dual-tree complex wavelet threshold denoising method for non-reference image denoising based on improved fruit fly algorithm and bilateral filter is proposed in this paper, and a series of simulations and applications prove the effectiveness of the method.

Dual-Tree Complex Wavelet Transform Based on Bivariate Shrinkage Function
When processing image with DTCWT, the image is resolved into a desired level by two separable 2D DWT branches, branch a (real part) and branch b (imagery part), whose filters are explicitly designed to meet the Hilbert pair requirement.
As shown in Figure 1, the h 0 (n), h 1 (n) are the low pass filter and high pass filter of the filter bank above which produce the scale coefficient and wavelet coefficient of the real part, while the g 0 (n), g 1 (n) are the low pass filter and high pass filter of the filter bank below that create the scale coefficient and wavelet coefficient of the imaginary part.In the first stage, the image has a line transformation through filter {g 0 (n), g 1 (n) }, then it has a rank transformation through filter {h 0 (n), h 1 (n)}.In the second stage, the image has a line transformation through filter {h 0 (n), h 1 (n)}, then has a rank transformation through filter {g 0 (n), g 1 (n)}.The decomposition at each stage produces one low-pass sub band LL and three high-pass sub bands HL, LH, and HH.The sum or difference of each pair of sub bands forms low-frequency wavelet coefficients and six wavelet transform coefficients in six directions.Then, bivariate shrinkage function is applied to process the wavelet coefficients.
Appl.Sci.2017, 7, 1190 4 of 25 transformation through filter , , then it has a rank transformation through filter h 0 n , h 1 n .In the second stage, the image has a line transformation through filter , , then has a rank transformation through filter , .The decomposition at each stage produces one low-pass sub band LL and three high-pass sub bands HL, LH, and HH.The sum or difference of each pair of sub bands forms low-frequency wavelet coefficients and six wavelet transform coefficients in six directions.Then, bivariate shrinkage function is applied to process the wavelet coefficients.Bivariate shrinkage function was proposed by Sendur L and Selesnick I W [45] in 2002.The DTCWT with bivariate shrinkage function can be described as follows.
Assuming that the image is contaminated by noise, the noised image can be denoted as: where the f is a noisy image; ∘ is the operator represents corresponding to different noise models, e.g., Additive noise is +, and multiplicative noise is ; x is the original image; and n is the noise.It is transformed into dual-tree complex wavelet domain: , where y stands for the wavelet coefficient of the observed image, ω represents the wavelet coefficient of the original image (to be estimated), and n is the noise wavelet coefficient which is independent and identically distributed and obeys the N(0, ).
Using the maximum a posteriori probability estimate (MAP), the ω is estimated from the noise factor y: where | | is the posterior probability function.
According to Bayesian estimation theory, Formula (3) can be written: , (5) is the bivariate probability distribution function that proposed in [45]: is the wavelet coefficient, ω 2 is the coefficient of the parent node at the next level of , and is the variance of the signal and is related to the location of the wavelet coefficient.The maximum posterior probability estimate of can be obtained: Bivariate shrinkage function was proposed by Sendur L and Selesnick I W [45] in 2002.The DTCWT with bivariate shrinkage function can be described as follows.
Assuming that the image is contaminated by noise, the noised image can be denoted as: where the f is a noisy image; • is the operator represents corresponding to different noise models, e.g., Additive noise is +, and multiplicative noise is ×; x is the original image; and n is the noise.It is transformed into dual-tree complex wavelet domain: where y stands for the wavelet coefficient of the observed image, ω represents the wavelet coefficient of the original image (to be estimated), and n is the noise wavelet coefficient which is independent and identically distributed and obeys the N(0, σ 2 ).Using the maximum a posteriori probability estimate (MAP), the ω is estimated from the noise factor y: where p ω|y (ω|y) is the posterior probability function.
According to Bayesian estimation theory, Formula (3) can be written: p ω (ω) is the bivariate probability distribution function that proposed in [45]: ω 1 is the wavelet coefficient, ω 2 is the coefficient of the parent node at the next level of ω 1 , and σ 2 ω is the variance of the signal and is related to the location of the wavelet coefficient.The maximum posterior probability estimate of ω 1 can be obtained: This is the bivariate shrinkage function.
The rectangular window region N(k) centered on the first k wavelet coefficients is defined.The signal variance σ 2 ω of this region is estimated by the wavelet coefficients in the N(k) region, and the noise variance σ 2 is estimated by the median filtering method: where subband HH is an aggregate of wavelet coefficients in the N(k) region.
From the observation model, we can obtain: M is the size of the rectangle region N(k), and the σ ω is estimated to be: After the noisy image processed by DTCWT, the wavelet coefficients of DTCWT is handled with the local adaptive bivariate shrinkage model which has local variance estimation, and then the denoised image can be obtained via inverse DTCWT of new wavelet coefficients.

Fruit Fly Optimization Algorithm
Fruit fly Optimization Algorithm (FOA), similar to other biomimetic interactive evolutionary computation algorithm, simulates the foraging process of fruit fly.The procedure of basic FOA can be summarized as follows and the schematic diagram is showed in Figure 2.
Step 1.1: Initialize the parameters: population size PS, maximum iteration number I N max , location range LR and the fly distance range FR.
The initial position of the fruit fly group is represented as follows: Step 1.2:An individual fruit fly uses its olfactory sensation to find food, the random searching direction and distance is given as follows: Step 1.3:Because of the unknown location of the food, the distance to the origin of coordinates (Dist i ) and the judgment value of smell concentration (S i ) are calculated as follows: Step 1.4:The fruit fly with maximum smell concentration among the fruit fly group searches food according to the smell concentration judgment function (or called fitness function), which represents as follows: where bestsmell donates the maximal smell concentration, bestindex is the corresponding fruit fly number and Smell is the smell concentration of the group.
Step 1.5:The new smell concentration is compared with the previous best smell concentration; if it is better than the last generation, the current best location and smell concentration can be presented as follows:

Bilateral Filter Algorithm
As a kind of nonlinear spatial filter, BF is able to achieve uniform spatial filtering and reserve the edge information of an image.BF considers both the grey value of the point in the neighborhood scope, and the distance from center of geometric.Thus, the formula of grey value after filtering is presented as follows: where k is the normalization coefficient, the expression is as follows: in which • is the corresponding point of the grey value before filtering, and x is the value after

Bilateral Filter Algorithm
As a kind of nonlinear spatial filter, BF is able to achieve uniform spatial filtering and reserve the edge information of an image.BF considers both the grey value of the point in the neighborhood scope, and the distance from center of geometric.Thus, the formula of grey value after filtering is presented as follows: where k is the normalization coefficient, the expression is as follows: in which h x (•) is the corresponding point of the grey value before filtering, and x is the value after filtering.c(•) is the space similarity value of the center point with its neighborhood points, s(•) is the gray similarity value of the center point with its neighborhood points.
The functions c(•) and s(•) can be implemented using Gaussian function, which can be defined as follows:

The Proposed Method
In this section, the architecture of proposed method is presented and the flow of the proposed method is described as well.To enhance the threshold optimization ability of proposed method, an improved FOA is proposed.

Architecture of the Proposed Method
This algorithm is a new method that deals with noisy images without reference images.The architecture of algorithm can be divided into two parts.In the first part, DTCWT enhanced by IFOA (IFOA-DTCWT) is applied to remove noise.Since there are no reference images, the denoised images can only compare with the noisy images to estimate the denoising performance.The IFOA searches the noise variance randomly, and a new wavelet coefficient is gained through the bivariate shrinkage function with the noise variance.Then, the denoised image is obtained through the inverse DTCWT of the new wavelet coefficient.The process continues until the optimized denoised image is acquired.After being denoised by IFOA-DTCWT, many processing areas of the square shape are remained in the denoised image.Then in the second part, bilateral filter is applied to smooth the area and enhance the edge information of the image at the same time.The combination of the two methods acquired good performance.The architecture is shown in Figure 3.
This algorithm is a new method that deals with noisy images without reference images.The architecture of algorithm can be divided into two parts.In the first part, DTCWT enhanced by IFOA (IFOA-DTCWT) is applied to remove noise.Since there are no reference images, the denoised images can only compare with the noisy images to estimate the denoising performance.The IFOA searches the noise variance randomly, and a new wavelet coefficient is gained through the bivariate shrinkage function with the noise variance.Then, the denoised image is obtained through the inverse DTCWT of the new wavelet coefficient.The process continues until the optimized denoised image is acquired.After being denoised by IFOA-DTCWT, many processing areas of the square shape are remained in the denoised image.Then in the second part, bilateral filter is applied to smooth the area and enhance the edge information of the image at the same time.The combination of the two methods acquired good performance.The architecture is shown in Figure 3.

Improved FOA
The fly step range affects convergence speed and search ability of fruit fly optimization algorithm.The global search ability of the algorithm will be obvious improved when fly step range is large enough.The problem is that its speed will slow down.In the condition of small fly step range, the algorithm convergences quickly but it will be easy to fall into local optimum.
In this article, an inertia weight is applied for modification of fly step range to balance the global search ability and search speed of FOA.While in initial stage of the iteration, large inertia weight is applied to make FOA have large step range.In the late stage of the iteration, the inertia weight diminishes and makes the step range small.In this way, FOA is intended to have better global search

Improved FOA
The fly step range affects convergence speed and search ability of fruit fly optimization algorithm.The global search ability of the algorithm will be obvious improved when fly step range is large enough.
The problem is that its speed will slow down.In the condition of small fly step range, the algorithm convergences quickly but it will be easy to fall into local optimum.
In this article, an inertia weight is applied for modification of fly step range to balance the global search ability and search speed of FOA.While in initial stage of the iteration, large inertia weight is applied to make FOA have large step range.In the late stage of the iteration, the inertia weight diminishes and makes the step range small.In this way, FOA is intended to have better global search ability and be more precise in local development.Some inertia weight functions are presented as follows: where g is the current number of iteration, and g max is the maximum iteration number.The three inertia weights above are applied to improve FOA and formed IFOA1, IFOA2 and IFOA3.The flowchart of improved FOA is presented in Figure 4.
Flowchart of the improved fruit fly optimization algorithm.

Flow of the Proposed Method
The proposed IFOA-DTCWT-BF can be summarized as follows: Step 2.1: The noisy image is first transformed into a gray image.The parameters of IFOA are initialized, like population size PS, maximum iteration number , location range LR and the fly step range FR.
Step 2.2: The location of each fly individual is gained through the fly group and FR.The distance and smell concentration of each fly individual are calculated according to Equation (14).Each smell concentration is considered as a potential threshold.A new threshold is just a new noise variance.Then, the noisy image is denoised through DTCWT and bivariate shrinkage function of the new variance.To evaluate the performance, the fitness function f is calculated as follows: where MSE is the mean square error between the noisy image and denoised image with size , and it can be calculated as follows: where , is the noisy image before processing, , is the denoised image using proposed

Flow of the Proposed Method
The proposed IFOA-DTCWT-BF can be summarized as follows: Step 2.1: The noisy image is first transformed into a gray image.The parameters of IFOA are initialized, like population size PS, maximum iteration number I N max , location range LR and the fly step range FR.
Step 2.2: The location of each fly individual is gained through the fly group and FR.The distance and smell concentration of each fly individual are calculated according to Equation (14).Each smell concentration is considered as a potential threshold.A new threshold is just a new noise variance.Then, the noisy image is denoised through DTCWT and bivariate shrinkage function of the new variance.To evaluate the performance, the fitness function f is calculated as follows: Appl.Sci.2017, 7, 1190 where MSE is the mean square error between the noisy image and denoised image with size m × n, and it can be calculated as follows: where f (x, y) is the noisy image before processing, f (x, y) is the denoised image using proposed IFOA-DTCWT.The fitness function f reaches the maximal when MSE is the maximum.
Step 2.3: Fruit fly with maximal fitness is selected as bestsmell.If the present bestsmell is bigger than the former value, smellbest, the corresponding coordinates are updated.Otherwise, smellbest, X _axis and Y _axis are reserved.
Step 2.4: If the ending conditions are researched, smellbest, X _axis and Y _axis are treated as the optimum.Otherwise, Steps 2.2 and 2.3 are repeated.
Step 2.5: The optimal threshold is regarded as the noise variance of noisy image and the noisy image is denoised by the DTCWT based on bivariate shrinkage function.
Step 2.6: After being denoised by the DTCWT, the image will be processed with bilateral filter.Then the final denoised image is obtained.

Experiment and Application
In this part, the improved FOA was tested and the proposed method IFOA-DTCWT-BF was used to remove noise of the images which were polluted by Gaussian noise and Speckle noise.Moreover, the denoising method was applied to denoise the real images from the surface experiment.To test the performance of improved fruit fly algorithm, a Schaffer function that has only one minimum point min f (0, 0) = 1 was used to test the search ability of algorithms.The Schaffer function is always used to test the algorithms' performance, since there are many local extremum points around the minimum point, forming two loop ditches.The optimization algorithms are easy to fall into the local minimum point.In addition, the typical image denoising methods, e.g., DTCWT, block-matching and 3D filtering (BM3D), median filtering, wiener filtering, wavelet decomposition filtering (DWF) and bilateral filtering (BF), were applied to compare with the proposed method.Experiments and algorithm implementation were performed using Matlab, the experiment configuration was showed in Table 1, and the testing gray images were showed in Figure 5. into the local minimum point.In addition, the typical image denoising methods, e.g., DTCWT, blockmatching and 3D filtering (BM3D), median filtering, wiener filtering, wavelet decomposition filtering (DWF) and bilateral filtering (BF), were applied to compare with the proposed method.Experiments and algorithm implementation were performed using Matlab, and the experiment configuration was showed in Table 1.

Algorithm Performance Testing
To compare the improved FOA with previous FOA and test the performances of three different inertia weights, the four algorithms were tested by searching the minimum points of Schaffer function to compare their performances.The Schaffer function has only one minimum point min (0, 0) = −1, and many local extremum points around the minimum point form two loop ditches.The function values are 0.990283 and 0.962776 respectively, so it is easy to fall into these tiny points.Some parameters of FOA: maximum number of generation was 90, population size was 20, the initial location range was [−1.0, 1.0], fly step range was [−2.0, 2.0].The three IFOAs changed the original fly step range with different inertia weights.Because of the inertia weight, the fly step range of improved FOA could be set two times larger than normal step or more.The improved fly range can be expressed as g [−5.0, 5.0].In Figure 6, IFOA had faster speed in the early stage of the global search.Then, in the later local optimization, IFOA had higher precision of search capabilities and found the extremum more accurately.
Figure 5 demonstrates that improved FOAs were faster than original FOA in search for extremum at the beginning.When in the later of the process, the local search became more detailed to find accurate extremum.Moreover, different inertia weights had different performances.The IFOA1 and IFOA2 searched quickly, and the IFOA3 was getting faster and faster and surpassed the two algorithms at the fifth generation.In Figures 6-9, the search route of IFOA was much wider than original FOA.The IFOA2 and IFOA3 had almost same search route.Overall, the fly step improved by inertia weights had good performance.Some key parameters of IFOA are initialized: maximum number of generation is 30, population size is 20, location range is [−10, 10], etc.

Algorithm Performance Testing
To compare the improved FOA with previous FOA and test the performances of three different inertia weights, the four algorithms were tested by searching the minimum points of Schaffer function to compare their performances.The Schaffer function has only one minimum point min (0, 0) = −1, and many local extremum points around the minimum point form two loop ditches.The function values are 0.990283 and 0.962776 respectively, so it is easy to fall into these tiny points.Some parameters of FOA: maximum number of generation was 90, population size was 20, the initial location range was [−1.0, 1.0], fly step range was [−2.0, 2.0].The three IFOAs changed the original fly step range with different inertia weights.Because of the inertia weight, the fly step range of improved FOA could be set two times larger than normal step or more.The improved fly range can be expressed as ω(g) [−5.0, 5.0].In Figure 6, IFOA had faster speed in the early stage of the global search.
Then, in the later local optimization, IFOA had higher precision of search capabilities and found the extremum more accurately.
convenient for bilateral filter to remove the small noise in the small areas.Some parameters of FOA include: maximum number of generation was 30, population size was 20, the flies' initial location range was [−10.0,10.0], and fly step range of [−1.0, 1.0].IFOA parameters include: the fly step length of [−2.0, 2.0], and other parameters were same as FOA.In Figure 11, the IFOAs convergences faster than FOA, and searched more meticulous than FOA.It showed that IFOA possessed the ability of fast global search and precise local search.The IFOA2 with the inertia weight of Formula ( 22) had the best performance in searching the optimal threshold of the denoising method.As a result, we used IFOA2 to improve the proposed method.22) had the best performance in searching the optimal threshold of the denoising method.As a result, we used IFOA2 to improve the proposed method.Then, the FOA and IFOAs were applied in denoising algorithm.Firstly adding Gaussian noise on the original image formed the noisy image.It could be described as g(x, y) = f (x, y) + v (x, y).
To simulate the Gaussian noise on the original image, the test image Lena.bmp (size: 512 × 512) was polluted with zero-mean Gaussian noise with the noise variance σ = 0.1.To compare the optimization ability of the FOA and IFOAs, four algorithms were applied to optimize the threshold of DTCWT to remove Gaussian noise on Lena image.The window size of the DTCWT was 5 × 5 which was convenient for bilateral filter to remove the small noise in the small areas.Some parameters of FOA include: maximum number of generation was 30, population size was 20, the flies' initial location range was [−10.0,10.0], and fly step range of [−1.0, 1.0].IFOA parameters include: the fly step length of ω(g) [−2.0, 2.0], and other parameters were same as FOA.
In Figure 11, the IFOAs convergences faster than FOA, and searched more meticulous than FOA.It showed that IFOA possessed the ability of fast global search and precise local search.The IFOA2 with the inertia weight of Formula ( 22) had the best performance in searching the optimal threshold of the denoising method.As a result, we used IFOA2 to improve the proposed method.

Experiment with Gaussian Noise
To compare denoising performance, Gaussian noise with different noise variance was added to different original images, and other denoising methods such as DTCWT, median filter, wiener filter, bilateral filter, and wavelet decomposition filter were used to remove the noise of the image.The image quality evaluation strategies were peak signal to noise ratio (PSNR) and two-dimensional correlation coefficient of image matrix r.The PSNR and r here were calculated from denoised images and original images.Denoising results are listed in Tables 2-4.

Experiment with Gaussian Noise
To compare denoising performance, Gaussian noise with different noise variance was added to different original images, and other denoising methods such as DTCWT, median filter, wiener filter, bilateral filter, and wavelet decomposition filter were used to remove the noise of the image.The image quality evaluation strategies were peak signal to noise ratio (PSNR) and two-dimensional correlation coefficient of image matrix r.The PSNR and r here were calculated from denoised images and original images.Denoising results are listed in Tables 2-4.The image denoising results showed that IFOA-DTCWT-BF had powerful performance dealing with Gaussian noise.In Table 3, we could see that although DTCWT acquired good data of PSNR, the r was not always better than the proposed method.Moreover, the proposed method got clearer edge information.Median filter, Wiener filter and WDF had general performance.BM3D was incapable of dealing with heavy noise.Bilateral filter was not good at denoising Gaussian noise and acquired worst results, while, in the proposed method, it played an important role, as shown in Figure 12.The edge information of denoised Lena images processed by DTCWT and the proposed method is shown in Figure 13.The denoised images are shown in Figures 14-16   The image denoising results showed that IFOA-DTCWT-BF had powerful performance dealing with Gaussian noise.In Table 3, we could see that although DTCWT acquired good data of PSNR, the r was not always better than the proposed method.Moreover, the proposed method got clearer edge information, which can be proven in Figure 13.Median filter, Wiener filter and WDF had general performance.BM3D was incapable of dealing with heavy noise.Bilateral filter was not good at denoising Gaussian noise and acquired worst results, while, in the proposed method, it played an important role, as shown in Figure 12.The edge information of denoised Lena images processed by DTCWT and the proposed method is shown in Figure 13.The denoised images are shown in Figures 14-16       However, the infrared thermal images have same unavoidable weaknesses.The low sharpness, noise problem and inhomogeneous are three major problems of the thermal images.Because of the low detectivity and spatial resolution of the thermal imaging system, the thermal images owns blurred visual effect.The random disturbances of the external environment like lamp light, dust, etc. cause the noise problem and the inconsistent of the response characteristics of each pixel in infrared detector results in the inhomogeneous of the thermal images.Before the infrared thermal image can be used for tracking coal mining machine, it needs to be denoised.In this work, the original infrared thermal images were denoised by the proposed method and other methods, and the denoised images are shown in Figure 21.However, the infrared thermal images have same unavoidable weaknesses.The low sharpness, noise problem and inhomogeneous are three major problems of the thermal images.Because of the low detectivity and spatial resolution of the thermal imaging system, the thermal images owns blurred visual effect.The random disturbances of the external environment like lamp light, dust, etc. cause the noise problem and the inconsistent of the response characteristics of each pixel in infrared detector results in the inhomogeneous of the thermal images.Before the infrared thermal image can be used for tracking coal mining machine, it needs to be denoised.In this work, the original infrared thermal images were denoised by the proposed method and other methods, and the denoised images are shown in Figure 21.
As the images were denoised, the processed images would have less information than before.Therefore, information entropy was used to represent the amount of information contained in the image: the greater the value of information entropy, the more information the image contains.As the noise in the images was wiped by denoising methods, the denoised images were smoother than before.The information entropy (Ie) and smoothness (Smo) were introduced to evaluate the images in Figure 21, which could be calculated as where K was the sum of gray scales in an image, H k is the proportion of the pixels of gray value k in an image, and S best (i, j) was the average gray value of the pixels in the best active window.The information entropy and smoothness of original images and processed images are plotted in Figure 22.As shown in Figure 22, we could find that the information entropies of images processed by different methods are almost the same.It showed the denoised images also had good quality and the major information of images was retained.In Figures 21 and 23, the images processed by proposed method had best smoothness.The smoothness of the images processed by IFOA-DTCWT-BF was 24.07, 24.28 and 23.41, which could not be acquired by other methods.It was beneficial for the following work of cutting unit tracking, as there were less unnecessary textures.The analysis results indicated that the images denoised by IFOA-DTCW-BF were better than other methods, which had better image textures and quality.
proposed method had best smoothness.The smoothness of the images processed by IFOA-DTCWT-BF was 24.07, 24.28 and 23.41, which could not be acquired by other methods.It was beneficial for the following work of cutting unit tracking, as there were less unnecessary textures.The analysis results indicated that the images denoised by IFOA-DTCW-BF were better than other methods, which had better image textures and quality.

Conclusions and Future Work
To eliminate noise from a noisy image without reference image and obtain good performance, this paper proposed a non-reference image denoising method based on enhanced dual-tree complex wavelet optimized by fruit fly algorithm and bilateral filter.In the absence of reference image, a hypothesis was proposed that the MSE of the denoising image and noisy image was supposed to increase as the noise is denoised.The IFOA searched the optimal threshold to gain best denoising performance.The bilateral filter was applied to process images in the late work, which can significantly improve the performance of images and enhance the edge information.To verify the performance of the proposed IFOA, it was compared with original FOA.Moreover, different inertia weights were tested to get the best inertia weight.As comparisons, six representative methods, DTCWT, BM3D, Median filter, Wiener filter, DWT and BF, were implemented.The results showed that the proposed denoising method outperformed other methods.Finally, the proposed IFOA-DTDWT-BF was applied to infrared thermal image of roller in running to test the actual effect.
This work is valuable for image processing, which provides a non-reference denoising method.However, there are also some shortcomings.On the one hand, the calculation duration is still a problem, although it is faster than other adaptive methods.On the other hand, the proposed algorithm adopts the MSE as the denoising evaluation index, which may have imperfections.To surmount these shortcomings, the authors intend to improve the method with more effective intelligence algorithms and form an evaluation function with several evaluation indexes to assess the quality of the denoised image.Besides, the authors plan to combine other efficient denoising methods with the proposed methods to create a better method and apply the method in the aspect of industrial vision in the future.

Conclusions and Future Work
To eliminate noise from a noisy image without reference image and obtain good performance, this paper proposed a non-reference image denoising method based on enhanced dual-tree complex wavelet optimized by fruit fly algorithm and bilateral filter.In the absence of reference image, a hypothesis was proposed that the MSE of the denoising image and noisy image was supposed to increase as the noise is denoised.The IFOA searched the optimal threshold to gain best denoising performance.The bilateral filter was applied to process images in the late work, which can significantly improve the performance of images and enhance the edge information.To verify the performance of the proposed IFOA, it was compared with original FOA.Moreover, different inertia weights were tested to get the best inertia weight.As comparisons, six representative methods, DTCWT, BM3D, Median filter, Wiener filter, DWT and BF, were implemented.The results showed that the proposed denoising method outperformed other methods.Finally, the proposed IFOA-DTDWT-BF was applied to infrared thermal image of roller in running to test the actual effect.
This work is valuable for image processing, which provides a non-reference denoising method.However, there are also some shortcomings.On the one hand, the calculation duration is still a problem, although it is faster than other adaptive methods.On the other hand, the proposed algorithm adopts the MSE as the denoising evaluation index, which may have imperfections.To surmount these shortcomings, the authors intend to improve the method with more effective intelligence algorithms and form an evaluation function with several evaluation indexes to assess the quality of the denoised image.Besides, the authors plan to combine other efficient denoising methods with the proposed methods to create a better method and apply the method in the aspect of industrial vision in the future.

Otherwise, repeated Step 1 . 2 to
Step 1.4 to find other better smell concentrations.Step 1.6:When the smell concentration reaches the preset precision value or the iteration number reaches the maximal IN, the circulation stops.Otherwise, repeat the process from Step 1.2 to Step 1.5.

Figure 4 .
Figure 4. Flowchart of the improved fruit fly optimization algorithm.

Algorithm 1 :
IFOA-DTCWT-BF Inputs: Noisy image y Outputs: Denoised image y* % Initialization Set the parameters of the IFOA: population size PS, maximum iteration number IN max , location range LR, the fly step range FR and inertia weight ω(g).

Figure 6 .
Figure 6.Optimization result of FOA and IFOAs with different inertia weights.

Figure 7 .
Figure 7. (A) Optimization process of FOA; and (B) fruit fly flying route of FOA.

Figure 6 .
Figure 6.Optimization result of FOA and IFOAs with different inertia weights.

Figure 6
Figure 6 demonstrates that improved FOAs were faster than original FOA in search for extremum at the beginning.When in the later of the process, the local search became more detailed to find accurate extremum.Moreover, different inertia weights had different performances.The IFOA1 and IFOA2 searched quickly, and the IFOA3 was getting faster and faster and surpassed the two algorithms at the fifth generation.In Figures 7-10, the search route of IFOA was much wider than original FOA.The IFOA2 and IFOA3 had almost same search route.Overall, the fly step improved by inertia weights had good performance.

Figure 6 .
Figure 6.Optimization result of FOA and IFOAs with different inertia weights.

Figure 7 .
Figure 7. (A) Optimization process of FOA; and (B) fruit fly flying route of FOA.Figure 7. (A) Optimization process of FOA; and (B) fruit fly flying route of FOA.

Figure 7 .
Figure 7. (A) Optimization process of FOA; and (B) fruit fly flying route of FOA.Figure 7. (A) Optimization process of FOA; and (B) fruit fly flying route of FOA.

Figure 10 .
Figure 10.(A) Optimization process of FOA3; and (B) fruit fly flying route of FOA3.Figure 10. (A) Optimization process of FOA3; and (B) fruit fly flying route of FOA3.

Figure 10 .
Figure 10.(A) Optimization process of FOA3; and (B) fruit fly flying route of FOA3.Figure 10. (A) Optimization process of FOA3; and (B) fruit fly flying route of FOA3.

Figure 11 .
Figure 11.Optimization result using FOA and IFOAs to remove Gaussian noise from noisy image (Lena).

Figure 11 .
Figure 11.Optimization result using FOA and IFOAs to remove Gaussian noise from noisy image (Lena). . .

Figure 12 .
Figure 12.The effect of the bilateral filter: the left is no BF method and right is proposed method.(A) Lena of noise σ = 0.01; (B) Lena of noise σ = 0.05; (C) Lena of noise σ = 0.1; (D) Lena of noise σ = 0.2.

Figure 20 .
Figure 20.Original image form the surface experiment.(A) Image form surface mining engineering; (B) Cutting unit of mining machine; (C) Infrared thermal image of cutting unit.

Figure 20 .
Figure 20.Original image form the surface experiment.(A) Image form surface mining engineering; (B) Cutting unit of mining machine; (C) Infrared thermal image of cutting unit.

Figure 22 .
Figure 22.Information entropy of images using different methods.Figure 22. Information entropy of images using different methods.

Figure 22 .
Figure 22.Information entropy of images using different methods.Figure 22. Information entropy of images using different methods.

Figure 23 .
Figure 23.Smoothness of images using different methods.

Figure 23 .
Figure 23.Smoothness of images using different methods.
Some key parameters of IFOA are initialized: maximum number of generation is 30, population size is 20, location range is [−10, 10], etc.

Table 2 .
Comparison of the five methods (Lena with Gaussian noise).WDF: wavelet decomposition filter; BM3D: block-matching and 3D filtering; DTCWT: dual-tree complex wavelet transform; IFOA-DTCWT-BF: enhanced dual-tree complex wavelet optimized by fruit fly algorithm and bilateral filter

Table 3 .
Comparison of the five methods (Columbia with Gaussian noise).

Table 4 .
Comparison of the five methods (Peppers with Gaussian noise).

Table 3 .
Comparison of the five methods (Columbia with Gaussian noise).

Table 4 .
Comparison of the five methods (Peppers with Gaussian noise).

Table 5 .
Comparison of the six methods (Lena with Speckle noise).