Abstract
With the current state-of-the-art computer aided manufacturing tools, the spatial resolution of hyperspectral sensors is becoming increasingly higher thus making it easy to obtain much more detailed information of the scene captured. However, the improvement of the spatial resolution also brings new challenging problems to address with signal dependent photon noise being one of them. Unlike the signal independent thermal noise, the variance of photon noise is dependent on the signal, therefore many denoising methods developed for the stationary noise cannot be applied directly to the photon noise. To make things worse, both photon and thermal noise coexist in the captured hyperspectral image (HSI), thus making it more difficult to whiten noise. In this paper, we propose a new denoising framework to cope with signal dependent nonwhite noise (SDNW), Pre-estimate—Whitening—Post-estimate (PWP) loop, to reduce both photon and thermal noise in HSI. Previously, we proposed a method based on multidimensional wavelet packet transform and multi-way Wiener filter which performs both white noise and spectral dimensionality reduction, referred to as MWPT-MWF, which was restricted to white noise. We get inspired from this MWPT-MWF to develop a new iterative method for reducing photon and thermal noise. Firstly, the hyperspectral noise parameters estimation (HYNPE) algorithm is used to estimate the noise parameters, the SD noise is converted to an additive white Gaussian noise by pre-whitening procedure and then the whitened HSI is denoised by the proposed method SDNW-MWPT-MWF. As comparative experiments, the Multiple Linear Regression (MLR) based denoising method and tensor-based Multiway Wiener Filter (MWF) are also used in the denoising framework. An HSI captured by Reflective Optics System Imaging Spectrometer (ROSIS) is used in the experiments and the denoising performances are assessed from various aspects: the noise whitening performance, the Signal-to-Noise Ratio (SNR), and the classification performance. The results on the real-world airborne hyperspectral image HYDICE (Hyperspectral Digital Imagery Collection Experiment) are also presented and analyzed. These experiments show that it is worth taking into account noise signal-dependency hypothesis for processing HYDICE and ROSIS HSIs.
1. Introduction
Hyperspectral images consist of a considerably large number of narrow spectral bands which are uniformly distributed over a wide spectral range. For each pixel, the almost continuous spectral signature adds the third orthogonal spectral dimension to the two-dimensional spatial domain and constitutes the well-known 3D data cube. Thus, hyperspectral signatures offer the capability to identify and discriminate ground cover types. To benefit from the additional spectral dimension, specific processing methods of hyperspectral images, for instance spectral signatures unmixing, target detection and classification, etc., have been developed. For these methods to operate well, the reduction of the noise affecting hyperspectral images (HSIs) should be incorporated into them.
The noise in HSIs can be distinguished into two classes [1]: random noise and fixed pattern noise. Photon and thermal noise are examples of random noise in HSIs, while striping, periodic and interference noise are examples of fixed pattern noise, which is generated by errors in the calibration process and can be removed from the HSI by suitable procedures [2]. However, random noise, due to its stochastic nature, cannot be removed by those procedures and influences the performance of the algorithms adopted in hyperspectral data exploitation. Note that, in this paper, we only focus on the random noise and will later refer to it as noise. For sensors used in hyperspectral imagery, the theory predicts that random noise mainly comes from two aspects: signal-independent (SI) electronic noise and signal-dependent (SD) photon noise [3]. The widely accepted SI noise model is white Gaussian one [1,3]. In [4,5], it was shown that the SI noise in some HSIs is colored, i.e., spectrally non-white. With the improvement of the sensitivity in the electronic components [6], the resolution of the charged coupled device (CCD) camera has improved significantly, so that the photon noise has become as dominant as the signal-independent electronic noise in HSI data collected by new-generation hyperspectral sensors [7,8,9,10]. In this case, the assumption of additive and stationary noise model is not appropriate although this hypothesis is plausible for HSIs where the SI noise is dominant while SD noise, which depends on the useful signal level, is negligible. Therefore, in this paper, we use the widely accepted noise model in [7,8,9,10] including both signal-dependent and signal-independent noise. There are few denoising algorithms based on the photon noise model. The tensor-based denoising methods were proposed only for the white noise situation. According to the different statistical properties of SI and SD noise, in this paper, we propose a method to remove the SD photon noise as well as the SI thermal noise in the HSI.
In the literature, two widely used models for the random noise in HSIs are the additive white noise along both spectral and spatial dimensions [8,11,12,13,14], and the Additive White Gaussian Noise (AWGN) along the spatial dimensions but non-stationary in the spectral dimension [15,16]; these are pretty reasonable processes when the thermal noise is dominant [8]. However, when the SD photon noise is taken into account, these two noise models become appropriate. The SD photon noise in digital images was discussed in [17,18], where the noise parameters were estimated by utilizing a scatter plot-based estimation procedure. Nonetheless, the discussion of these two papers was only limited to the pure SD noise, which is not suitable for the noise in HSIs. A more accurate generalized SD noise model for digital images was introduced in [19], where the noise parameters were estimated by the Linear Minimum Mean Square Error (LMMSE) in the wavelet domain. In addition, the same noise model was employed in [20] under the name Poissonian-Gaussian noise model, where the noise in HSIs was modeled as two parts: the Poissonian part for modeling the photon noise and the Gaussian part for modeling the remaining stationary distribution in the output data. Images were firstly transformed into the wavelet domain; then, the expectation/standard-deviation pairs were estimated by employing a local estimation method. Finally, the Maximum Likelihood (ML) method was applied to the locally estimated expectation/standard-deviation pairs to estimate the global parameters. Based on the generalized SD noise model, a scatter plot-based estimation method was proposed in [7] to estimate the parameters of the noise generated by the new-generation imaging spectrometers.
The generalized SD noise model was then introduced in the HSI framework in [3]. In this paper, the 2D fractal Brownian motion (fBm) model was employed as a statistical model for intra-band image texture correlation, which has made it possible to estimate additive noise variance locally both from homogeneous and textural images Scanning Widow (SW). Then, the noise parameters were estimated by a linear fit in each band. The spectral textural correlation was also considered, and each SW in a Multicomponent Scanning Window (MSW) was treated as a mixture of fBm-samples and noise. Finally, the global noise parameters were estimated by ML estimation.
The generalized SD noise model was also used in [8], where the HYperspectral Noise Parameter Estimation (HYNPE) algorithm was proposed. Unlike [3], HYNPE utilized the assumption that the noise in each band after being whitened followed a standard Gaussian distribution. The joint probability density function (PDF) of the whitened noise values was used to form the ML criterion. Nonetheless, the signal and noise values were assumed to be known in the ML criterion, which was not true in practical situations. Hence, the MLR-theory based approach was exploited to estimate them. However, MLR calculates the parameters by minimizing the Least Square Error (LSE), which is a biased estimator when the noise is not white. Thus, the estimates of MLR are not accurate in HYNPE, leading to the inaccuracy of the final parameter estimates. In this paper, we investigate the relevance of a new denoising framework for reducing simultaneously the SD photon noise and the SI thermal noise in HSIs on classification [21]. Considering that the noise variance is entangled with the signal, a denoising loop is proposed to remove the noise from HSIs. Each iteration of the loop consists three steps: firstly, we do the pre-estimate, i.e., use MWPT-MWF to directly denoise the HSI containing the photon and thermal noise. Secondly, use the pre-estimate to estimate the noise parameters by employing the ML criterion, and then whiten the noise. Finally, apply MWPT-MWF to the whitened HSIs to obtain the post-estimate. Then, in the next iteration, use the post-estimate of the last iteration as the pre-estimate of current iteration. After several iterations, the HSIs will be well denoised and the resulting classification results are improved.
The remainder of this paper is organized as follows: Section 2 gives the signal model used in this paper. Appendix A presents the Multilinear Algebra Tools. Appendix B overviews the HYNPE algorithm. Appendix C presents the MWPT-MWF method. Section 3 presents the proposed iterative denoising method: considering that the noise variance is dependent on the signal, a denoising loop is proposed to remove the noise from HSIs. Each iteration of the loop consists of three steps: (i) we perform a pre-estimation, i.e., use MWPT-MWF [22,23] to directly denoise the HSI containing the photon and thermal noise, (ii) we use the pre-estimation process to estimate the noise parameters by employing the ML criterion, and then whiten the noise and (iii) we apply MWPT-MWF to the whitened HSIs to obtain the post-estimate results. Then in the next iteration, we use the post-estimate of the last iteration as the pre-estimate of current iteration. After several iterations, the HSIs will be well denoised. Section 4 gives some comparative experimental results. The real-world HSI reflective optics system imaging spectrometer (ROSIS) is used in the experiments to evaluate the performances of denoising and classification processes. Finally, Section 6 provides details of the conclusions of the research undertaken.
In this paper, we denote by
| a scalar | |
| a vector | |
| a matrix | |
| a N-order tensor | |
| n-mode unfolding matrix | |
| a tensor obtained by calculating the square root of each element of | |
| n-mode dimension | |
| the Frobenius norm of | |
| ∘ | the vector outer product |
| ⊙ | the Khatri-Rao product |
| ⊛ | the Hadamard product |
| the inner product between and | |
| the mathematical expectation | |
| the transposition. |
2. Signal Modeling with Thermal and Photon Noise
An HSI is a three-dimensional data cube and can be modeled as a tensor , with the first two dimensions being the spatial domain and with the third dimension being the spectral domain. In fact, a hyperspectral sensor captures a HSI as a series of two-dimensional images of the spatial domain by a sensor array. By extending the data model in [7] to the 3D representation, a noisy HSI can be expressed as a third order tensor composed of a multidimensional signal impaired by an additive random noise :
where accounts for both thermal and photon noise and its variance depends on the pixel in the useful signal . The photon noise is caused by the random fluctuation of photon flux arriving at the CCD sensor, and it follows a Poisson model [24]. As the pixel size becomes smaller in the new-generation hyperspectral sensor, the number of photons that reach a pixel per unit time becomes smaller as well. Hence, the photon noise cannot be neglected anymore [20]. For a given entry of the pure signal HSI tensor , the corresponding photon noise element of tensor can be expressed as [25]:
where is a stationary, zero-mean uncorrelated random process independent on with variance . The thermal noise component in each sensor is electronics noise, denoted by which can be modeled as an additive zero-mean white Gaussian noise in each band with variance , while the noise variance changes from sensor to sensor due to different states of the electronic components in the sensors. Elementwise, the data model is [7]:
Then, we can define , and Equation (1) can be correspondingly rewritten as
The unfolding matrix of the HSI data tensor (with ) can be expressed as:
where is the mode-3 unfolding matrix of the multidimensional signal tensor and
with and being the mode-3 unfolding matrices of and , respectively.
3. Proposed Method
The aim of this paper is to obtain the pure signal estimate , which is necessary to determine the noise variance. Nonetheless, since the noise is signal-dependent, the noise variances of the entries of are different from each other and are related to the signal entries . It is worth noting that, for a given entry , the noise is a summation of two Gaussian-distributed variables and . Hence, is a conditional zero-mean Gaussian-distributed random variable, i.e.,
where denotes the normal distribution, and is the noise variance, which can be expressed as [3,7,8]:
This is to say that a precise noise variance estimate needs the precise signal estimate . Hence, the signal estimate and noise variance estimate problems are inter-related, thus making the signal estimate problem difficult to solve. In HYNPE, the noise parameters are estimated by using the signal estimate generated by the MLR theory based method, which estimates the signal by minimizing LSE. However, the LSE estimator requires that the signal and noise should be statistically independent, which is not satisfied in the SD photon noise situation. Thus, the estimates of the signal and noise are not precise, which makes the parameter estimate result unreliable as well. Moreover, since there is only one step for estimating the signal, the imprecise estimate degrades the performances of noise parameter estimation.
To use the classical parameter estimation algorithms, such as LSE and LMMSE, it is necessary to make the noise “independent of” the signal.
From Equation (8), it is evident that the noise variance is dependent on signal . To cut off this relation, we need to whiten the noise:
where the underlined is used to distinguish the whitened data from the original data. After the whitening operation, we can consider that the noise is independent from the whitened signal .
It is worth noting that, in the likelihood function Equation (A13), the signal value is assumed to be known. However, we cannot get this prior information in realistic situations; therefore, the signal value should be replaced by its estimate , as is presented in Appendix B.
Referring to [22], the wavelet-tensor-based algorithm MWPT-MWF yields the most accurate signal estimate . In addition, the well-known HYNPE algorithm permits obtaining the ML estimates of and . Hence, in this paper, we choose to combine these two methods to get more accurate estimation of noise variance for each element of , which can be calculated by:
When the noise variance of each entry of is obtained, the noise can be whitened by:
and the whitened hyperspectral image can be written as
However, MWPT-MWF was proposed for the white noise situation, therefore, when we use it to estimate directly without noise whitening, the estimate result is not accurate. To distinguish it with the signal estimate after noise whitening, is named as pre-estimate. Correspondingly, the estimate obtained after the noise whitening procedure is more accurate than , so we name it as post-estimate. The PWP process needs to be repeated several times to improve the performance of estimation. In fact, HYNPE only takes the first pre-estimation step in the PWP procedure, which degrades its performance when estimating the parameters. However, we utilize the more accurate post-estimate of current PWP iteration as the pre-estimate of the next PWP iteration. Therefore, the estimate accuracy can be improved in the PWP loop.
To adaptively stop the PWP loop according to the processed HSI, we need to find a stop criterion. The RMSE between the pre-estimate and the post-estimate is given as follows:
where and are the tensor forms of the pre-estimate and post-estimate, respectively. With the iteration times increasing, the becomes asymptotically stable. Hence, we can use the relative error of between two adjacent iterations as the stop criterion:
where is the RMSE of last iteration. If e is less than a given value , the loop should be terminated. This newly proposed method is called Signal-Dependent-Noise-Whitening MWPT-MWF(SDNW-MWPT-MWF), and, to make it easy to understand, its pseudo-code and flowchart are also supplied in Algorithm 1 and Figure 1, respectively.
| Algorithm 1: SDNW-MWPT-MWF algorithm |
|
Figure 1.
Flowchart of the SDNW-MWPT-MWF algorithm.
4. Experimental Results
The data set used in the experiments is the HSI captured by the ROSIS during a flight campaign over Pavia University, Northern Italy.
The ROSIS owns 103 spectral bands and pixels with the geometric resolution being 1.3 m. In this paper, only a part of pixels of this image is used. Hence, it is modeled as a tensor in the experiments. The SD photon noise and the SI thermal noise are both taken into account. In order to reproduce different noise scenarios, the SNR ranged from 20 dB to 40 dB with a step of 5 dB. As the power of the SD photon noise and that of the SI thermal noise are of the same level, in this paper, only the case is taken into account. The random noise is generated with a variance depending on the value of the useful signal according to Equation (8) and added into the signal as Equation (3) to create the noisy HSI data . The raw HSI has SNR between 35 and 40 dB [26,27]. This high-SNR HSI could be viewed as a noise-free data cube, so, in this experiment, the raw image can be taken as a reference data cube .
The RGB composites of and are shown in Figure 2. Correspondingly, Figure 3 presents the curve of the mean noise variance versus the band number.
Figure 2.
RGB composites of and (band 20, 35 and 45 for red, green and blue): (a) RGB composites of ; (b) RGB composites of .
Figure 3.
Mean noise variance in each band ( dB).
In the experiments, the wavelet db3 and transform level [1 1 0] are employed in the SDNW-MWPT-MWF. It is worth noting that various denoising methods can be used to replace the MWPT-MWF in the proposed SDNW-MWPT-MWF method. MWF is a classical tensor-based denoising method and MLR was used in the HYNPE method. Therefore, MWF and MLR have been considered to replace the MWPT-MWF as comparative experiments and are named as SDNW-MWF and SDNW-MLR, respectively. In this paper, the value of is set to for these three methods.
4.1. Noise-Whitening Performance Evaluation and Comparison
According to the relationship given in Equation (10), the noise variance estimate relies on the SD noise variance estimate and the SI noise variance estimate . Hence, the performance of estimating and influences directly the noise variance estimation result. Thus, we firstly consider the evolution of and in the estimation loop. The RMSE is employed to analyze the accuracy of and . The RMSE of the SD photon noise variance and the SI thermal noise variance are calculated by
Notice that low values for and denote good estimation accuracy.
A comparative analysis of SDNW-MWF, SDNW-MLR and SDNW-MWPT-MWF have been carried out by analyzing and with the maximum iteration times being set as 10. Figure 4 and Figure 5 present the evolution of and (in logarithmic scale) with the iteration times. From these two figures, it can be seen that, in the case where dB, SDNW-MLR performs better than SDNW-MWF in estimating both and . However, in the case where dB, SDNW-MWF outperforms SDNW-MLR. Nonetheless, in both cases, the proposed SDNW-MWPT-MWF can improve the estimation performance significantly according to the lowest and it obtains. Moreover, and are greater than the initial error in SDNW-MLR and SDNW-MWF, whereas and are well constrained in SDNW-MWPT-MWF.
Figure 4.
Evolution of with iteration times according to different values of : (a) 20 dB; (b) 40 dB.
Figure 5.
Evolution of with iteration times according to different values of : (a) 20 dB; (b) 40 dB.
Apart from and , the estimate of the signal also influences the accuracy of the noise variance estimate of a pixel (see Equation (10)). Since the post-estimate of current PWP iteration is used as the pre-estimate of the next PWP iteration, we only analyze the estimation performance of the post-estimate . To assess the performance of the signal estimator , we resort to the , which will be presented in Section 4.2.
To intuitively present the noise whitening results, Figure 6 shows the mean noise variance of each band after the noise whitening operation. It is evident that the mean noise variance generated by SDNW-MWPT-MWF changes slightly around 1 and is quite constant with respect to the band number. However, the mean noise variance generated by SDNW-MLR and SDNW-MWF is not very satisfactory. In Figure 6a, it can be seen that the trend of the mean noise variances in the lower bands is similar to that in Figure 3 thus implying that the noise in the lower bands (from 1 to 20) is not well whitened. On the other hand, in the bands from 20 to 100, the noise variances are relatively constant, the mean noise variance value is not 1. In Figure 6b, the noise whitening results by SDNW-MLR and SDNW-MWF are worse than that in Figure 6a. Hence, the whitening results in Figure 6 are strongly in favor of the proposed method SDNW-MWPT-MWF.
Figure 6.
Curves of noise variance versus band number with : (a) 20 dB; (b) 40 dB.
The normal probability plot is usually used to visually ascertain whether or not a dataset is approximately normally distributed, which means that the values in the dataset have the same noise variance. Hence, we supply the normal probability plots of the noise before and after whitening in Figure 7. It is obvious that the noise before whitening (Figure 7a) is not normally distributed. After whitening by SDNW-MWF (Figure 7b), the noise approaches towards the normal distribution though there still remain some values not well whitened as can be observed in the interval [−5, 0]. Nonetheless, the noise values after whitening by SDNW-MLR (Figure 7c) and SDNW-MWPT-MWF (Figure 7d) form a straight line and as such they can be considered as normally distributed. Correspondingly, Figure 8a presents the noise distribution situation in the noise environment where dB. It can be seen that the noise values after whitening by SDNW-MLR (Figure 8b) and SDNW-MWF (Figure 8c) are still not normally distributed, while the noise values after whitening by SDNW-MWPT-MWF (Figure 8d) are well normally distributed. The results in Figure 7 and Figure 8 validate once again that the proposed SDNW-MWPT-MWF performs well in both lower ( dB) and higher ( dB) SNR noise environments.

Figure 7.
Normal probability plot for the noise: (a) before whitening, and whitening after (b) SDNW-MWF; (c) SDNW-MLR; (d) SDNW-MWPT-MWF, with dB.
Figure 8.
Normal probability plot for the noise: (a) before whitening, and after whitening using (b) SDNW-MLR; (c) SDNW-MWF; (d) SDNW-MWPT-MWF, with dB.
4.2. Denoising Performance
In Section 4.1, we have discussed in detail the SD and SI noise variance estimates and . In this subsection, we show some results about the denoising performance. Figure 9 presents the noise removed in band 10 from the ROSIS HSI by SDNW-MLR, SDNW-MWF and SDNW-MWPT-MWF in the noise environment where dB. It is obvious that the removed noise is SD noise by comparing visually Figure 9 with the original image Figure 2a. The experimental results in Figure 9 imply that the proposed denoising framework is efficient in removing the SD photon noise in the HSI.
Figure 9.
Noise removal using: (a) SDNW-MLR; (b) SDNW-MWF; (c) SDNW-MWPT-MWF, band 10, dB.
Additionally, we have assessed the denoising performance of various methods by analyzing the criterion . Figure 10 presents the evolution of the in various noisy environments. It is obvious that the generated by SDNW-MWPT-MWF reaches a higher stable value after several iterations. Conversely, the of SDNW-MLR and SDNW-MWF are relatively lower than that of SDNW-MWPT-MWF due to the highest estimation errors of and as shown in Figure 4 and Figure 5, respectively. When dB, the generated by SDNW-MWF is only improved marginally compared to the . On the other hand, when dB SDNW-MWF improves the SNR significantly but the is not stable in the evolution when the iterations are increased. The SDNW-MLR performs well when dB, but when dB the denoising performance of SDNW-MLR is not good compared to that of SDNW-MWF and SDNW-MWPT-MWF. This trend becomes worse as the of SDNW-MLR is lower than the . Moreover, Figure 11 compares the of each method from 20 dB to 40 dB with a step of 5 dB. It shows that SDNW-MLR can improve the SNR from 20 to 30 dB, while degrading the SNR from 35 to 40 dB. The SDNW-MWF is even worse as there is only a marginal improvement of the SNR in 20 dB. From 25 to 40 dB, it degrades the SNR. Nonetheless, from the results presented in Figure 10 and Figure 11, we can conclude that SDNW-MWPT-MWF is a stable and reliable denoising method in various noisy environments.
Figure 10.
Evolution of with iteration times according to different values of : (a) 20 dB; (b) 40 dB.
Figure 11.
Comparison of denoising results based on versus .
4.3. Classification after Denoising
In Section 4.2, we have mainly compared various methods with the capability of improving the SNR. However, some methods might also modify the useful signal severely in the denoising process and which cannot be reflected by the SNR. The classification is employed to distinguish different materials in HSIs [21], and it is sensitive to the signal distortion. Hence, in this paper, we take into account the classification improvement ability of the various methods considered.
Two real-world images are considered for this investigation. The first one, referred to as ROSIS HSI, is described in the beginning of this section.
In the ROSIS HSI, there are nine classes: bitumen, self-blocking bricks, trees, shadows, gravel, bare soil, asphalt, painted metal sheets, and meadows, which are shown in Figure 2b and in the ground truth in Figure 12 with different colors. A proportion of of the reference data of each class is randomly selected as the training samples. The numbers of training and testing samples are shown in Table 1. The SVM classifier is employed to do the classification and its kernel function is RBF with and the penalty parameter .
Figure 12.
Classification reference data:(a) ground truth of the area; (b) nine classes in ROSIS HSI.
Table 1.
Training and testing samples used in the classification.
The second one, referred to as HYDICE HSI, was acquired by the HYperspectral Digital Imagery Collection Experiment (HYDICE) and has 148 spectral bands (from 435 to 2326 nm), 310 rows, and 220 columns. The scene is shown in Figure 13a. This HSI is modeled as a tensor and its ground truth is shown in Figure 13b. According to the ground truth, there are seven land cover classes in HYDICE HSI: field, trees, road, shadow and three different targets. A proportion of of the reference data of each class is randomly selected as the training samples. The numbers of training and testing samples are shown in Table 2.
Figure 13.
HSI images: (a) ground truth; (b) classes in HYDICE HSI.
Table 2.
Training and testing samples used in the classification.
The classification is applied to the denoised HSIs using various methods in order to compare their abilities of improving the classification performance. Figure 14 presents the classification results obtained in the noisy environment where dB. It can be seen from the “bare soil” class that there are less misclassified pixels in the results obtained by SDNW-MWPT-MWF compared to the other results. To make it easy to compare, we give the Overall Accuracy (OA) and Kappa coefficient (K) results for the various methods used in Table 3. It can be seen that if the classification is applied directly to the noisy HSI, the OA is only and K = . After denoising by SDNW-MLR, there is a marginal improvement of OA resulting in and K = . SDNW-MWF performs better than SDNW-MLR, and its OA is increased to and K to . The proposed SDNW-MWPT-MWF makes the most significant improvement among the denoising methods. Indeed, its OA is and its K is . The classification result clearly shows that the proposed SDNW-MWT-MWF is efficient in improving the classification performance.
Figure 14.
Classification results of the ROSIS HSI after denoising by: (a) SDNW-MLR; (b) SDNW-MWF; and (c) SDNW-MWPT-MWF. The classification result without denoising (d) is supplied as a benchmark. ( dB).
Table 3.
OA (%) and Kappa of the classification of the denoised ROSIS HSI, dB.
To investigate how the number of training samples affects the performance of our method and other comparison methods, we considered a proportion of of the reference data of each class is randomly selected as the training samples. Table 4 shows the OA and Kappa coefficient results. As shown from Table 3 and Table 4, our proposed method outperforms all other methods. In particular, with the number of training samples reducing (), our method could obtain bigger accuracy gains than other comparison methods.
Table 4.
OA (%) and Kappa of the classification of the denoised ROSIS HSI, dB.
Figure 15 shows the OA values obtained from the denoised HYDICE HSI and shows that the SDNW-MWPT-MWF method permits reduction of the noise, which is of great interest for SVM classifier. The comparison of the OA and Kappa values (see Table 5) calculated for each preprocessing of denoising shows that the multilinear algebra-based method SDNW-MWPT-MWF leads to better classification results than the considered methods in this experiment.
Figure 15.
Classification results of the HYDICE HSI after denoising by: (a) SDNW-MLR; (b) SDNW-MWF; and (c) SDNW-MWPT-MWF. The classification result without denoising (d) is supplied as a benchmark. ( dB).
Table 5.
OA (%) and Kappa of the classification of the denoised HYDICE HSI, dB.
To make the experimental results more convincing, we have compared the classification improvement performance of SDNW-MLR, SDNW-MWF and SDNW-MWPT-MWF in varying noisy environments, i.e., varies from 20 dB to 40 dB with a step of 5 dB. The classification result of the noisy HSI without denoising is also supplied as a benchmark. Figure 16 presents the curves of OA versus . In the cases where the HSI is impaired severely ( dB), the classification result can be improved significantly after denoising, which implies that the denoising is a necessary preprocessing procedure prior to the classification. Above 30 dB, SDNW-MLR can only improve the classification result marginally and the same is valid for SDNW-MWF with more than 35 dB. Nonetheless, SDNW-MWPT-MWF performs better than SDNW-MLR and SDNW-MWF and it improves the OA significantly for values from 20 dB to 40 dB.
Figure 16.
Classification OA with respect to .
5. Discussion
Hyperspectral sensors collect data in hundreds of narrow contiguous spectral bands, providing a powerful means to discriminate different materials by their spectral features. In the last decade, for improving the classification, several denoising algorithms for hyperspectral images were proposed. Most were derived assuming the spatial stationarity of the noise that affects hyperspectral images, meaning that the noise characteristics are assumed to be the same in each hyperspectral image region. The existing algorithms are proposed to remove the signal independent noise for enhancing hyperspectral image applications. In this study, we have proved that assumption is not valid for new-generation hyperspectral sensors, where photon noise, which depends on the spatially varying signal level, is not negligible. We are thus studying the possible impacts of signal-dependent noise on the performance of existing classification algorithms. By assuming a signal-dependent noise model, we have proposed a novel multilinear algebra-based algorithm to remove simultaneously the signal-independent noise and signal-dependent noise. This result has inspired a new iterative procedure noise-whitening (SDNW-MWPT-MWF) which improves the SVM’s robustness in the presence of signal-dependent noise. The noise-whitening is achieved by exploiting estimates of the noise variance for each band and for each pixel. The performance of the proposed SDNW-MWPT-MWF method are validated on the simulated HSIs disturbed by both SD and SI noise and on the real-world ROSIS and HYDICE HSIs. From the analysis and the comparative study against other similar methods in the experiments, it can be concluded that SDNW-MWPT-MWF method can effectively reduce both SD and SI noise from HSIs. It is also necessary to take into account the signal-dependent noise in the denoising when dealing with HSIs that were collected by a new-generation airborne hyperspectral sensors. Indeed, this study demonstrated that the signal-dependent noise may affect the properties of existing classification algorithms, thus encouraging future work about the impact of SD noise. Our ongoing activity is aimed at demonstrating the benefits arising from using the SDNW-MWPT-MWF algorithm in mitigating the impact of the SD noise in different algorithms for hyperspectral data exploitation.
Since in this study the optimal parameter combination is found by time-consuming brute force searching, future works will be focused on the reduction of the computational load. A heuristic algorithm can be used to search for the optimal (sub-optimal) parameter combination such as a genetic algorithm.
6. Conclusions
In this paper, a denoising method SDNW-MWPT-MWF has been proposed under the PWP denoising framework for the purpose of reducing the SD photon and SI thermal noise in HSIs. Two other adapted denoising methods SDNW-MLR, SDNW-MWF are also used as comparative methods. The performances of these methods are compared in the case of photon noise whitening, denoising and improvement of classification. The photon noise whitening experiment, the denoising experiment and the classification experiment are designed to assess the parameter estimation performance, the improvement of the SNR and the distortion of the spectra in the HSI, respectively. From the experimental results obtained, it can be concluded that the SDNW-MLR performs well in removing the noise, though it also changes the signal severely in the denoising process as shown from the lower classification result OA. In contrast, SDNW-MWF performs well in preserving the signal though it cannot remove the noise well from the images. However, the proposed SDNW-MWPT-MWF is able to obtain high as well as the high OA, thus implying that it is able to remove noise well while still preserving the signal.
These promising results encourage us to extend our experiments on other hyperspectral data such as Indian Pines and Salinas HSIs.
In this study, we employed the HYNPE algorithm to estimate the noise parameters, and then filtered noise by the proposed method SDNW-MWPT-MWF. Nonetheless, SDNW-MWPT-MWF converts SD noise to additive white Gaussian noise by a pre-whitening procedure and then applied MWPT-MWF. Due to the parameter estimate errors, the whitened noise is only approximately white, therefore the performance of filters developed for the white noise might degrade. Thus, it is interesting to develop a filter that can directly process the SD noise without the noise whitening procedure. For example, the tensor-based filtering method using a PARAFAC tensor decomposition could be carried out in the future.
Author Contributions
All authors contributed to the conception and design of the methodology.
Funding
This research received no external funding.
Acknowledgments
The authors would like to thank the reviewers for their careful reading and helpful comments, which improved the quality of this paper.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Multilinear Algebra Tools
Since multilinear algebra is used in this paper, we present some basic definitions in this section to make it easy to understand the discussions in the following sections. More details about multilinear algebra can be found in [28,29,30,31].
Appendix A.1. Definition of a Tensor
An N-th order tensor is an N-dimensional array, , in which indicates the real manifold, and N is the number of dimensions. The elements in this tensor can be expressed as , with . The nth dimension of this tensor is called mode-n.
Appendix A.2. Unfolding
Unfolding is also known as matricization or flattening. It can convert a tensor into its matrix form for the conveniency of using matrix-based data analyzing methods. A tensor can be unfolded in different ways according to the mode in which the unfolding is performed. The unfolding in mode-n is called mode-n unfolding.
Let denote the mode-n unfolding matrix of a tensor , where . The columns of are the -dimensional vectors obtained from by varying index while keeping the other indices fixed.
Appendix A.3. Mode-n Tensor Product × n
The mode-n product is defined as the product between a data tensor and a matrix in mode n. This mode-n product is denoted by , whose entries are given by
where .
Appendix A.4. Element Extraction
For a given tensor , we define the following element extraction operation:
Appendix A.5. Hadamard Product ⊛
The Hadamard product is defined as the product of two equal-size tensors and . The entries of the Hadamard product can be computed as:
where .
Appendix A.6. Mode-n Rank of a Tensor
The term mode-n rank of a tensor , denoted by , is the dimension of the column space of the mode-n unfolding matrix [29], i.e.,
The term mode-n rank is another way to extend the notion of the matrix rank to the tensor case. The mode-n rank is actually a rank of a matrix, therefore it can be analyzed by using matrix based techniques. Nevertheless, for a matrix , the ranks of its column space and row space are similar, , but, for the tensor , the different mode-n ranks are not necessarily the same. This means that the mode-n rank of tensor is not a unique value but a set of N values .
Appendix B. Overview of HYNPE
HYNPE was proposed in [8] to estimate the parameters of the photon noise which has aroused recently much research interest in the new generation hyperspectral sensors. HYNPE assumes that the noise sources are independent from one another and spectrally uncorrelated, which is the same noise model as the one introduced in Section 2. There are mainly two steps of HYNPE for estimating the noise parameters.
The first step is to estimate the noise and signal by employing the MLR theory. For a given noisy HSI , we consider its mode-3 unfolding matrix , which consists of row vectors (see Appendix A):
Denote the estimate of pure HSI by and noise by . Then, the mode-3 unfolding matrix of and are and , respectively, and their row-vector forms can be expressed as:
The MLR theory estimates the signal by exploiting the strong spectral correlation of the useful signal and the weak between-band correlation of the random noise in the HSI. It assumes that the signal estimate row vector in band can be expressed as a linear combination of the noisy data row vectors () in the other bands:
where and is the combination weight vector. Then, the optimal weight vector can be estimated by minimizing the LSE:
The solution of the LSE problem is well known and can be expressed as:
and the corresponding signal estimate and noise estimate can be computed by:
After the estimation of the signal and noise, the noise variances and are estimated by maximizing the likelihood function [8]:
with
Since and are unknown in a realistic scenario, they are replaced by the estimates computed in Equations (A10) and (A11), respectively.
Appendix C. Multiway Wiener Filter in Multidimensional Wavelet Packet Domain
In this section, we remind the main principles of the multidimensional wavelet packet transform (MWPT) and of the MWF in multidimensional wavelet packet domain [22].
Appendix C.1. MWPT
MWPT can be computed by performing 1D wavelet packet transform in each mode [32]. Therefore, the wavelet packet coefficient tensor can be computed as
and the reconstruction can be written as
where , and . In particular, when , MWPT indicates the 3D wavelet packet transform. denotes the level wavelet packet transform to kth mode of . is defined as the coefficient subtensor of , where is the index vector, and , . Then, for each element of , we can define:
where
and
The notation indicates the element of tensor in position as defined in Equation (A2). From the properties of the wavelet packet transform, we know that indicates the “frequency” of mode n. Thus, is the frequency index of coefficient block . For convenience, a component tensor of is referred to as in this paper.
Appendix C.2. Multiway Wiener Filter in Multidimensional Wavelet Packet Domain
In the existing MWF algorithm, the filter is applied to the whole hyperspectral image . As the calculation of the filters needs the estimation of the signal subspace or rank in each mode for suppressing the smallest eigenvalues [33], some weak signal might be removed in this procedure. Therefore, the SNR is an important factor influencing the rank. When SNR is higher, the rank estimated is greater, therefore more signal is preserved in the filtering process. In the contrast condition, more signal is lost. When the noise is white, the power of noise in each component is the same, whereas the signal concentrates in the lower frequency component. That is to say, in different components, the SNR is different. When MWF is applied to each component, more signal can be preserved. Performing MWPT to tensor , and in Equation (1), we obtain:
The coefficient tensor of each part
and the coefficient tensor of the estimate :
Extracting the components of each frequency , and from , and respectively by using Equation (A16), we obtain:
From Parseval’s theorem, the following expression can be obtained:
which means that minimizing the MSE between and its estimate is equivalent to minimizing the MSE between and for each . If is estimated by Tucker3 decomposition of :
then are the mode-n filters of the multiway Wiener filter [33]. After estimating for each , we obtain by concatenating . Furthermore, the estimate can be obtained by inverse MWPT, i.e.,
Appendix C.3. Best Transform Level and Basis Selection
In MWPT-MWF algorithm, several parameters should be determined:
- Level of transform: the performance of the algorithm is affected by the level of transform, which depends on the size of tensor . The maximum level can be calculated by:where rounds a number upward to its nearest integer, and the constant 5 is reduced from to make sure there are enough elements in each mode so that the transform is meaningful. Then, the set of possible transform levels can be expressed as:where denotes a set.
- Basis of transform: there are many wavelet bases designed for different cases. For the simplicity of expression, we define:to denote the set of possible wavelet bases, where is the number of wavelets in this set.
The best transform level and basis should minimize the MSE or risk [34], whose equivalent form using the coefficients can be expressed as
Then, the best transform level and basis can be selected by
As the selection of the optimal depends on which is generally unknown, to overcome this drawback, an alternative solution should be found. Denoting by the estimate of at the dth ALS loop and noticing that when is minimized, is the optimal estimate of obtained by MWF, and at the same time is minimized. Therefore, Equation (A32) can be replaced by:
where
Appendix C.4. Summary of the MWPT-MWF Method
The algorithm MWPT−MWF, can be summarized as presented here.
- Find the optimal and . Loop and w:
- (a)
- Decompose the data by MWPT: .
- (b)
- Extract component from using Equation (A16), for , where , .
- (c)
- Filter component by MWF: .
- (d)
- Calculate the risk using Equation (A34). If reaches a fixed threshold, return the optimal and .
- Concatenate to obtain and perform inverse MWPT: .
References
- Acito, N.; Diani, M.; Corsini, G. Signal-dependent noise modeling and model parameter estimation in hyperspectral images. IEEE Trans. Geosci. Remote Sens. 2011, 8, 2957–2971. [Google Scholar] [CrossRef]
- Rakwatin, P.; Takeuchi, W.; Yasuoka, Y. Stripe noise reduction in modis data by combining histogram matching with facet filter. IEEE Trans. Geosci. Remote Sens. 2007, 45, 1844–1856. [Google Scholar] [CrossRef]
- Uss, M.L.; Vozel, B.; Lukin, V.V.; Chehdi, K. Local signal-dependent noise variance estimation from hyperspectral textural images. IEEE J. Sel. Top. Signal Process. 2011, 3, 469–486. [Google Scholar] [CrossRef]
- Roger, R.E. Principal components transform with simple, automatic noise ajustment. INT J. Remote Sens. 1996, 17, 2719–2727. [Google Scholar] [CrossRef]
- Chang, C.-I.; Du, Q. Interference and noise adjusted principal components analysis. IEEE Trans. Geosci. Remote Sens. 1999, 37, 2387–2396. [Google Scholar] [CrossRef]
- Rhodes, H.; Agranov, G.; Hong, C.; Boettiger, U.; Mauritzson, R.; Ladd, J.; Karasev, I.; McKee, J.; Jenkins, E.; Quinlin, W.; et al. Cmos imager technology shrinks and image performance. In Proceedings of the 2004 IEEE Workshop on Microelectronics and Electron Devices, Boise, ID, USA, 16 April 2004; pp. 7–18. [Google Scholar]
- Alparone, L.; Selva, M.; Aiazzi, B.; Baronti, S.; Butera, F.; Chiarantini, L. Signal-dependent noise modelling and estimation of new-generation imaging spectrometers. In Proceedings of the Workshop on Hyperspectral Image and Signal Processing (WHISPERS), Grenoble, France, 26–28 August 2009; pp. 1–4. [Google Scholar]
- Liu, X.; Bourennane, S.; Fossati, C. Denoising of hyperspectral images using the parafac model and statistical performance analysis. IEEE Trans. Geosci. Remote Sens. 2012, 50, 3717–3724. [Google Scholar] [CrossRef]
- Gao, L.; Yao, D.; Li, Q.; Zhuang, L.; Zhang, B.; Bioucas-Dias, J.M. A New Low-Rank Representation based hyperspectral image denoising method for mineral mapping. Remote Sens. 2017, 9, 1145. [Google Scholar] [CrossRef]
- Fan, Y.R.; Huang, T.Z.; Zhao, X.L.; Deng, L.J.; Fan, S. Multispectral Image Denoising via Nonlocal Multitask Sparse Learning. Remote Sens. 2018, 10, 116. [Google Scholar] [CrossRef]
- Atkinson, I.; Kamalabadi, F.; Mohan, S.; Jones, D. Wavelet-based 2D multichannel signal estimation. In Proceedings of the IEEE International Conference on Image Processing, Barcelona, Spain, 14–17 September 2003; pp. 743–745. [Google Scholar]
- Pizurica, A.; Philips, W. Estimating the probability of the presence of a signal of interest in multiresolution single-and multiband image denoising. IEEE Trans. Image Process. 2006, 3, 654–665. [Google Scholar] [CrossRef]
- Letexier, D.; Bourennane, S. Noise removal from hyperspectral images by multidimensional filtering. IEEE Trans. Geosci. Remote Sens. 2008, 7, 2061–2069. [Google Scholar] [CrossRef]
- Renard, N.; Bourennane, S. Improvement of target detection methods by multiway filtering. IEEE Trans. Geosci. Remote Sens. 2008, 8, 2407–2417. [Google Scholar] [CrossRef]
- Aiazzi, B.; Alparone, L.; Barducci, A.; Baronti, S.; Marcoinni, P.; Pippi, I.; Selva, M. Noise modelling and estimation of hyperspectral data from airborne imaging spectrometers. Ann. Geophys. 2006, 1, 1–9. [Google Scholar]
- Gao, L.R.; Zhang, B.; Zhang, X.; Zhang, W.-J.; Tong, Q.-X. A new operational method for estimating noise in hyperspectral images. IEEE Geosci. Remote Sens. Lett. 2008, 1, 83–87. [Google Scholar] [CrossRef]
- Aiazzi, B.; Alparone, L.; Baronti, S. A robust method for parameter estimation of signal-dependent noise models in digital images. In Proceedings of the IEEE International Conference on Digital Signal Processin, Santorini, Greece, 2–4 July 1997; pp. 601–604. [Google Scholar]
- Aiazzi, B.; Alparone, L.; Baronti, S.; Garzelli, A. Coherence estimation from multilook incoherent sar imagery. IEEE Trans. Geosci. Remote Sens. 2003, 11, 2531–2539. [Google Scholar] [CrossRef]
- Argenti, F.; Torricelli, G.; Alparone, L. Mmse filtering of generalised signal-dependent noise in spatial and shift-invariant wavelet domains. Signal Process. 2006, 8, 2056–2066. [Google Scholar] [CrossRef]
- Foi, A.; Trimeche, M.; Katkovnik, V.; Egiazarian, K. Practical poissonian-Gaussian noise modeling and fitting for single-image raw-data. IEEE Trans. Image Process. 2008, 10, 1737–1754. [Google Scholar] [CrossRef] [PubMed]
- Cheng, G.; Li, Z.; Han, J.; Yao, X.; Guo, L. Exploring Hierarchical Convolutional Features for Hyperspectral Image Classification. IEEE Trans. Geosci. Remote Sens. 2018, 99, 1–11. [Google Scholar] [CrossRef]
- Lin, T.; Bourennane, S. Hyperspectral image processing by jointly filtering wavelet component tensor. IEEE Trans. Geosci. Remote Sens. 2013, 6, 3529–3541. [Google Scholar] [CrossRef]
- Lin, T.; Bourennane, S. Hyperspectral image denoising with rare signal preserving by jointly filtering image component. In Proceedings of the 2013 4th European Workshop on Visual Information Processing (EUVIP), Paris, France, 10–12 June 2013; pp. 265–269. [Google Scholar]
- Faraji, H.; MacLean, W.J. CCD noise removal in digital images. IEEE Trans. Image Process. 2006, 9, 2676–2685. [Google Scholar] [CrossRef]
- Aiazzi, B.; Alparone, L.; Baronti, S.; Butera, F.; Chiarantini, L.; Selva, M. Benefits of signal-dependent noise reduction for spectral analysis of data from advanced imaging spectrometers. In Proceedings of the Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS), Lisbon, Portugal, 6–9 June 2011; pp. 1–4. [Google Scholar]
- Othman, H.; Qian, S.-E. Noise reduction of hyperspectral imagery using hybrid spatial-spectral derivative-domain wavelet shrinkage. IEEE Trans. Geosci. Remote Sens. 2006, 2, 397–408. [Google Scholar] [CrossRef]
- Chen, G.; Qian, S.-E. Denoising of hyperspectral imagery using principal component analysis and wavelet shrinkage. IEEE Trans. Geosci. Remote Sens. 2011, 3, 973–980. [Google Scholar] [CrossRef]
- Kolda, T.G.; Bader, B.W. Tensor decompositions and applications. SIAM Rev. 2009, 3, 455–500. [Google Scholar] [CrossRef]
- De Lathauwer, L.; De Moor, B.; Vandewalle, J. A multilinear singular value decomposition. SIAM J. Matrix Anal. Appl. 2000, 4, 1253–1278. [Google Scholar] [CrossRef]
- De Lathauwer, L.; De Moor, B.; Vandewalle, J. On the best rank-1 and rank-(r1, r2, ..., rn) approximation of higher-order tensors. SIAM J. Matrix Anal. Appl. 2000, 4, 1324–1342. [Google Scholar] [CrossRef]
- Cichocki, A.; Zdunek, R.; Phan, A.; Amari, S. Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-Way Data Analysis and Blind Source Separation; Wiley: Hoboken, NJ, USA, 2009. [Google Scholar]
- Mallat, S. A theory for multiresolution signal decomposition: The wavelet representation. IEEE Trans. Pattern Anal. Mach. Intell. 1989, 7, 674–693. [Google Scholar] [CrossRef]
- Muti, D.; Bourennane, S. Multidimensional filtering based on a tensor approach. Signal Process. 2005, 12, 2338–2353. [Google Scholar] [CrossRef]
- Donoho, D.; Johnstone, I. Ideal denoising in an orthonormal basis chosen from a library of bases. Comptes Rendus de l’Academie des Sciences-Serie I-Mathematique 1994, 12, 1317–1322. [Google Scholar]
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).