1. Introduction
It is well known that polarimetric synthetic aperture radar (PolSAR) images are inherently affected by speckle noise, which is due to the coherent interference of radar signals reflected from many tiny scatterers in a resolution unit. Speckle noise has long been recognized as one of the most crucial problems of SAR data, and its presence degrades the appearance of images and has a great impact on the performance of land-use classification and scene analysis tasks [
1].
One of the earliest methods for SAR image despeckling was presented by Novak and Burl [
2], in which the polarimetric whitening filter (PWF) was utilized to produce a single speckle-reduced intensity image by optimally combining all the elements of the polarimetric covariance matrix. Motivated by this method, Lopes and Sery [
3] and Liu et al. [
4] introduced a PWF for multi-look processed data (MPWF). The above filters exploit the statistical correlations between the different polarization channels, leading to the problem that the polarimetric properties are not carefully preserved. An important branch of filtering methods for PolSAR data is based on the linear minimum mean-squared error (LMMSE) estimator. Lee et al. expanded their local LMMSE (LLMMSE) method in [
5] to filter PolSAR data by developing the edge-aligned window technique [
1]. In [
6], a novel method was further proposed to select neighboring pixels based on the same scattering characteristics, and to use the selected pixels to filter the processed pixel based on the LLMMSE filter. More LLMMSE algorithms can also be found in [
7,
8,
9].
In the traditional methods based on the LLMMSE filter, to obtain more precise filter parameters in the LMMSE estimator, a group of homogeneous image pixels are first selected in a local window. The LMMSE estimator is then generated from the values of the selected pixels, and is saved as the filtered value of the pixel being processed. These methods assume that all of the selected pixels are highly homogeneous pixels with respect to the processed pixel, which is not reliable enough. Besides, most of these methods have the shortcomings of the limited selection range of pixels, and comparing pixels only by their own characteristics, which may produce a biased or inferior estimation of the filter parameters in the estimator.
In addition to the methods based on LMMSE, some other filtering algorithms, such as partial differential equation (PDE)-based filters [
10,
11], variational-based filters [
12,
13,
14], simultaneous sparse coding (SSC)-based filters [
15], spherically invariant random vector-based filters [
16], and Bayesian theory-based filters [
17] have also been developed in recent years. What must be mentioned is that nonlocal means (NL-means) [
18], which were first used in the processing of optical images, have recently been extended for SAR despeckling and have achieved very positive performances. In [
19,
20], the NL-means were limited in processing single-polarization amplitude images. In [
21], the methodology of NL-means was innovatively applied to PolSAR speckle reduction by Chen et al., and they successfully made use of the full polarimetric information. In [
13], the ideas of total variation (TV) and NL-means were combined to tackle the issue of PolSAR image despeckling. In the NL-means method, the restored value of each pixel is obtained by the weighted average of all the pixels in a large searching window. Each weight is proportional to the equality between the local neighborhood of the pixel being processed and the neighborhood of the other pixels. The basic idea is that images contain repeated structures, and averaging them will reduce the random noise. This provides us with a new thought to solve the problems that the conventional LLMMSE filters encountered in filtering PolSAR images: it is better to assign a weight or reliability to each pixel in the LMMSE estimator based on its equality of the full polarimetric information with the processed pixel, rather than regard it as an absolutely homogeneous pixel. By doing so, the selection window of pixels can also be expanded, which can ensure that enough similar samples are included in the filter; in addition, the weight can be obtained by comparing the equality of the pixels’ neighborhoods, rather than only comparing the value in a single point.
Based on the above viewpoints, a novel nonlocal weighted linear minimum mean-squared error (NWLMMSE) filter is investigated in this present research, aiming at solving the problems of the conventional PolSAR LLMMSE filters and taking advantage of the theory of NL-means. First, an effective preliminary step is taken to preclassify the pixels. Then, strong point targets detected in the above step are kept unfiltered; for distributed target pixels, only the similar pixels in the searching window are selected as samples, and the idea of NL-means is employed to calculate the weights of the selected pixels. Finally, the weighted filter parameters evaluated by these pixels are applied to the LMMSE estimator, and the NWLMMSE filter is derived to filter the processed pixel.
The remainder of this paper is organized as follows. First, some basic concepts of the local linear minimum mean-squared error filter and NL-means are introduced in
Section 2, and the proposed NWLMMSE method is described in
Section 3. Then, in
Section 4, the experimental results and analyses of a simulated and two real-world PolSAR images are given. Finally, the conclusions are drawn in
Section 5.
3. The Proposed Nonlocal Weighted LMMSE Filter
The basic idea of our proposed nonlocal weighted LMMSE filter is that, to obtain a precise estimation of the filter parameters of the LMMSE estimator, enough statistical samples should be selected without considering whether they are absolutely homogeneous pixels or not (i.e., loosening the constraint of selecting samples), and a weight or reliability should be assigned to each sample. This can be described as the following statistical model: suppose we have
samples. Their covariance matrices are
, respectively, and their weights are
, respectively. The span values are then, respectively,
. Assume that the estimated value is a linear combination of a priori mean
and
, and following the process of the LMMSE estimator, the filtered covariance matrix of the processed pixel is estimated by the NWLMMSE filter as
where
is the weighted mean value of the samples’ covariance matrices:
The span image is used in the computation of the weight
:
where the mean value
and variance
are also obtained by considering the weight of each sample as the following Equation:
In NL-means, the estimated value is considered as the weighted mean value of samples: ; in the NWLMMSE filter, it is assumed to be a linear combination of the weighted mean value and the processed pixel value : . The parameter works as a weight between the weighted local mean value and the original pixel value. We can see that the formulation of NL-means is just a special case of the NWLMMSE estimator when . This indicates that the NWLMMSE filter is adaptive, and not only employs the idea of NL-means, but also takes account of the property of the multiplicative noise.
The main question with regard to the NWLMMSE filter is how to define the weight of each pixel. Inspired by the NL-means proposed in [
21], the weight in our study is obtained by comparing the equality between the corresponding neighborhoods of the sample pixels and the processed pixel, which can yield a more robust comparison.
3.1. Calculation of Weights
In NL-means, the equality of two pixels is measured by the similarity of their neighborhoods. In [
21], the authors employ the likelihood-ratio test of equality between two
laws with the same number of looks under the complex Wishart distribution, as presented by Conradsen et al. [
29], to calculate the equality between image patches.
By letting the independent Hermitian positive definite matrices X and Y be complex Wishart distributed, and considering the null hypothesis H
0:
=
, which states that two matrices are equal, against the alternative hypothesis H
1:
, the likelihood-ratio test statistic can be derived as follows:
where
L is the number of looks. The value of
Q is nonpositive, and equals to 0 when two matrices are the same. For two image patches that each have
pixels, their equality
is regarded as the sum of the equalities of the corresponding pixels within the patch:
When filtering the pixel
, the equality
can be converted into the initial weight assigned to the sample
by
where the parameter
acts as the degree of filtering, and it controls the decay of the exponential function. In general, a low value of parameter
would lower the denoising performance of the NL-means and yield a noisy result. On the other hand, a high value of
would result in a burring denoised image.
Figure 1 shows the profile of Equation (16) for a one-look image under a given
. We assume that, for an unknown single-look image, the equality
between two arbitrary patches randomly distributes between 0 and −100 (for
values of less than −100, the corresponding weights are much closer to zero, so they can be neglected). In NL-means, homogeneous pixels should occupy high weights when filtering a certain pixel. Therefore, a reasonable principle for choosing
is to ensure that the sum of the weights of the homogeneous pixels has a high proportion, such as 90% in this paper (black area of
Figure 1). Based on the above assumption, the steps of estimating parameter
in this paper are summarized as follows: first, select the homogeneous areas in a single-look image, calculate the
between any pair of pixels, and regard the mean value
of all the
values as the boundary between the homogeneous pixels and heterogeneous pixels; then, choose a value of
to meet the condition that the integral from
(
is the number of pixels in a patch) to 0 is equal to 90% of the integral from −100 to 0. By doing so, we obtain the approximate value of
as
. Since
Q (Equation (13)) is proportional to the number of looks, the above theoretical value can be generalized to
look images as follows:
In this paper, the size of the neighboring patch is set as 3 × 3 pixels, thus
. Suppose we have
samples, then the normalized weight of a certain sample
is computed as follows:
.
3.2. The Preliminary Step of Selecting Samples
Another question for our proposed NWLMMSE filter is how to select the samples. A common approach to NL-means is selecting all the pixels in a large square searching window, which leads to a high computational complexity, and makes it unpractical. Furthermore, strong point targets cannot be well retained by doing so, and the polarimetric scattering mechanism is not considered in the above equality calculation. The basic idea of the step proposed in this part is to preclassify the pixels in the searching window and to reject unrelated pixels according to their characteristics, with the aim of solving the above problems. This is performed as a preliminary step, and only those pixels that have similar characteristics to the processed pixel are selected as samples. This step is much faster than the weight calculation step.
In this study, two characteristics are considered to determine whether a pixel can be selected as a sample: scene heterogeneity (i.e., located in homogeneous scenes or heterogeneous scenes) and the polarimetric scattering mechanism.
To quickly obtain the scene heterogeneity of a pixel, we undertake a simple step to test the textural information of its neighboring patch, which is inspired by [
30]. Three classes of scene heterogeneity are presented: homogeneous class, heterogeneous class, and isolated point target. The definition of the scene heterogeneity is derived from the coefficient of variation (
) which is computed over the pixel’s 3 × 3 neighboring patch from the square roots of span. For a pixel, it is within a homogeneous area if
; it is within a heterogeneous area if
; and it is an isolated point target if
.
and
are the theoretical thresholds provided and validated in [
27], where
is the number of looks.
As mentioned in
Section 2, apart from the statistical property, the polarimetric scattering mechanism should also be considered to measure the similarity of two pixels. To characterize the scattering mechanism of a pixel, we chose the scattering model-based decomposition by Freeman and Durden [
31], which is based on realistic scattering models, for its effectiveness in providing scattering powers for each scattering component. In this polarimetric decomposition method, the pixels are categorized into three basic types: surface scattering dominated pixels, double-bounce scattering dominated pixels, and volume scattering dominated pixels. In this research, two pixels are considered to be similar in the polarimetric scattering mechanism, if they have the same dominant scattering mechanism.
In this study, isolated point targets are detected in the preliminary step and kept unfiltered. For a distributed target pixel being processed, only those pixels which have the same scene heterogeneity and dominant scattering mechanism in a square searching window are selected as the samples in the filter. For pixels located in homogeneous patches, the mean values of their neighboring patches can effectively represent these patches, so the equality of their neighborhoods can be obtained by comparing the equality of their mean values, to ease the computational load.
The main purpose of this preliminary step is to preserve the strong scattering point targets and to consider the similarity of the scattering mechanisms between pixels, and in the meanwhile accelerate the algorithm. In fact, for a pair of pixels in a searching window, the similarity between them is calculated once when filtering each of them, which is repetitive and could be avoided by certain means in practice. A further discussion about the computational complexity of the algorithm is provided in
Section 5. What must be made clear here is that the process of selecting pixels in this paper has a looser constraint than the conventional LLMMSE methods, which ensures that enough similar samples can be included in the filter.
4. Experiment Results of Polarimetric Speckle Filtering
In this section, to illustrate the performance of the NWLMMSE filtering method presented in
Section 3, the results obtained with a simulated PolSAR image and two real-world polarimetric SAR images are reported. The proposed method is compared with the refined Lee filter with a
edge-aligned window, the NL-means method presented in [
21], and the PolSAR nonlocal total variation-based filter (PolSAR NLTV) presented in [
13]. In this paper, the searching windows of both NL-means and the NWLMMSE filter are set to be
pixels, and the size of the neighboring patches is
pixels.
4.1. Simulated Image
The use of simulated PolSAR data allows for an objective assessment of the performance of the proposed filter.
Figure 2a shows the noise-free Pauli RGB image of the simulated data, and
Figure 2b is the noisy generated image. The images not only contain linear and curvilinear edges but also contain some strong point targets. As we can see, the refined Lee filter reduces the speckle to some degree; however, the edges seem to be slightly blurred and the point targets are badly smeared. NL-means and NWLMMSE have comparable performances in both speckle reduction and edge preservation, but most point targets are lost in the image processed by NL-means. The reason for this is that in NL-means, when there are not enough pixels which are similar to the processed point target, the target will be blurred by unrelated pixels. The PolSAR NLTV filter shows better performance in retaining point targets than NL-means, but the boundaries of the objects seem to be slightly over-smoothed. The NWLMMSE filter can effectively preserve these strong point targets, since we have successfully detected them and retained their original values.
To further compare the performance in speckle reduction of each method, two quantitative indexes are introduced: the coefficient of variation (CV) and the equivalent number of looks (ENL). CV and ENL are two indexes that measure the smoothness of an image in the homogeneous areas. A low value of CV or a high value of ENL represents a low degree of speckle. Furthermore, to measure the reconstruction of edges, we also utilize the edge reconstruction error (ERR), as proposed in [
32].
Table 1 lists the CV and the ENL values of each method, and it can be seen that that NL-means, NLTV, and NWLMMSE filters have much better performances in speckle reduction in homogeneous areas. For the ERR, the proposed method achieves the lowest value, since it preserves the details well and has a low error rate in reconstructing edges.
4.2. Real-World PolSAR Images
In this part, two real-world data sets, one from Flevoland in the Netherlands and the other from San Francisco, are used for demonstrating the performances of the proposed filter. Both of them are four-look.
Figure 3 displays the Pauli RGB image of the original data and the speckle filtering results of the different methods. The refined-Lee-filtered result (
Figure 3b) reveals positive filtering characteristics; however, the scalloped appearance due to the edge-aligned windows in this filter may be undesirable for some applications, as pointed out in [
8]. Compared with the refined Lee filter, NL-means and PolSAR NLTV (
Figure 3c,d) show better performances in edge preservation, and the speckle is remarkably reduced in the processed images; however, the images of both filters seem to be somewhat oversmoothed, which leads to the problem that some details are slightly blurred. Furthermore, some isolated point targets are smeared as a kind of bright rectangle in the NL-means filtered image (
Figure 3h). The result of the NWLMMSE filter preserves edges and point targets better, and the speckle is also reduced to a great extent.
An edge detector is often used as a tool to obtain the edge information of an image [
33,
34]. In this part, to qualitatively assess the edge preservation capability of the filters, the PolSAR constant false-alarm rate (CFAR) edge detector is used to detect the edge information of the images, since it is based on a test for equality of the covariance matrices and thus can utilize the full polarimetric information well [
35]. The probability of false alarms for each image is set as −1.0, and the edge detection results are presented in
Figure 4. As we can see, the CFAR detector has successfully detected most edges of the original speckled image in
Figure 4a, but many “false edges” exist in homogeneous areas due to the influence of the speckles. The refined Lee filter (
Figure 4b) has removed these false edges in its suppression of speckle; however, strong point targets and some fine edges are blurred and have not been detected by the detector (marked by the red ellipse). The NL-means and PolSAR NLTV filters (
Figure 4c,d) have better performances than the refined Lee filter in the preservation of edges, but some point targets can still not be detected. The NWLMMSE filter reveals the best performance in edge preservation, and most of the point target signatures have also been retained.
To visualize the speckle reduction efficiency of the filters, we plot the scattergrams of the original and the filtered elements of the covariance matrix (C22 and the phase part of C13) for four land-cover types of the Flevoland image (
Figure 5). The distributions of C22 (i.e., the intensity of the HV polarization) and the phase part of C13 (i.e., the HH and VV phase difference) become more concentrated for the filtered data than for the original data, and these four classes are much more separable in the data filtered by the NL-means, PolSAR NLTV, and NWLMMSE filters.
A good PolSAR speckle filtering method should possess filtering traits that not only reduce speckle and preserve edges, but also preserve the polarimetric scattering mechanisms of the image, because scattering mechanisms have always been considered as a class of important polarimetric information, and they are useful in land-use classification and scene interpretation. However, in the refined Lee filter and NL-means, the polarimetric scattering mechanism has not been taken into account in selecting homogeneous pixels or calculating the equality of two pixels. In our method, the scattering mechanism is chosen as one of the characteristics to select the samples. Two pixels are considered to be similar in the domain of a polarimetric scattering mechanism if they have the same dominant scattering mechanism. To validate the preservation of the scattering mechanism of the new method, we applied Freeman and Durden decomposition to a subset of the San Francisco image filtered by the different methods (
Figure 6). This data set contains several classes of typical land objects that have different scattering mechanisms (for example, water pixels dominated by surface scattering, man-made structure pixels dominated by double-bounce scattering, and tree pixels dominated by volume scattering). The result of the 7 × 7 refined Lee filter (
Figure 6b) shows the problem of a block effect, and some details are overly enhanced. NL-means (
Figure 6c) has a better performance in speckle reduction and edge preservation than the refined Lee filter, as expected. However, both of the above two filters have smeared some strong signals, which returns from double-bounce dominated building pixels in the forest area and from the ships on the water. Visually, PolSAR NLTV performs a little better in retaining point targets than NL-means. For the NL-means and PolSAR NLTV filtered images, the speckle has been greatly reduced in the middle part of the urban area; however, some polarimetric information of the plants in this area has been lost. This is because the polarimetric scattering mechanism has not been taken into account by the two filters, and some piecemeal volume scattering pixels (i.e., plants) are filtered by too many double-bounce scattering pixels (i.e., buildings) in the large searching window. The proposed method reveals good filtering characteristics of preserving the dominant scattering properties, reducing speckle, and retaining the spatial resolution (
Figure 6e).
4.3. Computational Complexity
Although NL-means has shown its outstanding performance in filtering, the high computational complexity still makes this method quite slow in practice. The high computational load is due to the cost of the equality calculation for all the pixels in a large window. Assuming the equality calculation between two pixels to be one step, and that the processed image has the size of M × N pixels, then the entire calculation scale of NL-means is , which is obviously a heavy load. For the PolSAR NLTV filter, the processing time is not only dependent on the cost of the equality calculation between patches, but also dependent on the calculation procedure of minimizing the variational function. Therefore, the PolSAR NLTV filter is generally quite time consuming. In this paper, the processing time of the NWLMMSE filter is also mainly dependent on the cost of the equality calculation. As we take a preliminary step, which is much simpler and faster than the weight calculation, to select similar pixels and reject unrelated pixels in the searching window, our method has been accelerated greatly. In addition, for pixels located in homogeneous patches, the equality between their neighboring patches is obtained by calculating the equality of their mean values, which can speed up the weight calculation of pixels in homogeneous scenes. Furthermore, as we also make an effort to merge some of the repetitive computations, the computational load of our method is further eased.
Table 2 lists the processing times of each method. It can be seen that NL-means and PolSAR NLTV are quite time consuming, and the proposed method outperforms the above two filters to a large degree. In fact, we found that our method was at least twice as fast as NL-means in most cases, and the more complicated the image was, the faster the NWLMMSE filter was when compared to the NL-means. This is because, for images that have a lot of complicated scenes, the preliminary step can effectively reject unrelated pixels and discard the useless weight calculation. However, it should also be pointed out that the processing time of the proposed method is still much longer than the refined Lee filter, which is due to the inherent property of the NL-means-based methods. More research needs be done in our future study to further reduce the processing time.
5. Conclusions
The conventional LLMMSE filters have the shortcomings of the limited selection range of pixels, comparing pixels only by their own characteristics, and the assumption of absolute homogeneity of the pixels. In this paper, a nonlocal weighted LMMSE filter was proposed for the speckle reduction of PolSAR data. For the processed pixel, an effective preliminary step was developed to select similar samples, and the idea of NL-means was employed to calculate the weight of each sample. A weighted linear minimum mean-squared error estimator is was derived to filter the pixels. Experiments were conducted on a single-look simulated image and two four-look AIRSAR images, revealing the good filtering performance of our method in reducing speckle, retaining edges and targets, and preserving the polarimetric scattering mechanism. Moreover, the proposed method has relatively high computational efficiency with regard to most nonlocal filters, which makes it more applicable in practice.
It should be noted that, as for the PolSAR imaging problem in practical application, motion compensation is another important issue. The motion errors will make the imaging results blurred. As pointed out in [
36,
37], speckle is independent of either well-focused stationary scenes or blurred moving targets. In theory, the motion errors can degrade the performance of the proposed despeckling method, which, however, has not been deeply investigated in this study. As far as we know, up to now, very few studies have investigated the issue of suppressing the speckle noise and compensating the moving errors in fully polarimetric SAR images at the same time, which will be a meaningful work in our future study.