Multilook SAR Image Segmentation with an Unknown Number of Clusters Using a Gamma Mixture Model and Hierarchical Clustering

This paper presents a novel multilook SAR image segmentation algorithm with an unknown number of clusters. Firstly, the marginal probability distribution for a given SAR image is defined by a Gamma mixture model (GaMM), in which the number of components corresponds to the number of homogeneous regions needed to segment and the spatial relationship among neighboring pixels is characterized by a Markov Random Field (MRF) defined by the weighting coefficients of components in GaMM. During the algorithm iteration procedure, the number of clusters is gradually reduced by merging two components until they are equal to one. For each fixed number of clusters, the parameters of GaMM are estimated and the optimal segmentation result corresponding to the number is obtained by maximizing the marginal probability. Finally, the number of clusters with minimum global energy defined as the negative logarithm of marginal probability is indicated as the expected number of clusters with the homogeneous regions needed to be segmented, and the corresponding segmentation result is considered as the final optimal one. The experimental results from the proposed and comparing algorithms for simulated and real multilook SAR images show that the proposed algorithm can find the real number of clusters and obtain more accurate segmentation results simultaneously.


Introduction
Synthetic Aperture Radar (SAR) is widely used in civil and military fields because of its characteristics such as high resolution, strong penetration, full time and full weather [1]. However, its specific imaging mechanism leads to speckle noises in SAR images. Although the multilook technique can partly reduce the type of noises, there are still obvious noises remaining in SAR images [2]. Image segmentation, as a crucial step for image analysis and interpretation, involves separating pixels into clusters, so that the pixels in the same cluster are homogeneous and the pixels in different clusters are heterogenous [3]. The main tasks for image segmentation are to determine a suitable number of clusters and obtain an accurate segmentation result [4]. Due to the inherent speckle noise existing in SAR images, these tasks are difficult to fulfill. Therefore, the objective of the proposed algorithm is to obtain the real number of clusters and the accurate segmentation result simultaneously for speckled SAR images. In this paper, the real number is regarded as the number selected by human visual judgment.
Many approaches have been studied to determine the number of clusters, such as clustering methods [5][6][7][8][9] and statistic methods [10][11][12][13][14]. The most popular clustering method is Iterative Self-Organizing Data Analysis Technique Algorithm (ISODATA) [5], in which the number of clusters is changed by alternately splitting and merging clusters. However, it requires a lot of parameters given by humans, and the K-means dominating optimization in ISODATA algorithm is not suitable for segmenting SAR images because of the statistical characteristics of the speckle noises. In order to reduce the human influence, another clustering method, the hierarchical clustering included agglomerative procedure (bottom-up) and divisive procedure (top-down) [7], was considered. The agglomerative produce is the most commonly used one and the representative algorithm is Agglomerative Nesting (AGNES) [8], which considers each element as a cluster and the two clusters which are closest to each other with minimum distance are merged at each step. This procedure continues until only one cluster exists. In order to overcome the sensitivity of noises to Euclidean distance, Legendre et al. [9] used the average distance to measure the closeness between two clusters, but the smaller cluster is easily ignored. Generally, statistical methods are considered to be a more appropriate strategy in SAR image segmentation [10,11]. Green [12] proposed the Reversible Jump Markov Chain Monte Carlo (RJMCMC) algorithm which has been successfully applied to image segmentation with an unknown number of clusters. Based on the work of Green, Kato [13] combined a mixture model and RJMCMC to complete the segmentation. It regards the number of clusters as a random variable contained in model parameters. The number jumps by sampling operation such as birth or death class, and the optimal number is obtained by maximizing or minimizing the objective function. However, the statistical methods for image segmentation with unknown number of clusters are too cumbersome because the dimension of parameters always changes along with the changing of the number of clusters, and it easily falls into the local optimum. To sum up, hierarchical clustering is convenient for obtaining the number of clusters, and the statistical model can describe images well [14], so hierarchical clustering combining a statistical model is regarded as a more appropriate approach in SAR image segmentation in this paper.
There are many approaches of hierarchical theory with statistical models. Mignotte et al. [15] proposed an unsupervised segmentation method for sonar images based on a hierarchical Markov Random Field (MRF) model; it defines a causal energy function with a Gibbs distribution form. Although it can avoid the ignoring of clusters with a few pixels, it cannot be applied to SAR image segmentation in the absence of the capability to manage the statistical property of the speckle noises. Upgrading the MRF model, Wang et al. [16] proposed a new unsupervised method based on a hierarchical Triplet Markov Field (TMF) model with Gaussian distribution to segment SAR images. It combines the advantages of both hierarchical and TMF in the discrete wavelet domain to deal with the problem of insufficient local statistical information. However, the unimodal characteristic model is not as flexible as a mixture model [17]. Garcia et al. [18] proposed a hierarchical Gaussian Mixture Model (GMM) algorithm, which is able to automatically learn the optimal number of components for the simplified GMM and successfully segment natural images, but the GMM cannot describe the data with non-Gaussian statistical character [19], such as SAR images. Oliver et al. [20] has proved that the intensities of pixels satisfy a Gamma distribution according to the speckle noise existing in SAR images, Dong et al. [21] fully validated that the Gamma distribution with MRF is more suitable for SAR image segmentation than a Gaussian distribution with MRF, so the hierarchical clustering based on a Gamma mixture model (GaMM) is proposed for SAR image segmentation.
In this paper, an over-segmentation result with the number of clusters which is much larger than the real number is given as the initial status. The number of clusters is reduced gradually by a cluster merging operation until the optimal number of clusters is obtained. Firstly, the original image is divided into many subsets in gray level by a span parameter. Each subset is regarded as a cluster, and the two subsets with minimum global energy are merged until the number of clusters is equal to one. In addition, the segmentation model with current number of clusters is established by marginal probability distribution based on the GaMM. The negative logarithm of marginal probability is described as the global energy function, and the smaller the value is, the better the segmentation is. Furthermore, in order to consider the influence of neighboring pixels, the prior probability is defined by the conditional probability of the membership based on MRF. Then, the segmentation results with the current number of clusters can be obtained by posterior probability according to the Bayesian theory [22], the number of clusters reduces by one after each iteration, and a segmentation result and a global energy are obtained under the current number of clusters. Finally, the optimal number of clusters is the one corresponding to the minimum global energy, and the result corresponding to the optimal number is the final optimal segmentation result.

Description of the Proposed Algorithm
Given a multilook SAR image x = {x i , i = 1, ..., n}, where i is the index of pixels, n is the number of pixels, x i is the intensity of pixel i. Assume that the image has m clusters, in order to represent the image segmentation, membership matrix z = [z ij ] n×m is used, where j is the index of clusters, z ij ∈ {0, 1}, m ∑ j=1 z ij = 1, z ij = 1 indicates the ith pixel belongs to jth cluster. Furthermore, z can be re-expressed as z = (z i ·, i = 1, . . . , n) T = (z· j , j = 1, ..., m), where z i · = (z ij , j = 1, ..., m) and z· j = (z ij , i = 1, ..., n) T , where z i · and z· j are row and column vectors of membership matrix z, respectively. In the image segmentation, x is observable data, z can be considered as unobservable data, {x, z} is the complete data set. The image segmentation is the process of estimating unobservable data z given x.

Segmentation Process
Assume that the conditional distribution of ith pixel belonging to jth cluster is Gamma distribution and its conditional probability density function (pdf) is: where Ga(·) is the pdf of Gamma distribution, θ j = (α j , β j ) is distribution parameter vector of jth cluster, α j and β j are shape and scale parameters of Gamma distribution, respectively. According to the Bayesian theory, the joint pdf of x i and z ij is: The marginal probability of x i is: where θ = {θ j , j = 1, . . . , m}.
Assume that the marginal distribution of intensity in the SAR image z is independent of each other, then the marginal probability of image x is: The prior probability for z ij = 1 is denoted p(z ij = 1) = π ij , where π ij ∈ [0, 1] and satisfies m ∑ j=1 π ij = 1.
In order to consider the spatial interaction from neighborhood pixels, the prior probability is defined by a conditional distribution: where N i is the neighborhood pixels set of ith pixel, η is the coefficient indicates the influence of intensities of neighbor pixels, the higher the coefficient η is, the stronger the neighbor influence is. Combining Equations (4) and (5), the marginal probability can be reformed as: As described in Equation (6), the marginal distribution of observable data x is GaMM, and the prior probability π ij is the model weight. The number of clusters is equal to the number of GaMM components.
Equation (6) is regarded as the segmentation model expressed by GaMM. The higher the marginal probability is, the better the segmentation result is. There are many criteria to determine the optimal segmentation. Among them, the maximum posterior probability is one of the most commonly used ones. According to the Bayesian theory, the posterior probability of ith pixel belonging to jth cluster is: Finally, the optimal segmentation corresponds to the resolution of the maximum posterior probability and can be expressed as: From Equation (8) one can see that the segmentation is the problem of solving the model parameters, θ j = {α j , β j }. For multilook SAR images, the shape parameter α j is usually equal to the number of looks. For scale parameter β j , it is difficult to solve by Equation (6), thus, the global energy function L defined as the negative logarithm of marginal probability of x is used instead of Equation (6) to evaluate the quality of results. The smaller the function value is, the bigger the marginal probability is, and the better the segmentation result is: Because the scale parameter β j is differentiable function of global energy L, the derivative of L for β j is: Let Equation (10) be equal to 0, its solution is equivalent to the solution of Equation (11): Sensors 2017, 17, 1114 It is difficult to solve Equation (11), because β j exists in both of the first and second term. In order to obtain the best approximate solution, let β j in the first term equal to the value of the last iteration expressed as t, Equation (11) can be rewritten as: Then the scale parameter β (t+1) j can be solved as: The optimization process of the segmentation results under fixed m will be stopped when t reaches the maximum value of T. According to our experiments, it is found that taking T = 20 is suitable for all testing images, so in the proposed algorithm, set T = 20 as the iterative stopping condition.
In summary, if the initial segmentation z ij (0) under the current number of clusters m is known, the segmentation process regarded as an inner loop under fixed number of clusters is described as follows: 1.
Calculating the scale parameter β j (t) by Equation (13), and calculating conditional probability by Calculating the posterior probability p(z ij = 1|x i ; θ j ) (t) by Equation (7);

Repeating
Step 1-3 until t is equal to T; 5.
The optimal segmentation result z ij (T) under fixed number of clusters m is obtained by Equation (8).

Hierarchical Clustering Process
In the above Subsection, the optimal segmentation can be obtained under a fixed number of clusters. In order to obtain the optimal result with a real number of clusters, the number of clusters m is viewed as a variable in the proposed hierarchical clustering algorithm. The number m gradually decreases from an initial number which is much greater than the real number of clusters to one by a cluster merging operation. Thus, one can get the optimal results under each number of clusters by inner loop. At last, the optimal segmentation is determined from the series of optimal results by minimizing global energy defined by Equation (9).
After merging, z (τ+1, T) can be obtained by inner loop and the global energy function value L (τ+1) is recorded by new parameters π ij (τ+1, T) , β j (τ+1, T) at each iteration, then the real number of clustersm can be estimated by: The final optimal segmentation is:ẑ In summary, an initial segmentation must be given before the process of both segmentation and hierarchical clustering. The initial segmentation z (τ=0, t=0) is given by dividing the image x into many sub-classes using span parameter d. Then the ith pixel in jth cluster can be expressed as: where · expresses for rounding up to an integer. The jth cluster will be deleted if there are no pixels in it until all clusters are assigned a number of pixels. Then, m (0, 0) is equal to the column number of z (0, 0) , and the initial scale parameter β j (τ=0, t=0) is calculated by: where n j is the number of pixels in jth cluster.

General Procedure of the Proposed Algorithm
The proposed algorithm uses the outer loop with hierarchical clustering to merge clusters in order to find the optimal number of clusters and the inner loop with the GaMM as the segmentation model to obtain the optimal segmentation under current number of clusters. The outer loop stops until the class number is reduced to one. The optimal segmentation under the estimated real number of clusters can be obtained by minimizing the global energy defined in Equation (9).
The process of the proposed algorithm can be summarized as follows: 1.
Initialization. The iterative indicator of inner loop t = 0 and outer loop τ = 0, the maximum number of inner loop T = 20 according to the experimental experience, shape parameter α which is equal to the looks of the SAR images, intensity of neighbor influence η within the range of 0~1; 2.
The real number of clustersm and final optimal segmentation resultẑ ij is found by Equations (16) and (17).

Experimental Results and Discussion
The proposed algorithm is tested with a simulated SAR image and four real SAR images. In addition, the comparing algorithms, including ISODATA [5], improved AGNES [8], HMRF-FCM [6] and Gamma-MRF [21] algorithms are also used to evaluate the proposed algorithm.

Simulated SAR Image Segmentation
Figure 1a is a template with size of 128 × 128 pixels for generating a simulated SAR image which covers four homogeneous regions labeled I-IV, respectively. Figure 1b is the simulated image, in which the intensities of pixels in each homogeneous region are drawn from a Gamma distribution with the shape and scale parameters listed in Table 1. Equations (16) and (17).

Experimental Results and Discussion
The proposed algorithm is tested with a simulated SAR image and four real SAR images. In addition, the comparing algorithms, including ISODATA [5], improved AGNES [8], HMRF-FCM [6] and Gamma-MRF [21] algorithms are also used to evaluate the proposed algorithm.

Simulated SAR Image Segmentation
Figure 1a is a template with size of 128 × 128 pixels for generating a simulated SAR image which covers four homogeneous regions labeled I-IV, respectively. Figure 1b is the simulated image, in which the intensities of pixels in each homogeneous region are drawn from a Gamma distribution with the shape and scale parameters listed in Table 1.  In order to generate initial segmentation of image shown in Figure 1b, we set span parameter d = 30. Then the initial number of clusters m is ⌈255/30⌉ which is equal to 9. In this experiment, α = 4 as the looks of SAR image. The coefficient η, which expresses the influence of neighbor pixels, is always taken from 0 to 1. If η is too small, the results will exist many misclassification pixels although the number of clusters is correct, contrarily, it will be over segmented. According to experiments, η = 0.8 which is suitable for the simulated images. Figure 2a is the initial segmentation with m = 9.  Figure 3. Figure 3a,b are the full view and local amplification image. It can be seen that the minimum global energy function value is 8.15465 × 10 4 corresponding to the iteration 6, and the corresponding number of clusters is 4. Thus, the final optimal segmentation result is the one shown in Figure 2f.  In order to generate initial segmentation of image shown in Figure 1b, we set span parameter d = 30. Then the initial number of clusters m is 255/30 which is equal to 9. In this experiment, α = 4 as the looks of SAR image. The coefficient η, which expresses the influence of neighbor pixels, is always taken from 0 to 1. If η is too small, the results will exist many misclassification pixels although the number of clusters is correct, contrarily, it will be over segmented. According to experiments, η = 0.8 which is suitable for the simulated images. Figure 2a is the initial segmentation with m = 9. Figure 2b Figure 3. Figure 3a,b are the full view and local amplification image. It can be seen that the minimum global energy function value is 8.15465 × 10 4 corresponding to the iteration 6, and the corresponding number of clusters is 4. Thus, the final optimal segmentation result is the one shown in Figure 2f.   The final optimal segmentation results by the proposed algorithm are displayed in Figure 4. Figure 4a is the final optimal segmentation, Figure 4b is the outline of the homogeneous regions, and Figure 4c shows the result of outlines overlaid on the original image.  The final optimal segmentation results by the proposed algorithm are displayed in Figure 4. Figure 4a is the final optimal segmentation, Figure 4b is the outline of the homogeneous regions, and Figure 4c shows the result of outlines overlaid on the original image. The final optimal segmentation results by the proposed algorithm are displayed in Figure 4. Figure 4a is the final optimal segmentation, Figure 4b is the outline of the homogeneous regions, and Figure 4c shows the result of outlines overlaid on the original image.  The final optimal segmentation results by the proposed algorithm are displayed in Figure 4. Figure 4a is the final optimal segmentation, Figure 4b is the outline of the homogeneous regions, and Figure 4c shows the result of outlines overlaid on the original image.   One can see from Figure 4 that the proposed algorithm can segment each homogeneous region well. There are almost no misclassified pixels, and the boundary is smooth and accurate. For qualitative analysis of the proposed algorithm, the segmentation results of the comparing algorithms are displayed in Figure 5, where Figure 5a-d are the segmentation results of ISODATA, AGNES, HMRF-FCM and Gamma-MRF, respectively. It shows that the ISODATA and AGNES algorithms can find the right number of clusters but ISODATA need more prior condition to limit the range of results and the result is pretty bad. Although AGNES avoids the help of prior knowledge, its segmentation effect is equivalent to ISODATA, there are many misclassified pixels in Figure 5a,b. The HMRF-FCM algorithm can reduce the influence of noise by introducing MRF to model the relation between neighbor pixels. It still cannot distinguish the homogeneous region well. For the Gamma-MRF algorithm, the Gamma distribution combining MRF theory can accurately describe the distribution of pixel intensities, but there are also many pixel errors. From the visual viewpoint, the proposed algorithm is much better than the compared ones. One can see from Figure 4 that the proposed algorithm can segment each homogeneous region well. There are almost no misclassified pixels, and the boundary is smooth and accurate. For qualitative analysis of the proposed algorithm, the segmentation results of the comparing algorithms are displayed in Figure 5, where Figure 5a-d are the segmentation results of ISODATA, AGNES, HMRF-FCM and Gamma-MRF, respectively. It shows that the ISODATA and AGNES algorithms can find the right number of clusters but ISODATA need more prior condition to limit the range of results and the result is pretty bad. Although AGNES avoids the help of prior knowledge, its segmentation effect is equivalent to ISODATA, there are many misclassified pixels in Figure 5a,b. The HMRF-FCM algorithm can reduce the influence of noise by introducing MRF to model the relation between neighbor pixels. It still cannot distinguish the homogeneous region well. For the Gamma-MRF algorithm, the Gamma distribution combining MRF theory can accurately describe the distribution of pixel intensities, but there are also many pixel errors. From the visual viewpoint, the proposed algorithm is much better than the compared ones. In order to demonstrate the superiority of GaMM, the histogram for simulated image and the estimated curve with Gaussian distribution, Gamma distribution and GaMM, respectively, are shown in Figure 6. From Figure 6, one can see that the GaMM with different weights can fit the histogram of the image more accurately. In order to demonstrate the superiority of GaMM, the histogram for simulated image and the estimated curve with Gaussian distribution, Gamma distribution and GaMM, respectively, are shown in Figure 6. From Figure 6, one can see that the GaMM with different weights can fit the histogram of the image more accurately.
To evaluate the proposed and comparing algorithms in quantitative, the main parameters, running time, accuracy and Kappa coefficient are shown in Table 2. For the proposed algorithm, the running time is 1139.19 s. The user's accuracy and producer's accuracy of each homogeneous region are more than 98%, and the overall accuracy and kappa coefficient are 99.34% and 0.99, respectively. For the comparing algorithms, the producer's accuracy of region III is only 36.16% and 40.28 % in ISODATA and AGNES, and the overall accuracy and kappa coefficient are about 60% and 0.5, respectively. HMRF-FCM and Gamma-MRF are better than the two algorithms discussed above, but the accuracy is still lower than that for the proposed algorithm. Generally, the running time of statistical model is usually much higher than clustering methods, but compared with Gamma-MRF, the proposed algorithm greatly reduces the time. Thus, Table 2 precisely describes the effectiveness and robustness of the proposed algorithm.
(a) (b) (c) (d) In order to demonstrate the superiority of GaMM, the histogram for simulated image and the estimated curve with Gaussian distribution, Gamma distribution and GaMM, respectively, are shown in Figure 6. From Figure 6, one can see that the GaMM with different weights can fit the histogram of the image more accurately.  Table 2. For the proposed algorithm, the  Furthermore, the mean and variance listed in Table 3 are also used to evaluate the quality of the proposed algorithm. One can see that the deviation rates between the estimated and actual values for all homogeneous regions are very small. Table 3 illustrates the accuracy of the algorithm. In order to analyze the robustness of the method and the segmentation accuracy affected by different span parameter d and neighborhood coefficient η, the relationship between the number of clusters, overall accuracy, running time and span parameter d, neighborhood coefficient η are given in Table 4. When the neighborhood coefficient is small, the number of clusters obtained are less than the real one. On the contrary, it is larger than the real one. In this part, we only consider the overall accuracy under the correct number of clusters. We can see that the accuracy of d = 30 is almost higher than others under the same neighborhood coefficient. In Table 4, when d = 30, η = 0.8, the overall accuracy is the highest. In order to prove that T = 20 is the optimal result, the change of scale parameter β with iteration is shown in Figure 7. From Figure 7 one can see that scale parameter β for each cluster converges to its stable value after 20 iterations. Consequently, 20 is regard as T's optimal choice, because it not only ensures the accuracy of segmentation but also saves time. Furthermore, the sensitivity of the value of T is also analyzed through the segmentation accuracy with different T under d = 30, η = 0.8, the results are listed in Table 5. If the value of T is too small, the segmentation result is inaccurate, if the value of T is too large, the running time will be increased. its stable value after 20 iterations. Consequently, 20 is regard as T's optimal choice, because it not only ensures the accuracy of segmentation but also saves time. Furthermore, the sensitivity of the value of T is also analyzed through the segmentation accuracy with different T under d = 30, η = 0.8, the results are listed in Table 5. If the value of T is too small, the segmentation result is inaccurate, if the value of T is too large, the running time will be increased.   Figure 8 shows the real SAR images with size of 128 × 128 pixel and the real number of clusters are 2, 3, 3 and 4, respectively. Figure 8a is a RADARSAT-I intensity image with HH polarization and the spatial resolution of 30 m, Figure 8b-c are RADARSAT-II images with VV polarization and the spatial resolution of 25 m, Figure 8d is a RADARSAT-I image with HH polarization and the spatial resolution of 25 m. All of them are sea ice images in different periods, from light to dark, representing sea ice with density from high to low. The image in Figure 8a with two clusters is easy  Figure 8 shows the real SAR images with size of 128 × 128 pixel and the real number of clusters are 2, 3, 3 and 4, respectively. Figure 8a is a RADARSAT-I intensity image with HH polarization and the spatial resolution of 30 m, Figure 8b,c are RADARSAT-II images with VV polarization and the spatial resolution of 25 m, Figure 8d is a RADARSAT-I image with HH polarization and the spatial resolution of 25 m. All of them are sea ice images in different periods, from light to dark, representing sea ice with density from high to low. The image in Figure 8a with two clusters is easy to segment for all algorithms. For the images in Figure 8b,c with three clusters, it is difficult to segment the different types of sea ices. For the image in Figure 8d with four clusters, the difficulty is that the water is easy to ignore because its area is too small compared with the background of low density sea ice areas. to segment for all algorithms. For the images in Figure 8b,c with three clusters, it is difficult to segment the different types of sea ices. For the image in Figure 8d with four clusters, the difficulty is that the water is easy to ignore because its area is too small compared with the background of low density sea ice areas.  Figure 9 is the segmentation results with the proposed algorithm and the comparing algorithms of ISODATA, AGNES, HMRF-FCM and Gamma-MRF. Figure 9(a1-d1) are the results of the ISODATA algorithm, Figure 9(a2-d2) are the results of the AGNES algorithm, Figure 9(a3-d3) are the results of the HMRF-FCM algorithm, Figure 9(a4-d4) are the results of the Gamma-MRF algorithm and Figure 9(a5-d5) are the results of the proposed algorithm.

Real SAR Images Segmentation
It can be seen that the ISODATA algorithm cannot always determine the right number of clusters, and as shown as Figure 9(c1), the real number of the original image is 3 while the estimated number is 2, which is a fatal error for image segmentation.
Although the AGNES algorithm can find the right number of clusters, it cannot overcome the noise effect well and ignores the existence of water class, see Figure 9(d2).
Although the HMRF-FCM algorithm combines a fuzzy cluster based on Gaussian distribution and MRF, it also cannot obtain a satisfactory segmentation result.
The Gamma-MRF algorithm adopts the Gamma distribution to describe the data, and its results are not good enough either. Besides, both HMRF-FCM and Gamma-MRF lack a mechanism for changing the number of clusters, which makes the algorithm not flexible enough in applications.   Figure 9(a1-d1) are the results of the ISODATA algorithm, Figure 9(a2-d2) are the results of the AGNES algorithm, Figure 9(a3-d3) are the results of the HMRF-FCM algorithm, Figure 9(a4-d4) are the results of the Gamma-MRF algorithm and Figure 9(a5-d5) are the results of the proposed algorithm.
It can be seen that the ISODATA algorithm cannot always determine the right number of clusters, and as shown as Figure 9(c1), the real number of the original image is 3 while the estimated number is 2, which is a fatal error for image segmentation.
Although the AGNES algorithm can find the right number of clusters, it cannot overcome the noise effect well and ignores the existence of water class, see Figure 9(d2).
Although the HMRF-FCM algorithm combines a fuzzy cluster based on Gaussian distribution and MRF, it also cannot obtain a satisfactory segmentation result.
The Gamma-MRF algorithm adopts the Gamma distribution to describe the data, and its results are not good enough either. Besides, both HMRF-FCM and Gamma-MRF lack a mechanism for changing the number of clusters, which makes the algorithm not flexible enough in applications.
As shown in Figure 9(a5-d5), the segmentation results from the proposed algorithm are all the optimal segmentation results under a real number of clusters. One can see that the sea ice images during different periods are evidently distinguished all the time by the proposed algorithm, and there are almost no mis-segmentations in the homogeneous regions.        The values of estimated parameters and running time for real images segmentation results are shown in Table 6. Combining with the results in Figure 9 and the running time in Table 6, it can be seen that the running time of the proposed algorithm has a high cost performance.  The values of estimated parameters and running time for real images segmentation results are shown in Table 6. Combining with the results in Figure 9 and the running time in Table 6, it can be seen that the running time of the proposed algorithm has a high cost performance. In addition, segmentation experiments for SAR images covering urban and agricultural areas were also carried out. Figure 11(a1,b1) show the HH RADARSAT-I/II intensity images for an urban area with two looks and an agricultural area with three looks, respectively. The scene shown in Figure 11(a1) contains three objects, including water, streets and buildings, and the scene in Figure 11(b1) contains water and two kinds of farmland. Figure 11(a2,b2) are the corresponding segmentation results from the proposed algorithm with d = 20, η = 0.3, α = 2 and d = 20, η = 0.35, α = 3, respectively. One can see that the proposed algorithm can separate the street from water on the basis of segmenting buildings and effectively distinguish different kinds of farmland. The water and streets shown in Figure 11(a2) are not clearly displayed, just because their grayscales are similar. Figure 11 illustrates that the proposed algorithm can segment not only simple SAR images well, but also complex images with different noise levels. shown in Figure 11(a2) are not clearly displayed, just because their grayscales are similar. Figure 11 illustrates that the proposed algorithm can segment not only simple SAR images well, but also complex images with different noise levels.

Conclusions
In this paper, a novel hierarchical clustering segmentation algorithm for multilook SAR image segmentation is proposed. The GaMM can describe the statistical characteristics of the image more accurately than unimodal character model. The hierarchical clustering with global energy merging criterion can determine the real number of clusters and obtain the accurate segmentation results

Conclusions
In this paper, a novel hierarchical clustering segmentation algorithm for multilook SAR image segmentation is proposed. The GaMM can describe the statistical characteristics of the image more accurately than unimodal character model. The hierarchical clustering with global energy merging criterion can determine the real number of clusters and obtain the accurate segmentation results simultaneously. The qualitative and quantitative analyses of the simulated and real images with the proposed and comparing algorithms fully demonstrate the superiority of the proposed algorithm in multilook SAR image segmentation. Some improvements of this research are still needed. On the one hand, the coarse segmentation method could be replaced with some more efficient approaches, such as, Fuzzy C-means (FCM). On the other hand, the image segmentation model can be improved by considering texture features.