Ocean Oil Spill Classification with RADARSAT-2 SAR Based on an Optimized Wavelet Neural Network

Oil spill accidents from ship or oil platform cause damage to marine and coastal environment and ecosystems. To monitor such spill events from space, fully polarimetric (Pol-SAR) synthetic aperture radar (SAR) has been greatly used in improving oil spill observation. Aiming to promote ocean oil spill classification accuracy, we developed a new oil spill identification method by combining multiple fully polarimetric SAR features data with an optimized wavelet neural network classifier (WNN). Two sets of RADARSAT-2 fully polarimetric SAR data are applied to test the validity of the developed method. The experimental results show that: (1) the convergence ability of optimized WNN can be enhanced, improving overall classification accuracy of ocean oil spill, in comparison to the classification results based on a common un-optimized WNN classifier; and (2) the joint use of the multiple fully Pol-SAR features as the inputs of the classifier can achieve better classification result than that only with single fully Pol-SAR feature. The developed method can improve classification accuracy by 4.96% and 7.75%, compared with the classification results with un-optimized WNN and only with one single fully polarimetric SAR feature. The classification overall accuracy based on the proposed approach can reach 97.67%. Experimental results have proven that the proposed approach is effective and applicable to classify the ocean oil spill.


Introduction
Oil spill happen often in the world oceans due to ship or oil platform accidents.They bring damage to coastal environment and marine ecosystems [1][2][3][4].Satellite remote sensing technology has been widely used in oil spill observations due to its frequent large coverage and relatively low cost [5][6][7][8][9][10][11].Among remote sensing sensors, synthetic aperture radar (SAR) can provide valuable synoptic information about the position and size of a particular oil spill under moderate wind speed (4-12 m/s) weather conditions, day and night [12].However, it is a challenge to distinguish oil spills from other lookalike natural phenomena (biogenic slicks, upwelling, low wind areas, rain cells, shear zones, internal waves, etc.) in SAR images [13].Efforts have been devoted to improve oil spill detection and classification.Nowadays, there is a general consensus that the extra information provided by the polarimetric SAR (Pol-SAR) data enhances the capabilities of identifying and classifying the Over the past several decades, artificial neural network algorithms have been widely used in remote sensing image classification [27][28][29], due to their good self-organization [30][31][32], self-learning [33,34], and self-adaptive abilities [35].Among these, BP (Back Propagation) neural network is one of the first widely used network models due to its simple and easy to implement training in initially stage.Its network construction can realize mapping from the sample inputs to outputs through nonlinear relationship.In general, through iterative operation with gradient descent algorithm, the error of the network gradually decreases to tolerant range.However, BP algorithm also has shortcomings.It is a kind of local optimization search methods, and the calculation often does not converge, or converge to a local minimum.In addition, slow convergence calculation speed makes it only useful for solving small-scale problems.To overcome the deficiency of the BP network, wavelet neural network (WNN) was proposed [36].WNN takes the topological structure of BP neural network as the basis of the network.The nonlinear function of BP neural network is replaced with the wavelet basis function.WNN combines time-frequency domain features of wavelet transform and self-learning ability of BP network.It effectively solves the BP neural network problem and has strong learning and generalization abilities.These advantages make WNN a widely applied method in remote sensing Remote Sens. 2017, 9, 799 3 of 20 image classification [37][38][39].The initial value of WNN plays a key role in the neural network training process, and affects the network convergence nature and classification capability.In this study, we develop a method of setting better initial value for the WNN to improve SAR image classification performance to locate oil slicks.
The main objectives of this study are to: (1) demonstrate the capability and superiority of the combined utilization of multiple fully Pol-SAR features as the inputs of an classifier for ocean oil spill classification; and (2) indicate the superiority and effectiveness of an optimized WNN classifier in improving classification performance of ocean oil spills, compared with un-optimized WNN.
The remainder of this paper is organized as follows: In Section 2, RadarSat-2 Pol-SAR data used in this study and methodology are described; in Section 3, the experimental results are given.The discussion and the conclusions are given in Sections 4 and 5, respectively.

Remote Sensing Data
We used two sets of remote sensing data of fully Pol-Radarsat-2.Tables 2 and 3 present the information on the two RADARSAT-2 scenes and Figure 1 shows the images of the two sets of datasets used in this study.The product type is SLC (simple look complex).Both images are acquired in fine quad-polarization imaging in the Gulf of Mexico.The RADARSAT-2 fine quad-polarization imaging mode provides single-look complex data in HH, VV, HV, and VH channels with a low noise floor of −35 dB.The space resolution along azimuth directions and range directions are about 5 m.The range of the incident angle is 26.093 • to 29.395 • for Image 1, and 43.631 • to 44.954 • for Image 2. The place of study in Dataset 1 is SW of New Orleans in the northern Gulf of Mexico, near the mouth of the Mississippi River.The Mississippi River Delta can be seen obviously in the image.
Table 1 is a sectional table extracted from Reference [26], summarizing the behaviors of fully Pol-SAR features of seawater, lookalike and oil spill.According to the table, it can be known the pixel values of α and H of oil spills are high, so their image tone should be brighter than the area covered by water or lookalike.On the contrary, the values of ν, µ, p and M33 of oil spill are low, therefore their image tone should be darker than the area covered by water or lookalike.According to this criterion [26], in terms of behavior of fully Pol-SAR features of an oil spill and water/lookalike, we can determine that the long strip objects in Dataset 1 and Dataset 2 in Figure 2, are oil spills, rather than lookalikes.Table 1 is a sectional table extracted from Reference [26], summarizing the behaviors of fully Pol-SAR features of seawater, lookalike and oil spill.According to the table, it can be known the pixel values of  and H of oil spills are high, so their image tone should be brighter than the area covered by water or lookalike.On the contrary, the values of  ,  , p and M33 of oil spill are low, therefore their image tone should be darker than the area covered by water or lookalike.According to this criterion [26], in terms of behavior of fully Pol-SAR features of an oil spill and water/lookalike, we can determine that the long strip objects in Dataset 1 and Dataset 2 in Figure 2, are oil spills, rather than lookalikes.

Introduction of Method Frame for Classifying the Ocean Oil Spills from Water
Figure 3 demonstrates the scheme of the proposed method.The method includes two main parts.The first main part is about the selection process of the fully Pol-SAR features based on the J-M distance index method, extracted from RadarSAT-2 SAR data.The selected features will be used as the inputs of an optimized classifier.The process is explained in detail in Section 2.2.2.The second key part of the proposed method is an introduction about developed optimized wavelet neural networks (WNN) classifier.The optimization processing of the classifier is exhibited in Section 2.2.3.Oil spill classification results will be generated based on the optimized WNN classifier with the multiple selected fully Pol-SAR features.Classification accuracy will be assessed and compared with these based on: (1) un-optimized WNN classifier; and (2) single fully Pol-SAR feature.The effectiveness of the proposed method will be estimated in Section 4.

Introduction of Method Frame for Classifying the Ocean Oil Spills from Water
Figure 3 demonstrates the scheme of the proposed method.The method includes two main parts.The first main part is about the selection process of the fully Pol-SAR features based on the J-M distance index method, extracted from RadarSAT-2 SAR data.The selected features will be used as the inputs of an optimized classifier.The process is explained in detail in Section 2.2.2.The second key part of the proposed method is an introduction about developed optimized wavelet neural networks (WNN) classifier.The optimization processing of the classifier is exhibited in Section 2.2.3.Oil spill classification results will be generated based on the optimized WNN classifier with the multiple selected fully Pol-SAR features.Classification accuracy will be assessed and compared with these based on: (1) un-optimized WNN classifier; and (2) single fully Pol-SAR feature.The effectiveness of the proposed method will be estimated in Section 4.
we can determine that the long strip objects in Dataset 1 and Dataset 2 in Figure 2, are oil spills, rather than lookalikes.

Introduction of Method Frame for Classifying the Ocean Oil Spills from Water
Figure 3 demonstrates the scheme of the proposed method.The method includes two main parts.The first main part is about the selection process of the fully Pol-SAR features based on the J-M distance index method, extracted from RadarSAT-2 SAR data.The selected features will be used as the inputs of an optimized classifier.The process is explained in detail in Section 2.2.2.The second key part of the proposed method is an introduction about developed optimized wavelet neural networks (WNN) classifier.The optimization processing of the classifier is exhibited in Section 2.2.3.Oil spill classification results will be generated based on the optimized WNN classifier with the multiple selected fully Pol-SAR features.Classification accuracy will be assessed and compared with these based on: (1) un-optimized WNN classifier; and (2) single fully Pol-SAR feature.The effectiveness of the proposed method will be estimated in Section 4.

The Combination of Multiple Fully Pol-SAR Features for Improving Ocean Oil Spill Identification
The ground object classification in remote sensing imagery is based on the difference of its feature values.Generally, the feature pixel values of the same object are similar; on the contrary, different objects possess different feature values.Therefore, the pixels of the same object present gathered and cluster characteristic and hold certain area coverage, while different objects occupy different areas in the feature space (shown in Figure 4).The more different the feature value of each object is, the higher the probability of being accurately classified is each object.Although the feature value of the most of pixels of different objects are separate in feature space, there is always observed mixture phenomenon of feature value of few pixels of different objects.As shown in Figure 4, feature values of most oil and water pixels are separate, but there are still a few cross-mixed regions of pixels, marked as the red border.The ground object classification in remote sensing imagery is based on the difference of its feature values.Generally, the feature pixel values of the same object are similar; on the contrary, different objects possess different feature values.Therefore, the pixels of the same object present gathered and cluster characteristic and hold certain area coverage, while different objects occupy different areas in the feature space (shown in Figure 4).The more different the feature value of each object is, the higher the probability of being accurately classified is each object.Although the feature value of the most of pixels of different objects are separate in feature space, there is always observed mixture phenomenon of feature value of few pixels of different objects.As shown in Figure 4, feature values of most oil and water pixels are separate, but there are still a few cross-mixed regions of pixels, marked as the red border.Figure 5 shows the probability density distribution of pixels feature value between ocean oil spill and water in region of interest (ROI) in the case of one-dimensional feature space.The overlaid area shows the cross-mixed phenomena.The X-axis is the feature value, and the Y-axis is the number of pixels under the corresponding feature value.Figure 5a,b shows Span, and H of fully Pol-SAR features derived from the RadarSAT-2 Dataset 1.The peak value of probability density of oil spill and water is separated entirely, which illustrates that Span and H are effective features.However, the cross section of curve in the red border of the probability density figure displays the Figure 5 shows the probability density distribution of pixels feature value between ocean oil spill and water in region of interest (ROI) in the case of one-dimensional feature space.The overlaid area shows the cross-mixed phenomena.The X-axis is the feature value, and the Y-axis is the number of pixels under the corresponding feature value.Figure 5a,b shows Span, and H of fully Pol-SAR features derived from the RadarSAT-2 Dataset 1.The peak value of probability density of oil spill and water is separated entirely, which illustrates that Span and H are effective features.However, the cross section of curve in the red border of the probability density figure displays the cross-mixed phenomena of feature value of pixels between oil spills and water in a one-dimensional feature space.This indicates using any single fully Pol-SAR feature we will inevitably make some misclassification for oil spill and water which pixels features point clusters locate in overlaid area in feature space.Therefore, the threshold segmentation method based only on a single feature value for identifying the oil spills from water will almost surely cause segmentation error.The probability of wrong classification is related to mix degree of oil spills and water in the feature space.
Remote Sens. 2017, 9, 799 7 of 20 cross-mixed phenomena of feature value of pixels between oil spills and water in a one-dimensional feature space.This indicates using any single fully Pol-SAR feature we will inevitably make some misclassification for oil spill and water which pixels features point clusters locate in overlaid area in feature space.Therefore, the threshold segmentation method based only on a single feature value for identifying the oil spills from water will almost surely cause segmentation error.The probability of wrong classification is related to mix degree of oil spills and water in the feature space.
(a) (b) However, when we add more Pol-SAR parameters in classification process, those pixels in the mixture cluster in feature space will reduce when finding a suitable segmentation curved surface, hence they will obtain more chances to be classified to the correct object types.This is because a higher separation ability will be achieved in higher dimensional feature space than that in a one-dimensional feature space, according to the knowledge of feature classification [40][41][42], such as in the classification process for sea ice with Pol-SAR data [41,42], multiple features are often employed to classify the objects, rather than only depending on a single feature.Therefore, we naturally generate an idea that, when making classification of ocean oil spill with fully Pol-SAR data, multiple features should be jointly used to try to enhance the classification performance, since we have obtained many types of fully Pol-SAR features of oil spill proposed by different scholars.For example, Figure 6 exhibits the different ability of identifying the oil spill from other objects based on several Pol-SAR features.In Figure 6a, the objects in the region of the "a" are drilling platform.It has similar image tone with the oil spill, consequently it is difficult to classify them.However, in Figure 6b,c, it is very easy to identify oil spill from the drilling platform shown in areas "b" and "c", since they have a contrary image tone, compared with the oil spill.In the same way, the object in areas "d", "e", and "f" is the wake of a boat.In Figure 6a,c, the oil spill and boat wake have a similar tone, so it is hard to correctly classify them, whereas, when introducing the image of Figure 6b, the problem is easily solved, since the boat wake has different image tone, compared with the image tone of oil spill.As a whole, Figure 6a-c presents a visualized behavior of oil spill and water/lookalike in different fully Pol-SAR features.By analyzing the characteristic of different objects in different fully Pol-SAR features images, we can be convinced of jointly usage of multiple fully Pol-SAR features can really improve the identifying ability of oil spill from water or lookalike, like the wake of a boat, and some other objects, such as the drilling platforms.Figure 6 effectively proves the necessary of combined use of multiple fully Pol-SAR features for oil spill classification.However, when we add more Pol-SAR parameters in classification process, those pixels in the mixture cluster in feature space will reduce when finding a suitable segmentation curved surface, hence they will obtain more chances to be classified to the correct object types.This is because a higher separation ability will be achieved in higher dimensional feature space than that in a one-dimensional feature space, according to the knowledge of feature classification [40][41][42], such as in the classification process for sea ice with Pol-SAR data [41,42], multiple features are often employed to classify the objects, rather than only depending on a single feature.Therefore, we naturally generate an idea that, when making classification of ocean oil spill with fully Pol-SAR data, multiple features should be jointly used to try to enhance the classification performance, since we have obtained many types of fully Pol-SAR features of oil spill proposed by different scholars.For example, Figure 6 exhibits the different ability of identifying the oil spill from other objects based on several Pol-SAR features.In Figure 6a, the objects in the region of the "a" are drilling platform.It has similar image tone with the oil spill, consequently it is difficult to classify them.However, in Figure 6b,c, it is very easy to identify oil spill from the drilling platform shown in areas "b" and "c", since they have a contrary image tone, compared with the oil spill.In the same way, the object in areas "d", "e", and "f" is the wake of a boat.In Figure 6a,c, the oil spill and boat wake have a similar tone, so it is hard to correctly classify them, whereas, when introducing the image of Figure 6b, the problem is easily solved, since the boat wake has different image tone, compared with the image tone of oil spill.As a whole, Figure 6a-c presents a visualized behavior of oil spill and water/lookalike in different fully Pol-SAR features.By analyzing the characteristic of different objects in different fully Pol-SAR features images, we can be convinced of jointly usage of multiple fully Pol-SAR features can really improve the identifying ability of oil spill from water or lookalike, like the wake of a boat, and some other objects, such as the drilling platforms.

Selection Criteria of Fully Pol-SAR Features Based on J-M Distance Method
For adequately exploiting the advantages of joint usage of multiple fully Pol-SAR features for oil spill classification, we first need to determine optimal combination pattern of multiple features.It is not necessary for every single Pol-SAR feature to be used in the classification process.The commonly used fully Pol-SAR features for oil spill detection include span [8],  [7], H [1], A [25],  [25],  [15], and P [22].Each of these features has its own different ability for identifying the ocean oil spills from water.To achieving joint use of fully Pol-SAR features, we have to employ feature selection method to determine combination pattern of fully Pol-SAR features.Jeffreys-Matusita distance (J-M) is an index that is widely used to select useful features [43].The chosen features will be as the inputs of the WNN in this study.The calculation of the index is simple and has good universality [43].The calculation method of J-M distance of two different objects (oil and water) with a feature is shown in Equations ( 1) and (2): where J represents J-M distance index under a certain feature, such as span or  in this study.m1 and m2 are mean of a certain feature value of two kinds of different ground objects, respectively.In this study, m1 and m2 are mean of the feature value of oil spill and water.i  and i  are the deviation of a certain feature value of oil spill and water, respectively.The value range of J is 0-2.When the value of the J is 0-1, the two objects have weak separability under a certain feature.When J is 1-1.9, the two objects have a certain separability under a certain feature.When J > 1.9, the two objects have strong separability under a certain feature [44].
Calculation result of J-M index based on the feature value demonstrates that span, H,  ,  , and P all have higher separability ability for identifying the ocean oil spills from water, because the value of J index is greater than 1.5.Therefore, these features are used in this classification experiments.The calculations of these features are summarized in Table 4.
J-M distance indexes are shown in Figure 7. Four fully Pol-SAR features (span, H,  , and P) are selected for Dataset 1 (solid line), while the other four ( , H,  , and P) are selected for Dataset 2

Selection Criteria of Fully Pol-SAR Features Based on J-M Distance Method
For adequately exploiting the advantages of joint usage of multiple fully Pol-SAR features for oil spill classification, we first need to determine optimal combination pattern of multiple features.It is not necessary for every single Pol-SAR feature to be used in the classification process.The commonly used fully Pol-SAR features for oil spill detection include span [8], ρ [7], H [1], A [25], α [25], µ [15], and P [22].Each of these features has its own different ability for identifying the ocean oil spills from water.To achieving joint use of fully Pol-SAR features, we have to employ feature selection method to determine combination pattern of fully Pol-SAR features.Jeffreys-Matusita distance (J-M) is an index that is widely used to select useful features [43].The chosen features will be as the inputs of the WNN in this study.The calculation of the index is simple and has good universality [43].The calculation method of J-M distance of two different objects (oil and water) with a feature is shown in Equations ( 1) and ( 2 where J represents J-M distance index under a certain feature, such as span or µ in this study.m 1 and m 2 are mean of a certain feature value of two kinds of different ground objects, respectively.In this study, m 1 and m 2 are mean of the feature value of oil spill and water.δ i and δ i are the deviation of a certain feature value of oil spill and water, respectively.The value range of J is 0-2.When the value of the J is 0-1, the two objects have weak separability under a certain feature.When J is 1-1.9, the two objects have a certain separability under a certain feature.When J > 1.9, the two objects have strong separability under a certain feature [44].Calculation result of J-M index based on the feature value demonstrates that span, H, µ , α , and P all have higher separability ability for identifying the ocean oil spills from water, because the value of J index is greater than 1.5.Therefore, these features are used in this classification experiments.The calculations of these features are summarized in Table 4. J-M distance indexes are shown in Figure 7. Four fully Pol-SAR features (span, H, µ, and P) are selected for Dataset 1 (solid line), while the other four (α, H, µ, and P) are selected for Dataset 2 (dash line), as these J-M distance indexes are similar to or greater than 1.5.Therefore, five selected features (span, H, µ, P and α) have certain identification ability for separating oil spill from water.
Remote Sens. 2017, 9, 799 9 of 20 (dash line), as these J-M distance indexes are similar to or greater than 1.5.Therefore, five selected features (span, H,  , P and  ) have certain identification ability for separating oil spill from water.backscattered energy [8];  : co-polarized complex correlation [7]; H: polarization entropy [1]; A: anisotropy coefficient [25];  : mean scattering angle [25];  : conformity coefficient [15]；P: degree of polarization [22] Table 4. Introduction of selected full pol-SAR features used in this study.Figure 2 shows the images of chosen fully Pol-SAR features.We can intuitively see the difference of image tone of oil spill and water; hence, we believe the chosen features have good classification ability.Meanwhile, the edge of the oil spills, such as the bottom pixels of the oil spills of Dataset 1, present weak contrast with water.Classification that only depends on one or two fully Pol-SAR feature images will inevitably cause segmentation error.The combined usage of the multiple fully Pol-SAR feature images are necessary to help improve the classification performance of oil spills and water.As a general view, the chosen fully Pol-SAR features are effective for oil spill backscattered energy [8]; ρ: co-polarized complex correlation [7]; H: polarization entropy [1]; A: anisotropy coefficient [25]; α: mean scattering angle [25]; µ: conformity coefficient [15]; P: degree of polarization [22].Figure 2 shows the images of chosen fully Pol-SAR features.We can intuitively see the difference of image tone of oil spill and water; hence, we believe the chosen features have good classification ability.Meanwhile, the edge of the oil spills, such as the bottom pixels of the oil spills of Dataset 1, present weak contrast with water.Classification that only depends on one or two fully Pol-SAR feature images will inevitably cause segmentation error.The combined usage of the multiple fully Pol-SAR feature images are necessary to help improve the classification performance of oil spills and water.

Feature
As a general view, the chosen fully Pol-SAR features are effective for oil spill classification, in terms of the feature images contrast degree (shown in Figure 2).These selected features will all be used as the inputs of the optimized WNN classifier, and for conducting the training of neural network.

Optimization Strategy of the Initial Value of Wavelet Neural Network
In Section 2.2.2, we explain the selection method of fully Pol-SAR features based on the J-M distance index.These features are span, H, µ and P for the classification experiment with Dataset 1, and α, H, µ, and P for the classification experiment with Dataset 2. The selected features will be as the inputs of the optimized WNN classifier, guiding the training process of the neural network.In Section 2.2.3, we introduce the optimization strategy of the WNN.In Section "The Architecture of the Wavelet Neural Network", we firstly exhibit the architecture of the WNN.Then, in Section "Optimization Method of the WNN", we present the optimization process of WNN.

The Architecture of the Wavelet Neural Network
The WNN applied in this study includes an input layer, a hidden layer, and an output layer.A corresponding weight value matrix is used for connecting the input layer with the hidden layer, and the hidden layer with the output layer.The WNN architecture with a single hidden layer is shown in Figure 8.
Remote Sens. 2017, 9, 799 10 of 20 terms of the feature images contrast degree (shown in Figure 2).These selected features will all be used as the inputs of the optimized WNN classifier, and for conducting the training of neural network.

Optimization Strategy of the Initial Value of Wavelet Neural Network
In Section 2.2.2, we explain the selection method of fully Pol-SAR features based on the J-M distance index.These features are span, H, μ and P for the classification experiment with Dataset 1, and α , H, μ , and P for the classification experiment with Dataset 2. The selected features will be as the inputs of the optimized WNN classifier, guiding the training process of the neural network.In Section 2.2.3, we introduce the optimization strategy of the WNN.In Section "The Architecture of the Wavelet Neural Network", we firstly exhibit the architecture of the WNN.Then, in Section "Optimization Method of the WNN", we present the optimization process of WNN.
The Architecture of the Wavelet Neural Network The WNN applied in this study includes an input layer, a hidden layer, and an output layer.A corresponding weight value matrix is used for connecting the input layer with the hidden layer, and the hidden layer with the output layer.The WNN architecture with a single hidden layer is shown in Figure 8.The wavelet neural network model is constructed by Equations ( 4) to (7).3)).
The meaning of the parameters is summarized in Table 5.
Parameter Definition number of nodes in the input layer number of nodes in the hidden layer number of nodes in the output layer weight matrix n × M from the input layer to the hidden layer, with w jk as the weight connecting node j of hidden layer with the node k of the input layer); (the initial value is a random value of −1-1) [w 1j ] N×n weight matrix N × n from the hidden layer to the output layer, with w ij as the weight connecting the node i of the output layer and node j of the hidden layer; (the initial value is a random value of −1-1) The kth input of the pth sample in the input layer The initial value of WNN plays an important role in train process of a network.A good initial value can improve the convergence speed and classification accuracy of a network.The nature of optimization method of the initial value of a WNN network proposed in this paper is obtaining a set of optimal initial value for the classifier, by making use of the relationship between the initial value (W jk , a i , b i ) of a WNN and training samples of ROI.The detailed steps are as follows: Step 1 Set up the initial value of W jk .
Firstly, a random number of 1-1 is generated as the initial value of W jk .Then, W jk is normalized according to the Equation (8).
Next, W jk is calculated.The function is given by Equations ( 9) and ( 10) where M is the number of the input layer nodes.n is the number of the hidden layer nodes.C is a constant.χ kmax is the maximum value of the input samples of the ith neuron node of the input layer.
χ kmin is the minimum value of the input samples of the ith neuron node of the input layer.
Step 2 Calculate the initial value of a i (Scaling parameter of the jth node of the hidden layer).The function is given by Equation ( 11) where ∆x 0i is the width of the window.x jmax is the maximum value of the input samples of the jth neuron node of the input layer.x jmin is the minimum value of the input samples of the jth neuron node of the input layer.
Step 3 Calculating the initial value of b i (translation parameter of the jth node of the hidden layer).
The function is given by Equation ( 12) where the meaning of x jmax , x jmin and ∆x 0i are the same as above-mentioned explained.
The classification process of the WNN in this experiment is as shown in Figure 9.The process is explained in detail as follows: Step

Classification Result and Accuracy Analysis
The ocean oil spill classification results of experiments of Dataset 1 and Dataset 2 are summarized in Tables 6 and 7, respectively.Table 6 shows the classification results with different Pol-SAR features based on optimized and un-optimized WNN of the experiment of Dataset 1.On the one hand, the combined usage of four features (  , P, H, and span) has highest classification accuracy in the case of same classifier, compared to single Pol-SAR feature.Classification accuracy of optimized WNN with four features as the input of the network arrives to 96.55%, and Kappa coefficient is 0.936.The classification accuracy of four Pol-SAR features is improved by 7.75%, 5.79%, 3.7% and 2.13%, compared to the classification results only using single  , P, H, and span as the input of the optimized classifier, respectively.The classification accuracy of four Pol-SAR features is improved by 3.65%, 3.12%, 3.91% and 0.65%, compared with the classification results, only applying  , P, H, and span as the input of un-optimized classifier, respectively.This result proves that the four Pol-SAR features as the network input can effectively improve the classification ability.On the other hand, when the input feature is the same, optimized WNN classifier always has higher classification ability than that of un-optimized classifier regardless of which one feature is as the input.Classification accuracy of optimized classifier with four features  , P, H, and span feature is improved by 4.65%, 0.55%, 1.98%, 4.86%, and 3.17%, respectively, compared with the classification accuracy of un-optimized classifier.It indicates that optimal WNN has better enhanced the classification accuracy.When observing the convergence times, it can also be seen the superiority of optimized classifier, in comparison to the un-optimized classifier.Note: OA is the overall accuracy of classification; Kappa is the meaning of Kappa coefficient.

Classification Result and Accuracy Analysis
The ocean oil spill classification results of experiments of Dataset 1 and Dataset 2 are summarized in Tables 6 and 7, respectively.Table 6 shows the classification results with different Pol-SAR features based on optimized and un-optimized WNN of the experiment of Dataset 1.On the one hand, the combined usage of four features (µ, P, H, and span) has highest classification accuracy in the case of same classifier, compared to single Pol-SAR feature.Classification accuracy of optimized WNN with four features as the input of the network arrives to 96.55%, and Kappa coefficient is 0.936.The classification accuracy of four Pol-SAR features is improved by 7.75%, 5.79%, 3.7% and 2.13%, compared to the classification results only using single µ, P, H, and span as the input of the optimized classifier, respectively.The classification accuracy of four Pol-SAR features is improved by 3.65%, 3.12%, 3.91% and 0.65%, compared with the classification results, only applying µ, P, H, and span as the input of un-optimized classifier, respectively.This result proves that the four Pol-SAR features as the network input can effectively improve the classification ability.On the other hand, when the input feature is the same, optimized WNN classifier always has higher classification ability than that of un-optimized classifier regardless of which one feature is as the input.Classification accuracy of optimized classifier with four features µ, P, H, and span feature is improved by 4.65%, 0.55%, 1.98%, 4.86%, and 3.17%, respectively, compared with the classification accuracy of un-optimized classifier.It indicates that optimal WNN has better enhanced the classification accuracy.When observing the convergence times, it can also be seen the superiority of optimized classifier, in comparison to the un-optimized classifier.Table 7 shows the classification results of the Dataset 2. It also shows that the optimized wavelet neural network has great ability of enhancing the classification accuracy.As a whole, the classification results of Dataset 1 and Dataset 2 manifest that the combined usage of multiple fully Pol-SAR features and the optimal classifier can greatly improve classification accuracy of ocean oil spill.Table 7 shows the classification results of the Dataset 2. It also shows that the optimized wavelet neural network has great ability of enhancing the classification accuracy.As a whole, the classification results of Dataset 1 and Dataset 2 manifest that the combined usage of multiple fully Pol-SAR features and the optimal classifier can greatly improve classification accuracy of ocean oil spill.Figures 12 and 13 show experimental results of Dataset 2. Figure 12 also indicates the optimized classifier has higher average classification accuracy.However, we can see that the classification accuracy of optimized method presents three times vibrate state when the hidden layer node is 35.In the case of other hidden layer node, the OA of classification of optimized network is always better than that of the un-optimized network.In short, the experimental results of Dataset 2 also show that optimized WNN classifier has better and stable classification ability.Figures 12 and 13 show experimental results of Dataset 2. Figure 12 also indicates the optimized classifier has higher average classification accuracy.However, we can see that the classification accuracy of optimized method presents three times vibrate state when the hidden layer node is 35.In the case of other hidden layer node, the OA of classification of optimized network is always better than that of the un-optimized network.In short, the experimental results of Dataset 2 also show that optimized WNN classifier has better and stable classification ability.Figures 12 and 13 show experimental results of Dataset 2. Figure 12 also indicates the optimized classifier has higher average classification accuracy.However, we can see that the classification accuracy of optimized method presents three times vibrate state when the hidden layer node is 35.In the case of other hidden layer node, the OA of classification of optimized network is always better than that of the un-optimized network.In short, the experimental results of Dataset 2 also show that optimized WNN classifier has better and stable classification ability.

Combination Pattern of Pol-SAR Features for Oil Spill Classification Should Be Taken into Consideration
Quad-polarization SAR measures the scattering matrix and provides full amplitude and phase information of each image pixel [7].In land-use classification, previous studies show that quad-polarization SAR data yield more useful information than conventional single-polarization SAR [7,8].RADARSAT-2 quad-polarization (HH, HV, VH, and VV) SAR measures the scattering matrix of each SAR image pixel.The measurement retains all the information describing the polarimetric properties of the observed scene in the scattered field.Due to the vector nature of the scattered field, quad-polarization measurements can be used to classify SAR imagery with different scattering mechanisms [8,15].However, previously, most research on the identifying of ocean oil spill from the water was based only on a single fully Pol-SAR feature, which would inevitably limit the enhancement of classification accuracy.Therefore, in this study, combined use of multiple fully Pol-SAR features for oil spill classification has been suggested and proved it can improve the classification ability, compared with only a single feature.Combination pattern of joint use of several fully Pol-SAR features should be discussed and investigated carefully in further research.Although in this study with two sets of RADARSAT-2, the five fully Pol-SAR features are successfully selected by J-M distance method and proved effectiveness of improving classification performance of ocean oil spill, when applying the other fully Pol-SAR datasets, or when there are other lookalikes in SAR imagery, effective combination pattern of fully Pol-SAR features might vary.The reason is that the types of oil spill and variable ocean environment, such as the wind speed change, may result in changing of effective combination pattern.Consequently, different methods for selecting features are suggested to be tested to determine effective combination pattern of fully Pol-SAR features for ocean oil spill classification.The research achievements and reference studies in regard to the features selection or bands selection have been obtained in previous related research field, such as hyperspectral band selection, not only by J-M distance method used in this study [43][44][45][46].Especially in the case of that there are several complex lookalikes in the images, more effective feature selection methods are particularly of importance for enhancing classification accuracy.It is noteworthy that main contribution of this paper is at the first proposing a scheme of jointly use of multiple fully

Combination Pattern of Pol-SAR Features for Oil Spill Classification Should Be Taken into Consideration
Quad-polarization SAR measures the scattering matrix and provides full amplitude and phase information of each image pixel [7].In land-use classification, previous studies show that quad-polarization SAR data yield more useful information than conventional single-polarization SAR [7,8].RADARSAT-2 quad-polarization (HH, HV, VH, and VV) SAR measures the scattering matrix of each SAR image pixel.The measurement retains all the information describing the polarimetric properties of the observed scene in the scattered field.Due to the vector nature of the scattered field, quad-polarization measurements can be used to classify SAR imagery with different scattering mechanisms [8,15].However, previously, most research on the identifying of ocean oil spill from the water was based only on a single fully Pol-SAR feature, which would inevitably limit the enhancement of classification accuracy.Therefore, in this study, combined use of multiple fully Pol-SAR features for oil spill classification has been suggested and proved it can improve the classification ability, compared with only a single feature.Combination pattern of joint use of several fully Pol-SAR features should be discussed and investigated carefully in further research.Although in this study with two sets of RADARSAT-2, the five fully Pol-SAR features are successfully selected by J-M distance method and proved effectiveness of improving classification performance of ocean oil spill, when applying the other fully Pol-SAR datasets, or when there are other lookalikes in SAR imagery, effective combination pattern of fully Pol-SAR features might vary.The reason is that the types of oil spill and variable ocean environment, such as the wind speed change, may result in changing of effective combination pattern.Consequently, different methods for selecting features are suggested to be tested to determine effective combination pattern of fully Pol-SAR features for ocean oil spill classification.The research achievements and reference studies in regard to the features selection or bands selection have been obtained in previous related research field, such as hyperspectral band selection, not only by J-M distance method used in this study [43][44][45][46].Especially in the case of that there are several complex lookalikes in the images, more effective feature selection methods are particularly of importance for enhancing classification accuracy.It is noteworthy that main contribution of this paper is at the first proposing a scheme of jointly use of multiple fully Pol-SAR features for oil spill classification.Generally, in further oil spill classification experiments, which specific combination pattern of Pol-SAR features should be employed will be determined by test.From the pattern recognition perspective, selection/extraction of representative features of oil spill image is important for its classification.However, it is also a bottleneck to improving accuracy, due to variation of ocean environment when oil spill occurs.Therefore, learning features automatically from a remote sensing data set rather than using manually designed features, and then performing classification on the learned features, is an effective way to improve the accuracy of classification [47].Deep learning theory was explicitly proposed by Hinton et al. [48] in 2006.It is a branch of machine learning based on a set of algorithms that attempt to model high level abstractions in data [49].Compared with the traditional machine learning theories, the most significant difference of deep learning is emphasizing automatic feature learning from a huge data set through the organization of multi-layer neurons.In recent years, various deep learning architectures such as Deep Belief Networks (DBN) [50], Convolutional Neural Networks (CNN) [47], and Recurrent Neural Networks (RNN) [51] have been proposed and applied in speech, vision and image recognition and classification fields [52], they have been shown to produce state-of-the-art results in these domains, nevertheless, deep learning usually requires big data [53].In deep learning techniques, CNN has achieved remarkable results in image classification, recognition, and other vision tasks [54][55][56][57].Therefore, we believe CNN classification model will have a wide applying space in ocean oil spill image recognition and classification field, since increasing Pol-SAR datasets of ocean oil spill, such as RADARSAT-2, ENVISAT-ASAR, SIR-C/X SAR, and ALOS-PALSAR, allow a large amount of Pol-SAR data to be archived, and generate big Pol-SAR data of ocean oil spill.In the future research on ocean oil spill classification, we will be devoted to develop a CNN to achieve further enhancement of classification accuracy.

Conclusions
Oil spill pollution arising from ship or oil platform accidents represents a serious threat to the marine and coastal environment and ecosystems.Remote sensing observations are key to identify oil spills.To monitor such spill events from space, fully polarimetric synthetic aperture radar data has been increasingly employed in improving oil spill classification.In this study, the combined usage of multiple fully Pol-SAR features is exploited to classify sea oil spill in SAR imagery.Experiments, undertaken using two sets of RADARSAT-2 fully Pol-SAR data, confirm the effectiveness of the proposed approach.The main novelties that characterize this study can be summarized as follows. • J-M distance index method is beneficial to select the fully Pol-SAR features.µ, H, span, P, and α are selected in this study.Strong contrast degree of gray level of pixels between oil spill and water illustrates the selected features have good separability of oil spill and seawater.

•
Jointly using multiple fully Pol-SAR features shows better classification performance of oil spill and seawater, compared with the classification results of only using single fully Pol-SAR feature.

•
We build a more robust WNN classifier through setting optimal initial values of the network for oil spill classification.The experimental results demonstrate that the optimized WNN classifier can promote the classification performance largely, compared to an un-optimized WNN classifier.

•
Since both the combined usage of fully Pol-SAR features and an optimized WNN classifier can improve classification performance, it proves the effectiveness and applicableness of the proposed method for ocean oil spill classification.

Figure 1 .
Figure 1.The ocean oil spill image of RadarSat-2 in fine quad-polarization imaging mode used in this study: (a) oil spill image of RadarSat-2 SAR (Dataset 1); and (b) oil spill image of RadarSat-2 SAR (Dataset 2).There are oil spill, water, island, and oil platform in the image of Dataset 1, and there are oil spill, water and oil platform in the image of Dataset 2.

Figure 1 .
Figure 1.The ocean oil spill image of RadarSat-2 in fine quad-polarization imaging mode used in this study: (a) oil spill image of RadarSat-2 SAR (Dataset 1); and (b) oil spill image of RadarSat-2 SAR (Dataset 2).There are oil spill, water, island, and oil platform in the image of Dataset 1, and there are oil spill, water and oil platform in the image of Dataset 2.

Figure 2 .
Figure 2. Presentation of several fully Pol-SAR features of Dataset 1 and Dataset 2.

Figure 3 .
Figure 3. Frame chart of experimental process for ocean oil spill classification.

Figure 2 .
Figure 2. Presentation of several fully Pol-SAR features of Dataset 1 and Dataset 2.

Figure 2 .
Figure 2. Presentation of several fully Pol-SAR features of Dataset 1 and Dataset 2.

Figure 3 .
Figure 3. Frame chart of experimental process for ocean oil spill classification.Figure 3. Frame chart of experimental process for ocean oil spill classification.

Figure 3 .
Figure 3. Frame chart of experimental process for ocean oil spill classification.Figure 3. Frame chart of experimental process for ocean oil spill classification.

Figure 4 .
Figure 4. Clusters of ocean oil spill and water in region of interesting in two and three dimensional feature space: (a) span- ; (b) span-H; and (c)  -span-H.

Figure 4 .
Figure 4. Clusters of ocean oil spill and water in region of interesting in two and three dimensional feature space: (a) span-α; (b) span-H; and (c) α-span-H.

Figure 5 .
Figure 5. Probability density of ocean oil spill and water in region of interesting in one dimensional feature space: (a) span; and (b) H.

Figure 5 .
Figure 5. Probability density of ocean oil spill and water in region of interesting in one dimensional feature space: (a) span; and (b) H.

Figure 6 Figure 6 .
Figure 6 effectively proves the necessary of combined use of multiple fully Pol-SAR features for oil spill classification.Remote Sens. 2017, 9, 799 8 of 20

Figure 6 .
Figure 6.The difference of identifying ability of oil spill from the boat wake and drilling platforms based on several fully Pol-SAR features extracted from Dataset 1.(a) the image of fully Pol-SAR feature H (polarization entropy); (b) the image of fully Pol-SAR feature α (mean scattering angle); (c) is the image of fully Pol-SAR feature span (backscattered energy).

Figure 7 .
Figure 7. J-M distance index of different fully Pol-SAR features of Dataset 1 and Dataset 2. span:

Figure 8 .
Figure 8. Wavelet neural network architecture with a single hidden layer.The excitation function of the hidden layer of the WNN uses the Morlet wavelet function (see Formula (3)).

Figure 8 .
Figure 8. Wavelet neural network architecture with a single hidden layer.
input of the jth node in the hidden layer of the pth sample net p i input of the ith node in the output layer of the pth sample a j and b j scaling parameter and translation parameter of the jth node of the hidden layer, respectively ψ a,b (net p j ) output of the jth node of the hidden layer of the pth sample β i threshold value at the ith node of the output layer, (the initial value is a random value of −1-1) y p i the ith actual output in the output layer of the pth sample Optimization Method of the WNN

Figure 9 .
Figure 9. Flow chart for the training process of the wavelet neural network.

Figure 9 .
Figure 9. Flow chart for the training process of the wavelet neural network.

Figures 10
Figures 10 and 11 demonstrate the classification results of Image 1. Figure 10 shows the mean of overall accuracy (OA) and kappa coefficient of 30 times of tests of Dataset 1 with the optimized WNN and un-optimized methods.The optimal WNN is obviously superior to the un-optimized WNN.No matter how many hidden layer nodes of the WNN, the optimized method has higher average classification accuracy.Meanwhile, the un-optimized WNN presents obvious fluctuation in classification accuracy.Especially when the number of the hidden layer nodes is 15, 20, 25, and 30, the OA of classification shows strong fluctuation.When investigating the convergence of these two classifiers, we can see that 30 times of classification tests of the optimized classifier are all convergent.However, the un-optimized WNN did not converge twice when hidden layer node is 25 and 30.The experimental results of Dataset 1 show that the optimized classifier has better and stable classification ability.

Figures 10
Figures 10 and 11 demonstrate the classification results of Image 1. Figure 10 shows the mean of overall accuracy (OA) and kappa coefficient of 30 times of tests of Dataset 1 with the optimized WNN and un-optimized methods.The optimal WNN is obviously superior to the un-optimized WNN.No matter how many hidden layer nodes of the WNN, the optimized method has higher average classification accuracy.Meanwhile, the un-optimized WNN presents obvious fluctuation in classification accuracy.Especially when the number of the hidden layer nodes is 15, 20, 25, and 30, the OA of classification shows strong fluctuation.When investigating the convergence of these two classifiers, we can see that 30 times of classification tests of the optimized classifier are all convergent.However, the un-optimized WNN did not converge twice when hidden layer node is 25 and 30.The experimental results of Dataset 1 show that the optimized classifier has better and stable classification ability.

Figure 10 .
Figure 10.Classification overall accuracy of optimized and un-optimized WNN with different hidden layer nodes of Dataset 1.

Figure 10 .
Figure 10.Classification overall accuracy of optimized and un-optimized WNN with different hidden layer nodes of Dataset 1.

Figure 12 .
Figure 12.Classification overall accuracy of optimized and un-optimized WNN with different hidden layer nodes of Dataset 2.

Figure 12 .
Figure 12.Classification overall accuracy of optimized and un-optimized WNN with different hidden layer nodes of Dataset 2.

Figure 12 .
Figure 12.Classification overall accuracy of optimized and un-optimized WNN with different hidden layer nodes of Dataset 2.

4. 2 .
More Advanced Classifier Should Be Introduced or Developed for Further Promoting Ocean Oil Spill Classification Performance

Table 2 .
Information on the image of RadarSAT-2 used in this study (Dataset 1).

Table 3 .
Information on the image of RadarSAT-2 used in this study (Dataset 2).

1 Implement the preprocessing of the original image
. The Pol-SAR features are extracted, and the expert interpretation map is determined.The region of interest is selected.Step 2 Build the WNN model.The number of nodes in each layer is determined.Numbers of the input layer nodes equal to numbers of the selected features (in Dataset 1 experiment, span, H, µ, and P fully Pol-SAR features are selected, and, in Dataset 2 experiment, α, H, µ, and P features are selected by J-M index).Numbers of hidden layer nodes are determined by the testing.In this study, we set the number of the hidden layer nodes as 15, 20, 25, 30, 35, and 40 to evaluate the convergence and classification performance under different neural network structure.The numbers of output layer nodes is equal to the number of classified types.In this study, the classified types are oil spill and water, the number of the output layer nodes is 2.

Step 3 Train the wavelet neural network. The
pixel values of the selected fully Pol-SAR features (span, H, µ, and P of Dataset 1; α, H, µ, and P of Dataset 2) in the region of interest are used as the input of the WNN to conduct the training of the network.The number of iterations is set to 100.The minimum output error E min of the neural is set to 1 × 10 −5 .If the output error E < E min , then the training iteration ends.If E > E min , then the training iteration continues.

Table 6 .
The experimental results of Dataset 1.

Table 6 .
The experimental results of Dataset 1.
Note: OA is the overall accuracy of classification; Kappa is the meaning of Kappa coefficient.

Table 7 .
The experimental results of Dataset 2.

Table 7 .
The experimental results of Dataset 2.