<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Sensors</journal-id>
<journal-title>Sensors</journal-title>
<issn pub-type="epub">1424-8220</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/s90907516</article-id>
<article-id pub-id-type="publisher-id">sensors-09-07516</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Remote-Sensing Image Classification Based on an Improved Probabilistic Neural Network</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Zhang</surname><given-names>Yudong</given-names></name><xref ref-type="aff" rid="af1-sensors-09-07516"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-sensors-09-07516"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Wu</surname><given-names>Lenan</given-names></name><xref ref-type="aff" rid="af1-sensors-09-07516"><sup>1</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Neggaz</surname><given-names>Nabil</given-names></name><xref ref-type="aff" rid="af2-sensors-09-07516"><sup>2</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Wang</surname><given-names>Shuihua</given-names></name><xref ref-type="aff" rid="af1-sensors-09-07516"><sup>1</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Wei</surname><given-names>Geng</given-names></name><xref ref-type="aff" rid="af1-sensors-09-07516"><sup>1</sup></xref></contrib></contrib-group>
<aff id="af1-sensors-09-07516">
<label>1</label> School of Information Science and Engineering, Southeast University, Nanjing 210009, China; E-Mails: <email>shuihuaw2007@gmail.com</email> (S.W.); <email>wei_geng@163.com</email> (G.W.)</aff>
<aff id="af2-sensors-09-07516">
<label>2</label> Signal-Image-Parole Laboratory, Department of Computer Science, University of Science and Technology – Oran, Oran, Algeria; E-Mail: <email>neggaz_nabil@yahoo.fr</email></aff>
<author-notes>
<corresp id="c1-sensors-09-07516">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>zhangyudongnuaa@gmail.com</email>.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2009</year></pub-date>
<pub-date pub-type="epub">
<day>23</day>
<month>9</month>
<year>2009</year></pub-date>
<volume>9</volume>
<issue>9</issue>
<fpage>7516</fpage>
<lpage>7539</lpage>
<history>
<date date-type="received">
<day>12</day>
<month>6</month>
<year>2009</year></date>
<date date-type="rev-recd">
<day>2</day>
<month>9</month>
<year>2009</year></date>
<date date-type="accepted">
<day>16</day>
<month>9</month>
<year>2009</year></date></history>
<permissions>
<copyright-statement>© 2009 by the authors; licensee MDPI, Basel, Switzerland</copyright-statement>
<copyright-year>2009</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>This paper proposes a hybrid classifier for polarimetric SAR images. The feature sets consist of span image, the H/A/α decomposition, and the GLCM-based texture features. Then, a probabilistic neural network (PNN) was adopted for classification, and a novel algorithm proposed to enhance its performance. Principle component analysis (PCA) was chosen to reduce feature dimensions, random division to reduce the number of neurons, and Brent’s search (BS) to find the optimal bias values. The results on San Francisco and Flevoland sites are compared to that using a 3-layer BPNN to demonstrate the validity of our algorithm in terms of confusion matrix and overall accuracy. In addition, the importance of each improvement of the algorithm was proven.</p></abstract>
<kwd-group>
<kwd>polarimetric SAR</kwd>
<kwd>Probabilistic neural network</kwd>
<kwd>gray-level co-occurrence matrix</kwd>
<kwd>principle component analysis</kwd>
<kwd>Brent’s Search</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>The classification of different objects, as well as different terrain characteristics, with single channel monopolarisation SAR images can carry a significant amount of error, even when operating after multilooking [<xref ref-type="bibr" rid="b1-sensors-09-07516">1</xref>]. One of the most challenging applications of polarimetry in remote sensing is landcover classification using fully polarimetric SAR (PolSAR) images.</p>
<p>The Wishart maximum likelihood (WML) method has often been used for PolSAR classification [<xref ref-type="bibr" rid="b2-sensors-09-07516">2</xref>]. This method uses the amplitudes of the elements in the covariance or coherency matrices. However, it does not explicitly take into consideration the phase information within polarimetric data, which plays a direct role in the characterization of a broad range of scattering processes. Furthermore, the covariance or coherency matrices are determined after spatial averaging and therefore can describe only stochastic scattering processes, while certain objects, such as man-made objects, are better characterized at a pixel-level [<xref ref-type="bibr" rid="b3-sensors-09-07516">3</xref>].</p>
<p>To overcome above shortcomings, polarimetric decompositions were introduced with an aim to establish a correspondence between the physical characteristics of the considered areas and the observed scattering mechanisms. There are seven famous decomposition methods: Pauli [<xref ref-type="bibr" rid="b4-sensors-09-07516">4</xref>], Krogager [<xref ref-type="bibr" rid="b5-sensors-09-07516">5</xref>], Freeman [<xref ref-type="bibr" rid="b6-sensors-09-07516">6</xref>], Huynen [<xref ref-type="bibr" rid="b7-sensors-09-07516">7</xref>], Barnes [<xref ref-type="bibr" rid="b8-sensors-09-07516">8</xref>], Cloude [<xref ref-type="bibr" rid="b9-sensors-09-07516">9</xref>] and Holm [<xref ref-type="bibr" rid="b8-sensors-09-07516">8</xref>]. The most effective method among these is the Cloude decomposition, also known as the H/A/α method.</p>
<p>Recently, texture information has been extracted and used as a parameter to enhance the classification results. The texture parameters can be defined as many types, such as entropy [<xref ref-type="bibr" rid="b10-sensors-09-07516">10</xref>], fractal dimension [<xref ref-type="bibr" rid="b11-sensors-09-07516">11</xref>], lacunarity [<xref ref-type="bibr" rid="b12-sensors-09-07516">12</xref>], wavelet energy [<xref ref-type="bibr" rid="b13-sensors-09-07516">13</xref>], semivariograms [<xref ref-type="bibr" rid="b14-sensors-09-07516">14</xref>], and gray-level co-occurrence matrix [<xref ref-type="bibr" rid="b15-sensors-09-07516">15</xref>]. Particularly, the gray-level co-occurrence matrices (GLCM) were already successfully applied to classification problems.</p>
<p>Thus, we chose the combination of H/A/α and GLCM as the parameter set of our method. The next problem is how to choose the best classifier. In the past, standard multi-layered feed-forward NNs with a back propagation (BP) algorithm have been applied for SAR image classification [<xref ref-type="bibr" rid="b16-sensors-09-07516">16</xref>]. BPs are effective methods since they do not involve complex models and equations as compared to traditional regression analysis. In addition, they can easily adapt to new data through a re-training process.</p>
<p>However, BP needs much effort to determine the architecture of networks and more computations for training. Moreover, BP yields deterministic but not probabilistic results. This makes it technically impractical in classifications. Probabilistic neural networks (PNNs), therefore, are effective alternatives that are faster in determining the network architecture and in training. Moreover, PNNs provide probabilistic viewpoints and deterministic classification results [<xref ref-type="bibr" rid="b17-sensors-09-07516">17</xref>].</p>
<p>The input weights and layer weights of PNN can be set directly from the available data, while the bias traditionally is difficult to determine, so it is usually obtained manually either by iterative experiments or by an exhaustive algorithm [<xref ref-type="bibr" rid="b18-sensors-09-07516">18</xref>]. In this paper we propose a novel weights/biases setting method. Available input/target pairs are divided into training and validation subsets to reduce the number of neurons, and Brent’s method [<xref ref-type="bibr" rid="b19-sensors-09-07516">19</xref>] is adopted to find the optimal biases values since the problem can be regarded as a 1-D interval location problem. In addition, Principal Component Analysis (PCA) is employed [<xref ref-type="bibr" rid="b20-sensors-09-07516">20</xref>] in order to reduce the feature dimensions and computation time.</p>
<p>The structure of this paper is as follows: In the next section, we introduce the concept of Pauli decomposition. Section 3 presents the feature set, namely, the span image, the H/A/α decomposition, and the feature derived from GLCM. In section 4, the mechanism, structure and shortcomings of PNNs are introduced. Section 5 proposes our method and expatiates on the three important improvements: PCA, random division and optimization by Brent’s Search. Section 6 applied our method to terrain classification on San Francisco site, and find that our method performs better than 3-layer BPNN method. Section 7 applied our method to crop classification on Flevoland site. Section 8 discusses the significances of combined feature sets, random division, and PCA. Finally, Section 9 concludes this paper.</p></sec>
<sec>
<label>2.</label>
<title>Pauli Decomposition</title>
<sec sec-type="intro">
<label>2.1.</label>
<title>Basic Introduction</title>
<p>The features are derived from the multilook coherence matrix of the polarimetric SAR data. Suppose <italic>S</italic> stands for the measured scattering matrix:
<disp-formula id="FD1">
<label>(1)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hh</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hv</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">vh</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">vv</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hh</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hv</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hv</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">vv</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>where <italic>S<sub>qp</sub></italic> represents the scattering coefficients of the targets, <italic>p</italic> the polarization of the incident field, <italic>q</italic> the polarization of the scattered field. <italic>S<sub>hv</sub></italic> equals to <italic>S<sub>vh</sub></italic> since reciprocity applies in a monostatic system configuration.</p>
<p>The Pauli decomposition expresses the scattering matrix <italic>S</italic> in the so-called Pauli basis, which is given by the following three 2×2 matrices:
<disp-formula id="FD2">
<label>(2)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>a</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd columnalign="left">
<mml:mn>0</mml:mn></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd columnalign="left">
<mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>b</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd columnalign="left">
<mml:mn>0</mml:mn></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>c</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd columnalign="left">
<mml:mn>1</mml:mn></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd columnalign="left">
<mml:mn>0</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></p>
<p>Thus, <italic>S</italic> can be expressed as:
<disp-formula id="FD3">
<label>(3)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi mathvariant="italic">aS</mml:mi>
<mml:mi>a</mml:mi></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi mathvariant="italic">bS</mml:mi>
<mml:mi>b</mml:mi></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi mathvariant="italic">cS</mml:mi>
<mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula>where:
<disp-formula id="FD4">
<label>(4)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hh</mml:mi></mml:mrow></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">vv</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hh</mml:mi></mml:mrow></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">vv</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>=</mml:mo>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hv</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></disp-formula></p>
<p>An RGB image could be formed with the intensities |<italic>a</italic>|<sup>2</sup>, |<italic>b</italic>|<sup>2</sup>, |<italic>c</italic>|<sup>2</sup>. The meanings of <italic>S<sub>a</sub></italic>, <italic>S<sub>b</sub></italic>, and <italic>S<sub>c</sub></italic> are listed in <xref ref-type="table" rid="t1-sensors-09-07516">Table 1</xref>.</p></sec>
<sec>
<label>2.2.</label>
<title>Coherence Matrix</title>
<p>The coherence matrix is obtained as:
<disp-formula id="FD5">
<label>(5)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo stretchy="false">]</mml:mo>
<mml:mo> </mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mo stretchy="false">]</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mi>T</mml:mi></mml:msup>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>11</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>13</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow>
<mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>22</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>23</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>13</mml:mn></mml:mrow>
<mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>23</mml:mn></mml:mrow>
<mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>33</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></p>
<p>The average of multiple single-look coherence matrices is the multi-look coherence matrix. (<italic>T</italic><sub>11</sub>,<italic>T</italic><sub>22</sub>,<italic>T</italic><sub>33</sub>) usually are regarded the channels of the polarimetric SAR images.</p></sec></sec>
<sec>
<label>3.</label>
<title>Feature Extraction</title>
<p>The proposed features can be divided into three types, which are explained below.</p>
<sec>
<label>3.1.</label>
<title>Span</title>
<p>The span or total scattered power indicates the received power by a fully polarimetric system and is given by:
<disp-formula id="FD6">
<label>(6)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hh</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo>+</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">vv</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo>+</mml:mo>
<mml:mn>2</mml:mn>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">hv</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:math></disp-formula></p></sec>
<sec>
<label>3.2.</label>
<title>H/A/Alpha Decomposition</title>
<p>Cloude and Potter [<xref ref-type="bibr" rid="b9-sensors-09-07516">9</xref>] proposed an algorithm to identify in an unsupervised way polarimetric scattering mechanisms in the <italic>H</italic>-<italic>α</italic> plane. The method extends the two assumptions of traditional ways: 1) azimuthally symmetric targets; 2) equal minor eigenvalues <italic>λ</italic><sub>2</sub> and <italic>λ</italic><sub>3</sub>.</p>
<p><italic>T</italic> can be rewritten as:
<disp-formula id="FD7">
<label>(7)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>λ</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>λ</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>λ</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo> </mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mn>3</mml:mn>
<mml:mi>H</mml:mi></mml:msubsup></mml:mrow></mml:math></disp-formula>
<disp-formula id="FD8">
<label>(8)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>β</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>δ</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>β</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>δ</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>β</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>δ</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>β</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>γ</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>β</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>γ</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>β</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>γ</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></p>
<p>Then, the pseudo-probabilities of the <italic>T</italic> matrix expansion elements are defined as:
<disp-formula id="FD9">
<label>(9)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>λ</mml:mi></mml:mrow>
<mml:mi>j</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>3</mml:mn></mml:msubsup>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>λ</mml:mi></mml:mrow>
<mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula></p>
<p>The entropy indicates the degree of statistical disorder of the scattering phenomenon. It can be defined as:
<disp-formula id="FD10">
<label>(10)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>3</mml:mn></mml:munderover>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mtext>log</mml:mtext>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo> </mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>≤</mml:mo>
<mml:mi>H</mml:mi>
<mml:mo>≤</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:math></disp-formula></p>
<p>For high entropy values, a complementary parameter (anisotropy) is necessary to fully characterize the set of probabilities. The anisotropy is defined as the relative importance of the second scattering mechanisms [<xref ref-type="bibr" rid="b21-sensors-09-07516">21</xref>]:
<disp-formula id="FD11">
<label>(11)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo> </mml:mo>
<mml:mo> </mml:mo>
<mml:mo> </mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>≤</mml:mo>
<mml:mi>A</mml:mi>
<mml:mo>≤</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:math></disp-formula></p>
<p>The four estimates of the angles are easily evaluated as:
<disp-formula id="FD12">
<label>(12)</label>
<mml:math display="block">
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mover accent="true">
<mml:mi>α</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mover accent="true">
<mml:mi>β</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mover accent="true">
<mml:mi>δ</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mover accent="true">
<mml:mi>γ</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo stretchy="false">]</mml:mo>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>3</mml:mn></mml:munderover>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>β</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>δ</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>γ</mml:mi>
<mml:mo stretchy="false">]</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>Thus, vectors from coherence matrix can be represented as (<italic>H</italic>, <italic>A</italic>, <italic>ᾱ</italic>, <italic>β̄</italic>, <italic>δ̄</italic>, <italic>γ̄</italic>).</p></sec>
<sec>
<label>3.3.</label>
<title>Texture Features</title>
<p>The Gray level co-occurrence matrix (GLCM) is a text descriptor which takes into account the specific position of a pixel relative to another. The GLCM is a matrix whose elements correspond to the relative frequency of occurrence of pairs of gray level values of pixels separated by a certain distance in a given direction [<xref ref-type="bibr" rid="b22-sensors-09-07516">22</xref>]. Formally, the elements of a GLCM <italic>G</italic>(<italic>i</italic>,<italic>j</italic>) for a displacement vector (<italic>a</italic>,<italic>b</italic>) is defined as
<disp-formula id="FD13">
<label>(13)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">|</mml:mo>
<mml:mo stretchy="false">{</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>y</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>v</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>:</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&amp;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>v</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">}</mml:mo>
<mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>Where (<italic>t</italic>,<italic>v</italic>) = (<italic>x</italic>+<italic>a</italic>, <italic>y</italic>+<italic>b</italic>), and |•| is the cardinality of a set. The displacement vector (<italic>a</italic>,<italic>b</italic>) can be rewritten as (<italic>d</italic>, <italic>θ</italic>) in polar coordinates.</p>
<p>GLCMs are suggested to calculate from four displacement vectors with <italic>d</italic> = 1 and <italic>θ</italic> = 0°, 45°, 90°, and 135° respectively. In this study, the (<italic>a</italic>,<italic>b</italic>) are chosen as (0,1), (−1,1), (−1,0), and (−1,−1) respectively, and the corresponding GLCMs are averaged.</p>
<p>The four features are extracted from normalized GLCMs, the sum of which is equal to 1. Suppose the normalized GLCM value at (<italic>i</italic>,<italic>j</italic>) is <italic>p</italic>(<italic>i</italic>,<italic>j</italic>), and their detailed definition are listed in <xref ref-type="table" rid="t2-sensors-09-07516">Table 2</xref>.</p></sec>
<sec>
<label>3.4.</label>
<title>Total Features</title>
<p>The texture features consist of 4 GLCM-based features, which should be multiplied by 3 since there exist three channels (<italic>T</italic><sub>11</sub>,<italic>T</italic><sub>22</sub>,<italic>T</italic><sub>33</sub>). In addition, there are one span feature, and six <italic>H</italic>/<italic>α</italic> parameters. In all, the total features are 1 + 6 + 4 × 3 = 19.</p></sec></sec>
<sec>
<label>4.</label>
<title>Probabilistic NN</title>
<sec>
<label>4.1.</label>
<title>Mechanism of PNN</title>
<p>Neural networks are widely used in pattern classification since they do not need any information about the probability distribution and the <italic>a priori</italic> probabilities of different classes. PNNs are basically pattern classifiers. They combine the well known Bayes decision strategy with the Parzen non-parametric estimator of the probability density functions (PDF) of different classes. PNNs have been of interest because they yield a probabilistic output and are easy to implement.</p>
<p>Taking a two categories situation as an example, we should decide the known state of nature <italic>θ</italic> to be either <italic>θ</italic><sub>A</sub> or <italic>θ</italic><sub>B</sub>. Suppose a set of measurements is obtained as <italic>p</italic>-dimensional vector <italic>x</italic> = [<italic>x</italic><sub>1</sub>, …, <italic>x<sub>p</sub></italic>], the Bayes decision rule becomes:
<disp-formula id="FD14">
<label>(14)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>θ</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi></mml:msub></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>l</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&gt;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">B</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>l</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">B</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">B</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>θ</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">B</mml:mi></mml:msub></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>l</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&lt;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">B</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>l</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">B</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">B</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula></p>
<p>Here, <italic>f</italic><sub>A</sub>(<italic>x</italic>) and <italic>f</italic><sub>B</sub>(<italic>x</italic>) are the PDF for categories A and B, respectively. <italic>l</italic><sub>A</sub> is the loss function associated with the wrong decision d(<italic>x</italic>) = <italic>θ</italic><sub>B</sub> when <italic>θ</italic> = <italic>θ</italic><sub>A</sub>, <italic>l</italic><sub>B</sub> is the loss function associated with the wrong decision d(<italic>x</italic>) = <italic>θ</italic><sub>A</sub> when <italic>θ</italic> = <italic>θ</italic><sub>B</sub>, and the losses associated with correct decisions are taken to be zero. <italic>h</italic><sub>A</sub> and <italic>h</italic><sub>B</sub> are the <italic>a priori</italic> probability of occurrence of patters from category A and B, respectively.</p>
<p>In a simple case that assumes the loss function and <italic>a priori</italic> probability are equal, the Bayes rule classifies an input pattern to the class with higher PDF. Therefore, the accuracy of the decision boundaries depends on what the underlying PDFs are estimated. Parzen’s results can be extended to estimate in the special case where the multivariate kernel is a product of univariate kernels. In the particular case of the Gaussian kernel, the multivariate estimates can be expressed as:
<disp-formula id="FD15">
<label>(15)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>π</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi>σ</mml:mi></mml:mrow>
<mml:mi>p</mml:mi></mml:msup></mml:mrow></mml:mfrac>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>m</mml:mi></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>m</mml:mi></mml:munderover>
<mml:mrow>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi>
<mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi></mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="normal">A</mml:mi>
<mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>σ</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula></p>
<p>Here, <italic>m</italic> is the number of training vectors in category A, p is the dimensionality of the training vectors, <italic>x</italic><sub>A<italic>i</italic></sub> is the <italic>i</italic>th training vector for category A, and <italic>σ</italic> is the smoothing parameter. It should be noted that <italic>f</italic><sub>A</sub>(<italic>x</italic>) is the sum of small multivariate Gaussian distributions centered at each training sample, but the sum is not limited to being Gaussian.</p></sec>
<sec>
<label>4.2.</label>
<title>PNN Structure</title>
<p><xref ref-type="fig" rid="f1-sensors-09-07516">Figure 1</xref> shows the outline of PNN. When an input is presented, the first layer computes distances from the input vector to the input weights (IW), and produces a vector whose elements indicate how close the input is to the IW. The second layer sums these contributions for each class of inputs to produce as its net output a vector of probabilities. Finally, a <italic>compet</italic> transfer function on the output of the second layer picks the maximum of these probabilities, and produces a 1 for that class and a 0 for other classes.</p>
<p>The mathematical expression of PNN can be expressed as:
<disp-formula id="FD16">
<label>(16)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mo>=</mml:mo>
<mml:mtext>radbas</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">‖</mml:mo>
<mml:mrow>
<mml:mtext>IW</mml:mtext>
<mml:mo>−</mml:mo>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mo stretchy="false">‖</mml:mo></mml:mrow>
<mml:mo>•</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></disp-formula>
<disp-formula id="FD17">
<label>(17)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mtext>compet</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>LW</mml:mtext>
<mml:mo>•</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>In this paper, the <italic>radbas</italic> is selected as:
<disp-formula id="FD18">
<label>(18)</label>
<mml:math display="block">
<mml:mrow>
<mml:mtext>radbas</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>The <italic>compet</italic> function is defined as:
<disp-formula id="FD19">
<label>(19)</label>
<mml:math display="block">
<mml:mrow>
<mml:mtext>compet</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo> </mml:mo>
<mml:mn>0</mml:mn>
<mml:mo> </mml:mo>
<mml:mo>⋯</mml:mo>
<mml:mo> </mml:mo>
<mml:mo> </mml:mo>
<mml:mn>0</mml:mn>
<mml:mo> </mml:mo>
<mml:munder>
<mml:mn>1</mml:mn>
<mml:mi>i</mml:mi></mml:munder>
<mml:mo> </mml:mo>
<mml:mn>0</mml:mn>
<mml:mo> </mml:mo>
<mml:mo> </mml:mo>
<mml:mo>⋯</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>0</mml:mn></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mo> </mml:mo>
<mml:mi>n</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mtext>max</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>This type of setting can produce a network with zero errors on training vectors, and obviously it does not need any training.</p></sec>
<sec>
<label>4.3.</label>
<title>Shortcomings of Traditional PNN</title>
<p>Suppose <italic>P</italic> and <italic>T</italic> denote the set of training vector <italic>x</italic> and corresponding target vector <italic>y</italic>, namely, <italic>P</italic> = [<italic>x</italic><sub>1</sub>, <italic>x</italic><sub>2</sub>, …<italic>x<sub>Q</sub></italic>], and <italic>T</italic> = [<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>, <italic>y<sub>Q</sub></italic>]. IW and LW are set traditionally as follows:
<disp-formula id="FD20">
<label>(20)</label>
<mml:math display="block">
<mml:mrow>
<mml:mtext>IW</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mi>P</mml:mi></mml:mrow></mml:math></disp-formula>
<disp-formula id="FD21">
<label>(21)</label>
<mml:math display="block">
<mml:mrow>
<mml:mtext>LW</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mi>T</mml:mi></mml:mrow></mml:math></disp-formula></p>
<p>However, it is obvious that <italic>Q</italic> is usually very large, and then the net will be too big and consume too much computation time. On the other hand, to simplify the setting of bias <italic>b</italic>, all of its components are considered as equal [<xref ref-type="bibr" rid="b23-sensors-09-07516">23</xref>]. Even so, the setting of <italic>b</italic> is still a challenge. Although errors on training vectors are always zero, the errors on test vectors are greatly dependent with the value of <italic>b</italic>.</p>
<p>If it is too small, the spread of each radial basis layer function becomes too large, and the network will take too many nearby design vectors into account, moreover, the radial basis neurons will output large values (near 1) for all the inputs used to design the network. If it is too larger, the spread becomes near zero, and the network will degrades as a nearest neighbor classifier.</p></sec></sec>
<sec>
<label>5.</label>
<title>A Novel Method of Weights/Biases Setting</title>
<p>Here we propose a novel method to solve the above two problems. The main idea is shown in <xref ref-type="fig" rid="f2-sensors-09-07516">Figure 2</xref>. Our improvement lies in the PCA, the random division, and the single variable optimization.</p>
<sec>
<label>5.1.</label>
<title>Feature Reduction</title>
<p>Excessive features increase computation times and storage memory. Furthermore, they sometimes make classification more complicated, which is called the curse of dimensionality. It is necessary to reduce the number of features.</p>
<p>Principal component analysis (PCA) is an efficient tool to reduce the dimensionality of a data set consisting of a large number of interrelated variables, while retaining most of the variations. It is achieved by transforming the data set to a new set of ordered variables. This technique has three effects: it orthogonalizes the components of the input vectors so that uncorrelated with each other, it orders the resulting orthogonal components so that those with the largest variation come first, and eliminates those components contributing the least to the variation in the data set.</p>
<p>It should be noted that the input vectors should be normalized to have zero mean and unity variance before performing PCA, which is shown in <xref ref-type="fig" rid="f3-sensors-09-07516">Figure 3</xref>.</p>
<p>The normalization is a standard procedure. Details about PCA can be found in Ref. [<xref ref-type="bibr" rid="b24-sensors-09-07516">24</xref>].</p></sec>
<sec>
<label>5.2.</label>
<title>Random Division</title>
<p>Realistic sample numbers <italic>Q</italic> are generally very large, which leads to a quite large PNN. Thus, we divide the available data into two subsets: training subset and validation subset. The ratio of each is called <italic>trainRatio</italic> and <italic>validRatio</italic> respectively. In order to save the storage room of the net and to fasten the computation, the <italic>trainRatio</italic> is set as small as possible, and meanwhile it should not affect the accuracy of the NN.</p></sec>
<sec>
<label>5.3.</label>
<title>Optimization by Brent’s Search</title>
<p>The goal of finding optimal <italic>b</italic> can be obtained by solving this problem: find the minimum MSE on validation subset of the corresponding <italic>b</italic>. This can be depicted as a single-variable optimization problem in the dash-line rectangle in <xref ref-type="fig" rid="f2-sensors-09-07516">Figure 2</xref>. Brent’s Search (BS) method is adopted to solve this optimization problem.</p>
<p>BS is a linear search, a hybrid of the golden section search and a quadratic interpolation. Golden section search has a first-order rate of convergence, while polynomial interpolations have an asymptotic rate faster than super-linear. On the other hand, the rate of convergence for the golden section search starts when the algorithm is initialized, whereas the asymptotic behavior for the polynomial interpolations can take many iterations to become apparent. BS attempts to combine the best features of both approaches. BS has the advantage that it does not require computation of the derivative, which greatly fits the optimization problem.</p></sec></sec>
<sec>
<label>6.</label>
<title>Terrain Classification</title>
<p>The NASA/JPL AirSAR L-band data for the San Francisco (California, USA) area was used for the experiments. Its size is 1,024 × 900. In order to reduce the computations, the sub-area with size 600 × 600 was extracted from the left-upper point of original image. The ground truth of the test site can be found at Ref. [<xref ref-type="bibr" rid="b2-sensors-09-07516">2</xref>].</p>
<p>Quantitative information about the experiment is described as follows, where ‘•’ denotes parameters known before simulation and ‘♦’ denotes the parameters obtained at the initial stage of the experiment.</p>
<list list-type="simple">
<list-item>
<p>•Number of features: 19</p></list-item>
<list-item>
<p>♦ Number of reduced features by PCA: 11 (obtained by performing PCA on total available pairs)</p></list-item>
<list-item>
<p>•Location of Sub San Francisco Area:
<list list-type="bullet">
<list-item>
<p>X-range: 1–600</p></list-item>
<list-item>
<p>Y-range: 1–600</p></list-item></list></p></list-item>
<list-item>
<p>•Location of Training/Test Rectangular Area (the first and second pixels denote the coordinate of the left-upper point of the rectangle, the third and forth pixels denote the width and length of the rectangle)
<list list-type="bullet">
<list-item>
<p>Sea:
<list list-type="bullet">
<list-item>
<p>Training Area1 [100 500 60 60]</p></list-item>
<list-item>
<p>Training Area2 [300 200 60 60]</p></list-item>
<list-item>
<p>Test Area [500 50 60 60]</p></list-item></list></p></list-item>
<list-item>
<p>Urban:
<list list-type="bullet">
<list-item>
<p>Training Area1 [450 400 60 60]</p></list-item>
<list-item>
<p>Training Area2 [500 250 60 60]</p></list-item>
<list-item>
<p>Test Area [500 530 60 60]</p></list-item></list></p></list-item>
<list-item>
<p>Vegetated
<list list-type="bullet">
<list-item>
<p>Training Area1 [50 50 60 60]</p></list-item>
<list-item>
<p>Training Area2 [50 250 60 60]</p></list-item>
<list-item>
<p>Test Area [320 450 60 60]</p></list-item></list></p></list-item></list></p></list-item>
<list-item>
<p>•Parameters of GLCM
<list list-type="bullet">
<list-item>
<p>local area: 5 × 5 (pixels)</p></list-item>
<list-item>
<p>Number of gray levels: 8</p></list-item>
<list-item>
<p>Offset: [0 1]</p></list-item></list></p></list-item>
<list-item>
<p>•Properties of available training/target pairs
<list list-type="bullet">
<list-item>
<p>Pairs = 21,600</p></list-item>
<list-item>
<p>R = 11</p></list-item>
<list-item>
<p>K = 3</p></list-item>
<list-item>
<p>P (size 11 × 21,600)</p></list-item>
<list-item>
<p>T (size 3 × 21,600)
<list list-type="simple">
<list-item>
<p>♦ Training Ratio: 0.01 (obtained by simple iterative tests)</p></list-item>
<list-item>
<p>•Validation Ratio: 0.99</p></list-item></list></p></list-item></list></p></list-item>
<list-item>
<p>•Properties of NN optimized by our approach
<list list-type="simple">
<list-item>
<p>•Q = Pairs × trainRatio = 216</p></list-item>
<list-item>
<p>♦ b = 4.73(obtained by BS method)</p></list-item>
<list-item>
<p>•IW = P (size: 216 × 11)</p></list-item>
<list-item>
<p>•LW = T (size: 3 × 216)</p></list-item></list></p></list-item>
<list-item>
<p>•Properties of BS Method
<list list-type="bullet">
<list-item>
<p>Tolerance X Value: 1e–3</p></list-item>
<list-item>
<p>Tolerance Function Value: 1e–5</p></list-item>
<list-item>
<p>Maximum Iterative Steps: 30</p></list-item></list></p></list-item>
<list-item>
<p>•Hardware: Pentium 4 CPU 1.66 GHz, 512 MB of RAM</p></list-item>
<list-item>
<p>•Software: PolSARpro v4.0, Neural Network Toolbox of Matlab 7.8(R2009)</p></list-item></list>
<sec>
<label>6.1.</label>
<title>Denoising by Lee Filter</title>
<p>The sub-area (600 × 600) is shown in <xref ref-type="fig" rid="f4-sensors-09-07516">Figure 4(a)</xref>. The Refined Lee filter (Window size = 7) is used to reduce the speckle noise and the results are shown in <xref ref-type="fig" rid="f4-sensors-09-07516">Figure 4(b)</xref>. The Lee filter adapts the amount of filtering to the local statistics. Homogeneous areas are filtered with the maximum strength where point scatterers are let unfiltered. The refined filter could use directional windows to preserve edges and heterogeneous features [<xref ref-type="bibr" rid="b25-sensors-09-07516">25</xref>].</p></sec>
<sec>
<label>6.2.</label>
<title>Full Features Set</title>
<p>Then, the basic span image and three channels (<italic>T</italic><sub>11</sub>,<italic>T</italic><sub>22</sub>,<italic>T</italic><sub>33</sub>) are easily obtained and shown in <xref ref-type="fig" rid="f5-sensors-09-07516">Figure 5</xref>. The parameters of H/A/Alpha decomposition are shown in <xref ref-type="fig" rid="f6-sensors-09-07516">Figure 6</xref>. The GLCM-based parameters of <italic>T</italic><sub>11</sub>, <italic>T</italic><sub>22</sub>, <italic>T</italic><sub>33</sub> are shown in <xref ref-type="fig" rid="f7-sensors-09-07516">Figures 7</xref>–<xref ref-type="fig" rid="f9-sensors-09-07516">9</xref>.</p></sec>
<sec>
<label>6.3.</label>
<title>Feature Reduction by PCA</title>
<p>The curve of cumulative sum of variance with dimensions of reduced vectors via PCA is shown in <xref ref-type="fig" rid="f10-sensors-09-07516">Figure 10</xref>. The detailed data are listed in <xref ref-type="table" rid="t3-sensors-09-07516">Table 3</xref>. It shows that only 11 features, half the original features only, could preserve 96.36% of variance.</p>
<p>Thus, 11 new features obtained via PCA are input to the NN for classification training.</p></sec>
<sec>
<label>6.4.</label>
<title>Training Preparation</title>
<p>The classification is run over three classes, the sea, the urban areas and the vegetated zones. The training and testing areas are selected manually shown in <xref ref-type="fig" rid="f11-sensors-09-07516">Figures 11(a)–(b)</xref>, respectively. Each square has a size of 60×60. In total, there are 21,600 pixels for training, and 10,800 pixels for testing. In this experiment, <italic>trainRatio</italic> is adjusted finally as 0.01, namely, the <italic>validRatio</italic> equals 0.99. In this way, the network only has 1% neurons of that constructed by traditional approach. The training subset and validation subset of the training area are divided randomly.</p></sec>
<sec>
<label>6.5.</label>
<title>Weights/Biases Setting</title>
<p>The IW and LW are easily set according to our novel approach, and the number of neurons decreases from 21,600 to only 216. The <italic>b</italic> is estimated by BS method. Its initial range is set as [0.01, 20], which is large enough to contain the optimal point. The curve of classification error versus the steps is shown in <xref ref-type="fig" rid="f12-sensors-09-07516">Figure 12</xref>. It is evident that the classification error converges at only three steps shown in the red dot. However, BS will continue to search the best <italic>b</italic> value since the tolerance of <italic>b</italic> is set as small as 1e–3. The whole process of the change of b is shown in <xref ref-type="fig" rid="f13-sensors-09-07516">Figure 13</xref>.</p>
<p>The optimal <italic>b</italic> is found as 4.73, with the smallest error 1.557%, namely, the highest classification accuracy 98.44%.</p></sec>
<sec>
<label>6.6.</label>
<title>Application to the Whole Image</title>
<p>We use the trained PNN to classify the whole image, and the results are shown in <xref ref-type="fig" rid="f14-sensors-09-07516">Figure 14</xref>. The brims of length 3 are not calculated considering the local area of GLCM, so the size here is only 594 × 594.</p>
<p>From <xref ref-type="fig" rid="f14-sensors-09-07516">Figure 14</xref> it makes clear that the sea can be classified perfectly, while the vegetated and urban areas are easily inter-confused. The next section will calculate the confusion matrix which reflects the degree of confusion between the three classes.</p></sec>
<sec>
<label>6.7.</label>
<title>Comparison with Other Approaches</title>
<p>Finally, our method is compared to the 3-layer BPNN [<xref ref-type="bibr" rid="b16-sensors-09-07516">16</xref>]. The confusion matrices (CM) by each methods on training area and testing area are listed in <xref ref-type="table" rid="t4-sensors-09-07516">Table 4</xref>. The element of <italic>i</italic>th row and <italic>j</italic>th column in the 3 × 3 matrix represents the amount of pixels belonging to class <italic>j</italic> as user defined are assigned to class <italic>i</italic> after the supervised classification.</p>
<p>It is obvious that the classification accuracies of our proposed method in training area are all higher than 32.5% (33.3% denotes the perfect classification). For the testing area, classification accuracies are all higher than 30.1%. The main drawback is around 3.3% of vegetated zones are misclassified as urban area.</p>
<p>The overall accuracies are calculated as CM<sub>11</sub> + CM<sub>22</sub> + CM<sub>33</sub> and listed in <xref ref-type="table" rid="t5-sensors-09-07516">Table 5</xref>, that demonstrates our method has a higher overall accuracy in both training area and testing area than those of 3-layer BPNN. The reason our method outperforms the 3-layer BPNN lies in not only the fact that PNN is adept at predicting the probabilistic results, but also the selected features sets are more discernable.</p></sec></sec>
<sec>
<label>7.</label>
<title>Crop Classification</title>
<p>Flevoland, an agricultural area in The Netherlands, was chosen as another example. The site is composed of strips of rectangular agricultural fields. The scene is designated as a supersite for the earth observing system (EOS) program, and is continuously surveyed by the authorities. The ground truth of the test site can be seen in Ref [<xref ref-type="bibr" rid="b26-sensors-09-07516">26</xref>].</p>
<list list-type="simple">
<list-item>
<p>•Number of features: 19</p></list-item>
<list-item>
<p>♦ Number of reduced features by PCA: 13 (obtained by performing PCA on total available pairs)</p></list-item>
<list-item>
<p>•Location of Train/Test Rectangular Area
<list list-type="bullet">
<list-item>
<p>Bare Soil 1:
<list list-type="bullet">
<list-item>
<p>Train Area [240 300 20 20]</p></list-item>
<list-item>
<p>Test Area [770 490 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Bare Soil 2
<list list-type="bullet">
<list-item>
<p>Train Area [335 440 20 20]</p></list-item>
<list-item>
<p>Test Area [420 425 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Barley
<list list-type="bullet">
<list-item>
<p>Train Area [285 500 20 20]</p></list-item>
<list-item>
<p>Test Area [765 425 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Forest
<list list-type="bullet">
<list-item>
<p>Train Area [959 155 20 20]</p></list-item>
<list-item>
<p>Test Area [900 490 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Grass
<list list-type="bullet">
<list-item>
<p>Train Area [535 240 20 20]</p></list-item>
<list-item>
<p>Test Area [500 303 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Lucerne
<list list-type="bullet">
<list-item>
<p>Train Area [550 495 20 20]</p></list-item>
<list-item>
<p>Test Area [505 550 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Peas
<list list-type="bullet">
<list-item>
<p>Train Area [523 330 20 20]</p></list-item>
<list-item>
<p>Test Area [436 200 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Potatoes
<list list-type="bullet">
<list-item>
<p>Train Area [32 40 20 20]</p></list-item>
<list-item>
<p>Test Area [655 307 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Rapeseed
<list list-type="bullet">
<list-item>
<p>Train Area [188 200 20 20]</p></list-item>
<list-item>
<p>Test Area [280 250 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Stem Beans
<list list-type="bullet">
<list-item>
<p>Train Area [800 350 20 20]</p></list-item>
<list-item>
<p>Test Area [777 384 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Sugar beet
<list list-type="bullet">
<list-item>
<p>Train Area [877 444 20 20]</p></list-item>
<list-item>
<p>Test Area [650 225 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Water
<list list-type="bullet">
<list-item>
<p>Train Area [965 50 20 20]</p></list-item>
<list-item>
<p>Test Area [961 201 20 20]</p></list-item></list></p></list-item>
<list-item>
<p>Wheat
<list list-type="bullet">
<list-item>
<p>Train Area [780 710 20 20]</p></list-item>
<list-item>
<p>Test Area [700 520 20 20]</p></list-item></list></p></list-item></list></p></list-item>
<list-item>
<p>•Parameters of GLCM
<list list-type="bullet">
<list-item>
<p>local area: 5×5 (pixels)</p></list-item>
<list-item>
<p>Number of gray levels: 8</p></list-item>
<list-item>
<p>Offset: [0 1]</p></list-item></list></p></list-item>
<list-item>
<p>•Properties of available training/target pairs
<list list-type="bullet">
<list-item>
<p>Pairs = 5200</p></list-item>
<list-item>
<p>R = 13</p></list-item>
<list-item>
<p>K = 13</p></list-item>
<list-item>
<p>P (size 13 × 5200)</p></list-item>
<list-item>
<p>T (size 13 × 5200)
<list list-type="simple">
<list-item>
<p>♦ Training Ratio: 0.2 (obtained by simple iterative tests)</p></list-item>
<list-item>
<p>•Validation Ratio: 0.8</p></list-item></list></p></list-item></list></p></list-item>
<list-item>
<p>•Properties of NN optimized by our approach
<list list-type="simple">
<list-item>
<p>•Q = Pairs × trainRatio = 1040</p></list-item>
<list-item>
<p>♦ b = 1.0827(obtained by BS method)</p></list-item>
<list-item>
<p>•IW = P (size: 13 × 1040)</p></list-item>
<list-item>
<p>•LW = T (size: 13 × 1040)</p></list-item></list></p></list-item>
<list-item>
<p>•Properties of BS Method
<list list-type="bullet">
<list-item>
<p>Tolerance X Value: 1e–3</p></list-item>
<list-item>
<p>Tolerance Function Value: 1e–5</p></list-item>
<list-item>
<p>Maximum Iterative Steps: 30</p></list-item></list></p></list-item>
<list-item>
<p>•Hardware: Pentium 4 CPU 1.66 GHz, 512 MB of RAM</p></list-item>
<list-item>
<p>•Software: PolSARpro v4.0, Neural Network Toolbox of Matlab 7.8(R2009)</p></list-item></list>
<sec>
<label>7.1.</label>
<title>Refine Lee Filter</title>
<p>The Pauli image of Flevoland is shown in <xref ref-type="fig" rid="f15-sensors-09-07516">Figure 15(a)</xref>, and the refined Lee filtered image (Window Size = 7) is shown in <xref ref-type="fig" rid="f15-sensors-09-07516">Figure 15(b)</xref>.</p></sec>
<sec>
<label>7.2.</label>
<title>Full Features</title>
<p>The basic span image and three channels (<italic>T</italic><sub>11</sub>,<italic>T</italic><sub>22</sub>,<italic>T</italic><sub>33</sub>) are easily obtained and shown in <xref ref-type="fig" rid="f16-sensors-09-07516">Figure 16</xref>. The parameters of H/A/Alpha decomposition are shown in <xref ref-type="fig" rid="f17-sensors-09-07516">Figure 17</xref>. The GLCM-based parameters of <italic>T</italic><sub>11</sub>, <italic>T</italic><sub>22</sub>, <italic>T</italic><sub>33</sub> are shown in <xref ref-type="fig" rid="f18-sensors-09-07516">Figures 18</xref>–<xref ref-type="fig" rid="f20-sensors-09-07516">20</xref>.</p></sec>
<sec>
<label>7.3.</label>
<title>Feature Reduction</title>
<p>The curve of cumulative sum of variance with dimensions of reduced vectors via PCA is shown in <xref ref-type="fig" rid="f21-sensors-09-07516">Figure 21</xref>. The detailed data are listed in <xref ref-type="table" rid="t6-sensors-09-07516">Table 6</xref>. It shows that only 13 features, which are only half the original features, could preserve 98.06% of variance.</p></sec>
<sec>
<label>7.4.</label>
<title>Training Preparation</title>
<p>The classification is run over 11 classes, bare soil 1, bare soil 2, barley, forest, grass, Lucerne, peas, potatoes, rapeseed, stem beans, and sugar beet. They are selected manually according to the ground truth [<xref ref-type="bibr" rid="b26-sensors-09-07516">26</xref>]. The training set and testing set are shown in <xref ref-type="fig" rid="f22-sensors-09-07516">Figure 22</xref>. Each square has a size of 20 × 20. In total, there are 5,200 pixels for training, and 5,200 pixels for testing.</p>
<p>Since the types of classes increase (3 to 13) while the available data decrease (21,800 to 5,200), thus, we should dispose more data to the training subset. Finally, <italic>trainRatio</italic> is adjusted as 0.2, while <italic>validRatio</italic> is set as 0.8. In this way, the network only has 1/5 neurons of that constructed by traditional approach. The training subset and validation subset of the training area are divided randomly.</p></sec>
<sec>
<label>7.5.</label>
<title>Weights/Biases Setting</title>
<p>The IW and LW are easily set according to our novel approach, and the number of neurons decreases from 5200 to only 1040. The <italic>b</italic> is estimated by BS method. Its initial range is set as before. The curve of classification error versus the steps is shown in <xref ref-type="fig" rid="f23-sensors-09-07516">Figure 23</xref>. It is evident that the classification error reaches the minimum at 17th steps shown in the red dot. The whole process of the change of <italic>b</italic> is shown in <xref ref-type="fig" rid="f24-sensors-09-07516">Figure 24</xref>.</p>
<p>The optimal <italic>b</italic> is found as 1.0827, with the smallest error 7.8%, namely, the highest classification accuracy on the validation subset of training area is 98.44%.</p></sec>
<sec sec-type="results">
<label>7.6.</label>
<title>Classification Results</title>
<p>The confusion matrices on training area and testing area are calculated and listed in <xref ref-type="fig" rid="f25-sensors-09-07516">Figure 25</xref> and <xref ref-type="fig" rid="f26-sensors-09-07516">Figure 26</xref>. The overall accuracy of our method on training area and test area are 93.71% and 86.2% respectively.</p>
<p>We apply our method on the whole image. The results are shown in <xref ref-type="fig" rid="f27-sensors-09-07516">Figure 27</xref>. From <xref ref-type="fig" rid="f27-sensors-09-07516">Figure 27</xref> it is clear that our method can classify most of areas correctly.</p></sec></sec>
<sec sec-type="discussion">
<label>8.</label>
<title>Discussion</title>
<p>The BS has an important effect on our algorithm as shown in <xref ref-type="fig" rid="f12-sensors-09-07516">Figure 12</xref> and <xref ref-type="fig" rid="f23-sensors-09-07516">Figure 23</xref>. It can guide users to find the optimal <italic>b</italic> value in quite short steps. Otherwise, the users will take a long time with the help of exhaustive search algorithm. The function of the combined feature, random division and PCA will be discussed in detail at following paragraphs.</p>
<sec>
<label>8.1.</label>
<title>Single Type of Feature Set versus Combined Feature Sets</title>
<p>The feature sets can be divided into two types. One is the polarimetric feature set, which contains the span, the six H/A/α parameters; the other is the texture feature set, which contains the properties extracted from the GLCM.</p>
<p><xref ref-type="table" rid="t7-sensors-09-07516">Table 7</xref> lists the classification accuracy of classifiers using polarimetric feature set, texture feature set, and combined feature set. It indicates that the polarimetric features contribute most to the classification while the texture feature contribute less. Then, we can find that the combined feature set performs better than each single. Thus, our classifier using combined feature set can be regarded as an feature fusion method.</p></sec>
<sec>
<label>8.2.</label>
<title>With and without Random Division</title>
<p>If we do not use the random division, the structure of PNN will increase 1/<italic>trainRatio</italic> times. Consequently, the computation will become a burden with very little improve on classification overall accuracy. Taking the San Francisco area as the example, four square areas of different size are picked out randomly from the image, and are classified by PNNs with and without random division. The computation time and overall accuracy of each are listed in <xref ref-type="table" rid="t8-sensors-09-07516">Table 8</xref>.</p>
<p><xref ref-type="table" rid="t8-sensors-09-07516">Table 8</xref> indicates that the computation time of traditional method is only 46 times of that of our method for 10 × 10 area, however, the ratio rockets to 516 for 40 × 40 area. Moreover, for a larger size area, such as 50 × 50, it cannot work because of the lack of memory.</p>
<p>From another point of view, the overall accuracy of traditional method was expected to be much higher than that of our method since it uses a great many neurons, whereas, in fact, they are nearly the same. The reason may consist of the optimization of <italic>b</italic> in our method. Accordingly, our method of weights/biases setting is valid and effective, and it is superior to traditional method in terms of computation time and storage room while it can maintain a high overall accuracy.</p></sec>
<sec>
<label>8.3.</label>
<title>With and without PCA</title>
<p>PNNs with and without PCA are investigated in the same manner as in Section 7.1. Their computation times are depicted in <xref ref-type="fig" rid="f28-sensors-09-07516">Figure 28</xref>, which indicates that PNN with PCA enjoys a less computation time than that of PNN without PCA. Their time differences are gradually becoming large as the width of the randomly selected area is increasing.</p>
<p>In addition, the overall accuracies of these two PNNs are observed. It should be noted that input data of the PNN without PCA still should be normalized although the PCA is omitted, otherwise the performance of PNN will decrease rapidly.</p>
<p>The overall accuracies obtained by the two PNNs are pictured in <xref ref-type="fig" rid="f29-sensors-09-07516">Figure 29</xref>. It demonstrates that the PNN with PCA outperforms PNN without PCA on the small test area (width &lt; 40). As the area becomes large (40 &lt; width &lt; 47), the PNN without PCA is better. Finally, as the area becomes large enough (width &gt; 47), these performances of the two PNNs are nearly equivalent. Therefore, our method embedding PCA can performs faster, and has no loss of overall accuracy.</p></sec></sec>
<sec sec-type="conclusions">
<label>9.</label>
<title>Conclusions</title>
<p>In this paper, a hybrid feature set has been introduced which is made up of the span image, the H/A/α decomposition, and the GLCM-based texture features. Then, a probabilistic neural network has been established. We proposed a novel weights/biases setting method based on Brent’s method and PCA. The method can decrease the feature set, reduce the number of neurons, and find optimal bias values.</p>
<p>Experiments of terrain classification on a San Francisco site and a crop classification on Flevoland show that our method can obtain good results which are more accurate than those of 3-layer BPNN. Afterwards, combined feature set, random division and PCA are assumed to be omitted in turn, and the results prove the indispensability of each improvement.</p></sec></body>
<back>
<ref-list>
<title>References and Notes</title>
<ref id="b1-sensors-09-07516"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pellizzeri</surname><given-names>T.M.</given-names></name></person-group><article-title>Classification of polarimetric SAR images of suburban areas using joint annealed segmentation and “H/A/α” polarimetric decomposition</article-title><source>ISPRS J. Photogramm. Remote Sens</source><year>2003</year><volume>58</volume><fpage>55</fpage><lpage>70</lpage><pub-id pub-id-type="doi">10.1016/S0924-2716(03)00017-0</pub-id></citation></ref>
<ref id="b2-sensors-09-07516"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname><given-names>J.S.</given-names></name><name><surname>Grunes</surname><given-names>M.R.</given-names></name><name><surname>Kwok</surname><given-names>R.</given-names></name></person-group><article-title>Classification of multi-look polarimetric SAR imagery based on complex Wishart distribution</article-title><source>Int. J. Remote Sens</source><year>1994</year><volume>15</volume><fpage>2299</fpage><lpage>2311</lpage><pub-id pub-id-type="doi">10.1080/01431169408954244</pub-id></citation></ref>
<ref id="b3-sensors-09-07516"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shimoni</surname><given-names>M.</given-names></name><name><surname>Borghys</surname><given-names>D.</given-names></name><name><surname>Heremans</surname><given-names>R.</given-names></name><name><surname>Perneel</surname><given-names>C.</given-names></name><name><surname>Acheroy</surname><given-names>M.</given-names></name></person-group><article-title>Fusion of PolSAR and PolInSAR data for land cover classification</article-title><source>Int. J. Appl. Earth Observ. Geoinf</source><year>2009</year><volume>11</volume><fpage>169</fpage><lpage>180</lpage><pub-id pub-id-type="doi">10.1016/j.jag.2009.01.004</pub-id></citation></ref>
<ref id="b4-sensors-09-07516"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cloude</surname><given-names>S.R.</given-names></name><name><surname>Pottier</surname><given-names>E.</given-names></name></person-group><article-title>A review of target decomposition theorems in radar polarimetry</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>1996</year><volume>34</volume><fpage>498</fpage><lpage>518</lpage><pub-id pub-id-type="doi">10.1109/36.485127</pub-id></citation></ref>
<ref id="b5-sensors-09-07516"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krogager</surname><given-names>E.</given-names></name></person-group><article-title>New decomposition of the radar target scattering matrix</article-title><source>Electron. Lett</source><year>1990</year><volume>26</volume><fpage>1525</fpage><lpage>1527</lpage><pub-id pub-id-type="doi">10.1049/el:19900979</pub-id></citation></ref>
<ref id="b6-sensors-09-07516"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freeman</surname><given-names>A.</given-names></name><name><surname>Durden</surname><given-names>S.L.</given-names></name></person-group><article-title>A three-component scattering model for polarimetric SAR data</article-title><source>IEEE Trans. Geosci. Rem. Sens</source><year>1998</year><volume>36</volume><fpage>963</fpage><lpage>973</lpage><pub-id pub-id-type="doi">10.1109/36.673687</pub-id></citation></ref>
<ref id="b7-sensors-09-07516"><label>7.</label><citation citation-type="other"><person-group person-group-type="author"><name><surname>Huynen</surname><given-names>J.R.</given-names></name></person-group><article-title>Phenomenological Theory of Radar Targets</article-title><comment>PhD dissertation.</comment><publisher-name>University of Technology</publisher-name><publisher-loc>Delft, The Netherlands</publisher-loc><year>1970</year></citation></ref>
<ref id="b8-sensors-09-07516"><label>8.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Holm</surname><given-names>W.A.</given-names></name><name><surname>Barnes</surname><given-names>R.M.</given-names></name></person-group><article-title>On radar polarization mixed target state decomposition techniques</article-title><conf-name>Proc. of IEEE Radar Conference</conf-name><conf-loc>Ann Arbor, MI, USA</conf-loc><conf-date>1988</conf-date><fpage>249</fpage><lpage>254</lpage></citation></ref>
<ref id="b9-sensors-09-07516"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cloude</surname><given-names>S.R.</given-names></name><name><surname>Pottier</surname><given-names>E.</given-names></name></person-group><article-title>An entropy based classification scheme for land applications of polarimetric SAR</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>1997</year><volume>35</volume><fpage>549</fpage><lpage>557</lpage></citation></ref>
<ref id="b10-sensors-09-07516"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname><given-names>C.J.</given-names></name><name><surname>Chung</surname><given-names>I.F.</given-names></name><name><surname>Chen</surname><given-names>C.H.</given-names></name></person-group><article-title>An entropy-based quantum neuro-fuzzy inference system for classification applications</article-title><source>Neurocomputing</source><year>2007</year><volume>70</volume><fpage>2502</fpage><lpage>2516</lpage><pub-id pub-id-type="doi">10.1016/j.neucom.2006.08.008</pub-id></citation></ref>
<ref id="b11-sensors-09-07516"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kido</surname><given-names>S.</given-names></name><name><surname>Tamura</surname><given-names>S.</given-names></name></person-group><article-title>Computerized classification of interstitial lung abnormalities on chest radiographs with normalized radiographic index and normalized fractal dimension</article-title><source>Eur. J. Radiol</source><year>2001</year><volume>37</volume><fpage>184</fpage><lpage>189</lpage><pub-id pub-id-type="doi">10.1016/S0720-048X(00)00296-5</pub-id><pub-id pub-id-type="pmid">11274847</pub-id></citation></ref>
<ref id="b12-sensors-09-07516"><label>12.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ranson</surname><given-names>K.J.</given-names></name><name><surname>Sun</surname><given-names>G.Q.</given-names></name></person-group><article-title>An evaluation of AIRSAR and SIR-C/X-SAR images for mapping northern forest attributes in Maine, USA</article-title><source>Remote Sens. Environ</source><year>1997</year><volume>59</volume><fpage>203</fpage><lpage>222</lpage><pub-id pub-id-type="doi">10.1016/S0034-4257(96)00154-X</pub-id></citation></ref>
<ref id="b13-sensors-09-07516"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Avci</surname><given-names>E.</given-names></name><name><surname>Turkoglu</surname><given-names>I.</given-names></name><name><surname>Poyraz</surname><given-names>M.</given-names></name></person-group><article-title>Intelligent target recognition based on wavelet packet neural network</article-title><source>Expert Syst. Appl</source><year>2005</year><volume>29</volume><fpage>175</fpage><lpage>182</lpage><pub-id pub-id-type="doi">10.1016/j.eswa.2005.01.016</pub-id></citation></ref>
<ref id="b14-sensors-09-07516"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Acqua</surname><given-names>F.D.</given-names></name><name><surname>Gamba</surname><given-names>P.</given-names></name><name><surname>Trianni</surname><given-names>G.</given-names></name></person-group><article-title>Semi-automatic choice of scale-dependent features for satellite SAR image classification</article-title><source>Patt. Recogn. Lett</source><year>2006</year><volume>27</volume><fpage>244</fpage><lpage>251</lpage><pub-id pub-id-type="doi">10.1016/j.patrec.2005.08.005</pub-id></citation></ref>
<ref id="b15-sensors-09-07516"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cooper</surname><given-names>G.R.J.</given-names></name><name><surname>Cowan</surname><given-names>D.R.</given-names></name></person-group><article-title>The use of textural analysis to locate features in geophysical data</article-title><source>Comput. Geosci</source><year>2005</year><volume>31</volume><fpage>882</fpage><lpage>890</lpage><pub-id pub-id-type="doi">10.1016/j.cageo.2005.02.001</pub-id></citation></ref>
<ref id="b16-sensors-09-07516"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname><given-names>K.U.</given-names></name><name><surname>Yang</surname><given-names>J.</given-names></name></person-group><article-title>Polarimetric synthetic aperture radar image classification by a hybrid method</article-title><source>Tsinghua Sci. Technol</source><year>2007</year><volume>12</volume><fpage>97</fpage><lpage>104</lpage><pub-id pub-id-type="doi">10.1016/S1007-0214(07)70015-9</pub-id></citation></ref>
<ref id="b17-sensors-09-07516"><label>17.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname><given-names>J.J.</given-names></name><name><surname>Kim</surname><given-names>D.</given-names></name><name><surname>Chang</surname><given-names>S.K.</given-names></name><name><surname>Nocete</surname><given-names>C.F.M.</given-names></name></person-group><article-title>An improved application technique of the adaptive probabilistic neural network for predicting concrete strength</article-title><source>Comput. Mater. Sci</source><year>2009</year><volume>44</volume><fpage>988</fpage><lpage>998</lpage><pub-id pub-id-type="doi">10.1016/j.commatsci.2008.07.012</pub-id></citation></ref>
<ref id="b18-sensors-09-07516"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mostafa</surname><given-names>M.M.</given-names></name></person-group><article-title>Modeling the competitive market efficiency of Egyptian companies: A probabilistic neural network analysis</article-title><source>Expert Syst. Appl</source><year>2009</year><volume>36</volume><fpage>8839</fpage><lpage>8848</lpage><pub-id pub-id-type="doi">10.1016/j.eswa.2008.11.024</pub-id></citation></ref>
<ref id="b19-sensors-09-07516"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Armand</surname><given-names>P.</given-names></name><name><surname>Benoist</surname><given-names>J.</given-names></name><name><surname>Bousquet</surname><given-names>E.</given-names></name><name><surname>Delage</surname><given-names>L.</given-names></name><name><surname>Olivier</surname><given-names>S.</given-names></name><name><surname>Reynaud</surname><given-names>F.</given-names></name></person-group><article-title>Optimization of a one dimensional hypertelescope for a direct imaging in astronomy</article-title><source>Eur. J. Oper. Res</source><year>2009</year><volume>195</volume><fpage>519</fpage><lpage>527</lpage><pub-id pub-id-type="doi">10.1016/j.ejor.2008.02.002</pub-id></citation></ref>
<ref id="b20-sensors-09-07516"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luukka</surname><given-names>P.</given-names></name></person-group><article-title>Classification based on fuzzy robust PCA algorithms and similarity classifier</article-title><source>Expert Syst. Appl.</source><year>2009</year><volume>36</volume><fpage>7463</fpage><lpage>7468</lpage><pub-id pub-id-type="doi">10.1016/j.eswa.2008.09.015</pub-id></citation></ref>
<ref id="b21-sensors-09-07516"><label>21.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Pottier</surname><given-names>E.</given-names></name><name><surname>Cloude</surname><given-names>S.R.</given-names></name></person-group><article-title>Application of the H/A/α polarimetric decomposition theorems for land classification</article-title><conf-name>Proc. SPIE Conference on Wideband Interferometric Sensing and Imaging Polarimetry</conf-name><conf-loc>San Diego, CA, USA</conf-loc><conf-date>1997</conf-date><fpage>132</fpage><lpage>143</lpage></citation></ref>
<ref id="b22-sensors-09-07516"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tien</surname><given-names>C.L.</given-names></name><name><surname>Lyu</surname><given-names>Y.R.</given-names></name><name><surname>Jyu</surname><given-names>S.S.</given-names></name></person-group><article-title>Surface flatness of optical thin films evaluated by gray level co-occurrence matrix and entropy</article-title><source>Appl. Surf. Sci</source><year>2008</year><volume>254</volume><fpage>4762</fpage><lpage>4767</lpage><pub-id pub-id-type="doi">10.1016/j.apsusc.2008.01.088</pub-id></citation></ref>
<ref id="b23-sensors-09-07516"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>C.H.</given-names></name><name><surname>Chu</surname><given-names>C.T.</given-names></name></person-group><article-title>High performance iris recognition based on 1-D circular feature extraction and PSO–PNN classifier</article-title><source>Expert Syst. Appl</source><year>2009</year><volume>36</volume><fpage>10351</fpage><lpage>10356</lpage><pub-id pub-id-type="doi">10.1016/j.eswa.2009.01.033</pub-id></citation></ref>
<ref id="b24-sensors-09-07516"><label>24.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luukka</surname><given-names>P.</given-names></name></person-group><article-title>Classification based on fuzzy robust PCA algorithms and similarity classifier</article-title><source>Expert Syst. Appl</source><year>2009</year><volume>36</volume><fpage>7463</fpage><lpage>7468</lpage><pub-id pub-id-type="doi">10.1016/j.eswa.2008.09.015</pub-id></citation></ref>
<ref id="b25-sensors-09-07516"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gupta</surname><given-names>K.K.</given-names></name><name><surname>Gupta</surname><given-names>R.</given-names></name></person-group><article-title>Despeckle and geographical feature extraction in SAR images by wavelet transform</article-title><source>ISPRS J. Photogramm. Remote Sens</source><year>2007</year><volume>62</volume><fpage>473</fpage><lpage>484</lpage><pub-id pub-id-type="doi">10.1016/j.isprsjprs.2007.06.001</pub-id></citation></ref>
<ref id="b26-sensors-09-07516"><label>26.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>K.S.</given-names></name><name><surname>Huang</surname><given-names>W.P.</given-names></name><name><surname>Tsay</surname><given-names>D.H.</given-names></name><name><surname>Amar</surname><given-names>F.</given-names></name></person-group><article-title>Classification of multifrequency polarimetric SAR imagery using a dynamic learning neural network</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>1996</year><volume>34</volume><fpage>814</fpage><lpage>820</lpage><pub-id pub-id-type="doi">10.1109/36.499786</pub-id></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-sensors-09-07516" position="float">
<label>Figure 1.</label>
<caption>
<p>Outline of PNN (R, Q, and K represent number of elements in input vector, input/target pairs, and classes of input data, respectively. IW and LW represent input weight and layer weight, respectively).</p></caption>
<graphic xlink:href="sensors-09-07516f1.gif"/></fig>
<fig id="f2-sensors-09-07516" position="float">
<label>Figure 2.</label>
<caption>
<p>The outline of our method.</p></caption>
<graphic xlink:href="sensors-09-07516f2.gif"/></fig>
<fig id="f3-sensors-09-07516" position="float">
<label>Figure 3.</label>
<caption>
<p>Using normalization before PCA.</p></caption>
<graphic xlink:href="sensors-09-07516f3.gif"/></fig>
<fig id="f4-sensors-09-07516" position="float">
<label>Figure 4.</label>
<caption>
<p>Pauli image of sub-area of San Francisco.</p></caption>
<graphic xlink:href="sensors-09-07516f4.gif"/></fig>
<fig id="f5-sensors-09-07516" position="float">
<label>Figure 5.</label>
<caption>
<p>Basic span image and three channels image.</p></caption>
<graphic xlink:href="sensors-09-07516f5.gif"/></fig>
<fig id="f6-sensors-09-07516" position="float">
<label>Figure 6.</label>
<caption>
<p>Parameters of H/A/Alpha decomposition.</p></caption>
<graphic xlink:href="sensors-09-07516f6a.gif"/>
<graphic xlink:href="sensors-09-07516f6b.gif"/></fig>
<fig id="f7-sensors-09-07516" position="float">
<label>Figure 7.</label>
<caption>
<p>GLCM-based features of <italic>T</italic><sub>11.</sub></p></caption>
<graphic xlink:href="sensors-09-07516f7.gif"/></fig>
<fig id="f8-sensors-09-07516" position="float">
<label>Figure 8.</label>
<caption>
<p>GLCM-based features of <italic>T</italic><sub>22.</sub></p></caption>
<graphic xlink:href="sensors-09-07516f8.gif"/></fig>
<fig id="f9-sensors-09-07516" position="float">
<label>Figure 9.</label>
<caption>
<p>GLCM-based features of <italic>T</italic><sub>33.</sub></p></caption>
<graphic xlink:href="sensors-09-07516f9.gif"/></fig>
<fig id="f10-sensors-09-07516" position="float">
<label>Figure 10.</label>
<caption>
<p>The curve of cumulative sum of variance with dimensions.</p></caption>
<graphic xlink:href="sensors-09-07516f10.gif"/></fig>
<fig id="f11-sensors-09-07516" position="float">
<label>Figure 11.</label>
<caption>
<p>Sample data of San Francisco (Red denotes sea, green urban areas, blue vegetated zones).</p></caption>
<graphic xlink:href="sensors-09-07516f11.gif"/></fig>
<fig id="f12-sensors-09-07516" position="float">
<label>Figure 12.</label>
<caption>
<p>The curve of error versus step.</p></caption>
<graphic xlink:href="sensors-09-07516f12.gif"/></fig>
<fig id="f13-sensors-09-07516" position="float">
<label>Figure 13.</label>
<caption>
<p>The curve of b versus step.</p></caption>
<graphic xlink:href="sensors-09-07516f13.gif"/></fig>
<fig id="f14-sensors-09-07516" position="float">
<label>Figure 14.</label>
<caption>
<p>Classification results of the whole image.</p></caption>
<graphic xlink:href="sensors-09-07516f14.gif"/></fig>
<fig id="f15-sensors-09-07516" position="float">
<label>Figure 15.</label>
<caption>
<p>Pauli Image of Flevoland (1024 × 750).</p></caption>
<graphic xlink:href="sensors-09-07516f15.gif"/></fig>
<fig id="f16-sensors-09-07516" position="float">
<label>Figure 16.</label>
<caption>
<p>Basic span image and three channels image.</p></caption>
<graphic xlink:href="sensors-09-07516f16.gif"/></fig>
<fig id="f17-sensors-09-07516" position="float">
<label>Figure 17.</label>
<caption>
<p>Parameters of H/A/Alpha decomposition.</p></caption>
<graphic xlink:href="sensors-09-07516f17.gif"/></fig>
<fig id="f18-sensors-09-07516" position="float">
<label>Figure 18.</label>
<caption>
<p>GLCM-based features of T<sub>11</sub>.</p></caption>
<graphic xlink:href="sensors-09-07516f18.gif"/></fig>
<fig id="f19-sensors-09-07516" position="float">
<label>Figure 19.</label>
<caption>
<p>GLCM-based features of T<sub>22</sub>.</p></caption>
<graphic xlink:href="sensors-09-07516f19.gif"/></fig>
<fig id="f20-sensors-09-07516" position="float">
<label>Figure 20.</label>
<caption>
<p>GLCM-based features of T<sub>33</sub>.</p></caption>
<graphic xlink:href="sensors-09-07516f20.gif"/></fig>
<fig id="f21-sensors-09-07516" position="float">
<label>Figure 21.</label>
<caption>
<p>The curve of cumulative sum of variance with dimensions.</p></caption>
<graphic xlink:href="sensors-09-07516f21.gif"/></fig>
<fig id="f22-sensors-09-07516" position="float">
<label>Figure 22.</label>
<caption>
<p>Sample data areas of Flevoland.</p></caption>
<graphic xlink:href="sensors-09-07516f22.gif"/></fig>
<fig id="f23-sensors-09-07516" position="float">
<label>Figure 23.</label>
<caption>
<p>The curve of error versus step.</p></caption>
<graphic xlink:href="sensors-09-07516f23.gif"/></fig>
<fig id="f24-sensors-09-07516" position="float">
<label>Figure 24.</label>
<caption>
<p>The curve of b versus step.</p></caption>
<graphic xlink:href="sensors-09-07516f24.gif"/></fig>
<fig id="f25-sensors-09-07516" position="float">
<label>Figure 25.</label>
<caption>
<p>Confusion matrix comparison on train area (values are given in percent) The overall accuracy is 93.71%.</p></caption>
<graphic xlink:href="sensors-09-07516f25.gif"/></fig>
<fig id="f26-sensors-09-07516" position="float">
<label>Figure 26.</label>
<caption>
<p>Confusion matrix comparison on test area (values are given in percent) The overall vccuracy is 86.2%.</p></caption>
<graphic xlink:href="sensors-09-07516f26.gif"/></fig>
<fig id="f27-sensors-09-07516" position="float">
<label>Figure 27.</label>
<caption>
<p>Classification Map of our method.</p></caption>
<graphic xlink:href="sensors-09-07516f27.gif"/></fig>
<fig id="f28-sensors-09-07516" position="float">
<label>Figure 28.</label>
<caption>
<p>Computation time with square width.</p></caption>
<graphic xlink:href="sensors-09-07516f28.gif"/></fig>
<fig id="f29-sensors-09-07516" position="float">
<label>Figure 29.</label>
<caption>
<p>The overall accuracy versus square width.</p></caption>
<graphic xlink:href="sensors-09-07516f29.gif"/></fig>
<table-wrap id="t1-sensors-09-07516" position="float">
<label>Table 1.</label>
<caption>
<p>Pauli bases and their corresponding meanings.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom"><bold>Pauli Bases</bold></th>
<th align="center" valign="bottom"><bold>Meaning</bold></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="middle">S<sub>a</sub></td>
<td align="center" valign="middle">Single- or odd-bounce scattering</td></tr>
<tr>
<td align="center" valign="middle">S<sub>b</sub></td>
<td align="center" valign="middle">Double- or even-bounce scattering</td></tr>
<tr>
<td align="center" valign="middle">S<sub>c</sub></td>
<td align="center" valign="middle">Those scatterers which are able to return the orthogonal polarization to the one of the incident wave (forest canopy)</td></tr></tbody></table></table-wrap>
<table-wrap id="t2-sensors-09-07516" position="float">
<label>Table 2.</label>
<caption>
<p>Properties of GLCM.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom"><bold>Property</bold></th>
<th align="center" valign="bottom"><bold>Description</bold></th>
<th align="center" valign="bottom"><bold>Formula</bold></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="middle">Contrast</td>
<td align="center" valign="middle">Intensity contrast between a pixel and its neighbor</td>
<td align="center" valign="middle">
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>j</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></td></tr>
<tr>
<td align="center" valign="middle">Correlation</td>
<td align="center" valign="middle">Correlation between a pixel and its neighbor (μ denotes the expected value, and σ the standard variance)</td>
<td align="center" valign="middle">
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>j</mml:mi></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>σ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>σ</mml:mi></mml:mrow>
<mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:mrow></mml:math></disp-formula></td></tr>
<tr>
<td align="center" valign="middle">Energy</td>
<td align="center" valign="middle">Energy of the whole image</td>
<td align="center" valign="middle">
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:math></disp-formula></td></tr>
<tr>
<td align="center" valign="middle">Homogeneity</td>
<td align="center" valign="middle">Closeness of the distribution of GLCM to the diagonal</td>
<td align="center" valign="middle">
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:mrow></mml:math></disp-formula></td></tr></tbody></table></table-wrap>
<table-wrap id="t3-sensors-09-07516" position="float">
<label>Table 3.</label>
<caption>
<p>Detailed data of PCA on 19 features.</p></caption>
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="center" valign="top"><bold>Dimensions</bold></td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">4</td>
<td align="center" valign="top">5</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">9</td></tr>
<tr>
<td align="center" valign="top"><bold>Variance (%)</bold></td>
<td align="center" valign="top">37.97</td>
<td align="center" valign="top">50.81</td>
<td align="center" valign="top">60.21</td>
<td align="center" valign="top">68.78</td>
<td align="center" valign="top">77.28</td>
<td align="center" valign="top">82.75</td>
<td align="center" valign="top">86.27</td>
<td align="center" valign="top">89.30</td>
<td align="center" valign="top">92.27</td></tr>
<tr>
<td align="left" valign="bottom" colspan="10">
<hr/></td></tr>
<tr>
<td align="center" valign="top"><bold>Dimensions</bold></td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">16</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">18</td></tr>
<tr>
<td align="center" valign="top"><bold>Variance (%)</bold></td>
<td align="center" valign="top">94.63</td>
<td align="center" valign="top">96.36</td>
<td align="center" valign="top">97.81</td>
<td align="center" valign="top">98.60</td>
<td align="center" valign="top">99.02</td>
<td align="center" valign="top">99.37</td>
<td align="center" valign="top">99.62</td>
<td align="center" valign="top">99.80</td>
<td align="center" valign="top">99.92</td></tr></tbody></table></table-wrap>
<table-wrap id="t4-sensors-09-07516" position="float">
<label>Table 4.</label>
<caption>
<p>Comparison of confusion matrix. (O denotes the output class, T denotes the target class).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th colspan="2" align="left" valign="top" rowspan="2"/>
<th colspan="3" align="center" valign="bottom"><bold>Training Area</bold></th>
<th colspan="3" align="center" valign="bottom"><bold>Testing Area</bold></th></tr>
<tr>
<th align="center" valign="bottom"><bold>Sea(T)</bold></th>
<th align="center" valign="bottom"><bold>Urb(T)</bold></th>
<th align="center" valign="bottom"><bold>Veg(T)</bold></th>
<th align="center" valign="bottom"><bold>Sea(T)</bold></th>
<th align="center" valign="bottom"><bold>Urb(T)</bold></th>
<th align="center" valign="bottom"><bold>Veg(T)</bold></th></tr>
<tr>
<th align="left" valign="bottom" colspan="8">
<hr/></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="middle" rowspan="6"><bold>3-layer BPNN</bold></td>
<td align="center" valign="middle" rowspan="2"><bold>Sea(O)</bold></td>
<td align="center" valign="middle">7158</td>
<td align="center" valign="middle">4</td>
<td align="center" valign="middle">60</td>
<td align="center" valign="middle">3600</td>
<td align="center" valign="middle">42</td>
<td align="center" valign="middle">5</td></tr>
<tr>
<td align="center" valign="middle">33.1%</td>
<td align="center" valign="middle">0.0%</td>
<td align="center" valign="middle">0.3%</td>
<td align="center" valign="middle">33.3%</td>
<td align="center" valign="middle">0.4%</td>
<td align="center" valign="middle">0.0%</td></tr>
<tr>
<td align="center" valign="middle" rowspan="2"><bold>Urb(O)</bold></td>
<td align="center" valign="middle">0</td>
<td align="center" valign="middle">6882</td>
<td align="center" valign="middle">136</td>
<td align="center" valign="middle">0</td>
<td align="center" valign="middle">3429</td>
<td align="center" valign="middle">355</td></tr>
<tr>
<td align="center" valign="middle">0%</td>
<td align="center" valign="middle">31.9%</td>
<td align="center" valign="middle">0.6%</td>
<td align="center" valign="middle">0.0%</td>
<td align="center" valign="middle">31.7%</td>
<td align="center" valign="middle">3.3%</td></tr>
<tr>
<td align="center" valign="middle" rowspan="2"><bold>Veg(O)</bold></td>
<td align="center" valign="middle">42</td>
<td align="center" valign="middle">314</td>
<td align="center" valign="middle">7004</td>
<td align="center" valign="middle">0</td>
<td align="center" valign="middle">129</td>
<td align="center" valign="middle">3240</td></tr>
<tr>
<td align="center" valign="middle">0.2%</td>
<td align="center" valign="middle">1.4%</td>
<td align="center" valign="middle">32.4%</td>
<td align="center" valign="middle">0.0%</td>
<td align="center" valign="middle">1.2%</td>
<td align="center" valign="middle">30.0%</td></tr>
<tr>
<td colspan="8" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="center" valign="middle" rowspan="6"><bold>Our Method</bold></td>
<td align="center" valign="middle" rowspan="2"><bold>Sea(O)</bold></td>
<td align="center" valign="middle">7150</td>
<td align="center" valign="middle">0</td>
<td align="center" valign="middle">76</td>
<td align="center" valign="middle">3597</td>
<td align="center" valign="middle">33</td>
<td align="center" valign="middle">0</td></tr>
<tr>
<td align="center" valign="middle">33.1%</td>
<td align="center" valign="middle">0.0%</td>
<td align="center" valign="middle">0.4%</td>
<td align="center" valign="middle">33.3%</td>
<td align="center" valign="middle">0.3%</td>
<td align="center" valign="middle">0.0%</td></tr>
<tr>
<td align="center" valign="middle" rowspan="2"><bold>Urb(O)</bold></td>
<td align="center" valign="middle">2</td>
<td align="center" valign="middle">7074</td>
<td align="center" valign="middle">74</td>
<td align="center" valign="middle">0</td>
<td align="center" valign="middle">3445</td>
<td align="center" valign="middle">354</td></tr>
<tr>
<td align="center" valign="middle">0%</td>
<td align="center" valign="middle">32.8%</td>
<td align="center" valign="middle">0.3%</td>
<td align="center" valign="middle">0.0%</td>
<td align="center" valign="middle">31.9%</td>
<td align="center" valign="middle">3.3%</td></tr>
<tr>
<td align="center" valign="middle" rowspan="2"><bold>Veg(O)</bold></td>
<td align="center" valign="middle">48</td>
<td align="center" valign="middle">126</td>
<td align="center" valign="middle">7050</td>
<td align="center" valign="middle">3</td>
<td align="center" valign="middle">122</td>
<td align="center" valign="middle">3246</td></tr>
<tr>
<td align="center" valign="middle">0.2%</td>
<td align="center" valign="middle">0.6%</td>
<td align="center" valign="middle">32.6%</td>
<td align="center" valign="middle">0.0%</td>
<td align="center" valign="middle">1.1%</td>
<td align="center" valign="middle">30.1%</td></tr></tbody></table></table-wrap>
<table-wrap id="t5-sensors-09-07516" position="float">
<label>Table 5.</label>
<caption>
<p>Overall accuracies (values are given in percent).</p></caption>
<table frame="hsides" rules="rows">
<thead>
<tr>
<th align="left" valign="top"/>
<th align="center" valign="bottom"><bold>Training Area</bold></th>
<th align="center" valign="bottom"><bold>Testing Area</bold></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="bottom"><bold>3-layer BPNN</bold></td>
<td align="center" valign="bottom">97.4%</td>
<td align="center" valign="bottom">95.1%</td></tr>
<tr>
<td align="center" valign="bottom"><bold>Our Method</bold></td>
<td align="center" valign="bottom">98.5%</td>
<td align="center" valign="bottom">95.3%</td></tr></tbody></table></table-wrap>
<table-wrap id="t6-sensors-09-07516" position="float">
<label>Table 6.</label>
<caption>
<p>Detailed data of PCA on 19 features.</p></caption>
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="center" valign="top"><bold>Dimensions</bold></td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">4</td>
<td align="center" valign="top">5</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">9</td></tr>
<tr>
<td align="center" valign="top"><bold>Variance (%)</bold></td>
<td align="center" valign="top">26.31</td>
<td align="center" valign="top">42.98</td>
<td align="center" valign="top">52.38</td>
<td align="center" valign="top">60.50</td>
<td align="center" valign="top">67.28</td>
<td align="center" valign="top">73.27</td>
<td align="center" valign="top">78.74</td>
<td align="center" valign="top">82.61</td>
<td align="center" valign="top">86.25</td></tr>
<tr>
<td align="left" valign="bottom" colspan="10">
<hr/></td></tr>
<tr>
<td align="center" valign="top"><bold>Dimensions</bold></td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top"><bold>13</bold></td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">16</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">18</td></tr>
<tr>
<td align="center" valign="top"><bold>Variance (%)</bold></td>
<td align="center" valign="top">89.52</td>
<td align="center" valign="top">92.72</td>
<td align="center" valign="top">95.50</td>
<td align="center" valign="top"><bold>98.06</bold></td>
<td align="center" valign="top">98.79</td>
<td align="center" valign="top">99.24</td>
<td align="center" valign="top">99.63</td>
<td align="center" valign="top">99.94</td>
<td align="center" valign="top">99.97</td></tr></tbody></table></table-wrap>
<table-wrap id="t7-sensors-09-07516" position="float">
<label>Table 7.</label>
<caption>
<p>Comparison of PNNs using polarimetric feature set, texture feature set, and combined feature set (TR denotes Classification Accuracy of Total Random).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th colspan="2" align="center" valign="middle"><bold>Site</bold></th>
<th align="center" valign="middle"><bold>Polarimetric feature set</bold></th>
<th align="center" valign="middle"><bold>Texture feature set</bold></th>
<th align="center" valign="middle"><bold>Combined feature set</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="2"><bold><italic>San Francisco (TR=33.3%)</italic></bold></td>
<td align="center" valign="middle"><bold><italic>Training Area</italic></bold></td>
<td align="center" valign="middle"><italic>97.1%</italic></td>
<td align="center" valign="middle"><italic>59.9%</italic></td>
<td align="center" valign="middle"><italic>98.5%</italic></td></tr>
<tr>
<td align="center" valign="middle"><bold><italic>Test Area</italic></bold></td>
<td align="center" valign="middle"><italic>87.4%</italic></td>
<td align="center" valign="middle"><italic>45.9%</italic></td>
<td align="center" valign="middle"><italic>95.3%</italic></td></tr>
<tr>
<td align="left" valign="top" rowspan="2"><bold><italic>Flevoland (TR=7.69%)</italic></bold></td>
<td align="center" valign="middle"><bold><italic>Training Area</italic></bold></td>
<td align="center" valign="middle"><italic>92.2%</italic></td>
<td align="center" valign="middle"><italic>48.0%</italic></td>
<td align="center" valign="middle"><italic>93.7%</italic></td></tr>
<tr>
<td align="center" valign="middle"><bold><italic>Test Area</italic></bold></td>
<td align="center" valign="middle"><italic>72.2%</italic></td>
<td align="center" valign="middle"><italic>24.1%</italic></td>
<td align="center" valign="middle"><italic>86.2%</italic></td></tr></tbody></table></table-wrap>
<table-wrap id="t8-sensors-09-07516" position="float">
<label>Table 8.</label>
<caption>
<p>Comparison of PNN with and without our weights/biases setting (RD denotes Random Division).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="middle" rowspan="2"><bold>Area Size</bold></th>
<th colspan="3" align="center" valign="middle"><bold>Computation Time</bold></th>
<th colspan="2" align="center" valign="middle"><bold>Overall Accuracy</bold></th></tr>
<tr>
<th align="center" valign="middle"><bold>Without RD</bold></th>
<th align="center" valign="middle"><bold>With RD</bold></th>
<th align="center" valign="middle"><bold>Ratio</bold></th>
<th align="center" valign="middle"><bold>Without RD</bold></th>
<th align="center" valign="middle"><bold>With RD</bold></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top"><bold>10 × 10</bold></td>
<td align="center" valign="top">1.0818</td>
<td align="center" valign="top">0.0231</td>
<td align="center" valign="top">46.8</td>
<td align="center" valign="top">94.8%</td>
<td align="center" valign="top">94.9%</td></tr>
<tr>
<td align="center" valign="top"><bold>20 × 20</bold></td>
<td align="center" valign="top">4.0803</td>
<td align="center" valign="top">0.0386</td>
<td align="center" valign="top">105.7</td>
<td align="center" valign="top">95.5%</td>
<td align="center" valign="top">95.5%</td></tr>
<tr>
<td align="center" valign="top"><bold>30 × 30</bold></td>
<td align="center" valign="top">22.4270</td>
<td align="center" valign="top">0.0751</td>
<td align="center" valign="top">298.6</td>
<td align="center" valign="top">96.3%</td>
<td align="center" valign="top">96.2%</td></tr>
<tr>
<td align="center" valign="top"><bold>40 × 40</bold></td>
<td align="center" valign="top">58.1409</td>
<td align="center" valign="top">0.1125</td>
<td align="center" valign="top">516.8</td>
<td align="center" valign="top">95.9%</td>
<td align="center" valign="top">95.4%</td></tr></tbody></table></table-wrap></sec></back></article>
