<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Sensors</journal-id>
<journal-title>Sensors</journal-title>
<issn pub-type="epub">1424-8220</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/s110504721</article-id>
<article-id pub-id-type="publisher-id">sensors-11-04721</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Crop Classification by Forward Neural Network with Adaptive Chaotic Particle Swarm Optimization</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Zhang</surname><given-names>Yudong</given-names></name></contrib>
<contrib contrib-type="author">
<name><surname>Wu</surname><given-names>Lenan</given-names></name><xref ref-type="corresp" rid="c1-sensors-11-04721"><sup>*</sup></xref></contrib>
<aff id="af1-sensors-11-04721">School of Information Science and Engineering, Southeast University, Nanjing 210096, China; E-Mail: <email>zhangyudongnuaa@gmail.com</email></aff></contrib-group>
<author-notes>
<corresp id="c1-sensors-11-04721">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>wuln@seu.cn</email>; Tel.: +86-1973-8667-581.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2011</year></pub-date>
<pub-date pub-type="epub">
<day>2</day>
<month>5</month>
<year>2011</year></pub-date>
<volume>11</volume>
<fpage>4721</fpage>
<lpage>4743</lpage>
<history>
<date date-type="received">
<day>10</day>
<month>3</month>
<year>2011</year></date>
<date date-type="rev-recd">
<day>24</day>
<month>4</month>
<year>2011</year></date>
<date date-type="accepted">
<day>26</day>
<month>4</month>
<year>2011</year></date></history>
<permissions>
<copyright-statement>© 2011 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2011</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>This paper proposes a hybrid crop classifier for polarimetric synthetic aperture radar (SAR) images. The feature sets consisted of span image, the H/A/α decomposition, and the gray-level co-occurrence matrix (GLCM) based texture features. Then, the features were reduced by principle component analysis (PCA). Finally, a two-hidden-layer forward neural network (NN) was constructed and trained by adaptive chaotic particle swarm optimization (ACPSO). <italic>K</italic>-fold cross validation was employed to enhance generation. The experimental results on Flevoland sites demonstrate the superiority of ACPSO to back-propagation (BP), adaptive BP (ABP), momentum BP (MBP), Particle Swarm Optimization (PSO), and Resilient back-propagation (RPROP) methods. Moreover, the computation time for each pixel is only 1.08 × 10<sup>−7</sup> s.</p></abstract>
<kwd-group>
<kwd>artificial neural network</kwd>
<kwd>synthetic aperture radar</kwd>
<kwd>principle component analysis</kwd>
<kwd>particle swarm optimization</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>The classification of different objects, as well as different terrain characteristics, with single channel monopolarisation SAR images can carry a significant amount of error, even when operating after multilooking [<xref ref-type="bibr" rid="b1-sensors-11-04721">1</xref>]. One of the most challenging applications of polarimetry in remote sensing is landcover classification using fully polarimetric SAR (PolSAR) images [<xref ref-type="bibr" rid="b2-sensors-11-04721">2</xref>].</p>
<p>The Wishart maximum likelihood (WML) method has often been used for PolSAR classification [<xref ref-type="bibr" rid="b3-sensors-11-04721">3</xref>]. However, it does not take explicitly into consideration the phase information contained within polarimetric data, which plays a direct role in the characterization of a broad range of scattering processes. Furthermore, the covariance or coherency matrices are determined after spatial averaging and therefore can only describe stochastic scattering processes while certain objects, such as man-made objects, are better characterized at pixel-level [<xref ref-type="bibr" rid="b4-sensors-11-04721">4</xref>].</p>
<p>To overcome above shortcomings, polarimetric decompositions were introduced with an aim at establishing a correspondence between the physical characteristics of the considered areas and the observed scattering mechanisms. The most effective method is the Cloude decomposition, also known as H/A/α method [<xref ref-type="bibr" rid="b5-sensors-11-04721">5</xref>]. Recently, texture information has been extracted, and used as a parameter to enhance the classification results. The gray-level co-occurrence matrices (GLCM) were already successfully applied to classification problems [<xref ref-type="bibr" rid="b6-sensors-11-04721">6</xref>]. We choose the combination of H/A/α and GLCM as the parameter set of our study.</p>
<p>In order to reduce the feature vector dimensions obtained by H/A/α and GLCM, and to increase the discriminative power, the principal component analysis (PCA) method was employed. PCA is appealing since it effectively reduces the dimensionality of the feature and therefore reduces the computational cost.</p>
<p>The next problem is how to choose the best classifier. In the past years, standard multi-layered feed-forward neural networks (FNN) have been applied for SAR image classification [<xref ref-type="bibr" rid="b7-sensors-11-04721">7</xref>]. FNNs are effective classifiers since they do not involve complex models and equations as compared to traditional regression analysis. In addition, they can easily adapt to new data through a re-training process.</p>
<p>However, NNs suffer from converging too slowly and being easily trapped into local extrema if a back propagation (BP) algorithm is used for training [<xref ref-type="bibr" rid="b8-sensors-11-04721">8</xref>]. Genetic algorithm (GA) [<xref ref-type="bibr" rid="b9-sensors-11-04721">9</xref>] has shown promising results in searching optimal weights of NN. Besides GA, Tabu search (TS) [<xref ref-type="bibr" rid="b10-sensors-11-04721">10</xref>], Particle Swarm Optimization (PSO) [<xref ref-type="bibr" rid="b11-sensors-11-04721">11</xref>], and Bacterial Chemotaxis Optimization (BCO) [<xref ref-type="bibr" rid="b12-sensors-11-04721">12</xref>] have also been reported. However, GA, TS, and BCO have expensive computational demands, while PSO is well-known for its lower computation cost, and the most attractive feature of PSO is that it requires less computational bookkeeping and a few lines of implementation codes.</p>
<p>In order to improve the performance of PSO, an adaptive chaotic PSO (ACPSO) method was proposed. In order to prevent overfitting, cross-validation was employed, which is a technique for assessing how the results of a statistical analysis will generalize to an independent data set and is mainly used to estimate how accurately a predictive model will perform in practice [<xref ref-type="bibr" rid="b13-sensors-11-04721">13</xref>]. One round of cross-validation involves partitioning a sample of data into complementary subsets, performing the analysis on one subset (called the training set), and validating the analysis on the other subset (called the validation set) [<xref ref-type="bibr" rid="b14-sensors-11-04721">14</xref>]. To reduce variability, multiple rounds of cross-validation are performed using different partitions, and the validation results are averaged over the rounds [<xref ref-type="bibr" rid="b15-sensors-11-04721">15</xref>].</p>
<p>The structure of this paper is as follows: In the next Section 2 the concept of Pauli decomposition was introduced. Section 3 presents the span image, the H/A/α decomposition, the feature derived from GLCM, and the principle component analysis for feature reduction. Section 4 introduces the forward neural network, proposed the ACPSO for training, and discussed the importance of using <italic>k</italic>-fold cross validation. Section 5 uses the NASA/JPL AIRSAR image of Flevoland site to show our proposed ACPSO outperforms traditional BP, adaptive BP, BP with momentum, PSO, and RPROP algorithms. Final Section 6 is devoted to conclusion.</p></sec>
<sec>
<label>2.</label>
<title>Pauli Decomposition</title>
<sec sec-type="intro">
<label>2.1.</label>
<title>Basic Introduction</title>
<p>The features are derived from the multilook coherence matrix of the PolSAR data [<xref ref-type="bibr" rid="b5-sensors-11-04721">5</xref>]. Suppose:
<disp-formula id="FD1">
<label>(1)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>stands for the measured scattering matrix. Here <italic>S</italic><italic><sub>qp</sub></italic> represents the scattering coefficients of the targets, <italic>p</italic> the polarization of the incident field, <italic>q</italic> the polarization of the scattered field. <italic>S</italic><italic><sub>hv</sub></italic> equals to <italic>S</italic><italic><sub>vh</sub></italic> since reciprocity applies in a monostatic system configuration.</p>
<p>The Pauli decomposition expresses the scattering matrix <italic>S</italic> in the so-called Pauli basis, which is given by the following three 2 × 2 matrices:
<disp-formula id="FD2">
<label>(2)</label>
<mml:math display="block">
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>a</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>b</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>c</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>Thus, <italic>S</italic> can be expressed as:
<disp-formula id="FD3">
<label>(3)</label>
<mml:math display="block">
<mml:mi>S</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>a</mml:mi>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>a</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>b</mml:mi>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>b</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>c</mml:mi>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>c</mml:mi></mml:msub></mml:math></disp-formula>where:
<disp-formula id="FD4">
<label>(4)</label>
<mml:math display="block">
<mml:mi>a</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>h</mml:mi></mml:mrow></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>b</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>h</mml:mi></mml:mrow></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>c</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:msqrt>
<mml:mn>2</mml:mn></mml:msqrt>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:math></disp-formula></p>
<p>An RGB image could be formed with the intensities |<italic>a</italic>|<sup>2</sup>, |<italic>b</italic>|<sup>2</sup>, |<italic>c</italic>|<sup>2</sup>. The meanings of <italic>S</italic><italic><sub>a</sub></italic>, <italic>S</italic><italic><sub>b</sub></italic>, and <italic>S</italic><italic><sub>c</sub></italic> are listed in <xref ref-type="table" rid="t1-sensors-11-04721">Table 1</xref>.</p></sec>
<sec>
<label>2.2.</label>
<title>Coherence Matrix</title>
<p>The coherence matrix is obtained as [<xref ref-type="bibr" rid="b16-sensors-11-04721">16</xref>]:
<disp-formula id="FD5">
<label>(5)</label>
<mml:math display="block">
<mml:mi>T</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>c</mml:mi>
<mml:mo stretchy="false">]</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>c</mml:mi>
<mml:mo stretchy="false">]</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow></mml:msup>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>11</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>13</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow>
<mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>22</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>23</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>13</mml:mn></mml:mrow>
<mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>23</mml:mn></mml:mrow>
<mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>33</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>The average of multiple single-look coherence matrices is the multi-look coherence matrix. (<italic>T</italic><sub>11</sub>, <italic>T</italic><sub>22</sub>, <italic>T</italic><sub>33</sub>) usually are regarded as the channels of the PolSAR images.</p></sec></sec>
<sec>
<label>3.</label>
<title>Feature Extraction and Reduction</title>
<p>The proposed features can be divided into three types, which are explained below.</p>
<sec>
<label>3.1.</label>
<title>Span</title>
<p>The span or total scattered power is given by:
<disp-formula id="FD6">
<label>(6)</label>
<mml:math display="block">
<mml:mi>M</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>2</mml:mn>
<mml:msup>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:math></disp-formula>which indicates the power received by a fully polarimetric system.</p></sec>
<sec>
<label>3.2.</label>
<title>H/A/Alpha Decomposition</title>
<p>H/A/α decomposition is designed to identify in an unsupervised way polarimetric scattering mechanisms in the <italic>H</italic>-α plane [<xref ref-type="bibr" rid="b5-sensors-11-04721">5</xref>]. The method extends the two assumptions of traditional ways [<xref ref-type="bibr" rid="b17-sensors-11-04721">17</xref>]: (1) azimuthally symmetric targets; (2) equal minor eigenvalues <italic>λ</italic><sub>2</sub> and <italic>λ</italic><sub>3</sub>. <italic>T</italic> can be rewritten as:
<disp-formula id="FD7">
<label>(7)</label>
<mml:math display="block">
<mml:mi>T</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>U</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>λ</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>λ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>λ</mml:mi>
<mml:mn>3</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo> </mml:mo>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mn>3</mml:mn>
<mml:mi>H</mml:mi></mml:msubsup></mml:math></disp-formula>where:
<disp-formula id="FD8">
<label>(8)</label>
<mml:math display="block">
<mml:msub>
<mml:mi>U</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>3</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>β</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>δ</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>β</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>δ</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>cos</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>β</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>δ</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>β</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>γ</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>β</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>γ</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>sin</mml:mtext>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>β</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>γ</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>Then, the pseudo-probabilities of the <italic>T</italic> matrix expansion elements are defined as:
<disp-formula id="FD9">
<label>(9)</label>
<mml:math display="block">
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mfrac>
<mml:mi>λ</mml:mi>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>3</mml:mn></mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi>λ</mml:mi>
<mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:mfrac></mml:math></disp-formula></p>
<p>The entropy [<xref ref-type="bibr" rid="b18-sensors-11-04721">18</xref>] indicates the degree of statistical disorder of the scattering phenomenon. It can be defined as:
<disp-formula id="FD10">
<label>(10)</label>
<mml:math display="block">
<mml:mi>H</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>3</mml:mn></mml:munderover>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>log</mml:mtext></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mn>0</mml:mn>
<mml:mo> </mml:mo>
<mml:mo>≤</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>H</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>≤</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:math></disp-formula></p>
<p>For high entropy values, a complementary parameter (anisotropy) [<xref ref-type="bibr" rid="b1-sensors-11-04721">1</xref>] is necessary to fully characterize the set of probabilities. The anisotropy is defined as the relative importance of the second scattering mechanisms [<xref ref-type="bibr" rid="b19-sensors-11-04721">19</xref>]:
<disp-formula id="FD11">
<label>(11)</label>
<mml:math display="block">
<mml:mi>A</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>3</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>3</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo> </mml:mo>
<mml:mn>0</mml:mn>
<mml:mo> </mml:mo>
<mml:mo>≤</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>A</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>≤</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>1</mml:mn></mml:math></disp-formula></p>
<p>The four estimates of the angles are easily evaluated as:
<disp-formula id="FD12">
<label>(12)</label>
<mml:math display="block">
<mml:mo stretchy="false">[</mml:mo>
<mml:mover accent="true">
<mml:mi>α</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mover accent="true">
<mml:mi>β</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mover accent="true">
<mml:mi>δ</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mover accent="true">
<mml:mi>γ</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo stretchy="false">]</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>3</mml:mn></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>β</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>δ</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>γ</mml:mi>
<mml:mo stretchy="false">]</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>Thus, vectors from coherence matrix can be represented as (<italic>H</italic>, <italic>A</italic>, <italic>ᾱ</italic>, <italic>β̄</italic>, <italic>δ̄</italic>, <italic>γ̄</italic>).</p></sec>
<sec>
<label>3.3.</label>
<title>Texture Features</title>
<p>Gray level co-occurrence matrix (GLCM) is a text descriptor which takes into account the specific position of a pixel relative to another. The GLCM is a matrix whose elements correspond to the relative frequency of occurrence of pairs of gray level values of pixels separated by a certain distance in a given direction [<xref ref-type="bibr" rid="b20-sensors-11-04721">20</xref>]. Formally, the elements of a GLCM <italic>G</italic>(<italic>i</italic>,<italic>j</italic>) for a displacement vector (<italic>a</italic>,<italic>b</italic>) is defined as:
<disp-formula id="FD13">
<label>(13)</label>
<mml:math display="block">
<mml:mi>G</mml:mi>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">|</mml:mo>
<mml:mo stretchy="false">{</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>y</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>v</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>:</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>I</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>i</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>&amp;</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>I</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>v</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">}</mml:mo>
<mml:mo stretchy="false">|</mml:mo></mml:math></disp-formula>where (<italic>t</italic>,<italic>v</italic>) = (<italic>x</italic> + <italic>a</italic>, <italic>y</italic> + <italic>b</italic>), and |•| denotes the cardinality of a set. The displacement vector (<italic>a</italic>,<italic>b</italic>) can be rewritten as (<italic>d</italic>, <italic>θ</italic>) in polar coordinates.</p>
<p>GLCMs are suggested to be calculated from four displacement vectors with <italic>d</italic> = 1 and <italic>θ</italic> = 0°, 45°, 90°, and 135° respectively. In this study, the (<italic>a</italic>, <italic>b</italic>) are chosen as (0,1), (−1,1), (−1,0), and (−1,–1) respectively, and the corresponding GLCMs are averaged. The four features are extracted from normalized GLCMs, and their sum equals to 1. Suppose the normalized GLCM value at (<italic>i</italic>,<italic>j</italic>) is <italic>p</italic>(<italic>i</italic>,<italic>j</italic>), and their detailed definition are listed in <xref ref-type="table" rid="t2-sensors-11-04721">Table 2</xref>.</p></sec>
<sec>
<label>3.4.</label>
<title>Total Features</title>
<p>The texture features consist of 4 GLCM-based features, which should be multiplied by 3 since there are three channels (<italic>T</italic><sub>11</sub>, <italic>T</italic><sub>22</sub>, <italic>T</italic><sub>33</sub>). In addition, there are one span feature, and six <italic>H</italic>/α parameters. In all, the number of total features is 1 + 6 + 4 × 3 = 19.</p></sec>
<sec sec-type="methods">
<label>3.5.</label>
<title>Principal Component Analysis</title>
<p>PCA is an efficient tool to reduce the dimension of a data set consisting of a large number of interrelated variables while retaining most of the variations. It is achieved by transforming the data set to a new set of ordered variables according to their variances or importance. This technique has three effects: It orthogonalizes the components of the input vectors so that uncorrelated with each other, it orders the resulting orthogonal components so that those with the largest variation come first, and eliminates those components contributing the least to the variation in the data set [<xref ref-type="bibr" rid="b21-sensors-11-04721">21</xref>].</p>
<p>More specifically, for a given n-dimensional matrix <italic>n × m</italic>, where <italic>n</italic> and <italic>m</italic> are the number of variables and the number of temporal observations, respectively, the <italic>p</italic> principal axes (<italic>p ≪ n</italic>) are orthogonal axes, onto which the retained variance is maximal in the projected space. The PCA describes the space of the original data projecting onto the space in a base of eigenvectors. The corresponding eigenvalues account for the energy of the process in the eigenvector directions. It is assumed that most of the information in the observation vectors is contained in the subspace spanned by the first <italic>p</italic> principal components. Considering data projection restricted to <italic>p</italic> eigenvectors with the highest eigenvalues, an effective reduction in the input space dimensionality of the original data can be achieved with minimal information loss. Reducing the dimensionality of the <italic>n</italic> dimensional input space by projecting the input data onto the eigenvectors corresponding to the first <italic>p</italic> eigenvalues is an important step that facilitates subsequent neural network analysis [<xref ref-type="bibr" rid="b22-sensors-11-04721">22</xref>].</p>
<p>The detailed steps of PCA are as follows: (1) organize the dataset; (2) calculate the mean along each dimension; (3) calculate the deviation; (4) find the covariance matrix; (5) find the eigenvectors and eigenvalues of the covariance matrix; (6) sort the eigenvectors and eigenvalues; (7) compute the cumulative energy content for each eigenvector; (8) select a subset of the eigenvectors as the new basis vectors; (9) convert the source data to z-scores; (10) project the z-scores of the data onto the new basis. <xref ref-type="fig" rid="f1-sensors-11-04721">Figure 1</xref> shows a geometric illustration of PCA. Here the original basis is {<italic>x</italic><sub>1</sub>, <italic>x</italic><sub>2</sub>}, and the new basis is {<italic>F</italic><sub>1</sub>, <italic>F</italic><sub>2</sub>}. After the data was projecting onto the new basis, we can find that the data focused along the first dimension of the new basis.</p></sec></sec>
<sec sec-type="intro">
<label>4.</label>
<title>Forward Neural Network</title>
<p>Neural networks are widely used in pattern classification since they do not need any information about the probability distribution and the <italic>a priori</italic> probabilities of different classes. A two-hidden-layer backpropagation neural network is adopted with sigmoid neurons in the hidden layers and linear neuron in the output layer via the information entropy method [<xref ref-type="bibr" rid="b23-sensors-11-04721">23</xref>].</p>
<p>The training vectors are formed from the selected areas and normalized and presented to the NN which is trained in batch mode. The network configuration is <italic>N</italic><italic><sub>I</sub></italic> × <italic>N</italic><italic><sub>H</sub></italic><sub>1</sub> × <italic>N</italic><italic><sub>H</sub></italic><sub>2</sub> × <italic>N</italic><italic><sub>O</sub></italic>, <italic>i.e.</italic>, a three-layer network with <italic>N</italic><italic><sub>I</sub></italic> neurons in the input layer, <italic>N</italic><italic><sub>H</sub></italic><sub>1</sub> neurons in the first hidden layer, <italic>N</italic><italic><sub>H</sub></italic><sub>2</sub> neurons in the second hidden layer, and <italic>N</italic><italic><sub>O</sub></italic> neuron in the output layer (<xref ref-type="fig" rid="f2-sensors-11-04721">Figure 2</xref>). Their values vary with the remote-sensing area, and will be determined in the Experimental section.</p>
<sec sec-type="intro">
<label>4.1.</label>
<title>Introduction of PSO</title>
<p>The traditional NN training method can easily be trapped into the local minima, and the training procedures take a long time [<xref ref-type="bibr" rid="b24-sensors-11-04721">24</xref>]. In this study, PSO is chosen to find the optimal parameters of the neural network. PSO is a population based stochastic optimization technique, which is based on simulating the social behavior of swarm of bird flocking, bees, and fish schooling. By randomly initializing the algorithm with candidate solutions, the PSO successfully leads to a global optimum [<xref ref-type="bibr" rid="b25-sensors-11-04721">25</xref>]. This is achieved by an iterative procedure based on the processes of movement and intelligence in an evolutionary system. <xref ref-type="fig" rid="f3-sensors-11-04721">Figure 3</xref> shows the flow chart of a PSO algorithm.</p>
<p>In PSO, each potential solution is represented as a particle. Two properties (position <italic>x</italic> and velocity <italic>v</italic>) are associated with each particle. Suppose <italic>x</italic> and <italic>v</italic> of the ith particle are given as [<xref ref-type="bibr" rid="b26-sensors-11-04721">26</xref>]:
<disp-formula id="FD14">
<label>(14)</label>
<mml:math display="block">
<mml:mi>x</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>1</mml:mn></mml:mrow></mml:msub>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>2</mml:mn></mml:mrow></mml:msub>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>⋯</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>N</mml:mi></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>
<disp-formula id="FD15">
<label>(15)</label>
<mml:math display="block">
<mml:mi>v</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>v</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>1</mml:mn></mml:mrow></mml:msub>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>v</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>2</mml:mn></mml:mrow></mml:msub>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>⋯</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>v</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>N</mml:mi></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <italic>N</italic> stands for the dimensions of the problem. In each iteration, a fitness function is evaluated for all the particles in the swarm. The velocity of each particle is updated by keeping track of two best positions. One is the best position a particle has traversed so far. It is called “<italic>pBest</italic>”. The other is the best position that any neighbor of a particle has traversed so far. It is a neighborhood best and is called “<italic>nBest</italic>”. When a particle takes the whole population as its neighborhood, the neighborhood best becomes the global best and is accordingly called “<italic>gBest</italic>”. Hence, a particle’s velocity and position are updated as follows:
<disp-formula id="FD16">
<label>(16)</label>
<mml:math display="block">
<mml:mi>v</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>ω</mml:mi>
<mml:mo>⋅</mml:mo>
<mml:mi>v</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:msub>
<mml:mi>r</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mtext mathvariant="italic">Best</mml:mtext>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:msub>
<mml:mi>r</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mtext mathvariant="italic">Best</mml:mtext>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>
<disp-formula id="FD17">
<label>(17)</label>
<mml:math display="block">
<mml:mi>x</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>x</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>v</mml:mi>
<mml:mo>Δ</mml:mo>
<mml:mi>t</mml:mi></mml:math></disp-formula>where <italic>ω</italic> is called the “<italic>inertia weight</italic>” that controls the impact of the previous velocity of the particle on its current one. <italic>c</italic><sub>1</sub> and <italic>c</italic><sub>2</sub> are positive constants, called “<italic>acceleration coefficients</italic>”. <italic>r</italic><sub>1</sub> and <italic>r</italic><sub>2</sub> are random numbers that are uniformly distributed in the interval [0,1]. These random numbers are updated every time when they occur. Δ<italic>t</italic> stands for the given time-step and usually equals to 1.</p>
<p>The population of particles is then moved according to Equations (<xref ref-type="disp-formula" rid="FD16">16</xref>) and (<xref ref-type="disp-formula" rid="FD17">17</xref>), and tends to cluster together from different directions. However, a maximum velocity <italic>v</italic><sub>max</sub>, should not be exceeded by any particle to keep the search within a meaningful solution space. The PSO algorithm runs through these processes iteratively until the termination criterion is satisfied.</p>
<p>Let <italic>NP</italic> denotes the number of particles, each having a position <italic>x</italic><italic><sub>i</sub></italic> and a velocity <italic>v</italic><italic><sub>i</sub></italic>. Let <italic>p</italic><italic><sub>i</sub></italic> be the best known position of particle <italic>i</italic> and <italic>g</italic> be the best known position of the entire swarm. A basic PSO algorithm can be described as follows:
<list list-type="simple">
<list-item>
<p>Step 1 Initialize every particle’s position with a uniformly distributed random vector;</p></list-item>
<list-item>
<p>Step 2 Initialize every particle’s best known position to its initial position, <italic>viz.</italic>, <italic>p</italic><italic><sub>i</sub></italic> = <italic>x</italic><italic><sub>i</sub></italic>;</p></list-item>
<list-item>
<p>Step 3 If <italic>f</italic>(<italic>p</italic><italic><sub>i</sub></italic>) &lt; <italic>f</italic>(<italic>g</italic>), then update the swarm’s best known position, <italic>g</italic> = <italic>p</italic><italic><sub>i</sub></italic>;</p></list-item>
<list-item>
<p>Step 4 Repeat until certain termination criteria was met
<list list-type="simple">
<list-item>
<p>Step 4.1 Pick random numbers <italic>r</italic><sub>1</sub> &amp; <italic>r</italic><sub>2</sub>;</p></list-item>
<list-item>
<p>Step 4.2 Update every particle’s velocity according to formula (16);</p></list-item>
<list-item>
<p>Step 4.3 Update every particle’s position according to formula (17);</p></list-item>
<list-item>
<p>Step 4.4 If <italic>f</italic>(<italic>x</italic><italic><sub>i</sub></italic>) &lt; <italic>f</italic>(<italic>p</italic><italic><sub>i</sub></italic>), then update the particle’s best known position, <italic>p</italic><italic><sub>i</sub></italic> = <italic>x</italic><italic><sub>i</sub></italic>. If <italic>f</italic>(<italic>p</italic><italic><sub>i</sub></italic>) &lt; <italic>f</italic>(<italic>g</italic>), then update the swarm’s best known position, <italic>g</italic> = <italic>p</italic><italic><sub>i</sub></italic>.</p></list-item>
<list-item>
<p>Step 5 Output <italic>g</italic> which holds the best found solution.</p></list-item></list></p></list-item></list></p></sec>
<sec>
<label>4.2.</label>
<title>ACPSO</title>
<p>In order to enhance the performance of canonical PSO, two improvements are proposed as follows. The inertia weight <italic>ω</italic> in Equation (<xref ref-type="disp-formula" rid="FD16">16</xref>) affects the performance of the algorithm. A larger inertia weight pressures towards global exploration, while a smaller one pressures towards fine-tuning of current search area [<xref ref-type="bibr" rid="b27-sensors-11-04721">27</xref>]. Thus, proper control of <italic>ω</italic> is important to find the optimum solution accurately. To deal with this shortcoming, an “<italic>adaptive inertia weight factor</italic>” (AIWF) was employed as follow:
<disp-formula id="FD18">
<label>(18)</label>
<mml:math display="block">
<mml:mi>ω</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mrow>
<mml:mtext>max</mml:mtext></mml:mrow></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mrow>
<mml:mtext>max</mml:mtext></mml:mrow></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mrow>
<mml:mtext>min</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mtext>max</mml:mtext></mml:mrow></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>×</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>k</mml:mi></mml:math></disp-formula></p>
<p>Here, <italic>ω</italic><sub>max</sub> denotes the maximum inertial weight, <italic>ω</italic><sub>min</sub> denotes the minimum inertial weight, <italic>k</italic><sub>max</sub> denotes the epoch when the inertial weight reaches the final minimum, and <italic>k</italic> denotes current epoch.</p>
<p>The parameters (<italic>r</italic><sub>1</sub>, <italic>r</italic><sub>2</sub>) were generated by pseudo-random number generators (RNG) in classical PSO. The RNG cannot ensure the optimization’s ergodicity in solution space because they are pseudo-random; therefore, we employed the Rossler chaotic operator [<xref ref-type="bibr" rid="b28-sensors-11-04721">28</xref>] to generate parameters (<italic>r</italic><sub>1</sub>, <italic>r</italic><sub>2</sub>). The Rossler equations are as follows:
<disp-formula id="FD19">
<label>(19)</label>
<mml:math display="block">
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi></mml:mrow></mml:mfrac>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>y</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>z</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>y</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi></mml:mrow></mml:mfrac>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>x</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>a</mml:mi>
<mml:mi>y</mml:mi></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi></mml:mrow></mml:mfrac>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>b</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>+</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>x</mml:mi>
<mml:mi>z</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>c</mml:mi>
<mml:mi>z</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:math></disp-formula></p>
<p>Here <italic>a</italic>, <italic>b</italic>, and <italic>c</italic> are parameters. In this study, we chose <italic>a</italic> = 0.2, <italic>b</italic> = 0.4, and <italic>c</italic> = 5.7. The results are shown in <xref ref-type="fig" rid="f4-sensors-11-04721">Figure 4</xref>, where the line in the 3D space exhibits a strong chaotic property called “spiral chaos”.</p>
<p>The dynamic properties of <italic>x</italic>(<italic>t</italic>) and <italic>y</italic>(<italic>t</italic>) are shown in <xref ref-type="fig" rid="f5-sensors-11-04721">Figure 5</xref>, where <italic>x</italic>(<italic>t</italic>) and <italic>y</italic>(<italic>t</italic>) satisfy both ergodicity and randomness. Therefore, we let <italic>r</italic><sub>1</sub> = <italic>x</italic>(<italic>t</italic>) and <italic>r</italic><sub>2</sub> = <italic>y</italic>(<italic>t</italic>) to embed the chaotic operator into the canonical PSO method.</p>
<p>There are some other chaotic PSO methods proposed in the past. Wang <italic>et al.</italic> [<xref ref-type="bibr" rid="b29-sensors-11-04721">29</xref>] proposed a chaotic PSO to find the high precision prediction for the grey forecasting model. Chuang <italic>et al.</italic> [<xref ref-type="bibr" rid="b30-sensors-11-04721">30</xref>] proposed a chaotic catfish PSO for solving global numeric optimization problem. Araujo <italic>et al.</italic> [<xref ref-type="bibr" rid="b31-sensors-11-04721">31</xref>] intertwined PSO with Lozi map chaotic sequences to obtain Takagi-Sugeno fuzzy model for representing dynamic behaviors. Coelho [<xref ref-type="bibr" rid="b32-sensors-11-04721">32</xref>] presented an efficient PSO algorithm based on Gaussian distribution and chaotic sequence to solve the reliability–redundancy optimization problems. Coelho <italic>et al.</italic> [<xref ref-type="bibr" rid="b33-sensors-11-04721">33</xref>] presented a quantum-inspired version of the PSO using the harmonic oscillator well to solve the economic dispatch problem. Cai <italic>et al.</italic> [<xref ref-type="bibr" rid="b34-sensors-11-04721">34</xref>] developed a multi-objective chaotic PSO method to solve the environmental economic dispatch problems considering both economic and environmental issues. Coelho <italic>et al.</italic> [<xref ref-type="bibr" rid="b35-sensors-11-04721">35</xref>] proposed three differential evolution approaches based on chaotic sequences using logistic equation for image enhancement process. Sun <italic>et al.</italic> [<xref ref-type="bibr" rid="b36-sensors-11-04721">36</xref>] proposed a drift PSO and applied it in estimating the unknown parameters of chaotic dynamic system.</p>
<p>The main difference between our ACPSO and popular PSO lies in two points: (1) we introduced in the adaptive inertia weight factor strategy; (2) we used the Rossler attractor because of the following advantages [<xref ref-type="bibr" rid="b37-sensors-11-04721">37</xref>]: the Rossler is simpler, having only one manifold, and easier to analyze qualitatively. In total, the procedures of ACPSO are listed as follows:
<list list-type="simple">
<list-item>
<p>Step 1 Initialize every particle’s position with a uniformly distributed random vector;</p></list-item>
<list-item>
<p>Step 2 Initialize every particle’s best known position to its initial position, <italic>viz.</italic>, <italic>p</italic><italic><sub>i</sub></italic> = <italic>x</italic><italic><sub>i</sub></italic>;</p></list-item>
<list-item>
<p>Step 3 If <italic>f</italic>(<italic>p</italic><italic><sub>i</sub></italic>) &lt; <italic>f</italic>(<italic>g</italic>), then update the swarm’s best known position, <italic>g</italic> = <italic>p</italic><italic><sub>i</sub></italic>;</p></list-item>
<list-item>
<p>Step 4 Repeat until certain termination criteria was met:
<list list-type="simple">
<list-item>
<p>Step 4.1 Update the value of ω according to formula (18);</p></list-item>
<list-item>
<p>Step 4.2 Pick chaotic random numbers <italic>r</italic><sub>1</sub> &amp; <italic>r</italic><sub>2</sub> according to formula (19)</p></list-item>
<list-item>
<p>Step 4.3 Update every particle’s velocity according to formula (16);</p></list-item>
<list-item>
<p>Step 4.4 Update every particle’s position according to formula (17);</p></list-item>
<list-item>
<p>Step 4.5 If <italic>f</italic>(<italic>x</italic><italic><sub>i</sub></italic>) &lt; <italic>f</italic>(<italic>p</italic><italic><sub>i</sub></italic>), then update the particle’s best known position, <italic>p</italic><italic><sub>i</sub></italic> = <italic>x</italic><italic><sub>i</sub></italic>. If <italic>f</italic>(<italic>p</italic><italic><sub>i</sub></italic>) &lt; <italic>f</italic>(<italic>g</italic>), then update the swarm’s best known position, <italic>g</italic> = <italic>p</italic><italic><sub>i</sub></italic>.</p></list-item></list></p></list-item>
<list-item>
<p>Step 5 Output <italic>g</italic> which holds the best found solution.</p></list-item></list></p></sec>
<sec>
<label>4.3.</label>
<title>ACPSO-NN</title>
<p>Let <italic>ω</italic><sub>1</sub>, <italic>ω</italic><sub>2</sub>, <italic>ω</italic><sub>3</sub> represent the connection weight matrix between the input layer and the first hidden layer, between the first and the second hidden layer, and between the second hidden layer and the output layer, respectively. When the ACPSO is employed to train the multi-layer neural network, each particle is denoted by:
<disp-formula id="FD20">
<label>(20)</label>
<mml:math display="block">
<mml:mi>ω</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo stretchy="false">]</mml:mo></mml:math></disp-formula></p>
<p>The outputs of all neurons in the first hidden layer are calculated by following steps:
<disp-formula id="FD21">
<label>(21)</label>
<mml:math display="block">
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>j</mml:mi></mml:mrow></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>H</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>I</mml:mi></mml:msub></mml:mrow></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>j</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo> </mml:mo>
<mml:mi>j</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>⋯</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></disp-formula></p>
<p>Here <italic>x</italic><italic><sub>i</sub></italic> denotes the <italic>i</italic>th input value, <italic>y</italic><sub>1</sub><italic><sub>j</sub></italic> denotes the <italic>j</italic>th output of the first hidden layer, and <italic>f</italic><italic><sub>H</sub></italic> is referred to as the activation function of hidden layer. The outputs of all neurons in the second hidden layer are calculated as:
<disp-formula id="FD22">
<label>(22)</label>
<mml:math display="block">
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>k</mml:mi></mml:mrow></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>H</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>k</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo> </mml:mo>
<mml:mi>k</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>⋯</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></disp-formula>where <italic>y</italic><sub>2</sub><italic><sub>j</sub></italic> denotes the <italic>j</italic>th output of the second hidden layer.</p>
<p>The outputs of all neurons in the output layer are given as follows:
<disp-formula id="FD23">
<label>(23)</label>
<mml:math display="block">
<mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>l</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>O</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>k</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mi>l</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo> </mml:mo>
<mml:mi>l</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>O</mml:mi></mml:msub></mml:math></disp-formula></p>
<p>Here <italic>f</italic><italic><sub>O</sub></italic> denotes the activation function of output layer, usually a line function. All weights are assigned with random values initially, and are modified by the delta rule according to the learning samples traditionally.</p>
<p>The error of one sample is expressed as the MSE of the difference between its output and the corresponding target value:
<disp-formula id="FD24">
<label>(24)</label>
<mml:math display="block">
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mi>m</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mtext>mse</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>O</mml:mi></mml:msub></mml:mrow></mml:munderover>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>l</mml:mi></mml:msub>
<mml:mo> </mml:mo>
<mml:mo>−</mml:mo>
<mml:mo> </mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>l</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo> </mml:mo>
<mml:mi>m</mml:mi>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo> </mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>S</mml:mi></mml:msub></mml:math></disp-formula>where <italic>T</italic><italic><sub>k</sub></italic> represents the <italic>k</italic>th value of the authentic values which are already known to users, and <italic>N</italic><italic><sub>S</sub></italic> represents the number of samples. Suppose there are <italic>N</italic><italic><sub>S</sub></italic> samples, then the fitness value is written as:
<disp-formula id="FD25">
<label>(25)</label>
<mml:math display="block">
<mml:mi>F</mml:mi>
<mml:mo> </mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>ω</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo> </mml:mo>
<mml:mo>=</mml:mo>
<mml:mo> </mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula>where <italic>ω</italic> represents the vectorization of the (<italic>ω</italic><sub>1</sub>, <italic>ω</italic><sub>2</sub>, <italic>ω</italic><sub>3</sub>). Our goal is to minimize this fitness function <italic>F</italic>(<italic>ω</italic>) by the proposed ACPSO method, viz., force the output values of each sample approximate to corresponding target values.</p></sec>
<sec>
<label>4.4.</label>
<title>Cross Validation</title>
<p>Cross validation methods consist of three types: Random subsampling, <italic>K</italic>-fold cross validation, and leave-one-out validation. The <italic>K</italic>-fold cross validation is applied due to its properties as simple, easy, and using all data for training and validation. The mechanism is to create a <italic>K</italic>-fold partition of the whole dataset, repeat <italic>K</italic> times to use <italic>K</italic>-1 folds for training and a left fold for validation, and finally average the error rates of <italic>K</italic> experiments. The schematic diagram of 5-fold cross validation is shown in <xref ref-type="fig" rid="f6-sensors-11-04721">Figure 6</xref>.</p>
<p>A challenge is to determine the number of folds. If <italic>K</italic> is set too large, the bias of the true error rate estimator will be small, however, the variance of the estimator will be large and the computation will be time-consuming. Alternatively, if <italic>K</italic> is set too small, the computation time will decrease, the variance of the estimator will be small, but the bias of the estimator will be large. The advantages and disadvantages of setting <italic>K</italic> large or small are listed in <xref ref-type="table" rid="t3-sensors-11-04721">Table 3</xref>. In this study, <italic>K</italic> is determined as 10 through trial-and-error method.</p>
<p>If the model selection and true error estimation are computed simultaneously, the data needs to be divided into three disjoint sets [<xref ref-type="bibr" rid="b38-sensors-11-04721">38</xref>]. In another word, the validation subset is used to tune the parameters of the neural network model, so another test subset is needed only to assess the performance of a trained neural network, <italic>viz.</italic>, the whole dataset is divided into three subsets with different purposes listed in <xref ref-type="table" rid="t4-sensors-11-04721">Table 4</xref>. The reason why the validation set and testing set cannot merge with each other lies in that the error rate estimation via the validation data will be biased (smaller than the true error rate) since the validation set is used to tune the model [<xref ref-type="bibr" rid="b39-sensors-11-04721">39</xref>].</p></sec></sec>
<sec>
<label>5.</label>
<title>Experiments</title>
<p>Flevoland, an agricultural area in The Netherlands, is chosen as the example. The site is composed of strips of rectangular agricultural fields. The scene is designated as a supersite for the earth observing system (EOS) program, and is continuously surveyed by the authorities.</p>
<sec>
<label>5.1.</label>
<title>Refine Lee Filter</title>
<p>The Pauli image of Flevoland is shown in <xref ref-type="fig" rid="f7-sensors-11-04721">Figure 7(a)</xref>, and the refine Lee filtered image (Window Size = 7) is shown in <xref ref-type="fig" rid="f7-sensors-11-04721">Figure 7(b)</xref>.</p></sec>
<sec>
<label>5.2.</label>
<title>Full Features</title>
<p>The basic span image and three channels (<italic>T</italic><sub>11</sub>, <italic>T</italic><sub>22</sub>, <italic>T</italic><sub>33</sub>) are easily obtained and shown in <xref ref-type="fig" rid="f8-sensors-11-04721">Figure 8</xref>. The parameters of H/A/Alpha decomposition are shown in <xref ref-type="fig" rid="f9-sensors-11-04721">Figure 9</xref>. The GLCM-based parameters of <italic>T</italic><sub>11</sub>, <italic>T</italic><sub>22</sub>, <italic>T</italic><sub>33</sub> are shown in <xref ref-type="fig" rid="f10-sensors-11-04721">Figures 10</xref>–<xref ref-type="fig" rid="f12-sensors-11-04721">12</xref>.</p></sec>
<sec>
<label>5.3.</label>
<title>PCA</title>
<p>The curve of cumulative sum of variance with dimensions of reduced vectors via PCA is shown in <xref ref-type="fig" rid="f13-sensors-11-04721">Figure 13</xref>. The detailed data are listed in <xref ref-type="table" rid="t5-sensors-11-04721">Table 5</xref>. It shows that only 13 features, which are only half the original features, could preserve 98.06% of variance.</p></sec>
<sec>
<label>5.4.</label>
<title>Area Selection</title>
<p>The classification is run over 13 classes, bare soil 1, bare soil 2, barley, forest, grass, lucerne, peas, potatoes, rapeseed, stem beans, sugar beet, water, and wheat. Our strategy is a semiautomatic method, viz. the training area was chosen and labeled manually. For each crop type, we choose a square of size 20 × 20, which is easy to perform since the training area size is 13 × 20 × 20 = 5,200 compared to the size of the whole image is 1,024 × 750 = 768,000. In order to reduce the complexity of experiment, the test areas are chosen randomly from rest areas [<xref ref-type="bibr" rid="b40-sensors-11-04721">40</xref>,<xref ref-type="bibr" rid="b41-sensors-11-04721">41</xref>], with the same square size as the training area.</p>
<p>The final manually selected training areas are shown in <xref ref-type="fig" rid="f14-sensors-11-04721">Figure 14(a)</xref>. Each square corresponds to a crop type with the size of 20 × 20. In total, there are 5,200 pixels for our training. The cross validation procedures loop 10 times, therefore, each loop we use 4,680 pixels for training and the left 520 pixels for validation. The final randomly selected test areas are shown <xref ref-type="fig" rid="f14-sensors-11-04721">Figure 14(b)</xref>. The samples numbers of training and test area are shown in <xref ref-type="table" rid="t6-sensors-11-04721">Table 6</xref>.</p></sec>
<sec>
<label>5.5.</label>
<title>Network Training</title>
<p><italic>N</italic><italic><sub>I</sub></italic> is determined as 13 due to the 13 features obtained by PCA. <italic>No</italic> is determined as 13 due to the 13 classes shown in <xref ref-type="fig" rid="f14-sensors-11-04721">Figure 14</xref>. Both <italic>N</italic><italic><sub>H</sub></italic><sub>1</sub> and <italic>N</italic><italic><sub>H</sub></italic><sub>2</sub> are set as 10 via the information entropy method [<xref ref-type="bibr" rid="b42-sensors-11-04721">42</xref>]. Therefore, the number of unknown variables of the neural network is 13 × 10 + 10 + 10 × 10 + 10 + 10 × 13 + 13 = 393.</p>
<p>The network was trained by the proposed ACPSO algorithm, of which the parameters are obtained via trial-and-error method and shown in <xref ref-type="table" rid="t7-sensors-11-04721">Table 7</xref>. Besides, BP algorithm [<xref ref-type="bibr" rid="b8-sensors-11-04721">8</xref>], BP with momentum (MBP) [<xref ref-type="bibr" rid="b43-sensors-11-04721">43</xref>], adaptive BP algorithm (ABP) [<xref ref-type="bibr" rid="b44-sensors-11-04721">44</xref>], and PSO [<xref ref-type="bibr" rid="b45-sensors-11-04721">45</xref>] are employed as comparative algorithms.</p>
<p>The curves of function fitness <italic>versus versus</italic> epoch of different algorithms are shown in <xref ref-type="fig" rid="f15-sensors-11-04721">Figure 15</xref>, indicating that the proposed ACPSO converges the most quickly and is capable of finding global minimum point.</p></sec>
<sec>
<label>5.6.</label>
<title>Classification Accuracy</title>
<p>The confusion matrices on training area of our method are calculated and shown in <xref ref-type="fig" rid="f16-sensors-11-04721">Figure 16</xref>. The overall accuracies of our method on the training area (combining training and validation subsets) and test area are 99.0% and 94.0%, respectively. The main drawbacks are around the following four misclassifications: (I) forest zones are easily misclassified as peas; (II) grasses are easily misclassified as barley and lucerne; (III) lucerne are easily misclassified as grasses; (IV) sugarbeets are easily misclassified as peas.</p>
<p>A typical classification accuracy of both training area and test area by BP, ABP, MBP, and PSO are listed in <xref ref-type="table" rid="t8-sensors-11-04721">Table 8</xref>, indicating that the proposed algorithm achieves the highest classification accuracy on both training (99.0%) and test area (94.0%). The random classifier disregards the information of the training data and returns random predictions, so it is usually employed to find the lowest classification rate.</p>
<p>Yudong also used Resilient back-propagation (RPROP) algorithm to train the neural network to classify the same Flevoland area [<xref ref-type="bibr" rid="b41-sensors-11-04721">41</xref>], and obtains 98.62% on training area and 92.87% on test area. The PSO ranks the third with 98.1% on training area and 88.7% on test area. The ABP ranks the fourth with 90.7% and 86.4% on both training and test area, respectively. The BP and MBP performs the worst with the classification accuracy only a bit higher than the random classifier of 1/131 = 37. 69%, indicating that only 2,000 iterations are not enough for these two training strategies. Besides, the classification accuracy of the proposed algorithm was extremely high on the test area due to the 10-fold cross validation.</p></sec>
<sec>
<label>5.7.</label>
<title>Robustness</title>
<p>In order to compare the robustness of each algorithm, we perform each algorithm 50 runs and calculated the minimum, the average, and the maximum of the classification rates. The results are listed in <xref ref-type="table" rid="t9-sensors-11-04721">Table 9</xref>. It indicates that the results of each algorithm changed at each run, but the variation is limited, so the rank of the performance of all algorithms is the same as that in <xref ref-type="table" rid="t8-sensors-11-04721">Table 8</xref>.</p></sec>
<sec sec-type="methods">
<label>5.8.</label>
<title>Time Analysis</title>
<p>Computation time is another important factor used to evaluate the classifier. The time for network training of our algorithm costs about 120 s, which can be ignored since the weights/biases of the NN remain fixed after training unless the property of images changes greatly. For example, the main crops in Flevoland are involved in the 13 types shown in <xref ref-type="fig" rid="f14-sensors-11-04721">Figure 14(c)</xref>, therefore, the classifier can be directly used to other remote-sensing images of Flevoland without retrain. It will cost about 0.131 + 30, 0.242 + 40, 0.232 + 30, 0.181 + 80, 0.048 = 0.83 s from the input of Flevoland images (size 1,024 × 750) to the output of final classification results as shown in <xref ref-type="table" rid="t10-sensors-11-04721">Table 10</xref>. For each pixel, it costs only 1.08 × 10<sup>−7</sup>s, which is fast enough for real time applications.</p></sec></sec>
<sec sec-type="conclusions">
<label>6.</label>
<title>Conclusions</title>
<p>In this study, a crop classification classifier was constructed by following stages. First, a hybrid feature set was introduced which was made up of the span image, the H/A/α decomposition, and the GLCM-based texture features. Afterwards, PCA was carried on to reduce the features. The principle components were sent to the two-hidden-layer neural network, which was trained by the proposed ACPSO method. 10-fold cross validation was employed to prevent overfitting. Experiments on Flevoland site show that the proposed ACPSO-NN obtains satisfying results. The ACPSO trains the neural network more efficiently and effectively than BP, ABP, MBP, PSO, and RPROP methods. More rigorous testing on more complex problems will be performed in future works.</p></sec></body>
<back>
<ack>
<p>The research is financed by following projects: (1) National Natural Science Foundation of China (#60872075); (2) National Technical Innovation Project Essential Project Cultivate Project (#706928) and (3) Nature Science Fund in Jiangsu Province (#BK2007103).</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-sensors-11-04721"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ilyushin</surname><given-names>YA</given-names></name></person-group><article-title>Impact of the plasma fluctuations in the Martian ionosphere on the performance of the synthetic aperture ground-penetrating radar</article-title><source>Planet. Space Sci</source><year>2009</year><volume>57</volume><fpage>1458</fpage><lpage>1466</lpage><pub-id pub-id-type="doi">10.1016/j.pss.2009.07.005</pub-id></citation></ref>
<ref id="b2-sensors-11-04721"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pal</surname><given-names>SK</given-names></name><name><surname>Majumdar</surname><given-names>TJ</given-names></name><name><surname>Bhattacharya</surname><given-names>AK</given-names></name></person-group><article-title>ERS-2 SAR and IRS-1C LISS III data fusion: A PCA approach to improve remote sensing based geological interpretation</article-title><source>ISPRS J. Photogramm. Remote Sens</source><year>2007</year><volume>61</volume><fpage>281</fpage><lpage>297</lpage><pub-id pub-id-type="doi">10.1016/j.isprsjprs.2006.10.001</pub-id></citation></ref>
<ref id="b3-sensors-11-04721"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname><given-names>KU</given-names></name><name><surname>Yang</surname><given-names>J</given-names></name></person-group><article-title>Polarimetric synthetic aperture radar image classification by a hybrid method</article-title><source>Tsinghua Sci. Technol</source><year>2007</year><volume>12</volume><fpage>97</fpage><lpage>104</lpage><pub-id pub-id-type="doi">10.1016/S1007-0214(07)70015-9</pub-id></citation></ref>
<ref id="b4-sensors-11-04721"><label>4.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Simons</surname><given-names>M</given-names></name><name><surname>Rosen</surname><given-names>PA</given-names></name></person-group><article-title>Interferometric synthetic aperture radar geodesy</article-title><source>Treatise on Geophysics</source><person-group person-group-type="editor"><name><surname>Gerald</surname><given-names>S</given-names></name></person-group><publisher-name>Elsevier</publisher-name><publisher-loc>Amsterdam, The Netherlands</publisher-loc><year>2007</year><fpage>391</fpage><lpage>446</lpage></citation></ref>
<ref id="b5-sensors-11-04721"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hellmann</surname><given-names>M</given-names></name><name><surname>Jäger</surname><given-names>G</given-names></name></person-group><article-title>Fuzzy rule based classification of polarimetric SAR data</article-title><source>Aerosp. Sci. Technol</source><year>2002</year><volume>6</volume><fpage>217</fpage><lpage>232</lpage><pub-id pub-id-type="doi">10.1016/S1270-9638(02)01152-5</pub-id></citation></ref>
<ref id="b6-sensors-11-04721"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gadelmawla</surname><given-names>ES</given-names></name></person-group><article-title>A vision system for surface roughness characterization using the gray level co-occurrence matrix</article-title><source>NDTE Int</source><year>2004</year><volume>37</volume><fpage>577</fpage><lpage>588</lpage><pub-id pub-id-type="doi">10.1016/j.ndteint.2004.03.004</pub-id></citation></ref>
<ref id="b7-sensors-11-04721"><label>7.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quan</surname><given-names>JJ</given-names></name><name><surname>Wen</surname><given-names>XB</given-names></name><name><surname>Xu</surname><given-names>XQ</given-names></name></person-group><article-title>Multiscale probabilistic neural network method for SAR image segmentation</article-title><source>Appl. Math. Comput</source><year>2008</year><volume>205</volume><fpage>578</fpage><lpage>583</lpage><pub-id pub-id-type="doi">10.1016/j.amc.2008.05.030</pub-id></citation></ref>
<ref id="b8-sensors-11-04721"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fontenla-Romero</surname><given-names>O</given-names></name><name><surname>Guijarro-Berdiñas</surname><given-names>B</given-names></name><name><surname>Pérez-Sánchez</surname><given-names>B</given-names></name><name><surname>Alonso-Betanzos</surname><given-names>A</given-names></name></person-group><article-title>A new convex objective function for the supervised learning of single-layer neural networks</article-title><source>Patt. Recog</source><year>2010</year><volume>43</volume><fpage>1984</fpage><lpage>1992</lpage><pub-id pub-id-type="doi">10.1016/j.patcog.2009.11.024</pub-id></citation></ref>
<ref id="b9-sensors-11-04721"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mansour</surname><given-names>MAA</given-names></name><name><surname>Dessouky</surname><given-names>MM</given-names></name></person-group><article-title>A genetic algorithm approach for solving the daily photograph selection problem of the SPOT5 satellite</article-title><source>Comput. Ind. Eng</source><year>2010</year><volume>58</volume><fpage>509</fpage><lpage>520</lpage><pub-id pub-id-type="doi">10.1016/j.cie.2009.11.012</pub-id></citation></ref>
<ref id="b10-sensors-11-04721"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fernández</surname><given-names>A</given-names></name><name><surname>Gómez</surname><given-names>S</given-names></name></person-group><article-title>Portfolio selection using neural networks</article-title><source>Comput. Oper. Res</source><year>2007</year><volume>34</volume><fpage>1177</fpage><lpage>1191</lpage><pub-id pub-id-type="doi">10.1016/j.cor.2005.06.017</pub-id></citation></ref>
<ref id="b11-sensors-11-04721"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Yan</surname><given-names>J</given-names></name><name><surname>Wei</surname><given-names>G</given-names></name><name><surname>Wu</surname><given-names>L</given-names></name></person-group><article-title>Find multi-objective paths in stochastic networks via chaotic immune PSO</article-title><source>Expert Syst. Appl</source><year>2010</year><volume>37</volume><fpage>1911</fpage><lpage>1919</lpage><pub-id pub-id-type="doi">10.1016/j.eswa.2009.07.025</pub-id></citation></ref>
<ref id="b12-sensors-11-04721"><label>12.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>YD</given-names></name><name><surname>Wu</surname><given-names>L</given-names></name></person-group><article-title>Weights optimization of neural network via improved BCO approach</article-title><source>Prog. Electromagn. Res</source><year>2008</year><volume>83</volume><fpage>185</fpage><lpage>198</lpage><pub-id pub-id-type="doi">10.2528/PIER08051403</pub-id></citation></ref>
<ref id="b13-sensors-11-04721"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Polat</surname><given-names>K</given-names></name><name><surname>Günes</surname><given-names>S</given-names></name></person-group><article-title>A new method to forecast of Escherichia coli promoter gene sequences: Integrating feature selection and Fuzzy-AIRS classifier system</article-title><source>Expert Syst. Appl</source><year>2009</year><volume>36</volume><fpage>57</fpage><lpage>64</lpage><pub-id pub-id-type="doi">10.1016/j.eswa.2007.09.010</pub-id></citation></ref>
<ref id="b14-sensors-11-04721"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delen</surname><given-names>D</given-names></name><name><surname>Walker</surname><given-names>G</given-names></name><name><surname>Kadam</surname><given-names>A</given-names></name></person-group><article-title>Predicting breast cancer survivability: A comparison of three data mining methods</article-title><source>Artif. Intell. Med</source><year>2005</year><volume>34</volume><fpage>113</fpage><lpage>127</lpage><pub-id pub-id-type="doi">10.1016/j.artmed.2004.07.002</pub-id><pub-id pub-id-type="pmid">15894176</pub-id></citation></ref>
<ref id="b15-sensors-11-04721"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kertész-Farkas</surname><given-names>A</given-names></name><name><surname>Dhir</surname><given-names>S</given-names></name><name><surname>Sonego</surname><given-names>P</given-names></name><name><surname>Pacurar</surname><given-names>M</given-names></name><name><surname>Netoteia</surname><given-names>S</given-names></name><name><surname>Nijveen</surname><given-names>H</given-names></name><name><surname>Kuzniar</surname><given-names>A</given-names></name><name><surname>Leunissen</surname><given-names>JAM</given-names></name><name><surname>Kocsor</surname><given-names>A</given-names></name><name><surname>Pongor</surname><given-names>S</given-names></name></person-group><article-title>Benchmarking protein classification algorithms via supervised cross-validation</article-title><source>J. Biochem. Biophys. Meth</source><year>2008</year><volume>70</volume><fpage>1215</fpage><lpage>1223</lpage><pub-id pub-id-type="doi">10.1016/j.jbbm.2007.05.011</pub-id><pub-id pub-id-type="pmid">17604112</pub-id></citation></ref>
<ref id="b16-sensors-11-04721"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jouan</surname><given-names>A</given-names></name><name><surname>Allard</surname><given-names>Y</given-names></name></person-group><article-title>Land use mapping with evidential fusion of features extracted from polarimetric synthetic aperture radar and hyperspectral imagery</article-title><source>Inform. Fusion</source><year>2004</year><volume>5</volume><fpage>251</fpage><lpage>267</lpage><pub-id pub-id-type="doi">10.1016/j.inffus.2003.10.005</pub-id></citation></ref>
<ref id="b17-sensors-11-04721"><label>17.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goodarzi</surname><given-names>M</given-names></name><name><surname>Freitas</surname><given-names>MP</given-names></name></person-group><article-title>MIA-QSAR coupled to principal component analysis-adaptive neuro-fuzzy inference systems (PCA-ANFIS) for the modeling of the anti-HIV reverse transcriptase activities of TIBO derivatives</article-title><source>Eur. J. Med. Chem</source><year>2010</year><volume>45</volume><fpage>1352</fpage><lpage>1358</lpage><pub-id pub-id-type="doi">10.1016/j.ejmech.2009.12.028</pub-id><pub-id pub-id-type="pmid">20060625</pub-id></citation></ref>
<ref id="b18-sensors-11-04721"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Wu</surname><given-names>L</given-names></name></person-group><article-title>Pattern recognition via PCNN and Tsallis entropy</article-title><source>Sensors</source><year>2008</year><volume>8</volume><fpage>7518</fpage><lpage>7529</lpage><pub-id pub-id-type="doi">10.3390/s8117518</pub-id></citation></ref>
<ref id="b19-sensors-11-04721"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Celik</surname><given-names>T</given-names></name></person-group><article-title>A Bayesian approach to unsupervised multiscale change detection in synthetic aperture radar images</article-title><source>Signal Process</source><year>2010</year><volume>90</volume><fpage>1471</fpage><lpage>1485</lpage><pub-id pub-id-type="doi">10.1016/j.sigpro.2009.10.018</pub-id></citation></ref>
<ref id="b20-sensors-11-04721"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alvarenga</surname><given-names>AV</given-names></name><name><surname>Teixeira</surname><given-names>CA</given-names></name><name><surname>Ruano</surname><given-names>MG</given-names></name><name><surname>Pereira</surname><given-names>WCA</given-names></name></person-group><article-title>Influence of temperature variations on the entropy and correlation of the Grey-Level Co-occurrence Matrix from B-Mode images</article-title><source>Ultrasonics</source><year>2010</year><volume>50</volume><fpage>290</fpage><lpage>293</lpage><pub-id pub-id-type="doi">10.1016/j.ultras.2009.09.002</pub-id><pub-id pub-id-type="pmid">19800646</pub-id></citation></ref>
<ref id="b21-sensors-11-04721"><label>21.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werth</surname><given-names>MT</given-names></name><name><surname>Halouska</surname><given-names>S</given-names></name><name><surname>Shortridge</surname><given-names>MD</given-names></name><name><surname>Zhang</surname><given-names>B</given-names></name><name><surname>Powers</surname><given-names>R</given-names></name></person-group><article-title>Analysis of metabolomic PCA data using tree diagrams</article-title><source>Analytical Biochem</source><year>2010</year><volume>399</volume><fpage>58</fpage><lpage>63</lpage><pub-id pub-id-type="doi">10.1016/j.ab.2009.12.022</pub-id></citation></ref>
<ref id="b22-sensors-11-04721"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Korte</surname><given-names>FS</given-names></name><name><surname>Razumova</surname><given-names>MV</given-names></name><name><surname>Feest</surname><given-names>ER</given-names></name><name><surname>Wang</surname><given-names>D</given-names></name><name><surname>Regnier</surname><given-names>M</given-names></name></person-group><article-title>PKA phosphorylation has no effect on the force-PCA relationship or length dependent activation following L48Q cTNC-Tn exchange in rat demembranated trabeculae</article-title><source>Biophys J</source><year>2010</year><volume>98</volume><issue>Supplement 1</issue><fpage>6a</fpage></citation></ref>
<ref id="b23-sensors-11-04721"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Wu</surname><given-names>L</given-names></name></person-group><article-title>Stock Market Prediction of S&amp;P 500 via combination of improved BCO approach and BP neural network</article-title><source>Expert Syst. Appl</source><year>2009</year><volume>36</volume><fpage>8849</fpage><lpage>8854</lpage></citation></ref>
<ref id="b24-sensors-11-04721"><label>24.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Radulovic</surname><given-names>J</given-names></name><name><surname>Rankovic</surname><given-names>V</given-names></name></person-group><article-title>Feedforward neural network and adaptive network-based fuzzy inference system in study of power lines</article-title><source>Expert Syst. Appl</source><year>2010</year><volume>37</volume><fpage>165</fpage><lpage>170</lpage></citation></ref>
<ref id="b25-sensors-11-04721"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Carvalho</surname><given-names>AB</given-names></name><name><surname>Pozo</surname><given-names>A</given-names></name><name><surname>Vergilio</surname><given-names>SR</given-names></name></person-group><article-title>A symbolic fault-prediction model based on multiobjective particle swarm optimization</article-title><source>J. Syst. Softw</source><year>2010</year><volume>83</volume><fpage>868</fpage><lpage>882</lpage></citation></ref>
<ref id="b26-sensors-11-04721"><label>26.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Niknam</surname><given-names>T</given-names></name><name><surname>Mojarrad</surname><given-names>HD</given-names></name><name><surname>Nayeripour</surname><given-names>M</given-names></name></person-group><article-title>A new fuzzy adaptive particle swarm optimization for non-smooth economic dispatch</article-title><source>Energy</source><year>2010</year><volume>35</volume><fpage>1764</fpage><lpage>1778</lpage></citation></ref>
<ref id="b27-sensors-11-04721"><label>27.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonilla-Petriciolet</surname><given-names>A</given-names></name><name><surname>Segovia-Hernández</surname><given-names>JG</given-names></name></person-group><article-title>A comparative study of particle swarm optimization and its variants for phase stability and equilibrium calculations in multicomponent reactive and non-reactive systems</article-title><source>Fluid Phase Equilibria</source><year>2010</year><volume>289</volume><fpage>110</fpage><lpage>121</lpage></citation></ref>
<ref id="b28-sensors-11-04721"><label>28.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>dos Santos Coelho</surname><given-names>L</given-names></name><name><surname>Coelho</surname><given-names>AAR</given-names></name></person-group><article-title>Model-free adaptive control optimization using a chaotic particle swarm approach</article-title><source>Chaos. Soliton. Fractal</source><year>2009</year><volume>41</volume><fpage>2001</fpage><lpage>2009</lpage></citation></ref>
<ref id="b29-sensors-11-04721"><label>29.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>J</given-names></name><name><surname>Zhu</surname><given-names>S</given-names></name><name><surname>Zhao</surname><given-names>W</given-names></name><name><surname>Zhu</surname><given-names>W</given-names></name></person-group><article-title>Optimal parameters estimation and input subset for grey model based on chaotic particle swarm optimization algorithm</article-title><source>Expert Syst. Appl</source><year>2011</year><volume>38</volume><fpage>8151</fpage><lpage>8158</lpage></citation></ref>
<ref id="b30-sensors-11-04721"><label>30.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chuang</surname><given-names>LY</given-names></name><name><surname>Tsai</surname><given-names>SW</given-names></name><name><surname>Yang</surname><given-names>CH</given-names></name></person-group><article-title>Chaotic catfish particle swarm optimization for solving global numerical optimization problems</article-title><source>Appl. Math. Comput</source><year>2011</year><volume>217</volume><fpage>6900</fpage><lpage>6916</lpage></citation></ref>
<ref id="b31-sensors-11-04721"><label>31.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Araujo</surname><given-names>E</given-names></name><name><surname>Coelho</surname><given-names>LdS</given-names></name></person-group><article-title>Particle swarm approaches using Lozi map chaotic sequences to fuzzy modelling of an experimental thermal-vacuum system</article-title><source>Appl. Soft Comput</source><year>2008</year><volume>8</volume><fpage>1354</fpage><lpage>1364</lpage></citation></ref>
<ref id="b32-sensors-11-04721"><label>32.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Coelho</surname><given-names>LdS</given-names></name></person-group><article-title>An efficient particle swarm approach for mixed-integer programming in reliability-redundancy optimization applications</article-title><source>Reliab. Eng. Syst. Safety</source><year>2009</year><volume>94</volume><fpage>830</fpage><lpage>837</lpage></citation></ref>
<ref id="b33-sensors-11-04721"><label>33.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Coelho</surname><given-names>LdS</given-names></name><name><surname>Mariani</surname><given-names>VC</given-names></name></person-group><article-title>Particle swarm approach based on quantum mechanics and harmonic oscillator potential well for economic load dispatch with valve-point effects</article-title><source>Energ. Conv. Manage</source><year>2008</year><volume>49</volume><fpage>3080</fpage><lpage>3085</lpage></citation></ref>
<ref id="b34-sensors-11-04721"><label>34.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname><given-names>J</given-names></name><name><surname>Ma</surname><given-names>X</given-names></name><name><surname>Li</surname><given-names>Q</given-names></name><name><surname>Li</surname><given-names>L</given-names></name><name><surname>Peng</surname><given-names>H</given-names></name></person-group><article-title>A multi-objective chaotic particle swarm optimization for environmental/economic dispatch</article-title><source>Energ. Conv. Manage</source><year>2009</year><volume>50</volume><fpage>1318</fpage><lpage>1325</lpage></citation></ref>
<ref id="b35-sensors-11-04721"><label>35.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Coelho</surname><given-names>LdS</given-names></name><name><surname>Sauer</surname><given-names>JG</given-names></name><name><surname>Rudek</surname><given-names>M</given-names></name></person-group><article-title>Differential evolution optimization combined with chaotic sequences for image contrast enhancement</article-title><source>Chaos. Soliton. Fractal</source><year>2009</year><volume>42</volume><fpage>522</fpage><lpage>529</lpage></citation></ref>
<ref id="b36-sensors-11-04721"><label>36.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname><given-names>J</given-names></name><name><surname>Zhao</surname><given-names>J</given-names></name><name><surname>Wu</surname><given-names>X</given-names></name><name><surname>Fang</surname><given-names>W</given-names></name><name><surname>Cai</surname><given-names>Y</given-names></name><name><surname>Xu</surname><given-names>W</given-names></name></person-group><article-title>Parameter estimation for chaotic systems with a drift particle swarm optimization method</article-title><source>Phys. Lett. A</source><year>2010</year><volume>374</volume><fpage>2816</fpage><lpage>2822</lpage></citation></ref>
<ref id="b37-sensors-11-04721"><label>37.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Wu</surname><given-names>L</given-names></name><name><surname>Wang</surname><given-names>S</given-names></name><name><surname>Huo</surname><given-names>Y</given-names></name></person-group><article-title>Chaotic Artificial bee colony used for cluster analysis</article-title><source>Commun. Comput. Inform. Sci</source><year>2011</year><volume>134</volume><fpage>205</fpage><lpage>211</lpage></citation></ref>
<ref id="b38-sensors-11-04721"><label>38.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Affleck</surname><given-names>RT</given-names></name><name><surname>Melloh</surname><given-names>RA</given-names></name><name><surname>Shoop</surname><given-names>SA</given-names></name></person-group><article-title>Cross-country mobility on various snow conditions for validation of a virtual terrain</article-title><source>J. Terramech</source><year>2009</year><volume>46</volume><fpage>203</fpage><lpage>210</lpage></citation></ref>
<ref id="b39-sensors-11-04721"><label>39.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pradhan</surname><given-names>B</given-names></name></person-group><article-title>Remote sensing and GIS-based landslide hazard analysis and cross-validation using multivariate logistic regression model on three test areas in Malaysia</article-title><source>Adv. Space Res</source><year>2010</year><volume>45</volume><fpage>1244</fpage><lpage>1256</lpage></citation></ref>
<ref id="b40-sensors-11-04721"><label>40.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kandrika</surname><given-names>S</given-names></name><name><surname>Roy</surname><given-names>PS</given-names></name></person-group><article-title>Land use land cover classification of Orissa using multi-temporal IRS-P6 awifs data: A decision tree approach</article-title><source>Int. J. Appl. Earth Obs. Geoinf</source><year>2008</year><volume>10</volume><fpage>186</fpage><lpage>193</lpage></citation></ref>
<ref id="b41-sensors-11-04721"><label>41.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Wu</surname><given-names>L</given-names></name><name><surname>Wei</surname><given-names>G</given-names></name></person-group><article-title>A new classifier for polarimetric SAR images</article-title><source>Prog. Electromagn Res</source><year>2009</year><volume>94</volume><fpage>83</fpage><lpage>104</lpage></citation></ref>
<ref id="b42-sensors-11-04721"><label>42.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname><given-names>HC</given-names></name><name><surname>Xiong</surname><given-names>FL</given-names></name><name><surname>Huai</surname><given-names>XY</given-names></name></person-group><article-title>A method for estimating the number of hidden neurons in feed-forward neural networks based on information entropy</article-title><source>Comput. Electron. Agric</source><year>2003</year><volume>40</volume><fpage>57</fpage><lpage>64</lpage></citation></ref>
<ref id="b43-sensors-11-04721"><label>43.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Majdi</surname><given-names>A</given-names></name><name><surname>Beiki</surname><given-names>M</given-names></name></person-group><article-title>Evolving neural network using a genetic algorithm for predicting the deformation modulus of rock masses</article-title><source>Int. J. Rock Mech. Mining Sci</source><year>2010</year><volume>47</volume><fpage>246</fpage><lpage>253</lpage></citation></ref>
<ref id="b44-sensors-11-04721"><label>44.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Toprak</surname><given-names>A</given-names></name><name><surname>Siraç Özerdem</surname><given-names>M</given-names></name><name><surname>Güler</surname><given-names>I</given-names></name></person-group><article-title>Suppression of impulse noise in MR images using artificial intelligent based neuro-fuzzy adaptive median filter</article-title><source>Digit. Signal Process</source><year>2008</year><volume>18</volume><fpage>391</fpage><lpage>405</lpage></citation></ref>
<ref id="b45-sensors-11-04721"><label>45.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamidian</surname><given-names>D</given-names></name><name><surname>Seyedpoor</surname><given-names>SM</given-names></name></person-group><article-title>Shape optimal design of arch dams using an adaptive neuro-fuzzy inference system and improved particle swarm optimization</article-title><source>Appl. Math. Model</source><year>2010</year><volume>34</volume><fpage>1574</fpage><lpage>1585</lpage></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-sensors-11-04721" position="float">
<label>Figure 1.</label>
<caption>
<p>Geometric Illustration of PCA.</p></caption>
<graphic xlink:href="sensors-11-04721f1.gif"/></fig>
<fig id="f2-sensors-11-04721" position="float">
<label>Figure 2.</label>
<caption>
<p>A three-layer neural network.</p></caption>
<graphic xlink:href="sensors-11-04721f2.gif"/></fig>
<fig id="f3-sensors-11-04721" position="float">
<label>Figure 3.</label>
<caption>
<p>Flow chart of the PSO algorithm.</p></caption>
<graphic xlink:href="sensors-11-04721f3.gif"/></fig>
<fig id="f4-sensors-11-04721" position="float">
<label>Figure 4.</label>
<caption>
<p>A Rossler chaotic number generator with <italic>a</italic> = 0.2, <italic>b</italic> = 0.4, <italic>c</italic> = 5.7.</p></caption>
<graphic xlink:href="sensors-11-04721f4.gif"/></fig>
<fig id="f5-sensors-11-04721" position="float">
<label>Figure 5.</label>
<caption>
<p>Chaotic sequence of (<bold>a</bold>) <italic>x</italic>(<italic>t</italic>) and (<bold>b</bold>) <italic>y</italic>(<italic>t</italic>).</p></caption>
<graphic xlink:href="sensors-11-04721f5.gif"/></fig>
<fig id="f6-sensors-11-04721" position="float">
<label>Figure 6.</label>
<caption>
<p>A 5-fold cross validation.</p></caption>
<graphic xlink:href="sensors-11-04721f6.gif"/></fig>
<fig id="f7-sensors-11-04721" position="float">
<label>Figure 7.</label>
<caption>
<p>Pauli Image of Flevoland (1,024 × 750). (<bold>a</bold>) Pauli Image; (<bold>b</bold>) The refine Lee filtered images.</p></caption>
<graphic xlink:href="sensors-11-04721f7.gif"/></fig>
<fig id="f8-sensors-11-04721" position="float">
<label>Figure 8.</label>
<caption>
<p>Basic span image and three channels image. (<bold>a</bold>) Span (dB); (<bold>b</bold>) <italic>T</italic><sub>11</sub> (dB); (<bold>c</bold>) <italic>T</italic><sub>22</sub> (dB); (<bold>d</bold>) <italic>T</italic><sub>33</sub>(dB).</p></caption>
<graphic xlink:href="sensors-11-04721f8.gif"/></fig>
<fig id="f9-sensors-11-04721" position="float">
<label>Figure 9.</label>
<caption>
<p>Parameters of H/A/α decomposition. (<bold>a</bold>) <italic>H</italic>; (<bold>b</bold>) <italic>A</italic>; (<bold>c</bold>) <italic>ᾱ</italic>; (<bold>d</bold>) <italic>β̄</italic>; (<bold>e</bold>) <italic>δ̄</italic>; (<bold>f</bold>) <italic>γ̄</italic>.</p></caption>
<graphic xlink:href="sensors-11-04721f9.gif"/></fig>
<fig id="f10-sensors-11-04721" position="float">
<label>Figure 10.</label>
<caption>
<p>GLCM-based features of <italic>T</italic><sub>11</sub>. (<bold>a</bold>) Contrast. (<bold>b</bold>) Correlation. (<bold>c</bold>) Energy. (<bold>d</bold>) Homogeneity.</p></caption>
<graphic xlink:href="sensors-11-04721f10a.gif"/>
<graphic xlink:href="sensors-11-04721f10b.gif"/></fig>
<fig id="f11-sensors-11-04721" position="float">
<label>Figure 11.</label>
<caption>
<p>GLCM-based features of <italic>T</italic><sub>22</sub>. (<bold>a</bold>) Contrast; (<bold>b</bold>) Correlation; (<bold>c</bold>) Energy; (<bold>d</bold>) Homogeneity.</p></caption>
<graphic xlink:href="sensors-11-04721f11.gif"/></fig>
<fig id="f12-sensors-11-04721" position="float">
<label>Figure 12.</label>
<caption>
<p>GLCM-based features of <italic>T</italic><sub>33</sub>. (<bold>a</bold>) Contrast; (<bold>b</bold>) Correlation; (<bold>c</bold>) Energy; (<bold>d</bold>) Homogeneity.</p></caption>
<graphic xlink:href="sensors-11-04721f12a.gif"/>
<graphic xlink:href="sensors-11-04721f12b.gif"/></fig>
<fig id="f13-sensors-11-04721" position="float">
<label>Figure 13.</label>
<caption>
<p>Cumulative sum of variance <italic>versus</italic> principle components.</p></caption>
<graphic xlink:href="sensors-11-04721f13.gif"/></fig>
<fig id="f14-sensors-11-04721" position="float">
<label>Figure 14.</label>
<caption>
<p>Sample data areas of Flevoland. (<bold>a</bold>) Training Area; (<bold>b</bold>) Test Area; (<bold>c</bold>) Legend of Colors.</p></caption>
<graphic xlink:href="sensors-11-04721f14.gif"/></fig>
<fig id="f15-sensors-11-04721" position="float">
<label>Figure 15.</label>
<caption>
<p>The curve of fitness <italic>versus</italic> epoch.</p></caption>
<graphic xlink:href="sensors-11-04721f15.gif"/></fig>
<fig id="f16-sensors-11-04721" position="float">
<label>Figure 16.</label>
<caption>
<p>Confusion Matrixes of ACPSO-NN algorithm. (<bold>a</bold>) Training Area; (<bold>b</bold>) Test Area.</p></caption>
<graphic xlink:href="sensors-11-04721f16.gif"/></fig>
<table-wrap id="t1-sensors-11-04721" position="float">
<label>Table 1.</label>
<caption>
<p>Pauli bases and their corresponding meanings.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="middle"><bold>Pauli Basis</bold></th>
<th align="center" valign="middle"><bold>Meaning</bold></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top"><italic>S<sub>a</sub></italic></td>
<td align="left" valign="top">Single- or odd-bounce scattering</td></tr>
<tr>
<td align="center" valign="top"><italic>S<sub>b</sub></italic></td>
<td align="left" valign="top">Double- or even-bounce scattering</td></tr>
<tr>
<td align="center" valign="top"><italic>S<sub>c</sub></italic></td>
<td align="left" valign="top">Those scatterers which are able to return the orthogonal polarization to the one of the incident wave (forest canopy)</td></tr></tbody></table></table-wrap>
<table-wrap id="t2-sensors-11-04721" position="float">
<label>Table 2.</label>
<caption>
<p>Properties of GLCM.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle"><bold>Property</bold></th>
<th align="center" valign="middle"><bold>Description</bold></th>
<th align="left" valign="middle"><bold>Formula</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Contrast</td>
<td align="left" valign="top">Intensity contrast between a pixel and its neighbor</td>
<td align="left" valign="top">Σ|<italic>i</italic>–<italic>j</italic>|<sup>2</sup><italic>p</italic>(<italic>i</italic>,<italic>j</italic>)</td></tr>
<tr>
<td align="left" valign="top">Correlation</td>
<td align="left" valign="top">Correlation between a pixel and its neighbor (μ denotes the expected value, and <italic>σ</italic> the standard variance)</td>
<td align="left" valign="top">Σ [(<italic>i–μ</italic><italic><sub>i</sub></italic>)(<italic>j–μ</italic><italic><sub>j</sub></italic>)<italic>p</italic>(<italic>i</italic>,<italic>j</italic>)/(<italic>σ</italic><italic><sub>i</sub></italic><italic>σ</italic><italic><sub>j</sub></italic>)]</td></tr>
<tr>
<td align="left" valign="top">Energy</td>
<td align="left" valign="top">Energy of the whole image</td>
<td align="left" valign="top">Σ<italic>p</italic><sup>2</sup>(<italic>i</italic>,<italic>j</italic>)</td></tr>
<tr>
<td align="left" valign="top">Homogeneity</td>
<td align="left" valign="top">Closeness of the distribution of GLCM to the diagonal</td>
<td align="left" valign="top">Σ[<italic>p</italic>(<italic>i</italic>,<italic>j</italic>)/(1+|<italic>i</italic>–<italic>j</italic>|]</td></tr></tbody></table></table-wrap>
<table-wrap id="t3-sensors-11-04721" position="float">
<label>Table 3.</label>
<caption>
<p>Large <italic>K versus</italic> small <italic>K.</italic></p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="middle"><bold><italic>K</italic> value</bold></th>
<th align="center" valign="middle"><bold>Estimator Bias</bold></th>
<th align="center" valign="middle"><bold>Estimator Variance</bold></th>
<th align="center" valign="middle"><bold>Computation Time</bold></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">Large</td>
<td align="center" valign="top">↓</td>
<td align="center" valign="top">↑</td>
<td align="center" valign="top">↑</td></tr>
<tr>
<td align="center" valign="top">small</td>
<td align="center" valign="top">↑</td>
<td align="center" valign="top">↓</td>
<td align="center" valign="top">↓</td></tr></tbody></table></table-wrap>
<table-wrap id="t4-sensors-11-04721" position="float">
<label>Table 4.</label>
<caption>
<p>Purposes of different subsets.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle"><bold>Subset</bold></th>
<th align="center" valign="middle"><bold>Intent</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Training</td>
<td align="left" valign="top">Learning to fit the parameters of the classifier</td></tr>
<tr>
<td align="left" valign="top">Validation</td>
<td align="left" valign="top">Estimate the error rate to tune the parameters of the classifier</td></tr>
<tr>
<td align="left" valign="top">Testing</td>
<td align="left" valign="top">Estimate the true error rate to assess the classifier</td></tr></tbody></table></table-wrap>
<table-wrap id="t5-sensors-11-04721" position="float">
<label>Table 5.</label>
<caption>
<p>Detailed cumulative sum of variance.</p></caption>
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="center" valign="middle"><bold>Dimensions</bold></td>
<td align="center" valign="middle">1</td>
<td align="center" valign="middle">2</td>
<td align="center" valign="middle">3</td>
<td align="center" valign="middle">4</td>
<td align="center" valign="middle">5</td>
<td align="center" valign="middle">6</td>
<td align="center" valign="middle">7</td>
<td align="center" valign="middle">8</td>
<td align="center" valign="middle">9</td></tr>
<tr>
<td align="center" valign="middle"><bold>Variance (%)</bold></td>
<td align="center" valign="middle">26.31</td>
<td align="center" valign="middle">42.98</td>
<td align="center" valign="middle">52.38</td>
<td align="center" valign="middle">60.50</td>
<td align="center" valign="middle">67.28</td>
<td align="center" valign="middle">73.27</td>
<td align="center" valign="middle">78.74</td>
<td align="center" valign="middle">82.61</td>
<td align="center" valign="middle">86.25</td></tr>
<tr>
<td colspan="10" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="center" valign="middle"><bold>Dimensions</bold></td>
<td align="center" valign="middle">10</td>
<td align="center" valign="middle">11</td>
<td align="center" valign="middle">12</td>
<td align="center" valign="middle">13</td>
<td align="center" valign="middle">14</td>
<td align="center" valign="middle">15</td>
<td align="center" valign="middle">16</td>
<td align="center" valign="middle">17</td>
<td align="center" valign="middle">18</td></tr>
<tr>
<td align="center" valign="middle"><bold>Variance (%)</bold></td>
<td align="center" valign="middle">89.52</td>
<td align="center" valign="middle">92.72</td>
<td align="center" valign="middle">95.50</td>
<td align="center" valign="middle">98.06</td>
<td align="center" valign="middle">98.79</td>
<td align="center" valign="middle">99.24</td>
<td align="center" valign="middle">99.63</td>
<td align="center" valign="middle">99.94</td>
<td align="center" valign="middle">99.97</td></tr></tbody></table></table-wrap>
<table-wrap id="t6-sensors-11-04721" position="float">
<label>Table 6.</label>
<caption>
<p>Sample numbers of training and test area.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="middle"><bold>Training Area</bold></th>
<th align="left" valign="middle"><bold>Test Area</bold></th>
<th align="left" valign="middle"><bold>Total</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">5,200 10 loops (4,680 for train and 520 for validation)</td>
<td align="left" valign="top">5,200</td>
<td align="left" valign="top">10,400</td></tr></tbody></table></table-wrap>
<table-wrap id="t7-sensors-11-04721" position="float">
<label>Table 7.</label>
<caption>
<p>Parameters of PSO &amp; ACPSO.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top" rowspan="2"><bold>Parameters</bold></th>
<th colspan="2" align="center" valign="middle"><bold>Values</bold>
<hr/></th></tr>
<tr>
<th align="center" valign="middle"><bold>PSO</bold></th>
<th align="center" valign="middle"><bold>ACPSO</bold></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">Dimensions</td>
<td align="center" valign="top">393</td>
<td align="center" valign="top">393</td></tr>
<tr>
<td align="center" valign="top"><italic>V</italic><sub>max</sub></td>
<td align="center" valign="top">0.04</td>
<td align="center" valign="top">0.04</td></tr>
<tr>
<td align="center" valign="top">Maximum Iterations</td>
<td align="center" valign="top">2,000</td>
<td align="center" valign="top">2,000</td></tr>
<tr>
<td align="center" valign="top"><italic>k</italic><sub>max</sub></td>
<td align="center" valign="top">1,500</td>
<td align="center" valign="top">1,500</td></tr>
<tr>
<td align="center" valign="top"><italic>NP</italic></td>
<td align="center" valign="top">24</td>
<td align="center" valign="top">24</td></tr>
<tr>
<td align="center" valign="top"><italic>c</italic><sub>1</sub></td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">2</td></tr>
<tr>
<td align="center" valign="top"><italic>c</italic><sub>2</sub></td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">2</td></tr>
<tr>
<td align="center" valign="top">Function tolerance</td>
<td align="center" valign="top">1e<sup>−6</sup></td>
<td align="center" valign="top">1e<sup>−6</sup></td></tr>
<tr>
<td align="center" valign="top">ω<sub>max</sub></td>
<td align="center" valign="top">-</td>
<td align="center" valign="top">0.9</td></tr>
<tr>
<td align="center" valign="top">ω<sub>min</sub></td>
<td align="center" valign="top">-</td>
<td align="center" valign="top">0.4</td></tr>
<tr>
<td align="center" valign="top"><italic>a</italic></td>
<td align="center" valign="top">-</td>
<td align="center" valign="top">0.2</td></tr>
<tr>
<td align="center" valign="top"><italic>b</italic></td>
<td align="center" valign="top">-</td>
<td align="center" valign="top">0.4</td></tr>
<tr>
<td align="center" valign="top"><italic>c</italic></td>
<td align="center" valign="top">-</td>
<td align="center" valign="top">5.7</td></tr></tbody></table></table-wrap>
<table-wrap id="t8-sensors-11-04721" position="float">
<label>Table 8.</label>
<caption>
<p>A typical classification accuracy of different algorithms (Maximum iterations = 2,000).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle"><bold>Algorithm</bold></th>
<th align="center" valign="middle"><bold>Training Area</bold></th>
<th align="center" valign="middle"><bold>Test Area</bold></th>
<th align="center" valign="middle"><bold>Rank</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Random</td>
<td align="center" valign="top">7.69%</td>
<td align="center" valign="top">7.69%</td>
<td align="center" valign="top">7</td></tr>
<tr>
<td align="left" valign="top">MBP</td>
<td align="center" valign="top">8.8%</td>
<td align="center" valign="top">7.5%</td>
<td align="center" valign="top">6</td></tr>
<tr>
<td align="left" valign="top">BP</td>
<td align="center" valign="top">8.3%</td>
<td align="center" valign="top">8.2%</td>
<td align="center" valign="top">5</td></tr>
<tr>
<td align="left" valign="top">ABP</td>
<td align="center" valign="top">90.7%</td>
<td align="center" valign="top">86.4%</td>
<td align="center" valign="top">4</td></tr>
<tr>
<td align="left" valign="top">PSO</td>
<td align="center" valign="top">98.1%</td>
<td align="center" valign="top">88.7%</td>
<td align="center" valign="top">3</td></tr>
<tr>
<td align="left" valign="top">RPROP[<xref ref-type="bibr" rid="b41-sensors-11-04721">41</xref>]</td>
<td align="center" valign="top">98.62%</td>
<td align="center" valign="top">92.87%</td>
<td align="center" valign="top">2</td></tr>
<tr>
<td align="left" valign="top">ACPSO</td>
<td align="center" valign="top">99.0%</td>
<td align="center" valign="top">94.0</td>
<td align="center" valign="top">1</td></tr></tbody></table></table-wrap>
<table-wrap id="t9-sensors-11-04721" position="float">
<label>Table 9.</label>
<caption>
<p>Statistical results of different algorithms (Maximum iterations = 2,000).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="middle" rowspan="3"><bold>Algorithm</bold></th>
<th colspan="3" align="center" valign="middle"><bold>Training Area</bold></th>
<th colspan="3" align="center" valign="middle"><bold>Test Area</bold></th></tr>
<tr>
<th colspan="7" align="left" valign="top">
<hr/></th></tr>
<tr>
<th align="center" valign="middle"><bold>Min</bold></th>
<th align="center" valign="middle"><bold>Ave</bold></th>
<th align="center" valign="middle"><bold>Max</bold></th>
<th align="center" valign="middle"><bold>Min</bold></th>
<th align="center" valign="middle"><bold>Ave</bold></th>
<th align="center" valign="middle"><bold>Max</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Random</td>
<td align="left" valign="top">7.58%</td>
<td align="left" valign="top">7.69%</td>
<td align="left" valign="top">7.83%</td>
<td align="left" valign="top">7.58%</td>
<td align="left" valign="top">7.69%</td>
<td align="left" valign="top">7.81%</td></tr>
<tr>
<td align="left" valign="top">MBP</td>
<td align="left" valign="top">8.52%</td>
<td align="left" valign="top">8.83%</td>
<td align="left" valign="top">9.08%</td>
<td align="left" valign="top">6.98%</td>
<td align="left" valign="top">7.44%</td>
<td align="left" valign="top">7.92%</td></tr>
<tr>
<td align="left" valign="top">BP</td>
<td align="left" valign="top">7.96%</td>
<td align="left" valign="top">8.33%</td>
<td align="left" valign="top">8.65%</td>
<td align="left" valign="top">7.90%</td>
<td align="left" valign="top">8.17%</td>
<td align="left" valign="top">8.35%</td></tr>
<tr>
<td align="left" valign="top">ABP</td>
<td align="left" valign="top">81.04%</td>
<td align="left" valign="top">87.18%</td>
<td align="left" valign="top">94.12%</td>
<td align="left" valign="top">76.60%</td>
<td align="left" valign="top">83.55%</td>
<td align="left" valign="top">89.83%</td></tr>
<tr>
<td align="left" valign="top">PSO</td>
<td align="left" valign="top">95.83%</td>
<td align="left" valign="top">97.68%</td>
<td align="left" valign="top">98.52%</td>
<td align="left" valign="top">83.15%</td>
<td align="left" valign="top">89.32%</td>
<td align="left" valign="top">91.54%</td></tr>
<tr>
<td align="left" valign="top">RPROP</td>
<td align="left" valign="top">97.63%</td>
<td align="left" valign="top">98.71%</td>
<td align="left" valign="top">98.90%</td>
<td align="left" valign="top">90.87%</td>
<td align="left" valign="top">92.65%</td>
<td align="left" valign="top">93.77%</td></tr>
<tr>
<td align="left" valign="top">ACPSO</td>
<td align="left" valign="top">98.15%</td>
<td align="left" valign="top">98.84%</td>
<td align="left" valign="top">99.13%</td>
<td align="left" valign="top">92.56%</td>
<td align="left" valign="top">93.80%</td>
<td align="left" valign="top">94.52%</td></tr></tbody></table></table-wrap>
<table-wrap id="t10-sensors-11-04721" position="float">
<label>Table 10.</label>
<caption>
<p>Computation Time of Flevoland image classification.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle"><bold>Stage</bold></th>
<th align="center" valign="middle"><bold>Time</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Span</td>
<td align="left" valign="top">0.13 s</td></tr>
<tr>
<td align="left" valign="top">H/A/α decomposition</td>
<td align="left" valign="top">0.24 s</td></tr>
<tr>
<td align="left" valign="top">GLCM</td>
<td align="left" valign="top">0.23 s</td></tr>
<tr>
<td align="left" valign="top">PCA</td>
<td align="left" valign="top">0.18 s</td></tr>
<tr>
<td align="left" valign="top">NN Training<xref ref-type="table-fn" rid="tfn1-sensors-11-04721">*</xref></td>
<td align="left" valign="top">120 s</td></tr>
<tr>
<td align="left" valign="top">Classification</td>
<td align="left" valign="top">0.048 s</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-sensors-11-04721">
<p>(* denotes training time can be ignored)</p></fn></table-wrap-foot></table-wrap></sec></back></article>
