<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Sensors</journal-id>
<journal-title>Sensors</journal-title>
<issn pub-type="epub">1424-8220</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/s120404764</article-id>
<article-id pub-id-type="publisher-id">sensors-12-04764</article-id>
<article-categories>
<subj-group>
<subject>Review</subject></subj-group></article-categories>
<title-group>
<article-title>Multiple Classifier System for Remote Sensing Image Classification: A Review</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Du</surname><given-names>Peijun</given-names></name><xref ref-type="aff" rid="af1-sensors-12-04764"><sup>1</sup></xref><xref ref-type="aff" rid="af2-sensors-12-04764"><sup>2</sup></xref><xref ref-type="corresp" rid="c1-sensors-12-04764"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Xia</surname><given-names>Junshi</given-names></name><xref ref-type="aff" rid="af2-sensors-12-04764"><sup>2</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Zhang</surname><given-names>Wei</given-names></name><xref ref-type="aff" rid="af3-sensors-12-04764"><sup>3</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Tan</surname><given-names>Kun</given-names></name><xref ref-type="aff" rid="af2-sensors-12-04764"><sup>2</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Liu</surname><given-names>Yi</given-names></name><xref ref-type="aff" rid="af2-sensors-12-04764"><sup>2</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Liu</surname><given-names>Sicong</given-names></name><xref ref-type="aff" rid="af2-sensors-12-04764"><sup>2</sup></xref></contrib></contrib-group>
<aff id="af1-sensors-12-04764">
<label>1</label> Department of Geographical Information Science, Nanjing University, Nanjing 210093, China</aff>
<aff id="af2-sensors-12-04764">
<label>2</label> Key Laboratory for Land Environment and Disaster Monitoring of State Bureau of Surveying and Mapping of China, China University of Mining and Technology, Xuzhou 221116, China; E-Mails: <email>xiajunshi@126.com</email> (J.X.); <email>tankun@cumt.edu.cn</email> (K.T.); <email>rs_liuyi@163.com</email> (Y.L.); <email>allencong@163.com</email> (S.L.)</aff>
<aff id="af3-sensors-12-04764">
<label>3</label> Hebei Bureau of Surveying and Mapping, Shijiazhuang 050031, China; E-Mail: <email>cumtwzh@163.com</email></aff>
<author-notes>
<corresp id="c1-sensors-12-04764">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>dupjrs@126.com</email>; Tel.: +86-139-5219-6497; Fax: +86-25-8359-2686.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2012</year></pub-date>
<pub-date pub-type="epub">
<day>12</day>
<month>4</month>
<year>2012</year></pub-date>
<volume>12</volume>
<issue>4</issue>
<fpage>4764</fpage>
<lpage>4792</lpage>
<history>
<date date-type="received">
<day>22</day>
<month>2</month>
<year>2012</year></date>
<date date-type="rev-recd">
<day>22</day>
<month>3</month>
<year>2012</year></date>
<date date-type="accepted">
<day>6</day>
<month>4</month>
<year>2012</year></date></history>
<permissions>
<copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2012</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>Over the last two decades, multiple classifier system (MCS) or classifier ensemble has shown great potential to improve the accuracy and reliability of remote sensing image classification. Although there are lots of literatures covering the MCS approaches, there is a lack of a comprehensive literature review which presents an overall architecture of the basic principles and trends behind the design of remote sensing classifier ensemble. Therefore, in order to give a reference point for MCS approaches, this paper attempts to explicitly review the remote sensing implementations of MCS and proposes some modified approaches. The effectiveness of existing and improved algorithms are analyzed and evaluated by multi-source remotely sensed images, including high spatial resolution image (QuickBird), hyperspectral image (OMISII) and multi-spectral image (Landsat ETM+). Experimental results demonstrate that MCS can effectively improve the accuracy and stability of remote sensing image classification, and diversity measures play an active role for the combination of multiple classifiers. Furthermore, this survey provides a roadmap to guide future research, algorithm enhancement and facilitate knowledge accumulation of MCS in remote sensing community.</p></abstract>
<kwd-group>
<kwd>multiple classifier system</kwd>
<kwd>classifier ensemble</kwd>
<kwd>remote sensing</kwd>
<kwd>classification</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<title>1. Introduction</title>
<p>Remote sensing images are widely used for land cover classification, target identification and thematic mapping from local to global scales owing to its technical advantages such as multi-resolution, wide coverage, repeatable observation and multi/hyperspectral-spectral records [<xref ref-type="bibr" rid="b1-sensors-12-04764">1</xref>]. With the advancement of remote sensing data acquisition technology, remote sensing images can be acquired by various sensors, for example, hyperspectral imaging spectrometer, high resolution sensors, polarimetric synthetic aperture radar, <italic>etc.</italic> [<xref ref-type="bibr" rid="b2-sensors-12-04764">2</xref>]. Under this situation, image classification techniques are exposed with new challenges to process the multi-source data and serve to multidisciplinary applications [<xref ref-type="bibr" rid="b2-sensors-12-04764">2</xref>,<xref ref-type="bibr" rid="b3-sensors-12-04764">3</xref>]. One effective solution is to generate a classifier ensemble by combining some individual classifiers, which is named as multiple classifier system (MCS) or classifier ensemble [<xref ref-type="bibr" rid="b4-sensors-12-04764">4</xref>–<xref ref-type="bibr" rid="b10-sensors-12-04764">10</xref>]. In the past twenty years, MCS has developed rapidly and been widely used in various fields such as pattern recognition, image processing and target identification. Besides, MCS has become a hot topic in the attractive series international workshops, mainly because of its capability to improve accuracy and efficiency [<xref ref-type="bibr" rid="b11-sensors-12-04764">11</xref>–<xref ref-type="bibr" rid="b14-sensors-12-04764">14</xref>].</p>
<p>MCS, which has been utilized by the remote sensing society recently, is viewed as one effective way to improve classification performance of remotely sensed imagery [<xref ref-type="bibr" rid="b3-sensors-12-04764">3</xref>,<xref ref-type="bibr" rid="b4-sensors-12-04764">4</xref>]. Many researchers investigated the scheme of combining several classifiers to generate a single result. Both theoretical and empirical studies indicate that a good MCS is one where individual classifiers are accurate and at the same time disagree on some different parts of the input space [<xref ref-type="bibr" rid="b15-sensors-12-04764">15</xref>]. Two popular approaches for creating accurate ensemble are Bagging and Boosting [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>,<xref ref-type="bibr" rid="b17-sensors-12-04764">17</xref>]. Bagging uses bootstrap sampling to generate accurate ensemble [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>]. Boosting is a general method of producing a very accurate prediction rule by combining rough and moderately inaccurate learner [<xref ref-type="bibr" rid="b17-sensors-12-04764">17</xref>].</p>
<p>Some previous studies demonstrated that the simple majority vote of the classifier prediction is an effective strategy. Within this scheme, a pixel is classified as the class that was chosen by the majority of the individual classifiers. Foody et <italic>al</italic>. used simple majority vote rule to integrate multiple binary classifiers for the mapping of a specific class [<xref ref-type="bibr" rid="b7-sensors-12-04764">7</xref>]. Benediktsson and Swain proposed a consensus theoretic classification approach and designed a hybrid consensus theoretic classification scheme [<xref ref-type="bibr" rid="b18-sensors-12-04764">18</xref>,<xref ref-type="bibr" rid="b19-sensors-12-04764">19</xref>]. The experimental results using hybrid consensus theoretic classification scheme outperform other statistical/neural network and consensus theoretic algorithms. Doan and Foody investigated the combination of soft classification methods for remote sensing image and found that classification combination could improve the accuracy [<xref ref-type="bibr" rid="b8-sensors-12-04764">8</xref>]. Fauvel <italic>et al</italic> fused the classification results derived from a neural network and a fuzzy classifier using fuzzy sets theory, and the proposed method improved the classification accuracy compared to the separate use of different classifiers [<xref ref-type="bibr" rid="b20-sensors-12-04764">20</xref>]. Smits focused on the family of MCSs based on dynamic classifier selection (DCS) and proposed a modification to dynamic classifier selection by local accuracy (DCS-LA) [<xref ref-type="bibr" rid="b9-sensors-12-04764">9</xref>]. Experiments had shown that DCS-LA is a valid alternative method to majority vote.</p>
<p>In addition, MCS was also used for multi-source and hyperspectral remote sensing images. Briem <italic>et al</italic>. applied multiple classifiers (minimum Euclidean distance classifier, maximum likelihood classifier, conjugate-gradient back propagation, decision table, J48, 1R) to multisource remote sensing data (multispectral, synthetic aperture radar and topographical data sets) and demonstrated that Bagging, Boosting and consensus theory obtained higher overall user and producer accuracies than traditional classifiers [<xref ref-type="bibr" rid="b4-sensors-12-04764">4</xref>]. Debeir <italic>et al</italic>. integrated spectral features, spatial and contextual information and experimented Bagging and subset selection algorithms with high resolution remotely sensed images, and the accuracy was higher than neural network and decision tree classifier [<xref ref-type="bibr" rid="b21-sensors-12-04764">21</xref>]. Kumar <italic>et al</italic>. designed a hierarchical fusion strategy for classifying hyperspectral remote sensing image, and the classifier fusion method outperformed other traditional approaches, including maximum likelihood classifier (MLC) <italic>etc.</italic> [<xref ref-type="bibr" rid="b11-sensors-12-04764">11</xref>]. Waske <italic>et al</italic>. fused Support Vector Machines (SVM) for classifying SAR and multispectral imagery from agricultural areas[<xref ref-type="bibr" rid="b22-sensors-12-04764">22</xref>]. The presented approaches are superior to all other techniques, such as a standard SVM. Ceamanos <italic>et al</italic>. designed a classifier ensemble for classifying hyperspectral data [<xref ref-type="bibr" rid="b15-sensors-12-04764">15</xref>]. Firstly, hyperspectral image is divided into several parts according to the similarity of spectral bands. Then, each part is used as the input of SVM classifier. Finally, all outputs are applied to be combined together by an additional SVM classifier. The results clearly demonstrated that the proposed SVM fusion scheme outperforms a standard SVM classifier. Owing to its technical superiorities, MCS is also applied to change detection from multi-temporal remote sensing images [<xref ref-type="bibr" rid="b23-sensors-12-04764">23</xref>,<xref ref-type="bibr" rid="b24-sensors-12-04764">24</xref>].</p>
<p>As mentioned above, the classification results to be combined should be diverse. If the classification results are identical or similar, it would not improve the accuracy by combining them. Therefore, diversity is a vital requirement for the success of ensemble, demonstrated by theoretical and empirical studies [<xref ref-type="bibr" rid="b25-sensors-12-04764">25</xref>,<xref ref-type="bibr" rid="b26-sensors-12-04764">26</xref>]. Kuncheva and Whitaker summarized the diversity measures in classifier ensembles [<xref ref-type="bibr" rid="b27-sensors-12-04764">27</xref>]. A special issue called “Diversity Measure in Multiple Classifier System” published in Information Fusion journal indicates that diversity measure is an important research direction in MCS [<xref ref-type="bibr" rid="b28-sensors-12-04764">28</xref>]. Petrakos <italic>et al</italic>. applied agreement measure in decision fusion level combination [<xref ref-type="bibr" rid="b29-sensors-12-04764">29</xref>]. Foody compared the different classification results from three aspects (similarity, non-inferiority, difference) using hypothesis tests and confidence interval algorithms [<xref ref-type="bibr" rid="b30-sensors-12-04764">30</xref>]. Brown <italic>et al</italic>. indicated that the diversity for classification tasks is still an ill-defined concept, and defining an appropriate diversity measures for MCS is still an open question [<xref ref-type="bibr" rid="b28-sensors-12-04764">28</xref>]. So far, there are many existing diversity measures, but in the remote sensing literatures, they are rarely used and compared for remote sensing image classification by MCS. In conclusion, a successful MCS system depends to a large extent on the proper selection of diverse classifiers for incorporation [<xref ref-type="bibr" rid="b31-sensors-12-04764">31</xref>]. Kuncheva claims that the choice of an appropriate ensemble strategy can improve the performance of MCS [<xref ref-type="bibr" rid="b32-sensors-12-04764">32</xref>]. More details about diversity measures and classifier ensemble approaches can be seen in Section 2.</p>
<p>The objective of this paper is to give a review on the uses of MCS in remote sensing and the ideas of improving MCS approaches, including classifier ensemble approaches and diversity measures, and some modified algorithms. In order to demonstrate the effectiveness of MCS, the mentioned MCS approaches are experimented with three typical optical remotely sensed images: high spatial resolution image (QuickBird image), hyperspectral image (OMISII image) and multi-spectral image (Landsat ETM+ image).</p>
<p>The remainder of this paper is organized as follows. In Section 2, we introduce some common approaches of MCS, summarize the advances of MCS to remote sensing image classification, and propose some improved algorithms. Experimental results are presented in Section 3. Section 4 is the conclusion of this paper.</p></sec>
<sec>
<title>2. Multiple Classifier System for Remote Sensing Image Classification</title>
<sec>
<title>2.1. MCS: A Primer</title>
<p>For remote sensing image classification, Giacinto <italic>et al.</italic> compared the performance of remote sensing classification approaches in different applications and found that no classifier could always perform well [<xref ref-type="bibr" rid="b33-sensors-12-04764">33</xref>]. To alleviate this problem, MCS can provide the complementary information of the pattern classifiers and integrate the outputs of these pattern classifiers according to a certain combination approaches [<xref ref-type="bibr" rid="b5-sensors-12-04764">5</xref>,<xref ref-type="bibr" rid="b34-sensors-12-04764">34</xref>–<xref ref-type="bibr" rid="b36-sensors-12-04764">36</xref>]. Therefore, it can outperform any individual classifier. In addition to MCS in the domain of classification, some concepts are similar to MCS, including multiple classifier combination [<xref ref-type="bibr" rid="b37-sensors-12-04764">37</xref>,<xref ref-type="bibr" rid="b38-sensors-12-04764">38</xref>], classifier ensemble [<xref ref-type="bibr" rid="b10-sensors-12-04764">10</xref>,<xref ref-type="bibr" rid="b27-sensors-12-04764">27</xref>,<xref ref-type="bibr" rid="b39-sensors-12-04764">39</xref>], decision level combination [<xref ref-type="bibr" rid="b40-sensors-12-04764">40</xref>,<xref ref-type="bibr" rid="b41-sensors-12-04764">41</xref>], ensemble learning [<xref ref-type="bibr" rid="b42-sensors-12-04764">42</xref>,<xref ref-type="bibr" rid="b43-sensors-12-04764">43</xref>], <italic>etc.</italic></p>
<p>We assume <italic>M</italic> classes exist on a remote sensing image, represented by <italic>C</italic><sub>1</sub> ∪ <italic>C</italic><sub>2</sub>…<italic>C</italic><sub>i</sub> ∪ <italic>C<sub>M</sub>, i</italic> ∈ {1,2,…, <italic>M</italic>}. For hard classification, each pixel is allocated a specific class label <italic>J, J</italic> ∈ {1,2,…, <italic>M</italic>}. For soft classification, each pixel is depicted by a vector {<italic>P</italic>(1), <italic>P</italic>(2),…, <italic>P(M)</italic>}, where <italic>P</italic>(<italic>i</italic>) is the probability or membership of the pixel belonging to the <italic>i<sup>th</sup></italic> class, or the abundance of the <italic>i<sup>th</sup></italic> endmember within the pixel. When multiple classifiers are combined, both hard and soft classification results can be integrated.</p>
<p>In order to introduce the MCS approaches clearly, three categories are adopted to summarize the classifier ensemble approaches:
<list list-type="order">
<list-item>
<p>Algorithms based on training sample manipulation. The most popular methods are Bagging and Boosting [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>,<xref ref-type="bibr" rid="b17-sensors-12-04764">17</xref>].</p></list-item>
<list-item>
<p>Concatenation combination. In a concatenation combination, the classification result generated by a classifier is used as the input into the next classifier. The results achieved through each classifier are similarly passed onto the next classifier until a result is obtained through the final classifier in the chain [<xref ref-type="bibr" rid="b31-sensors-12-04764">31</xref>,<xref ref-type="bibr" rid="b44-sensors-12-04764">44</xref>,<xref ref-type="bibr" rid="b45-sensors-12-04764">45</xref>] (<xref ref-type="fig" rid="f1-sensors-12-04764">Figure 1(a)</xref>).</p></list-item>
<list-item>
<p>Parallel combination. In a parallel combination, multiple classifiers are designed independently without any mutual interaction and their outputs are combined according to certain strategies [<xref ref-type="bibr" rid="b44-sensors-12-04764">44</xref>,<xref ref-type="bibr" rid="b45-sensors-12-04764">45</xref>] (<xref ref-type="fig" rid="f1-sensors-12-04764">Figure 1(b)</xref>). The main design decision is the selection of a reprehensive ensemble approach. If the approach is well designed, the MCS can obtain the accurate result [<xref ref-type="bibr" rid="b31-sensors-12-04764">31</xref>]. Some of the popular and successful ensemble approaches are majority voting, fuzzy integral, D-S evidence theory, <italic>etc.</italic></p></list-item></list></p>
<p>Besides concatenation and parallel combination, hierarchical combination that combines both concatenation and parallel combination is also used [<xref ref-type="bibr" rid="b31-sensors-12-04764">31</xref>]. But in this paper, we just focus on the concatenation and parallel combination.</p>
<p>As mentioned in Section 1, diversity of the classifier outputs is a vital requirement for the success of the ensemble. Diversity measures are often divided into parts: pairwise and non-pairwise [<xref ref-type="bibr" rid="b27-sensors-12-04764">27</xref>]. Pairwise measures include kappa statistics, double fault, disagreement, <italic>etc.</italic>, and non-pairwise measures consist of entropy, weighted count of errors and correct results (WCEC), <italic>etc.</italic> [<xref ref-type="bibr" rid="b27-sensors-12-04764">27</xref>,<xref ref-type="bibr" rid="b46-sensors-12-04764">46</xref>]. When pairwise measures are used to calculate the diversity among more than two classification results, it adopts the average of the pairwise measures between each two classification results. The non-pairwise diversity measures consider all the classifiers together and calculate directly one diversity value for the ensemble [<xref ref-type="bibr" rid="b12-sensors-12-04764">12</xref>]. In this paper, both pairwise and non-pairwise diversity measures, including kappa statistics, double fault, entropy, WCEC and a novel measure are reviewed.</p>
<p>The following sections describe the classifier ensemble approaches and diversity measures. Section 2.2 introduces Bagging and AdaBoost. Section 2.3 lists out a collection of parallel combination methods. Section 2.4 gives the basic idea of concatenation combination schemes. We proposed three modified concatenation combination approaches in Section 2.5. Section 2.6 presents five diversity measures, three of them are pairwise and the rest are non-pairwise.</p></sec>
<sec>
<title>2.2. Bagging and AdaBoost</title>
<p>Unlike statistical voting theory which is based on the assumption of independent data sources and uses all training samples only one time, Boosting and Bagging are exerted by manipulating training samples [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>,<xref ref-type="bibr" rid="b17-sensors-12-04764">17</xref>].</p>
<p>Bagging is the abbreviation of bootstrap aggregating. In this algorithm, <italic>n</italic> samples are selected at random from a training set with <italic>k</italic> samples, and instructive iteration is exerted to create some different bags, and each bag is classified by vote to predict its class [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>]. Boosting can process data with weights, and the weights of misclassified samples are increased to concentrate the learning algorithm on specific samples. Bagging has been shown to reduce the variance of the classification, while Boosting reduces both the variance and the bias of the classification. So in most cases, Boosting can produce more accurate classification results than Bagging. However, the computation time of Boosting is more than Bagging, and Boosting is sensitive to noise [<xref ref-type="bibr" rid="b47-sensors-12-04764">47</xref>]. AdaBoost is the popular used approach of Boosting algorithms. The pseudo-code of Bagging and AdaBoost can be seen in <xref ref-type="fig" rid="f2-sensors-12-04764">Figure 2</xref> [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>,<xref ref-type="bibr" rid="b17-sensors-12-04764">17</xref>,<xref ref-type="bibr" rid="b47-sensors-12-04764">47</xref>]. In addition, there is a great variety of approaches drawn upon the basic idea of Bagging and Boosting. Detailed descriptions can be found from Wagging [<xref ref-type="bibr" rid="b48-sensors-12-04764">48</xref>], Random Forest [<xref ref-type="bibr" rid="b49-sensors-12-04764">49</xref>], Random Subspace [<xref ref-type="bibr" rid="b50-sensors-12-04764">50</xref>], Logistic Boosting [<xref ref-type="bibr" rid="b51-sensors-12-04764">51</xref>], MultiBoost [<xref ref-type="bibr" rid="b52-sensors-12-04764">52</xref>], Rotation Forest [<xref ref-type="bibr" rid="b39-sensors-12-04764">39</xref>], Rotboost [<xref ref-type="bibr" rid="b53-sensors-12-04764">53</xref>], <italic>etc.</italic></p></sec>
<sec>
<title>2.3. Parallel Combination</title>
<sec>
<title>Majority or weighted vote</title>
<p>The most popular ensemble approach is the majority vote by which each individual classifier “votes” for the specific class, and the class that collects the majority votes is predicted by the ensemble [<xref ref-type="bibr" rid="b12-sensors-12-04764">12</xref>]. By generalizing this approach, Xu <italic>et al.</italic> proposed a threshold plurality vote, which imposed a threshold on the number of votes to select the class [<xref ref-type="bibr" rid="b44-sensors-12-04764">44</xref>]. The major problem of majority vote is that all the classifiers have the same “authority” regardless of their respective abilities to classify properly [<xref ref-type="bibr" rid="b54-sensors-12-04764">54</xref>]. Therefore, the weighting methods are best suited for problems where the individual classifiers perform the same task. The simple one is that the decision of each classifier is weighted according to the estimated accuracy on the training set. Furthermore, Moreno-Seco <italic>et al.</italic> proposed three variant of weighted voting approaches: re-scaled weighted vote (RSMV), best-worst weighted vote (BWWV) and quadratic best-worst weighted vote (QBWWV) [<xref ref-type="bibr" rid="b54-sensors-12-04764">54</xref>]. Among all voting algorithm tested in [<xref ref-type="bibr" rid="b54-sensors-12-04764">54</xref>], each method has shown the different performance in different datasets. In this paper, we adopted the specific class accuracy generated by the individual classifier as the weights.</p></sec>
<sec>
<title>Bayesian average method</title>
<p>Bayesian combiners are used to linearly combine the probabilistic predictions of multiple classifiers by weighting their posterior probabilities [<xref ref-type="bibr" rid="b55-sensors-12-04764">55</xref>]. It can be defined as follows:
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm1" display="block">
<mml:semantics id="sm1">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo>∈</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>∣</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>k</mml:mi></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo>∈</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>∣</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>Then, the class label is decided:
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm2" display="block">
<mml:semantics id="sm2">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo>arg</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mo>max</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi></mml:mrow></mml:munder>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo>∈</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>∣</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>P<sub>k</sub> (X</italic> ε <italic>C<sub>i</sub></italic>∣<italic>X)</italic> represents the probability of class <italic>i</italic> of pixel <italic>X</italic> in <italic>k<sup>th</sup></italic> classifier, <italic>M</italic> is the total number of classes, <italic>N</italic> is the total number of classifiers. <italic>L(X)</italic> represents the class label of pixel <italic>X</italic>.</p>
<p>Other simple operators such as <italic>Maximum, Minimum</italic> and <italic>Product</italic> have been applied to combine the posterior probabilities [<xref ref-type="bibr" rid="b56-sensors-12-04764">56</xref>]. On the basis of Bayesian framework, Kittler <italic>et al.</italic> provided a theoretical underpinning of many existing combination schemes based on the product and sum rule, showing that the sum rule is less sensitive to the errors of subsets of base classifiers [<xref ref-type="bibr" rid="b57-sensors-12-04764">57</xref>].</p></sec>
<sec>
<title>Fuzzy Integral</title>
<p>Fuzzy integral is an effective information fusion algorithm based on the fuzzy measure [<xref ref-type="bibr" rid="b58-sensors-12-04764">58</xref>,<xref ref-type="bibr" rid="b59-sensors-12-04764">59</xref>]. Sugeno integral is a widely used fuzzy integral algorithm, in which fuzzy measure <italic>g</italic> is defined as a function in measurable space: <italic>g</italic>: Θ → [0,1], meeting the conditions as follows:
<list list-type="order">
<list-item>
<p><italic>g</italic>(Φ) = 0, <italic>g(S)</italic> = 1,</p></list-item>
<list-item>
<p>If <italic>A, B</italic> ∈ Θ, and A ⊆ B ⇒ <italic>g</italic> (<italic>A)</italic> ≤ <italic>g</italic>(<italic>B</italic>),</p></list-item>
<list-item>
<p>If 
<inline-formula>
<mml:math id="mm3" display="inline">
<mml:semantics id="sm3">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>}</mml:mo></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>∞</mml:mo></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> is a incremental measurable set, then <italic>g</italic> (lim<italic><sub>i</sub></italic><sub>→∞</sub>) = lim<italic><sub>i</sub></italic><sub>→∞</sub> <italic>g</italic>(<italic>A<sub>i</sub></italic>).</p></list-item></list></p>
<p>Fuzzy measure <italic>g</italic>λ was introduced in Sugeno integral.</p>
<p>For any <italic>A, B</italic> ∈ Θ, <italic>g(A</italic> ∪ <italic>B)</italic> = <italic>g(A)</italic> + <italic>g(B)</italic> + λ<italic>g(A)g(B)</italic>, λ &gt; −1.</p>
<p>Suppose that (<italic>S, R, g</italic>) is the fuzzy measurement space, and f : <italic>S</italic> → [0,1] is a measurable function on <italic>R</italic>. When <italic>S</italic> = <italic>s</italic><sub>1</sub>, <italic>s</italic><sub>2</sub>, …, <italic>s<sub>N</sub></italic> is a finite set and meets the condition of 1 ≥ <italic>f</italic>(<italic>s</italic><sub>1</sub>) <italic>≥ f</italic>(<italic>s</italic><sub>2</sub>) … ≥ <italic>f</italic>(<italic>s<sub>n</sub></italic>) ≥ 0, the fuzzy measurement can be calculated as the follows:
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="mm4" display="block">
<mml:semantics id="sm4">
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>{</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>}</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD4">
<label>(4)</label>
<mml:math id="mm5" display="block">
<mml:semantics id="sm5">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>{</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>}</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi>λ</mml:mi>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>{</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>}</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&lt;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi>N</mml:mi></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd/></mml:mtr></mml:mtable></mml:semantics></mml:math></disp-formula>where <italic>A<sub>i</sub></italic> = {<italic>s</italic><sub>1</sub>, <italic>s</italic><sub>2</sub>, …, <italic>s<sub>i</sub></italic>} is the classifier ensemble, and <italic>N</italic> is the amounts of classifiers.</p>
<p>Sugeno integral is calculated in the following:
<disp-formula id="FD5">
<label>(5)</label>
<mml:math id="mm6" display="block">
<mml:semantics id="sm6">
<mml:mrow>
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>∫</mml:mi>
<mml:mi>A</mml:mi></mml:msub>
<mml:mrow>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo></mml:mrow></mml:mrow>
<mml:munder>
<mml:mrow>
<mml:mo>sup</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>∈</mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo stretchy="false">]</mml:mo></mml:mrow></mml:munder>
<mml:mtext mathvariant="italic">min</mml:mtext>
<mml:mo>{</mml:mo>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>}</mml:mo></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>When the member classifiers are more than two, the values of λ corresponding to each class is calculated as the following equation:
<disp-formula id="FD6">
<label>(6)</label>
<mml:math id="mm7" display="block">
<mml:semantics id="sm7">
<mml:mrow>
<mml:mi>λ</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∏</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:munderover>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>+</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>{</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>}</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>In multiple classifier combination, <italic>f<sub>j</sub></italic>(<italic>s<sub>i</sub></italic>) is defined as the probability of the <italic>i<sup>th</sup></italic> classifier to the <italic>j<sup>th</sup></italic> class in soft classification, or the accuracy of the <italic>i<sup>th</sup></italic> classifier to the <italic>j<sup>th</sup></italic> class in hard classification. We utilized the latter one, hard classification in this paper.</p>
<p>The final classification result is the class label corresponding to the maximum fuzzy integral value:
<disp-formula id="FD7">
<label>(7)</label>
<mml:math id="mm8" display="block">
<mml:semantics id="sm8">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo>arg</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mo>max</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi></mml:mrow></mml:munder>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>Moreover, Ben Adballah <italic>et al</italic>. proposed a local fuzzy integral approach, called context extraction for local fusion with fuzzy integral (CELF-FI) for fusing different classifiers outputs [<xref ref-type="bibr" rid="b60-sensors-12-04764">60</xref>]. This method based on a novel objective function that combines context identification and multi-algorithm fusion criteria into a joint objective function. This objective function consists of two terms: the first is designed to produce compact clusters, called contexts, and the second to produce Sugeno measures for fuzzy integral fusion for each context [<xref ref-type="bibr" rid="b60-sensors-12-04764">60</xref>].</p></sec>
<sec>
<title>Consensus Theory</title>
<p>Consensus theory is suitable for the combination of classifiers with probability to specific classes as their outputs [<xref ref-type="bibr" rid="b18-sensors-12-04764">18</xref>,<xref ref-type="bibr" rid="b19-sensors-12-04764">19</xref>]. Usually logarithmic and linear consensus models are used in the following two ways:
<disp-formula id="FD8">
<label>(8)</label>
<mml:math id="mm9" display="block">
<mml:semantics id="sm9">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∏</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:munderover>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo>∣</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>λ</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD9">
<label>(9)</label>
<mml:math id="mm10" display="block">
<mml:semantics id="sm10">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:munderover>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo>∣</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>λ</mml:mi>
<mml:mtext mathvariant="italic">ij</mml:mtext></mml:msub></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>Each classifier is viewed as an expert with the membership or probability of the pixel belonging to every class as its output. <italic>T<sub>j</sub>(X)</italic> is the probability the pixel <italic>X</italic> being assigned to the <italic>j<sup>th</sup></italic> class, and p<italic><sub>i</sub> (C<sub>j</sub></italic>∣<italic>X)</italic> is the probability of pixel <italic>X</italic> being assigned to the <italic>j<sup>th</sup></italic> class by the <italic>i<sup>th</sup></italic> classifier, and λ<italic><sub>ij</sub></italic> is the weight. The simplest approach of the weighting scheme consists in giving all the individual outputs equal weights. Furthermore, the weights can be defined to not only weight the individual sources but also the individual classifiers [<xref ref-type="bibr" rid="b5-sensors-12-04764">5</xref>]. Thus, we select the specific class accuracy as the weights. It means that λ<italic><sub>ij</sub></italic> represents classification accuracy of the <italic>i<sup>th</sup></italic> classifier to the <italic>j<sup>th</sup></italic> class in this paper.</p>
<p>The final classification result is the class label corresponding to the maximum value of <italic>T<sub>j</sub>(X), j</italic> = 1,…, <italic>M</italic>:
<disp-formula id="FD10">
<label>(10)</label>
<mml:math id="mm11" display="block">
<mml:semantics id="sm11">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo>arg</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mo>max</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi></mml:mrow></mml:munder>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:semantics></mml:math></disp-formula></p></sec>
<sec>
<title>D-S Evidence Theory and its improvement</title>
<p>D-S evidence theory, which assigns probability to the sets, is able to handle the uncertainty caused by unknown factors [<xref ref-type="bibr" rid="b12-sensors-12-04764">12</xref>]. It uses discrimination framework, confidence function, likelihood function and probability allocation function to represent and process knowledge. Suppose that Θ = <italic>C</italic><sub>1</sub>, <italic>C<sub>2</sub></italic>,…, <italic>C<sub>i</sub></italic>… <italic>C<sub>M</sub></italic> is the discrimination framework and <italic>M</italic> is the number of classes, therefore basic probability allocation function <italic>m</italic> is a function from 2<sup>Θ</sup> to [0,1] and it meets the requirement of:
<disp-formula id="FD11">
<label>(11)</label>
<mml:math id="mm12" display="block">
<mml:semantics id="sm12">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="right">
<mml:mtr columnalign="right">
<mml:mtd columnalign="right">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>ϕ</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="right">
<mml:mtd columnalign="right">
<mml:mrow>
<mml:msub>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>⊆</mml:mo>
<mml:mi>Θ</mml:mi></mml:mrow></mml:msub>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>A</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>If there are two or more different evidences, orthogonal sum can be used to combine those evidences. Assume that <italic>Z</italic><sub>1</sub>, <italic>Z</italic><sub>2</sub>,…, <italic>Z<sub>n</sub></italic> are the probability allocation functions corresponding to evidence <italic>F</italic><sub>1</sub>, <italic>F<sub>2</sub></italic>,…, <italic>F<sub>n</sub></italic>, and their orthogonal sum <italic>Z</italic> = <italic>Z</italic><sub>1</sub> ⊕ <italic>Z<sub>2</sub></italic> … ⊕ <italic>Z<sub>n</sub></italic> is:
<disp-formula id="FD12">
<label>(12)</label>
<mml:math id="mm13" display="block">
<mml:semantics id="sm13">
<mml:mrow>
<mml:mi>Z</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>ϕ</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD13">
<label>(13)</label>
<mml:math id="mm14" display="block">
<mml:semantics id="sm14">
<mml:mrow>
<mml:mi>Z</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>A</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>K</mml:mi></mml:mfrac>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mo>∩</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munder>
<mml:mrow>
<mml:munder>
<mml:mo>∏</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>≤</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi>n</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD14">
<label>(14)</label>
<mml:math id="mm15" display="block">
<mml:semantics id="sm15">
<mml:mrow>
<mml:mi>K</mml:mi>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mo>∩</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>≠</mml:mo>
<mml:mi>Φ</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:munder>
<mml:mo>∏</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>≤</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi>n</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>When various evidences are inconsistent or contradictory from each other, the result may be unreasonable. In order to alleviate this problem, an improved evidence theory algorithm was proposed and experimented in [<xref ref-type="bibr" rid="b61-sensors-12-04764">61</xref>], and it proved that the modified method was superior to traditional method while the evidences with high contradiction and inconsistency. For remote sensing image, different classifiers may generate different classified labels, resulting in the generation of evidence with high contradiction, so the improved evidence theory ensemble approach may improve the performance. The detailed equations are detailed as follows [<xref ref-type="bibr" rid="b61-sensors-12-04764">61</xref>]:
<disp-formula id="FD15">
<label>(15)</label>
<mml:math id="mm16" display="block">
<mml:semantics id="sm16">
<mml:mrow>
<mml:mi>Z</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>A</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>∈</mml:mo>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>…</mml:mo>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>n</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>n</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi>K</mml:mi>
<mml:mo>×</mml:mo>
<mml:mo>∈</mml:mo></mml:mrow>
<mml:mo>×</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>n</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>A</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>where:
<disp-formula id="FD16">
<label>(16)</label>
<mml:math id="mm17" display="block">
<mml:semantics id="sm17">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>k</mml:mi>
<mml:mo>∼</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&lt;</mml:mo>
<mml:mi>j</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>∩</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>Φ</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>∈</mml:mo>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo>∈</mml:mo>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD17">
<label>(17)</label>
<mml:math id="mm18" display="block">
<mml:semantics id="sm18">
<mml:mrow>
<mml:mo>∈</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo>exp</mml:mo></mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mover accent="true">
<mml:mi>k</mml:mi>
<mml:mo>∼</mml:mo></mml:mover></mml:mrow></mml:msup></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>ε</italic> is the confidence of evidence, <italic>k̃</italic> is the average of contradiction level between two evidences, and <italic>K</italic> is the total contradiction level of all evidences. This evidence combination method can reduce the limitations caused by evidence inconsistency. For multiple classifier combination of remote sensing, the result of each classifier can be viewed as a piece of evidence. Probability allocation function can be represented by the classification accuracy of specific class. For example, if a pixel belongs to the <italic>i<sup>th</sup></italic> class in a classifier, the basic probability is <italic>m</italic>(C<italic><sub>i</sub></italic>) = <italic>P<sub>i</sub>, m</italic>(Θ) = 1 − <italic>P<sub>i</sub></italic>, where <italic>P<sub>i</sub></italic> is the accuracy of the <italic>i<sup>th</sup></italic> class by the specific classifier. After the completion of evidence combination, the class with maximum evidence is selected as the final result.</p></sec>
<sec>
<title>Dynamic Classifier Selection [<xref ref-type="bibr" rid="b9-sensors-12-04764">9</xref>,<xref ref-type="bibr" rid="b62-sensors-12-04764">62</xref>]</title>
<p>Smits applied dynamic classifier selection and improved models to multiple classifier combination for remote sensing images [<xref ref-type="bibr" rid="b9-sensors-12-04764">9</xref>]. In this method, MCS are addressed that use estimates of each individual classifier's local accuracy in small regions of feature space surrounding an unknown test sample [<xref ref-type="bibr" rid="b9-sensors-12-04764">9</xref>]. The local regions are defined by the <italic>k</italic>—nearest neighbors (<italic>KNN</italic>). The drawback of this approach is the section of <italic>k</italic>, which is rather heuristic and subjective an exercise, and the classification result is sensitive to the choice of <italic>k</italic> [<xref ref-type="bibr" rid="b9-sensors-12-04764">9</xref>,<xref ref-type="bibr" rid="b63-sensors-12-04764">63</xref>,<xref ref-type="bibr" rid="b64-sensors-12-04764">64</xref>].</p>
<p>The concept of local accuracy assumes that classes maintain a certain continuity in the feature space and neighboring pixels are expected to maintain a stronger relationship than pixels that are further away [<xref ref-type="bibr" rid="b9-sensors-12-04764">9</xref>]. Dudani suggested that using distance-weighted <italic>k</italic>—NN can improve performance, especially with small training sample [<xref ref-type="bibr" rid="b65-sensors-12-04764">65</xref>]. Suppose that <italic>d<sub>k</sub>, d<sub>t</sub>, d<sub>m</sub></italic> are the distances of the <italic>k<sup>th</sup>, t<sup>th</sup></italic> and <italic>m<sup>th</sup></italic> samples to the pixel in the decreasing sort, the weight of the <italic>t<sup>th</sup></italic> sample is:
<disp-formula id="FD18">
<label>(18)</label>
<mml:math id="mm19" display="block">
<mml:semantics id="sm19">
<mml:mrow>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mi>t</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>≠</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mtext>otherwise</mml:mtext></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>The local classification accuracy of the <italic>j</italic><sup>th</sup> classifier is:
<disp-formula id="FD19">
<label>(19)</label>
<mml:math id="mm20" display="block">
<mml:semantics id="sm20">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msup>
<mml:mi>L</mml:mi>
<mml:mi>j</mml:mi></mml:msup>
<mml:mo>=</mml:mo>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>k</mml:mi></mml:mfrac>
<mml:mo>⋅</mml:mo>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>θ</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>t</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>t</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mi>t</mml:mi></mml:msub>
<mml:mo>,</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>≤</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi>k</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>θ(X<sub>t</sub>)</italic> is the class label of the <italic>t<sup>th</sup></italic> neighboring sample, <italic>C<sub>j</sub>(X<sub>t</sub>)</italic> is the assigned class label of the <italic>j<sup>th</sup></italic> classifier to the <italic>t<sup>th</sup></italic> sample. Finally, the label of the classifier with the biggest local accuracy is selected as the final classification result.</p></sec>
<sec sec-type="methods">
<title>Differences between these methods</title>
<p>In addition, according to the individual output, classifier ensemble approach in a parallel combination can be divided into three levels: abstract level, rank level and measurement level [<xref ref-type="bibr" rid="b66-sensors-12-04764">66</xref>]. The abstract level combination methods are applied when each classifiers outputs a unique label [<xref ref-type="bibr" rid="b66-sensors-12-04764">66</xref>]. Rank level makes use of a ranked list of classes where ranking is based on decreasing likelihood. In the measurement level, probability values of the classes provided by the each classifier are used in the combination. Therefore, majority/weighted vote, fuzzy integral, evidence theory and dynamic classifier selection belong to the abstract level combination methods, while others are measurement level methods. <xref ref-type="table" rid="t1-sensors-12-04764">Table 1</xref> summarizes the classifier ensemble approaches in parallel combination. Weighted vote, fuzzy integral, Dempster–Shafer evidence theory and consensus theory require validation set to calculate the weights while other methods do not. Dynamic classifier selection calculates the distance between the samples so it requires the original image. And the computation time of dynamic classifier selection is more expensive than other approaches.</p></sec></sec>
<sec>
<title>2.4. Concatenation Combination</title>
<p>In the original concatenation combination approach, the classification result generated by a classifier is used as an input into the next classifier [<xref ref-type="bibr" rid="b31-sensors-12-04764">31</xref>]. The results obtained through each classifier are similarly passed onto the next classifier until a result is obtained through the final classifier in the chain, which can be clearly shown in <xref ref-type="fig" rid="f1-sensors-12-04764">Figure 1(a)</xref>. In this structure, decisions could be first made by a subset of the committee classifiers and their conclusions are then passed to another subset of classifiers. The construction of this algorithm is usually required to improve the accuracy of the entire system and to incorporate error checks [<xref ref-type="bibr" rid="b31-sensors-12-04764">31</xref>]. Two potential ways to improve concatenation combination are enhancing the input diversity of the final combiner, and selecting an advanced combination algorithm.</p></sec>
<sec>
<title>2.5. Modified Concatenation Combination Approaches</title>
<p>On the basis of the previous improvement ideas, we propose three modified concatenation combination approaches of MCS in this section. The overall architecture is given in <xref ref-type="fig" rid="f3-sensors-12-04764">Figure 3</xref>. All the proposed schemes contain two training stages. Among the three methods, SVM classifier is selected as the second classifier because of its satisfactory capability to deal with the classification problem [<xref ref-type="bibr" rid="b67-sensors-12-04764">67</xref>,<xref ref-type="bibr" rid="b68-sensors-12-04764">68</xref>].</p>
<p>Proposed method (1) is the original concatenation combination approach. Multi-layer perceptron neural network (MLPNN) is assigned as the first level classifier. The class probabilities generated by MLPNN are defined as the input of the second level. In order to enhance the diversity in concatenation combination approach, the class probabilities generated by the several classifiers are combined as the input in method (2). The class probabilities generated by MLPNN, Radial Basis Function neural network (RBFNN), C4.5 decision tree (DT), Minimum Mahalanobis distance classifier (MMDC) and SVM are treated as the input to train the new SVM classifier. Similar to method (2), method (3) adopts the input derived from the different source to enhance the diversity. Original remote sensing image and the class probabilities produced by MLPNN are defined as the input of the second level in method (3).</p></sec>
<sec>
<title>2.6. Diversity Measures</title>
<p>Kappa statistics, double fault, entropy and WCEC are used in this paper, and a novel measure is proposed for remote sensing based on the improvements in general MCS.</p>
<sec>
<title>Kappa statistic</title>
<p>This measure is calculated by the following equation [<xref ref-type="bibr" rid="b29-sensors-12-04764">29</xref>]:
<disp-formula id="FD20">
<label>(20)</label>
<mml:math id="mm21" display="block">
<mml:semantics id="sm21">
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mn>11</mml:mn></mml:mrow></mml:msup>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mn>00</mml:mn></mml:mrow></mml:msup>
<mml:mo>−</mml:mo>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow></mml:msup>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mn>01</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>N</italic><sup>11</sup> represents the samples correctly classified by both classifiers, <italic>N</italic><sup>10</sup> is the samples correctly classified by the first classifier and incorrectly by the second classifier, <italic>N</italic><sup>01</sup> represents the samples incorrectly classified by the first classifier and correctly by the second classifier, <italic>N</italic><sup>00</sup> is the samples incorrectly classified by both classifier. <italic>p</italic><sub>1</sub> represents the samples correctly classified by the first classifier (it equals <italic>N</italic><sup>11</sup> plus <italic>N</italic><sup>10</sup>), <italic>p</italic><sub>2</sub> represents the samples correctly classified by the second classifier (it equals <italic>N</italic><sup>11</sup> plus <italic>N</italic><sup>01</sup>), <italic>q</italic><sub>1</sub> represents the samples incorrectly classified by the first classifier (it equals <italic>N</italic><sup>01</sup> plus <italic>N<sup>00</sup>), q<sub>2</sub></italic> represents the samples incorrectly classified by the second classifier (it equals <italic>N</italic><sup>10</sup> plus <italic>N</italic><sup>00</sup>).</p>
<p>It not only gives a measure of the degree of agreement but also has a test associated with itself to check if the apparent agreement can be attributed to chance only In addition, we also use a new kappa statistic <italic>k</italic>′ and the final value is the average of the class kappa statistic.</p></sec>
<sec>
<title>Double fault</title>
<p>Gacinto and Roli proposed the double-fault measure to select classifiers that are least related from a pool of classifiers. This diversity decreases when the value of the double-fault measure increases [<xref ref-type="bibr" rid="b55-sensors-12-04764">55</xref>]. This index is calculated by:
<disp-formula id="FD21">
<label>(21)</label>
<mml:math id="mm22" display="block">
<mml:semantics id="sm22">
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mi>F</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mn>00</mml:mn></mml:mrow></mml:msup></mml:mrow>
<mml:mi>N</mml:mi></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>N</italic> represents the number of sample.</p></sec>
<sec>
<title>Entropy</title>
<p>Entropy is calculated in the following [<xref ref-type="bibr" rid="b27-sensors-12-04764">27</xref>]:
<disp-formula id="FD22">
<label>(22)</label>
<mml:math id="mm23" display="block">
<mml:semantics id="sm23">
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>N</mml:mi></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:munderover>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>−</mml:mo>
<mml:mrow>
<mml:mo>⌈</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:mrow>
<mml:mo>⌉</mml:mo></mml:mrow></mml:mrow></mml:mfrac>
<mml:mtext mathvariant="italic">min</mml:mtext>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mtext mathvariant="bold-italic">z</mml:mtext>
<mml:mi>m</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>L</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mtext mathvariant="bold-italic">z</mml:mtext>
<mml:mi>m</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>L</italic> is the number of classifiers, <italic>N</italic> is the number of sample, ⌈⌉ rounds a number to the nearest integer. <italic>l</italic>(<bold><italic>z</italic></bold><italic><sub>m</sub></italic>) is the number of samples correctly classified by the classifiers <bold>z</bold>.</p>
<p>This measure is based on the assumption: if the results of ⌈<italic>L/2</italic>⌉ is the same, the diversity has the biggest value. The range of this measure is 0 to 1, in which 1 represents the biggest diversity value and 0 represents the lowest diversity value.</p></sec>
<sec>
<title>Weighted count of errors and correct (WCEC)</title>
<p><italic>WCEC</italic> is also proposed by [<xref ref-type="bibr" rid="b46-sensors-12-04764">46</xref>]. Compared to kappa statistics, WCEC not only gives a measure of the correctness of classifiers but also consider the diversity between two classifiers.
<disp-formula id="FD23">
<label>(23)</label>
<mml:math id="mm24" display="block">
<mml:semantics id="sm24">
<mml:mrow>
<mml:mtext mathvariant="italic">WCEC</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mn>11</mml:mn></mml:mrow></mml:msup>
<mml:mo>+</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mn>01</mml:mn></mml:mrow></mml:msup>
<mml:mo>+</mml:mo>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow></mml:msup></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mtext mathvariant="italic">diff</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mn>00</mml:mn></mml:mrow></mml:msubsup>
<mml:mo>−</mml:mo>
<mml:mn>5</mml:mn>
<mml:msubsup>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mtext mathvariant="italic">same</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mn>00</mml:mn></mml:mrow></mml:msubsup></mml:mrow>
<mml:mi>N</mml:mi></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>where 
<inline-formula>
<mml:math id="mm25" display="inline">
<mml:semantics id="sm25">
<mml:mrow>
<mml:msubsup>
<mml:mtext mathvariant="italic">N</mml:mtext>
<mml:mrow>
<mml:mtext mathvariant="italic">diff</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mn>00</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> represents the different errors occurring in both classifiers, 
<inline-formula>
<mml:math id="mm26" display="inline">
<mml:semantics id="sm26">
<mml:mrow>
<mml:msubsup>
<mml:mtext mathvariant="italic">N</mml:mtext>
<mml:mrow>
<mml:mtext mathvariant="italic">same</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mn>00</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> represents the same errors occurring in both classifiers.</p></sec>
<sec>
<title>Disagreement-accuracy measure</title>
<p>Combining similar classifiers does not make too much sense as the results will not be improved [<xref ref-type="bibr" rid="b31-sensors-12-04764">31</xref>]. If the sample is classified correctly or incorrectly in the classifiers, the final result is not improved. In order to consider both the classification accuracy and disagreement among the classification results comprehensively, we propose a new diversity measure, namely, disagreement and accuracy measure (<italic>DA</italic>). Firstly, we have made the classification results in descending order according to the overall accuracy. Then <italic>DA</italic> was calculated by the contribution of the classifiers to each sample. It can be defined as the following:
<disp-formula id="FD24">
<label>(24)</label>
<mml:math id="mm27" display="block">
<mml:semantics id="sm27">
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mi>A</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>N</mml:mi></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:munderover>
<mml:mrow>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>L</mml:mi></mml:munderover>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>N</italic> is total number of samples, <italic>L</italic> is the number of classifiers, <italic>P<sub>i</sub></italic>(<bold>z</bold><italic><sub>j</sub></italic>) is the specific class accuracy of correctly classified sample <italic>i</italic> by the classifier <bold>z</bold><italic><sub>j</sub></italic>. This measure highlighted the contribution of classifier with high overall accuracy to the sample. That means if a sample is classified correctly by the classifier which has the highest overall accuracy, the contribution of the subsequent classifier to this sample should be decreased.</p></sec></sec></sec>
<sec sec-type="discussion">
<title>3. Experiments and Discussions</title>
<p>In order to evaluate the effectiveness and performance of MCS to remote sensing image classification, the aforementioned existing and modified methods are experimented in the three types of remote sensing images: high spatial resolution QuickBird image, airborne hyperspectral image, and medium-resolution multispectral Landsat ETM+ image.</p>
<p>The classifiers used in an ensemble should generally be accurate but different. Based on the survey to existing pixel-based classification algorithms by [<xref ref-type="bibr" rid="b3-sensors-12-04764">3</xref>] and many suggestions on the selection of classifiers by [<xref ref-type="bibr" rid="b7-sensors-12-04764">7</xref>], we have chosen nine classifiers to be used, including Maximum Likelihood Classifier (MLC), Minimum Euclidean Distance Classifier (MEDC), Minimum Mahalanobis distance classifier (MMDC), support vector machine (SVM), MLP neural network (MLPNN), RBF neural network (RBFNN), spectral angle mapper (SAM), J48(C4.5) decision tree classifier (DTC) and simplified fuzzy H-ARTMAP (SFH-ARTMAP). The descriptions of classifiers can be detailed in [<xref ref-type="bibr" rid="b68-sensors-12-04764">68</xref>–<xref ref-type="bibr" rid="b75-sensors-12-04764">75</xref>].</p>
<p>MLC, MDC, SAM, SVM in ENVI 4.5 and Bagging, AdaBoost, J48DT, MLPNN, RBFNN in Weka 3.6.3 software are used, while SFH-AREMAP is implemented in MATLAB 6.5 software [<xref ref-type="bibr" rid="b72-sensors-12-04764">72</xref>]. The other classifier ensemble approaches and diversity measures are all implemented in ENVI+IDL software.</p>
<p>In this paper, we mainly focus on the final results of MCS, so we adopt the default parameters in the above classifiers. The iterations of AdaBoost and Bagging are 10. The different classifier ensemble algorithms are experimented in the three typical images, but the diversity measures are only experimented on QuickBird image.</p>
<sec>
<title>3.1. Experiment 1: High Spatial Resolution Image Classification</title>
<p>QuickBird multispectral image (four bands of multi-spectral, spatial resolution: 2.44 m) is used as the example of high spatial resolution remote sensing image. The image size is 500 × 500. The study area is located in the northern suburb of Xuzhou city along the urban and rural connecting areas. Through the image analysis and field works, the image is classified into five land cover types: water, built-up area, green areas, vegetation and barren soil. <xref ref-type="fig" rid="f4-sensors-12-04764">Figure 4</xref> is the false color composite image (R: Band 4, G: Band 3, B: Band 2) of the study area.</p>
<p>In addition to original multispectral image, textural features extracted from the Band 4 which has the richest information are also added for classification. Mean and variance extracted by gray level co-occurrence matrix (GLCM) are used to describe the textural features. Two schemes are used for classification. The first scheme is only using the spectral features of four multispectral bands as the inputs, and the second scheme is using both four spectral features and two textural features as the inputs. Seven classifiers are used, including: MMDC, MLC, SVM, MLPNN, RBFNN, J48 DT, and SFH-ARTMAP. The accuracies of all classifiers using only spectral features and spectral together with textural features are listed in <xref ref-type="table" rid="t2-sensors-12-04764">Tables 2</xref> and <xref ref-type="table" rid="t3-sensors-12-04764">3</xref>. The classification result can be seen in <xref ref-type="fig" rid="f5-sensors-12-04764">Figures 5</xref> and <xref ref-type="fig" rid="f6-sensors-12-04764">6</xref>.</p>
<p>Comparing the test areas, the accuracy assessment demonstrates that all the classifiers perform very accurately and achieve overall accuracies of 85.92% (MMDC), 92.78% (MLC), 93.41% (SVM), 93.49% (MLP), 92.83% (RBFNN), 92.42% (J48 DTC) and 93.15% (SFH-ARTMAP) using spectral and texture features (seen <xref ref-type="table" rid="t3-sensors-12-04764">Table 3</xref>). As expected, the kappa values of the classification results using spectral and texture features are slightly higher than the ones only using spectral features. The specific accuracies of built-up areas of MMDC, MLC, MLPNN and SFH-ARTMAP are significantly lower (less than 79%). Using spectral and texture features together, most obvious is that the accuracies of built-up areas are increased by up to 7.8%, 8.14%, 8.72%, 9.44% of MMDC, MLC, MLPNN and SFH-ARTMAP, respectively. Furthermore, some other findings can be summarized as follows:
<list list-type="order">
<list-item>
<p>When textural features are introduced as the input of the classifiers, both the overall and class accuracies are enhanced in the most cases, especially for built-up areas, demonstrating the superiority of incorporating textural features in the classification process.</p></list-item>
<list-item>
<p>The classifiers have shown the different performance on the specific classes, indicating that the classifier performing well for one class may be poor for other classes. For instance, when only using spectral feature, MMDC performs weaker on the land cover type of “built-up areas” (48.48%) but SVM achieve the accuracy of 97.51% (<xref ref-type="table" rid="t2-sensors-12-04764">Table 2</xref>). The situation for spectral and texture features is similar to the one for spectral feature. The highlights in <xref ref-type="table" rid="t2-sensors-12-04764">Tables 2</xref> and <xref ref-type="table" rid="t3-sensors-12-04764">3</xref> are the highest accuracy of class among the classifiers. From the above analysis, it is necessary to combine multiple classifiers to find a better result than any individual classifiers. In the following experiments on QuickBird image, the classification results derived from both spectral and texture features are adopted as the individual classification results.</p></list-item></list></p>
<p>For Bagging and AdaBoost, J48 DTC, RBFNN and MLPNN are used as the base classifiers, and the accuracies are summarized in <xref ref-type="table" rid="t4-sensors-12-04764">Table 4</xref>. Both Bagging and AdaBoost obtain very accurately results and achieve very high overall accuracies (94.93% and 95.67% for J48 DTC, 95.09% and 95.43% for MLPNN, 93.36% and 95.35% for RBFNN). On the other hand, the accuracies of Bagging and AdaBoost increased by 0.89% to 1.96% and 1.94% to 2.25% for the three base classifiers. The classification maps can be seen in <xref ref-type="fig" rid="f7-sensors-12-04764">Figures 7</xref>–<xref ref-type="fig" rid="f9-sensors-12-04764">9</xref>. The theories of Bagging illustrate that it shows excellent performance when the base classifier is an unstable classifier such as decision tree, neural networks <italic>et al</italic>. [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>]. In most cases, the performance of AdaBoost is superior to Bagging. Our experimental results are compatible with the theoretical analysis. In addition, we also studied the influence of bootstrapped sample size to classification accuracy. Here, we choose four sample sizes: 25%, 50%, 75% and 100%. The general trend is that the overall accuracies increase slightly (less than 1%) when the sampling rate rise, although the accuracy of RBFNN is a bit unsteady under different sampling rate. Perhaps the reason is that the iterations of RBFNN are not enough (only 10 in the experiment). The results indicate that Bagging and AdaBoost outperform the base classifier in terms of overall accuracy.</p>
<p>In order to illustrate the effectiveness and universality of MCS, we choose different member classifiers applied to different parallel combination rules. For the parallel combination methods such as Bayesian average, logarithmic consensus and linear consensus, the member classifiers are MLC, SVM, MLPNN, RBFNN and SFH-ARTMAP. The member classifiers for others are SVM, MLPNN, RBFNN, MEDC and J48DT. <xref ref-type="fig" rid="f9-sensors-12-04764">Figure 9</xref> presents the classification results of MCS on parallel combination, and their accuracy statistics are shown in <xref ref-type="table" rid="t5-sensors-12-04764">Table 5</xref>. Among the individual classifiers, SVM classifier achieves the highest accuracy of 93.49%. Compared to SVM classifier, Bayesian average achieves the accuracy of 94.99%, with the improvement of 1.5%; Majority vote and fuzzy integral improve the accuracy from 93.49% to 94.74% and 93.49% to 94.37%, respectively. Other parallel combination approaches improve the accuracy slightly with the improvement of 0.28% to 1%. The “local” ensemble method (DWDCS) results in 93.98% accuracy with the improvement of 0.49%. Compared with the overall accuracies, the kappa coefficients show similar characteristics. Bayesian average obtains the highest accuracy for all parallel combination schemes.</p>
<p>Unlike other combination algorithms, DWDCS pays attention to the local optimum while other combination algorithms achieve the global optimum. The computation time of DWDCS is proportional to the test sample size and the nearest neighbors. In our experiments, the test sample size is more than 500 and the nearest neighbor is selected to be 15. The computation time of DWDCS is more than 10 min and other parallel combination strategies are less than 20 s.</p>
<p><xref ref-type="fig" rid="f10-sensors-12-04764">Figure 10</xref> presents the classification result of MCS based on concatenation combination and <xref ref-type="table" rid="t6-sensors-12-04764">Table 6</xref> shows the accuracy statistics. The accuracy assessment indicated that all the proposed concatenation combination algorithms perform very accurately and obtain overall accuracies of 95.21% (method 1), 97.06% (method 2) and 95.05% (method 3). Among all the classifier ensemble approaches in MCS, the proposed method 2 obtains the highest accuracy (97.06%) due to the following reasons:
<list list-type="order">
<list-item>
<p>This proposed method adopts the probabilities as the input of the second level and there is obvious diversity among these probabilities;</p></list-item>
<list-item>
<p>The output distributions of different classifier are different, for instance, the probability distribution of SVM and MLP are focused on 0 or 1, while the probability distribution of MEDC is ranged from 0.45 to 0.55. The linear combination rules such as linear consensus are not suited, and SVM can tackle non-linear problems to a good extent. Therefore, it can achieve the highest accuracy compared with others.</p></list-item></list></p>
<p>In the experiment of diversity measures, we choose five member classifiers from the classifier pool to be combined based on diversity measure. The classifiers in the pool are MLPNN (1), SVM (2), SFH-ARTMAP (3), RBFNN (4), MLC (5), J48DT (6), MMDC (7). Thus, there are twenty-one selection schemes of five classifiers. We have calculated the diversity of the all combination. The optimum combination corresponding to the diversity measure can be seen in <xref ref-type="table" rid="t7-sensors-12-04764">Table 7</xref>. Then, we adopted the different parallel combination approaches (majority voting, weighted voting, fuzzy integral, DS evidence theory) to evaluate the performance of the combination of classifiers selected by diversity measures.</p>
<p>From <xref ref-type="table" rid="t7-sensors-12-04764">Table 7</xref>, <italic>k, k</italic>′, <italic>entropy</italic> and <italic>WCEC</italic> diversity measures selected the same combination, which contains the classifiers: MLPNN, SVM, SFH-ARTMAP, J48DT, MMDC. <italic>DF</italic> selected the combination of MLPNN, SFH-ARTMAP, RBFNN, J48 DTC and MMDC. MLPNN, SFH-ARTMAP, RBFNN, MLC, J48DTC and MMDC were chosen by <italic>DA</italic>. The overall accuracies of the combination selected by <italic>DA</italic> are 95.52% (majority vote), 95.39% (weighted vote), 95.40% (fuzzy integral) and 95.36% (DS evidence theory). In contrast, the combination selected by <italic>DF</italic> results in the accuracies of 95.05% (majority vote), 94.79% (weighted vote), 94.76% (fuzzy integral) and 94.86% (DS evidence theory). The combination selected by other four measures achieves the accuracies of 94.83% (majority vote), 93.54% (weighted vote), 93.77% (fuzzy integral) and 93.55% (DS evidence theory). Comparison of these results with the accuracy assessment indicates that the combination selected by <italic>DA</italic> outperform the ones selected by other diversity measures in terms of overall accuracy. It demonstrates that the proposed diversity measure is a positive guidance for classifier combination. Furthermore, we ranked all the combination according to the <italic>DA</italic> measure in <xref ref-type="table" rid="t8-sensors-12-04764">Table 8</xref> and the highlights are the first three ones, whose accuracy produced by the parallel combination approaches can be seen in <xref ref-type="table" rid="t7-sensors-12-04764">Table 7</xref>.</p></sec>
<sec>
<title>3.2. Experiment 2: OMISII Hyperspectral Remote Sensing Image</title>
<p>Original airborne OMISII hyperspectral remote sensing image has 64 bands with spectral range: 0.45–1.09 <italic>μm</italic>. The spectral resolution is 10 <italic>nm</italic>. In this paper, Zhongguangcun, a high-tech zone of Beijing City is chosen as the study area. The five noisy bands are removed so it is remaining 59 bands for classification. The image size is 400 × 400. <xref ref-type="fig" rid="f11-sensors-12-04764">Figure 11(a)</xref> is the false color composite of the image by using Band 27, 25 and 2 as R, G and B components. Training samples and test samples are selected independently from the image. The land cover is classified into five classes: water, building, vegetation, forest and bare soil. From <xref ref-type="table" rid="t9-sensors-12-04764">Table 9</xref>, SVM classifier has the highest classification accuracy of forest (92.69%) and bare soil (92.97%); MLPNN classifier produces the highest classification accuracy of vegetation (98.10%) and building (98.34%); RBFNN generates the highest classification accuracy of water (95.72%). These results indicate the diversity among classifiers, which is similar to the experiment on QuickBird image. MLPNN achieves the highest overall accuracy of 92.53%. <xref ref-type="fig" rid="f11-sensors-12-04764">Figure 11(b-e)</xref> shows the classification results of SVM, J48 DTC, RBFNN and MLPNN respectively. Based on diversity measure among all classifier combination schemes, MMDC, MLPNN and SVM classifier are selected to generate the final classification.</p>
<p>From <xref ref-type="table" rid="t10-sensors-12-04764">Table 10</xref>, both AdaBoost and Bagging can enhance the classification accuracy of the base classifier. Bagging can improve the accuracy from 87.86% to 90.44% for J48 DTC, 89.46% to 89.88% for RBFNN and 92.53% to 93.09% for MLPNN. In contrast, AdaBoost performs a bit better (91% for J48 DTC, 90.65% for RBFNN and 93.27 for MLPNN) than bagging when the same base classifier is used. <xref ref-type="fig" rid="f12-sensors-12-04764">Figure 12</xref> are the classification results based on AdaBoost and Bagging using RBFNN, MLPNN, J48 DTC as the base classifiers. In classification ensemble based on parallel/concatenation combination, both improved D-S evidence theory and concatenation combination (proposed method 1) can obtain very accurately classification result with the accuracies of 92.65% and 93.58% than the best individual classifier (92.53%). This demonstrates that these methods are effectively to improve the accuracy. <xref ref-type="fig" rid="f13-sensors-12-04764">Figure 13(a)</xref> is the result of improved D-S evidence theory using MMDC, MLPNN and SVM as member classifiers. <xref ref-type="fig" rid="f13-sensors-12-04764">Figure 13(b)</xref> is the result of concatenation combination (proposed method 1).</p></sec>
<sec>
<title>3.3. Experiment 3: Medium Resolution Multi-Spectral Landsat ETM+ Image</title>
<p>A Landsat ETM+ image acquired on September 17th 2004 is used to validate the above methods. The used bands of Landsat ETM+ images are as follows: band 1: Blue band; band 2: Green band; band 3: Red band; band 4: NIR band; band 5: MIR band, band 7: LIR band. The image size is 500 × 500. The study area is the central part of Xuzhou City, China. Through the image analysis and field works, the land cover is classified into five classes: water, farmland, woodland, built-up land, and public green space. We use five member classifiers: RBFNN, J48DT, MMDC, MLC and MEDC. The adopted classifier combination methods are majority voting, weighted voting, D-S evidence theory, and fuzzy integral.</p>
<p><xref ref-type="table" rid="t11-sensors-12-04764">Table 11</xref> summarizes the accuracies of those classification results. MLC achieves the highest accuracy (91.19%). All the four classifier ensemble methods obtain higher overall accuracies than MLC classifier. The accuracy of majority vote increases from 91.19% to 93.55%. Weighted vote achieves the accuracy of 93.55%, with the improvement 2.36%. The accuracy is increased by up to 2.23% and 1.8% by DS evidence theory and fuzzy integral approaches. This indicates that these methods effectively improve the accuracy. <xref ref-type="fig" rid="f14-sensors-12-04764">Figure 14</xref> presents the classification results of different classifiers and classifier combination strategies.</p></sec>
<sec>
<title>3.4. Discussion</title>
<p>Based on the above analysis, we identified that the incorporation of diversity among the classifiers is of great importance in order to obtain better classification results via a MCS [<xref ref-type="bibr" rid="b31-sensors-12-04764">31</xref>]. Combining the similar classification results would not further improve the overall accuracy. Bagging uses bootstrap sampling to generate diversity [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>]. In order to enhance the diversity, concatenation combination constructs a proper dataflow so that diverse decisions could be first made by a subset of the committee classifiers and their conclusions are then set to another subset of classifiers. It is worthy to point out that the most important issue to combine the classifiers in parallel combination method is to examine the difference among them using diversity measures. In addition, the selection of an appropriate ensemble strategy is also an important issue of MCS. This paper presents a comprehensive list of the most common parallel combination approaches and also analyzes the situation in which each of them are most useful. We also experimentally prove the effectiveness of the three modified concatenation combination algorithms within certain images. Furthermore, we studied the diversity measures to select the optimum combination. Compared to other diversity measures, the proposed diversity measure considers both the difference and accuracy among the classifiers. Thus, it can select the optimum combination whose accuracy is higher than the selected ones by other measures.</p></sec></sec>
<sec sec-type="conclusions">
<title>4. Conclusions</title>
<p>This review discusses the intensive contributions of MCS-based work in remote sensing. We present a comprehensive review of the most popular approaches and analyze the situations in which each of them is most useful. Those algorithms are experimented with three typical remotely sensed images, high resolution QuickBird image, airborne hyperspectral OMISII image, and medium Landsat ETM+ image. Some modified algorithms are proposed based on the ideas of improving MCS performance, and we experimentally proved the usefulness of these proposed methods.</p>
<p>Most of the findings from this paper show that both concatenation and parallel combination can enhance classification accuracy, but their performances are affected by different factors such as selected member classifiers, classifier combination criterion, <italic>etc.</italic> Furthermore, according to our experimental results, diversity measures can play active guidance for the selection of multiple classifiers combination.</p></sec></body>
<back>
<ack>
<p>The authors would like to thank the anonymous reviewers for their valuable comments. The authors thank the support to this research from the National Natural Science Foundation of China (40871195, 41171323), Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD) and Natural Science Foundation of Jiangsu Province, China (BK2010182).</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-sensors-12-04764"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Navalgund</surname><given-names>R.R.</given-names></name><name><surname>Jayaraman</surname><given-names>V.</given-names></name><name><surname>Roy</surname><given-names>P.S.</given-names></name></person-group><article-title>Remote sensing applications: An overview</article-title><source>Current</source><year>2007</year><volume>93</volume><fpage>1747</fpage><lpage>1766</lpage></citation></ref>
<ref id="b2-sensors-12-04764"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y.</given-names></name></person-group><article-title>Ten years of technology advancement in remote sensing and the research in the CRC-AGIP lab in GCE</article-title><source>Geomatica</source><year>2010</year><volume>64</volume><fpage>173</fpage><lpage>189</lpage></citation></ref>
<ref id="b3-sensors-12-04764"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname><given-names>D.</given-names></name><name><surname>Weng</surname><given-names>Q.</given-names></name></person-group><article-title>A survey of image classification methods and techniques for improving classification performance</article-title><source>Int. J. Remote Sens.</source><year>2007</year><volume>28</volume><fpage>823</fpage><lpage>870</lpage><pub-id pub-id-type="doi">10.1080/01431160600746456</pub-id></citation></ref>
<ref id="b4-sensors-12-04764"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Briem</surname><given-names>G.</given-names></name><name><surname>Benediktsson</surname><given-names>J.</given-names></name><name><surname>Sveinsson</surname><given-names>J.</given-names></name></person-group><article-title>Multiple classifiers applied to multisource remote sensing data</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>2002</year><volume>40</volume><fpage>2291</fpage><lpage>2299</lpage><pub-id pub-id-type="doi">10.1109/TGRS.2002.802476</pub-id></citation></ref>
<ref id="b5-sensors-12-04764"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name><name><surname>Chanussot</surname><given-names>J.</given-names></name><name><surname>Fauvel</surname><given-names>M.</given-names></name></person-group><article-title>Multiple classifier systems in remote sensing: From basics to recent developments</article-title><source>Mult. Classif. Syst.</source><year>2007</year><volume>4472</volume><fpage>501</fpage><lpage>512</lpage></citation></ref>
<ref id="b6-sensors-12-04764"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steele</surname><given-names>B.M.</given-names></name></person-group><article-title>Combining multiple classifiers: An application using spatial and remotely sensed information for land cover type mapping</article-title><source>Remote Sens. Environ.</source><year>2000</year><volume>74</volume><fpage>545</fpage><lpage>556</lpage><pub-id pub-id-type="doi">10.1016/S0034-4257(00)00145-0</pub-id></citation></ref>
<ref id="b7-sensors-12-04764"><label>7.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foody</surname><given-names>G.M.</given-names></name><name><surname>Boyd</surname><given-names>D.S.</given-names></name><name><surname>Sanchez-Hernandez</surname><given-names>C.</given-names></name></person-group><article-title>Mapping a specific class with an ensemble of classifiers</article-title><source>Int. J. Remote Sens.</source><year>2007</year><volume>28</volume><fpage>1733</fpage><lpage>1746</lpage><pub-id pub-id-type="doi">10.1080/01431160600962566</pub-id></citation></ref>
<ref id="b8-sensors-12-04764"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doan</surname><given-names>H.T.X.</given-names></name><name><surname>Foody</surname><given-names>G.M.</given-names></name></person-group><article-title>Increasing soft classification accuracy through the use of an ensemble of classifiers</article-title><source>Int. J. Remote Sens.</source><year>2007</year><volume>28</volume><fpage>4609</fpage><lpage>4623</lpage><pub-id pub-id-type="doi">10.1080/01431160701244872</pub-id></citation></ref>
<ref id="b9-sensors-12-04764"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smits</surname><given-names>P.C.</given-names></name></person-group><article-title>Multiple classifier systems for supervised remote sensing image classification based on dynamic classifier selection</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>2002</year><volume>40</volume><fpage>801</fpage><lpage>813</lpage><pub-id pub-id-type="doi">10.1109/TGRS.2002.1006354</pub-id></citation></ref>
<ref id="b10-sensors-12-04764"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Waske</surname><given-names>B.</given-names></name><name><surname>van der Linden</surname><given-names>S.</given-names></name><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name><name><surname>Rabe</surname><given-names>A.</given-names></name><name><surname>Hostert</surname><given-names>P.</given-names></name></person-group><article-title>Sensitivity of support vector machines to random feature selection in classification of hyperspectral data</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>2010</year><volume>48</volume><fpage>2880</fpage><lpage>2889</lpage><pub-id pub-id-type="doi">10.1109/TGRS.2010.2041784</pub-id></citation></ref>
<ref id="b11-sensors-12-04764"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname><given-names>S.</given-names></name><name><surname>Ghosh</surname><given-names>J.</given-names></name><name><surname>Crawford</surname><given-names>M.M.</given-names></name></person-group><article-title>Hierarchical fusion of multiple classifiers for hyperspectral data analysis</article-title><source>Pattern Anal. Appl.</source><year>2002</year><volume>5</volume><fpage>210</fpage><lpage>220</lpage><pub-id pub-id-type="doi">10.1007/s100440200019</pub-id></citation></ref>
<ref id="b12-sensors-12-04764"><label>12.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kuncheva</surname><given-names>L.I.</given-names></name></person-group><source>Combining Pattern Classifiers: Methods and Algorithms</source><publisher-name>Wiley-Interscience</publisher-name><publisher-loc>Hoboken, NJ, USA</publisher-loc><year>2004</year></citation></ref>
<ref id="b13-sensors-12-04764"><label>13.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Rokach</surname><given-names>L</given-names></name></person-group><source>Pattern Classification Using Ensemble Methods</source><publisher-name>World Scientific</publisher-name><publisher-loc>Singapore</publisher-loc><year>2010</year></citation></ref>
<ref id="b14-sensors-12-04764"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Biggio</surname><given-names>B.</given-names></name><name><surname>Fumera</surname><given-names>G.</given-names></name><name><surname>Roli</surname><given-names>F.</given-names></name></person-group><article-title>Multiple classifier systems for robust classifier design in adversarial environments</article-title><source>J. Mach. Learn. Cybern.</source><year>2010</year><volume>1</volume><fpage>27</fpage><lpage>41</lpage><pub-id pub-id-type="doi">10.1007/s13042-010-0007-7</pub-id></citation></ref>
<ref id="b15-sensors-12-04764"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ceamanos</surname><given-names>X.</given-names></name><name><surname>Waske</surname><given-names>B.</given-names></name><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name><name><surname>Chanussot</surname><given-names>J.</given-names></name><name><surname>Fauvel</surname><given-names>M.</given-names></name><name><surname>Sveinsson</surname><given-names>J.R.</given-names></name></person-group><article-title>A classiler ensemble based on fusion of support vector machines for classifying hyperspectral data</article-title><source>Int. J. Image Data Fusion</source><year>2010</year><volume>1</volume><fpage>293</fpage><lpage>307</lpage><pub-id pub-id-type="doi">10.1080/19479832.2010.485935</pub-id></citation></ref>
<ref id="b16-sensors-12-04764"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breiman</surname><given-names>L.</given-names></name></person-group><article-title>Bagging predictors</article-title><source>Mach. Learn.</source><year>1996</year><volume>24</volume><fpage>123</fpage><lpage>140</lpage></citation></ref>
<ref id="b17-sensors-12-04764"><label>17.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Freund</surname><given-names>Y.</given-names></name><name><surname>Schapire</surname><given-names>R.E.</given-names></name></person-group><article-title>Experiments with a New Boosting Algorithm</article-title><conf-name>Proceedings of the International Conference on Machine Learning</conf-name><conf-loc>Bari, Italy</conf-loc><conf-date>3–6 July 1996</conf-date><fpage>148</fpage><lpage>156</lpage></citation></ref>
<ref id="b18-sensors-12-04764"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name><name><surname>Swain</surname><given-names>P.H.</given-names></name></person-group><article-title>Consensus theoretic classification methods</article-title><source>IEEE Trans. Syst. Man Cybern.</source><year>1992</year><volume>22</volume><fpage>688</fpage><lpage>704</lpage><pub-id pub-id-type="doi">10.1109/21.156582</pub-id></citation></ref>
<ref id="b19-sensors-12-04764"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name><name><surname>Sveinsson</surname><given-names>J.R.</given-names></name></person-group><article-title>Hybrid consensus theoretic classification</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>1997</year><volume>35</volume><fpage>833</fpage><lpage>843</lpage><pub-id pub-id-type="doi">10.1109/36.602526</pub-id></citation></ref>
<ref id="b20-sensors-12-04764"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fauvel</surname><given-names>M.</given-names></name><name><surname>Chanussot</surname><given-names>J.</given-names></name><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name></person-group><article-title>Decision fusion for the classification of urban remote sensing images</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>2006</year><volume>44</volume><fpage>2828</fpage><lpage>2838</lpage><pub-id pub-id-type="doi">10.1109/TGRS.2006.876708</pub-id></citation></ref>
<ref id="b21-sensors-12-04764"><label>21.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ebeir</surname><given-names>L.D.</given-names></name><name><surname>Atinne</surname><given-names>P.A.L.</given-names></name><name><surname>Teen</surname><given-names>I.S.V.</given-names></name></person-group><article-title>Remote sensing classification of spectral,spatial and contextual data using multiple classifier systems</article-title><source>Image Rochester N. Y.</source><year>2001</year><volume>20</volume><fpage>584</fpage><lpage>589</lpage></citation></ref>
<ref id="b22-sensors-12-04764"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Waske</surname><given-names>B.</given-names></name><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name></person-group><article-title>Fusion of support vector machines for classification of multisensor data</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>2007</year><volume>45</volume><fpage>3858</fpage><lpage>3866</lpage><pub-id pub-id-type="doi">10.1109/TGRS.2007.898446</pub-id></citation></ref>
<ref id="b23-sensors-12-04764"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nemmour</surname><given-names>H.</given-names></name><name><surname>Chibani</surname><given-names>Y.</given-names></name></person-group><article-title>Multiple support vector machines for land cover change detection: An application for mapping urban extensions</article-title><source>ISPRS J. Photogramm. Remote Sens.</source><year>2006</year><volume>61</volume><fpage>125</fpage><lpage>133</lpage><pub-id pub-id-type="doi">10.1016/j.isprsjprs.2006.09.004</pub-id></citation></ref>
<ref id="b24-sensors-12-04764"><label>24.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Waske</surname><given-names>B.</given-names></name><name><surname>Braun</surname><given-names>M.</given-names></name></person-group><article-title>Classifier ensembles for land cover mapping using multitemporal SAR imagery</article-title><source>ISPRS J. Photogramm. Remote Sens.</source><year>2009</year><volume>64</volume><fpage>450</fpage><lpage>457</lpage><pub-id pub-id-type="doi">10.1016/j.isprsjprs.2009.01.003</pub-id></citation></ref>
<ref id="b25-sensors-12-04764"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krogh</surname><given-names>A.</given-names></name><name><surname>Vedelsby</surname><given-names>J.</given-names></name></person-group><article-title>Neural network ensembles, cross validation, and active learning</article-title><source>Adv. Neural Inf. Process. Syst.</source><year>1995</year><volume>7</volume><fpage>231</fpage><lpage>238</lpage></citation></ref>
<ref id="b26-sensors-12-04764"><label>26.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chandra</surname><given-names>A.</given-names></name><name><surname>Yao</surname><given-names>X.</given-names></name></person-group><article-title>Evolving hybrid ensembles of learning machines for better generalisation</article-title><source>Neurocomputing</source><year>2006</year><volume>69</volume><fpage>686</fpage><lpage>700</lpage><pub-id pub-id-type="doi">10.1016/j.neucom.2005.12.014</pub-id></citation></ref>
<ref id="b27-sensors-12-04764"><label>27.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuncheva</surname><given-names>L.I.</given-names></name><name><surname>Whitaker</surname><given-names>C.J.</given-names></name></person-group><article-title>Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy</article-title><source>Mach. Learn.</source><year>2003</year><volume>51</volume><fpage>181</fpage><lpage>207</lpage><pub-id pub-id-type="doi">10.1023/A:1022859003006</pub-id></citation></ref>
<ref id="b28-sensors-12-04764"><label>28.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname><given-names>G.</given-names></name><name><surname>Wyatt</surname><given-names>J.</given-names></name><name><surname>Harris</surname><given-names>R.</given-names></name><name><surname>Yao</surname><given-names>X.</given-names></name></person-group><article-title>Diversity creation methods: A survey and categorisation</article-title><source>Inf. Fusion</source><year>2005</year><volume>6</volume><fpage>5</fpage><lpage>20</lpage><pub-id pub-id-type="doi">10.1016/j.inffus.2004.04.004</pub-id></citation></ref>
<ref id="b29-sensors-12-04764"><label>29.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michail</surname><given-names>P.</given-names></name><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name><name><surname>Ioannis</surname><given-names>K.</given-names></name></person-group><article-title>The effect of classifier agreement on the accuracy of the combined classifier in decision level fusion</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>2002</year><volume>39</volume><fpage>2539</fpage><lpage>2546</lpage></citation></ref>
<ref id="b30-sensors-12-04764"><label>30.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foody</surname><given-names>G.M.</given-names></name></person-group><article-title>Classification accuracy comparison: Hypothesis tests and the use of confidence intervals in evaluations of difference, equivalence and non-inferiority</article-title><source>Remote Sens. Environ.</source><year>2009</year><volume>113</volume><fpage>1658</fpage><lpage>1663</lpage><pub-id pub-id-type="doi">10.1016/j.rse.2009.03.014</pub-id></citation></ref>
<ref id="b31-sensors-12-04764"><label>31.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ranawana</surname><given-names>R.</given-names></name><name><surname>Palade</surname><given-names>V.</given-names></name></person-group><article-title>Multi-classifier systems: Review and a roadmap for developers</article-title><source>Int. J. Hybrid Intell. Syst.</source><year>2006</year><volume>3</volume><fpage>1</fpage><lpage>41</lpage></citation></ref>
<ref id="b32-sensors-12-04764"><label>32.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuncheva</surname><given-names>L.I.</given-names></name></person-group><article-title>A theoretical study on six classifier fusion strategies</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell.</source><year>2002</year><volume>24</volume><fpage>281</fpage><lpage>286</lpage><pub-id pub-id-type="doi">10.1109/34.982906</pub-id></citation></ref>
<ref id="b33-sensors-12-04764"><label>33.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Giacinto</surname><given-names>G.</given-names></name><name><surname>Roli</surname><given-names>F.</given-names></name><name><surname>Vernazza</surname><given-names>G</given-names></name></person-group><article-title>Comparison and combination of statistical and neural network algorithms for remote-sensing image classification</article-title><source>Neurocomputation in Remote Sensing Data Analysis</source><publisher-name>Springer-Verlag</publisher-name><publisher-loc>Berlin, Germany</publisher-loc><year>1997</year><fpage>117</fpage><lpage>124</lpage></citation></ref>
<ref id="b34-sensors-12-04764"><label>34.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ho</surname><given-names>T.K.</given-names></name><name><surname>Hull</surname><given-names>J.J.</given-names></name><name><surname>Srihari</surname><given-names>S.N.</given-names></name></person-group><article-title>Decision combination in multiple classifier systems</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell.</source><year>1994</year><volume>16</volume><fpage>66</fpage><lpage>75</lpage><pub-id pub-id-type="doi">10.1109/34.273716</pub-id></citation></ref>
<ref id="b35-sensors-12-04764"><label>35.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giacinto</surname><given-names>G.</given-names></name></person-group><article-title>An approach to the automatic design of multiple classifier systems</article-title><source>Pattern. Recogn. Lett.</source><year>2001</year><volume>22</volume><fpage>25</fpage><lpage>33</lpage><pub-id pub-id-type="doi">10.1016/S0167-8655(00)00096-9</pub-id></citation></ref>
<ref id="b36-sensors-12-04764"><label>36.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghosh</surname><given-names>A.</given-names></name><name><surname>Shankar</surname><given-names>B.U.</given-names></name><name><surname>Bruzzone</surname><given-names>L.</given-names></name><name><surname>Meher</surname><given-names>S.K.</given-names></name></person-group><article-title>Neuro-fuzzy-combiner: An effective multiple classifier system</article-title><source>Int. J. Knowl. Eng. Soft Data Paradig.</source><year>2010</year><volume>2</volume><fpage>107</fpage><lpage>129</lpage><pub-id pub-id-type="doi">10.1504/IJKESDP.2010.034678</pub-id></citation></ref>
<ref id="b37-sensors-12-04764"><label>37.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woods</surname><given-names>K.</given-names></name><name><surname>Kegelmeyer</surname><given-names>W.P.</given-names></name><name><surname>Bowyer</surname><given-names>K.</given-names></name></person-group><article-title>Combination of multiple classifiers using local accuracy estimates</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell.</source><year>1997</year><volume>19</volume><fpage>405</fpage><lpage>410</lpage><pub-id pub-id-type="doi">10.1109/34.588027</pub-id></citation></ref>
<ref id="b38-sensors-12-04764"><label>38.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Czyz</surname><given-names>J.</given-names></name><name><surname>Kittler</surname><given-names>J.</given-names></name><name><surname>Vandendorpe</surname><given-names>L.</given-names></name></person-group><article-title>Multiple classifier combination for face-based identity verification</article-title><source>Pattern. Recogn.</source><year>2003</year><volume>37</volume><fpage>1459</fpage><lpage>1469</lpage></citation></ref>
<ref id="b39-sensors-12-04764"><label>39.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rodriguez</surname><given-names>J.J.</given-names></name><name><surname>Kuncheva</surname><given-names>L.I.</given-names></name></person-group><article-title>Rotation forest: A new classifier ensemble method</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell.</source><year>2009</year><volume>28</volume><fpage>1619</fpage><lpage>1630</lpage></citation></ref>
<ref id="b40-sensors-12-04764"><label>40.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name><name><surname>Kanellopoulos</surname><given-names>I.</given-names></name></person-group><article-title>Classification of multisource and hyperspectral data based on decision fusion</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>1999</year><volume>37</volume><fpage>1367</fpage><lpage>1377</lpage><pub-id pub-id-type="doi">10.1109/36.763301</pub-id></citation></ref>
<ref id="b41-sensors-12-04764"><label>41.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fauvel</surname><given-names>M.</given-names></name><name><surname>Chanussot</surname><given-names>J.</given-names></name><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name></person-group><article-title>Decision fusion for the classification of urban remote sensing images</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>2006</year><volume>44</volume><fpage>2828</fpage><lpage>2838</lpage><pub-id pub-id-type="doi">10.1109/TGRS.2006.876708</pub-id></citation></ref>
<ref id="b42-sensors-12-04764"><label>42.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>Z.H.</given-names></name><name><surname>When</surname><given-names>Semi-Supervised</given-names></name><name><surname>Learning</surname><given-names>Meets</given-names></name></person-group><article-title>Ensemble Learning</article-title><conf-name>Proceedings of the 8th International Workshop on Multiple Classifier Systems (MCS'09)</conf-name><conf-loc>Reykjavik, Iceland</conf-loc><conf-date>10–12 June 2009</conf-date><comment>Volume 5</comment><fpage>529</fpage><lpage>538</lpage></citation></ref>
<ref id="b43-sensors-12-04764"><label>43.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Brown</surname><given-names>G</given-names></name></person-group><article-title>Ensemble Learning</article-title><source>Encyclopedia of Machine Learning</source><person-group person-group-type="editor"><name><surname>Webb</surname><given-names>G.I.</given-names></name><name><surname>Sammut</surname><given-names>C.</given-names></name></person-group><publisher-name>Springer</publisher-name><publisher-loc>New York, NY, USA</publisher-loc><year>2010</year></citation></ref>
<ref id="b44-sensors-12-04764"><label>44.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname><given-names>L.</given-names></name><name><surname>Krzyzak</surname><given-names>A.</given-names></name><name><surname>Suen</surname><given-names>C.Y.</given-names></name></person-group><article-title>Methods of combining multiple classifiers and their applications to handwriting recognition</article-title><source>EEE Trans. Syst. Man Cybern.</source><year>1992</year><volume>22</volume><fpage>418</fpage><lpage>435</lpage><pub-id pub-id-type="doi">10.1109/21.155943</pub-id></citation></ref>
<ref id="b45-sensors-12-04764"><label>45.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rahman</surname><given-names>A.F.R.</given-names></name><name><surname>Fairhurst</surname><given-names>M.C.</given-names></name></person-group><article-title>Serial combination of multiple experts: A unified evaluation</article-title><source>Pattern Anal. Appl.</source><year>1999</year><volume>2</volume><fpage>292</fpage><lpage>311</lpage><pub-id pub-id-type="doi">10.1007/s100440050038</pub-id></citation></ref>
<ref id="b46-sensors-12-04764"><label>46.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aksela</surname><given-names>M.</given-names></name><name><surname>Laaksonen</surname><given-names>J.</given-names></name></person-group><article-title>Using diversity of errors for selecting members of a committee classifier</article-title><source>Pattern. Recogn.</source><year>2006</year><volume>39</volume><fpage>608</fpage><lpage>623</lpage><pub-id pub-id-type="doi">10.1016/j.patcog.2005.08.017</pub-id></citation></ref>
<ref id="b47-sensors-12-04764"><label>47.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gislason</surname><given-names>P.</given-names></name><name><surname>Benediktsson</surname><given-names>J.A.</given-names></name><name><surname>Sveinsson</surname><given-names>J.M.</given-names></name></person-group><article-title>Random forests for land cover classification</article-title><source>Pattern. Recogn. Lett.</source><year>2006</year><volume>27</volume><fpage>294</fpage><lpage>300</lpage><pub-id pub-id-type="doi">10.1016/j.patrec.2005.08.011</pub-id></citation></ref>
<ref id="b48-sensors-12-04764"><label>48.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bauer</surname><given-names>E.</given-names></name><name><surname>Kohavi</surname><given-names>R.</given-names></name></person-group><article-title>An empirical comparison of voting classification algorithms: Bagging, boosting, and variants</article-title><source>Mach. Learn.</source><year>1999</year><volume>36</volume><fpage>105</fpage><lpage>139</lpage><pub-id pub-id-type="doi">10.1023/A:1007515423169</pub-id></citation></ref>
<ref id="b49-sensors-12-04764"><label>49.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breiman</surname><given-names>L.</given-names></name></person-group><article-title>Random forest</article-title><source>Mach. Learn.</source><year>2001</year><volume>45</volume><fpage>5</fpage><lpage>32</lpage><pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id></citation></ref>
<ref id="b50-sensors-12-04764"><label>50.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ho</surname><given-names>T.K.</given-names></name></person-group><article-title>The random subspace method for constructing decision forests</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell.</source><year>1998</year><volume>20</volume><fpage>832</fpage><lpage>844</lpage><pub-id pub-id-type="doi">10.1109/34.709601</pub-id></citation></ref>
<ref id="b51-sensors-12-04764"><label>51.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Collins</surname><given-names>M.</given-names></name><name><surname>Schapire</surname><given-names>R.E.</given-names></name><name><surname>Singer</surname><given-names>Y.</given-names></name></person-group><article-title>Logistic regression, adaboost and bregman distances</article-title><source>Mach. Learn.</source><year>2002</year><volume>48</volume><fpage>31</fpage><lpage>44</lpage></citation></ref>
<ref id="b52-sensors-12-04764"><label>52.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Webb</surname><given-names>G.I.</given-names></name></person-group><article-title>MultiBoosting: A technique for combining boosting and wagging</article-title><source>Mach. Learn.</source><year>2000</year><volume>39</volume><fpage>159</fpage><lpage>196</lpage></citation></ref>
<ref id="b53-sensors-12-04764"><label>53.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>C.</given-names></name><name><surname>Zhang</surname><given-names>J.</given-names></name></person-group><article-title>RotBoost: A technique for combining Rotation Forest and AdaBoost</article-title><source>Pattern. Recogn. Lett.</source><year>2008</year><volume>29</volume><fpage>1524</fpage><lpage>1536</lpage><pub-id pub-id-type="doi">10.1016/j.patrec.2008.03.006</pub-id></citation></ref>
<ref id="b54-sensors-12-04764"><label>54.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moreno-Seco</surname><given-names>F.</given-names></name><name><surname>Inesta</surname><given-names>J.</given-names></name><name><surname>de Leon</surname><given-names>P.</given-names></name><name><surname>Mico</surname><given-names>L.</given-names></name></person-group><article-title>Comparison of classifier fusion methods for classification in pattern recognition tasks</article-title><source>Struct. Syntactic Stat. Pattern Recogn.</source><year>2006</year><volume>4109</volume><fpage>705</fpage><lpage>713</lpage></citation></ref>
<ref id="b55-sensors-12-04764"><label>55.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giacinto</surname><given-names>G.</given-names></name><name><surname>Roli</surname><given-names>F.</given-names></name></person-group><article-title>Design of effective neural network ensembles for image classification</article-title><source>Image Vis. Comput. J.</source><year>2001</year><volume>19</volume><fpage>697</fpage><lpage>705</lpage></citation></ref>
<ref id="b56-sensors-12-04764"><label>56.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shipp</surname><given-names>C.A.</given-names></name><name><surname>Kuncheva</surname><given-names>L.I.</given-names></name></person-group><article-title>Relationships between combination methods and measures of diversity in combining classifiers</article-title><source>Inf. Fusion</source><year>2001</year><volume>3</volume><fpage>135</fpage><lpage>148</lpage></citation></ref>
<ref id="b57-sensors-12-04764"><label>57.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kittler</surname><given-names>J.</given-names></name><name><surname>Hatef</surname><given-names>M.</given-names></name><name><surname>Duin</surname><given-names>R.P.W.</given-names></name><name><surname>Matas</surname><given-names>J.</given-names></name></person-group><article-title>On combining classifiers</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell.</source><year>1998</year><volume>20</volume><fpage>226</fpage><lpage>239</lpage><pub-id pub-id-type="doi">10.1109/34.667881</pub-id></citation></ref>
<ref id="b58-sensors-12-04764"><label>58.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nemmour</surname><given-names>H.</given-names></name><name><surname>Chibani</surname><given-names>Y.</given-names></name></person-group><article-title>Multiple support vector machines for land cover change detection: An application for mapping urban extensions</article-title><source>ISPRS J. Photogramm. Remote Sens.</source><year>2006</year><volume>61</volume><fpage>125</fpage><lpage>133</lpage><pub-id pub-id-type="doi">10.1016/j.isprsjprs.2006.09.004</pub-id></citation></ref>
<ref id="b59-sensors-12-04764"><label>59.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Kong</surname><given-names>Z.</given-names></name><name><surname>Cai</surname><given-names>Z</given-names></name></person-group><article-title>Advances of Research in Fuzzy Integral for Classifiers' fusion</article-title><conf-name>Proceedings of the 8th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing—Volume 02</conf-name><conf-loc>Washington, DC, USA</conf-loc><conf-date>30 July–1 August 2007</conf-date><fpage>809</fpage><lpage>814</lpage></citation></ref>
<ref id="b60-sensors-12-04764"><label>60.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ben Abdallah</surname><given-names>A.C.</given-names></name><name><surname>Frigui</surname><given-names>H.</given-names></name><name><surname>Gader</surname><given-names>P</given-names></name></person-group><article-title>Adaptive local fusion with fuzzy integrals</article-title><source>IEEE Trans. Fuzzy Syst.</source><year>2012</year><comment>in press</comment></citation></ref>
<ref id="b61-sensors-12-04764"><label>61.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname><given-names>Q.</given-names></name><name><surname>Ye</surname><given-names>X.Q.</given-names></name><name><surname>Gu</surname><given-names>W.K.</given-names></name></person-group><article-title>A new combination rules of evidence theory (in Chinese)</article-title><source>Acta Electron. Sin.</source><year>2000</year><volume>8</volume><fpage>117</fpage><lpage>119</lpage></citation></ref>
<ref id="b62-sensors-12-04764"><label>62.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ko</surname><given-names>A.</given-names></name><name><surname>Sabourin</surname><given-names>R.</given-names></name><name><surname>Brittojr</surname><given-names>A.</given-names></name></person-group><article-title>From dynamic classifier selection to dynamic ensemble selection</article-title><source>Pattern. Recogn.</source><year>2008</year><volume>41</volume><fpage>1718</fpage><lpage>1731</lpage><pub-id pub-id-type="doi">10.1016/j.patcog.2007.10.015</pub-id></citation></ref>
<ref id="b63-sensors-12-04764"><label>63.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woods</surname><given-names>K.</given-names></name><name><surname>Kegelmeyer</surname><given-names>W.P.</given-names></name><name><surname>Bowyer</surname><given-names>K.</given-names></name></person-group><article-title>Combination of multiple classifiers using local accuracy estimates</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell.</source><year>1997</year><volume>19</volume><fpage>405</fpage><lpage>410</lpage><pub-id pub-id-type="doi">10.1109/34.588027</pub-id></citation></ref>
<ref id="b64-sensors-12-04764"><label>64.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giacinto</surname><given-names>G.</given-names></name><name><surname>Roli</surname><given-names>F.</given-names></name><name><surname>Fumera</surname><given-names>G.</given-names></name></person-group><article-title>Selection of image classifiers</article-title><source>Electron. Lett.</source><year>2000</year><volume>36</volume><fpage>420</fpage><lpage>422</lpage><pub-id pub-id-type="doi">10.1049/el:20000374</pub-id></citation></ref>
<ref id="b65-sensors-12-04764"><label>65.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dudani</surname><given-names>S.A.</given-names></name></person-group><article-title>The distance-weighted k-nearest-neighbor rule</article-title><source>IEEE Trans. Syst. Man Cybern.</source><year>1976</year><volume>SMC-6</volume><fpage>325</fpage><lpage>327</lpage><pub-id pub-id-type="doi">10.1109/TSMC.1976.5408784</pub-id></citation></ref>
<ref id="b66-sensors-12-04764"><label>66.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ruta</surname><given-names>D.</given-names></name><name><surname>Gabrys</surname><given-names>B.</given-names></name></person-group><article-title>An overview of classifier fusion methods</article-title><source>Comput. Inf. Syst.</source><year>2000</year><volume>7</volume><fpage>1</fpage><lpage>10</lpage></citation></ref>
<ref id="b67-sensors-12-04764"><label>67.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mountrakis</surname><given-names>G.</given-names></name><name><surname>Im</surname><given-names>J.</given-names></name><name><surname>Ogole</surname><given-names>C.</given-names></name></person-group><article-title>Support vector machines in remote sensing: A review</article-title><source>ISPRS J. Photogramm. Remote Sens.</source><year>2011</year><volume>66</volume><fpage>247</fpage><lpage>259</lpage><pub-id pub-id-type="doi">10.1016/j.isprsjprs.2010.11.001</pub-id></citation></ref>
<ref id="b68-sensors-12-04764"><label>68.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Melgani</surname><given-names>F.</given-names></name><name><surname>Bruzzone</surname><given-names>L.</given-names></name></person-group><article-title>Classification of hyperspectral remote sensing images with support vector machines</article-title><source>IEEE Trans. Geosci. Remote Sens.</source><year>2004</year><volume>42</volume><fpage>1778</fpage><lpage>1790</lpage><pub-id pub-id-type="doi">10.1109/TGRS.2004.831865</pub-id></citation></ref>
<ref id="b69-sensors-12-04764"><label>69.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pal</surname><given-names>M.</given-names></name><name><surname>Mather</surname><given-names>P.M.</given-names></name></person-group><article-title>An assessment of the effectiveness of decision tree methods for land cover classification</article-title><source>Remote Sens. Environ.</source><year>2003</year><volume>86</volume><fpage>554</fpage><lpage>565</lpage><pub-id pub-id-type="doi">10.1016/S0034-4257(03)00132-9</pub-id></citation></ref>
<ref id="b70-sensors-12-04764"><label>70.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foody</surname><given-names>G.M.</given-names></name><name><surname>Mathur</surname><given-names>A.</given-names></name></person-group><article-title>Toward intelligent training of supervised image classifications: Directing training data acquisition for SVM classification</article-title><source>Remote Sens. Environ.</source><year>2004</year><volume>93</volume><fpage>107</fpage><lpage>117</lpage><pub-id pub-id-type="doi">10.1016/j.rse.2004.06.017</pub-id></citation></ref>
<ref id="b71-sensors-12-04764"><label>71.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname><given-names>X.</given-names></name><name><surname>Weng</surname><given-names>Q.</given-names></name></person-group><article-title>Estimating impervious surfaces from medium spatial resolution imagery using the self-organizing map and multi-layer perceptron neural networks</article-title><source>Remote Sens. Environ.</source><year>2009</year><volume>113</volume><fpage>2089</fpage><lpage>2102</lpage><pub-id pub-id-type="doi">10.1016/j.rse.2009.05.014</pub-id></citation></ref>
<ref id="b72-sensors-12-04764"><label>72.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Witten</surname><given-names>I.H.</given-names></name><name><surname>Frank</surname><given-names>W</given-names></name></person-group><source>Data Mining: Practical Machine Learning Tools and Techniques</source><publisher-name>China Machine Press</publisher-name><publisher-loc>Beijing, China</publisher-loc><year>2006</year></citation></ref>
<ref id="b73-sensors-12-04764"><label>73.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dennison</surname><given-names>P.E.</given-names></name><name><surname>Halligan</surname><given-names>K.Q.</given-names></name><name><surname>Roberts</surname><given-names>D.A.</given-names></name></person-group><article-title>A comparison of error metrics and constraints for multiple endmember spectral mixture analysis and spectral angle mapper</article-title><source>Remote Sens. Environ.</source><year>2004</year><volume>93</volume><fpage>359</fpage><lpage>367</lpage><pub-id pub-id-type="doi">10.1016/j.rse.2004.07.013</pub-id></citation></ref>
<ref id="b74-sensors-12-04764"><label>74.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Quinlan</surname><given-names>J.R.</given-names></name></person-group><source>C4.5: Programs for Machine Learning</source><publisher-name>Morgan Kaufmann Publishers Inc.</publisher-name><publisher-loc>San Francisco, CA, USA</publisher-loc><year>1993</year></citation></ref>
<ref id="b75-sensors-12-04764"><label>75.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>W.</given-names></name><name><surname>Du</surname><given-names>P.</given-names></name><name><surname>Yin</surname><given-names>Z.X.</given-names></name></person-group><article-title>Application of simplified fuzzy H-ARTMAP network to hyperspectral remote sensing classification (in Chinese)</article-title><source>Sci. Surv. Mapp.</source><year>2009</year><volume>33</volume><fpage>113</fpage><lpage>115</lpage></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-sensors-12-04764" position="float">
<label>Figure 1.</label>
<caption>
<p>The combination style in MCS.</p></caption>
<graphic xlink:href="sensors-12-04764f1.gif"/></fig>
<fig id="f2-sensors-12-04764" position="float">
<label>Figure 2.</label>
<caption>
<p>The pseudo-code of Bagging and AdaBoost [<xref ref-type="bibr" rid="b16-sensors-12-04764">16</xref>,<xref ref-type="bibr" rid="b17-sensors-12-04764">17</xref>,<xref ref-type="bibr" rid="b47-sensors-12-04764">47</xref>].</p></caption>
<graphic xlink:href="sensors-12-04764f2.gif"/></fig>
<fig id="f3-sensors-12-04764" position="float">
<label>Figure 3.</label>
<caption>
<p>The structure of concatenation combination.</p></caption>
<graphic xlink:href="sensors-12-04764f3.gif"/></fig>
<fig id="f4-sensors-12-04764" position="float">
<label>Figure 4.</label>
<caption>
<p>False color composite of QuickBird remote sensing image.</p></caption>
<graphic xlink:href="sensors-12-04764f4.gif"/></fig>
<fig id="f5-sensors-12-04764" position="float">
<label>Figure 5.</label>
<caption>
<p>Classification results of using only spectral features.</p></caption>
<graphic xlink:href="sensors-12-04764f5a.gif"/>
<graphic xlink:href="sensors-12-04764f5b.gif"/></fig>
<fig id="f6-sensors-12-04764" position="float">
<label>Figure 6.</label>
<caption>
<p>Classification results of using both spectral and textural features.</p></caption>
<graphic xlink:href="sensors-12-04764f6.gif"/></fig>
<fig id="f7-sensors-12-04764" position="float">
<label>Figure 7.</label>
<caption>
<p>Classifications of J48 DTC Based on Bagging and AdaBoost.</p></caption>
<graphic xlink:href="sensors-12-04764f7.gif"/></fig>
<fig id="f8-sensors-12-04764" position="float">
<label>Figure 8.</label>
<caption>
<p>Classifications of RBFNN Based on Bagging and AdaBoost.</p></caption>
<graphic xlink:href="sensors-12-04764f8.gif"/></fig>
<fig id="f9-sensors-12-04764" position="float">
<label>Figure 9.</label>
<caption>
<p>Classification results based on parallel combination approaches.</p></caption>
<graphic xlink:href="sensors-12-04764f9.gif"/></fig>
<fig id="f10-sensors-12-04764" position="float">
<label>Figure 10.</label>
<caption>
<p>Classification results based on proposed concatenation combination algorithms.</p></caption>
<graphic xlink:href="sensors-12-04764f10.gif"/></fig>
<fig id="f11-sensors-12-04764" position="float">
<label>Figure 11.</label>
<caption>
<p>False color composite of OMIS image and classification results of individual classifier.</p></caption>
<graphic xlink:href="sensors-12-04764f11.gif"/></fig>
<fig id="f12-sensors-12-04764" position="float">
<label>Figure 12.</label>
<caption>
<p>Classification result based on AdaBoost and Bagging.</p></caption>
<graphic xlink:href="sensors-12-04764f12.gif"/></fig>
<fig id="f13-sensors-12-04764" position="float">
<label>Figure 13.</label>
<caption>
<p>Classification result using parallel and concatenation combination style.</p></caption>
<graphic xlink:href="sensors-12-04764f13.gif"/></fig>
<fig id="f14-sensors-12-04764" position="float">
<label>Figure 14.</label>
<caption>
<p>Classification results of Landsat ETM+ image using different schemes.</p></caption>
<graphic xlink:href="sensors-12-04764f14.gif"/></fig>
<table-wrap id="t1-sensors-12-04764" position="float">
<label>Table 1.</label>
<caption>
<p>Summary of the classifier ensemble approaches in parallel combination.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Name</bold></th>
<th align="center" valign="top"><bold>hard labels</bold></th>
<th align="center" valign="top"><bold>soft labels</bold></th>
<th align="center" valign="top"><bold>validation set</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Majority vote</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">N</td></tr>
<tr>
<td align="left" valign="top">Weighted vote</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">Y</td></tr>
<tr>
<td align="left" valign="top">Bayesian average</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">N</td></tr>
<tr>
<td align="left" valign="top">Dempster-shafer evidence theory</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">Y</td></tr>
<tr>
<td align="left" valign="top">Fuzzy integral</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">Y</td></tr>
<tr>
<td align="left" valign="top">Consensus theory</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">Y</td></tr>
<tr>
<td align="left" valign="top">Dynamic classifier selection</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">Y</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-sensors-12-04764">
<label>*</label>
<p>Note: “Y” and “N” mean whether or not the hard labels, soft labels or validation set are needed. Dynamic classifier selection method needs the original image to calculate the distance.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t2-sensors-12-04764" position="float">
<label>Table 2.</label>
<caption>
<p>Classification accuracy of the first Scheme (only spectral features).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Classifier</bold></th>
<th align="center" valign="top"><bold>Water</bold></th>
<th align="center" valign="top"><bold>Built-up Area</bold></th>
<th align="center" valign="top"><bold>Green Space</bold></th>
<th align="center" valign="top"><bold>Vegetation</bold></th>
<th align="center" valign="top"><bold>Barren Soil</bold></th>
<th align="center" valign="top"><bold>OA</bold></th>
<th align="center" valign="top"><bold>Kappa</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">MMDC</td>
<td align="center" valign="top"><bold>96.81</bold>%</td>
<td align="center" valign="top">48.48%</td>
<td align="center" valign="top">92.18%</td>
<td align="center" valign="top">95.64%</td>
<td align="center" valign="top">97.68%</td>
<td align="center" valign="top">84.41%</td>
<td align="center" valign="top">0.81</td></tr>
<tr>
<td align="left" valign="top">MLC</td>
<td align="center" valign="top">95.17%</td>
<td align="center" valign="top">75.00%</td>
<td align="center" valign="top">96.54%</td>
<td align="center" valign="top">92.01%</td>
<td align="center" valign="top"><bold>98.01</bold>%</td>
<td align="center" valign="top">90.80%</td>
<td align="center" valign="top">0.88</td></tr>
<tr>
<td align="left" valign="top">SVM</td>
<td align="center" valign="top">87.56%</td>
<td align="center" valign="top"><bold>97.51</bold>%</td>
<td align="center" valign="top"><bold>95.73</bold>%</td>
<td align="center" valign="top">89.05%</td>
<td align="center" valign="top">92.77%</td>
<td align="center" valign="top"><bold>92.59</bold>%</td>
<td align="center" valign="top"><bold>0.91</bold></td></tr>
<tr>
<td align="left" valign="top">MLPNN</td>
<td align="center" valign="top">92.12%</td>
<td align="center" valign="top">78.04%</td>
<td align="center" valign="top">96.98%</td>
<td align="center" valign="top"><bold>95.04</bold>%</td>
<td align="center" valign="top">97.80%</td>
<td align="center" valign="top">91.44%</td>
<td align="center" valign="top">0.89</td></tr>
<tr>
<td align="left" valign="top">RBFNN</td>
<td align="center" valign="top">94.23%</td>
<td align="center" valign="top">80.59%</td>
<td align="center" valign="top">95.25%</td>
<td align="center" valign="top">87.84%</td>
<td align="center" valign="top">96.10%</td>
<td align="center" valign="top">90.65%</td>
<td align="center" valign="top">0.88</td></tr>
<tr>
<td align="left" valign="top">J48 DTC</td>
<td align="center" valign="top">93.85%</td>
<td align="center" valign="top">89.19%</td>
<td align="center" valign="top">89.19%</td>
<td align="center" valign="top">86.75%</td>
<td align="center" valign="top">92.78%</td>
<td align="center" valign="top">91.92%</td>
<td align="center" valign="top">0.90</td></tr>
<tr>
<td align="left" valign="top">SFH-ARTMAP</td>
<td align="center" valign="top">92.77%</td>
<td align="center" valign="top">77.85%</td>
<td align="center" valign="top"><bold>97.25</bold>%</td>
<td align="center" valign="top">91.23%</td>
<td align="center" valign="top">96.39%</td>
<td align="center" valign="top">90.70%</td>
<td align="center" valign="top">0.88</td></tr></tbody></table></table-wrap>
<table-wrap id="t3-sensors-12-04764" position="float">
<label>Table 3.</label>
<caption>
<p>Classification accuracy of the second scheme (spectral and textural features).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Classifier</bold></th>
<th align="left" valign="top"><bold>Water</bold></th>
<th align="left" valign="top"><bold>Built-up Area</bold></th>
<th align="left" valign="top"><bold>Green Space</bold></th>
<th align="left" valign="top"><bold>Vegetation</bold></th>
<th align="left" valign="top"><bold>Barren Soil</bold></th>
<th align="left" valign="top"><bold>OA</bold></th>
<th align="left" valign="top"><bold>Kappa</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">MMDC</td>
<td align="left" valign="top">88.08%</td>
<td align="left" valign="top">56.28%</td>
<td align="left" valign="top">93.61%</td>
<td align="left" valign="top"><bold>97.16</bold>%</td>
<td align="left" valign="top">96.76%</td>
<td align="left" valign="top">85.92%</td>
<td align="left" valign="top">0.82</td></tr>
<tr>
<td align="left" valign="top">MLC</td>
<td align="left" valign="top">95.59%</td>
<td align="left" valign="top">83.14%</td>
<td align="left" valign="top">97.11%</td>
<td align="left" valign="top">91.23%</td>
<td align="left" valign="top">98.34%</td>
<td align="left" valign="top">92.78%</td>
<td align="left" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">SVM</td>
<td align="left" valign="top">90.56%</td>
<td align="left" valign="top"><bold>98.62</bold>%</td>
<td align="left" valign="top">97.39%</td>
<td align="left" valign="top">84.33%</td>
<td align="left" valign="top">94.51%</td>
<td align="left" valign="top">93.41%</td>
<td align="left" valign="top">0.92</td></tr>
<tr>
<td align="left" valign="top">MLPNN</td>
<td align="left" valign="top"><bold>97.60</bold> %</td>
<td align="left" valign="top">86.76%</td>
<td align="left" valign="top">95.12%</td>
<td align="left" valign="top">88.87%</td>
<td align="left" valign="top"><bold>98.84</bold>%</td>
<td align="left" valign="top"><bold>93.49</bold>%</td>
<td align="left" valign="top"><bold>0.92</bold></td></tr>
<tr>
<td align="left" valign="top">RBFNN</td>
<td align="left" valign="top">94.93%</td>
<td align="left" valign="top">88.66%</td>
<td align="left" valign="top">93.34%</td>
<td align="left" valign="top">89.35%</td>
<td align="left" valign="top">97.80%</td>
<td align="left" valign="top">92.83%</td>
<td align="left" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">J48 DTC</td>
<td align="left" valign="top">92.82%</td>
<td align="left" valign="top">93.23%</td>
<td align="left" valign="top">95.69%</td>
<td align="left" valign="top">86.45%</td>
<td align="left" valign="top">96.51%</td>
<td align="left" valign="top">92.42%</td>
<td align="left" valign="top">0.90</td></tr>
<tr>
<td align="left" valign="top">SFH-ARTMAP</td>
<td align="left" valign="top">93.34%</td>
<td align="left" valign="top">87.29%</td>
<td align="left" valign="top"><bold>97.56</bold>%</td>
<td align="left" valign="top">90.87%</td>
<td align="left" valign="top"><bold>98.84</bold>%</td>
<td align="left" valign="top">93.15%</td>
<td align="left" valign="top">0.91</td></tr></tbody></table></table-wrap>
<table-wrap id="t4-sensors-12-04764" position="float">
<label>Table 4.</label>
<caption>
<p>Accuracy of AdaBoost and Bagging using different base classifier.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Base classifier</bold></th>
<th align="center" valign="top"><bold>J48 DTC</bold></th>
<th align="center" valign="top"><bold>MLPNN</bold></th>
<th align="center" valign="top"><bold>RBFNN</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Base classifier</td>
<td align="center" valign="top">92.42%</td>
<td align="center" valign="top">93.49%</td>
<td align="center" valign="top">92.83%</td></tr>
<tr>
<td align="left" valign="top">Bagging(25% samples)</td>
<td align="center" valign="top">94.22%</td>
<td align="center" valign="top">94.20%</td>
<td align="center" valign="top">93.76%</td></tr>
<tr>
<td align="left" valign="top">Bagging(50% samples)</td>
<td align="center" valign="top">94.55%</td>
<td align="center" valign="top">95.27%</td>
<td align="center" valign="top">93.39%</td></tr>
<tr>
<td align="left" valign="top">Bagging(75% samples)</td>
<td align="center" valign="top">94.30%</td>
<td align="center" valign="top">94.91%</td>
<td align="center" valign="top">93.40%</td></tr>
<tr>
<td align="left" valign="top">Bagging(100% samples)</td>
<td align="center" valign="top">94.93%</td>
<td align="center" valign="top">95.09%</td>
<td align="center" valign="top">93.36%</td></tr>
<tr>
<td align="left" valign="top">AdaBoost</td>
<td align="center" valign="top">95.67%</td>
<td align="center" valign="top">95.43%</td>
<td align="center" valign="top">95.35%</td></tr></tbody></table></table-wrap>
<table-wrap id="t5-sensors-12-04764" position="float">
<label>Table 5.</label>
<caption>
<p>Classification accuracy of parallel classifiers combination</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Combination strategy</bold></th>
<th align="center" valign="top"><bold>OA</bold></th>
<th align="center" valign="top"><bold>Kappa</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Bayesian average</td>
<td align="center" valign="top">94.99%</td>
<td align="center" valign="top">0.94</td></tr>
<tr>
<td align="left" valign="top">Logarithmic consensus</td>
<td align="center" valign="top">93.77%</td>
<td align="center" valign="top">0.92</td></tr>
<tr>
<td align="left" valign="top">Linear consensus</td>
<td align="center" valign="top">94.37%</td>
<td align="center" valign="top">0.93</td></tr>
<tr>
<td align="left" valign="top">Majority vote</td>
<td align="center" valign="top">94.74%</td>
<td align="center" valign="top">0.93</td></tr>
<tr>
<td align="left" valign="top">Evidence theory</td>
<td align="center" valign="top">93.76%</td>
<td align="center" valign="top">0.92</td></tr>
<tr>
<td align="left" valign="top">Improved evidence theory</td>
<td align="center" valign="top">94.18%</td>
<td align="center" valign="top">0.93</td></tr>
<tr>
<td align="left" valign="top">Fuzzy integral</td>
<td align="center" valign="top">94.37%</td>
<td align="center" valign="top">0.93</td></tr>
<tr>
<td align="left" valign="top">DWDCS</td>
<td align="center" valign="top">93.98%</td>
<td align="center" valign="top">0.92</td></tr></tbody></table></table-wrap>
<table-wrap id="t6-sensors-12-04764" position="float">
<label>Table 6.</label>
<caption>
<p>Classification accuracies of concatenation combination algorithms.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Concatenation combination</bold></th>
<th align="center" valign="top"><bold>OA</bold></th>
<th align="center" valign="top"><bold>Kappa</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Proposed method 1</td>
<td align="center" valign="top">95.21%</td>
<td align="center" valign="top">0.94</td></tr>
<tr>
<td align="left" valign="top">Proposed method 2</td>
<td align="center" valign="top">97.06%</td>
<td align="center" valign="top">0.96</td></tr>
<tr>
<td align="left" valign="top">Proposed method 3</td>
<td align="center" valign="top">95.05%</td>
<td align="center" valign="top">0.94</td></tr></tbody></table></table-wrap>
<table-wrap id="t7-sensors-12-04764" position="float">
<label>Table 7.</label>
<caption>
<p>The optimum combination selected by diversity measures.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle"><bold>Diversity</bold></th>
<th align="center" valign="top"/>
<th align="center" valign="middle"><bold>Classifiers combination</bold></th>
<th align="center" valign="middle"><bold>Majority vote</bold></th>
<th align="center" valign="middle"><bold>Weighted majority vote</bold></th>
<th align="center" valign="middle"><bold>Fuzzy integral</bold></th>
<th align="center" valign="middle"><bold>DS Evidence theory</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top"><italic>k</italic></td>
<td align="center" valign="top"/>
<td align="center" valign="top">1,2,3,6,7</td>
<td align="center" valign="top">94.83%</td>
<td align="center" valign="top">93.54%</td>
<td align="center" valign="top">93.77%</td>
<td align="center" valign="top">93.55%</td></tr>
<tr>
<td align="left" valign="top">k′</td>
<td align="center" valign="top"/>
<td align="center" valign="top">1,2,3,6,7</td>
<td align="center" valign="top">94.83%</td>
<td align="center" valign="top">93.54%</td>
<td align="center" valign="top">93.77%</td>
<td align="center" valign="top">93.55%</td></tr>
<tr>
<td align="left" valign="top">DF</td>
<td align="center" valign="top"/>
<td align="center" valign="top">1,3,4,6,7</td>
<td align="center" valign="top">95.05%</td>
<td align="center" valign="top">94.79%</td>
<td align="center" valign="top">94.76%</td>
<td align="center" valign="top">94.86%</td></tr>
<tr>
<td align="left" valign="top">Entropy</td>
<td align="center" valign="top"/>
<td align="center" valign="top">1,2,3,6,7</td>
<td align="center" valign="top">94.83%</td>
<td align="center" valign="top">93.54%</td>
<td align="center" valign="top">93.77%</td>
<td align="center" valign="top">93.55%</td></tr>
<tr>
<td align="left" valign="top">WCEC</td>
<td align="center" valign="top"/>
<td align="center" valign="top">1,2,3,6,7</td>
<td align="center" valign="top">94.83%</td>
<td align="center" valign="top">93.54%</td>
<td align="center" valign="top">93.77%</td>
<td align="center" valign="top">93.55%</td></tr>
<tr>
<td align="left" valign="top">DA</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">1,3,4,5,6</td>
<td align="center" valign="top">95.52%</td>
<td align="center" valign="top">95.39%</td>
<td align="center" valign="top">95.40%</td>
<td align="center" valign="top">95.36%</td></tr>
<tr>
<td align="left" valign="top"/>
<td align="center" valign="top">2</td>
<td align="center" valign="top">1,3,4,6,7</td>
<td align="center" valign="top">95.05%</td>
<td align="center" valign="top">94.79%</td>
<td align="center" valign="top">94.76%</td>
<td align="center" valign="top">94.86%</td></tr>
<tr>
<td align="left" valign="top"/>
<td align="center" valign="top">3</td>
<td align="center" valign="top">1,3,4,5,6</td>
<td align="center" valign="top">95.32%</td>
<td align="center" valign="top">95.15%</td>
<td align="center" valign="top">95.22%</td>
<td align="center" valign="top">95.20%</td></tr></tbody></table></table-wrap>
<table-wrap id="t8-sensors-12-04764" position="float">
<label>Table 8.</label>
<caption>
<p>Results of DP measure of all five classifiers' combination.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top"><bold>Classifiers combination</bold></th>
<th align="center" valign="top"><bold>DP</bold></th>
<th align="center" valign="top"><bold>Rank</bold></th>
<th align="center" valign="top"><bold>Classifiers combination</bold></th>
<th align="center" valign="top"><bold>DP</bold></th>
<th align="center" valign="top"><bold>Rank</bold></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">1,2,3,4,5</td>
<td align="center" valign="top">1.68520</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">1,3,4,5,7</td>
<td align="center" valign="top">1.74182</td>
<td align="center" valign="top">6</td></tr>
<tr>
<td align="center" valign="top">1,2,3,5,6</td>
<td align="center" valign="top">1.69234</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top"><bold>1,3,5,6,7</bold></td>
<td align="center" valign="top"><bold>1.75065</bold></td>
<td align="center" valign="top"><bold>3</bold></td></tr>
<tr>
<td align="center" valign="top">1,2,3,5,7</td>
<td align="center" valign="top">1.67308</td>
<td align="center" valign="top">16</td>
<td align="center" valign="top"><bold>1,3,4,6,7</bold></td>
<td align="center" valign="top"><bold>1.75104</bold></td>
<td align="center" valign="top"><bold>2</bold></td></tr>
<tr>
<td align="center" valign="top">1,2,3,4,6</td>
<td align="center" valign="top">1.69413</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">1,4,5,6,7</td>
<td align="center" valign="top">1.74351</td>
<td align="center" valign="top">5</td></tr>
<tr>
<td align="center" valign="top">1,2,3,4,7</td>
<td align="center" valign="top">1.67177</td>
<td align="center" valign="top">18</td>
<td align="center" valign="top">2,3,4,5,6</td>
<td align="center" valign="top">1.68378</td>
<td align="center" valign="top">11</td></tr>
<tr>
<td align="center" valign="top">1,2,3,6,7</td>
<td align="center" valign="top">1.68283</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">2,3,4,5,7</td>
<td align="center" valign="top">1.66168</td>
<td align="center" valign="top">21</td></tr>
<tr>
<td align="center" valign="top">1,2,4,5,6</td>
<td align="center" valign="top">1.68565</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">2,3,5,6,7</td>
<td align="center" valign="top">1.67331</td>
<td align="center" valign="top">15</td></tr>
<tr>
<td align="center" valign="top">1,2,4,5,7</td>
<td align="center" valign="top">1.66347</td>
<td align="center" valign="top">19</td>
<td align="center" valign="top">2,3,4,6,7</td>
<td align="center" valign="top">1.67285</td>
<td align="center" valign="top">17</td></tr>
<tr>
<td align="center" valign="top">1,2,5,6,7</td>
<td align="center" valign="top">1.67524</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">2,3,5,6,7</td>
<td align="center" valign="top">1.66311</td>
<td align="center" valign="top">20</td></tr>
<tr>
<td align="center" valign="top">1,2,4,6,7</td>
<td align="center" valign="top">1.67524</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">3,4,5,6,7</td>
<td align="center" valign="top">1.74569</td>
<td align="center" valign="top">4</td></tr>
<tr>
<td align="center" valign="top"><bold>1,3,4,5,6</bold></td>
<td align="center" valign="top"><bold>1.75277</bold></td>
<td align="center" valign="top"><bold>1</bold></td>
<td align="center" valign="top"/>
<td align="center" valign="top"/>
<td align="center" valign="top"/></tr></tbody></table></table-wrap>
<table-wrap id="t9-sensors-12-04764" position="float">
<label>Table 9.</label>
<caption>
<p>Classification accuracy of single classifier.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Classifier</bold></th>
<th align="center" valign="top"><bold>Water</bold></th>
<th align="center" valign="top"><bold>Building</bold></th>
<th align="center" valign="top"><bold>Forest</bold></th>
<th align="center" valign="top"><bold>Vegetation</bold></th>
<th align="center" valign="top"><bold>Bared soil</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">SVM</td>
<td align="center" valign="top">80.57%</td>
<td align="center" valign="top">97.34%</td>
<td align="center" valign="top"><bold>92.69</bold>%</td>
<td align="center" valign="top">89.96%</td>
<td align="center" valign="top"><bold>92.97</bold>%</td></tr>
<tr>
<td align="left" valign="top">J48DT</td>
<td align="center" valign="top">88.07%</td>
<td align="center" valign="top">86.38%</td>
<td align="center" valign="top">88.80%</td>
<td align="center" valign="top">95.82%</td>
<td align="center" valign="top">80.92%</td></tr>
<tr>
<td align="left" valign="top">RBFNN</td>
<td align="center" valign="top"><bold>95.72</bold>%</td>
<td align="center" valign="top">88.70%</td>
<td align="center" valign="top">89.58%</td>
<td align="center" valign="top">90.87%</td>
<td align="center" valign="top">81.63%</td></tr>
<tr>
<td align="left" valign="top">MMDC</td>
<td align="center" valign="top">94.80%</td>
<td align="center" valign="top">96.35%</td>
<td align="center" valign="top">88.42%</td>
<td align="center" valign="top">95.82%</td>
<td align="center" valign="top">80.92%</td></tr>
<tr>
<td align="left" valign="top">MLPNN</td>
<td align="center" valign="top">87.77%</td>
<td align="center" valign="top"><bold>98.34</bold>%</td>
<td align="center" valign="top">90.35%</td>
<td align="center" valign="top"><bold>98.10</bold>%</td>
<td align="center" valign="top">88.69%</td></tr></tbody></table></table-wrap>
<table-wrap id="t10-sensors-12-04764" position="float">
<label>Table 10.</label>
<caption>
<p>Classification accuracy statistics.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Method</bold></th>
<th align="center" valign="top"><bold>OA</bold></th>
<th align="center" valign="top"><bold>Kappa</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">SVM</td>
<td align="center" valign="top">90.72%</td>
<td align="center" valign="top">0.88</td></tr>
<tr>
<td align="left" valign="top">J48 DTC</td>
<td align="center" valign="top">87.86%</td>
<td align="center" valign="top">0.85</td></tr>
<tr>
<td align="left" valign="top">RBFNN</td>
<td align="center" valign="top">89.46%</td>
<td align="center" valign="top">0.87</td></tr>
<tr>
<td align="left" valign="top">MMDC</td>
<td align="center" valign="top">91.42%</td>
<td align="center" valign="top">0.89</td></tr>
<tr>
<td align="left" valign="top">MLPNN</td>
<td align="center" valign="top">92.53%</td>
<td align="center" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">RBFNN based on bagging</td>
<td align="center" valign="top">89.88%</td>
<td align="center" valign="top">0.87</td></tr>
<tr>
<td align="left" valign="top">MLPNN based on bagging</td>
<td align="center" valign="top">93.09%</td>
<td align="center" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">J48 DT based on bagging</td>
<td align="center" valign="top">90.44%</td>
<td align="center" valign="top">0.88</td></tr>
<tr>
<td align="left" valign="top">RBFNN based on AdaBoost</td>
<td align="center" valign="top">90.65%</td>
<td align="center" valign="top">0.88</td></tr>
<tr>
<td align="left" valign="top">J48 DT based on AdaBoost</td>
<td align="center" valign="top">91.00%</td>
<td align="center" valign="top">0.89</td></tr>
<tr>
<td align="left" valign="top">MLPNN based on AdaBoost</td>
<td align="center" valign="top">93.27%</td>
<td align="center" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">Improved DS evidence theory</td>
<td align="center" valign="top">92.95%</td>
<td align="center" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">Concatenation combination</td>
<td align="center" valign="top">93.58%</td>
<td align="center" valign="top">0.92</td></tr></tbody></table></table-wrap>
<table-wrap id="t11-sensors-12-04764" position="float">
<label>Table 11.</label>
<caption>
<p>Classification accuracies of Landsat ETM+ image.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><bold>Method</bold></th>
<th align="left" valign="top"><bold>OA</bold></th>
<th align="left" valign="top"><bold>Kappa</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">RBFNN</td>
<td align="left" valign="top">88.09%</td>
<td align="left" valign="top">0.85</td></tr>
<tr>
<td align="left" valign="top">MMDC</td>
<td align="left" valign="top">86.35%</td>
<td align="left" valign="top">0.83</td></tr>
<tr>
<td align="left" valign="top">MLC</td>
<td align="left" valign="top">91.19%</td>
<td align="left" valign="top">0.89</td></tr>
<tr>
<td align="left" valign="top">MEDC</td>
<td align="left" valign="top">80.40%</td>
<td align="left" valign="top">0.76</td></tr>
<tr>
<td align="left" valign="top">J48 DTC</td>
<td align="left" valign="top">87.84%</td>
<td align="left" valign="top">0.85</td></tr>
<tr>
<td align="left" valign="top">Majority vote</td>
<td align="left" valign="top">93.55%</td>
<td align="left" valign="top">0.92</td></tr>
<tr>
<td align="left" valign="top">Weighted vote</td>
<td align="left" valign="top">92.56%</td>
<td align="left" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">DS evidence theory</td>
<td align="left" valign="top">93.42%</td>
<td align="left" valign="top">0.92</td></tr>
<tr>
<td align="left" valign="top">Fuzzy integral</td>
<td align="left" valign="top">92.99%</td>
<td align="left" valign="top">0.91</td></tr></tbody></table></table-wrap></sec></back></article>
