<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Sensors</journal-id>
<journal-title>Sensors</journal-title>
<issn pub-type="epub">1424-8220</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/s110404372</article-id>
<article-id pub-id-type="publisher-id">sensors-11-04372</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Boosting-Based On-Road Obstacle Sensing Using Discriminative Weak Classifiers</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Adhikari</surname><given-names>Shyam Prasad</given-names></name><xref ref-type="aff" rid="af1-sensors-11-04372"><sup>1</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Yoo</surname><given-names>Hyeon-Joong</given-names></name><xref ref-type="aff" rid="af2-sensors-11-04372"><sup>2</sup></xref><xref ref-type="corresp" rid="c1-sensors-11-04372"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Kim</surname><given-names>Hyongsuk</given-names></name><xref ref-type="aff" rid="af1-sensors-11-04372"><sup>1</sup></xref></contrib></contrib-group>
<aff id="af1-sensors-11-04372">
<label>1</label> Division of Electronics and Information Engineering, Chonbuk National University, Jeonju 561-756, Korea; E-Mails: <email>shyam.rvision@hotmail.com</email> (S.P.A.); <email>hskim@jbnu.ac.kr</email> (H.K.)</aff>
<aff id="af2-sensors-11-04372">
<label>2</label> Department of IT Engineering, Sangmyung University, Chonan 330-720, Korea</aff>
<author-notes>
<corresp id="c1-sensors-11-04372">
<label>*</label> Author to whom correspondence should be addressed; E-Mail: <email>yoohj@smu.ac.kr</email>; Tel.: +82-10-3480-0809; Fax: +82-41-550-5355.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2011</year></pub-date>
<pub-date pub-type="epub">
<day>14</day>
<month>4</month>
<year>2011</year></pub-date>
<volume>11</volume>
<issue>4</issue>
<fpage>4372</fpage>
<lpage>4384</lpage>
<history>
<date date-type="received">
<day>14</day>
<month>1</month>
<year>2011</year></date>
<date date-type="rev-recd">
<day>20</day>
<month>3</month>
<year>2011</year></date>
<date date-type="accepted">
<day>12</day>
<month>4</month>
<year>2011</year></date></history>
<permissions>
<copyright-statement>© 2011 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2011</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>This paper proposes an extension of the weak classifiers derived from the Haar-like features for their use in the Viola-Jones object detection system. These weak classifiers differ from the traditional single threshold ones, in that no specific threshold is needed and these classifiers give a more general solution to the non-trivial task of finding thresholds for the Haar-like features. The proposed quadratic discriminant analysis based extension prominently improves the ability of the weak classifiers to discriminate objects and non-objects. The proposed weak classifiers were evaluated by boosting a single stage classifier to detect rear of car. The experiments demonstrate that the object detector based on the proposed weak classifiers yields higher classification performance with less number of weak classifiers than the detector built with traditional single threshold weak classifiers.</p></abstract>
<kwd-group>
<kwd>weak classifiers</kwd>
<kwd>Haar-like features</kwd>
<kwd>AdaBoost</kwd>
<kwd>quadratic discriminant analysis</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>In pattern recognition, object detection generally is a two-class classification problem with two essential issues of feature selection and classifier design based on the selected features. Classifiers based on Haar-like features [<xref ref-type="bibr" rid="b1-sensors-11-04372">1</xref>] have been successfully used for object detection. Viola and Jones [<xref ref-type="bibr" rid="b2-sensors-11-04372">2</xref>] proposed an object detection framework where these Haar-like features are selected and classifier is trained using AdaBoost [<xref ref-type="bibr" rid="b3-sensors-11-04372">3</xref>]. This approach has become a popular framework for object detection and several extensions of this framework have been proposed. One of the extensions is the improvement in the boosting algorithm. Modified versions of AdaBoost such as Real AdaBoost [<xref ref-type="bibr" rid="b4-sensors-11-04372">4</xref>], FloatBoost [<xref ref-type="bibr" rid="b5-sensors-11-04372">5</xref>] and KLBoosting [<xref ref-type="bibr" rid="b6-sensors-11-04372">6</xref>] are available. Real AdaBoost is used for multi-view face detection [<xref ref-type="bibr" rid="b7-sensors-11-04372">7</xref>]. In addition to face detection [<xref ref-type="bibr" rid="b5-sensors-11-04372">5</xref>] Float Boost is also applied to hand shape detection [<xref ref-type="bibr" rid="b8-sensors-11-04372">8</xref>]. The other extension of the original framework is to use an extended set of Haar-like features so that different image patterns can be evaluated. In addition to the basic feature set of <xref ref-type="fig" rid="f1-sensors-11-04372">Figure 1(a)</xref>, an extended set of Haar-like features as shown in <xref ref-type="fig" rid="f1-sensors-11-04372">Figure 1(b,c)</xref> are introduced in [<xref ref-type="bibr" rid="b9-sensors-11-04372">9</xref>,<xref ref-type="bibr" rid="b10-sensors-11-04372">10</xref>], and [<xref ref-type="bibr" rid="b11-sensors-11-04372">11</xref>]. Mita <italic>et al</italic>. [<xref ref-type="bibr" rid="b12-sensors-11-04372">12</xref>] have selected multiple co-occurring linear weak classifiers to form a more efficient classifier. Boosting in a hierarchical feature space where the local Haar-like features are replaced by global features derived from PCA in later stages of boosting is introduced in [<xref ref-type="bibr" rid="b13-sensors-11-04372">13</xref>]. An extension of Haar-like features in which different weights, determined by techniques like Brute force search, Genetic algorithms and Fischer’s linear discriminant analysis, are assigned to the rectangles of Haar-like features is proposed in [<xref ref-type="bibr" rid="b14-sensors-11-04372">14</xref>]. Hybrid features composed of gradient features, Edgelet features and Haar-like features are used in [<xref ref-type="bibr" rid="b15-sensors-11-04372">15</xref>] for pedestrian detection.</p>
<p>The selection of threshold for the Haar-like features is not a trivial task and has not been explained in detail in [<xref ref-type="bibr" rid="b2-sensors-11-04372">2</xref>]. The weak classifiers based on single threshold Haar-like features are sub-optimal and not efficient for discriminating object and non-object. At later stages of the cascade these single threshold Haar-like features become too weak for discrimination and make boosting ineffective [<xref ref-type="bibr" rid="b13-sensors-11-04372">13</xref>]. In this paper, we propose a different set of weak classifiers for boosting that achieves higher classification accuracy with less number of weak classifiers. Unlike in [<xref ref-type="bibr" rid="b2-sensors-11-04372">2</xref>], the proposed weak classifiers do not require explicit thresholds be calculated for the Haar-like features and present a more general solution to the threshold selection problem. The proposed weak classifiers are equally efficient for discrimination at later stages of boosting also.</p>
<p>The rest of the paper is organized as follows: Section 2 describes the AdaBoost learning of the Haar-like features. Section 3 presents the proposed method for realizing efficient weak classifiers. Experimental setup and results are presented in Section 4, followed by concluding remarks in Section 5.</p></sec>
<sec>
<label>2.</label>
<title>Boosting of Weak Classifiers</title>
<p>This section describes the conventional weak classifiers and AdaBoost learning algorithm for constructing a strong classifier by selecting the weak classifiers.</p>
<sec>
<label>2.1.</label>
<title>Boosting of Weak Classifiers</title>
<p>The Haar-like features have scalar values that represent the difference in the sum of intensities between the adjacent rectangular regions. To capture the <italic>ad hoc</italic> knowledge about the domain, these features are evaluated at different positions and with different sizes exhaustively according to the base resolution of the classifier. For example, when the classifier resolution is 24 × 18 pixels, 91,620 features are generated from the five features in <xref ref-type="fig" rid="f1-sensors-11-04372">Figure 1(a,b)</xref>. Each feature is evaluated on all the training samples and the probability density for each of the object and non-object class is calculated as shown in <xref ref-type="fig" rid="f2-sensors-11-04372">Figure 2</xref>. In [<xref ref-type="bibr" rid="b2-sensors-11-04372">2</xref>], a single threshold that separates these two distributions is selected for each feature. These features along with their respective thresholds and polarity form the weak classifiers for the learning algorithm.</p>
<p>A weak classifier can be mathematically described as:
<disp-formula id="FD1">
<label>(1)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>f</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>θ</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mtext> if </mml:mtext>
<mml:mi mathvariant="italic">pf</mml:mi>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
<mml:mo>&gt;</mml:mo>
<mml:mi>p</mml:mi>
<mml:mi>θ</mml:mi></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
<mml:mtext>  otherwise     </mml:mtext></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula>where <italic>x</italic> is the base resolution of the classifier, <italic>f</italic> the Haar-like feature, <italic>θ</italic> the threshold for the feature and <italic>p</italic> the polarity indicating the direction of inequality. The choice of optimal threshold for the features is not stated clearly in [<xref ref-type="bibr" rid="b2-sensors-11-04372">2</xref>] and shows to be a non-trivial task.</p></sec>
<sec>
<label>2.2.</label>
<title>AdaBoost</title>
<p>AdaBoost is a machine learning boosting algorithm that constructs a strong classifier by combining a set of weak classifiers. A small number of discriminative weak classifiers are selected by updating the sample distribution. The prediction of the strong classifier is produced through a weighted majority voting of the weak classifiers. Pseudo code of a variant of AdaBoost used in the implementation is given in Algorithm 1.</p></sec></sec>
<sec>
<title><bold>Algorithm 1.</bold> Pseudo code of Discrete AdaBoost</title>
<list list-type="simple">
<list-item>
<p>-Given example images (<italic>x</italic><sub>1</sub>, <italic>y</italic><sub>1</sub>),..., (<italic>x<sub>n</sub></italic>, <italic>y<sub>n</sub></italic>) where <italic>y<sub>i</sub></italic> = −1,1 for negative and positive examples.</p></list-item>
<list-item>
<p>-Initialize weights 
<inline-formula>
<mml:math>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>m</mml:mi></mml:mrow></mml:mfrac>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi></mml:mrow></mml:math></inline-formula>, where <italic>l</italic> and <italic>m</italic> are the number of positives and negatives respectively</p></list-item>
<list-item>
<p>-For <italic>t =</italic> 1,...,<italic>T</italic>:
<list list-type="order">
<list-item>
<p>Normalize the weights,
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:msub>
<mml:mo>←</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>n</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula></p></list-item>
<list-item>
<p>Select the best weak classifier with respect to the weighted error:
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ɛ</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">t</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mtext>min</mml:mtext></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>θ</mml:mi></mml:mrow></mml:msub>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mi>i</mml:mi></mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>f</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>θ</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula></p></list-item>
<list-item>
<p>Define <italic>h<sub>t</sub></italic>(<italic>x</italic>) = <italic>h</italic>(<italic>x</italic>, <italic>f<sub>t</sub></italic>, <italic>p<sub>t</sub></italic>, <italic>θ<sub>t</sub></italic>) where <italic>f<sub>t</sub></italic>, <italic>p<sub>t</sub></italic> and <italic>θ<sub>t</sub></italic> are minimizers of <italic>ɛ</italic><sub>t</sub></p></list-item>
<list-item>
<p>Update the weights: 
<inline-formula>
<mml:math>
<mml:mrow>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:msub>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:msup></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mtext>if</mml:mtext></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>t</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:msup></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mtext>if</mml:mtext></mml:mrow></mml:mtd>
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>y</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>≠</mml:mo>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>t</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:math></inline-formula>where,
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mi>t</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mtext>ln</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>ɛ</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">t</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ɛ</mml:mi></mml:mrow>
<mml:mi mathvariant="normal">t</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></p></list-item></list></p></list-item>
<list-item>
<p>– The final strong classifier is:
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">final</mml:mi></mml:mrow></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">sign</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>T</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mi>t</mml:mi></mml:msub>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>t</mml:mi></mml:msub>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></p></list-item></list></sec>
<sec>
<label>3.</label>
<title>Proposed Weak Classifiers</title>
<p>This section describes the proposed weak classifiers which eliminate the need of explicit threshold for the Haar-like features. First we formulate the definition of the new weak classifiers based on Bayesian decision theory and quadratic discriminant analysis [<xref ref-type="bibr" rid="b16-sensors-11-04372">16</xref>]. Later we discuss the motivation to use and the relative advantage of the proposed weak classifiers over the traditional single threshold weak classifiers.</p>
<sec>
<label>3.1.</label>
<title>Bayesian Decision Rule</title>
<p>Given a set of features, the Bayesian decision theory for classification requires decision boundaries that minimize the error rate on the training data. Let us consider a two class problem with <italic>ω</italic><sub>1</sub> and <italic>ω</italic><sub>2</sub> as the state of nature. If <italic>x</italic> is the observed feature value, the decision boundary that minimizes the classification error is given in terms of the posterior probabilities as <italic>P</italic>(<italic>ω</italic><sub>1</sub>|<italic>x</italic>) = <italic>P</italic>(<italic>ω</italic><sub>2</sub>|<italic>x</italic>). The corresponding decision rule is: decide <italic>ω</italic><sub>1</sub> if <italic>P</italic>(<italic>ω</italic><sub>1</sub>|<italic>x</italic>) &gt; <italic>P</italic>(<italic>ω</italic><sub>2</sub>|<italic>x</italic>); else decide <italic>ω</italic><sub>2</sub>.</p></sec>
<sec>
<label>3.2.</label>
<title>Discriminant Function for Normal Density</title>
<p>One of the most useful ways to represent pattern classifiers is in terms of a set of discriminant functions <italic>g<sub>i</sub></italic>(<italic>x</italic>); <italic>i</italic> = 1, 2,…, <italic>c</italic>, where <italic>c</italic> is the number of categories to discriminate. The classifier is said to assign a feature <italic>x</italic> to class <italic>ω<sub>i</sub></italic> if:
<disp-formula id="FD2">
<label>(2)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&gt;</mml:mo>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>  for all  </mml:mtext>
<mml:mi>j</mml:mi>
<mml:mo>≠</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:math></disp-formula></p>
<p>The effect of the decision rule is to divide the feature space into <italic>c</italic> decision regions. The regions are separated by decision boundaries, surfaces in the feature space where ties occur among the largest discriminant functions [<xref ref-type="bibr" rid="b16-sensors-11-04372">16</xref>]. Assuming the distribution of the univariate Haar-like features to be normal, <italic>i.e.</italic>, <italic>p</italic>(<italic>x</italic>|<italic>ω<sub>i</sub></italic>) ∼ <italic>N</italic>(<italic>μ<sub>i</sub></italic>, Σ<italic><sub>i</sub></italic>), the minimum error rate classification can be achieved by the use of discriminant function of the form given in <xref ref-type="disp-formula" rid="FD3">Equation (3)</xref> [<xref ref-type="bibr" rid="b16-sensors-11-04372">16</xref>]:
<disp-formula id="FD3">
<label>(3)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mi>t</mml:mi></mml:msup>
<mml:msubsup>
<mml:mrow>
<mml:mo>Σ</mml:mo></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mtext>ln</mml:mtext>
<mml:mn>2</mml:mn>
<mml:mi>π</mml:mi>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mtext>ln</mml:mtext>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mo>Σ</mml:mo></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mo>+</mml:mo>
<mml:mtext>ln</mml:mtext>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ω</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>where <italic>P</italic>(<italic>ω<sub>i</sub></italic>) is the priori probability of class <italic>ω<sub>i</sub></italic>. Taking a general univariate normal case with different variances for each category, the resulting discriminant function is given as:
<disp-formula id="FD4">
<label>(4)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mi>t</mml:mi></mml:msup>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mi>x</mml:mi>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mi>x</mml:mi>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>0</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></disp-formula>where:
<disp-formula>
<mml:math display="block">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:msubsup>
<mml:mrow>
<mml:mo>Σ</mml:mo></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mo>Σ</mml:mo></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>0</mml:mn></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:msubsup>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>t</mml:mi></mml:msubsup>
<mml:msubsup>
<mml:mrow>
<mml:mo>Σ</mml:mo></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mtext>ln</mml:mtext>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mo>Σ</mml:mo></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mo>+</mml:mo>
<mml:mtext>ln</mml:mtext>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ω</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
<p>The discriminant functions of <xref ref-type="disp-formula" rid="FD4">Equation (4)</xref> are inherently quadratic. The decision surfaces are hyperquadrics and in one dimensional case the decision regions needn’t be simply connected as shown in <xref ref-type="fig" rid="f3-sensors-11-04372">Figure 3</xref>. This observation motivates us to formulate new kind of weak classifiers without explicitly specifying the threshold for each weak classifier.</p></sec>
<sec>
<label>3.3.</label>
<title>Proposed Weak Classifiers</title>
<p>The proposed weak classifiers are based on the quadratic discriminant functions described above. Each Haar-like feature from the pool of 91,620 features is evaluated on the training samples and one-dimensional probability densities for object and non-object classes are calculated. Assuming the density of each feature to be normal, the distributions of feature on the object and non-object classes are parameterized by their maximum likelihood estimators, <italic>i.e</italic>., mean <italic>μ</italic> and variance Σ. The distribution for the object (positive) class is <italic>p</italic>(<italic>x</italic>|<italic>ω<sub>p</sub></italic>) ∼ <italic>N</italic>(<italic>μ<sub>p</sub></italic>, Σ<italic><sub>p</sub></italic>), and for non-object (negative) class is <italic>p</italic>(<italic>x</italic>|<italic>ω<sub>n</sub></italic>) ∼ <italic>N</italic>(<italic>μ<sub>n</sub></italic>, Σ<italic><sub>n</sub></italic>). The decision regions for the two distributions are given from <xref ref-type="disp-formula" rid="FD2">Equation (2)</xref>, <italic>i.e</italic>., assign the observed feature value <italic>x</italic> to class <italic>ω<sub>p</sub></italic> if:
<disp-formula id="FD5">
<label>(5)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>p</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&gt;</mml:mo>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>n</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>This decision rule divides the feature space into decision regions which needn’t be simply connected for the same class. The proposed weak classifiers for the Haar-like features are defined as:
<disp-formula id="FD6">
<label>(6)</label>
<mml:math display="block">
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>f</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>p</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>n</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo>Σ</mml:mo></mml:mrow>
<mml:mi>p</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo>Σ</mml:mo></mml:mrow>
<mml:mi>n</mml:mi></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo stretchy="true">{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mtext>if </mml:mtext>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>p</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&gt;</mml:mo>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>n</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
<mml:mtext>                         otherwise</mml:mtext></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula>where <italic>x</italic> is the base resolution of the classifier and <italic>f</italic> is the Haar-like feature. Since a more general model of the distribution is considered, the proposed weak classifiers are expected to perform better than the single threshold weak classifier.</p>
<p>For the weak classifiers of <xref ref-type="disp-formula" rid="FD1">Equation (1)</xref>, each feature produces a single scalar value and the decision boundary corresponds to a scalar threshold. But the choice of this threshold is not stated clearly in [<xref ref-type="bibr" rid="b2-sensors-11-04372">2</xref>] and determination of an optimal threshold is a nontrivial task. The proposed weak classifiers of <xref ref-type="disp-formula" rid="FD6">Equation (6)</xref> are more general and do not require any explicit representation of the threshold. In fact, the weak classifiers of <xref ref-type="disp-formula" rid="FD1">Equation (1)</xref> are a special case of the proposed weak classifiers when Σ<italic><sub>p</sub></italic> and Σ<italic><sub>n</sub></italic> are identical. The weak classifiers based on single threshold commonly employ “average of means” of the two distributions, <italic>i.e.</italic>, (μ<sub>p</sub> + μ<sub>n</sub>)/2, as decision threshold. Under this hypothesis, it is statistically observed that most of the Haar-like features are non-discriminative and inefficient for boosting.</p>
<p>The error rates of these single threshold weak classifiers selected at later stages of the boosting process become large as the sample distribution consists of samples which are difficult to discriminate as shown in <xref ref-type="fig" rid="f4-sensors-11-04372">Figure 4</xref>. The single threshold weak classifiers are not efficient in discriminating such distributions. The proposed weak classifiers are expected to efficiently discriminate the underlying distribution of <xref ref-type="fig" rid="f4-sensors-11-04372">Figure 4</xref>, as disjoint decision regions are also supported as shown in <xref ref-type="fig" rid="f3-sensors-11-04372">Figure 3</xref>.</p></sec></sec>
<sec sec-type="results">
<label>4.</label>
<title>Experimental Results</title>
<sec sec-type="methods">
<label>4.1.</label>
<title>Data Preparation</title>
<p>The experiments were carried out for detection of rear of cars. The experiments were done using 1,500 positive and 3,500 negative samples. The positive samples consisted of instances of rear of cars cropped from a video taken from a camera mounted at the front of a host car while driving in an urban environment. Each instance was resized to a base size of 24 × 18 pixels. The negative samples consisted of images cropped from random high resolution images that did not contain any instance of rear of car. Each negative sample was also resized to base size of 24 × 18 pixels. 1,000 positives and 3,000 negative samples were used for training the classifiers while the remaining 500 positive and 500 negative samples were used for validation. <xref ref-type="fig" rid="f5-sensors-11-04372">Figure 5</xref> shows some of the positive and negative samples used for the experiment.</p></sec>
<sec>
<label>4.2.</label>
<title>Performance Comparison between Proposed Weak Classifiers and Single Threshold Weak Classifiers</title>
<p>A single stage classifier was trained by AdaBoost on the training data using the proposed weak classifiers to achieve 100% hit rate on the positive samples and zero false positive on the negative samples. The final strong classifier achieved the required performance on the training data with a total of 69 proposed weak classifiers. The first weak classifier selected by AdaBoost yielded an error rate of 0.2. The subsequent selected weak classifiers yielded comparatively higher error rates. The worst error rate among the selected classifiers was 0.37 for the 66th classifier. The error rates of subsequent selected classifiers can be seen in <xref ref-type="fig" rid="f6-sensors-11-04372">Figure 6</xref>. Another strong classifier was trained on the same training data using the conventional single threshold based weak classifiers. These classifiers employed the average of means as the threshold. The final strong classifier required 225 weak classifiers to achieve similar performance on the training data. The error rate of the first selected weak classifier was 0.21 but it increased rapidly for the subsequent classifiers and the worst was 0.44 for the 208th classifier. A third strong classifier was trained on the same training data using single threshold based weak classifiers which employed Otsu’s method [<xref ref-type="bibr" rid="b17-sensors-11-04372">17</xref>] for optimal threshold selection. The final classifier consisted of 125 weak classifiers. The error rate of the first weak classifier was 0.22 and the highest error rate was 0.41 for the 124th classifier. <xref ref-type="fig" rid="f6-sensors-11-04372">Figure 6</xref> shows that the error rates of the proposed weak classifiers are consistently lower than the single threshold based counterparts.</p>
<p>Most of the features selected using the proposed weak classifiers have overlapping distributions of the object and non-object classes. Though these features have lower error rates and are boostable under the proposed hypothesis, they would have been rendered useless for boosting under the single threshold hypothesis. Some of the feature distributions are shown in <xref ref-type="fig" rid="f7-sensors-11-04372">Figure 7</xref>.</p>
<p>The selection of efficient classifiers at each round of boosting helps the learning algorithm to converge faster (in terms of the number of weak classifiers) on the training data as can be seen in <xref ref-type="fig" rid="f8-sensors-11-04372">Figure 8</xref>. Similar performance (in terms of hit rate and false positive rate) on the training data can be achieved with a significant reduction in the total number of weak classifiers by using the proposed weak classifiers over the conventional single threshold weak classifiers.</p>
<p>To investigate the generalization performance of the proposed weak classifiers, the strong classifiers were tested on a validation dataset. The validation data set consisted of 500 positive and 500 negative sample images that were not used for training. ROC curves were generated using the validation dataset. The points in the ROC curves were obtained by evaluating each of the strong classifiers against the validation dataset by sliding the stage threshold from −10 to +10 at step of 0.25. The thresholds for the stage were chosen because varying the thresholds in this range proved to be sufficient to generate the whole range in the ROC curves. The plot of the hit rate <italic>versus</italic> the false alarm rate for all the methods is given in <xref ref-type="fig" rid="f9-sensors-11-04372">Figure 9</xref>. From the ROC curves in <xref ref-type="fig" rid="f9-sensors-11-04372">Figure 9</xref>, we can see that the classifier trained using the proposed weak classifiers perform consistently better than the classifier trained using single threshold weak classifiers. The detection rate of the classifier based on the proposed weak classifiers is always higher than the classifier based on single threshold weak classifiers. And for a given detection rate, the classifier using the proposed weak classifier always has less false alarm rate than the classifiers using single threshold weak classifiers. The higher performance of the proposed method reflects the benefit of the usage of discriminant function based weak classifiers, which are more effective at discriminating car and non-car examples.</p></sec>
<sec>
<label>4.3.</label>
<title>Performance Comparison on Relatively Difficult Samples</title>
<p>In this experiment, the classifiers were trained on relatively difficult samples than those of Section 4.2. The positive samples contained 500 positive images from the original training set. The negative samples contained 2,000 negative images. The negative samples were the false positives generated when a 10 stage cascade was evaluated on random high resolution images. The 10 stage cascade was trained on the original training set using the single threshold weak classifiers. In this sense, the negative samples are relatively difficult for the single threshold weak classifiers to discriminate. Three different classifiers were trained using the three types of weak classifiers on this data to achieve 100% hit rate and zero false positives. The classifier using the proposed weak classifiers required only 36 features whereas the classifier using single threshold weak classifier employing average of means required 236 features and the classifier employing Otsu’s thresholding method required 90 features to achieve same performance on the training data. This shows that the proposed weak classifiers are equally efficient in discriminating difficult samples than the single threshold counterparts. The generalization performance of the trained classifiers was tested on the validation set as in Section 4.2. The ROC curves in <xref ref-type="fig" rid="f10-sensors-11-04372">Figure 10</xref>, generated against the validation dataset show significant performance improvement of the classifier trained using the proposed weak classifiers over the classifier using single threshold weak classifiers.</p></sec>
<sec>
<label>4.4.</label>
<title>Comparison of the Speed of the Object Detector</title>
<p>The speed of a cascaded object detector is directly proportional to the number of features evaluated per scanned sub-window in the image. To compare the speed of the detector, we trained three 15 stage cascade using the single threshold classifiers and the proposed weak classifiers on the UIUC Image database for car detection. The cascades were evaluated on the UIUC car test image set at different scales. For the classifier with single threshold employing average of means, an average of 14.5 features out of the total 248 features and for the one employing Otsu’s optimal threshold an average of 13 features out of the total of 230 were evaluated per sub-window, whereas for the proposed classifiers only an average of 8 features out of the total 131 features were evaluated per sub-window. <xref ref-type="table" rid="t1-sensors-11-04372">Table 1</xref> shows the total number of sub-windows scanned and the total features evaluated for seven images randomly sampled from the UIUC car test images at different scales. The feature value calculation time for the proposed classifier is the same as that for the single threshold Haar-like features. But from <xref ref-type="disp-formula" rid="FD3">Equation (3)</xref> we see that the proposed weak classifier requires additional multiplication and addition operation to make the class decision. This makes them relatively more expensive to compute than the single threshold classifiers. The experiments conducted show that the proposed weak classifier requires around 1.6 times more computation time than the single threshold classifier to make a class decision. However as seen from <xref ref-type="table" rid="t1-sensors-11-04372">Table 1</xref>, the single threshold classifiers need to evaluate on average around 1.6 times more features per sub-window than the proposed weak classifiers. This makes the speed of the proposed detector comparable to that of the conventional single threshold based detector.</p></sec></sec>
<sec sec-type="conclusions">
<label>5.</label>
<title>Conclusions</title>
<p>In this paper, we have proposed a new set of weak classifiers for efficient boosting. The proposed weak classifiers do not require an explicit decision threshold to be calculated as is required for the single threshold weak classifiers and present a general solution for the optimal threshold finding problem. The proposed quadratic discriminant analysis based solution significantly improves the ability of the weak classifiers to discriminate object and non-object classes. The experimental results demonstrate that the proposed weak classifiers have far less classification error rate than the single threshold weak classifiers. An object detector trained using the proposed weak classifiers using AdaBoost facilitated efficient boosting and the final classifier yielded higher classification performance with less number of weak classifiers than a detector built with traditional single threshold weak classifiers.</p></sec></body>
<back>
<ref-list>
<title>References</title>
<ref id="b1-sensors-11-04372"><label>1.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Papageorgiou</surname><given-names>CP</given-names></name><name><surname>Oren</surname><given-names>M</given-names></name><name><surname>Poggio</surname><given-names>T</given-names></name></person-group><article-title>A general framework for object detection</article-title><conf-name>Proceedings of the International Conference on Computer Vision</conf-name><conf-loc>Bombay, India</conf-loc><conf-date>4–7 January 1998</conf-date><fpage>555</fpage><lpage>562</lpage></citation></ref>
<ref id="b2-sensors-11-04372"><label>2.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Viola</surname><given-names>P</given-names></name><name><surname>Jones</surname><given-names>M</given-names></name></person-group><article-title>Rapid object detection using a boosted cascade of simple features</article-title><conf-name>Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR)</conf-name><conf-loc>Kauai, HI, USA</conf-loc><conf-date>8–14 December 2001</conf-date></citation></ref>
<ref id="b3-sensors-11-04372"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freund</surname><given-names>Y</given-names></name><name><surname>Schapire</surname><given-names>R</given-names></name></person-group><article-title>A decision-theoretic generalization of on-line learning and an application to boosting</article-title><source>J. Comput. Syst. Sci</source><year>1997</year><volume>55</volume><fpage>119</fpage><lpage>139</lpage><pub-id pub-id-type="doi">10.1006/jcss.1997.1504</pub-id></citation></ref>
<ref id="b4-sensors-11-04372"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schapire</surname><given-names>RE</given-names></name><name><surname>Singer</surname><given-names>Y</given-names></name></person-group><article-title>Improved boosting algorithms using confidence-rated predictions</article-title><source>Mach. Learn</source><year>1999</year><volume>37</volume><fpage>297</fpage><lpage>336</lpage><pub-id pub-id-type="doi">10.1023/A:1007614523901</pub-id></citation></ref>
<ref id="b5-sensors-11-04372"><label>5.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>C</given-names></name><name><surname>Shum</surname><given-names>HY</given-names></name></person-group><article-title>Kullback-leibler boosting</article-title><conf-name>Proceedings of Computer Vision and Pattern Recognition</conf-name><conf-loc>Madison, WI, USA</conf-loc><conf-date>16–22 June 2003</conf-date><fpage>587</fpage><lpage>594</lpage></citation></ref>
<ref id="b6-sensors-11-04372"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>SZ</given-names></name><name><surname>Zhang</surname><given-names>ZQ</given-names></name></person-group><article-title>Floatboost learning and statistical face detection</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell</source><year>2004</year><volume>26</volume><fpage>1112</fpage><lpage>1123</lpage><pub-id pub-id-type="doi">10.1109/TPAMI.2004.68</pub-id><pub-id pub-id-type="pmid">15742888</pub-id></citation></ref>
<ref id="b7-sensors-11-04372"><label>7.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Wu</surname><given-names>B</given-names></name><name><surname>Ai</surname><given-names>H</given-names></name><name><surname>Huang</surname><given-names>C</given-names></name><name><surname>Lao</surname><given-names>S</given-names></name></person-group><article-title>Fast rotation invariant multi-view face detection based on real adaboost</article-title><conf-name>Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition</conf-name><conf-loc>Seoul, Korea</conf-loc><conf-date>17–19 May 2004</conf-date><fpage>79</fpage><lpage>84</lpage></citation></ref>
<ref id="b8-sensors-11-04372"><label>8.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Ong</surname><given-names>EJ</given-names></name><name><surname>Bowden</surname><given-names>R</given-names></name></person-group><article-title>A boosted classifier tree for hand shape detection</article-title><conf-name>Proceedings of IEEE Conference on Automatic Face and Gesture Recognition</conf-name><conf-loc>Seoul, Korea</conf-loc><conf-date>17–19 May 2004</conf-date><fpage>889</fpage><lpage>894</lpage></citation></ref>
<ref id="b9-sensors-11-04372"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lienhart</surname><given-names>R</given-names></name><name><surname>Maydt</surname><given-names>J</given-names></name></person-group><article-title>An extended set of Haar-like features for rapid object detection</article-title><source>Proc. IEEE Int. Conf. Image Process</source><year>2003</year><volume>1</volume><fpage>900</fpage><lpage>903</lpage></citation></ref>
<ref id="b10-sensors-11-04372"><label>10.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Wu</surname><given-names>B</given-names></name><name><surname>Ai</surname><given-names>H</given-names></name><name><surname>Huang</surname><given-names>C</given-names></name><name><surname>Lao</surname><given-names>S</given-names></name></person-group><article-title>Fast rotation invariant multi-view face detection based on real AdaBoost</article-title><conf-name>Proceedings of IEEE Conference on Automatic Face and Gesture Recognition</conf-name><conf-loc>Seoul, Korea</conf-loc><conf-date>17–19 May 2004</conf-date><fpage>79</fpage><lpage>84</lpage></citation></ref>
<ref id="b11-sensors-11-04372"><label>11.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Kolsch</surname><given-names>M</given-names></name><name><surname>Turk</surname><given-names>M</given-names></name></person-group><article-title>Robust hand detection</article-title><conf-name>Proceedings of IEEE Conference on Automatic Face and Gesture Recognition</conf-name><conf-loc>Seoul, Korea</conf-loc><conf-date>17–19 May 2004</conf-date><fpage>614</fpage><lpage>619</lpage></citation></ref>
<ref id="b12-sensors-11-04372"><label>12.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mita</surname><given-names>T</given-names></name><name><surname>Kaneko</surname><given-names>T</given-names></name><name><surname>Stenger</surname><given-names>B</given-names></name><name><surname>Hori</surname><given-names>H</given-names></name></person-group><article-title>Discriminative feature co-occurrence selection for object detection</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell</source><year>2008</year><volume>30</volume><fpage>1257</fpage><lpage>1269</lpage><pub-id pub-id-type="doi">10.1109/TPAMI.2007.70767</pub-id><pub-id pub-id-type="pmid">18550907</pub-id></citation></ref>
<ref id="b13-sensors-11-04372"><label>13.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>D</given-names></name><name><surname>Li</surname><given-names>SZ</given-names></name><name><surname>Perez</surname><given-names>DG</given-names></name></person-group><article-title>Real-time face detection using boosting in hierarchical feature Spaces</article-title><conf-name>Proceedings of International Conference on Pattern Recognition</conf-name><conf-loc>Cambridge, UK</conf-loc><conf-date>August 2004</conf-date><fpage>411</fpage><lpage>414</lpage></citation></ref>
<ref id="b14-sensors-11-04372"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pavani</surname><given-names>SK</given-names></name><name><surname>Delgado-Gomez</surname><given-names>D</given-names></name><name><surname>Frangi</surname><given-names>AF</given-names></name></person-group><article-title>Haar-like features with optimally weighted rectangles for rapid object detection</article-title><source>Pattern Recog</source><year>2010</year><volume>43</volume><fpage>160</fpage><lpage>172</lpage><pub-id pub-id-type="doi">10.1016/j.patcog.2009.05.011</pub-id></citation></ref>
<ref id="b15-sensors-11-04372"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname><given-names>B</given-names></name><name><surname>Wang</surname><given-names>S</given-names></name><name><surname>Ding</surname><given-names>X</given-names></name></person-group><article-title>Multi features combination for pedestrian detection</article-title><source>J. Multimed</source><year>2010</year><volume>5</volume><fpage>79</fpage><lpage>84</lpage></citation></ref>
<ref id="b16-sensors-11-04372"><label>16.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Duda</surname><given-names>R</given-names></name><name><surname>Hart</surname><given-names>P</given-names></name><name><surname>Stork</surname><given-names>D</given-names></name></person-group><source>Pattern Classification</source><publisher-name>Wiley-Interscience</publisher-name><publisher-loc>Malden, MA, USA</publisher-loc><year>2000</year></citation></ref>
<ref id="b17-sensors-11-04372"><label>17.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Otsu</surname><given-names>NA</given-names></name></person-group><article-title>Threshold selection method from gray-level histograms</article-title><source>IEEE Trans. Syst. Man Cybern</source><year>1972</year><volume>9</volume><fpage>62</fpage><lpage>66</lpage></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Table</title>
<fig id="f1-sensors-11-04372" position="float">
<label>Figure 1.</label>
<caption>
<p>Examples of the Haar-like feature set. (<bold>a</bold>) Basic feature set which consists of two adjacent rectangles. (<bold>b</bold>) and (<bold>c</bold>) Extended feature sets which consist of different number and arrangement of rectangles, respectively.</p></caption>
<graphic xlink:href="sensors-11-04372f1.gif"/></fig>
<fig id="f2-sensors-11-04372" position="float">
<label>Figure 2.</label>
<caption>
<p>Example of feature value distributions. In [<xref ref-type="bibr" rid="b2-sensors-11-04372">2</xref>] a single threshold that separates the two distributions is used.</p></caption>
<graphic xlink:href="sensors-11-04372f2.gif"/></fig>
<fig id="f3-sensors-11-04372" position="float">
<label>Figure 3.</label>
<caption>
<p>Non-simply connected decision regions in one dimension.</p></caption>
<graphic xlink:href="sensors-11-04372f3.gif"/></fig>
<fig id="f4-sensors-11-04372" position="float">
<label>Figure 4.</label>
<caption>
<p>Typical distribution of feature values in later stages of boosting on the training data (described later).</p></caption>
<graphic xlink:href="sensors-11-04372f4.gif"/></fig>
<fig id="f5-sensors-11-04372" position="float">
<label>Figure 5.</label>
<caption>
<p>Example of the rear-of-car images (<bold>left</bold>) and the non-car images (<bold>right</bold>) used for training.</p></caption>
<graphic xlink:href="sensors-11-04372f5.gif"/></fig>
<fig id="f6-sensors-11-04372" position="float">
<label>Figure 6.</label>
<caption>
<p>Error rates of the weak classifiers selected by boosting using proposed weak classifiers and the single threshold weak classifiers.</p></caption>
<graphic xlink:href="sensors-11-04372f6.gif"/></fig>
<fig id="f7-sensors-11-04372" position="float">
<label>Figure 7.</label>
<caption>
<p>Distributions of feature values of the 6th (<bold>left</bold>) and the 9th (<bold>right</bold>) features selected by AdaBoost using the proposed weak classifiers. The features have error rate of 0.27 and 0.3 respectively under the proposed hypothesis. The single threshold approach will reject these features as inefficient since a single threshold is not sufficient to discriminate these types of distributions which are unimodal or close to unimodal.</p></caption>
<graphic xlink:href="sensors-11-04372f7.gif"/></fig>
<fig id="f8-sensors-11-04372" position="float">
<label>Figure 8.</label>
<caption>
<p>Plot of training error and the number of weak classifiers: the proposed weak classifiers and the single threshold (derived from average of means and Otsu’s optimal thresholding method) based weak classifiers.</p></caption>
<graphic xlink:href="sensors-11-04372f8.gif"/></fig>
<fig id="f9-sensors-11-04372" position="float">
<label>Figure 9.</label>
<caption>
<p>ROC curves for one stage classifiers trained using the proposed weak classifiers and single threshold (derived from average of means and Otsu’s optimal thresholding method) based weak classifiers.</p></caption>
<graphic xlink:href="sensors-11-04372f9.gif"/></fig>
<fig id="f10-sensors-11-04372" position="float">
<label>Figure 10.</label>
<caption>
<p>ROC curves for single stage classifier trained on difficult samples using the proposed weak classifiers and the single threshold weak classifiers. Single stage classifiers were trained on the negative samples acquired as false positives of an already trained 10 stage cascade using the single threshold based weak classifiers.</p></caption>
<graphic xlink:href="sensors-11-04372f10.gif"/></fig>
<table-wrap id="t1-sensors-11-04372" position="float">
<label>Table 1.</label>
<caption>
<p>Comparison of the speed of the detectors in terms of the average number of features evaluated per scanned window in the test images.</p></caption>
<table frame="box" rules="cols">
<thead>
<tr>
<th align="center" valign="middle" rowspan="4"><bold>S.No.</bold></th>
<th align="center" valign="middle" rowspan="4"><bold>Total sub-windows scanned</bold></th>
<th colspan="2" align="center" valign="middle" rowspan="2"><bold>Proposed Method</bold></th>
<th colspan="4" align="center" valign="middle"><bold>Single Threshold Method</bold>
<hr/></th></tr>
<tr>
<th colspan="2" align="center" valign="middle"><bold>Average of Means</bold></th>
<th colspan="2" align="center" valign="middle"><bold>Otsu’s Threshold</bold></th></tr>
<tr>
<th colspan="6" align="center" valign="bottom">
<hr/></th></tr>
<tr>
<th align="center" valign="middle"><bold>Total features evaluated</bold></th>
<th align="center" valign="middle"><bold>Average feature/sub-window</bold></th>
<th align="center" valign="middle"><bold>Total features evaluated</bold></th>
<th align="center" valign="middle"><bold>Average feature/sub-window</bold></th>
<th align="center" valign="middle"><bold>Total features evaluated</bold></th>
<th align="center" valign="middle"><bold>Average feature/sub-window</bold></th></tr>
<tr>
<th align="center" valign="middle" colspan="8">
<hr/></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">1</td>
<td align="center" valign="top">63,313</td>
<td align="center" valign="top">521,545</td>
<td align="center" valign="top">8.23</td>
<td align="center" valign="top">868,037</td>
<td align="center" valign="top">13.7</td>
<td align="center" valign="top">790,047</td>
<td align="center" valign="top">12.4</td></tr>
<tr>
<td align="center" valign="top">2</td>
<td align="center" valign="top">63,618</td>
<td align="center" valign="top">541,644</td>
<td align="center" valign="top">8.51</td>
<td align="center" valign="top">929,962</td>
<td align="center" valign="top">14.61</td>
<td align="center" valign="top">859,328</td>
<td align="center" valign="top">13.5</td></tr>
<tr>
<td align="center" valign="top">3</td>
<td align="center" valign="top">85,106</td>
<td align="center" valign="top">692,548</td>
<td align="center" valign="top">8.13</td>
<td align="center" valign="top">1,173,507</td>
<td align="center" valign="top">14.9</td>
<td align="center" valign="top">1,032,627</td>
<td align="center" valign="top">12.13</td></tr>
<tr>
<td align="center" valign="top">4</td>
<td align="center" valign="top">87,378</td>
<td align="center" valign="top">763,826</td>
<td align="center" valign="top">8.74</td>
<td align="center" valign="top">1,302,362</td>
<td align="center" valign="top">14.9</td>
<td align="center" valign="top">1,122,984</td>
<td align="center" valign="top">12.85</td></tr>
<tr>
<td align="center" valign="top">5</td>
<td align="center" valign="top">40,810</td>
<td align="center" valign="top">366,326</td>
<td align="center" valign="top">8.97</td>
<td align="center" valign="top">653,903</td>
<td align="center" valign="top">16.3</td>
<td align="center" valign="top">620,030</td>
<td align="center" valign="top">15.19</td></tr>
<tr>
<td align="center" valign="top">6</td>
<td align="center" valign="top">82,354</td>
<td align="center" valign="top">688,846</td>
<td align="center" valign="top">8.36</td>
<td align="center" valign="top">1,078,020</td>
<td align="center" valign="top">13.1</td>
<td align="center" valign="top">1,052,336</td>
<td align="center" valign="top">12.77</td></tr>
<tr>
<td align="center" valign="top">7</td>
<td align="center" valign="top">58,590</td>
<td align="center" valign="top">492,783</td>
<td align="center" valign="top">8.41</td>
<td align="center" valign="top">842,548</td>
<td align="center" valign="top">14.38</td>
<td align="center" valign="top">753,639</td>
<td align="center" valign="top">12.86</td></tr></tbody></table></table-wrap></sec></back></article>
