<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Sensors</journal-id>
<journal-title>Sensors</journal-title>
<issn pub-type="epub">1424-8220</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/s120912386</article-id>
<article-id pub-id-type="publisher-id">sensors-12-12386</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Design of a Multi-Sensor Cooperation Travel Environment Perception System for Autonomous Vehicle</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Chen</surname><given-names>Long</given-names></name><xref ref-type="aff" rid="af1-sensors-12-12386"><sup>1</sup></xref><xref ref-type="aff" rid="af2-sensors-12-12386"><sup>2</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Li</surname><given-names>Qingquan</given-names></name><xref ref-type="aff" rid="af1-sensors-12-12386"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-sensors-12-12386"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Li</surname><given-names>Ming</given-names></name><xref ref-type="aff" rid="af1-sensors-12-12386"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-sensors-12-12386"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Zhang</surname><given-names>Liang</given-names></name><xref ref-type="aff" rid="af1-sensors-12-12386"><sup>1</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Mao</surname><given-names>Qingzhou</given-names></name><xref ref-type="aff" rid="af1-sensors-12-12386"><sup>1</sup></xref></contrib></contrib-group>
<aff id="af1-sensors-12-12386">
<label>1</label> State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, No.129, Luoyu Road, Wuhan, 430079, China; E-Mails: <email>lchen.whu@gmail.com</email> (L.C.); <email>zl200531610254@126.com</email> (L.Z.); <email>qzhmao@whu.edu.cn</email> (Q.M.)</aff>
<aff id="af2-sensors-12-12386">
<label>2</label> School of Electronic Information, Wuhan University, No.129, Luoyu Road, Wuhan, 430079, China</aff>
<author-notes>
<corresp id="c1-sensors-12-12386">
<label>*</label> Author to whom correspondence should be addressed; E-Mails: <email>qqli@whu.edu.cn</email> (Q.L.); <email>liming751218@gmail.com</email> (M.L.) Tel./Fax: +86-755-2653-6101.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2012</year></pub-date>
<pub-date pub-type="epub">
<day>12</day>
<month>09</month>
<year>2012</year></pub-date>
<volume>12</volume>
<issue>9</issue>
<fpage>12386</fpage>
<lpage>12404</lpage>
<history>
<date date-type="received">
<day>31</day>
<month>07</month>
<year>2012</year></date>
<date date-type="rev-recd">
<day>20</day>
<month>08</month>
<year>2012</year></date>
<date date-type="accepted">
<day>23</day>
<month>08</month>
<year>2012</year></date></history>
<permissions>
<copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2012</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>This paper describes the environment perception system designed for intelligent vehicle SmartV-II, which won the 2010 Future Challenge. This system utilizes the cooperation of multiple lasers and cameras to realize several necessary functions of autonomous navigation: road curb detection, lane detection and traffic sign recognition. Multiple single scan lasers are integrated to detect the road curb based on Z-variance method. Vision based lane detection is realized by two scans method combining with image model. Haar-like feature based method is applied for traffic sign detection and SURF matching method is used for sign classification. The results of experiments validate the effectiveness of the proposed algorithms and the whole system.</p></abstract>
<kwd-group>
<kwd>autonomous vehicle</kwd>
<kwd>travel environment perception system</kwd>
<kwd>multi-sensor cooperation</kwd>
<kwd>road and lane detection</kwd>
<kwd>traffic sign detection</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>Intelligent Vehicle System (IVS) is a comprehensive system which should have several necessary functions: travel environment perception, self-localization, path planning and vehicle control. Travel environment perception is the foundation of other functions in IVS. This paper introduces the multi-sensor fusion travel environment perception system designed for our autonomous vehicle SmartV-II. The main functions include road curb detection, lane detection and traffic sign recognition. With the help of this travel environment perception system, SmartV-II became the only robot to complete comprehensive test section of the 2010 Future Challenge in time, see <xref ref-type="fig" rid="f1-sensors-12-12386">Figure 1</xref>.</p>
<p>IVS has been studied for a long time, especially since the DAPRA Challenge held in 2005. Many effective solutions have been proposed for road and lane detection and traffic sign recognition.</p>
<sec>
<label>1.1.</label>
<title>Road and Lane Detection</title>
<p>Road can be mainly divided into structured road and unstructured road based on the structure information. The former means regular road with visible lane markings, such as highway and most urban road. For structured road, lane detection and following as the key technology have been studied over last two decades. Some effective lane detection systems have been proposed, such as AWSTM, AutoVue, RALPH ([<xref ref-type="bibr" rid="b1-sensors-12-12386">1</xref>–<xref ref-type="bibr" rid="b3-sensors-12-12386">3</xref>]), AURORA [<xref ref-type="bibr" rid="b4-sensors-12-12386">4</xref>], SCAR [<xref ref-type="bibr" rid="b5-sensors-12-12386">5</xref>], GOLD ([<xref ref-type="bibr" rid="b6-sensors-12-12386">6</xref>,<xref ref-type="bibr" rid="b7-sensors-12-12386">7</xref>]), and LOIS [<xref ref-type="bibr" rid="b8-sensors-12-12386">8</xref>]. These lane detection algorithms can be mainly grouped into two categories: edge based methods and model based methods. Edge based methods are most widely used [<xref ref-type="bibr" rid="b9-sensors-12-12386">9</xref>,<xref ref-type="bibr" rid="b10-sensors-12-12386">10</xref>].They are fast but highly dependent on the method used to extract the edges corresponding to the lane boundaries. When the road condition is complex, these methods may easily fail. Common road models include triangle model, straight line model, clothoid model, polynomial model and spline model, <italic>etc.</italic> Wang <italic>et al.</italic> [<xref ref-type="bibr" rid="b11-sensors-12-12386">11</xref>] computed the likelihood probability through fitting the detected features to the model, and Kang <italic>et al.</italic> [<xref ref-type="bibr" rid="b12-sensors-12-12386">12</xref>] and Wang <italic>et al.</italic> [<xref ref-type="bibr" rid="b13-sensors-12-12386">13</xref>] found the extreme value of the energy function to the lane location, then the Kalman filter was used for predicting the parameters of the model. These algorithms would be time-consuming because of the iterative operation. Unstructured road refers the irregular road without normal markings such as campus and park road, rural road and off-road. In the situation, researchers mainly focus on the natural road boundary and drivable range detection [<xref ref-type="bibr" rid="b14-sensors-12-12386">14</xref>–<xref ref-type="bibr" rid="b17-sensors-12-12386">17</xref>]. Lieb <italic>et al.</italic> [<xref ref-type="bibr" rid="b14-sensors-12-12386">14</xref>] used one-dimensional template matching and the sum of squared differences combined with optical flow to determine the most similar regions in front of vehicle. This method can hardly deal with the situation where there is an unexpected obstacle in the front. Dynamical sampling windows are used for training range detection in [<xref ref-type="bibr" rid="b15-sensors-12-12386">15</xref>], but the selected range can not represent the real road classes feature space well. Our previous solution of lane detection is reported in [<xref ref-type="bibr" rid="b18-sensors-12-12386">18</xref>]. In this paper, we apply a more believable method based on laser information for locating the road range, because the laser has more reliability depth information which is easier to find structural change. In [<xref ref-type="bibr" rid="b19-sensors-12-12386">19</xref>], a trigonometry based road detection method using laser scanner is proposed, which applies the relationship of neighboring three laser points. However, because of the ranging error, the relationship may be destroyed and this method will be less robust as the range increases. In this paper, a Z-Variance based road curb detection method is proposed, which is range independent. Chen <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-12386">20</xref>] also introduced some recent developments of active vision in robotic systems.</p></sec>
<sec>
<label>1.2.</label>
<title>Traffic Sign Detection and Recognition</title>
<p>Traffic sign detection and recognition in realtime is a vital issue in IVS and Driver Assistance System (DAS). One decade age, realtime performing systems have been successfully achieved [<xref ref-type="bibr" rid="b21-sensors-12-12386">21</xref>–<xref ref-type="bibr" rid="b23-sensors-12-12386">23</xref>]. Traffic sign recognition usually consists of two components: detection and classification. First, the location of the traffic signs are found and the target rectangles are extracted in the detection stage. To which category does the candidate sign belong is the main issue needing to be addressed in the classification phase. For traffic sign detection, color segmentation is the most common method. RGB color model is widely used [<xref ref-type="bibr" rid="b24-sensors-12-12386">24</xref>]. RGB color space has a higher sensitivity to light intensity. Therefore, HIS and HSV which are not affected by the lighting changes have been used [<xref ref-type="bibr" rid="b25-sensors-12-12386">25</xref>,<xref ref-type="bibr" rid="b26-sensors-12-12386">26</xref>]. Some other authors also used YIQ [<xref ref-type="bibr" rid="b27-sensors-12-12386">27</xref>], YUV, L*a*b [<xref ref-type="bibr" rid="b28-sensors-12-12386">28</xref>] and CIE color spaces. Some authors developed databases of color pixels, look-up tables and hierarchical region growing techniques [<xref ref-type="bibr" rid="b26-sensors-12-12386">26</xref>,<xref ref-type="bibr" rid="b29-sensors-12-12386">29</xref>,<xref ref-type="bibr" rid="b30-sensors-12-12386">30</xref>]. Shape based method is usually used for a final detection after the color segmentation. Many circle, ellipse and triangle detection methods also have been used. Soetedjo and Yamada [<xref ref-type="bibr" rid="b31-sensors-12-12386">31</xref>] discussed ellipse detection in complex scene with neighborhood characteristics and symmetric features of the simple coding. Piccioli <italic>et al.</italic> [<xref ref-type="bibr" rid="b32-sensors-12-12386">32</xref>] analyzed the color information and geometrical characteristic of the edges to extract possible triangular or circular signs. For traffic sign classification, many methods have been employed for traffic signs classification such as template matching, LDA, SVM, ANN and other machine learning methods. OCR systems are applied in [<xref ref-type="bibr" rid="b28-sensors-12-12386">28</xref>,<xref ref-type="bibr" rid="b33-sensors-12-12386">33</xref>,<xref ref-type="bibr" rid="b34-sensors-12-12386">34</xref>] using the pictogram-based classification by template matching and cross-correlation. In [<xref ref-type="bibr" rid="b35-sensors-12-12386">35</xref>,<xref ref-type="bibr" rid="b36-sensors-12-12386">36</xref>], the authors make use of the LDA to distinguish between the road signs. The Multi-Layer Perception [<xref ref-type="bibr" rid="b37-sensors-12-12386">37</xref>] is widely used in the current approaches. Neural networks are also widely adopted [<xref ref-type="bibr" rid="b38-sensors-12-12386">38</xref>,<xref ref-type="bibr" rid="b39-sensors-12-12386">39</xref>]. Support vector machines (SVM) are largely adopted to classify the inner part of road signs [<xref ref-type="bibr" rid="b40-sensors-12-12386">40</xref>]. Random forests, an ensemble learning technique, are used in [<xref ref-type="bibr" rid="b41-sensors-12-12386">41</xref>] to classify signs, and a comparison is made between this technique and SVM and AdaBoost. In recent years, one of the most accepted and widely used approach in object detection has been proposed by Viola and Jones [<xref ref-type="bibr" rid="b42-sensors-12-12386">42</xref>]. Their approach is based on a cascade of detectors, where each detector is an ensemble of boosted classifiers based on the Haar-like features. Inspired by detector presented in [<xref ref-type="bibr" rid="b42-sensors-12-12386">42</xref>], we apply this method combined with color segmentation for the traffic sign detection. Different from above solutions, this paper presents a low-cost multi-sensor integrated system to realize the necessary functions based on several novel algorithms. The contributions of this paper are as follows:
<list list-type="order">
<list-item>
<p>By reasonably arranging several simple low-cost sensors, our system can realize complex functions without high-end sensors. Combination of cameras and lasers based road detection method can deal with not only structured road but also unstructured road.</p></list-item>
<list-item>
<p>Multiple sensors are skillfully installed for covering more view around the vehicle to satisfy the situation that the vehicle drives with high speed or passes a turn with high curvature.</p></list-item>
<list-item>
<p>Traffic signs are divided into six classes; for each class, we trained a classifier based on Haar-like features for the detection and the scale invariant feature SURF is used for the sign classification.</p></list-item></list></p>
<p>The rest of the paper is organized as follows. Section 2 introduces the layout of the sensors. Section 3 describes Z-variance based road curb detection. Section 4 presents two scans method for multiple lanes detection. Realtime traffic sign recognition is introduced in Section 5. Experiments and results are discussed in Section 6. Conclusions are given in Section 7.</p></sec></sec>
<sec>
<label>2.</label>
<title>Multi-Sensor Layout</title>
<p>The layout of the sensors for IVS should enable a wide view including not only the front view but also the left and right sides of the vehicle. Compared with the two successful vehicle in DAPRA Challenge, <italic>i.e.</italic>, BOSS [<xref ref-type="bibr" rid="b43-sensors-12-12386">43</xref>] from CMU and Stanley [<xref ref-type="bibr" rid="b44-sensors-12-12386">44</xref>] from Stanford University, our system uses lower cost sensors instead of the high-end laser scanners such as Velodyne and fixes several sensors in the front part of the vehicle to cover the area close to the vehicle. Our detection system arranges the layout of lasers and cameras in such a way that guarantees our range of perception should cover not only the front view of the ego vehicle but also the left and right view. This arrangement can deal with the situation where the vehicle prepares to drive through a turn with high speed. <xref ref-type="fig" rid="f2-sensors-12-12386">Figure 2</xref> shows the positions and coverage areas of the sensors. Three laser scanners are marked by 1, 2 and 3 in the upper figure. Laser 1 is mounted on the roof and Laser 2 and Laser 3 are mounted on the head of the vehicle, tilted downward to scan the road ahead. We can adjust the pitch angles <italic>ρ</italic><sub>1</sub>, <italic>ρ</italic><sub>2</sub> and <italic>ρ</italic><sub>3</sub> in order that the lasers can touch different distances ahead our vehicle. Three cameras with different pitch angles and heading angles are used for curb finding. When vehicle is traveling roughly along the straight line, the middle camera is used for lane detection. When it comes to turning, two aside cameras are chosen in order to cover the closer area around the vehicle. Data from different sensors will be transformed to the unique vehicle coordinate. Calibration is performed using OPENCV functions [<xref ref-type="bibr" rid="b45-sensors-12-12386">45</xref>] and the Camera Calibration Toolbox for MATLAB. The algorithm used is taken mainly from [<xref ref-type="bibr" rid="b46-sensors-12-12386">46</xref>].</p></sec>
<sec>
<label>3.</label>
<title>Road Curb Detection</title>
<sec>
<label>3.1.</label>
<title>Z-Variance Based Road Curb Detection</title>
<p>The laser scanner used for road shoulder detection is slanted down. The proposed method assumes that the road surface is flat. With this hypothesis, the elevation variance of the points on road surface is low, while the variance of Z value is high on the road boundary or curb. All the laser points are translated to the vehicle coordinate. Median filter is applied to filter out some tiny objects on the road such as leaves and road crack. The Z-variance of the <italic>i</italic>th point will be calculated by
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm1" display="block">
<mml:semantics id="sm1">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>9</mml:mn></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>4</mml:mn></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>4</mml:mn></mml:mrow></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>The algorithm step is as follows:
<list list-type="order">
<list-item>
<p>Calculate the Z-variance of all points.</p></list-item>
<list-item>
<p>Select the points with Z-variances above the threshold <italic>t</italic>, and the segment between these two points with length wider than the vehicle will be selected as candidate road section.</p></list-item>
<list-item>
<p>Compare the mean value of height <italic>H</italic>, distance <italic>D</italic> between head of vehicle and midpoint of one section, then calculate weights for all candidate road sections by the following equation:
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm2" display="block">
<mml:semantics id="sm2">
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>⋅</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mtext>min</mml:mtext></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mtext>min</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:mtd></mml:mtr></mml:mtable>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mrow>
<mml:mtext>min</mml:mtext></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mrow>
<mml:mtext>min</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>H<sub>min</sub></italic> is the minimum height and <italic>D<sub>min</sub></italic> is the minimum distance, and <italic>α</italic> is a weighting factor. <italic>W<sub>i</sub></italic> ranges from 0 to 1.</p></list-item>
<list-item>
<p>The candidate road section with highest weight is considered as the real road which is expressed as pointpair, that is, left point (<italic>X<sub>l</sub>,Y<sub>l</sub></italic>) and right point (<italic>X<sub>R</sub>, Y<sub>R</sub></italic>).</p></list-item></list></p></sec>
<sec>
<label>3.2.</label>
<title>Multi-Laser Based Road Curb Fitting</title>
<p>To obtain the road boundary, only one single scan laser is not enough. Multiple lasers are combined to settle this problem. Three SICK laser scanners are used with scan range 2 m, 3.5 m and 6 m respectively. Road curb detection described above will be carried out with each laser dependently. Consequently, we can get three point-pairs which can be divided into left points ((<italic>X<sub>L</sub></italic><sup>2</sup>,<italic>Y<sub>L</sub></italic><sup>2</sup>),(<italic>X<sub>L</sub></italic><sup>3.5</sup>,<italic>Y<sub>L</sub></italic><sup>3.5</sup>) and (<italic>X<sub>L</sub></italic><sup>6</sup>,<italic>Y<sub>L</sub></italic><sup>6</sup>)) and right points ((<italic>X<sub>R</sub></italic><sup>2</sup>, <italic>Y<sub>R</sub></italic><sup>2</sup>),(<italic>X<sub>R</sub></italic><sup>3.5</sup>, <italic>Y<sub>R</sub></italic><sup>3.5</sup>) and (<italic>X<sub>R</sub></italic><sup>6</sup>, <italic>Y<sub>R</sub></italic><sup>6</sup>)). Finally, a parabola is used to fit the points on the same side, see <xref ref-type="fig" rid="f3-sensors-12-12386">Figure 3</xref>.</p>
<p>
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="mm3" display="block">
<mml:semantics id="sm3">
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>b</mml:mi>
<mml:mi>y</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:semantics></mml:math></disp-formula></p></sec></sec>
<sec>
<label>4.</label>
<title>Lane Detection</title>
<p>For structured roads, this paper proposes a two scans method to detect multiple lanes. <xref ref-type="fig" rid="f4-sensors-12-12386">Figure 4</xref> is the proposed flow chart of multiple lanes detection method. Road image from top-middle camera is first preprocessed by top-hat transform and threshold. In mathematical morphology, top-hat transform is an operation that extracts small elements and details from given images. The top-hat extracts the objects that have not been eliminated by the opening. That is, it removes objects larger than the structuring element.</p>
<sec>
<label>4.1.</label>
<title>Imaging Model</title>
<p>Using the image model, we can rebuild the model of the lane plane in the 3D world space from the image in the 2D image space based on the inverse perspective mapping (IPM), and finally obtain the real width of lane markings and distance between two adjacent lane markings. The proposed image model is shown in <xref ref-type="fig" rid="f5-sensors-12-12386">Figure 5</xref>. <italic>W</italic> = (<italic>X, Y, Z</italic>) ∈ <italic>E</italic><sup>3</sup> denotes the world coordinate system <italic>WCS</italic> and <italic>I</italic> = (<italic>u, v</italic>) ∈ <italic>E</italic><sup>2</sup> denotes the image coordinate. Camera is located in <italic>C</italic>(<italic>d</italic>, 0, <italic>h</italic>) ∈ <italic>W</italic>, <italic>h</italic> is the height of the camera from the ground. Optical axis is parallel to the ground, <italic>γ</italic> is the angle between optical axis and the lane. <italic>α</italic> is horizontal view angle of the camera and <italic>β</italic> is vertical view angle. The mapping from <italic>W</italic> to <italic>I</italic> is given in <xref rid="FD4" ref-type="disp-formula">Equation (4)</xref> and the mapping from <italic>I</italic> to <italic>W</italic> is given in <xref rid="FD5" ref-type="disp-formula">Equation (5)</xref>, where <italic>H<sub>I</sub></italic> and <italic>W<sub>I</sub></italic> respectively represents horizontal resolution and vertical resolution of the camera, which can be acquired by calibration. The width of the lane marking decreases with increasing distance to the camera in perspective view. Based on imaging model, we can get the real distance Δ<italic>X</italic> in the <italic>WCS</italic> coordinate when the distance is Δ<italic>u</italic> in the line <italic>v</italic> in the image coordinate. The relationship is given in <xref rid="FD7" ref-type="disp-formula">Equation (7)</xref>.</p>
<p>
<disp-formula id="FD4">
<label>(4)</label>
<mml:math id="mm4" display="block">
<mml:semantics id="sm4">
<mml:mrow>
<mml:mtable columnalign="right">
<mml:mtr columnalign="right">
<mml:mtd columnalign="right">
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mi>I</mml:mi></mml:msub></mml:mrow>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>Y</mml:mi>
<mml:mo>×</mml:mo>
<mml:mo>tan</mml:mo>
<mml:mfrac>
<mml:mi>β</mml:mi>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mo>×</mml:mo>
<mml:mo>cos</mml:mo>
<mml:mi>γ</mml:mi></mml:mrow></mml:mfrac>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="right">
<mml:mtd columnalign="right">
<mml:mrow>
<mml:mi>u</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>I</mml:mi></mml:msub></mml:mrow>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mi>X</mml:mi>
<mml:mrow>
<mml:mi>Y</mml:mi>
<mml:mo>tan</mml:mo>
<mml:mfrac>
<mml:mi>α</mml:mi>
<mml:mn>2</mml:mn></mml:mfrac></mml:mrow></mml:mfrac>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD5">
<label>(5)</label>
<mml:math id="mm5" display="block">
<mml:semantics id="sm5">
<mml:mrow>
<mml:mtable columnalign="right">
<mml:mtr columnalign="right">
<mml:mtd columnalign="right">
<mml:mrow>
<mml:mi>X</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>Y</mml:mi>
<mml:mo>tan</mml:mo>
<mml:mfrac>
<mml:mi>α</mml:mi>
<mml:mn>2</mml:mn></mml:mfrac></mml:mrow>
<mml:mrow>
<mml:mtext>cot</mml:mtext>
<mml:mi>γ</mml:mi></mml:mrow></mml:mfrac>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>u</mml:mi></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>I</mml:mi></mml:msub></mml:mrow></mml:mfrac>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="right">
<mml:mtd columnalign="right">
<mml:mrow>
<mml:mi>Y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>×</mml:mo>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mi>I</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mi>I</mml:mi></mml:msub>
<mml:mo>−</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>v</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>×</mml:mo>
<mml:mo>tan</mml:mo>
<mml:mfrac>
<mml:mi>β</mml:mi>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:mo>×</mml:mo>
<mml:mo>cos</mml:mo>
<mml:mi>γ</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD6">
<label>(6)</label>
<mml:math id="mm6" display="block">
<mml:semantics id="sm6">
<mml:mrow>
<mml:mi mathvariant="normal">Δ</mml:mi>
<mml:mi>u</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="normal">Δ</mml:mi>
<mml:mi>X</mml:mi>
<mml:mo>×</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>I</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>v</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mi>I</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext mathvariant="italic">cot</mml:mtext>
<mml:mi>γ</mml:mi>
<mml:mtext mathvariant="italic">cos</mml:mtext>
<mml:mi>γ</mml:mi>
<mml:mtext mathvariant="italic">tan</mml:mtext>
<mml:mfrac>
<mml:mi>β</mml:mi>
<mml:mn>2</mml:mn></mml:mfrac></mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mo>×</mml:mo>
<mml:mi>h</mml:mi>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mi>I</mml:mi></mml:msub>
<mml:mo>×</mml:mo>
<mml:mtext mathvariant="italic">tan</mml:mtext>
<mml:mfrac>
<mml:mi>α</mml:mi>
<mml:mn>2</mml:mn></mml:mfrac></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula></p></sec>
<sec>
<label>4.2.</label>
<title>Two Scans Based Method for Multi-Lane Detection</title>
<p>After preprocessing, the gradient of each pixel will be calculated as follows:
<disp-formula id="FD7">
<label>(7)</label>
<mml:math id="mm7" display="block">
<mml:semantics id="sm7">
<mml:mrow>
<mml:mo>∇</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>y</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mo>∂</mml:mo>
<mml:mi>I</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo>∂</mml:mo>
<mml:mi>x</mml:mi></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mo>∂</mml:mo>
<mml:msup>
<mml:mi>I</mml:mi>
<mml:mi>T</mml:mi></mml:msup></mml:mrow>
<mml:mrow>
<mml:mo>∂</mml:mo>
<mml:mi>y</mml:mi></mml:mrow></mml:mfrac>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>≈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>x</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>y</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>D<sub>x</sub></italic> and <italic>D<sub>y</sub></italic> denote the gradient in <italic>x</italic> direction and <italic>y</italic> direction respectively. First, we want to get a most obvious lane, called the surest lane, based on the edge distribution function(EDF). EDF is the histogram of the gradient magnitude with respect to the orientation. We can estimate the magnitude and orientation by <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref>. To compute this histogram, the angle <italic>θ</italic>(<italic>x, y</italic>) with the range [−90°, 90°] were quantized in 90 subintervals at a step of 2°. The surest lane is defined as the maximum value of the histogram. <xref ref-type="fig" rid="f6-sensors-12-12386">Figure 6(g)</xref> shows the RANSAC line fitting of the surest lane after the first scan. Starting from the surest lane, we can do the second scan. Other lanes could be fitted with the same method. <xref ref-type="fig" rid="f6-sensors-12-12386">Figure 6(j)</xref> shows the results of multiple lane detection. <xref ref-type="fig" rid="f6-sensors-12-12386">Figure 6(i,k)</xref> presents the global maxima and local maxima.</p>
<p>
<disp-formula id="FD8">
<label>(8)</label>
<mml:math id="mm8" display="block">
<mml:semantics id="sm8">
<mml:mrow>
<mml:mtable columnalign="right">
<mml:mtr columnalign="right">
<mml:mtd columnalign="right">
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:mo>∇</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>y</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo stretchy="false">|</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>x</mml:mi></mml:msub></mml:mrow>
<mml:mo stretchy="false">|</mml:mo></mml:mrow>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>y</mml:mi></mml:msub></mml:mrow>
<mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="right">
<mml:mtd columnalign="right">
<mml:mrow>
<mml:mi>θ</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>y</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mo>tan</mml:mo>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msup>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>y</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:semantics></mml:math></disp-formula></p></sec></sec>
<sec>
<label>5.</label>
<title>Traffic Sign Detection and Classification</title>
<p>The proposed sign detection and recognition method includes two parts. The detection part is based on color segmentation, Haar-like wavelet features and AdaBoost classifier. The recognition part is based on feature matching method with the Speeded Up Robust Features (SURF). <xref ref-type="fig" rid="f7-sensors-12-12386">Figure 7</xref> is the flow chart of the traffic sign recognition system. Because Haar-like features are features of gray images, the detection method we proposed here is mainly based on the gray information. Since the shape information can mainly affect the Haar-like features, the main traffic signs that this paper copes with can be divided into six classes based on the shape, as shown in <xref ref-type="fig" rid="f8-sensors-12-12386">Figure 8</xref>.</p>
<sec>
<label>5.1.</label>
<title>Color-Based Segmentation</title>
<p>The color-based segmentation includes two steps: (1) color quantization, (2) ROI locking. In the first step, we extract the target color pixels. In the next step, we get the ROI from the pixels based on constraints on bounding box of the connected-components of the pixels. The main color includes: red, blue, yellow, white and black. In our detection method, we focus on the three colors: red, blue and yellow. The RGB color model is highly related to the light intensity. HSV color model is applied in this paper.</p>
<p>According to <xref ref-type="table" rid="t1-sensors-12-12386">Table 1</xref>, we can get the red, blue and yellow pixels from the original image. After the color segmentation, the detected pixels can form some connected regions, then we can get the enclosing rectangles (ER) of them. Based on some constraints on ER, we can wipe off many noise regions. First, the ER smaller than 20 × 20 pixels are considered as noise and not processed further. Second, the aspect ratio of ER is limited to 2. Third, the saturation of ER is no less than 0.5. The rest of ERs will be ignored. <xref ref-type="fig" rid="f9-sensors-12-12386">Figure 9</xref> shows the results of three color segmentation and ROI locking.</p></sec>
<sec>
<label>5.2.</label>
<title>AdaBoost for Traffic Sign Detection</title>
<p>The AdaBoost algorithm is a classifier learning method which combines a set of weak classifiers to construct a strong classifier and then assembles some strong classifiers to a cascade classifier. Feature selection is crucial for classifier. Motivated by the work of Tieu and Viola [<xref ref-type="bibr" rid="b47-sensors-12-12386">47</xref>], we use extended Haar-like features to train AdaBoost classifier for traffic signs detection.</p>
<p>
<disp-formula id="FD9">
<label>(9)</label>
<mml:math id="mm9" display="block">
<mml:semantics id="sm9">
<mml:mrow>
<mml:msub>
<mml:mtext mathvariant="italic">feature</mml:mtext>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>∈</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>ω</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>×</mml:mo>
<mml:mtext mathvariant="italic">RectSum</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>r</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>ω<sub>i</sub></italic> denotes the weight of rectangle, <italic>RectSum</italic>(<italic>r<sub>i</sub></italic>) is the integral of image by surrounded by rectangle <italic>r<sub>i</sub></italic>, <italic>feature<sub>j</sub></italic> is the <italic>j<sub>th</sub></italic> feature, <italic>n</italic> is arbitrarily chosen that represents the number of rectangles consisting of <italic>feature<sub>j</sub></italic>.</p></sec>
<sec>
<label>5.3.</label>
<title>SURF Matching for Classification</title>
<p>The proposed recognition method includes three steps: image scaling, SURF features extraction, features matching. The detected targets found in detection stage will be normalized to be of the same size (100 × 100) as the template which will be matched. Though SURF is a scale invariant feature, in this step we will make sure that the true sign contains enough features to be matched with the template sign. If the number of matched points is lower than a certain value, the candidate will be discarded as a noise. In order to make sure the certain value is adequate for all candidates, the image scaling is necessary. In this paper, we use bilinear interpolation for image scaling. Once the image is normalized, the SURF descriptor can be used for exacting the scale and rotation invariant features.</p>
<p>SURF [<xref ref-type="bibr" rid="b48-sensors-12-12386">48</xref>,<xref ref-type="bibr" rid="b49-sensors-12-12386">49</xref>] detector is chosen instead of the often used SIFT detector. SURF is developed to run substantially faster but possess comparable performance than SIFT. The resulting descriptor vector for all 4 × 4 sub-regions is of length 64. More details about SURF can be found in [<xref ref-type="bibr" rid="b48-sensors-12-12386">48</xref>] and [<xref ref-type="bibr" rid="b49-sensors-12-12386">49</xref>].</p>
<p>Because we have many template signs to be matched, in order to reduce the matching time, all the template signs are divided into six groups based on the color and the trained Adaboost classifiers. We used Approximate Nearest Neighbor (ANN) [<xref ref-type="bibr" rid="b50-sensors-12-12386">50</xref>] algorithm for matching. SURF features are first extracted from all the template signs which will be divided into eight groups and stored in a database. Then a candidate image is matched by individually comparing each feature of the candidate with the special database; the selection is made based on the classifier used and color information and the features are matched based on ANN. The image in the template database that gives the maximum number of matches with the candidate image is the target class. <xref ref-type="fig" rid="f10-sensors-12-12386">Figure 10</xref> shows some match results between the candidate signs and template signs. See [<xref ref-type="bibr" rid="b51-sensors-12-12386">51</xref>] for more details about the algorithm.</p></sec></sec>
<sec sec-type="results|discussion">
<label>6.</label>
<title>Results and Discussion</title>
<sec>
<label>6.1.</label>
<title>Road Curb Detection</title>
<p>In order to test the curb detection algorithm, we collected the synchronous laser data and the image data of the whole route in the Future Challenge 2010. The data set contains 9,230 frames as a combination of three laser scanners. If the road curb detected from the laser data is close to the scene in image, we consider it as true position. The final accuracy can reach 82%. <xref ref-type="fig" rid="f11-sensors-12-12386">Figure 11</xref> shows some results of the proposed road curb detection. The point in red denotes the road segment point obtained from our curb detection method. The red dashed line represents the fitting boundary based on the curb points.</p></sec>
<sec>
<label>6.2.</label>
<title>Lane Detection</title>
<p>The algorithm takes the mobile laboratory SmartV-II (<xref ref-type="fig" rid="f1-sensors-12-12386">Figure 1(b)</xref>) Wuhan University as the platform. The test image data is acquired by the analog Video Camera, which is mounted on the top of the Chery SUV with a fixed strut. The size of the recorded images is 640 × 480. For some special reason, we transform the video to 388 × 332. We tested the system under a variety of different road conditions, including structured road and unstructured road. The test data contains 15 videos and 4,319 frames in total, among which unstructured road (without lanes) consisting 2,891 frames and unstructured road (without lanes) consisting 1,428 frames. All the videos are taken on urban roads in Wuhan and Xi'an City, China. The average error rate under different conditions is lower than 9%. The average processing time is 20 ms per frame on a Pentium E5200 2.5 GHz computer. For comparison, we implemented the Canny/Hough Estimation of Vanishing Points (CHEVP) algorithm [<xref ref-type="bibr" rid="b13-sensors-12-12386">13</xref>]. Wang <italic>et al.</italic> proposes the CHEVP algorithm to initialize their B-Spline SNAKE tracking algorithm. Here, we just compare the detection algorithm instead of tracking. For all the 4,319 frames, the correct detection of CHEVP is lower than 30%, and for the 1,428 structured road frames, the correct detection is no more than 50%. The main reason is the Hough failed to grab many unobvious lines.</p>
<p><xref ref-type="fig" rid="f12-sensors-12-12386">Figure 12</xref> shows some results from the front camera under different road conditions. <xref ref-type="fig" rid="f12-sensors-12-12386">Figure 12(a)</xref> shows the roads with vehicle or shadow. <xref ref-type="fig" rid="f12-sensors-12-12386">Figure 12(b)</xref> shows the highway with orientation arrows markings. <xref ref-type="fig" rid="f12-sensors-12-12386">Figure 12(c)</xref> shows the highway with crosswalk warning markings. <xref ref-type="fig" rid="f12-sensors-12-12386">Figure 12(d)</xref> is the road with crosswalk markings. <xref ref-type="fig" rid="f12-sensors-12-12386">Figure 12(e)</xref> shows the road with pavement lettering markings.</p></sec>
<sec>
<label>6.3.</label>
<title>Traffic Sign Detection and Recognition</title>
<p>The test image data is acquired by the CCD Video Camera which is mounted on the top of the Chery SUV with a fixed strut. The size of the recorded images is of 640 × 480. We tested the system under a variety of different conditions. To evaluate the performance of the proposed method, 200 images were taken as test images, in which there are 281 traffic signs.</p>
<p>In this paper, six classifiers were trained for the six classes of signs listed in <xref ref-type="fig" rid="f8-sensors-12-12386">Figure 8</xref>. For all the classifiers, the number of position samples (PS) and negative samples (NS) are listed in <xref ref-type="table" rid="t2-sensors-12-12386">Table 2</xref>. Our method can detect road signs in 50 ms. In the 281 signs, there are 265 signs being correctly detected, 14 signs being missed, and 2 signs being false alarm. Thus the detection rate is 94.3%, demonstrating that the proposed detection method is effective and efficient. Some detection results are shown in <xref ref-type="fig" rid="f13-sensors-12-12386">Figure 13</xref> to demonstrate that our method is insensitive to many complex conditions.</p>
<p>The 265 detected traffic signs are used to evaluate the performance of the proposed method. Among the 265 signs, 244 signs are correctly classified and 14 signs are falsely classified. The recognition accuracy is 92.7%.</p></sec></sec>
<sec sec-type="conclusions">
<label>7.</label>
<title>Conclusions</title>
<p>In this paper, we propose a real-time traveling environment perception system for autonomous vehicle navigation. Our system makes use of the good aspects of laser and camera respectively. At the same time, the combination of multiple lasers and multiple cameras can cover all the front view of ego vehicle, and their information fusion can deal with tough situations. The functions of our perception system include road shoulder detection, lane detection and traffic sign recognition. Many experiment results show that our system is reliable in synthetic urban environment. Our future work will also introduce the Velodyne laser scanner to deal with more complex road conditions and make use of SLAM to develop our IVS.</p></sec></body>
<back>
<ack>
<p>The work described in this paper was supported by the National Natural Science Foundation of China (Grant No. 91120002 and No. 2011AA110403) and Major State Basic Research Development Program (No. 2010CB732100).</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-sensors-12-12386"><label>1.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Batavia</surname><given-names>P.</given-names></name><name><surname>Pomerleau</surname><given-names>D.</given-names></name><name><surname>Thorpe</surname><given-names>C.</given-names></name></person-group><article-title>Predicting Lane Position for Roadway Departure Prevention</article-title><conf-name>Proceedings of the IEEE Intelligent Vehicles Symposium</conf-name><conf-loc>Berlin, Germany</conf-loc><conf-date>October 1998</conf-date></citation></ref>
<ref id="b2-sensors-12-12386"><label>2.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Batavia</surname><given-names>P.</given-names></name></person-group><source>Driver Adaptive Warning Systems</source><comment>Technical Report CMU-RI-TR-98-07</comment><publisher-name>Carnegie Mellon University</publisher-name><publisher-loc>Pittsburgh, PA, USA</publisher-loc><month>March</month><year>1998</year></citation></ref>
<ref id="b3-sensors-12-12386"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bertozzi</surname><given-names>M.</given-names></name><name><surname>Broggi</surname><given-names>A.</given-names></name><name><surname>Cellario</surname><given-names>M.</given-names></name><name><surname>Fascioli</surname><given-names>A.</given-names></name><name><surname>Lombardi</surname><given-names>P.</given-names></name><name><surname>Porta</surname><given-names>M.</given-names></name></person-group><article-title>Artificial Vision in Road Vehicles</article-title><source>Proc. IEEE</source><year>2002</year><volume>90</volume><fpage>1258</fpage><lpage>1271</lpage><pub-id pub-id-type="doi">10.1109/JPROC.2002.801444</pub-id></citation></ref>
<ref id="b4-sensors-12-12386"><label>4.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>M.</given-names></name><name><surname>Jochem</surname><given-names>T.</given-names></name><name><surname>Pomerleau</surname><given-names>D.</given-names></name></person-group><article-title>AURORA: A Vision-based Roadway Departure Warning System</article-title><conf-name>Proceedings of the 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems 95, Human Robot Interaction and Cooperative Robots</conf-name><conf-loc>Pittsburgh, PA, USA</conf-loc><conf-date>August 1995</conf-date><comment>Volume 1</comment><fpage>243</fpage><lpage>248</lpage></citation></ref>
<ref id="b5-sensors-12-12386"><label>5.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Pomerleau</surname><given-names>D.</given-names></name><name><surname>Thorpe</surname><given-names>C.</given-names></name><name><surname>Emery</surname><given-names>L.</given-names></name></person-group><article-title>Performance Specification Development for Roadway Departure Collision Avoidance Systems</article-title><conf-name>Proceedings of the 4th World Congress on Intelligent Transport Systems</conf-name><conf-loc>Berlin, Germany</conf-loc><conf-date>October 1997</conf-date></citation></ref>
<ref id="b6-sensors-12-12386"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bertozzi</surname><given-names>M.</given-names></name><name><surname>Broggi</surname><given-names>A.</given-names></name></person-group><article-title>Vision-Based Vehicle Guidance</article-title><source>Computer</source><year>1997</year><volume>30</volume><fpage>49</fpage><lpage>55</lpage></citation></ref>
<ref id="b7-sensors-12-12386"><label>7.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bertozzi</surname><given-names>M.</given-names></name><name><surname>Broggi</surname><given-names>A.</given-names></name></person-group><article-title>GOLD: A Parallel Real-time Stereo Vision System for Generic Obstacle and lane detection</article-title><source>IEEE Trans. Imag. Proc.</source><year>1998</year><volume>7</volume><fpage>62</fpage><lpage>81</lpage><pub-id pub-id-type="doi">10.1109/83.650851</pub-id></citation></ref>
<ref id="b8-sensors-12-12386"><label>8.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Kluge</surname><given-names>K.</given-names></name><name><surname>Lakshmanan</surname><given-names>S.</given-names></name></person-group><article-title>A Deformable-Template Approach to Lane Detection</article-title><conf-name>Proceedings of the Intelligent Vehicles '95 Symposium</conf-name><conf-loc>Detroit, MI, USA</conf-loc><conf-date>25–26 September 1995</conf-date><fpage>54</fpage><lpage>59</lpage></citation></ref>
<ref id="b9-sensors-12-12386"><label>9.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Broggi</surname><given-names>A.</given-names></name></person-group><article-title>Robust Real-time Lane and Road Detection in Critical Shadow Conditions</article-title><conf-name>Proceedings of International Symposium on Computer Vision</conf-name><conf-loc>Coral Gables, FL, USA</conf-loc><conf-date>November 1995</conf-date><fpage>353</fpage><lpage>358</lpage></citation></ref>
<ref id="b10-sensors-12-12386"><label>10.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Paetzold</surname><given-names>F.</given-names></name><name><surname>Franke</surname><given-names>U.</given-names></name><name><surname>von Seelen</surname><given-names>W.</given-names></name></person-group><article-title>Lane Recognition in Urban Environment Using Optimal Control Theory</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Dearborn, MI, USA</conf-loc><conf-date>October 2000</conf-date><fpage>221</fpage><lpage>226</lpage></citation></ref>
<ref id="b11-sensors-12-12386"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>Y.</given-names></name><name><surname>Shen</surname><given-names>D.</given-names></name><name><surname>Teoh</surname><given-names>E.</given-names></name></person-group><article-title>Lane Detection Using Spline Model</article-title><source>Pattern Recognit. Lett.</source><year>2000</year><volume>21</volume><fpage>677</fpage><lpage>689</lpage><pub-id pub-id-type="doi">10.1016/S0167-8655(00)00021-0</pub-id></citation></ref>
<ref id="b12-sensors-12-12386"><label>12.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Kang</surname><given-names>D.</given-names></name><name><surname>Choi</surname><given-names>J.</given-names></name><name><surname>Kweon</surname><given-names>I.</given-names></name></person-group><article-title>Finding and Tracking Road Lanes Using Line-snakes</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Tokyo, Japan</conf-loc><conf-date>September 1996</conf-date><fpage>189</fpage><lpage>194</lpage></citation></ref>
<ref id="b13-sensors-12-12386"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>Y.</given-names></name><name><surname>Teoh</surname><given-names>E.</given-names></name><name><surname>Shen</surname><given-names>D.</given-names></name></person-group><article-title>Lane detection and tracking using B-Snake</article-title><source>Image Vis. Comput.</source><year>2004</year><volume>22</volume><fpage>269</fpage><lpage>280</lpage><pub-id pub-id-type="doi">10.1016/j.imavis.2003.10.003</pub-id></citation></ref>
<ref id="b14-sensors-12-12386"><label>14.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Lieb</surname><given-names>D.</given-names></name><name><surname>Lookingbill</surname><given-names>A.</given-names></name><name><surname>Thrun</surname><given-names>S.</given-names></name></person-group><article-title>Adaptive Road Following Using Self-supervised Learning and Reverse Optical Flow</article-title><conf-name>Proceedings of Robotics: Science and Systems</conf-name><conf-loc>Cambridge, MI, USA</conf-loc><conf-date>June 2005</conf-date></citation></ref>
<ref id="b15-sensors-12-12386"><label>15.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>J.</given-names></name><name><surname>Ji</surname><given-names>Z.</given-names></name><name><surname>Su</surname><given-names>Y.</given-names></name></person-group><article-title>Unstructured road detection using hybrid features</article-title><conf-name>Proceedings of the International Conference on Machine Learning and Cybernetics</conf-name><conf-loc>Baoding, China</conf-loc><conf-date>July 2009</conf-date><comment>Volume 1</comment><fpage>482</fpage><lpage>486</lpage></citation></ref>
<ref id="b16-sensors-12-12386"><label>16.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Foedisch</surname><given-names>M.</given-names></name><name><surname>Takeuchi</surname><given-names>A.</given-names></name></person-group><article-title>Adaptive Road Detection Through Continuous Environment Learning</article-title><conf-name>Proceedings of the 33rd Applied Imagery Pattern Recognition Workshop</conf-name><conf-loc>Washington, DC, USA</conf-loc><conf-date>October 2004</conf-date><fpage>16</fpage><lpage>21</lpage></citation></ref>
<ref id="b17-sensors-12-12386"><label>17.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>S.</given-names></name><name><surname>Iagnemma</surname><given-names>K.</given-names></name></person-group><article-title>Self-supervised Learning Method for Unstructured Road Detection Using Fuzzy Support Vector Machines</article-title><conf-name>Proceedings of the IEEE /RSJ International Conference on Intelligent Robots and Systems</conf-name><conf-loc>Taipei, Taiwan</conf-loc><conf-date>October 2010</conf-date></citation></ref>
<ref id="b18-sensors-12-12386"><label>18.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>L.</given-names></name><name><surname>Li</surname><given-names>Q.</given-names></name><name><surname>Mao</surname><given-names>Q.</given-names></name><name><surname>Zou</surname><given-names>Q.</given-names></name></person-group><article-title>Block-constraint Line Scanning Method for Lane Detection</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>San Diego, CA, USA</conf-loc><conf-date>June 2010</conf-date><fpage>89</fpage><lpage>94</lpage></citation></ref>
<ref id="b19-sensors-12-12386"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wijesoma</surname><given-names>W.</given-names></name><name><surname>Kodagoda</surname><given-names>K.</given-names></name><name><surname>Balasuriya</surname><given-names>A.</given-names></name></person-group><article-title>Road-boundary Detection and Tracking Using Ladar Sensing</article-title><source>IEEE Trans. Robot. Autom.</source><year>2004</year><volume>20</volume><fpage>456</fpage><lpage>464</lpage><pub-id pub-id-type="doi">10.1109/TRA.2004.825269</pub-id></citation></ref>
<ref id="b20-sensors-12-12386"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>S.</given-names></name><name><surname>Li</surname><given-names>Y.</given-names></name><name><surname>Kwok</surname><given-names>N.</given-names></name></person-group><article-title>Active Vision in Robotic Systems: A Survey of Recent Developments</article-title><source>Int. J. Robot. Res.</source><year>2011</year><volume>30</volume><fpage>1343</fpage><lpage>1377</lpage><pub-id pub-id-type="doi">10.1177/0278364911410755</pub-id></citation></ref>
<ref id="b21-sensors-12-12386"><label>21.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Gavrila</surname><given-names>D.</given-names></name><name><surname>Philomin</surname><given-names>V.</given-names></name></person-group><article-title>Real-Time Object Detection for Smart Vehicles</article-title><conf-name>Proceedings of the Seventh IEEE International Conference on Computer Vision</conf-name><conf-loc>Kerkyra, Greece</conf-loc><conf-date>September 1999</conf-date><comment>Volume 1</comment><fpage>87</fpage><lpage>93</lpage></citation></ref>
<ref id="b22-sensors-12-12386"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gavrila</surname><given-names>D.</given-names></name><name><surname>Franke</surname><given-names>U.</given-names></name><name><surname>Wohler</surname><given-names>C.</given-names></name><name><surname>Gorzig</surname><given-names>S.</given-names></name></person-group><article-title>Real Time Vision for Intelligent Vehicles</article-title><source>IEEE Instrum. Meas. Mag.</source><year>2001</year><volume>4</volume><fpage>22</fpage><lpage>27</lpage><pub-id pub-id-type="doi">10.1109/5289.930982</pub-id></citation></ref>
<ref id="b23-sensors-12-12386"><label>23.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Barnes</surname><given-names>N.</given-names></name><name><surname>Zelinsky</surname><given-names>A.</given-names></name></person-group><article-title>Real-time Radial Symmetry for Speed Sign Detection</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Parma, Italy</conf-loc><conf-date>June 2004</conf-date><fpage>566</fpage><lpage>571</lpage></citation></ref>
<ref id="b24-sensors-12-12386"><label>24.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Andrey</surname><given-names>V.</given-names></name><name><surname>Jo</surname><given-names>K.</given-names></name></person-group><article-title>Automatic Detection and Recognition of Traffic Signs Using Geometric Structure Analysis</article-title><conf-name>Proceedings of the International Joint Conference on SICE-ICASE</conf-name><conf-loc>Busan, Korea</conf-loc><conf-date>October 2006</conf-date><fpage>1451</fpage><lpage>1456</lpage></citation></ref>
<ref id="b25-sensors-12-12386"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>H.</given-names></name><name><surname>Ran</surname><given-names>B.</given-names></name></person-group><article-title>Vision-based Stop Sign Detection and Recognition System for Intelligent Vehicles</article-title><source>Transp. Res. Rec.</source><year>2001</year><volume>1748</volume><fpage>161</fpage><lpage>166</lpage><pub-id pub-id-type="doi">10.3141/1748-20</pub-id></citation></ref>
<ref id="b26-sensors-12-12386"><label>26.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Escalera</surname><given-names>A.</given-names></name><name><surname>Armingol</surname><given-names>J.</given-names></name><name><surname>Mata</surname><given-names>M.</given-names></name></person-group><article-title>Traffic Sign Recognition and Analysis for Intelligent Vehicles</article-title><source>Image Vis. Comput.</source><year>2003</year><volume>21</volume><fpage>247</fpage><lpage>258</lpage><pub-id pub-id-type="doi">10.1016/S0262-8856(02)00156-7</pub-id></citation></ref>
<ref id="b27-sensors-12-12386"><label>27.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Kehtarnavaz</surname><given-names>N.</given-names></name><name><surname>Ahmad</surname><given-names>A.</given-names></name></person-group><article-title>Traffic Sign Recognition in Noisy Outdoor Scenes</article-title><conf-name>Proceedings of the Intelligent Vehicles 95 Symposium</conf-name><conf-loc>Detroit, MI, USA</conf-loc><conf-date>September 1995</conf-date><fpage>460</fpage><lpage>465</lpage></citation></ref>
<ref id="b28-sensors-12-12386"><label>28.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Siogkas</surname><given-names>G.</given-names></name><name><surname>Dermatas</surname><given-names>E.</given-names></name></person-group><article-title>Detection, Tracking and Classification of Road Signs in Adverse Conditions</article-title><conf-name>Proceedings of the 2006 IEEE Mediterranean Electrotechnical Conference</conf-name><conf-loc>Malaga, Spain</conf-loc><conf-date>May 2006</conf-date><fpage>537</fpage><lpage>540</lpage></citation></ref>
<ref id="b29-sensors-12-12386"><label>29.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Escalera</surname><given-names>A.</given-names></name><name><surname>Armingol</surname><given-names>J.</given-names></name><name><surname>Pastor</surname><given-names>J.</given-names></name><name><surname>Rodriguez</surname><given-names>F.</given-names></name></person-group><article-title>Visual Sign Information Extraction and Identification by Deformable Models for Intelligent Vehicles</article-title><source>Intell. Transp. Sys.</source><year>2004</year><volume>5</volume><fpage>57</fpage><lpage>68</lpage><pub-id pub-id-type="doi">10.1109/TITS.2004.828173</pub-id></citation></ref>
<ref id="b30-sensors-12-12386"><label>30.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Fleyeh</surname><given-names>H.</given-names></name></person-group><article-title>Color Detection and Segmentation for Road and Traffic Signs</article-title><conf-name>Proceedings of the Conference on Cybernetics and Intelligent Systems</conf-name><conf-loc>Singapore</conf-loc><conf-date>December 2004</conf-date><comment>Volume 2</comment><fpage>809</fpage><lpage>814</lpage></citation></ref>
<ref id="b31-sensors-12-12386"><label>31.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Soetedjo</surname><given-names>A.</given-names></name><name><surname>Yamada</surname><given-names>K.</given-names></name></person-group><article-title>Fast and Robust Traffic Sign Detection</article-title><conf-name>Proceedings of 2005 IEEE International Conference on Systems, Man and Cybernetics</conf-name><conf-loc>Waikoloa, HI, USA</conf-loc><conf-date>October 2005</conf-date><comment>Volume 2</comment><fpage>1341</fpage><lpage>1346</lpage></citation></ref>
<ref id="b32-sensors-12-12386"><label>32.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Piccioli</surname><given-names>G.</given-names></name><name><surname>De Micheli</surname><given-names>E.</given-names></name><name><surname>Parodi</surname><given-names>P.</given-names></name><name><surname>Campani</surname><given-names>M.</given-names></name></person-group><article-title>Robust Method for Road Sign Detection and Recognition</article-title><source>Imag Vis. Comput.</source><year>1996</year><volume>14</volume><fpage>209</fpage><lpage>223</lpage><pub-id pub-id-type="doi">10.1016/0262-8856(95)01057-2</pub-id></citation></ref>
<ref id="b33-sensors-12-12386"><label>33.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname><given-names>W.</given-names></name><name><surname>Chen</surname><given-names>X.</given-names></name><name><surname>Yang</surname><given-names>J.</given-names></name></person-group><article-title>Detection of Text on Road Signs From Video</article-title><source>IEEE Trans. Intell. Transp.</source><year>2005</year><volume>6</volume><fpage>378</fpage><lpage>390</lpage><pub-id pub-id-type="doi">10.1109/TITS.2005.858619</pub-id></citation></ref>
<ref id="b34-sensors-12-12386"><label>34.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Li</surname><given-names>L.</given-names></name><name><surname>Ma</surname><given-names>G.</given-names></name><name><surname>Ding</surname><given-names>S.</given-names></name></person-group><article-title>Identification of Degraded Traffic Sign Symbols Using Multi-Class Support Vector Machines</article-title><conf-name>Proceedings of the International Conference on Mechatronics and Automation</conf-name><conf-loc>Harbin, China</conf-loc><conf-date>August 2007</conf-date><fpage>2467</fpage><lpage>2471</lpage></citation></ref>
<ref id="b35-sensors-12-12386"><label>35.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Bahlmann</surname><given-names>C.</given-names></name><name><surname>Zhu</surname><given-names>Y.</given-names></name><name><surname>Ramesh</surname><given-names>V.</given-names></name><name><surname>Pellkofer</surname><given-names>M.</given-names></name><name><surname>Koehler</surname><given-names>T.</given-names></name></person-group><article-title>A System for Traffic Sign Detection, Tracking, and Recognition Using Color, Shape, and Motion Information</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Las Vegas, NV, USA</conf-loc><conf-date>June 2005</conf-date><fpage>255</fpage><lpage>260</lpage></citation></ref>
<ref id="b36-sensors-12-12386"><label>36.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Keller</surname><given-names>C.</given-names></name><name><surname>Sprunk</surname><given-names>C.</given-names></name><name><surname>Bahlmann</surname><given-names>C.</given-names></name><name><surname>Giebel</surname><given-names>J.</given-names></name><name><surname>Baratoff</surname><given-names>G.</given-names></name></person-group><article-title>Real-Time Recognition of US Speed Signs</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Eindhoven, the Netherlands</conf-loc><conf-date>June 2008</conf-date><fpage>518</fpage><lpage>523</lpage></citation></ref>
<ref id="b37-sensors-12-12386"><label>37.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Ishak</surname><given-names>K.</given-names></name><name><surname>Sani</surname><given-names>M.</given-names></name><name><surname>Tahir</surname><given-names>N.</given-names></name></person-group><article-title>A Speed Limit Sign Recognition System Using Artificial Neural Network</article-title><conf-name>Prceedings of the Conference on Research and Development</conf-name><conf-loc>Selangor, Malaysia</conf-loc><conf-date>June 2006</conf-date><fpage>127</fpage><lpage>131</lpage></citation></ref>
<ref id="b38-sensors-12-12386"><label>38.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Nguwi</surname><given-names>Y.</given-names></name><name><surname>Kouzani</surname><given-names>A.</given-names></name></person-group><article-title>Automatic Road Sign Recognition Using Neural Networks</article-title><conf-name>Proceedings of the 2006 International Joint Conference on Neural Networks</conf-name><conf-loc>Vancouver, BC, Canada</conf-loc><year>2006</year><fpage>3955</fpage><lpage>3962</lpage></citation></ref>
<ref id="b39-sensors-12-12386"><label>39.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Ach</surname><given-names>R.</given-names></name><name><surname>Luth</surname><given-names>N.</given-names></name><name><surname>Schinner</surname><given-names>T.</given-names></name><name><surname>Techmer</surname><given-names>A.</given-names></name><name><surname>Walther</surname><given-names>S.</given-names></name></person-group><article-title>Classification of Traffic Signs in Real-Time on a Multi-Core Processor</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Eindhoven, the Netherlands</conf-loc><conf-date>June 2008</conf-date><fpage>313</fpage><lpage>318</lpage></citation></ref>
<ref id="b40-sensors-12-12386"><label>40.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Maldonado-Bascon</surname><given-names>S.</given-names></name><name><surname>Lafuente-Arroyo</surname><given-names>S.</given-names></name><name><surname>Siegmann</surname><given-names>P.</given-names></name><name><surname>Gomez-Moreno</surname><given-names>H.</given-names></name><name><surname>Acevedo-Rodriguez</surname><given-names>F.</given-names></name></person-group><article-title>Traffic Sign Recognition System for Inventory Purposes</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Eindhoven, the Netherlands</conf-loc><conf-date>June 2008</conf-date><fpage>590</fpage><lpage>595</lpage></citation></ref>
<ref id="b41-sensors-12-12386"><label>41.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Kouzani</surname><given-names>A.</given-names></name></person-group><article-title>Road-Sign Identification Using Ensemble Learning</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Xi'an, China</conf-loc><conf-date>June 2007</conf-date><fpage>438</fpage><lpage>443</lpage></citation></ref>
<ref id="b42-sensors-12-12386"><label>42.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Viola</surname><given-names>P.</given-names></name><name><surname>Jones</surname><given-names>M.</given-names></name></person-group><article-title>Robust Real-time Object Detection</article-title><source>Int. J. Comput. Vis.</source><year>2001</year><volume>57</volume><fpage>137</fpage><lpage>154</lpage></citation></ref>
<ref id="b43-sensors-12-12386"><label>43.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Urmson</surname><given-names>C.</given-names></name><name><surname>Anhalt</surname><given-names>J.</given-names></name><name><surname>Bagnell</surname><given-names>D.</given-names></name><name><surname>Baker</surname><given-names>C.</given-names></name><name><surname>Bittner</surname><given-names>R.</given-names></name><name><surname>Clark</surname><given-names>M.</given-names></name><name><surname>Dolan</surname><given-names>J.</given-names></name><name><surname>Duggins</surname><given-names>D.</given-names></name><name><surname>Galatali</surname><given-names>T.</given-names></name><name><surname>Geyer</surname><given-names>C.</given-names></name><etal/></person-group><article-title>Autonomous Driving in Urban Environments: Boss and the Urban Challenge</article-title><source>J. Field Robot.</source><year>2008</year><volume>25</volume><fpage>425</fpage><lpage>466</lpage><pub-id pub-id-type="doi">10.1002/rob.20255</pub-id></citation></ref>
<ref id="b44-sensors-12-12386"><label>44.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thrun</surname><given-names>S.</given-names></name><name><surname>Montemerlo</surname><given-names>M.</given-names></name><name><surname>Dahlkamp</surname><given-names>H.</given-names></name><name><surname>Stavens</surname><given-names>D.</given-names></name><name><surname>Aron</surname><given-names>A.</given-names></name><name><surname>Diebel</surname><given-names>J.</given-names></name><name><surname>Fong</surname><given-names>P.</given-names></name><name><surname>Gale</surname><given-names>J.</given-names></name><name><surname>Halpenny</surname><given-names>M.</given-names></name><name><surname>Hoffmann</surname><given-names>G.</given-names></name><etal/></person-group><article-title>Stanley: The Robot that Won the DARPA Grand Challenge</article-title><source>J. Robot. Sys.</source><year>2007</year><fpage>1</fpage><lpage>43</lpage></citation></ref>
<ref id="b45-sensors-12-12386"><label>45.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bradski</surname><given-names>G.</given-names></name><name><surname>Kaehler</surname><given-names>A.</given-names></name></person-group><source>Learning OpenCV: Computer vision with the OpenCV library</source><publisher-name>O'Reilly Media</publisher-name><year>2008</year></citation></ref>
<ref id="b46-sensors-12-12386"><label>46.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Z.</given-names></name></person-group><article-title>Flexible Camera Calibration by Viewing a Plane from Unknown Orientations</article-title><conf-name>Proceedings of the Seventh IEEE International Conference on Computer Vision</conf-name><conf-loc>Kerkyra, Greece</conf-loc><conf-date>September 1999</conf-date><comment>Volume 1</comment><fpage>666</fpage><lpage>673</lpage></citation></ref>
<ref id="b47-sensors-12-12386"><label>47.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Viola</surname><given-names>P.</given-names></name><name><surname>Jones</surname><given-names>M.</given-names></name></person-group><article-title>Robust Real-Time Face Detection</article-title><source>Int. J. Comput. Vis.</source><year>2004</year><volume>57</volume><fpage>137</fpage><lpage>154</lpage><pub-id pub-id-type="doi">10.1023/B:VISI.0000013087.49260.fb</pub-id></citation></ref>
<ref id="b48-sensors-12-12386"><label>48.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Bay</surname><given-names>H.</given-names></name><name><surname>Tuytelaars</surname><given-names>T.</given-names></name><name><surname>Van Gool</surname><given-names>L.</given-names></name></person-group><article-title>SURf: Speeded Up Robust Ffeatures</article-title><conf-name>Proceedings of the 9th European Conference on Computer Vision</conf-name><conf-loc>Graz</conf-loc><conf-date>Austria, May 2006</conf-date><fpage>404</fpage><lpage>417</lpage></citation></ref>
<ref id="b49-sensors-12-12386"><label>49.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Fasel</surname><given-names>B.</given-names></name><name><surname>Van Gool</surname><given-names>L.</given-names></name></person-group><article-title>Interactive Museum Guide: Accurate Retrieval of Object Descriptions</article-title><conf-name>Proceedings of the 4th International Conference on Adaptive Multimedia Retrieval: User, Context, and Feedback</conf-name><conf-loc>Geneva</conf-loc><conf-date>Switzerland, July 2006</conf-date><fpage>179</fpage><lpage>191</lpage></citation></ref>
<ref id="b50-sensors-12-12386"><label>50.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arya</surname><given-names>S.</given-names></name><name><surname>Mount</surname><given-names>D.</given-names></name><name><surname>Netanyahu</surname><given-names>N.</given-names></name><name><surname>Silverman</surname><given-names>R.</given-names></name><name><surname>Wu</surname><given-names>A.</given-names></name></person-group><article-title>An Optimal Algorithm for Approximate Nearest Neighbor Searching Fixed Dimensions</article-title><source>J. ACM (JACM)</source><year>1998</year><volume>45</volume><fpage>891</fpage><lpage>923</lpage><pub-id pub-id-type="doi">10.1145/293347.293348</pub-id></citation></ref>
<ref id="b51-sensors-12-12386"><label>51.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>L.</given-names></name><name><surname>Li</surname><given-names>Q.</given-names></name><name><surname>Li</surname><given-names>M.</given-names></name><name><surname>Mao</surname><given-names>Q.</given-names></name></person-group><article-title>Traffic Sign Detection and Recognition for Intelligent Vehicle</article-title><conf-name>Proceedings of the Intelligent Vehicles Symposium</conf-name><conf-loc>Baden-Banden</conf-loc><conf-date>Germany, June 2011</conf-date><fpage>908</fpage><lpage>913</lpage></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-sensors-12-12386" position="float">
<label>Figure 1.</label>
<caption>
<p>(<bold>a</bold>) At approximately 3:04 pm on Oct 18, 2010, SmartV-II was the first robot to complete the Future Challenge; (<bold>b</bold>) Autonomous Vehicle SmartV-II, developed by Wuhan University.</p></caption>
<graphic xlink:href="sensors-12-12386f1.gif"/></fig>
<fig id="f2-sensors-12-12386" position="float">
<label>Figure 2.</label>
<caption>
<p>Lasers and cameras layout.</p></caption>
<graphic xlink:href="sensors-12-12386f2.gif"/></fig>
<fig id="f3-sensors-12-12386" position="float">
<label>Figure 3.</label>
<caption>
<p>Road curb fitting.</p></caption>
<graphic xlink:href="sensors-12-12386f3.gif"/></fig>
<fig id="f4-sensors-12-12386" position="float">
<label>Figure 4.</label>
<caption>
<p>Flowchart of two scans based lane detection method.</p></caption>
<graphic xlink:href="sensors-12-12386f4.gif"/></fig>
<fig id="f5-sensors-12-12386" position="float">
<label>Figure 5.</label>
<caption>
<p>Imaging Model. (<bold>a</bold>) The <italic>W</italic> space. (<bold>b</bold>) The <italic>xy</italic> plane in the <italic>W</italic> space. (<bold>c</bold>) The <italic>yz</italic> plane in the <italic>W</italic> space.</p></caption>
<graphic xlink:href="sensors-12-12386f5.gif"/></fig>
<fig id="f6-sensors-12-12386" position="float">
<label>Figure 6.</label>
<caption>
<p>Step by step results by proposed lane detection. (<bold>a</bold>) Original Image. (<bold>b</bold>) Image after open operation. (<bold>c</bold>) Image after dilate operation. (<bold>d</bold>) Image after top-hat transform. (<bold>e</bold>) Threshold. (<bold>f</bold>) First scan. (<bold>g</bold>) RANSAC. (<bold>h</bold>) Second scan. (<bold>i</bold>) Gradient contribution function after first scan. (<bold>j</bold>) RANSAC. (<bold>k</bold>) Gradient contribution function after second scan.</p></caption>
<graphic xlink:href="sensors-12-12386f6.gif"/></fig>
<fig id="f7-sensors-12-12386" position="float">
<label>Figure 7.</label>
<caption>
<p>Flow chart of traffic signs recognition system.</p></caption>
<graphic xlink:href="sensors-12-12386f7.gif"/></fig>
<fig id="f8-sensors-12-12386" position="float">
<label>Figure 8.</label>
<caption>
<p>Traffic Signs Classes.</p></caption>
<graphic xlink:href="sensors-12-12386f8.gif"/></fig>
<fig id="f9-sensors-12-12386" position="float">
<label>Figure 9.</label>
<caption>
<p>Color quantization and ROI locking. (<bold>a</bold>) original image, (<bold>b</bold>) ROI locking, (<bold>c</bold>) red segmentation, (<bold>d</bold>) blue segmentation, (<bold>e</bold>) yellow segmentation.</p></caption>
<graphic xlink:href="sensors-12-12386f9.gif"/></fig>
<fig id="f10-sensors-12-12386" position="float">
<label>Figure 10.</label>
<caption>
<p>SURF feature matching. The number of match points is 16, 11, 24, 7, 12, 7 according to priority.</p></caption>
<graphic xlink:href="sensors-12-12386f10.gif"/></fig>
<fig id="f11-sensors-12-12386" position="float">
<label>Figure 11.</label>
<caption>
<p>Some results of the road curb detection.</p></caption>
<graphic xlink:href="sensors-12-12386f11.gif"/></fig>
<fig id="f12-sensors-12-12386" position="float">
<label>Figure 12.</label>
<caption>
<p>Some examples of lane detection results.</p></caption>
<graphic xlink:href="sensors-12-12386f12.gif"/></fig>
<fig id="f13-sensors-12-12386" position="float">
<label>Figure 13.</label>
<caption>
<p>Detection of traffic signs under various conditions.</p></caption>
<graphic xlink:href="sensors-12-12386f13.gif"/></fig>
<table-wrap id="t1-sensors-12-12386" position="float">
<label>Table 1.</label>
<caption>
<p>Color quantization.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top"/>
<th align="center" valign="top">Red</th>
<th align="center" valign="top">Blue</th>
<th align="center" valign="top">Yellow</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">Saturation</td>
<td align="center" valign="top"><italic>S</italic> &gt; 0.2</td>
<td align="center" valign="top"><italic>S</italic> &gt; 0.2</td>
<td align="center" valign="top"><italic>S</italic> &gt; 0.2</td></tr>
<tr>
<td align="center" valign="top">Hue</td>
<td align="center" valign="top">0 &lt; <italic>H</italic> &lt; 10320 &lt; <italic>H</italic> &lt; 360</td>
<td align="center" valign="top">200 &lt; <italic>H</italic> &lt; 270</td>
<td align="center" valign="top">20 &lt; <italic>H</italic> &lt; 100</td></tr></tbody></table></table-wrap>
<table-wrap id="t2-sensors-12-12386" position="float">
<label>Table 2.</label>
<caption>
<p>The number of PS and NS for the six trained classifiers.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top"/>
<th align="center" valign="top">C1</th>
<th align="center" valign="top">C2</th>
<th align="center" valign="top">C3</th>
<th align="center" valign="top">C4</th>
<th align="center" valign="top">C5</th>
<th align="center" valign="top">C6</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">PS</td>
<td align="center" valign="top">3,125</td>
<td align="center" valign="top">1,276</td>
<td align="center" valign="top">794</td>
<td align="center" valign="top">648</td>
<td align="center" valign="top">963</td>
<td align="center" valign="top">346</td></tr>
<tr>
<td align="center" valign="top">NS</td>
<td align="center" valign="top">5,200</td>
<td align="center" valign="top">2,300</td>
<td align="center" valign="top">16,00</td>
<td align="center" valign="top">1,600</td>
<td align="center" valign="top">1,600</td>
<td align="center" valign="top">1,000</td></tr></tbody></table></table-wrap></sec></back></article>
