<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Sensors</journal-id>
<journal-title>Sensors</journal-title>
<issn pub-type="epub">1424-8220</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/sl21013333</article-id>
<article-id pub-id-type="publisher-id">sensors-12-13333</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Evaluation of Intrinsic Image Algorithms to Detect the Shadows Cast by Static Objects Outdoors</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Isaza</surname><given-names>Cesar</given-names></name><xref ref-type="aff" rid="af1-sensors-12-13333"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-sensors-12-13333"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Salas</surname><given-names>Joaquín</given-names></name><xref ref-type="aff" rid="af1-sensors-12-13333"><sup>l</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Raducanu</surname><given-names>Bogdan</given-names></name><xref ref-type="aff" rid="af2-sensors-12-13333"><sup>2</sup></xref></contrib></contrib-group>
<aff id="af1-sensors-12-13333">
<label>1</label> CICATA Qro., Instituto Politécnico Nacional, Cerro Blanco 141, Col. Colinas del Cimatario, Santiago de Queretaro, C.P. 76090, Mexico; E-Mail: <email>jsalasr@ipn.mx</email></aff>
<aff id="af2-sensors-12-13333">
<label>2</label> Computer Vision Center, UAB Campus, Bellaterra 08193, Barcelona, Spain; E-Mail: <email>bogdan@cvc.uab.es</email></aff>
<author-notes>
<corresp id="c1-sensors-12-13333">
<label>*</label> Author to whom correspondence should be addressed; E-Mail: <email>cisazab0900@ipn.mx</email>; Tel.: +55-5729-6000 (ext. 81047); Fax. +55-5729-6000 (ext. 47127).</corresp></author-notes>
<pub-date pub-type="collection">
<year>2012</year></pub-date>
<pub-date pub-type="epub">
<day>01</day>
<month>10</month>
<year>2012</year></pub-date>
<volume>12</volume>
<issue>10</issue>
<fpage>13333</fpage>
<lpage>13348</lpage>
<history>
<date date-type="received">
<day>29</day>
<month>07</month>
<year>2012</year></date>
<date date-type="rev-recd">
<day>07</day>
<month>09</month>
<year>2012</year></date>
<date date-type="accepted">
<day>17</day>
<month>09</month>
<year>2012</year></date></history>
<permissions>
<copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2012</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>In some automatic scene analysis applications, the presence of shadows becomes a nuisance that is necessary to deal with. As a consequence, a preliminary stage in many computer vision algorithms is to attenuate their effect. In this paper, we focus our attention on the detection of shadows cast by static objects outdoors, as the scene is viewed for extended periods of time (days, weeks) from a fixed camera and considering daylight intervals where the main source of light is the sun. In this context, we report two contributions. First, we introduce the use of synthetic images for which ground truth can be generated automatically, avoiding the tedious effort of manual annotation. Secondly, we report a novel application of the intrinsic image concept to the automatic detection of shadows cast by static objects in outdoors. We make both a quantitative and a qualitative evaluation of several algorithms based on this image representation. For the quantitative evaluation, we used the synthetic data set, while for the qualitative evaluation we used both data sets. Our experimental results show that the evaluated methods can partially solve the problem of shadow detection.</p></abstract>
<kwd-group>
<kwd>video sequences</kwd>
<kwd>shadow detection</kwd>
<kwd>intrinsic images</kwd>
<kwd>illumination component</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>A shadow is the result of an opaque object obstructing light which otherwise would directly illuminate a surface. Shadows are present in almost every computer vision application, where they may give rise to undesired effects in methods including segmentation [<xref ref-type="bibr" rid="b1-sensors-12-13333">1</xref>,<xref ref-type="bibr" rid="b2-sensors-12-13333">2</xref>], recognition [<xref ref-type="bibr" rid="b3-sensors-12-13333">3</xref>], and tracking [<xref ref-type="bibr" rid="b4-sensors-12-13333">4</xref>–<xref ref-type="bibr" rid="b6-sensors-12-13333">6</xref>]. The main problem is that their existence may alter our interpretation of the scene, making our models drift—sometimes up to the point of failure. Consequently, it is desirable to detect them and to attenuate as much as possible their negative effects [<xref ref-type="bibr" rid="b7-sensors-12-13333">7</xref>,<xref ref-type="bibr" rid="b8-sensors-12-13333">8</xref>]. However, in some situations their presence is attractive as they may help to obtain 3D information for scene reconstruction [<xref ref-type="bibr" rid="b9-sensors-12-13333">9</xref>,<xref ref-type="bibr" rid="b10-sensors-12-13333">10</xref>], for instance.</p>
<p>The general problem of shadow detection can be classified depending on whether the objects casting the shadow are static [<xref ref-type="bibr" rid="b11-sensors-12-13333">11</xref>,<xref ref-type="bibr" rid="b12-sensors-12-13333">12</xref>] or moving [<xref ref-type="bibr" rid="b13-sensors-12-13333">13</xref>,<xref ref-type="bibr" rid="b14-sensors-12-13333">14</xref>]. However, a very important factor when considering this categorization is the scale of time. For instance, in outdoors the shadows cast by objects such as buildings, lamp posts, and trees during daylight can be interpreted as static if we consider in our analysis a temporal window of a few seconds. In this case, no significant changes will be perceived in the scene and existing techniques for moving cast shadow detection [<xref ref-type="bibr" rid="b15-sensors-12-13333">15</xref>] cannot be applied. On the other hand, if we consider a temporal window of a few hours, the same shadow could be interpreted as a moving object. Although the problem of detecting moving shadows has been extensively studied [<xref ref-type="bibr" rid="b13-sensors-12-13333">13</xref>,<xref ref-type="bibr" rid="b16-sensors-12-13333">16</xref>], the problem of detecting static shadows outdoors, over long periods of time, such as days, has received little attention [<xref ref-type="bibr" rid="b17-sensors-12-13333">17</xref>].</p>
<p>Our paper presents two main contributions. First, we introduce the use of synthetic images for which ground truth can be generated automatically, avoiding the tedious effort of manual annotation. In the process, we generated a custom database based on two image data sets (these data sets are publicly available for download at <ext-link xlink:href="http://imagenes.cicataqro.ipn.mx/shadows/" ext-link-type="uri">http://imagenes.cicataqro.ipn.mx/shadows/</ext-link>), one real and one synthetic. The real data set was acquired outdoors during several days using a fixed camera, which was overlooking a quiet area without moving objects. The synthetic data set was created using a rendering software. Secondly, we perform a quantitative and a qualitative evaluation of several algorithms for shadow detection based on the intrinsic image (the concept of intrinsic images was introduced by Barrow and Tenenbaum [<xref ref-type="bibr" rid="b18-sensors-12-13333">18</xref>] as a way to describe an image in terms of characteristics such as range, orientation, reflectance, color, texture, and incident light) representation [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>–<xref ref-type="bibr" rid="b22-sensors-12-13333">22</xref>], all of which uses the reflectance component. The quantitative evaluation was done for the synthetic data set, while the qualitative one was done with both data sets.</p>
<p>The rest of the paper is organized as follows. In Section 2, we present a survey of the existing research literature about shadow detection algorithms. Then in Section 3, we describe the synthetic and real data sets used in our evaluation. In Section 4, we recall the definition of intrinsic images and then present the algorithms used in our evaluation study. In Section 5, we present quantitative and qualitative results to compare the methods. Finally, Section 6 contains our conclusion and our ideas for future work.</p></sec>
<sec>
<label>2.</label>
<title>Related Work</title>
<p>In this section, we summarize work in shadow detection. For a clear presentation, we distinguish between static and moving shadows.</p>
<sec>
<label>2.1.</label>
<title>Static Shadow Detection</title>
<p>Static shadows give us clues about the scene represented in an image or a video sequence, including the shape, the objects, and the relative position of the light sources [<xref ref-type="bibr" rid="b23-sensors-12-13333">23</xref>]. Additionally, these shadows seem to modify the perceived shape and color of the objects [<xref ref-type="bibr" rid="b24-sensors-12-13333">24</xref>]. Some methods have exploited the color properties of objects as they are affected by shadows. For instance, Nagao <italic>et al.</italic> [<xref ref-type="bibr" rid="b11-sensors-12-13333">11</xref>] and Scanlan [<xref ref-type="bibr" rid="b25-sensors-12-13333">25</xref>] used histogram analysis to detect shadows. On the other hand, Jiang and Ward [<xref ref-type="bibr" rid="b26-sensors-12-13333">26</xref>] reported a method for identifying and classifying shadows in real images based on the constraints that shadows possess in both intensity and geometry. Suzuki <italic>et al.</italic> [<xref ref-type="bibr" rid="b27-sensors-12-13333">27</xref>] proposed a compensation method to remove shadows in aerial images transforming the red, green, and blue (RGB) values of the original image into hue, saturation and intensity (HSI) values. Some other researchers have exploited the relationship between the different homogeneous color regions in an image. This allows them to obtain first the edges of a shadow and then extract the shadow area. In addition, several color spaces have been explored in order to detect shadows, e.g., normalized red, green, and blue (rgb), hue (H) and saturation (S), <italic>c1c2c3</italic> and <italic>l1l2l3</italic> [<xref ref-type="bibr" rid="b28-sensors-12-13333">28</xref>]. Finlayson <italic>et al.</italic> [<xref ref-type="bibr" rid="b29-sensors-12-13333">29</xref>] proposed a method to process a 3-band color image to remove the shadows based on edge information. Moreover, Gevers and Stokman [<xref ref-type="bibr" rid="b30-sensors-12-13333">30</xref>] also used color information to classify edges based on whether the transition between regions is due to shadows, abrupt surface orientation changes, illumination, or material changes. Levine and Bhattacharyya [<xref ref-type="bibr" rid="b31-sensors-12-13333">31</xref>] developed a strategy that does not require camera calibration or other a priori information regarding the scene.</p></sec>
<sec>
<label>2.2.</label>
<title>Moving Shadow Detection</title>
<p>Unlike static shadows, the moving ones are associated with dynamic objects. In many situations, moving objects and their shadows are detected as one single region. The above effect may require a stage to separate the object from its cast shadows, like the one described in the method proposed by Sonoda and Ogata [<xref ref-type="bibr" rid="b32-sensors-12-13333">32</xref>], which is based on projective geometry. In addition, other authors proposed different strategies to detect moving shadows based on the use of diverse color models. For example, Horprasert <italic>et al.</italic> [<xref ref-type="bibr" rid="b33-sensors-12-13333">33</xref>] used a brightness and chromaticity color model. Moreover, Mikic <italic>et al.</italic> [<xref ref-type="bibr" rid="b34-sensors-12-13333">34</xref>] used a method that combines different color spaces. The authors considered three features at each pixel: brightness, normalized red and normalized blue. In this method, each feature is analyzed by a posterior probability estimator that computes probabilities for three classes: background, foreground and shadows. Nadimi and Bhanu [<xref ref-type="bibr" rid="b35-sensors-12-13333">35</xref>] proposed an algorithm to detect moving shadows in outdoor environments based on a spatio-temporal albedo test and a dichromatic reflection model. In addition to the color spaces, some shadow features such as transparency (a shadow always darkens the region upon which it falls) and homogeneity (the relationship between pixels under shadows is roughly linear) have been used to detect moving shadows in outdoor traffic scenes [<xref ref-type="bibr" rid="b36-sensors-12-13333">36</xref>]. However, this method considers a linear relationship between shadow and non-shadow regions and only gray-scale images are processed. On the contrary, Cucchiara <italic>et al.</italic> [<xref ref-type="bibr" rid="b37-sensors-12-13333">37</xref>] developed a method for segmenting moving objects without their shadows by using color information. This algorithm transformed the original input image from RGB color space to hue, saturation, and value (HSV). Recently, Sanin <italic>et al.</italic> [<xref ref-type="bibr" rid="b15-sensors-12-13333">15</xref>] presented a survey of several techniques for moving cast shadow detection. It is important to understand that the strategies these authors compared cannot be applied to the problem that we describe, because these techniques first detect changes in the scene (moving objects) and then classify the detected pixels as foreground (object) or shadow. Under the above assumption, the shadows cast by static objects will be part of the background; or, if a large enough interval of time is selected to perceive changes in the shape and position of the shadows cast by static objects, the moving shadow detection strategies will detect only the parts of the shadow that change, which is typically the region around the boundaries.</p></sec></sec>
<sec sec-type="methods">
<label>3.</label>
<title>Data Sets</title>
<p>For algorithm comparison, we used two data sets, one synthetic and one real (see <xref ref-type="fig" rid="f1-sensors-12-13333">Figure 1</xref>). The synthetic data set consists of two sequences and has been used for quantitative evaluation; meanwhile for the qualitative evaluation, we used real and synthetic sequences. To build the data sets, we considered only daytime, when the sun is by far the main source of light. It is important to mention that in scenarios lighted with other sources, such as fluorescent or street lights, the shadows cast by static objects will be always static and perhaps only changes in intensity will be perceived. In the next subsections we present details regarding the generation of these data sets.</p>
<sec>
<label>3.1.</label>
<title>Synthetic Data Set</title>
<p>A serious limitation in the systematic evaluation of algorithms to detect shadows cast by static objects during extended periods of time is the lack of a standard data set with annotated ground truth. Based on this fact, we used two synthetic sequences that simulate the changes in the sun's position over a long period of time (days) for a particular geographical position. The advantage of using synthetic images is that the ground truth is automatically generated.</p>
<p>The synthetic sequences were rendered using the POV-Ray software [<xref ref-type="bibr" rid="b38-sensors-12-13333">38</xref>]. We designed the first synthetic sequence in accordance to the one introduced by Masushita <italic>et al.</italic> in [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>], to analyze the problem of the computation of intrinsic images. This data set has 20 frames with a resolution of 512 × 384 pixels (width × height). It represents white lines on a road surface (see first, second and third columns in <xref ref-type="fig" rid="f2-sensors-12-13333">Figure 2</xref>). The shadow effect was created using a rectangular object out of the field of view of the camera, changing the light source positions on the horizontal axis. The second synthetic sequence simulates the sun position over 12 consecutive days. We generated 40 frames per day with a resolution of 512 × 384 pixels (width × height). We designed this data set considering objects that exist in a typical outdoor location. We included in our scenario static objects such as buildings, a road, trees, and bushes, among others (see fourth, fifth and sixth columns in <xref ref-type="fig" rid="f2-sensors-12-13333">Figure 2</xref>).</p>
<p>In the synthetic sequences, the shadow effect was created using the following procedures. First, we computed the sun position in a given geographical location. We used an off-the-shelf Geographical Positioning System (GPS) to obtain the latitude, longitude, and elevation of a camera installed in the real outdoors scenario. Then, using the algorithm presented by Reda and Andreas [<xref ref-type="bibr" rid="b41-sensors-12-13333">41</xref>] the sun position (elevation and azimuth angles at a given location) as a function of the local time and position of the observer was computed from dawn to dusk during twelve consecutive days. After that, the values of the elevation and azimuth angles were transformed to Cartesian coordinates for the rendering software to simulate the changes in the sun position.</p></sec>
<sec>
<label>3.2.</label>
<title>Real Data Set</title>
<p>In addition to the synthetic sequences, we recorded a real data set. For this purpose, we positioned a fixed camera on the roof of a building and took images from dawn to dusk during seven consecutive days (see seventh, eighth, and ninth columns in <xref ref-type="fig" rid="f2-sensors-12-13333">Figure 2</xref>). The camera was fixed to capture a motionless area, to facilitate the analysis of shadows cast by static objects during this long time interval. Each image has a resolution of 1,032 × 776 pixels (width × height). We selected the location, based on the challenge that it represents for the detection of shadows cast by static objects. In the scenario, there are regions that are shaded during all daylight and others with huge shadows. Additionally, some shadows have significant changes in the intensity, up to the point that it is difficult to define the boundaries. This data set also has isolated shadows cast by trees and shadows mixed that are cast by several objects. Another important feature of the data set is that only small fast traveling clouds appear in the sequence, resulting in the presence of shadows during all daylight.</p></sec></sec>
<sec>
<label>4.</label>
<title>Evaluating Algorithms</title>
<p>Our evaluation considers the algorithms to derive intrinsic images introduced by Weiss [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>], Matsushita <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>], Land and McCann [<xref ref-type="bibr" rid="b39-sensors-12-13333">39</xref>], and Blake [<xref ref-type="bibr" rid="b21-sensors-12-13333">21</xref>]. Because all of the algorithms are based on the concept of intrinsic images, we review it first. Then, we describe in some detail each of the algorithms evaluated. Lastly, because the algorithms were primarily created to compute intrinsic images, we illustrate how they could in principle be used to detect shadows.</p>
<sec>
<label>4.1.</label>
<title>Intrinsic Images</title>
<p>The concept of intrinsic images was introduced by Barrow and Tenenbaum [<xref ref-type="bibr" rid="b18-sensors-12-13333">18</xref>]. It describes an image decomposition in terms of characteristics such as range, orientation, reflectance, and incident illumination. One of the simplest models is described by the product <italic>I</italic>(<bold>x</bold>, <italic>t</italic>) = <italic>R</italic>(<bold>x</bold>, <italic>t</italic>)<italic>L</italic>(<bold>x</bold>, <italic>t</italic>), where <italic>I</italic>(<bold>x</bold>, <italic>t</italic>) is an image, <bold>x</bold> is a pixel index, and <italic>t</italic> represents the frame index respect to time. The reflectance image <italic>R</italic>(<bold>x</bold>, <italic>t</italic>) represents the properties of the object to reflect light in the direction of the pixel <bold>x</bold>. The illumination <italic>L</italic>(<bold>x</bold>, <italic>t</italic>) describes the distribution of the incident light and accounts for some of the shading effects and shadows. Deriving this decomposition is a fundamentally ill-posed problem [<xref ref-type="bibr" rid="b42-sensors-12-13333">42</xref>]. Weiss shows that this problem can be solved if one considers the reflectance to be constant while the illumination varies [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>]. Matsushita <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>] extended Weiss' method to handle scenes where the Lambertian assumption does not hold. They consider that both the illumination and reflectance may change. In their method, they analyze the magnitude of the gradient in the reflectance component. If the magnitude in a given position is larger than a given threshold, the illumination values at that position are removed and added to the reflectance image. As a result, for each input image two others are obtained, one for the reflectance and another for the illumination. Although with this method a reflectance component is obtained representing the texture in the scene, some texture appears in the illumination image even for different values of the threshold. Another strategy to derive intrinsic images was introduced by Land and McCann [<xref ref-type="bibr" rid="b39-sensors-12-13333">39</xref>]. They proposed the Retinex theory, which expresses that the reflectance component can be separated from the illumination if the later is assumed to vary slowly. An extension which uses color have been introduced by Finlayson [<xref ref-type="bibr" rid="b8-sensors-12-13333">8</xref>,<xref ref-type="bibr" rid="b40-sensors-12-13333">40</xref>]. In a different approach, a learning-based method to separate reflectance and illuminate was proposed by Tappen <italic>et al.</italic> [<xref ref-type="bibr" rid="b42-sensors-12-13333">42</xref>]. They successfully separated the reflectance and the illumination components for a light source in a synthetic data set. In 2009, Grosse <italic>et al.</italic> [<xref ref-type="bibr" rid="b22-sensors-12-13333">22</xref>] introduced an intrinsic image model with three terms: reflectance, illumination, and specularity <italic>C</italic>(<bold>x</bold>, <italic>t</italic>). All together, this decomposition is expressed as: <italic>I</italic>(<bold>x</bold>, <italic>t</italic>) = <italic>R</italic>(<bold>x</bold>, <italic>t</italic>)<italic>L</italic>(<bold>x</bold>, <italic>t</italic>) + <italic>C</italic>(<bold>x</bold>, <italic>t</italic>). Nonetheless, in this model, the problem of factoring the information between reflectance and illumination remains.</p></sec>
<sec>
<label>4.2.</label>
<title>Weiss' Algorithm</title>
<p>Weiss [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>] proposed that intrinsic components can be derived by using a sequence of images without motion acquired with a fixed camera in an outdoor scene under varying illumination conditions. This method uses the statistics of natural images [<xref ref-type="bibr" rid="b43-sensors-12-13333">43</xref>] and assumes that illumination images will give rise to sparse filter outputs. Then, the scene reflectance image is obtained by taking the median of the filtered image sequence in the log domain. Additionally, the method assumes that the scene is Lambertian and the fact that illumination images have less contrast than reflectance images.</p>
<p>Weiss uses the following equation to derive intrinsic images:
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm1" display="block">
<mml:semantics id="sm1">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>}</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>T</mml:mi></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mi>R</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mi>L</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>}</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>T</mml:mi></mml:msubsup></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>For convenience, Weiss worked in the log domain. In what follows, we will represent variables in the log domain using lower-case letters, e.g., <italic>i</italic>(<bold>x</bold>, <italic>t</italic>) to represent the logarithm of <italic>I</italic>(x, <italic>t</italic>). According to Weiss' method, a reflectance edge image is computed by taking the median along the time axis of the convolution between the derivative filter and a given image:
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm2" display="block">
<mml:semantics id="sm2">
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mtext mathvariant="italic">median</mml:mtext>
<mml:mo stretchy="false">[</mml:mo>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>m</mml:mi></mml:msub>
<mml:mo>∗</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">]</mml:mo></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>r</italic>(<bold>x</bold>) is the constant edge reflectance image and <italic>f<sub>m</sub></italic> represents the derivative niters <italic>f</italic><sub>0</sub> = [0, 1, −1] or <italic>f</italic><sub>1</sub> = [0, 1, −1]<italic><sup>T</sup></italic>. Then, the illumination edge images <italic>l</italic>(<bold>x</bold>, <italic>t</italic>) are computed subtracting the edge maps of the input and the reflectance images:
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="mm3" display="block">
<mml:semantics id="sm3">
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:semantics></mml:math></disp-formula></p></sec>
<sec>
<label>4.3.</label>
<title>Matsushita et al.'s Algorithm</title>
<p>Matsushita <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>] analyzed the reflectance component based on the strategy proposed by Weiss and the derived time variable of reflectance and illumination images. They noticed that in real-world scenes, the Lambertian assumption is not sufficient to derive intrinsic images. Matsushita <italic>et al.</italic> analyzed the magnitude of the gradient of the reflectance edge image <italic>r</italic>(<bold>x</bold>) by assuming that texture information should not be present. Then, if the magnitude of the gradient in a given position of the reflectance image is larger than a given threshold <italic>T</italic>, the texture edge is removed from the illumination image <italic>l</italic>(<bold>x</bold>, <italic>t</italic>) and added to the time-varying reflectance image denote by
<disp-formula id="FD4">
<label>(4)</label>
<mml:math id="mm4" display="block">
<mml:semantics id="sm4">
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mspace width="0.2em"/>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:mi>T</mml:mi></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mspace width="0.2em"/>
<mml:mtext>otherwise</mml:mtext></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow>
<mml:mtext>and</mml:mtext></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD5">
<label>(5)</label>
<mml:math id="mm5" display="block">
<mml:semantics id="sm5">
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mspace width="0.2em"/>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:mi>T</mml:mi></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mspace width="0.2em"/>
<mml:mtext>otherwise</mml:mtext></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>After the time-varying reflectance and illumination component edge maps are obtained, a deconvolution process is applied to get a reconstructed image, as
<disp-formula id="FD6">
<label>(6)</label>
<mml:math id="mm6" display="block">
<mml:semantics id="sm6">
<mml:mrow>
<mml:mo>&lt;</mml:mo>
<mml:mover accent="true">
<mml:mi>r</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>,</mml:mo>
<mml:mover accent="true">
<mml:mi>l</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>&gt;</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo>∗</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mi>m</mml:mi></mml:munder>
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>m</mml:mi>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo>&gt;</mml:mo></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>r̂</italic> and <italic>l̂</italic> are the reflectance and illumination time-varying reconstructed images, 
<inline-formula>
<mml:math id="mm7" display="inline">
<mml:semantics id="sm7">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>m</mml:mi>
<mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> is the inverse filter of <italic>f<sub>m</sub></italic>, and <italic>g</italic> is the filter which satisfies the equation:
<disp-formula id="FD7">
<label>(7)</label>
<mml:math id="mm8" display="block">
<mml:semantics id="sm8">
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mo>∗</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mi>m</mml:mi></mml:munder>
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>m</mml:mi>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>∗</mml:mo>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>δ</mml:mi></mml:mrow></mml:semantics></mml:math></disp-formula></p></sec>
<sec>
<label>4.4.</label>
<title>Gray Retinex Algorithm</title>
<p>The Retinex algorithm, proposed in [<xref ref-type="bibr" rid="b39-sensors-12-13333">39</xref>], analyzes logarithm image gradients. This method considers that small gradients are due to changes in the illumination, while large gradients represent texture. The threshold value that classifies the edges into reflectance or illumination is defined for the horizontal <italic>i<sub>x</sub></italic>(<bold>x</bold>, <italic>t</italic>) and vertical <italic>i<sub>y</sub></italic>(<bold>x</bold>, <italic>t</italic>) derivatives. This method can be applied to gray-scale images or each color band separately. The formal description for gray-scale images is:
<disp-formula id="FD8">
<label>(8)</label>
<mml:math id="mm9" display="block">
<mml:semantics id="sm9">
<mml:mrow>
<mml:msub>
<mml:mi>r</mml:mi>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mspace width="0.2em"/>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:mi>T</mml:mi></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mspace width="0.2em"/>
<mml:mtext>otherwise</mml:mtext></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>k</italic> can be either <italic>x</italic> or <italic>y</italic>.</p></sec>
<sec>
<label>4.5.</label>
<title>Color Retinex</title>
<p>An extension of the Retinex algorithm to color images has been proposed by Finlayson <italic>et al.</italic> [<xref ref-type="bibr" rid="b40-sensors-12-13333">40</xref>]. This method analyzes logarithm image gradients in color space. Here, two thresholds are considered, one for the chromaticity <italic>T<sub>C</sub></italic> and another for the brightness <italic>T<sub>B</sub></italic> subspace, as
<disp-formula id="FD9">
<label>(9)</label>
<mml:math id="mm10" display="block">
<mml:semantics id="sm10">
<mml:mrow>
<mml:msub>
<mml:mi>r</mml:mi>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msubsup>
<mml:mi>i</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>B</mml:mi></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mspace width="0.2em"/>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>i</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>B</mml:mi></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>B</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msubsup>
<mml:mi>i</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>C</mml:mi></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mspace width="0.2em"/>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>i</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>C</mml:mi></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>C</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mspace width="0.2em"/>
<mml:mtext>otherwise</mml:mtext></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>k</italic> can be either <italic>x</italic> or <italic>y</italic>.</p></sec>
<sec>
<label>4.6.</label>
<title>Shadow Detection</title>
<p>None of the methods described earlier was designed to estimate the position of the shadows. However, the illumination component, <italic>l</italic>(<bold>x</bold>, <italic>t</italic>), encodes information about shadows, <italic>S</italic>(<bold>x</bold>, <italic>t</italic>). So to compare their usefulness for shadow detection, we devised a method based on thresholding the illumination histogram. <xref ref-type="fig" rid="f3-sensors-12-13333">Figure 3</xref> illustrates an example of the original image, where illumination and its corresponding histogram are estimated with the method proposed by Matsushita <italic>et al.</italic> Then, each pixel of the illumination image is classified into shadow (<italic>C</italic>1) or non-shadow (<italic>C</italic>2). For this purpose, an experimental threshold (<italic>T</italic>) is selected and the shadow segmentation process is achieved, as
<disp-formula id="FD10">
<label>(10)</label>
<mml:math id="mm11" display="block">
<mml:semantics id="sm11">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn>
<mml:mspace width="0.2em"/>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mi>l</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&lt;</mml:mo>
<mml:mi>T</mml:mi></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn>
<mml:mspace width="0.2em"/>
<mml:mtext>otherwise</mml:mtext></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>We used the same procedure to detect shadows in all of the algorithms that were evaluated.</p></sec></sec>
<sec sec-type="results">
<label>5.</label>
<title>Experimental Results</title>
<p>The purpose of the experiments is twofold. On one hand, we show how the intrinsic image methods are used to detect shadows. On the other hand, we report quantitative and qualitative results based on the illumination images computed with the intrinsic derivation methods described in the previous section.</p>
<sec>
<label>5.7.</label>
<title>Intrinsic Image Results</title>
<p>To illustrate the results of the methods to derive intrinsic images, we selected three frames from each data set. In <xref ref-type="fig" rid="f2-sensors-12-13333">Figure 2</xref>, reflectance images computed with the evaluated algorithms are illustrated. The first row in this figure includes input images with shadows cast at different locations due to variations of the sun position. All of the columns of the second row show the reflectance image computed with the Weiss' method. Here, a static reflectance image for the respective sequence is obtained. In the third, fourth and fifth rows, reflectance images computed with the Matsushita <italic>et al.</italic>'s, gray and color Retinex methods are presented. The results show that shadow information is present in all reflectance images. Moreover, in the second and third rows of <xref ref-type="fig" rid="f2-sensors-12-13333">Figure 2</xref>, it may be seen that the reflectance images derived with the Weiss' and Matsushita <italic>et al.</italic>'s methods have more texture than the gray and color Retinex algorithms. Furthermore, it should be noted that the shadow detection process is based on the illumination component and not on the reflectance one. Thus, we selected the thresholds for all the methods to derive the intrinsic images based on the visual quality of the illumination component.</p>
<p>Results in <xref ref-type="fig" rid="f4-sensors-12-13333">Figure 4</xref> show that all illumination images contain texture information. In the frames computed using Weiss's method, the texture that represents the white lines is visible inside and outside of the shadow area, while in the frames obtained with Matshusita <italic>et al.</italic>'s method, the texture information is presented inside the shadow region. Also, a very smooth texture pattern appears outside the shadow region. Moreover, some boundaries of the shadow region estimated with the Matsushita <italic>et al.</italic>'s method become too diffuse as a result of the value of the threshold selected. This diffuse effect near the boundaries causes problems in the segmentation of shadows. In the illumination images computed with the gray and color Retinex algorithms, some purple and green color artifacts appear, because there is mixed information about gradients due to illumination and reflectance and one single threshold is not sufficient to separate both components.</p></sec>
<sec>
<label>5.2.</label>
<title>Quantitative Analysis</title>
<p>In this section, a quantitative assessment of the methods to detect shadows cast by urban infrastructure, based on the derived illumination image, is presented. To evaluate the performance of the methods systematically, we used the Receiver Operating Characteristic (ROC) analysis. For our evaluation, we used forty synthetic images (samples of these images are illustrated in fourth, fifth and sixth rows in <xref ref-type="fig" rid="f5-sensors-12-13333">Figure 5</xref>). The ground truth was computed automatically by subtracting the shadows from the shadowless image. First, the intrinsic image derivation methods were used to compute the reflectance and the illumination component of each frame. Then, a segmentation process based on the histogram analysis was applied. The curves in <xref ref-type="fig" rid="f6-sensors-12-13333">Figure 6</xref> represent the false positive shadow detection rate on the horizontal axis and the shadow detection rate on the vertical one.</p>
<p>In addition, different performance results can be obtained for the methods in relation to the measuring parameter. For example, if we consider the value measured at the point in the ROC space that is located at the northwest point, the result would show that the best method to detect shadows based on the derived illumination image is color Retinex. Nonetheless, if we selected the area under the curve as a measurement parameter [<xref ref-type="bibr" rid="b44-sensors-12-13333">44</xref>], the best method is that of Matsushita <italic>et al</italic>.</p>
<p>For a better analysis of <xref ref-type="fig" rid="f6-sensors-12-13333">Figure 6</xref>, we can consider the plots consisting of four regions. The first region <italic>R</italic><sub>1</sub>, according to the false positive rate, is between 0 and 0.01; the second region <italic>R</italic><sub>2</sub> between 0.01 and 0.1; the third <italic>R</italic><sub>3</sub> between 0.1 and 0.23, and the fourth one <italic>R</italic><sub>4</sub> with larger values than the previous one (vertical lines <xref ref-type="fig" rid="f6-sensors-12-13333">Figure 6</xref>).</p>
<p>In <italic>R</italic><sub>1</sub>, all methods express similar behavior and thus it is not fair to rank them. In <italic>R</italic><sub>2</sub>, the method proposed by Matsushita <italic>et al</italic> has a performance that is superior to the others. In <italic>R</italic><sub>3</sub>, a mixture of information in the performance of the methods appears, while in <italic>R</italic><sub>4</sub> a significant difference between the algorithms is apparent. In general, <xref ref-type="fig" rid="f6-sensors-12-13333">Figure 6</xref> shows that there are two main tendencies in the efficiency of the methods, one for gray and color Retinex and the other for Weiss and Matsushita <italic>et al</italic> This is due to the fact that these two methods are very similar.</p>
<p><xref ref-type="fig" rid="f7-sensors-12-13333">Figure 7</xref> illustrates the ranking of the methods in the four regions of <xref ref-type="fig" rid="f6-sensors-12-13333">Figure 6</xref>. We used the normalized cumulative value of true positive rate (TPR) in each region to rank the methods. Based on these results, we considered the second region to extract and present the reflectance and illumination images in <xref ref-type="fig" rid="f2-sensors-12-13333">Figures 2</xref> and <xref ref-type="fig" rid="f4-sensors-12-13333">4</xref>. Moreover, all shadow detection results in <xref ref-type="fig" rid="f5-sensors-12-13333">Figure 5</xref> were extracted from each respective data set with a 10% value for the false positive rate as the threshold parameter.</p></sec>
<sec>
<label>5.3.</label>
<title>Qualitative Analysis</title>
<p><xref ref-type="fig" rid="f5-sensors-12-13333">Figure 5</xref> illustrates some shadow regions correctly detected. In the second row, the ground truth of shadows is illustrated. The ground truth of the shadows has been annotated manually to serve as a qualitative comparison between the methods to derive intrinsic images and the detection of shadows.</p>
<p>The Weiss [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>] and Matsushita <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>] methods show better results than the strategies based on the Retinex algorithm [<xref ref-type="bibr" rid="b39-sensors-12-13333">39</xref>,<xref ref-type="bibr" rid="b40-sensors-12-13333">40</xref>]. The results of the Retinex algorithms applied to the synthetic sequence with white lines on the road surface are less effective than the other methods. The problem with the Retinex algorithm is that it is too constrained. It assumes that small values of the magnitude of the gradient are always due to changes in the illumination of the scene, but in general this idea is not always true.</p>
<p>In the second synthetic sequence, which consists of several static objects in an urban scenario, the methods can detect isolated shadows with good visual performance (see the pine tree and the tower in columns 4–6 in <xref ref-type="fig" rid="f5-sensors-12-13333">Figure 5</xref>). However, other objects such as the house and the trees that appear in the back part of the scene behind the tower, together with the sky line, have poor visual quality. These results are due to the remnant texture information present in the illumination component before the application of the shadow detection process. In the real images of <xref ref-type="fig" rid="f5-sensors-12-13333">Figure 5</xref>, variations in the intensity value of large shaded regions cause the algorithms to only be partially successful. As a result, some regions that have shadows are not well detected. For instance, the region at the right side of the image with the white brick wall (see columns 7–9) is not well detected by any of the discussed methods. The Weiss and Matsushita <italic>et al.</italic> methods fail because these regions are shaded all day, so this causes the shadow information to appear in the reflectance image. Similarly, the gray and color Retinex algorithms fail to detect large shadow areas, because the magnitude of the gradient in the shadow edges is similar to those caused by texture or color.</p></sec></sec>
<sec sec-type="conclusions">
<label>6.</label>
<title>Conclusions</title>
<p>A primary goal of many computer vision algorithms is to attenuate the effects caused by shadows. Due to several factors, the problem of shadow detection is a complex and open research field. In this paper, we presented an evaluation of several intrinsic image base methods to detect shadows cast by static objects in outdoor locations.</p>
<p>Although these algorithms were not constructed with the purpose of detecting shadows cast by static objects in the outdoors, we can conclude from the experimental results that the efficiency of intrinsic image methods is relatively poor. Quantitatively, the best method to detect shadows after the intrinsic image components are derived is the algorithm proposed by Matsushita <italic>et al.</italic>, but only if we accept a false positive rate (FPR) between 1% and 10%. Finally, in terms of visual comparison and shadow detection accuracy, we conclude that, if the shadows are isolated, all of the methods can detect them.</p>
<p>Future work will focus on the exploration of alternatives to obtain intrinsic images without the texture information remaining in the illumination component or the shadow information remaining in the reflectance image.</p></sec></body>
<back>
<ack>
<p>This research was partially supported with a grant from IPN-SIP under grant contract 20121642. The authors would like to thank Paul Riley for many helpful comments about the English.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-sensors-12-13333"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salvador</surname><given-names>E.</given-names></name><name><surname>Cavallaro</surname><given-names>A.</given-names></name><name><surname>Ebrahimi</surname><given-names>T.</given-names></name></person-group><article-title>Cast shadow segmentation using invariant color features</article-title><source>Comput. Vision Image Underst.</source><year>2004</year><volume>95</volume><fpage>238</fpage><lpage>259</lpage><pub-id pub-id-type="doi">10.1016/j.cviu.2004.03.008</pub-id></citation></ref>
<ref id="b2-sensors-12-13333"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stander</surname><given-names>J.</given-names></name><name><surname>Mech</surname><given-names>R.</given-names></name><name><surname>Ostermann</surname><given-names>J.</given-names></name></person-group><article-title>Detection of moving cast shadows for object segmentation</article-title><source>Multimedia</source><year>1999</year><volume>1</volume><fpage>65</fpage><lpage>76</lpage></citation></ref>
<ref id="b3-sensors-12-13333"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname><given-names>D.</given-names></name><name><surname>Li</surname><given-names>X.</given-names></name><name><surname>Liu</surname><given-names>Z.</given-names></name><name><surname>Yuan</surname><given-names>Y.</given-names></name></person-group><article-title>Cast shadow detection in video segmentation</article-title><source>Pattern Recog. Lett.</source><year>2005</year><volume>26</volume><fpage>91</fpage><lpage>99</lpage><pub-id pub-id-type="doi">10.1016/j.patrec.2004.09.005</pub-id></citation></ref>
<ref id="b4-sensors-12-13333"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foresti</surname><given-names>G.</given-names></name></person-group><article-title>Object recognition and tracking for remote video surveillance</article-title><source>IEEE Trans. Circuits Syst. Video Technol.</source><year>1999</year><volume>9</volume><fpage>1045</fpage><lpage>1062</lpage><pub-id pub-id-type="doi">10.1109/76.795058</pub-id></citation></ref>
<ref id="b5-sensors-12-13333"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsieh</surname><given-names>J.</given-names></name><name><surname>Hu</surname><given-names>W.</given-names></name><name><surname>Chang</surname><given-names>C.</given-names></name><name><surname>Chen</surname><given-names>Y.</given-names></name></person-group><article-title>Shadow elimination for effective moving object detection by gaussian shadow modeling</article-title><source>Image Vision Comput.</source><year>2003</year><volume>21</volume><fpage>505</fpage><lpage>516</lpage><pub-id pub-id-type="doi">10.1016/S0262-8856(03)00030-1</pub-id></citation></ref>
<ref id="b6-sensors-12-13333"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leone</surname><given-names>A.</given-names></name><name><surname>Distante</surname><given-names>C.</given-names></name></person-group><article-title>Shadow detection for moving objects based on texture analysis</article-title><source>Pattern Recog.</source><year>2007</year><volume>40</volume><fpage>1222</fpage><lpage>1233</lpage><pub-id pub-id-type="doi">10.1016/j.patcog.2006.09.017</pub-id></citation></ref>
<ref id="b7-sensors-12-13333"><label>7.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Daum</surname><given-names>M.</given-names></name><name><surname>Dudek</surname><given-names>G.</given-names></name></person-group><article-title>On 3-D Surface Reconstruction Using Shape from Shadows</article-title><conf-name>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</conf-name><conf-loc>Santa Barbara, CA, USA</conf-loc><conf-date>23–25 June 1998</conf-date><fpage>461</fpage><lpage>468</lpage></citation></ref>
<ref id="b8-sensors-12-13333"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Finlayson</surname><given-names>G.</given-names></name><name><surname>Hordley</surname><given-names>S.</given-names></name><name><surname>Lu</surname><given-names>C.</given-names></name><name><surname>Drew</surname><given-names>M.</given-names></name></person-group><article-title>On the removal of shadows from images</article-title><source>IEEE Trans. Pattern Anal. Mach. Intell.</source><year>2006</year><volume>28</volume><fpage>59</fpage><lpage>68</lpage><pub-id pub-id-type="pmid">16402619</pub-id><pub-id pub-id-type="doi">10.1109/TPAMI.2006.18</pub-id></citation></ref>
<ref id="b9-sensors-12-13333"><label>9.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Poulin</surname><given-names>A.</given-names></name></person-group><article-title>Interactive Rendering of Trees with Shading and Shadows</article-title><conf-name>Proceedings of the Rendering Techniques: Eurographics Workshop</conf-name><conf-loc>London, UK</conf-loc><conf-date>25–27 June 2001</conf-date><publisher-name>Springer Verlag Wien</publisher-name><publisher-loc>Berlin/Heidelberg, Germany</publisher-loc><year>2001</year><fpage>183</fpage><lpage>196</lpage></citation></ref>
<ref id="b10-sensors-12-13333"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Savarese</surname><given-names>S.</given-names></name><name><surname>Andreetto</surname><given-names>M.</given-names></name><name><surname>Rushmeier</surname><given-names>H.</given-names></name><name><surname>Bernardini</surname><given-names>F.</given-names></name><name><surname>Perona</surname><given-names>P.</given-names></name></person-group><article-title>3D Reconstruction by shadow carving: Theory and practical evaluation</article-title><source>Int. J. Comput. Vis.</source><year>2007</year><volume>71</volume><fpage>305</fpage><lpage>336</lpage><pub-id pub-id-type="doi">10.1007/s11263-006-8323-9</pub-id></citation></ref>
<ref id="b11-sensors-12-13333"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nagao</surname><given-names>M.</given-names></name><name><surname>Matsuyama</surname><given-names>T.</given-names></name><name><surname>Ikeda</surname><given-names>Y.</given-names></name></person-group><article-title>Region extraction and shape analysis in aerial photographs</article-title><source>Comput. Graph. Image Process.</source><year>1979</year><volume>10</volume><fpage>195</fpage><lpage>223</lpage><pub-id pub-id-type="doi">10.1016/0146-664X(79)90001-7</pub-id></citation></ref>
<ref id="b12-sensors-12-13333"><label>12.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Barnard</surname><given-names>K.</given-names></name><name><surname>Finlayson</surname><given-names>G.</given-names></name></person-group><article-title>Shadow Identification Using Colour Ratios</article-title><conf-name>Proceedings of the 8th Color Imaging Conference</conf-name><conf-loc>Scottsdale, AZ, USA</conf-loc><conf-date>7–10 November 2000</conf-date><volume>2</volume><fpage>97</fpage><lpage>101</lpage></citation></ref>
<ref id="b13-sensors-12-13333"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prati</surname><given-names>A.</given-names></name><name><surname>Mikic</surname><given-names>I.</given-names></name><name><surname>Trivedi</surname><given-names>M.</given-names></name><name><surname>Cucchiara</surname><given-names>R.</given-names></name></person-group><article-title>Detecting moving shadows: Algorithms and evaluation</article-title><source>IEEE Trans. Pattern Anal. Machine Intell.</source><year>2003</year><volume>25</volume><fpage>918</fpage><lpage>923</lpage><pub-id pub-id-type="doi">10.1109/TPAMI.2003.1206520</pub-id></citation></ref>
<ref id="b14-sensors-12-13333"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heikkila</surname><given-names>M.</given-names></name><name><surname>Pietikainen</surname><given-names>M.</given-names></name></person-group><article-title>A texture-based method for modeling the background and detecting moving objects</article-title><source>IEEE Trans. Pattern Anal. Machine Intell.</source><year>2006</year><volume>28</volume><fpage>657</fpage><lpage>662</lpage><pub-id pub-id-type="doi">10.1109/TPAMI.2006.68</pub-id></citation></ref>
<ref id="b15-sensors-12-13333"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sanin</surname><given-names>A.</given-names></name><name><surname>Sanderson</surname><given-names>C.</given-names></name><name><surname>Lovell</surname><given-names>B.</given-names></name></person-group><article-title>Shadow detection: A survey and comparative evaluation of recent methods</article-title><source>Pattern Recog.</source><year>2012</year><volume>45</volume><fpage>1684</fpage><lpage>1695</lpage><pub-id pub-id-type="doi">10.1016/j.patcog.2011.10.001</pub-id></citation></ref>
<ref id="b16-sensors-12-13333"><label>16.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Prati</surname><given-names>A.</given-names></name><name><surname>Cucchiara</surname><given-names>R.</given-names></name><name><surname>Mikic</surname><given-names>I.</given-names></name><name><surname>Trivedi</surname><given-names>M.</given-names></name></person-group><article-title>Analysis and Detection of Shadows in Video Streams: A Comparative Evaluation</article-title><conf-name>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</conf-name><conf-loc>Kauai, HI, USA</conf-loc><conf-date>8–14 December 2001</conf-date><volume>2</volume><fpage>571</fpage><lpage>576</lpage></citation></ref>
<ref id="b17-sensors-12-13333"><label>17.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Isaza</surname><given-names>C.</given-names></name><name><surname>Salas</surname><given-names>J.</given-names></name><name><surname>Raducanu</surname><given-names>B.</given-names></name></person-group><article-title>Toward the Detection of Urban Infrastructure's Edge Shadows</article-title><source>Advanced Concepts for Intelligent Vision Systems</source><publisher-name>Springer</publisher-name><publisher-loc>Berlin/Heidelberg, Germany</publisher-loc><year>2010</year><fpage>30</fpage><lpage>37</lpage></citation></ref>
<ref id="b18-sensors-12-13333"><label>18.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Barrow</surname><given-names>H.</given-names></name><name><surname>Tenenbaum</surname><given-names>J.</given-names></name></person-group><source>Recovering Intrinsic Scene Characteristics from Images</source><publisher-name>Artificial Intelligence Center, SRI International</publisher-name><publisher-loc>Menlo Park, CA, USA</publisher-loc><year>1978</year></citation></ref>
<ref id="b19-sensors-12-13333"><label>19.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Weiss</surname><given-names>Y.</given-names></name></person-group><article-title>Deriving Intrinsic Images from Image Sequences</article-title><conf-name>Proceedings of the IEEE International Conference on Computer Vision</conf-name><conf-loc>Vancouver, BC, Canada</conf-loc><conf-date>7–14 July, 2001</conf-date><volume>2</volume><fpage>68</fpage><lpage>75</lpage></citation></ref>
<ref id="b20-sensors-12-13333"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsushita</surname><given-names>Y.</given-names></name><name><surname>Nishino</surname><given-names>K.</given-names></name><name><surname>Ikeuchi</surname><given-names>K.</given-names></name><name><surname>Sakauchi</surname><given-names>M.</given-names></name></person-group><article-title>Illumination normalization with time-dependent intrinsic images for video surveillance</article-title><source>IEEE Trans. Pattern Anal. Machine Intell.</source><year>2004</year><volume>26</volume><fpage>1336</fpage><lpage>1347</lpage><pub-id pub-id-type="doi">10.1109/TPAMI.2004.86</pub-id></citation></ref>
<ref id="b21-sensors-12-13333"><label>21.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blake</surname><given-names>A.</given-names></name></person-group><article-title>Boundary conditions for lightness computation in mondrian world</article-title><source>Comput. Vis., Graph., Image Process.</source><year>1985</year><volume>32</volume><fpage>314</fpage><lpage>327</lpage><pub-id pub-id-type="doi">10.1016/0734-189X(85)90054-4</pub-id></citation></ref>
<ref id="b22-sensors-12-13333"><label>22.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Grosse</surname><given-names>R.</given-names></name><name><surname>Johnson</surname><given-names>M.</given-names></name><name><surname>Adelson</surname><given-names>E.</given-names></name><name><surname>Freeman</surname><given-names>W.</given-names></name></person-group><article-title>Ground Truth Dataset and Baseline Evaluations for Intrinsic Image Algorithms</article-title><conf-name>Proceedings of the IEEE International Conference on Computer Vision</conf-name><conf-loc>Kyoto, Japan</conf-loc><conf-date>27 September–4 October 2009</conf-date><fpage>2335</fpage><lpage>2342</lpage></citation></ref>
<ref id="b23-sensors-12-13333"><label>23.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Kawasaki</surname><given-names>H.</given-names></name><name><surname>Furukawa</surname><given-names>R.</given-names></name></person-group><article-title>Shape Reconstruction from Cast Shadows Using Coplanarities and Metric Constraints</article-title><conf-name>Proceedings of Asian Conference on Computer Vision</conf-name><conf-loc>Tokyo, Japan</conf-loc><conf-date>18–22 November 2007</conf-date><fpage>847</fpage><lpage>857</lpage></citation></ref>
<ref id="b24-sensors-12-13333"><label>24.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kennedy</surname><given-names>J.</given-names></name></person-group><source>A Psychology of Picture Perception</source><publisher-name>Jossey-Bass</publisher-name><publisher-loc>Oxford, England</publisher-loc><year>1974</year></citation></ref>
<ref id="b25-sensors-12-13333"><label>25.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Scanlan</surname><given-names>J.</given-names></name><name><surname>Chabries</surname><given-names>D.</given-names></name><name><surname>Christiansen</surname><given-names>R.</given-names></name></person-group><article-title>A Shadow Detection and Removal Algorithm for 2-D Images</article-title><conf-name>Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing</conf-name><conf-loc>Albuquerque, NM, USA</conf-loc><conf-date>3–6 April 1990</conf-date><fpage>2057</fpage><lpage>2060</lpage></citation></ref>
<ref id="b26-sensors-12-13333"><label>26.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Jiang</surname><given-names>C.</given-names></name><name><surname>Ward</surname><given-names>M.</given-names></name></person-group><article-title>Shadow Identification</article-title><conf-name>Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition</conf-name><conf-loc>Champaign, IL, USA</conf-loc><conf-date>15–18 June 1992</conf-date><fpage>606</fpage><lpage>612</lpage></citation></ref>
<ref id="b27-sensors-12-13333"><label>27.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Suzuki</surname><given-names>A.</given-names></name><name><surname>Shio</surname><given-names>A.</given-names></name><name><surname>Arai</surname><given-names>H.</given-names></name><name><surname>Ohtsuka</surname><given-names>S.</given-names></name></person-group><article-title>Dynamic Shadow Compensation of Aerial Images Based on Color and Spatial Analysis</article-title><conf-name>Proceedings of the IEEE International Conference on Pattern Recognition</conf-name><conf-loc>Barcelona, Spain</conf-loc><conf-date>5 September 2000</conf-date><volume>1</volume><fpage>317</fpage><lpage>320</lpage></citation></ref>
<ref id="b28-sensors-12-13333"><label>28.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gevers</surname><given-names>T.</given-names></name><name><surname>Smeulders</surname><given-names>W.</given-names></name></person-group><article-title>Color based object recognition</article-title><source>Pattern Recog.</source><year>1999</year><volume>32</volume><fpage>453</fpage><lpage>464</lpage><pub-id pub-id-type="doi">10.1016/S0031-3203(98)00036-3</pub-id></citation></ref>
<ref id="b29-sensors-12-13333"><label>29.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Finlayson</surname><given-names>G.</given-names></name><name><surname>Hordley</surname><given-names>S.</given-names></name><name><surname>Drew</surname><given-names>M.</given-names></name></person-group><article-title>Removing Shadows from Images</article-title><conf-name>Proceedings of European Conference on Computer Vision</conf-name><conf-loc>Copenhagen, Denmark</conf-loc><conf-date>27 May 2002</conf-date><fpage>129</fpage><lpage>132</lpage></citation></ref>
<ref id="b30-sensors-12-13333"><label>30.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Gevers</surname><given-names>T.</given-names></name><name><surname>Stokman</surname><given-names>H.</given-names></name></person-group><article-title>Classifying Color Transitions into Shadow-geometry, Illumination, Highlight or Material Edges</article-title><conf-name>Proceedings of the IEEE International Conference on Image Processing</conf-name><conf-loc>Vancouver, BC, Canada</conf-loc><conf-date>10–13 September 2000</conf-date><volume>1</volume><fpage>521</fpage><lpage>524</lpage></citation></ref>
<ref id="b31-sensors-12-13333"><label>31.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levine</surname><given-names>M.</given-names></name><name><surname>Bhattacharyya</surname><given-names>J.</given-names></name></person-group><article-title>Removing shadows</article-title><source>Pattern Recog. Lett.</source><year>2005</year><volume>26</volume><fpage>251</fpage><lpage>265</lpage><pub-id pub-id-type="doi">10.1016/j.patrec.2004.10.021</pub-id></citation></ref>
<ref id="b32-sensors-12-13333"><label>32.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Sonoda</surname><given-names>Y.</given-names></name><name><surname>Ogata</surname><given-names>T.</given-names></name></person-group><article-title>Separation of Moving Objects and Their Shadows, and Application to Tracking of Loci in the Monitoring Images</article-title><conf-name>Proceedings of the IEEE International Conference on Signal Processing Proceedings</conf-name><conf-loc>Santa Barbara, CA, USA</conf-loc><conf-date>12–16 October 1998</conf-date><volume>2</volume><fpage>1261</fpage><lpage>1264</lpage></citation></ref>
<ref id="b33-sensors-12-13333"><label>33.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Horprasert</surname><given-names>T.</given-names></name><name><surname>Harwood</surname><given-names>D.</given-names></name><name><surname>Davis</surname><given-names>L.</given-names></name></person-group><article-title>A Statistical Approach for Real-Time Robust Background Subtraction and Shadow Detection</article-title><conf-name>Proceedings of the IEEE International Conference on Computer Vision</conf-name><conf-loc>Kerkyra, Corfu, Greece</conf-loc><conf-date>20–25 September 1999</conf-date><volume>99</volume><fpage>256</fpage><lpage>261</lpage></citation></ref>
<ref id="b34-sensors-12-13333"><label>34.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Mikic</surname><given-names>I.</given-names></name><name><surname>Cosman</surname><given-names>P.</given-names></name><name><surname>Kogut</surname><given-names>G.</given-names></name><name><surname>Trivedi</surname><given-names>M.</given-names></name></person-group><article-title>Moving Shadow and Object Detection in Traffic Scenes</article-title><conf-name>Proceedings of the IEEE International Conference on Pattern Recognition</conf-name><conf-loc>Barcelona, Spain</conf-loc><conf-date>3–8 September 2000</conf-date><volume>1</volume><fpage>321</fpage><lpage>324</lpage></citation></ref>
<ref id="b35-sensors-12-13333"><label>35.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Nadimi</surname><given-names>S.</given-names></name><name><surname>Bhanu</surname><given-names>B.</given-names></name></person-group><article-title>Moving Shadow Detection Using a Physics-based Approach</article-title><conf-name>Proceedings of the IEEE International Conference on Pattern Recognition</conf-name><conf-loc>Quebec City, PQ, Canada</conf-loc><conf-date>1–15 August 2002</conf-date><volume>2</volume><fpage>701</fpage><lpage>704</lpage></citation></ref>
<ref id="b36-sensors-12-13333"><label>36.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Bevilacqua</surname><given-names>A.</given-names></name><name><surname>Roffilli</surname><given-names>M.</given-names></name></person-group><article-title>Robust Denoising and Moving Shadows Detection in Traffic Scenes</article-title><conf-name>Proceedings of the IEEE Internatinoal Conference on Computer Vision and Pattern Recognition</conf-name><conf-loc>Kauai, HI, USA</conf-loc><conf-date>8–14 December 2001</conf-date><fpage>1</fpage><lpage>4</lpage></citation></ref>
<ref id="b37-sensors-12-13333"><label>37.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Cucchiara</surname><given-names>R.</given-names></name><name><surname>Grana</surname><given-names>C.</given-names></name><name><surname>Piccardi</surname><given-names>M.</given-names></name><name><surname>Prati</surname><given-names>A.</given-names></name></person-group><article-title>Detecting Objects, Shadows and Ghosts in Video Streams by Exploiting Color and Motion Information</article-title><conf-name>Proceedings of the IEEE International Conference on Image Analysis and Processing</conf-name><conf-loc>Palermo, Italy</conf-loc><conf-date>26–28 September 2001</conf-date><fpage>360</fpage><lpage>365</lpage></citation></ref>
<ref id="b38-sensors-12-13333"><label>38.</label><citation citation-type="web"><person-group person-group-type="author"><collab>The Persistence of Vision Raytracer Pty. Ltd.</collab></person-group><source>The Persistence of Vision Raytracer</source><comment>Available online: <ext-link xlink:href="http://www.povray.org" ext-link-type="uri">http://www.povray.org</ext-link> (accessed on 31 January 2012)</comment></citation></ref>
<ref id="b39-sensors-12-13333"><label>39.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Land</surname><given-names>E.</given-names></name><name><surname>McCann</surname><given-names>J.</given-names></name></person-group><article-title>Lightness and retinex theory</article-title><source>J. Opt. Soc. Am.</source><year>1971</year><volume>61</volume><fpage>1</fpage><lpage>11</lpage><pub-id pub-id-type="pmid">5541571</pub-id><pub-id pub-id-type="doi">10.1364/JOSA.61.000001</pub-id></citation></ref>
<ref id="b40-sensors-12-13333"><label>40.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Finlayson</surname><given-names>G.D.</given-names></name><name><surname>Hordley</surname><given-names>S.D.</given-names></name><name><surname>Drew</surname><given-names>M.S.</given-names></name></person-group><article-title>Removing Shadows from Images Using Retinex</article-title><conf-name>Proceedings of the Color Science and Engineering Systems, Technologies, and Applications. Color Imaging Conference</conf-name><conf-loc>Scottsdale, AZ, USA</conf-loc><conf-date>12 November 2002</conf-date><fpage>73</fpage><lpage>79</lpage></citation></ref>
<ref id="b41-sensors-12-13333"><label>41.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reda</surname><given-names>I.</given-names></name><name><surname>Andreas</surname><given-names>A.</given-names></name></person-group><article-title>Solar position algorithm for solar radiation applications</article-title><source>Solar Energy</source><year>2004</year><volume>76</volume><fpage>577</fpage><lpage>589</lpage><pub-id pub-id-type="doi">10.1016/j.solener.2003.12.003</pub-id></citation></ref>
<ref id="b42-sensors-12-13333"><label>42.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tappen</surname><given-names>M.</given-names></name><name><surname>Freeman</surname><given-names>W.</given-names></name><name><surname>Adelson</surname><given-names>E.</given-names></name></person-group><article-title>Recovering intrinsic images from a single image</article-title><source>Pattern Anal. Machine Intell.</source><year>2005</year><volume>27</volume><fpage>1459</fpage><lpage>1472</lpage><pub-id pub-id-type="doi">10.1109/TPAMI.2005.185</pub-id></citation></ref>
<ref id="b43-sensors-12-13333"><label>43.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>J.</given-names></name><name><surname>Mumford</surname><given-names>D.</given-names></name></person-group><article-title>Statistics of Natural Images and Models</article-title><conf-name>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</conf-name><conf-loc>Fort Collins, CO, USA</conf-loc><conf-date>23–25 June 1999</conf-date><volume>1</volume><fpage>541</fpage><lpage>547</lpage></citation></ref>
<ref id="b44-sensors-12-13333"><label>44.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bradley</surname><given-names>A.</given-names></name></person-group><article-title>The use of the area under the ROC curve in the evaluation of machine learning algorithms</article-title><source>Pattern Recog.</source><year>1997</year><volume>30</volume><fpage>1145</fpage><lpage>1159</lpage><pub-id pub-id-type="doi">10.1016/S0031-3203(96)00142-2</pub-id></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures</title>
<fig id="f1-sensors-12-13333" position="float">
<label>Figure 1.</label>
<caption>
<p>Data sets used for the evaluation of algorithms. The first and second rows are synthetic and the third row is a real scenario. The images illustrate changes in the illumination due to the relative movement of the sun.</p></caption>
<graphic xlink:href="sensors-12-13333f1.gif"/></fig>
<fig id="f2-sensors-12-13333" position="float">
<label>Figure 2.</label>
<caption>
<p>Reflectance images. Top, input images with different illumination conditions; second, third, fourth, and fifth rows are reflectance images computed with the Weiss [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>], Matsushita <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>], gray [<xref ref-type="bibr" rid="b39-sensors-12-13333">39</xref>] and color Retinex [<xref ref-type="bibr" rid="b40-sensors-12-13333">40</xref>] methods respectively.</p></caption>
<graphic xlink:href="sensors-12-13333f2.gif"/></fig>
<fig id="f3-sensors-12-13333" position="float">
<label>Figure 3.</label>
<caption>
<p>Example of an illumination histogram. (<bold>a</bold>) Input image; (<bold>b</bold>) illumination component computed with the method proposed by Matsushita <italic>et al.</italic>; (<bold>c</bold>) Corresponding histogram of the illumination with two Gaussian tendencies. The highest Gaussian tendency corresponds to non-shadow pixels.</p></caption>
<graphic xlink:href="sensors-12-13333f3.gif"/></fig>
<fig id="f4-sensors-12-13333" position="float">
<label>Figure 4.</label>
<caption>
<p>Corresponding illumination images of <xref ref-type="fig" rid="f2-sensors-12-13333">Figure 2</xref>. From top to bottom: Weiss [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>], Matsushita <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>], gray [<xref ref-type="bibr" rid="b39-sensors-12-13333">39</xref>], and color Retinex Methods [<xref ref-type="bibr" rid="b40-sensors-12-13333">40</xref>].</p></caption>
<graphic xlink:href="sensors-12-13333f4.gif"/></fig>
<fig id="f5-sensors-12-13333" position="float">
<label>Figure 5.</label>
<caption>
<p>Qualitative results in the synthetic data set. The first and second rows are input and ground truth of shadows. The third, fourth, fifth, and sixth rows are the results of the Weiss [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>], Matsushita <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>], gray [<xref ref-type="bibr" rid="b39-sensors-12-13333">39</xref>], and color Retinex methods [<xref ref-type="bibr" rid="b40-sensors-12-13333">40</xref>], respectively.</p></caption>
<graphic xlink:href="sensors-12-13333f5.gif"/></fig>
<fig id="f6-sensors-12-13333" position="float">
<label>Figure 6.</label>
<caption>
<p>ROC Curves for the Weiss [<xref ref-type="bibr" rid="b19-sensors-12-13333">19</xref>], Matsushita <italic>et al.</italic> [<xref ref-type="bibr" rid="b20-sensors-12-13333">20</xref>], gray [<xref ref-type="bibr" rid="b39-sensors-12-13333">39</xref>], and color Retinex methods [<xref ref-type="bibr" rid="b40-sensors-12-13333">40</xref>]. The vertical lines define the four regions where the algorithms are compared.</p></caption>
<graphic xlink:href="sensors-12-13333f6.gif"/></fig>
<fig id="f7-sensors-12-13333" position="float">
<label>Figure 7.</label>
<caption>
<p>Quantitative comparison of all the algorithms. The regions <italic>R</italic><sub>2</sub>, <italic>R</italic><sub>3</sub> and <italic>R</italic><sub>4</sub>, correspond to the interval of analysis defined by the vertical lines of <xref ref-type="fig" rid="f6-sensors-12-13333">Figure 6</xref>.</p></caption>
<graphic xlink:href="sensors-12-13333f7.gif"/></fig></sec></back></article>
